The Full Wiki

More info on WT:BP

WT:BP: Wikis

Advertisements

Note: Many of our articles have direct quotes from sources you can cite, within the Wikipedia article! This article doesn't yet, but we're working on it! See more info or our list of citable articles.

Wiktionary

Up to date as of January 15, 2010
(Redirected to Wiktionary:Beer parlour article)

Definition from Wiktionary, a free dictionary

Nuvola apps chat.png Start a new discussion
Lautrec a corner in a dance hall 1892.jpg

Welcome, all, to the Beer Parlour! This is the place where many a historic decision has been made and where important discussions are being held daily. If you have a question about fundamental Wiktionary aspects—that is, about policies, proposals and other community-wide features—please place it at the bottom of the list (click on Start a new discussion), and it will be considered. Please keep in mind the rules of discussion: remain civil, don't make personal attacks, don't change other people's posts, and sign your comments with four tildes (~~~~), which produces your name with timestamp. Also keep in mind the purpose of this page. There are various other discussion rooms which may serve the idea behind your questions better. Please take a look to see which is most appropriate.

Sometimes discussion identifies an issue as an idea for policy development or rewriting. Such discussions may be taken out of the Beer parlour to the relevant policy page, or a brand new one may be created. See Category:Policies - Wiktionary Top Level for identified policy pages. Some of these may be inactive. Usually, the active policy pages will be listed in one of the sections below. See also the policy development page.

Shortcuts:
WT:BP
WT:BEER

Questions and answers will not remain on this page indefinitely, as it would very soon become too long to be editable. After a period of time with no further activity (usually a couple of weeks), information will be moved to the archives. We make a point to preserve all discussions that were started here in the archives. However, talk that is clearly not intended for this page may be moved and will not end up in the archives. Enjoy the Beer parlour!

Beer parlour archives
2002
December
2003
  • January-March
  • April-June
  • July-September
  • October-December
2004
  • January-March
  • April-June
  • July-September
  • October-December
2005
  • January-March
  • April-June
  • July-September
  • October-December
2006
  • January
  • February
  • March
  • April
  • May
  • June
  • July
  • August
  • September
  • October
  • November
  • December
2007
  • January
  • February
  • March
  • April
  • May
  • June
  • July
  • August
  • September
  • October
  • November
  • December
2008
  • January
  • February
  • March
  • April
  • May
  • June
  • July
  • August
  • September
  • October
  • November
  • December
2009
  • January
  • February
  • March
  • April
  • May
  • June
  • July
  • August
  • September
  • October
  • November
  • December
All subject headings

Contents

October 2009

CFI Clarification required

(copy of entry in Wiktionary_talk:Criteria_for_inclusion#Clarification_Required) The CFI (Criteria for Inclusion) need clarification on one point:-

Attestation.

“Attested” means verified through
  • Clearly widespread use,
  • Usage in a well-known work,
  • Appearance in a refereed academic journal, or
  • Usage in permanently recorded media, conveying meaning, in at least three independent instances spanning at least a year.

Are those 4 attenstation criteria joined by OR, or by AND.

My personal view is that they should be joined by an OR, so that a term that meets ANY of the criteria, and does not need to meet ALL of the criteria.

I would suggest a change of the paragraph to

“Attested” means verified through meeting ANY of the following conditions

  • Clearly widespread use,
  • Usage in a well-known work,
  • Appearance in a refereed academic journal, or
  • Usage in permanently recorded media, conveying meaning, in at least three independent instances spanning at least a year.

I cannot be bothered to mount a campaign or vote on my own. Anyone agree enough to take it on ? --Richardb 14:28, 1 October 2009 (UTC)

They are joined by an "or". It's right there in the text. I agree that this is not sufficiently clear on a first reading; in fact just a little while ago I was modifying Wiktionary:Editable CFI in a way similar to your suggestion. Please feel free to modify that page further -- it is there to be edited. -- Visviva 14:34, 1 October 2009 (UTC)
Whilst I can see the little "or" buried in there, there are clearly those who cannot. And as to being bold and -- it is there to be edited--- there is CLEALRY at the top of the page "This is a Wiktionary policy, guideline or common practices page. It should not be modified without a VOTE." I've been around long enough, and been battered enough, not to try messing with CFI without a VOTE.--Richardb 15:44, 1 October 2009 (UTC)
Wiktionary:Editable CFI is a seperate page. Conrad.Irwin 17:29, 1 October 2009 (UTC)
Yeh ? I was still beaten up for editing that page too!--Richardb 07:39, 30 November 2009 (UTC)

Note on some unusable sources.

There are a few publishing houses out there, notably Alphascript Publishing and Global Vision Publishing House (that I've seen, at least) which are putting out "books" which are nothing more than collections of Wikipedia articles plucked and thrown together. Obviously, such works should not be counted towards the CFI requirement for demonstrating usage of words within them any more than the copied Wikipedia articles themselves. Cheers! bd2412 T 17:12, 3 October 2009 (UTC)

Well, obviously they would only count once ... but if they are in fact printed paper books, I'm not sure why we wouldn't accept them as citations. We would accept a printed quote from a Wikipedia article, wouldn't we? -- Visviva 04:14, 4 October 2009 (UTC)
A printed quote in a peer-reviewed source, I'm sure we could use. However, if someone is just grabbing a handful of Wikipedia titles and calling the collection a book (for $60-70, no less) they are no more reliable for our purposes than an openly editable Wikipedia article. From what I'm told, I should add, these printed volumes come complete with whatever spelling/syntax errors are to be found in the articles grabbed, and misprints of certain characters. See User:PrimeHunter/Alphascript Publishing sells free articles as expensive books for details, but my particular concern is that these books do come up in word searches on Google Books. Cheers! bd2412 T 23:04, 4 October 2009 (UTC)
We are only looking of for durably archived examples of usage. WP can't rely on a printed copy of itself. We could rely on a printed copy of articles from WP because it overcomes the durable archiving issue. I wonder under what circumstances we could rely on printed copies of our own usage examples for attestation! Both sources run the risk that a savvy person could game attestation, but we already face that problem with Usenet. A savvy PR person could plant terms in the press and in testimony to game the system as well. DCDuring TALK 01:16, 5 October 2009 (UTC)
Our own usage examples are mentions, not uses. But our discussions (as her in the BP) are uses.​—msh210 17:24, 7 October 2009 (UTC)
With the advent of self-publishing mechanisms like Lulu, the divide between "books as properly reviewed and edited texts" and "books as anything anyone can write" (à la blog) is becoming more and more blurry. I foresee this becoming a problem in the near future. Equinox ◑ 21:45, 5 October 2009 (UTC)
Not sure why somebody reverted that edit. To clarify: I don't mean that the man on the street should never, under any circumstances, be allowed to write a book, but that books without skilled editing are liable to be a mess, and that our WT:CFI currently relies on a tacit assumption that books don't have the failings of blogs and Web pages. Equinox ◑ 22:27, 5 October 2009 (UTC)

Extended Wiktionary queries now available

I've been working on a tool which makes it possible to make many new kinds of queries on the English Wiktionary. It doesn't yet have a public front-end but for now you can submit queries to me and I will try to fulfil them.

For an example of what is possible see User talk:Vahagn Petrosyan/Armenian nouns lacking declension sections.

Basically anything involving page titles, languages, section headings, and section heading levels. It's also possible to compare and count.

I would like to add categories and information other than section headings too and if anybody would like to help me improve the tool that would also be great. — hippietrail 08:03, 6 October 2009 (UTC)

i'dlike 2find it[+ipa pl--史凡>voice-MSN/skypeme!RSI>typin=hard! 10:17, 6 October 2009 (UTC)
I'd like to find italian entries containing IPA please
L☺g☺maniac chat? 13:30, 6 October 2009 (UTC) +--史凡>voice-MSN/skypeme!RSI>typin=hard! 14:40, 6 October 2009 (UTC)
Sorry but IPA doesn't involve a section heading. I could find all Italian entries with a Pronunciation section if you like. — hippietrail 17:01, 6 October 2009 (UTC)

k+same4cantones pl--史凡>voice-MSN/skypeme!RSI>typin=hard! 01:32, 7 October 2009 (UTC)

I like it, this will make many cleanup lists easier to devise and keep updated. Can't wait for a front-end. --Bequw → ¢ • τ 16:38, 7 October 2009 (UTC)
A front-end is difficult because SQL is difficult and allowing users to enter arbitrary SQL would be dangerous. That means I have to wrap it in some generalized code. But to do that I need to know what kinds queries people will want. So please ask for some and the front-end will get closer.
I have categories working now too so for another example I was able to find all the differences between pages with Armenian Noun headings and Armenian noun categories. — hippietrail 00:51, 8 October 2009 (UTC)
Using just headword, section headers and categories: Sample queries: English entries with a "homophones" header or in homophones category with no Pronunciation section; English PoS headers not in corresponding English PoS category; English entries in category English Prepositional phrases with only Adverb headers in English section, with Phrase header, etc. English phrase headers not in any of a set of categories. Single-word English entries with Proverb, Phrase, or Idiom headers.
Many of these would need a "count" run, followed by a "sample" run of 10 or 20 (not necessarily a random sample, but not always beginning with "a"). HTH. DCDuring TALK 01:47, 8 October 2009 (UTC)
  • User:HippieBot/English homophones without pronunciations — hippietrail 04:10, 8 October 2009 (UTC)
  • User:HippieBot/Single-word English entries with Proverb, Phrase, or Idiom headers — hippietrail 14:28, 8 October 2009 (UTC)
What an interesting effort. Can I have a list of words in Category:1000 English basic words that lack the etymology header? If yes, you can post it to my user space or anywhere else fit.
Does the tool have a home page here in wiki? --Dan Polansky 07:44, 8 October 2009 (UTC)
  • User:HippieBot/English basic words without etymologies
    No home page yet. Rather than being a tool I have several tools which create metadata database tables from a dump file and for now I'm figuring out SQL queries for those tables. When I get the hang of it I will attempt to craft it into a tool with a web front end. — hippietrail 14:08, 8 October 2009 (UTC)
    Thanks! Nice that you have also posted the SQL statement to the result page. --Dan Polansky 14:49, 8 October 2009 (UTC)
    Thanks very much for generating these extremely useful cleanup lists. One good use would be to make our PoS categories match our PoS headers so the categories could be relied upon as being nearly complete matches with the headers.
Being able to match templates and context tags with headers would be nice too, when as and if you get the chance. In the meantime, the lists that you can now generate make it easier to systematically do lots of cleanup. DCDuring TALK 15:45, 8 October 2009 (UTC)
A great mega-family of cleanup lists would be, for each language, entries with headers of a given PoS, but not in the corresponding categories. This is close to what an Ullman bot generates I think, but only in pursuit of changed items. DCDuring TALK 20:56, 8 October 2009 (UTC)
I have completed cleanup on the two lists you provided. I would like to work through some of the smaller English PoS categories: prepositions, determiners, conjunctions, pronouns, interjections, idioms, phrases, proverbs. The cleanup would be those with the headers indicated, but not in the categories indicated. DCDuring TALK 17:23, 10 October 2009 (UTC)
  • User:HippieBot/English determiners with mismatch between heading and category — hippietrail 03:04, 14 October 2009 (UTC)
    Thanks. Done, with one item needing discussion (own Adj per CGEL or Det per Quirk et al?). DCDuring TALK 08:54, 14 October 2009 (UTC)
  • User:HippieBot/English prepositions with mismatch between heading and category
  • User:HippieBot/English conjunctions with mismatch between heading and category
  • User:HippieBot/English interjections with mismatch between heading and category
  • User:HippieBot/English idioms with mismatch between heading and category
  • User:HippieBot/English phrases with mismatch between heading and category
  • User:HippieBot/English proverbs with mismatch between heading and category
    Many thanks. Prepositions done. The others should keep me busy for a while. DCDuring TALK 17:37, 14 October 2009 (UTC)

Here's another one I thought of: User:HippieBot/English terms in unknown etymology category but without etymology section

Categorization of Abbreviations, Acronyms and Initialism

Shouldn't the categorizing templates ({{abbreviation}}, {{acronym}}, {{initialism}}) assume "English" unless otherwise specified, just like the context templates do? There's whole mess of entries in Category:Abbreviations, acronyms and initialisms that are English and therefore should be in Category: English abbreviations, acronyms and initialisms but are miscategorized because people don't pass in lang=English/en. --Bequw → ¢ • τ 19:13, 8 October 2009 (UTC)

Makes sense. L☺g☺maniac chat? 19:18, 8 October 2009 (UTC)
Yes. English gets to be the default, but has the responsibility for cleaning up. Shouldn't there be a pseudolanguage code for "language to be assigned"? Is there one already? DCDuring TALK 20:13, 8 October 2009 (UTC)
{{und}} Undetermined is the relevant ISO 639-3 code you are thinking of. — Carolina wren discussió 20:32, 8 October 2009 (UTC)
Thanks, CW. DCDuring TALK 20:42, 8 October 2009 (UTC)
There are 6270 or so in that category. All the ones beginning with "." (>200) are translingual. Many of the others are translingual. Perhaps Hippietrail should make a gift of a cleanup list linked to the About pages for each language of all entries with an abbreviation header in a language section and not an entry in the appropriate category. If some items now have "en" as a parameter, I would guess it would be more efficient if the master list of misclassified items remained where it is. DCDuring TALK 20:42, 8 October 2009 (UTC)
Implemented. Now time for cleanup. --Bequw → ¢ • τ 00:25, 9 October 2009 (UTC)

Reconstructed terms in attested languages?

How should we handle reconstructed terms in attested languages?

These arise frequently in detailed etymologies – for example, firkin comes from conjectured Middle Dutch *vierdekijn, diminutive of vierde (fourth).

These differ from reconstructed terms in reconstructed languages in that only the term is reconstructed – the language itself is attested; thus *vierdekijn is an conjectured unattested term in the language Middle Dutch, not in “Proto-Dutch” or “Proto-Middle Dutch”.

Specifically, how should be file and link to pages for such terms?

The existing policy page, Wiktionary:Reconstructed terms (WT:RT), largely deals with reconstructed languages, the category for reconstructions, Category:Reconstructions, contains only reconstructed languages, and the {{reconstructed}} template is used only terms in reconstructed languages – not one single reconstructed term in an attested language is flagged or categorized as such, hence there is no existing practice on which to base policy.

Question 1 – where do they go?

Do reconstructed terms (in existing languages) go in the main namespace or in Appendix:? Reconstructed terms fail WT:CFI as they are not attested, but WT:RT: Entries for terms states:

A Latin reconstruction should be clearly marked as a reconstruction, but goes in the main namespace as any other Latin term would, following normal rules for inclusion.

This appears to be a mistake – following “normal rules for inclusion”, a reconstructed term does not meet CFI, because it is unattested.

We should either:

  • Amend WT:CFI to allow reconstructed terms in attested languages;
    Dubious – reconstructed terms are subject to revision as theories change: “attested” is part of CFI for good reason.
  • Clarify at WT:RT and WT:CFI that reconstructed terms do not go in the main namespace, whether the language is or is not reconstructed.

Based on existing practice, I’d assume that *vierdekijn would be filed in:

  • Appendix:Middle Dutch *vierdekijn

What should the parent category be called?

  • Category:Reconstructed terms in Middle Dutch, or
  • Category:Middle Dutch reconstructed terms?

(There’d be a “nouns” in between, natch.)

Question 2 – how to link?

Existing policy at WT:RT: References from Etymologies only refers to reconstructed languages, and prescribes the use of {{proto}}.

How to link to *vierdekijn (and other reconstructed terms)?

  • Using {term} in some way – say, link to vierdekijn but display as *vierdekijn,
  • Using {proto} in some way – namely, (optionally) removing the required “Proto-”.
  • Using a new template, say {{conjectured}}, which functions like {term} but links to Appendix, optionally adds the *, and some wording like “conjectured”.

My thoughts:

  • Overloading {term} any further seems a mistake – it’s very basic.
  • Adding a noproto argument to {proto} would be the easiest answer, and fine if we don’t want to distinguish reconstructed terms from reconstructed languages.
  • A new {conjectured} template would allow finest control; also, one could use existing ISO codes for languages.

Simple summary of my conclusion:

  • reconstructed terms in Appendix:, even if language attested;
  • add noproto to {proto}, or make new template {conjectured}.

What do people think?

—Nils von Barth (nbarth) (talk) 21:09, 10 October 2009 (UTC)
BTW, previous discussions at:
  • WT:RT: Rollback, and
  • UT:Hippietrail: Links to reconstructed terms in non-proto languages
—Nils von Barth (nbarth) (talk) 21:15, 10 October 2009 (UTC)
As I said on some of the abovelinked discussions, I see no difference between protolanguages, and what others call "reconstructed terms in existing languages". Proto-languages are collections of proto-terms, and these are the etymons that are hypothesized to have existed on the basis of comparative evidence, in order to yield existing, actually attested forms, and *vierdekijn is exactly such "proto-term". The certainty by which *vierdekijn is reconstructed is no less than some of the reconstructions in ancestral languages that are not attested at all, or attested much more scarcely than Middle Dutch.
Also: I don't see much gained by putting such reconstructed middle-forms yielding an attested form in only one language (or only one form in one language) in the appendix, and IMHO the appendix namespace should be used only for major proto-languages which have many descendants (in this case Proto-Germanic where MD reconstruction would be listed at the appropriate clade in the hierarchy). Vulgar Latin (Proto-Romance) forms that were not attested could be added as the descendants in the most-close Latin etymon (and it won't be too wrong in this particular case to list D firkin as if descending from MD vierde).
In etymologies I've been so far formatting these reconstructed non-protolanguage terms as *{{term||<form>|meaning}}. I wouldn't have nothing against using some newly-written {conjectured} template, tho enforcing it may be an overkill for this whole issue, as {term} seems to be up to the task for it. --Ivan Štambuk 09:53, 12 October 2009 (UTC)
I think Ivan has pretty well summarized my opinions as well. --EncycloPetey 02:28, 13 October 2009 (UTC)

Ivan, thanks for the thoughtful write-up (and EP for concurrence). If I may summarize, both to verify that I’ve understood and to make a concrete proposal:

  • Question 1 – where do they go?
Reconstructed terms in attested languages – especially intermediate forms in etymologies – should not have an entry (a page), either in the main namespace (because they do not meet CFI), nor in the Appendix (as that is reserved for Proto-languages).
Rather, they should be listed in the entry for the descendants used to reconstruct it (in the “Etymology”) section, using *{{term||<form>|meaning}} (which yields *<form> (meaning) – formatted, but no link), and in the entry for the closest older term (in the “Descendants”) section (e.g., Vulgar Latin in the Latin entry).
  • Question 2 – how to link?
(Don’t link; {term} formats correctly.)

This sounds like an excellent solution – it lists the form for etymology (which is their function), but skips having an entry whose only function is to fill in a chain of etymology.

—Nils von Barth (nbarth) (talk) 01:59, 14 October 2009 (UTC)

Stock symbols

Do we even want these?
We currently have Category:Stock symbols for companies with feeder template {{stock symbol}}, but I strongly doubt we want either the category ot the template, while the entries themselves should all be sent to RFV or simply deleted, with the possible exception of T (AT&T) . Not only are the ticker symbols duplicated on various exchanges worldwide, they change over time. For example. C was once Chrysler, and now is Citigroup. — Carolina wren discussió 17:54, 11 October 2009 (UTC)
What is the reason? That they have an average life of less than 30 years? There might be books (even literature) that refer to then-current ticker symbols, especially the ones with cutesy names or that were for popular stocks, like "T". We seem to find is easy to justify including all kinds of abbreviations from realms that are more familiar (or ideologically congenial ?), like ISO-639 codes, E numbers, etc. I would be interested to hear the arguments distinguishing this case from those others. DCDuring TALK 16:12, 12 October 2009 (UTC)
It seems to me that the case is basically identical with ISO codes, except that there is more than one registering authority. Which is to say, I wouldn't really lose any sleep over it if they all got shuffled off to Appendix-land, but I do think we are a more useful reference work for having them (and for having them in mainspace). If we got rid of these, I would want to see a global purge of all the other initialisms that don't meet normal-word criteria. But I don't really think that would be a good use of anyone's time. -- Visviva 16:35, 12 October 2009 (UTC)
I say, apply the CFI for brand names. Iff they meet it, they stay in. If not, appendicize. bd2412 T 17:40, 12 October 2009 (UTC)
But they're not brand names; why would we treat them like they are? Also, that bit about not being written "about the type of product in general" is rather problematic. Surely no one would expect to find these anywhere but in publications that are at least generally related to the stock market. -- Visviva 11:08, 13 October 2009 (UTC)
What is a stock ticker symbol? It's a stand-in for the company name, a proper noun. I have no objection whatsoever to including them in an appendix, but I'm not sure what purpose is served by reporting them as words. Someone who comes across a stock ticker symbol will almost certainly be looking at a stock ticker or the like, and will immediately know what quality of symbol they are looking at. bd2412 T 19:38, 13 October 2009 (UTC)
Unlike ISO-639 or E numbers, we've got the issues of multiple issuing authorities and no ban on reassignment of previously used symbols. Indeed, Citigroup took over C the same year Chrysler merged with Daimler. At a minimum, if generally kept in mainspace, we need to revamp the category and template, so that it could handle the multiple issuing authority concern. — Carolina wren discussió 20:14, 12 October 2009 (UTC)
But that actually suggests to me that this would be more useful than other similar classes of entries. A zeptogram will never be anything but zepto- + -gram, and en will never be the ISO 639-1 code for anything but English; but a stock symbol may have wildly different associations over time and space, associations that may not be satisfactorily documented anywhere else. Someone reading an older financial text that refers to "C" might come away with a very flawed interpretation if they assume it is referring to Citigroup rather than Chrysler. -- Visviva 11:08, 13 October 2009 (UTC)
I'll agree that a source that collects historical ticker symbols would have value, but are they used in non-tabular text, unaccompanied by the name of the company whose stock they represent? Generally, the answer to that is no, with occasional exceptions such as T which could then pass the normal CFI. E numbers are a marginal case, but they will show up commonly enough with such words as peas, carrots, and potatoes in ingredient lists. — Carolina wren discussió 17:42, 13 October 2009 (UTC)
They are in widespread use in the financial press in articles that discuss stock prices. I'm not really sure that the use of language codes occurs much more on running test than stock symbols do. We haven't been very demanding on any of the 1900 headwords that are have abbreviation-type heading. Items from the world of commerce often attract hostile attention that analogous words from the worlds of IM, computer gaming, computing generally, and linguistics do not. Consequently they can serve as a kind of miners' canary, providing useful information about entry classes for which our standards have otherwise been overly lax. DCDuring TALK 18:00, 13 October 2009 (UTC)
In the financial press I read, I've generally only seen them used in conjunction with the the name of a company, never independently, as way to enable easy lookup of data by interested readers, since financial data often is setup based on the assumption that the ticker symbol will be used to access it. — Carolina wren discussió 19:49, 13 October 2009 (UTC)
... or areas where we have an unreasonable prejudice against business. (-: -- Visviva 18:21, 13 October 2009 (UTC)
If the entries only said "stock ticker symbol", I agree that would be fairly useless. But most seem to have at least the name of the company and a link to the pertinent Wikipedia article. Information about the exchange and period of use would also be helpful. Of course, all of this is rather encyclopedic... but it's also somewhat dictionaric, and even if it were included somewhere in Wikipedia it wouldn't be likely to be easily found. So we once again face the choice of whether we would rather be useful or pure. I'm less than convinced of the community's ability to make the right choice on such matters, based on past experience; maybe I should just go out and create a few thousand of these myself, to help create some momentum in the "right" direction. ;-) -- Visviva 18:21, 13 October 2009 (UTC)
The meaningful symbols each have issuing authorities appropriate for the scope. There is some effort currently to differentiate via prefixes or suffixes securities from markets on other exchanges or similar entities. In some ways, the polysemy and duplication is reminiscent of what occurs in other realms, such as ordinary words in ordinary languages. DCDuring TALK 14:05, 13 October 2009 (UTC)

I just thought of another reason to not include most of them. In the case of ISO 639, en is a symbol for English. In the case of E numbers, E175 is a symbol for gold. In the case of stock ticker symbols, KFT is a symbol for Kraft Foods, which unlike English or gold, likely does not meet CFI. How can a symbol for something which has not met CFI, meet CFI itself? — Carolina wren discussió 19:49, 13 October 2009 (UTC)

We don’t have for your information, but we do have fyi. There certainly are cases where something might not merit inclusion, but an abbreviation or symbol for it probably would. —Stephen 20:07, 13 October 2009 (UTC)
Agreed. Furthermore, I'd dispute the claim that "en", "E175", and "KFT" are symbols for "English", "gold", and "Kraft Foods"; rather, I think they're symbols for English (the language), gold (the substance), and Kraft Foods (the company). That is, these symbols have the same referents as the corresponding plain English, and two of them are even derived from said plain English, but they are not actually symbols for the plain English. —RuakhTALK 20:23, 13 October 2009 (UTC)
Ticker symbols are for various securities. Most well known are the ticker symbols for the common equity of operating businesses, but they also exist for other securities of those companies, and for options, closed-end funds and exchange-traded funds (ETFs). I'm sure I'm missing other categories. In the financial press, the common stock ticker symbol may be used to refer to the underlying company, but the divergence between the company and its equity becomes clear whenever bankruptcy is an issue.
Just as with ISO 639 codes, the need to avoid duplication sometimes forces a choice of ticker symbol whose connection with the referent is obscure or even arbitrary.
Many of our abbreviations refer to organization names that do not meet CFI or to SoP phrases. (Though Pawley would suggest that the existence of an abbrevation was evidence of idiomaticity.) DCDuring TALK 22:53, 13 October 2009 (UTC)

Restoring Bosnian, Croatian, Serbian sections

In a few days I will be running code to restore the standard language sections deleted with no consensus. There will be no changes made to other sections, and no changes made to the restored sections other than to tag some of them for further attention.

There will be several tests (8-10 entries) run in the meantime for inspection (;-) Robert Ullmann 01:45, 13 October 2009 (UTC)

Robert, many of those entries were created originally by Ivan Štambuk himself or other contributors who support the change to Serbo-Croatian, and then changed from Bosnian, Croatian, Serbian to Serbo-Croatian. If you are not contributing in these language(s), I don't think it's right. Anatoli 02:19, 13 October 2009 (UTC)
Once an entry (or any content) is added to the wiki, it is not the property of any contributor; deleting it without consensus is not permissible, and must be restored. (I'll quote Štambuk himself: "... ti ne posjeduješ ovaj projekt. On nije tvoje vlasništvo, ni jednog jedinog bajta. you do not possess/own this project, not one single byte from hr.wikt Kafić ;-) The alternative is that the various contributors, including myself, will have to correct 6000+ entries manually, which we are not looking forward to. Robert Ullmann 02:40, 13 October 2009 (UTC)
Uhm, I don't remember writing anything like that in the Kafić. Please stop spreading your dirty lies.
As for the "deleting" - absolutely nothing of content was deleted. I've explained to you that many, many times. Can your brain understand that? Pretty much all of the merged entries were heavily expanded and rectified in the process.
"deleting it without consensus" - there was a consensus for 4 months while the merger was ongoing, and which you happily ignored, until you imagined it was some kind of "linguistic genocide" (and sadly, too many ignorants have succumbed to many to your FUD)
You can manually add individual B/C/S/M sections from the merged ones, and add the new ones from the new SC entries, I have absolutely no problem with that. What I have problem with is this unilateral bot-running of yours, for languages you are completely ignorant of. It would generate faulty entries, and it would generate more 15-20 000 entries all of which would have to be manually checked. It would generate absolutely no useful content at all, as all of the information of the "new" entries is allready contained in the merged ones (since they're, you know, the same language). --Ivan Štambuk 09:34, 13 October 2009 (UTC)
...ti ne posjeduješ ovaj projekt. On nije tvoje vlasništvo, ni jednog jedinog bajta... was written by IP, but in style of Ivan Štambuk (which is unique on hr wiki), so anybody could make an error and assume that was Ivan, when in reality somebody could easily pose as him just by using his violent style. SpeedyGonsales 13:27, 14 October 2009 (UTC)
It is, in fact, his (fixed assignment) IP address at the University of Zagreb. Robert Ullmann 05:35, 15 October 2009 (UTC)
Uhm, FYI I have dynamic IP address thru several providers, and when I edit Wiktionary (and other Wikimedia projects) from any external connection (from college or public computer) I always do it tunneled through my home computer (which is always on) for safety reasons. I've never been to Slavonia or Požega (where the above IP traces itself to), and even though I do use CARNET as one of my providers, it is also used by at least 100 000 other people in the Academia.
What is really interesting to me is that you Robert can't understand a iota of any Slavic language (Serbo-Croatian included), and yet you somehow claim out of thin air that some randomly quoted IP, that posted on Croatian Wikipedia discussion board several weeks ago, is me. How on earth did you reach that conclusion? SpeedyGonsales (desysoped and banned bureaucrat from Croatian Wikipedia) claims that it apparently bears resemblance to my style of writing, which I find silly since anyone who has read any prosaic writings of mine on Croatian Wikipedia knows very much that my preferred style of writing (which I tend to rotate depending on the occasion, mood, and the addressee) are long, baroquesque sentences with many archaic and unusual words, metaphors and idioms (I abhor the so-called "standard language", and strive to "break" it wherever I can ^_^). You couldn't have possibly made out the stylemes of my writing by means of Google Translate, or similar automatic-translation tools, which brings me to the conclusion that you've been suggested of it by somebody else in off-wiki communication. Gee, I never thought I'd someday be so important that people would translate even the suspected writings of mine to some external parties by e-mail!
And as for this whole "issue", which needlessly diverged into whether that IP address is me or not: It was addressed already in June when DCDuring here in Beer Parlour expressed concern on whether the "merged" entries would belong to the contributors who opposed the "unified treatment". Back then basically all (99%) of all the individual B/C/S entries were in fact created by contributors who were pro unified treatment, and I also added an aside that that issue is irrelevant at any case, because people do not "own" the entries they created, but that we might as well take it into consideration, as a token of good-intention towards the potential (back-then, non-existent) SC contributors who might oppose the common treatment, to only merge the entries created/edited by those supporting the proposal (which in practice meant pretty much all of the entries). And that has been in practice ever since.
However, reducing the legitimacy of a creation of these entries from page histories to whether we could add them or not is simply clouding the issue altogether: A great deal of these entries have some problems (see below), absolutely none of them adds anything useful to the already-present ==Serbo-Croatian== entries (which even Elephantus (talk • contribs) figured out soon, when he started re-creating ==Croatian== entries not from page history, as he did initially, but by copy-pasting ==Serbo-Croatian== entries to ==Croatian==, the only changes being the the switch of sh [the Serbo-Croatian ISO code] to hr [Croatian ISO code]), and the whole effort seems to be an evil-minded exercise in "how vicious can we be to Štambuk", with you unilaterally announcing it here in Beer-Parlour, not notifying the relevant contributors on the relevant talk-pages for feedback, ignoring the requests to ask for a vote sanction first which is required not only per bot policy, but also given the controversy of all this, and esp. doing all so in an accusatory tone (quoting an allegedly IP of mine, that was suppose to retort on counter-arguments), in a company of a couple who, to put it mildly, don't really suggest good intentions of your actions. --Ivan Štambuk 12:53, 15 October 2009 (UTC)
So long as the Serbo-Croatian sections aren't deleted when the Bosnian, Croatian, and Serbian sections are restored I don't see a major problem. That same lack of consensus over how to treat Southwest Slavic works both ways. I do see a minor problem depending on how you intend to manage the restoration. If the tool will be looking through page histories for Bosnian, Croatian, and Serbian sections deleted when a Serbo-Croatian section was added in its place, and adding them exactly as were prior to deletion, that's one thing, but I can't see trusting a bot, even one that is being human monitored, to recreate Bosnian, Croatian, and Serbian sections from a Serbo-Croatian section. After all the whole raison d'etre of having them as separate sections is that there may be differences. — Carolina wren discussió 03:31, 13 October 2009 (UTC)
Tell me Carolina Wren, will you be helping cleaning up the dreck generated by Ullmann's bot, or you're simply seeing no problem with it because it wouldn't affect your work here at all? --Ivan Štambuk 09:36, 13 October 2009 (UTC)
If anybody doesn't know why I write above that Ivan Štambuk is violent, here is proof (just one of many): cleaning up the dreck generated by Ullmann's bot, clear violation of "assume good faith", because Ivan assumes that bot will make errors, even before agreement is reached will some action be done by bot. I know it is hard when admin is breaking the rules of project, but some measure of civility should be held. SpeedyGonsales 13:27, 14 October 2009 (UTC)
I don't simply "assume" that bot will make errors, I know it will, and I already listed below (in reply to Visviva) some cases where it would introduce errors requiring manual cleanup. I gave an example of one of the test edits, where bot restored the wrong etymology, which has been fixed in the merged entry. It would also restore some of the obsolete templates and sections, not to mention ambiguous and sometimes downright wrong definition lines (which were fixed and expanded in the merged entries), introducing not only ethnical imbalance (by not generating sections for all the four modern Serbo-Croatian standards), but by also misleading the poor reader of Wiktionary that there actually are some differences in the meanings among standards, where there is none (cf. rog#Bosnian and rog#Croatian. The question on whether the bot will make errors or not is competely orthogonal to the question on whether the agreement is reached of bot being run or not - as long as all the bot does is unintelligently restoring old sections from page history, there would be errors, no doubt about that.
"I know it is hard when admin is breaking the rules of project, but some measure of civility should be held." - Uhm, what rules of the project? :) AGF principle applies prevalently to the very first edits of the newly-arrived contributors (before they gain experience of how things are handled here, not only because lots of common practice is not written in the help pages, but because making quality edits requires a bit more technical knowledge than on other Wikimedia projects), and to what appears to be disruptive edits by some of the regulars. What you are trying to do here is to "prove" that I am somehow "violent", and should be "sanctioned" :) My reply to Carolina Wren was along the lines of the old saying lako je tuđim kurcem gloginje mlatiti :) The point is, that those whose edits here are not being directly affected by the consequences of running this bot, esp. those not even knowing the language in question thus being unable to assess the cleanup effort it would induce, don't really have the moral ground to bless its running. Have you checked the validity of those test edits by UllmannBot (talk • contribs)? If you haven't, I don't really care what you think :) --Ivan Štambuk 13:18, 15 October 2009 (UTC)
Yes, don't delete Serbo-Croatian sections. Some questions:
  1. Will this bot deal with cyrillic entries? Mr. Štambuk created equivalent cyrillic entries of the latin ones. And the bot should try to keep sync of cyrillic & latin entries, otherwise confusion will reign.
  2. Will this bot deal with Template:sr-noun and sh-noun since they aren't the same? Instrumental & locative are swapped around.
  3. What will it do with accent markers?
  4. Does your bot know what to do with yat reflexes?
Hopefully, one day, Croats will stop using serbian vukopis. Serbs don't use Kajkavian & Chakavian and Croats should embrace these dialects more. I feel bad for Mr. Štambuk, since this is mostly his work, but at least this bot could save him time if it will keep cyrillic and latin entries in sync.--Pepsi Lite 10:00, 13 October 2009 (UTC)
What do you mean by "Serbian vukopis"? Does that alphabet have a Serbian ethnical marker attached to it? Latin script used for Serbo-Croatian today is mostly a result of the work of Ljudevit Gaj and his associates.
I generate Cyrillic-script Serbo-Croatian entries from Latin ones automatically by means of the program I wrote - first I write the Latin-script entry manually, copy it to clipboard, click on the Cyrillic-script redlink linked to in the inflection line, and do CTRL+V (i.e. "paste") - it comes out as Cyrillic. That process can be 99.99% automated. It handles accent marks, sc=Cyrl, various templates ({term}, {l}..) and context markers.. All the edits to either of the script are kept in sync manually that way.
Kajkavian and Čakavian are dying out for the last 5 centuries, and will have no native speakers at all by the end of this century. There is a diglossia with Štokavian wherever Kajkavian and Čakavian are spoken, and that's only in the rural areas, and considering the ever-increasing urbanization, that esp. the fact that they're non-literally with negligible actually written literally output, they're interesting today only as a historical devices of communication to linguists (and a badge of "Croatdom" for some Croatian nationalists). --Ivan Štambuk 10:40, 13 October 2009 (UTC)
Kajkavian and Čakavian are dying out for the last 5 centuries - equally false statement as "water is running uphill". After work of Ljudevit Gaj (19th century) Croatian people chose Štokavian dialect as base of unified language (in short). Ivan Štambuk thinks we are in 24th century? Every language which is not used, die, that is obvious. But still today both Kajkavian and Čakavian dialect are very much alive, maybe not so much in written form, but you just need to travel a bit, and you will hear so colorful richness of Croatian dialects as you can imagine. People like Isaac Asimov in his books assumed that some day we will all speak some mixture of English and Russian, but that are SF books. Nobody really knows the future, we can make an educated guess about it, but it is still nothing more then educated guess. Wiktionary (same as Wikipedia) should describe reality, not be used as a tool to enforce somebodies vision how it should be. And in reality there are 3 different languages (Bosnian, Croatian, Serbian) and one forming (Montenegrin). And although they are similar to some extent, every language have full right to be described until it cease to exist, either to be joined to other language, or if there is no live speakers of it. And Latin is here nice example, nobody will deny that is dead language, but as it is still used in some professions, it has also its Wikipedia. Štambuk use word nationalist every now and then to antagonize his opponents (which should be discouraged on Wikimedia projects), but we should only stick with reality, as it is only thing that matters. SpeedyGonsales 13:59, 14 October 2009 (UTC)
Cquote1 black.svg
Današnje područje čakavštine znatno je manje negoli je bilo prije migracija izazvanih osmanskim osvajanjima velikoga dijela hrvatskoga jezičnog prostora, a čakavsko područje se i danas smanjuje pod pritiskom književnog standarda. Tako je prije jednog stoljeća, uoči 1. svj. rata čakavski govorilo blizu 1/4 Hrvata ili oko 23%, a danas dvostruko manje ili tek 12%, pa je to najugroženije narječje hrvatskog jezika, najbliže izumiranju.
Cquote2 black.svg
Translation from the article on the distribution of Čakavian from Croatian Wikipedia:
Cquote1 black.svg
Today's spread of the Čakavian dialect is significantly lower than it used to be, prior to the migrations caused by the Ottoman conquest of a great deal of Croatian linguistic area, and Čakavian-speaking area is diminishing even today, under the influence of literary standard. Hence, a century ago, at the eve of WW1, Čakavian was spoken by approximately 1/4 or 23% of Croats, and today barely 12% of Croats speak it, which makes it the most endangered of all the Croatian dialects, the closest one to extinction.
Cquote2 black.svg
This was the spread of Čakavian in the Middle Ages (an image from the book by Dalibor Brozović, renowned Croatian Slavist), and this is today. As one can see, it has been gradually decreasing in the territorial distribution, from hinterland towards the islands. Former exclusive Čakavian urban centres such as Split and Rijeka are today completely Štokavianised. The same thing can be said for the Kajkavian dialect area - the foremost example is the city of Zagreb, Croatia's capital, which used to be Kajkavian few centuries ago, but is today 100% Štokavian (the only traces of Kajkavian left in the speech of its dwellers are the interrogative pronoun kaj, stress-based accentuation, and a few lexical items in the speech of common people, especially older folks). To my knowledge, in the course of centuries exactly nowhere the reverse trend has been observed - Štokavian speeches always ousted Čakavian and Kajkavian speeches. By exercising the pure commonsense mental logic, taking into consideration the immense increase of mass media, urbanization, education of all classes of people as well as language standardization in the last half of century, and especially the fact that today Čakavian and Kajkavian are virtually dead as literally languages, it's safe to assume that that in the following century they'd be reduced to the brink of extinction. That process is irreversible and unstoppable. It's just a matter of time before inertia and ignorance finishes what Turks initiated 600 years ago :)
Nicely put, but that is only showing that Kajkavian and Čakavian dialects are slowly dying, but not 5 centuries (slower even longer, faster much shorter). And for irreversible and unstoppable process, I would only mention example of Norwegian languages, which are nice examples what can be done if people want to change its language. Once again, wiktionary should describe todays reality, not something somebody thinks will be in 1, 5 or 10 centuries. SpeedyGonsales 03:23, 16 October 2009 (UTC)
And in reality there are 3 different languages (Bosnian, Croatian, Serbian) and one forming (Montenegrin). - In reality there is one linguistical entity, in dialectology usually called Neoshtokavian, more commonly known as Serbo-Croatian, actualized in 4 national standards whose mutual differences in grammar could fit on some 2 pages of text :) Now, whether you call these national stanadards "languages" or not is a matter of somebody's perception. Most foreign linguists would rather treat tham as one pluricentric language with regional variants, such as we already have for English, Spanish, German, Portuguese... In the Balkans the word language bears much more value of identity than it does in the rest of the world, so insisting that one can talk of "different languages", strictly on the basis of what ore - let's be honest - trivial differences in spelling and pronunciation, becomes a confirmation of self-identity. You speak of "sticking with reality" - that's exactly what we're doing! :) Linguistic reality is that there are 4 dialectals systems (really 4 languages) - Čakavian, Kajkavian, Štokavian and Torlakian. The third one on the list, a particular innovative speach of it to be more precise (Neo-Štokavian), is used as a bases of the standard, codified language of all the 4 nations of that 4-part dialect cluster. If we were to truly describe linguistic reality, we'd have L2 ==Shtokavian==, ==Kajkavian==, ==Chakavian== and ==Kajkavian== (as Millosh once suggested), but that that would be pointless as we're not building a dialectological dictionary but dictionary that people would use to learn the language of communication, i.e. the standard language.
Štambuk use word nationalist every now and then to antagonize his opponents - I use it because bulk of the objections against the SC proposal come/came on nationalist grounds, as I explained above. Voting "no" on the Unified Serbo-Croatian vote page was perceived as a step to re-affirming one's "Croatdom" (hrvatstvo) by many of the voters. Some even requested that, can you imagine that, they'd be apologised to, due to "being insulted" :) Most of the Balkans still lives in the 19th century state-language-nation fairy tales, a state of mind commonly described by the word nationalist. Sorry, but to ignore the nationalist dimension in all this would be to simply play dumb. FFS, people deliberately lie when discussing the history of "Croatian language", as if the choice of Neoštokavian dialect as literary has absolutely nothing to do with Karadžić and Serbs, which is a ridiculous fabrication of history. Vienna Literary Agreement? Never happened! :)--Ivan Štambuk 15:12, 15 October 2009 (UTC)
Sadly, you explained nothing. To ignore national dimension when we are talking about language is more than dumb, but to ignore rules of Wikipedia & Wikimedia as you tend to, is rude. You don't have the right to call users of Wikimedia project nationalists! Say what is true or false in words of others, but stop giving others tags of nationalists or any other ists, its rude, FFS. SpeedyGonsales 03:23, 16 October 2009 (UTC)

Ullmann, you need to pass a vote to do any kind of such large-scale modifications in the main namespace, esp. for languages you have absolutely no bloody clue about. You can start a proposal, and then I'll explain to you why your brain-damaged bot wouldn't work (as I've already partially done, but you seem to simply ignore any kind of discussion). --Ivan Štambuk 09:23, 13 October 2009 (UTC)

I agree that this would require a vote. (If he wants to go through and do it manually, that's another issue — there's no consensus to forbid it — but bots are only for implementing consensus, not for exploiting the lack of it.) —RuakhTALK 16:15, 15 October 2009 (UTC)
I would like a vote is such language as brain-damaged bot allowed on this project? Or I can start calling every bot I dislike brain-damaged? Ruakh, maybe you have bot. Is it damaged? Or? SpeedyGonsales 03:23, 16 October 2009 (UTC)
(don't worry, the bot doesn't care if it is called "brain-damaged". And since he applies that word to everyone who disagrees with him, it is mostly meaningless; "brain-damaged bigot", etc ;-) Robert Ullmann 06:12, 16 October 2009 (UTC)
"you need to pass a vote to do any kind of such large-scale modifications in the main namespace" Quite so. I entirely concur. When you set out to make "large-scale modifications" by deleting 3 languages from the Wiktionary, and forcibly merging them into "Serbo-Croatian", you discovered that it was contentious, and disputed, and yet continued. You then set up a vote, and continued making the modifications while the vote was running. When the vote failed, and your modifications were rejected, you continued as if the vote had not occurred.
Now you have the absolute temerity to demand that your entirely unauthorized "large-scale modifications" not be undone?
The 5,427 standard language content sections improperly deleted must be restored. The brain-damaged bot can be used to restore them to the status quo pro ante. If Mr Štambuk wants to insist that they must instead be restored manually, then I believe we must insist that Mr Štambuk personally do all of those manual restorations to his desired standard of quality before being permitted to engage in any other activity on the Wiktionary. Robert Ullmann 06:12, 16 October 2009 (UTC)

Missing Citations page should link to Wiktionary:Citations

To a casual user who just encountered a new citation, which might never be recorded in wiktionary if not recorded now, the Citations page seems to be the place to put it, something like sending a letter to the OED in the old days.

However, it seems that putting such a note on the citations page is likely to get it deleted without a trace, causing the citation to be lost forever. See for example the recently deleted Citations:paronomasiac which cited the NYT Magazine On Language column.

BTW I am not going to add it back. IMHO that is SemperBlotto's job.

So there should be more instructions indicating how to get a citation recorded permanently by someone using a mobile device which just can't accomplish a properly formatted entry.

—This unsigned comment was added by Archimerged (talk • contribs) 23:50, 13 October 2009 (UTC).

Well, seeing as you didn't include the actual quotation, misspelled the referent's name (he's William Safire, not william saffire), and seem to have gotten the headline wrong (at least, the one you included does not match that on the NYT Web-site), your contribution was not so much a citation as a vague pointer to where we might find one — fodder less for the citations page than for the discussion page. And given that the word gets well over a hundred hits on Google Book Search, I can't say that such a pointer is really all that necessary, anyway; so, I don't blame SemperBlotto for having deleted it. Recent-changes patrolling always involves this sort of trade-off; you don't get anywhere if you try to fix every entry, so you always have to make a judgment call about whether what's there is better than a redlink. In this case, a reader clicking the bluelinked "citations" tab would have been sorely disappointed; the citation in the entry was far more useful. (However, in this case, I've restored the page, and tracked down the citation and fixed the page, so you can see what sort of thing we have in mind.) —RuakhTALK 01:50, 14 October 2009 (UTC)
Thanks for fixing the entry. It was written using a kindle. You can't make capital letters (the shift key is ignored by the experimental web browser). Any quotations have to be retyped by hand on a tiny keyboard from memory, and there is no spell check. While reading the paper on the kindle and away from a computer, I looked up "paronomasiac" wondering if it was a new word, and because the entry doesn't link to "paronomasia" I thought it might be a word coined in 2004 by Spider Robinson, so seeing it used in the NYT would be an important piece of data. I had a choice: try to find out elsewhere if the citation is important (but then I wouldn't have time to record it), or enter it on the citations page. There is no indication that the citations page isn't supposed to be something like the talk page anyway.
Anyway, I think the important thing is that the page displayed from a red "Citations" link ought to say a little about what citations pages are for and link to the official policy page (which is quite difficult to find). Also it ought to link to a place where pointers to citations can be recorded (or say they should be recorded on the talk page), for people using mobile devices who encounter possibly importation citations. After all, a dictionary is not just for people who want to look up words. It is for recording the slow alteration of the language. — Archimerged 01:52, 15 October 2009 (UTC)-
This is a good point; we should have something similar to the current warning message for templaltes. Does anyone know where that text that appears on a template edit page comes from? I can't find it anywhere in MediaWiki or template-space. -- Visviva 02:58, 16 October 2009 (UTC)

Useless link at Wiktionary:Discussion rooms

Wiktionary:Discussion rooms lists Bug reports under "Other places to congregate" which was eliminated four months ago and now redirects to the Grease pit. Anyone have any ideas on the best way to get rid of it without making the entire table look terrible? --Yair rand 13:50, 15 October 2009 (UTC)

Better?​—msh210 16:42, 15 October 2009 (UTC)
Great. --Yair rand 18:19, 15 October 2009 (UTC)

Names of specific entries (again)

I thought I'd copy the URL to save time (here). The several problems I can find (for me, anyway) are:

  1. What is a specific entry? Proper nouns aren't always specific entries, like Stephen as you can have several Stephens. So it refers to things that there are only one of, right? Still, ambiguous.
  2. Used attributively. Okay I'll spare you use#Verb but attributively and attributive aren't very detailed and give almost no help. If it's just the grammatical sense of attributively, then almost any place name will meet CFI because place names (in English) don't have adjectival forms. So I'm from Leeds. A Leeds taxi, a Leeds restaurant. All of these attributive, right? I'd think as long as the place name is not extremely, the attributive form won't be either.
  3. Widely understood meaning. A debate broke out over Daffy Duck that even though it can be cited attributively, it doesn't have a widely understood meaning. What does that even mean? I mean Daffy Duck means a cartoon duck, right? In the same way that Leeds means a city in West Yorkshire. Is there another meaning of Leeds? Or Confucius means a 5th Century BC philosopher. AFAICT "widely understood meaning" doesn't put any limits on what the meaning is, it just has to have one.

Admittedly, and I can barely stress this enough I don't really have a better idea but maybe someone else does? Mglovesfun (talk) 10:10, 19 October 2009 (UTC)

I have boldly edited Wiktionary:Editable CFI#Names of specific entities to reflect my understanding of this annoyingly oracular passage. I hope that those who disagree will boldly revert or revise my edit, and perhaps we can eventually work out something that will have a working consensus behind it. Here's the problem asa I see it: the lack of clarity of the current wording has given everyone cover to read whatever they want into it (or to ignore it entirely). Any efforts to clarify it thus mean that someone's ox will get gored, and so are voted down for the most ridiculously disingenuous reasons. It's worth reiterating that there was never any consensus for CFI to be set in stone; the current state of {{policy}} is purely the product of some erratic editing by Richardb and Connel in 2006. -- Visviva 10:57, 19 October 2009 (UTC)
That's already about 50 times better. If we could have a few more people edit it and then have a vote on it, I would be pretty happy. But at the very least, it has to be worded in a way that is a lot less ambiguous, I tend to think that everything is at least a little bit ambiguous. But hey, c'est la vie. Mglovesfun (talk) 11:16, 19 October 2009 (UTC)
For (1) see Appendix:English proper nouns#Proper nouns as common nouns. Visviva got my understanding of (2+3) right (a definition independent of the referent). --Bequw → ¢ • τ 14:17, 19 October 2009 (UTC)

Until we can agree on sets of proper nouns that should be included in their own right, I don't see a successful vote to make the change. The current proposal makes one criterion clearer, but rewords it in such a way as to exclude most names of countries, etc., which is a set of proper nouns most community editors have supported for inclusion in the past. I have made one small edit to the last sentence, since the way that is was worded would require that we include Thomas Jefferson (or any other full name) if there were more than one individual with that name, which is probably not what was intended. --EncycloPetey 01:52, 20 October 2009 (UTC)

Daily dump RSS feed

For anyone interested I have created an RSS feed for our daily dumps. You can subscribe at http://www.devtionary.org/cgi-bin/feed.pl - please report any bugs or feature requests. I've tried to make it as similar to the RSS feeds for the offical dumps from Wikimedia. — hippietrail 00:09, 20 October 2009 (UTC)

Translingual translations

Corvidae. On the surface this seems rather contradictory. If there are translations, then how can it be called 'Translingual'? Nadando 21:12, 20 October 2009 (UTC)

These terms are translingual by their virtue of being understood by biologists the world over. HOWEVER, they are, in practice, New Latin, and as such pretty much all the taxonomic level higher than genus have mostly standardised forms in vernacular languages. Usually this is a standard form for the suffix combined with agreed rules of transliteration, although many language function differently in assigning vernacular terms. Circeus 01:33, 21 October 2009 (UTC)
I supported these once, but looking at Corvidae#Translations, it's obvious how badly this can go wrong. At best, these seem like translations of "crow family"; some, however, bear a strong resemblance to wikispecies:Corvidae#Vernacular names, which suggests that they just mean "corvid". So either we come up with a standard header template for these that makes very clear that only non-Roman exact equivalents should be entered, or we just need to remove them entirely. The linked entries can get by on their own, I think; appendices can take care of the cross-referencing. -- Visviva 02:26, 21 October 2009 (UTC)
Come to think of it, isn't crow family idiomatic enough for inclusion? Why don't we just move all of the "translations" thither? -- Visviva 02:32, 21 October 2009 (UTC)
Crow family is hardly idiomatic in English. We already have corvid#Noun.
But, in principle, couldn't there be even multiple transcriptions of these taxonomic names into the various scripts. And since we have translations into languages, not scripts, we could have multiple ones for each language using, say, the Cyrillic script. Is it more standardized than that? DCDuring TALK 02:48, 21 October 2009 (UTC)
How is it not idiomatic? It's a proper noun, even. (Or if it isn't, we have a whole lot of mislabeled entries.) How would someone guess the specific meaning from crow + family? Or how would a non-English speaker know that it was the "crow family" and not, say, the "jackdaw family"? On the other hand, it's difficult to say that Corvidae has a distinct meaning apart from "the corvids", so perhaps there is no real need for a translation handle. On the third hand, 까마귓과 means "Corvidae" and definitely not corvid. So there is a risk of orphan translations (not that that's a huge problem in itself).
I don't really know how these issues are handled outside of the CJK scripts, but w:Corvidae has (unsurprisingly) very different interwikis for Russian and Kazakh. Neither is a transliteration. -- Visviva 03:33, 21 October 2009 (UTC)

Surely the correct solution is that foreign words meaning corvid belong at corvid#Translations. The only translations that should be at Corvidae#Translations are scientific names in non-Latin languages, such as Врановые and カラス科. (Note that these are not always transliterations. Russian Врановые, for example — Vránovye — is not a transliteration of Corvidae, but instead appears to be natively Slavic.)

Of course, this raises a related issue: the question of how to link from Corvidae to corvid. Users certainly should be able to navigate directly from Corvidae to corvid, but where should the link go? The answer is not as obvious as one might suppose, and a few possibilities occur:

  1. Within the definition itself. Thus the definition for Corvidae would be something like one of the following:
  2. As a descendant. Etymologically, corvid derives directly from Corvidae; it is common for vernacular names to derive directly from scientific names in this way, when no vernacular name already exists. Of course, this is not always true; animal does not derive from Animalia. Also, users looking for a vernacular name might not expect to find it under "Descendants".
  3. As a related term. Seems sensible, except that WT:ELE#Related terms indicates that the "Related terms" section is exclusively for terms that are in the same language.

—Caesura(t) 13:12, 1 December 2009 (UTC)

Morse code

How should we treat Morse code (and related codes such as Russian Morse code)? In its basic form, Morse code is just a character encoding scheme. Some encoding schemes don't get entries for their individual characters, such as signal flags or semaphores, while others do, such as Braille (because they have their own Unicode codepoints) and ASL manual alphabet (because we have a consistent transcription scheme and because it's part of real language like ASL). Aside from single letter encodings, Morse code can represent control characters (with Prosigns) and special letter combinations has specific meanings (special abbreviations, Q codes, and Z codes). As I see it, there are several things that should be decided.

  1. Currently, our Latin letter pages (and "Variations of .." pages) show Morse code but link to the audio representation. Should we extend this to multi-letter terms? If so, would we extend the other encoding systems (e.g. semaphores) to multi-letter terms as well?
  2. Do we have entries for individual Morse code encodings ("..." = 'S')? One problem is that there is no standard textual transcription of Morse code (see Wikipedia discussion). For "dots" people use periods '.' (which is problematic for linking), bullets '', and middots '·'. For "dashes" some use the hyphen-minus '-' and some the em-dash ''.
  3. Do we list special letter combinations, and if so which ones? Some, like SOS and CQ are common outside of Morse code, but many are not. If we do include them, do we represent them in Latin characters or as Morse code sequences? ... --- ... ("SOS") used to exist but was deleted for being "encyclopedic".

I'd say for #1 that we should only do single letters. I'm not sure about #2. For #3 I'd like the terms listed in Latin letters but not in Morse code. Anyone else have thoughts? --Bequw → ¢ • τ 16:50, 22 October 2009 (UTC)

[e/c] I think we should not include Morse-code pagenames (#2), in part because (per Bequw) there's no unicodification of the Morse code. Things like QSL and SOS (#3) we should have; attestation can be easy, by means of transcribed telegrams quoted in durably archived sources. For #1, Bequw seems to be asking whether an entry like [[foo]] should include the Morse-code representation of foo as... an =Alternative spelling=? a =Trivia= fact? Either way, I think not. That's an easy one-to-one correspondence anyone can figure out by checking a table of codes, so there's no reason to include it; and a downside is the wasted screen space.​—msh210 17:14, 22 October 2009 (UTC)
Can't hurt to have entries for the code for individual letters. I'd support having the entry for the code for something like SOS if citations could be found that indicate that a reader might come across it 'in the wild' without a referent to the letters for which it was a code. bd2412 T 17:03, 22 October 2009 (UTC)
Wiktionary is not Unicode's bitch. All for inclusion of letters by whichever transcription is preferred. Terms would need to be cited where they are intended to be discernible. DAVilla 06:31, 9 November 2009 (UTC)
I agree (with DAVilla). —RuakhTALK 11:32, 9 November 2009 (UTC)
I seem to remember adding some of the individual letters some time ago - but they all got deleted, "Morse code" not being a "language". So I just added the table to our entry for Morse code instead. I wasn't able to get the dots and dashes to align properly though. SemperBlotto 11:39, 9 November 2009 (UTC)

Replace Category:Old French plurals with Category:Old French noun forms

Basically it's the simplest way of dealing with cases like none which is the nominative singular of nonain. I've had a go at update the relevant templates ({{fro-noun}}) if I've screwed anything up, do fix it, tell me. User:Widsith is the only other user (that I know of) that knows anything about Old French. So, anyone object? Mglovesfun (talk) 20:40, 24 October 2009 (UTC)

I'm strongly in favor of anything that gets us more Middle French, Norman French, or Old French entries because of their importance to English etymology. For that purpose, any simplifications that lead to more lemma entries would be great. DCDuring TALK 22:35, 24 October 2009 (UTC)
Do we have anything even close to policy describing when it is appropriate to ditch plurals and go with noun forms instead? I'd been thinking of nominating Category:Catalan plurals for deletion and simply leave all the non-lemma forms in Category:Catalan noun forms (and Category:Catalan adjective forms, since at present the plural adjective forms are ending up in there as well. — Carolina wren discussió 23:14, 24 October 2009 (UTC)
Well Old French has a case system, (nominative and oblique). I sometimes wonder what to do about stuff like joueuse which is just classed as a noun right now, which is okay, but it could be considered a noun form. It's easy to see with Russian, Latin, Greek and whatnot, that when there's a case system, the only other sensible option would be Category:Latin genitive singular forms (and about 11 others) which is why we have Category:Latin noun forms. Other input? Mglovesfun (talk) 13:22, 25 October 2009 (UTC)
Support. I think that [[Category:langname plurals]] only makes sense for a language like English, where only one part of speech has a plural form, and said plural form cannot be marked for anything else (such as definiteness or case). Something like [[Category:langname plural nouns]] or [[Category:langname noun plurals]] could work for a language like Catalan or Modern French (especially if we treat pairs like cousin ~ cousine as being two related lemmata; I know that some people consider it to be one noun with an inflection to indicate its referent's natural gender), but the more general solution of [[Category:langname noun forms]] seems best. (Personally, I wouldn't oppose an all-out split — stuff like [[Category:Old French nominative plural nouns]] — but when we've discussed this in the past for verbs, the general attitude has seemed to be in favor one big catch-all category for non-lemma forms of a given POS.) —RuakhTALK 13:03, 26 October 2009 (UTC)
Such categories were created on the basis of English category model for parts of speech. There are these for many more languages that have additionally marked plural forms, and which also ought to be deleted. Some of these grew quite large (e.g. Category:Hungarian plurals). Deletion of these should best be discussed on an individual language basis, but the creation of new such categories for languages which inflect nouns for more then 1 plural number should IMHO be strongly discouraged, especially if there is any kind of syncretism (e.g. nominative plural and accusative plural forms are the same). --Ivan Štambuk 15:25, 28 October 2009 (UTC)
  • I don't object. I have always read "[lang] plurals" as "[lang] nominative plurals" anyway, but given that there are only two cases in OF, it does seem a bit weird, I agree. Ƿidsiþ 13:59, 26 October 2009 (UTC)
    • If nobody objects, I may as well get on with it. Mglovesfun (talk) 08:51, 28 October 2009 (UTC)

More advanced Wiktionary queries

I've been hard at work finding more stuff to index. I now have working which scripts are used in page titles and I'm working on Unicode Collation Algorithm sort keys. Here's the first new result:

  • User:HippieBot/Pages with mixed-script titles

Let me know if you have some more ideas. — hippietrail 03:28, 26 October 2009 (UTC)

Have you gotten to templates and content-level items yet or are we still operating on headwords, categories, and headings? I'm still plenty busy on the product of your last runs. DCDuring TALK 09:33, 26 October 2009 (UTC)

Verifying rare languages

See WT:RFV#tingo. Languages that are little used or not often written are by nature, hard to cite with three durably archived cites. Apparently our only other Rapa Nui word is hehe, which I imagine is equally difficult to cite. There is no www.google.rap by the way. Is there any reasonable way to combat this? I can't think of one. Mglovesfun (talk) 20:52, 27 October 2009 (UTC)

I don't think the three cites rule fits smaller languages particularly well, since they're not only rarely printed, but the prints that exist are hardly avaliable on the Internet. I think we should accept definitions in dictionaries or scholarly works for those languages. -- Prince Kassad 21:05, 27 October 2009 (UTC)
Well if it's written and has any kind of literature, I don't think we should stray away much from the usual CFI (we could e.g. lower it to only one attestation, because in those cases when language is spoken by a tiny community there is little chance that the recorded word is not actually used). If a language has no written literature at all, and is only described in scholarly works in some form of transcription (or wore, in multiple incompatible transcriptions, depending on how the speech was analyzed by the linguists who investigated it), then it should IMHO belong only to the appendix namespace. A quick Web search yields several websites containing written Rapanui, so there's no excuse to remove RfV label. --Ivan Štambuk 21:37, 27 October 2009 (UTC)
That would exclude all languages without written tradition to appear in Wiktionary at all. Given that there are about 5,000 languages without written tradition, this is very major. -- Prince Kassad 13:30, 28 October 2009 (UTC)
Regardless, we shouldn't be adding them at all in the main namespace if there is no standardized orthography or transcription. The loss is minimal, as all of these languages will be extinct by the end of the century, and they interest barely anyone outside the academia. --Ivan Štambuk 15:11, 28 October 2009 (UTC)
That might be a fairer way to define what we currently include under "appeared in a well-known work" (or did when I last read this stuff) - if it appears as a mention in a scholarly article (or two, or three), then we can take Wikipedia's approach and define it in terms of what is given there (given that wthere are not enough cites to define it for ourselves) - though we might need more rigourous criteria to define when such mentions are acceptable, yadayada. Conrad.Irwin 21:41, 27 October 2009 (UTC)
In the case of Rapa Nui, there is http://www.rongorongo.org/index.html, but this dictionary does not have tingo. To me, the definition looks fishy. The book by Adam Jacot de Boinod seems to have been poorly researched and inaccurate. Looked at some of his German inclusions...they are either just plain wrong or, in some cases, do not even exist. I say, delete tingo posthaste. —Stephen 22:30, 27 October 2009 (UTC)
Google Books suggests a possible single reference outside of the controversial The Meaning of Tingo reference on "page 11" of Pacific Studies, Vol. 3-4, a publication by Brigham Young University--Hawaii Campus's Institute for Polynesian Studies, possibly being earlier than the books were published. (OCLC link) This particular reference, while perhaps rescuing the use, is still not well supported at best, and there's no way for me right now to easily confirm that this reference predates The Meaning of Tingo in any case. It does appear to discuss it in context with a similar meaning ("[a reciprocity system] abused by the unscrupulous who might make excessive demands"), however, which may be promising if it does appear to be independent from the word collection. Finding an actual Rapa Nui source may be difficult, however. --Pipian 07:09, 21 November 2009 (UTC)

Tbot mess with Chinese translations - this must stop

I left a message on Robert Ullmann's talk page but got no reply yet,

This edit [1] created such a mess with this Chinese translation! Can this be stopped please? I don't know how many entries are affected but whatever it's doing, it's wrong! --Anatoli 22:42, 27 October 2009 (UTC)

It was probably caused by things like [[wo|wǒ]]. I think all of those are wrong and, if they are linked at all, should simply be [[wǒ]]. The tone marks should not be ignored. —Stephen 23:26, 27 October 2009 (UTC)
Yes, Stephen, that looked like an attempt to wikify pinyin linking to pages without tone marks but it all went wrong. The transliteration should follow "|tr=", so the result was just a piece of ugly looking code. Anyway, I don't see much benefit to linking transliteration to pinyin syllables with or without tone marks, besides, transliteration should be left alone. Anatoli 23:45, 27 October 2009 (UTC)
I have (just now ;-) replied, and yes, the problem is a bad attempt to wikilink to the forms w/o tones. Either they should be linked properly to the forms with tones, or, as is usual with transliterations, not linked at all, as transliterations are usually not also written forms. As Stephen says, lose the piped links and link to the forms with tones. Robert Ullmann 23:58, 27 October 2009 (UTC)
Please don't link at all. Anatoli 08:16, 28 October 2009 (UTC)
I agree, the Pinyin transliterations should be unlinked. Delete the links when you see them. —Stephen 05:43, 29 October 2009 (UTC)
Robert, as you said on your talk page (I got lost in that discussion, sorry), it seems Tbot is trying to link the first part of Japanese transliteration, assuming it is "|tr=(Hiragana), Rōmaji|". I don't think it's a good idea either. The transliteration is a free form, could be more than one reading, mixing Hiragana and Rōmaji or only Rōmaji. The Hiragana entry doesn't have to exist, if the word is seldom written in Hiragana. Even if there is a value in it, I'd leave it for humans to add a link. --Anatoli 22:04, 29 October 2009 (UTC)
  • I don't feel qualified to make a comment about the scripting problems issues, but I gotta say, those translations for "I must go" (我必須去 and 我應該去) are pretty awkward. This is because they use literal translations of the word "go", when in fact the implicature is not "go" but "leave". 我必須走了 would be much more natural. Tooironic 08:21, 2 November 2009 (UTC)
    • Although if I may just say one thing about the scripting thing, I recently added both "literal" and "natural" (though less accurate) translations for antipasto, please let me know if I've done the formatting correctly. Cheers. Tooironic 09:54, 2 November 2009 (UTC)
Thanks, I agree with your translation and I have changed it. True, I haven't added the explanation that "I must leave" is implied. Anatoli 22:36, 3 November 2009 (UTC)

November 2009

Bot idea

I'd like to know if there would be consensus for me (or someone else) to write and run a bot to move sister-project links (eg {{wikipedia}}) and images that are above the first language header to directly underneath that header. Information above the language sections is really language-independent. The two main examples of appropriate above-the-first-L2 elements are {{also}} (which links to entries that could be in any language) and {{character info}} (which shows encoding related info). Images and sister-projects links are inherently language specific. Images are representations of language-specific definitions (I'll stear clear of images in templates such as {{stroke order}} or {{character info}} which actually represent the character and not it's definition) and the sister-project boxes link to language-specific projects (eg en.wikipedia.org). These elements therefore should be moved into the language section they correspond to, which, is as far as I've seen, always the top language section. This will maintain semantic order for the page and follow existing guidelines (eg Template_talk:wikipedia#Placement). Additionally, these offending elements mess with right-hand side Table of Contents. Are there any notable deviations that I'm currenlty not aware of? If there is concensus I would code up a bot and submit it for a vote. --Bequw → ¢ • τ 22:51, 2 November 2009 (UTC)

I like the idea with one class of exceptions: links to WP disambiguation pages (and only dab pages) that actually exist. Such links should be in a more compact format than our big WP boxes. Those pages are more like our {{also}} links. They provide another opportunity for users to locate, for example, Proper nouns and Abbreviations which we do not cover. I am not sure whether this is desirable for other PoS entries. Perhaps something similar should be done for entries whose top L2 is not English, linking to a transliteration at WP, if there is one. I don't think we should ever to direct users from a landing page to a non-English pedia or sister project unless it is from the corresponding language section. I certainly doubt that we would want multiple sister project links above the first L2 header. DCDuring TALK 23:32, 2 November 2009 (UTC)
I'd support that. And personally, I don't see a need for WP disambig boxes to be above the first L2 header. I don't know for sure, but I'd guess that banner blindness probably prevents users from seeing boxes up there, anyway. —RuakhTALK 01:19, 3 November 2009 (UTC)
That would be a reason to get rid of everything above the first L2. We've got browser frame, site banner, site tabs, and the {{also}} row. It is the relatively empty middle of the "also" row that would be my target. I would favor preventing the also row from ever pushing the rhs ToC down, by forcing it to remain within a frame that did not interfere with ToC, if that were possible. DCDuring TALK 03:06, 3 November 2009 (UTC)
Sorry, but if you have an image above the first L2, so, as you note, it's not tied in any bot-readable way to any language, how will the bot know where to put it?​—msh210 15:24, 3 November 2009 (UTC)
Everyone I've seen has always pertained to the top L2 (usually English). It's pretty obvious if it doesn't pertain to the top one, and I think editors spot and fix these quickly. --Bequw → ¢ • τ 18:57, 3 November 2009 (UTC)
I support the idea. I support that all English WP boxes that are now above L2 heading come under the L2 heading, including disambiguating ones, as they pertain only to English regardless of their being disambiguating.
Images: The bot should make the assumption that any image that is above the first heading in an entry that has an English section is an image belonging to the English section, as my experience wholly confirms this assumption; the exceptions will have to be manually corrected.
An alternative to {{wikipedia}} below L2 heading would be {{pedialite}} in See also section. But I propose to leave this aside: whatever the preferences regarding the two templates, robotic moving {{wikipedia}} from above L2-heading below it does not increase the number of {{wikipedia}} instances. --Dan Polansky 16:34, 3 November 2009 (UTC)
The wikipedia links still pertain to only English so I think the proper way to make those links prominent on pages where wikipedia has definitions we don't is through a layout change in the English section or adding them to also somehow. The boxes shouldn't be left outside any L2. These possible solutions could be handled afterwards with no problem, assuming there's a consensus for that. For number's sake, out of about 12k {{wikipedia}} invocations that are above the first L2, about 1k are explicitly disambiguation pages. --Bequw → ¢ • τ 18:54, 3 November 2009 (UTC)
"Explicitly" meaning that that's how {wikipedia} refers to them? — or meaning that the WP page is categorized ina disambiguation category or has a disambiguation-page template?​—msh210 18:57, 3 November 2009 (UTC)
I meant "explicit" in the template call ({{wikipedia|dab=}}), so not perfect, just a rough estimate.--Bequw → ¢ • τ 14:41, 4 November 2009 (UTC)
Is it possible to specify a location that is just below the first L2 and just to the left of the ToC? That would be fine. Another location is alternative forms, though the header would be a misnomer. I am hoping to find a solution that doesn't take up vertical screen space either for the main flow of content (left justified) and the rhs ToC. Does or can the ToC have a maximum width with excess text wrapping to the line below at a reasonable resource burden? DCDuring TALK 21:32, 3 November 2009 (UTC)
Yeah, that could be done with some modifications. See this Sandbox version which works with the latest versions of FF, IE, and Chrome. Maybe an option could be passed into {{slim-wikipedia}} which would make it float next to the ToC and set the width to be variable rather than fixed. Do you like that? I do, it's prominent and still inside the L2. As for the ToC, its div tag can have a fixed width, but on overflow, the div will just present scrollbars rather than having the content wrap. But I don't think this is a huge problem because our Headers are generally just one or two words. --Bequw → ¢ • τ 14:41, 4 November 2009 (UTC)
That is beautiful. Exactly what I had in mind. Thanks.
Does anyone object to that placement for WP links to dab pages? It seems adequate for users who might be searching for proper noun senses or encyclopedic senses of a word and consistent with the logic of our heading structure. Should it be limited to use with dab pages in that location? What should it be called as a template? DCDuring TALK 15:30, 4 November 2009 (UTC)
Would something like {{temp|also|wp=}} work? generating "see also bing, bong, and Wikipedia articles" (for the disambiguation pages at least). Conrad.Irwin 21:47, 3 November 2009 (UTC)
{{also}} is for language-independent or cross-language linking, which is why I think that links to English (language-dependent) Wikipedia should not be on the same line with it, but rather should be somewhere below the "==English==" L2 heading: either in a box or in a pedialite link. An example of an entry with two boxes is "word"; an edit that the bot would make would be like this one made to "word", resulting in two Wikipedia boxes below "==English==" heading. --Dan Polansky 08:41, 4 November 2009 (UTC)
Do you think that the two wikipedia boxes stacked on top of each other is a problem? They are almost identical (the disambig one says "articles") which is not ideal. Anything to be done about these? --Bequw → ¢ • τ 14:44, 4 November 2009 (UTC)
I see no problem in having two boxes, but other people may differ. If the robot does not change the number of boxes, fine with me. If the robot keeps only one box in case there are more than one boxes, and it keeps the disambiguating one, fine with me. --Dan Polansky 18:02, 4 November 2009 (UTC)
Building off Dan's comment: It should be possible to link to two different articles sith a single box, rather than stacking two boxes. Alternatively, the {{pedialite}} link can be used instead of the box, when the box template becomes cumbersome. --EncycloPetey 00:11, 8 November 2009 (UTC)

Translations of the attributive form of nouns

Many English nouns may be used attributively as adjectives. In many cases the foreign-language translation is different to that of the noun sense. How do we show that? As an example, the translation table of folklore has folclore as an Italian translation - but nothing to point to folcloristico. SemperBlotto 22:46, 4 November 2009 (UTC)

Italian: {{t|it|folclore}} {{pos n}}, {{t|it|folcloristico}} {{pos a}} {{qualifier|in attributive use}}?​—msh210 22:58, 4 November 2009 (UTC)
Probably on folkloric xD However, in general this is a very delicate issue, because there are are also other adjectival senses that are not (or cannot be) expressed in English but can be relatively regularly derived in a FL, and that are being missed in the translation tables simply because they all translate as English attributive noun usage. E.g. "mother's", "of or pertaining to a mother", "like a mother" etc. I think that we should either:
  1. include separate translation tables for attributive usages of English nouns, which note that these translate as adjectives in many FL
  2. not add adjectives to the translation tables of English nouns at all, because of the cluttering and because these would prob. wrongly get generated as nouns on many FL wiktionaries that bot-generate mainspace entries from the English Wiktionary translation tables (I know this is not our problem, but still..). Also, in most of the cases, such adjectives are regular morphological derivations from the base noun, and could be listed in the FL noun entry in the ====Derived terms==== section. (In this case, folcloristico at folclore) --Ivan Štambuk 23:18, 4 November 2009 (UTC)
OK - I've added a second translation table to folklore - does that seem reasonable? SemperBlotto 08:00, 5 November 2009 (UTC)
Yes, indeed. "Attributive use" might not be clear enough as a table header line; perhaps "adjectives (translating the attributive use of the noun)"? (Well, that's worded badly, but something else explicit.)​—msh210 15:40, 5 November 2009 (UTC)

AWB please?

Hello. Can I please be approved for AWB use here please? Thanks, Razorflame 23:10, 5 November 2009 (UTC)

Yes check.svg Done: Wiktionary:AutoWikiBrowser/CheckPage#Approved users. —RuakhTALK 23:21, 5 November 2009 (UTC)
Same here please, although I don't know how to use it, I will need to know how. Mglovesfun (talk) 11:54, 6 November 2009 (UTC)
You should have access automatically as you are an administrator - if not you can authorize yourself. There is documentation somewhere on Wikipedia, I think. Conrad.Irwin 13:36, 6 November 2009 (UTC)

Valencian -iste

First the background so you all can understand where I'm coming from. In the IEC's standard for Catalan, the suffix that corresponds to -ist is -ista which is both masculine and feminine. That's not too surprising since while nouns ending in 'a' are usually feminine in Catalan, but the Latin suffix -ista this descends from is masculine in Latin and similar cases where a masculine Latin noun stem ends in 'a' are usually masculine nouns in Catalan. However, in the Valencian (AVL) standard, they allow either -ista as the masculine singular or as a backformation -iste. There doesn't seem to be a similar case of a to e backformation for other words ending in 'a' that are both masculine and feminine. (or if it there is, it is rare). Therefore I'm planning on writing some specialized templates for this situation. I don't need help with the coding, but a potential policy type issue relating to the coding came to mind as I was wring one of them.

The template I plan to use to generate the definition line for the alternate masculine singular form. Since that definition line will always have the {{Valencia}} regional context label, and never any other context label, my initial inclination was to have the new template include {{Valencia}}. The minor advantages of doing so are that in the unlikely event we change how we do regional context labelling, the change could be done via one edit to the template, or in the equally unlikely event of the acceptance of -iste by the IEC as an alternate form for the masculine singular, the label could be removed with one edit. The only meaningful advantage is not having to type or paste {{Valencia}} into the wikitext.

As to the potential disadvantages, first off there is a mild side effect that having the new template include {{Valencia}} would cause it to be a member of Category:Regional context labels. However, the concern that brought me here to bug you with this is do we want context labels buried inside non-context templates or should context templates always be present in the wikitext itself? I don't think it would be a concern, but before I write the template and make use of it, I thought I'd get input from others. — Carolina wren discussió 01:53, 6 November 2009 (UTC)

Bot request

Hello. I would like to propose a bot to be used here. I currently use it for interwikis, but I wouldn't mind reworking it to be used for AWB tasks. Here is what I would like to propose:

  • My username: Razorflame
  • My bots' username: Darkicebot
  • Software used: AWB
  • Proposed task: Both Prince-Kassad and hippietrail said that {{top}}, {{mid}}, and {{bottom}} templates on entries needed to be changed to {{trans-top}}, {{trans-mid}}, and {{trans-bottom}}. This task would easily be able to be done using Darkicebot and AWB, as it needs to be done, according to the two users above. If you would rather a human do it, I would also be happy to do it, but I would require AWB access in order to do it, a point of contention amongst me and Ruakh.

Thanks, Razorflame 06:00, 10 November 2009 (UTC)

Doesn't AutoFormat already do that? --Yair rand 06:06, 10 November 2009 (UTC)
Not that I know of. There seems to be a long list of entries on the What links here for the templates that need to be done, so....Razorflame 06:10, 10 November 2009 (UTC)
AutoFormat only does it when the gloss is present. It doesn't do it when a gloss is not present. Razorflame 06:11, 10 November 2009 (UTC)
No. Any mass conversion to {{trans-top}} needs a human to generate glosses and determine whether the translations need to be split to conform with senses. A bot would only create more work in Category:Translation table header lacks gloss. Nadando 06:13, 10 November 2009 (UTC)
1. AutoFormat does not add a gloss either and nobody seems to complain against that, 2. however AutoFormat only reads the recent changes, so we need a dedicated bot for {{top}} replacement. -- Prince Kassad 16:51, 10 November 2009 (UTC)
As Nadando said above, {{top}} et al can't be converted wholesale to {{trans-top}} et al because they aren't 1:1. Top is used for more than just translation sections and the translation sections using top can't just have the templates swapped, as trans-top has a different and more involved usage. This is work best done by humans, assisted by AWB probably but not done blindly. - TheDaveRoss 17:05, 10 November 2009 (UTC)
It does say AWB right at the beginning. -- Prince Kassad 18:11, 10 November 2009 (UTC)
AWB is not a bot, it is a browser. This is a bot request. You don't need -- nor should you have -- bot status to run AWB in manual mode. - TheDaveRoss 20:42, 11 November 2009 (UTC)
Then can I turn this into an AWB request to use AWB myself on this account to do this? Cheers, Razorflame 16:16, 12 November 2009 (UTC)
You don't need permission to use AWB, and you don't need a special account. As long as you know what you are doing and what you are doing is in accordance with en.wikt policies etc. you can just do it. If you are going to be doing a lot of edits quickly you ought to mark them as minor, but other than that no special consideration is needed. - TheDaveRoss 22:00, 12 November 2009 (UTC)
Ruakh gave me AWB access a few days ago, but took it back when s/he said that I needed to talk with the community before any big changes...Razorflame 22:01, 12 November 2009 (UTC)

Huh?

What in the world is the Wikipedia logo doing at the top of every single Wiktionary page with a link to a page titled "Wikipedia Forever"? --Yair rand 05:19, 11 November 2009 (UTC)

It's yet another (badly implemented) advertising banner. It's slowed my page loads down to a crawl on both Wiktionary and Wikipedia. --EncycloPetey 05:58, 11 November 2009 (UTC)
You can adblock this banner, if it bothers you. -- Prince Kassad 06:01, 11 November 2009 (UTC)
I have my preferences already set to not show them, but it pops up for a fraction of a second anyway, and the loading delay prior to that is still a problem. --EncycloPetey 06:10, 11 November 2009 (UTC)

If others are willing, I would suggest adding the following to MediaWiki:Common.css so that it looks less like we have been hacked (it is about the same, but a third of the size, and with internal link colour instead of external link colour). I have no idea who was responsible for this abomination, but I hope they're reading the comments from all-over the wiki-sphere. Conrad.Irwin 13:43, 11 November 2009 (UTC)

div.siteNoticeBig {
    height: 30px;
    margin-top: 3px;
}
div.siteNoticeBig .toggle-box{
    font-size: 1em;
    padding-right: 5px;
}
div.siteNoticeBig img {
    height: 20px;
    padding: 0px 10px;
    margin-top: 5px;
}
div.siteNoticeBig #forever {
    margin-top: -50px;
    font-size: 12px;
    font-weight: bold;
    color: #002BB8;
}
The banner has been removed do to IE6/7 breakage. Apparently there has been a huge amount of opposition to the banner: w:Wikipedia:PROPS#Abolish the silly headers, m:Fundraising_2009/Launch_Feedback, n:Wikinews:Water_cooler/technical#Ugly_Ass_central_site_notice. There is now a discussion going on to make alternative banners. --Yair rand 17:54, 11 November 2009 (UTC)
The need for project-localized banners was raised earlier, by me and by others. I expect this is just a sign of being in a hurry; there is interest in making sure this happens. Please add your voice to the need for suitable Wiktionary banners on the launch feedback page. +sj + 11:56, 12 November 2009 (UTC)

Request for clarification of policy / convention in re inflected forms of English multi-word idioms

What is the present policy / convention governing inflected forms of English multi-word idioms such as damn by association and fudge the issue? It was my understanding that we aren’t meant to give such terms full entries (or even soft-redirect entries), but that they should rather hard-redirect to their lemmata. This question stems from uncertainty in a discussion between msh210 and me. How are they meant to be treated?  (u):Raifʻhār (t):Doremítzwr﴿ 18:17, 11 November 2009 (UTC)

NVRM; Ruakh has resolved this confusion: Such forms are meant to hard-redirect, per Wiktionary:Redirections#Redirecting between different forms of idioms.  (u):Raifʻhār (t):Doremítzwr﴿ 18:22, 11 November 2009 (UTC)
An aside: the title of this thread should IMHO have been "Inflected forms of English multi-word idioms" or something of the sort, to be usefully short. --Dan Polansky 20:01, 11 November 2009 (UTC)
If we have the technology to put a man on the Moon (or [your favorite technological milestone here]), we should have the technology to do word wraps in the floating right-hand ToC. That would get rid of one bad consequence of overly long section heads. DCDuring TALK 23:07, 11 November 2009 (UTC)

Move to WT:AEN DCDuring TALK 17:29, 4 December 2009 (UTC)

Could all entries have a non-bolded repeat of the word?

Hello,

I am a user. What I am finding is a need for your word entries to have a non-bolded copy of the word, so I can simply copy and paste it into my document. I don't look up words I already know how to spell or that the spell checker in MS Word already knows. So the words I am looking up need to be copied for speedy processing. But once I copy/paste, I always have to go back and un-bold them. This seems a waste of time to me. Giving me such a copy of the word would enhance the usefulness of the site.

Thanks for your consideration. —This comment was unsigned.

  • Time and effort involved - masses.
  • Would help - one person.
  • Would piss off - many people.
  • We probably won't do it. SemperBlotto 14:54, 12 November 2009 (UTC)
As far as I know, only certain browsers like Firefox copy the style of a word/links/whatever else. I'm not really sure what the big issue with unbolding a word is...lazy person. — [ R·I·C ] opiaterein — 15:22, 12 November 2009 (UTC)
Use Opera. It does not copy formatting. --Vahagn Petrosyan 16:10, 12 November 2009 (UTC)
The title is non-bolded, though it might copy it out title-sized. Conrad.Irwin 16:28, 12 November 2009 (UTC)
they could use Paste Special, Unformatted. Even write a little macro so that is one combo-keystroke.Richardb

Darkicebot inflection testing

Hi there all. I am going to be testing my inflection bot Darkicebot for a while. It won't make too many edits. I just need to test it while I get it working properly. I am only going to be making an edit or two, stopping it, then repeating the process until it works. I plan on fixing all mistakes it makes in the meantime. Thanks, Razorflame 17:13, 12 November 2009 (UTC)

Testing has been completed and a vote is now active. Thanks Razorflame 20:52, 12 November 2009 (UTC)
I notice you have several templates lined up for use with the bot, but the VOTE only mentions Esperanto verbs - which languages and parts of speech will Darkicebot service? Conrad.Irwin 23:40, 12 November 2009 (UTC)
I've gone over the vote and added in the information which you are looking for. I've stated on the vote page that I will be creating Esperanto verb form-of entries, batch creating Esperanto noun form-of entries from a text file, and batch creating Ido noun plurals from a text file. Those are my goals. I will probably add in Esperanto adjectives, too, but I think that I will wait for that for a while longer. Cheers, Razorflame 23:43, 12 November 2009 (UTC)

My userbox

Hello there. After reading over two archives about userboxes, I noticed that you said that controversial userboxes are not allowed here on the English Wiktionary. I made User:Razorflame/Count so that I could keep track of how many edits that I have made here, and I don't believe that it was offensive towards anyone, much less controversial. Ruakh removed it from my userpage (made it invisible), and I went ahead and fully deleted it. I just wanted to ask here if I could be allowed to use this userbox as it isn't doing any harm to anyone. Thanks, Razorflame 20:52, 12 November 2009 (UTC)

Not for the unaware: the relevant policy is (for some inconceivable reason) Wiktionary:Neutral point of view, which states that "All other [non–language-proficiency] userboxes are currently forbidden (though specific exceptions may be made, after discussion)." I believe Razorflame is asking for such a specific exception; if so, I'd be O.K. with that. The only POV it seems to reflect is that it's good to contribute to Wiktionary, which I think is a POV we can all accept. (Arguably it implies that quantity is more important than quality, but personally, I'd much rather a number-of-contributions userbox than a my-contributions-are-better-than-yours userbox.) —RuakhTALK 22:49, 12 November 2009 (UTC)
I don't see it as a good thing to boast about, the number of contributions is almost completely irrelevant. The idea of acheiving a high "edit count" can encourage people to make many edits of meagre substance - and while there are plenty of that kind of edit to do, I feel it is more useful to encourage people to make more beneficial edits. As some edits are considerably harder, and may take thousands of times as long as quick-fixes or javascript edits, the "amount of contribution" of two users with a similar edit count could be well over 100% different, it would seem misleading if they wear the same badge. The policy got co-opted into WT:USER, which while not voted on, is I think reasonably accepted - though we seem quite lax about signatures in some cases. Conrad.Irwin 23:36, 12 November 2009 (UTC)
I suppose Conrad's right that it'd be inappropriately promoting quantity over quality, but I don't see that as a big deal. What I see as a bigger deal is the slippery slope toward infection by the bacterium Userboxophilis uikipedii.​—msh210 16:19, 13 November 2009 (UTC)
It isn't a question of being proud of your edit count or displaying a statistic. Both Stephen and I link to this site which keeps track of such things. The concern among the community has always been the problem of what's happened to UserBoxes on Wikipedia. There, the boxes have become silly, numerous, and (therefore) often uninformative. On a multilingual dictionary project like this, the most relevant information to have in a standardized format is language proficiency, with script proficiency just a bit behind that. Your politics, ideologies, diet, and other idiosyncrasies have no bearing on the project, and we chose to therefore limit the use of Userboxes. You are still free to describe yourself in text on your user page, just not to create userboxes or their accompanying categories. --EncycloPetey 02:00, 16 November 2009 (UTC)
Ruakh is correct. Now, as it seems relevant to the project, I would allow. I don't think a discussion is in order on the merits of counting edits. If Razorflame is proud of it, and again as it's relevant to his contribution, he can ask for an exception. That would apply to a specific template, I think, unless he wants to be more general in his proposal. I doubt this has too be very formal either. 70.112.24.181 06:21, 18 November 2009 (UTC)
I don't particularly like the idea of a contributions userbox. However, if Razorflame would like to display the userbox, it is his userspace after all which we shouldn't have too much say over. Ditto Conrad.Irwin and msh210's comments; let's not get into promoting quantity over quality please....... L☺g☺maniac chat? 15:23, 18 November 2009 (UTC)
How could it possibly be wrong for a user to have a labor-saving way of displaying their edit count? Any way individual users can motivate themselves to do something productive seems fine. Is there any metric that would provide the correct incentives in all regards? No. But all metrics have some value. I am sure I am not alone in using the zero target for the number of English-language or any-language entries in various cleanup categories, both community/official and user-created. (BTW, It would be nice to have some metrics for entry quality by language.)
OTOH, competitive edit counting seems to go in the wrong direction. DCDuring TALK 15:40, 18 November 2009 (UTC)
The implication of this last point is clearly that we should forbid access to this site or have it destroyed. Just knowing that it exists is making my adrenalin flow.
The concern about keeping main user pages "plain" seems to befit this community's monkish approach to the project. The question of appearance is wholly distinct from the edit-count question, is it not? DCDuring TALK 15:54, 18 November 2009 (UTC)
I think it's pointless posturing and would personally oppose any loosening of the current userbox rules. Equinox ◑ 22:40, 18 November 2009 (UTC)
I 100% agree with Equinox. Unfortunately, too often people here do mindless botwork in order to pump their editcount, when they should be focusing on missing content itself. Allowing editcount userboxes would amount to openly promoting that kind of degenerate metric. --Ivan Štambuk 04:29, 19 November 2009 (UTC)
If the mindless botwork were actually being done by mindless bots then we wouldn't have to worry about recruiting and motivating people. There are plenty of cases of one kind of flaw appearing in 100 entries. Getting a bot to do that is often not worth it. In fact the content gaps and the format blunders and the poor quality all need work. One good way for people to learn is to do a lot of less complicated word so that they get to see a large number of entries with the inevitable large number of good and bad features. Then they can proceed to more complicated work and eventually to noting new problems and making major improvements. It certainly would be nice if everyone was brilliant from their arrival here, but in fact we are stuck with defective human contributors, some deficient in people skills, some deficient in energy, some deficient in perspective, some deficient in humor, some deficient in language skills, some deficient in tolerance, some deficient in patience. DCDuring TALK 04:52, 19 November 2009 (UTC)
And you think that editcount userboxes would help foster a culture of newbies gradually turning eager to make quality edits after making 10^x trivial ones:) I'd say quite the opposite.. Lexicography if one of the most boring and ardous tasks imaginable, and the real motivation comes from "within".
God I'd hate we turn to Wikipedia with everyone posturing with their fancy "awards" and brag-articles.. --Ivan Štambuk 05:18, 19 November 2009 (UTC)
Ugh yes. I don't care if anyone (newbie or not) wants to do the mindless botwork; it needs to be done, and it won't harm anybody. However, they shouldn't be showing off about it. This page has a lot more useful information on it than just user contributions, and I don't see it as a bad thing that should be forbidden linking to or deleted. I saw it before I started to contribute here and it was encouraging to see that so many people were dedicated enough to this project to contribute that much. (However, I don't like the column on that table that shows position change i.e. this user has moved up one spot in the last month. That gives the air that this is a competition :P) L☺g☺maniac chat? 15:19, 19 November 2009 (UTC)

Wiktionary:Requests for deletion/Others#Category:US slang

I put a link here as it looks like a policy decision more than just a deletion one. Mglovesfun (talk) 17:21, 14 November 2009 (UTC) Move to WT:AEN DCDuring TALK 17:30, 4 December 2009 (UTC)

Wikisaurus - inclusion criteria

Richardb thinks that Wikisaurus should have less strict inclusion criteria than the Wiktionary mainspace, and, without starting a discussion, has been entering his proposal directly to Wiktionary:Editable CFI and Wiktionary:Wikisaurus. I have reverted his edits, but I have no interest in keeping an edit war with him.

Please provide your input on what you think the inclusion criteria for Wikisaurus should be. I think they should be basically the same as those for the mainspace, with possible exceptions that are left unarticulated. --Dan Polansky 16:36, 15 November 2009 (UTC)

No, there should not be exceptions. Both Wikisaurus and the mainspace should allow all words, as the first sentence of CFI (before the table of contents) states it. Lmaltier 17:19, 15 November 2009 (UTC)
So you think the same CFI should apply to both and separately disagree with the restrictiveness of the current CFI? Or do think that since the current CFI is too restrictive that it shouldn't apply to Wikisaurus? --Bequw → ¢ • τ 23:35, 15 November 2009 (UTC)
I think the same CFI should apply to both and separately disagree with the restrictiveness of the current CFI. Lmaltier 19:13, 18 November 2009 (UTC)
Same CFI for both. They both a strive to be general lexical reference works. --Bequw → ¢ • τ 23:35, 15 November 2009 (UTC)
If Wikisaurus were to include references to appendices that have terms that do not meet CFI, we might be able to have our cake and eat it too. At present the recommendation to put such terms in appendices is transparently (to all but the most naive newbies) consigning them to oblivion. Wikisaurus could be a window onto such appendices, which might include common SoP terms and well as protologisms and unattestable colloquialisms that don't rise to the "widespread-use" exception to CFI. DCDuring TALK 23:57, 15 November 2009 (UTC)
This argument seems to fall into a broad category of arguments of the form, "the CFI are, or WT:CFI is, broken, but it's too hard to change them/it the right way, so let's 'fix' them/it in a way that doesn't solve the underlying problems". Such arguments most often come up with place-names; also frequently with personal names (especially, for whatever reason, in Chinese); and occasionally with languages that aren't well attested. I don't dispute that the CFI are broken — not that I have much of a gripe with any particular CFI, but collectively they do not have, and AFAICT at no point have they ever had, consensus, and they're riddled with ambiguities that we can't pass a vote to resolve one way or the other, and enforcement is very inconsistent, etc., etc. etc. — but I don't think that proposals like "let's let Richardb use Wikisaurus as a dumping-ground for garbage we won't allow in mainspace, because no one actually cares about Wikisaurus" are really very productive. Especially since, as it happens, one editor does care about Wikisaurus. —RuakhTALK 15:42, 17 November 2009 (UTC)
Advertisements

Argument for altering CFI to meet current practice, for Wikisaurus

--Richardb 13:15, 16 November 2009 (UTC)

There is "Added Value" in capturing as many words as possible. But there can be a "Cost" in having a Wiktionary cluttered with entries of questionable value, neologisms etc. But in Wikisaurus, especially in the /more pages, there is good added value in just simply having a word (not a page or an entry), be it a neologism or a piece of technical largon, associated with other related words, with very little "cost". Especially once the more dubious words in the more contentious pages are hived off to the /more pages . Indeed, the /more pages are, of themselves, a kind of appendix, are they not ?
Further, I make a point that the policy and current practice, in regard to Wikisaurus "words" (not headwords/pages/entries), do not match. Either the policy is wrong or the practice is wrong. So, either we make an alteration to CFI to allow Wikisaurus the same sort of latitude as given to appendices, or we wipe out a considerable chunk of value in Wikisaurus by deleting all the single word content that doesn't strictly meet CFI as it is now. I know which I feel Adds Value and which Costs.
Additionally, Wikisuarus allowing some unattested words does serve a number of useful purposes.
  1. It gives a space for protologisms, neologisms etc to go, without cluttering the main space. and yet they are still searchable. To some extent we lose creditability if a person sees a new word somewhere, comes to Wiktionary, and either can't find it at all, or finds it only in a list of "neologisms". Whereas, if he finds the word in Wikisaurus page, he is going to get a fair idea of what the word means through it's associations.
  2. Equally, we make it pretty daunting for newbies to add a new word, having to meet CFI, formatting, template use etc. It's quite a bit easier to be able to put the new word they have discovered into Wikisaurus. and now all of a sudden, we don't just have a newbie reader, we have a new contributor, who actually has invested something into Wiktionary, and has some, very small, feeling of ownership, of sharing in the exprience of building Wiktionary. Some of these new contributors may go on to be very significant contributors. Instead of the first response to them trying to add a word somewhere is to be told = No, it doesn't meet CFI, it isn't formatted properly etc. I know in the past that we have lost a number of new, interested contributors, because of the "Wiktionay Police" approach of one or two over pedantic admins.
Allowing words in lists within Wikisaurus that don't necessarily meet full CFI(present) adds a lot of value, at little cost. So why stop it? Why have a policy that doesn't meet current practice?
PS. I have noted in the past, and I think it is still the case, that there are many archaic and obscure words in Wiktionary that only on the slimmest of measures meet CFI. I myself know I have added one or two words from Jane Austen works that I could not find in use (as opposed to in dictionaries) in any other book. Why should words like that be admitted, while we have huge battles over words like "Chillaxin" or "Boucebackability", clearly in widespread use.
Finally, I have also proposed another notionally very minor clarification change to CFI, which, if accepted, would quite possibly avoid this whole argument anyway, at least to a large extent. That is to clearly spell out that words need only meet ANY one of the CFI, not ALL of them. Too often in the past that too subtle little "or" in the criteria has been ignored. "No, it's not good enough that "chillaxin" has hundreds of thousands of hits on Google, it must also appear in print somewhere". If that clarifying modification, the vey clear reference to meeting ANY of the criteria, is accepted, then perhaps we don't have much to worry over after all. Except, of course, for the very top policy of all - all words in all languages!

--Richardb 13:15, 16 November 2009 (UTC)

Richard, can you give an instance of a Wikisaurus entry at which current practice does not match CFI?
Well, since virtually all words meet CFI IMHO, I 'm guessing you are looking for an example where there is a clearly valid entry in Wikisaurus that is not in the Main namespace. How about off one's trolley in wikisaurus:insane. Google hits - "Results 1 - 10 of about 96,400 for "off one's trolley" ". Clearly a phrase undestood by most people in Britain. But no entry in Main namespace. Not even a mention in trolley. So, with very little effort, certainly no effort to ensure it meets CFI, a user has contributed a term to Wikisaurus that adds real value. --Richardb 00:11, 30 November 2009 (UTC)
Really, it's about the way CFI is interpreted. If a word is a bit new, or a bit obscene, the bowdlerisers delete it on the grounds it does not meet strict CFI, clearly applying a much harder test than for many other words. Actually, as pointed out in the discussions on CFI, I feel things would be improved just by the clear expression that the CFI Criteria are to be applied with an OR, not an AND condition. Change that, and get the exclsuionists to stick by that change, and pretty much the argument about having lesser CFI for Wiktionary would be irrelevant. Though the /more pages would still be useful.Richardb 07:25, 30 November 2009 (UTC)
I agree that the subpages of Wikisaurus entries ending in "/more" such as "Wikisaurus:breasts/more" can be left unregulated by criteria for inclusion, as is currently the case. I have added a sentence to that effect into Wiktionary:Wikisaurus.
Other than that, Wikisaurus should not become an inbox for malformatted and unattested content. If editors of Wiktionary wish to create an inbox for quickly added malformatted and unattested entries, we can create one.
Wikisaurus is a thesaurus and a namespace for semantic network, not a bin for malformatted and unverified content. Wikisaurus is striving at no less accuracy and being well-formatted than the mainspace. --Dan Polansky 14:06, 16 November 2009 (UTC)
Sorry - "Wikisaurus is a thesaurus and a namespace for semantic network" ???? When was that decided? Funny thing is, I thought it was for users, people. Please expand on your idea. Perhaps outside this immediaste discussion though. --Richardb 00:11, 30 November 2009 (UTC)
Let me be specific. In this edit to Wikisaurus:die, I have removed "become brown bread", "get off one's twig", etc., as these terms have almost no Google hits. Do you wish that these unattested invented terms are included in a Wikisaurus entry?
I have to question your objectivity. Even if I use the precise phrase "become brownn bread", hardly the commonest form of usage, I get "Results 1 - 10 of about 18,800 for "become brown bread" from Google. So those 18,800 hits are, in your humble opinion, are not worth anything in support of the inclusion of the phrase. OK, in reality, there may be only a few usages there, but, nevertheless, if a non-English speaking person sees the phrase and looks it up on Wiktionary, what will they find. Nothing, or something helpful ?--Richardb 23:51, 29 November 2009 (UTC)
My mistake. I admit that "become brown bread" has gained 18,000, which it had not when I had removed it. I retract "these terms have almost no Google hits", as its use of present tense is incorrect; the terms had almost no Google hits when I had removed them. Nevertheless, the term "become brown bread" can be readded when it becomes clear that it meets CFI. --Dan Polansky 09:11, 30 November 2009 (UTC)
For a context, there was a vote on the "/more" pages at Wikisaurus: Wiktionary:Votes/2006-09/Wikisaurus semi-protection. --Dan Polansky 14:29, 16 November 2009 (UTC)
It feels like this is at least the third time this discussion has come up. Historically it has been decided that we do care about the Wikisaurus namespace being more than an Urban Dictionary mirror. It will continue to be my vote that verification and attestation as rigorous as the main namespace be applied to all material meant to be used as reference material included in this project. The giant list of utterly dubious "synonyms" listed in the current incarnation are barely worth the bits they are stored in. - TheDaveRoss 20:30, 19 November 2009 (UTC)
Even i f they are "barely worth the bits they are stored in", they are still woorth more than it costs. But, what you really mean is "in my opinion barely worth the bits they are stored in". It seems that you, as with many other exclusionists, take the judgement that if you think it is not worth anything, then sod those who do think it is worth something. I believe there is more value in inclusiveness, and have made the argument. I cannot recall seeing a cogent argument for making this an exclusive, academic work.--Richardb 23:36, 29 November 2009 (UTC)

Richardb, I can't see how you call this a good-faith proposal, when you ignore things I've generated for you like User:Connel MacKenzie/thesaurus. The truth here, is that you aren't interested in building a thesaurus, you are interested in dumping lists of obscene terms in the wikisaurus namespace. Since I've spent at least ten times the amount of time on wikisaurus than you have, I'm shocked to see you portrayed as "caring" about it. But I'm not surprised to see the same proposal now, that was rejected numerous times in numerous forms, in the past. --Connel MacKenzie 23:24, 19 November 2009 (UTC)

As far as I can see, the only person who's used the word "caring" in this discussion is me, and I can assure that you I did not mean Richardb. Since Amina left, the only editor putting much effort into Wikisaurus has been Dan Polansky. —RuakhTALK 01:51, 20 November 2009 (UTC)
How is having User:Connel MacKenzie/thesaurus any use to someone searching the Wikisaurus namespace ? And it is a total and slanderous misrepresentation to say --Richardb 23:36, 29 November 2009 (UTC)"you are interested in dumping lists of obscene terms in the wikisaurus namespace". I am interested in protecting the value that is in all of Wikisaurus, contributed by a great number of people. It just so happens a lot of commonn usage words and phrases are obscene. (and some times I put a few back after the bowdlerisers have been active). This is not an argument about ":obscene" or otherwise.. It's about Inlcusivness, or exclusion. I believe there is more value in inclusiveness, and have made the argument. I cannot recall seeing a cogent argument for making this an exclusive, academic work.--Richardb 23:36, 29 November 2009 (UTC)

Responding to some of the postings above as regards whether the current practice of Wikisaurus matches CFI: The example given by Richardb—off one's trolley in wikisaurus:insane—has meanwhile been created in the mainspace, as it meets CFI. To demonstrate that the current practice of Wikisaurus does not match the principle of including in Wikisaurus only CFI-meeting terms, it would be necessary to list a considerable number of terms that are currently included in Wikisaurus and yet do not meet CFI; that a term is not yet in the mainspace does not prove that it does not meet CFI. To see whether a term meets CFI, the first thing to do is to search for it in Google books, and check further criteria. Whether a term is vulgar or obscence is out of discussion; CFI allows inclusion of vulgar or obscence terms. The demonstration is unlikely to be delivered, as there is a large number of entries that I have either created or cleaned up. That is, Richardb's claim posted in this thread that "...the policy and current practice, in regard to Wikisaurus "words" (not headwords/pages/entries), do not match" is untrue. I am unsure whether this subject is still relevant, though, since several people in this thread expressed their support for the principle of keeping in Wikisaurus only terms meeting CFI, while only one person disagreed. Unless some more people turn out to disagree, this thread documents that the principle has a community support. If it turns out that some Wikisaurus entries contain CFI-non-meeting terms, this can be easily amended by removing the offending terms from Wikisaurus. Let me remind that this discussion does not pertain to the "/more" subpages in Wikisaurus, whose content is, for now, allowed not to meet CFI. --Dan Polansky 08:56, 30 November 2009 (UTC)

Move to WT:AEN DCDuring TALK 17:31, 4 December 2009 (UTC)

Collaboration

The Collaboration of the week has been inactive for quite a while, probably because there isn't enough interest in it to have a new one every week. I think there were really some flaws in the concept in the first place, mainly that a new one was started whether the last was good enough or not. However, collaboration is very useful in building entries.

Therefore, I propose that a new project be started, that would work on one word, and not start a new one until the current one is about as good as it can possibly get. That means great and complete definitions, as many pronunciation guides as possible, audio files, synonyms and antonyms (and possibly a Wikisaurus entry as well), a good etymology section, example sentences, large translation tables with as many de-redlinked and well-formatted translations as possible, references and citations for everything. Basically, work on this word would not stop until the entry is as much of an example of a "perfect entry" as possible. This will probably take longer than one week per word. I don't know how we could decide whether an entry is "perfect" or not (Polls? Comments? Specific criteria?) nor do I have an idea for a name, but I really think that trying to get as many near-perfect entries as possible is rather important and that we often make quantity of entries too much of a priority over quality.

Any thoughts? --Yair rand 20:29, 15 November 2009 (UTC)

Sounds like a good idea in general, though I would imagine a time-limit is necessary (we can always re-start the clock if there's still enthusiasm for a particular entry). The only really successful "explicit" collaboration we've had was User:Mutante's emptying of Special:UncategorizedPages went from tens of thousands to tens in a few weeks - I think it was successful because everyone could take part in their own way (even the people who contributed only to one language); there was a definite success criteria, and a strong driving force. If some definite targets could be set, and you were willing to whip people a little bit, then I can imagine this working very well. Conrad.Irwin 21:45, 15 November 2009 (UTC)
One of the reasons I suggested this is so we could have a goal, rather than a deadline. I like the idea of working on one word until it's done, rather than working on it until a certain amount of time has past. I think the simplest way would be some form of poll, maybe stay with the current word until a majority including at least a certain number of people (3,4,5?) think we should switch to a new one. --Yair rand 00:58, 16 November 2009 (UTC)
I started a page for it in my userspace (User:Yair rand/Current Collaboration). Edits to it are welcome. I hope to move it to the Wiktionary namespace after a name is decided and there is sufficient interest in having a collaboration project. Personally, I'd love Wiktionary to have some entries that would just make readers go "wow". I think this is the best way to accomplish that. --Yair rand 05:45, 17 November 2009 (UTC)
You mean like we now have for listen, parrot, etc.? That was the idea behind the Model Pages, except that the models were chosen to have a limited number of parts of speech and possible senses, to avoid confusion when using them as models. My most recent attempt at "complete" page was biceps. I selected that word in part because (1) there is a common sense often missing from other dictionaries (the informal sense #3), and (2) the word exists in several languages, so we now have a new Model Page that models additional languages beyond English. If we picked a few such multi-lingual entries for collaboration, that might spur more interest. That is, pick pagenames that ought to contain entries in multiple languages, even if the meanings of the words in those languages are unrelated. --EncycloPetey 03:46, 21 November 2009 (UTC)
Yes, those are the kind of entries I was talking about. The idea was really just to have some pages where everything immediately around it was just how it should be. I guess that really should include words in other languages that have the same name as the English word. --Yair rand 06:16, 24 November 2009 (UTC)

WT:APR

I decided to go ahead an make WT:APR active. Any feedback is appreciated. --The New Mikemoral ♪♫ 01:40, 16 November 2009 (UTC)

Note the existence of {{rfap}}, which I think is the more conventional way of requesting an audio file for an entry. While I don't know, I'm guessing that the ease of use of that template and its automatic categorization are why WT:APR was {{inactive}} (as redundant).​—msh210 17:20, 16 November 2009 (UTC)
This is more to get audio wanted immediately recorded. I am currently working on clearing the backlog at the category. --The New Mikemoral ♪♫ 06:09, 17 November 2009 (UTC)
I like the idea. I'm not knowledgeable about pronunciation and sometimes have questions that relate to multiple entries, for which the rfp/rfap fora are not exactly right. DCDuring TALK 17:34, 4 December 2009 (UTC)

Renaming of *Topics categories

Per the recent BP and RFDO conversations I was going to edit {{topic cat}} et al. to rename the "*Topics" categories to "All topics". If you have strong feelings that this isn't correct (e.g. if you think it should just be "Topics"), please discuss now before lots of categories are renamed. --Bequw → ¢ • τ 20:04, 16 November 2009 (UTC)

WT:EDIT allows editing of translation table glosses

In order to help try to clear the backlog at Category:Translation table header lacks gloss, I've implemented another feature in WT:EDIT which allows you to modify the gloss without bothering with that horrible edit-page bit [2] and [3]. More detail, transcluded from the talk page is below. Conrad.Irwin 03:07, 17 November 2009 (UTC)

I should perhaps have mentioned that this feature won't work unless you click the "Enable" button that should appear after you clear your cache (ctrl+shift+F5). I wasn't planning to make this available "by default", as I'm not sure non-ediits will understand exactly what to do (even after we've fixed all the bugs). Conrad.Irwin 03:09, 17 November 2009 (UTC)

Table Labelling

If you enable this feature, each translation table has a ± in the corner.

  • Clicking on this brings up an edit box for the table's header.
  • Type what you want in, normally the first few words in the definition (without linking or formatting)
  • Click "Preview" - if you are not happy with the result, click ± again - otherwise click "Save" in the top corner

WARNING: This feature has not been well tested in all browsers yet. Please notify me if you have any problems.

You don't have the current version of the translation table gloss editor.
Excellent. Very convenient. Now bug number 1: I can't edit Marxist, says "Could not find translation table". Also, I have a question: do you plan to develop this tool to cover cases like this, i.e. add new tables, and ttbcify the translations of the first one? It's unbelievably hard to do by hand. --Vahagn Petrosyan 12:00, 17 November 2009 (UTC)
I thought I had fixed the Marxist bug a while back, I'll look again. I was planning to allow adding new tables (at some point) - I suppose I could try ttbc-ifying old ones, it doesn't look too hard for a computer, but it wasn't an immediate goal. Conrad.Irwin 12:03, 17 November 2009 (UTC)
Maybe once the backlog of Category:Translation table header lacks gloss is cleared out a bit we can start automatically converting {{top}}{{top2}}/{{trans-top}} and {{mid}}{{mid2}}/{{trans-mid}} so those 3-letter templates can be used fully as language codes. Also a quick scan shows that the majority of the entries in this category have a single definition line meaning there's lots of low-hanging fruit. --Bequw → ¢ • τ 15:07, 17 November 2009 (UTC)
Yes, that would be a good idea - I just caught most of the low-hangers from the "b" section of Category:Translation table header lacks gloss. I was able to do 88/120. So, this is only going to cut down the category from ~3000 -> ~1000. The rest needs much more attention, by far the most common remaining problem is that there is one translation table, but multiple definitions - perhaps we could automagically ttbc-ify all tables in that situation (or maybe only those with more than one translation, and ask people who speak the language of that one translation to re-name the table)? Conrad.Irwin 17:26, 17 November 2009 (UTC)
I think we'd lose some valid translations that could be kept if the conversion was done manually. On a page with 2 senses and only one translation box, a human can look through the page history to find when the second sense was added. S/he can then keep all translations up to that point in history as valid for the original sense. Translations added after there were two senses would have to be ttbc'ed of course, but at least we're not dumping them all in there. Maybe this could be automated, but it seems a bit trickier. --Bequw → ¢ • τ 20:21, 17 November 2009 (UTC)

Category:Translation table header lacks gloss

It'd be good to catch all the easy-to-fix problems in this category: (well, and all the others too!). To catch the easy ones, you just open up lots of tabs with pages from Category:Translation table header lacks gloss and add a short summary of the definition to the translation table using the ± button enablable above. Conrad.Irwin 17:26, 17 November 2009 (UTC)

  •  !->Z: easy done Vahagn Petrosyan
  • a: easy done
  • b: done the easy ones. Conrad.Irwin
  • c: easy done
  • d: easy done Bequw
  • e: easy done
  • f: done the easy ones​—msh210
  • g: easy done.
  • h: done the easy ones​—msh210
  • i: easy done.
  • j: easy done.
  • k: easy done Conrad.Irwin
  • l: easy done.
  • m: easy done
  • n: easy done
  • o: easy done
  • p: easy done
  • q: easy done Conrad.Irwin
  • r: easy done
  • s: easy done
  • t: easy done.
  • u: all done
  • v: easy done
  • w: easy done
  • x: done by big V
  • y: done by big V
  • z: done by big V

Any advice about how to write a trans table gloss? Simply the definition given, or copy it? Help:Translation gloss?? --Volants 13:50, 19 November 2009 (UTC)

IMO, glosses should be short enough to fit in one line for most screen resolutions, and be particular enough to clearly correspond to a single definition/sense in the parent PoS. It is not required to define the term all over again and when the definition is medium to long in length it is appropriate to simplify it for the gloss. When there's only one sense for a certain PoS, the gloss can be trivial to produce, but as more senses are added one should take care to ensure that the glosses still identify unique senses. If someone else concurs maybe they can add something to Wiktionary:Translations or maybe WT:STYLE. --Bequw → ¢ • τ 14:20, 19 November 2009 (UTC)
I've written my thoughts at Help:Glosses, I think we mainly agree though you emphasise quoting the definition more than I - please feel free to update the help page. Conrad.Irwin 16:53, 19 November 2009 (UTC)

The category is now down to 1,600ish from 2,900ish. Thanks to everyone, and I hope we can get the remaining gaps of "easy done" plugged in the next day or two (it only takes a few tens of seconds per-page, and is most satisfying)! Conrad.Irwin 16:53, 19 November 2009 (UTC)

What do we think about HTML "comment" glosses? For example, pond uses <!‐‐DefAtlOcean‐‐> (on the definition line) and acceleration has <!‐‐DefB‐‐>. I think if we are going to be having people add translation glosses w/o seeing the underlying wikitext (and associated HTML comments) then these "comment" glosses should be converted to normal glosses. --Bequw → ¢ • τ 18:35, 19 November 2009 (UTC)
Yes, I agree. The comment-style glosses are only useful for people editing the entire page (and/or robots) not for readers nor sectional editors. Conrad.Irwin 22:48, 19 November 2009 (UTC)

Now that all the "easy" ones are done and we are down to ~800, I say we convert the {{top}}/{{mid}}s that are in translation sections to {{trans-top}}/{{trans-mid}}. It will allow more people to work on the problem(s). Also, since most of {{top}} ones are "easy" (one sense) after clearing out the easy ones, this category shouldn't be that much bigger. Anyone object? --Bequw → ¢ • τ 05:30, 3 December 2009 (UTC)

I brought this up earlier, because I wanted to help convert all the {{top}}, {{mid}}, and {{bottom}} to their respective translation table templates, but people didn't want that to happen. I am all for it, though, espeicially now that we have an easy way to update/correct/fix them. Razorflame 05:34, 3 December 2009 (UTC)
Can {{bottom}} be used with {{trans-top}} and {{trans-mid}} without any negative effects? Is it identical to {{trans-bottom}}? --Yair rand 05:38, 3 December 2009 (UTC)
It appears not. It seems that {{bottom}} is not the same as {{trans-bottom}}. Razorflame 05:42, 3 December 2009 (UTC)
{{bottom}} is exactly, 1:1 identical to {{trans-bottom}}. I don't know why one doesn't just redirect to another. -- Prince Kassad 05:43, 3 December 2009 (UTC)
Really? The </div> stuff in {{trans-bottom}} doesn't do anything? Why is it there then? --Yair rand 05:51, 3 December 2009 (UTC)
Hm, good point. It's not displayed by default and easy to miss. -- Prince Kassad 05:54, 3 December 2009 (UTC)

What's the deal with the logo?

I was surprised to see that you guys are still using the old crappy Wiktionary logo since most of the other Wiktionaries have upgraded. (Especially since the vote was 3 years ago.) When I tried to find out why, I learned that "While there was a consensus to approve the logo on Meta, there is a substantial opposition to changing the Wiktionary logo entirely from, primarily, the English Wiktionary project, and therefore, the Wiktionary logo has not been changed at all." Now there is total chaos on Meta about what to do regarding the Wiktionary logos. (It looks like there are about 20 different proposals with little to no organization.) What's the deal guys? Why don't you just upgrade your logo instead of monkey-wrenching the process for everyone? It would definitely be an improvement over what you're currently using (which looks more like an HTML rendering mistake). Think it over. Kaldari 21:46, 17 November 2009 (UTC)

There was consensus on the English Wiktionary not to use the tile logo. There is not "little or no organization", nor is there any chaos on Meta. The mess on the logo page is completely due to User:Richardb's decision two days ago to wipe half the page and replace it with his own comments. That vote will begin as soon as the rest of the translations are complete. After that, and the vote deciding whether to use the winning logo, hopefully we will have a new logo, which will most likely be a lot better than the "upgrade" of the scrabble imitation logo. --Yair rand 22:10, 17 November 2009 (UTC)
That is bloody slanderous. I did not delete anything. I am NOT, NOT, NOT a deletionist!!!. All I did was
  1. moved all the info about Voting into one section, under voting
  2. researched the history of the matter, and put in a sectiion with a short summary of that history.
  3. put my opinion, that, based on reviewing the history, there was a clear misrepresentation about the outcomes of previous voting. The traditional logo (as used by English wiktionary) had failed to gain much support at all.


I'd appreciate a withdrawl of the slander from Yair Rand. But I won't hold my breath!--Richardb 08:37, 30 November 2009 (UTC)
PS: Far from organising a vote, you couldn't organise a piss-up in a brewery. You have absoliutely confused the issue by the way you have handled it. No clarity at all.--Richardb 08:37, 30 November 2009 (UTC)

Uh-huh. It seems from these edits (which were all yours except for an IP saying something about a Corel Word Perfect logo) that you did remove substantial parts of the page and you did replace parts of other people's comments with your own. Furthermore, you added all of your "History and explanation" to the top of the page overwriting what was there before, which is entirely inappropriate. --Yair rand 17:13, 2 December 2009 (UTC)

Which consensus? AFAIK, no vote has been organized here. Lmaltier 19:11, 18 November 2009 (UTC)

From what I can see in the Beer Parlour archives, it was clear long before it could have come to a vote that nobody wanted the tile logo. More recently, a poll was held on Meta, and there were 71 votes to start from scratch rather than use the text logo or the tile logo. The point of whether the tile logo is better than the text logo is now irrelevant, as the logo vote will be starting soon, as soon as German, Japanese, Turkish, Lithuanian and Vietnamese translations of the voting page are added, the Russian translation is completed, and the French and Finnish translations are proofread. --Yair rand 20:59, 18 November 2009 (UTC)
Well, when fr.wikt has organized a vote, its results were very surprising. A few negative comments don't mean that most users are negative. Lmaltier 22:14, 18 November 2009 (UTC)

Wow, what an embarrassment. After 3 years of debate, Wiktionary still hasn't decided whether or not it wants Goatse.cx as it's logo. Why hasn't The Register written an article about this yet? Kaldari 19:11, 20 November 2009 (UTC)

Also, what's the point of having a new vote on meta? Won't the results just be ignored again? It kind of reminds me of elections in Burma. Every once in a while they have an election, but since the military dictator never wins the election, they just keep ignoring the results. Kaldari 19:20, 20 November 2009 (UTC)
If you had fully read the voting page you would have noticed the section where it says that "Following [the vote], each language Wiktionary will hold their own vote on whether to accept the winning logo. In the event that less than 60% of the Wiktionaries approve of the logo, none of the Wiktionaries will use the logo." This way, we have a chance at unifying the logos and the result will not be ignored. --Yair rand 19:28, 20 November 2009 (UTC)
Your stupid ad-hominem approach to pushing the process forward doesn't endear you to me. Equinox ◑ 00:48, 21 November 2009 (UTC)
Bit strong there, Equinox. Although I do agree that Kaldari's points are not particularly helpful. The logo hasn't changed because there wasn't a consensus right? That's how the system works. At the end of the day, an attractive logo does not change anything about the project. People use wiktionary because they find it useful, not to marvel at its graphic design. If someone comes up with a logo that everyone really can agree on then all the power to them. But that day hasn't come yet, has it? Tooironic 02:30, 22 November 2009 (UTC)
Right! By looking at those hundreds of kilobytes long "discussions" on meta on whether it's better to use logo with a line over here or a tile over there, one begins to wonder whether those people have anything better to do in their lives. WMF should hire a professional designer and get this "problem" over with, so that everyone can waste their energies more constructively. Building a "consensus" over sth that isn't the problem in the first place (picking a favorite logo is really the same as picking a favorite flavor of ice cream) will inevitably leave most of the voters dissatisfied, esp. when choice needs to be made from 30+ proposals. With these pointless votes, methinks meta is slowly turning into a giant self-purposing bureaucratic machinery, losing touch with the real world. FWIW, I'm perfectly satisfied with our "embarrassing crappy old logo" and more concerned with nailing down the 10k lemmata missing on Wiktionary that I have on my TODO list. --Ivan Štambuk 02:52, 22 November 2009 (UTC)
"More recently, a poll was held on Meta, and there were 71 votes to start from scratch rather than use the text logo or the tile logo." In my research of the history of this matter, I did not come across this. I did however come across a reference to something like a a vote on English Wiktionary Beer Parlour, a vote of 71 votes to retain the traditional logo. Can you please give a link to your sources, so we can add that in to the history of the "logo project".--Richardb 08:37, 30 November 2009 (UTC)
The poll on meta is here. There never was a poll in the Beer perlour on the logo; I think it was assumed to be unnecessary. --Yair rand 17:13, 2 December 2009 (UTC)

Missing categories

I can't help noticing that the necessary templates like {{en-noun}} or at worst {{infl|en|noun}} (usually for less common languages) are missing from a lot of words. acceleration due to gravity (currently at rfd) is in the physics category, but not English nouns (or English anything). Entries with context tags like this, or rfc tags don't get listed in Special:UncategorizedPages (corrected link Mglovesfun (talk) 12:16, 21 November 2009 (UTC))which means they are harder to find. Mglovesfun (talk) 22:23, 19 November 2009 (UTC)

For Spanish at least there is Category:Spanish entries lacking inflection, but I don't know if any other languages have this. Could entries without {{infl}} or a standard inflection template be tagged by Autoformat? Nadando 22:33, 19 November 2009 (UTC)
I think what you want is Special:WantedCategories. Besides Special:SpecialPages (on the left frame) Ullman, Hippiebot, and perhaps Conrad may have subpages that contain useful problem-entry lists. CM's haven't been run recently AFAICT and may not be runnable or may have been replaced. There may be more. DCDuring TALK 00:19, 20 November 2009 (UTC)
Wanted categories are for red linked categories. I'm talking about English nouns that aren't in the category English nouns (as an example). Mglovesfun (talk) 12:16, 21 November 2009 (UTC)
User:Conrad.Irwin/English_nouns_without_categories is a list of all 7613 entries with ===Noun=== under ==English that aren't in Category:English nouns, Category:English plurals, Category:English alternative spellings, Category:English misspellings, Category:Misspellings, Category:Alternative spellings. It paints quite a depressing picture, containing things like tastier and antecedently which aren't nouns at all. :(. Conrad.Irwin 14:36, 21 November 2009 (UTC)
Indeed, and that's just the nouns in English. I find verbs and adjectives as well, of course. Mglovesfun (talk) 11:36, 23 November 2009 (UTC)

Wiktionary:Requests for cleanup#Category:United States of America

A debate over on RFC concerning whether to use Category:United States of America or Category:United States. Result should affect other topical categories using one or the other of the two ways of referring to the country by name. — Carolina wren discussió 22:40, 20 November 2009 (UTC)

What's in a name?

Currently Category:Names is a topical category that exists betwixt and between our topical categories and our part of speech categories in terms of naming. There's the subcategories Category:Given names and Category:Surnames that use the part of speech category naming system (i.e, French given names and French surnames) and Category:Demonyms and Category:Place names that use the topical category naming system (i.e, fr:Demonyms and fr:Place names). It also has the topical category Category:Onomastics as its sole parent. There are also a few anomalous categories:

So, what to do? First off, while related, a good case can be made that Demonyms aren't Names per se. Unlike Given names, Surnames, or Place names, they aren't proper nouns and in English at least, generally do double duty as adjective and noun. So I recommend changing the parentage of Demonyms by replacing Names with Onomastics. That's easy enough to do (or undo) and if it were the only recommendation I had, I'd likely go ahead and do it without bothering the parlor. However, that leaves us with what to do with place names.

Changing Category:Place names and its subcategories over to the part of speech category naming system would be a lot of effort, and we'd end up with some extremely verbose category names such as English names of states of the United States for some of the subcategories. Plus, unlike the given names and surnames which combine to form a compound name that is used without commas (at least in English), place names generally aren't used to form compound names, so it can be argued that structurally they act differently. So here's what I'd like to propose:

  1. Change Category:Demonyms from having Category:Names as a parent to having Category:Onomastics.
  2. Change Category:Place names from having Category:Names as a parent to having Category:Onomastics.
  3. Create a new category that would follow the part of speech category naming convention: Category:Personal names.
  4. Change Category:Given names from having Category:Names as a parent to having Category:Personal names.
  5. Change Category:Surnames from having Category:Names as a parent to having Category:Personal names.
  6. Delete Category:Names and include in the descriptions of Category:Onomastics and Category:Personal names the relationship between the two categories.

The reason I want to delete Category:Names is that I feel that the category name is ambiguous, since apple is the name of a type of fruit, etc., plus once Demonyms and Place names are moved to Onomastics, elements used to form personal names would be all that are left in the category.

It might be worth renaming Category:Place names to Category:Toponyms to reduce potential confusion if Category:Names is retained, but it is not essential to the proposal, and renaming can be considered as part of the Gazetteer proposal that has been floating around these discussion pages. — Carolina wren discussió 00:46, 21 November 2009 (UTC)

This is a good proposal. Category:Names , and all its topic and POS forms in various languages (like Category:fr:Names and Category:French names) are confusing and do more harm than good. That is why I gave the "Xxxn given names/surnames" categories "Xxxn language" as the parent when I created them a year ago. Only those categories with the new Template:namecatboiler have "Xxxn names" as a parent. Why not change it into "Category:Xxxn personal names"? This template does not include Category:Given names/Surnames by language either, though they would seem essential to me.
The Australian, German and Irish categories are accidental left-overs from the topic category days, and should be deleted. I would keep Jewish surnames ( but not Jewish names) and Indian names, in order to direct all the anon Indian contributors into correctly formatted entries.
If I were you I wouldn't hurry changing "Place names" into "Toponyms". We'll probably have a hundred more discussions about the CFI for place names, so anything could happen.--Makaokalani 15:24, 23 November 2009 (UTC)

Unprotect WT:CFI

My argument would be:

We don't actually use the criteria listed there, because we can't! They're too vague. I can pick apart almost every sentence and show how vague it is. The "names of specific entries" bit has been cited ad nauseam, so how about "idiomaticity"?

An expression is “idiomatic” if its full meaning cannot be easily derived from the meaning of its separate components.
For example, this is a door is not idiomatic, but shut up and red herring are.
Compounds are generally idiomatic, even when the meaning can be clearly expressed in terms of the parts. The reason is that the parts often have several possible senses, but the compound is often restricted to only some combinations of them.

It does actually say "expression", which we interpret as "more than one word". I think words with hyphens and apostrophes are ambiguous. For example rod-shaped, one word or two? don't, one word or two? What about l'ai or l'appelle in French? If those are one word, there's no doubt they can be attested. llámame in Spanish, is that one word or two?

I'm not convinced that expression does mean "more than one word", that's just the Wiktionary norm. I'd quite like a 24 hour period to edit WT:CFI, just because I don't think anyone actually knows what it means, or uses it seriously! Mglovesfun (talk) 12:32, 21 November 2009 (UTC)

Wiktionary:Editable CFI was set up to deal with these issues, one presumes that the proponents thereof will eventually instigate procedings to have the new version become authoritative, as and when the bugs have been removed. I personally would prefer a much less prosaic set of inclusion and exclusion rules against which words can be argued to match, with the rules being updatable by discussion at WT:RFD/WT:RFV, but I'm not sure how well this would work in practice. Conrad.Irwin 13:04, 21 November 2009 (UTC)
I'm tempted to put {{rfc}} at the top of it, but I suspect I'd get a block for that. Whatever happened to Visvisa proposing some stuff to vote on? Or at least, that was the idea I had in my head. Mglovesfun (talk) 12:05, 22 November 2009 (UTC)
If you have a specific change you'd like to see, then by all means propose the specific change, have the discussion, and possibly put it for a vote. A 24 hour free-for-all of editing one of our core policy pages is a bad idea. The page is protected to keep more people from getting themselves in trouble for editing a page that tells them in a big banner they don't see at the top not to do so. --EncycloPetey 15:52, 22 November 2009 (UTC)

Let's talk about sex, baby!

I don't think that Category:Sex and Category:Sexuality are redundant to each other, but they most definitely need some clean up. I also stumbled on Category:Sexual deviance which seems a bit POV. It might need renaming, but then again, to what? Mglovesfun (talk) 12:53, 21 November 2009 (UTC)

There are things that belong in cat:sexuality (like sexual orientations) that don't belong in cat:sex. Category:Sexual deviance should definitely go, though. I'm gonna rfdo it. — [ R·I·C ] opiaterein — 17:15, 21 November 2009 (UTC)
What about stuff like Mile High Club, is that sex or sexuality? Certainly anything biological should go in sex. AFAICT stuff like penis and vagina should be in there, I'll have a quick look (so to speak). Mglovesfun (talk) 21:56, 21 November 2009 (UTC)
I would say the Mile High Club should go in sex, but I can see how it could go into both. I think it best to reserve sex for sexual activities, aids, toys, etc. while sexuality should be for sexual preferences, gender identities, things like that. — [ R·I·C ] opiaterein — 22:17, 21 November 2009 (UTC)
Suits me. Mglovesfun (talk) 22:24, 21 November 2009 (UTC)

Logo Vote

The Wiktionary logo vote is set to start 2009-12-07 00:01. The first round will continue until 2009-12-31 23:59 at which point the second round will last until 2010-01-31 23:59. Anyone who understands a foreign language that the voting page has not been translated into, please consider contributing a translation. Other Wiktionaries still need to be notified about the vote, so please help post messages into other Wiktionary Beer parlours (or equivalent). Thank you. --Yair rand 01:48, 22 November 2009 (UTC)

There are at least four discussions of the logo on this beer parlour alone. But you make no effort to connect them together, to put this announcement into the other discussions. And at the top of every Wiktionary page there is a link [Discuss new logo proposals for Wiktionary.], but you don't even mention this new voting schedule prominently on that on that page!!!! OK, way down the page I could eventually find a minute reference - I have added the schedule to the voting pages. --Yair rand 23:49, 21 November 2009 (UTC). Not even a sodding link to the voting pages! You need to do serious publicity if you want this process to have any credibility.
Again I ask, by what authority, and for what reason, are you running this vote (or rather, attempting to run it) ? It seems clear to me that you are pushing this all on your own, making lots of unilateral decisions, without ever once declaring why ? Please explain yourself.
Before your sudden pronouncement about the latest voting schedule, your last comment on the matter that I can find on Beer Parlour was "It is clear that this discussion is getting us nowhere. This apparently pointless debate is now over. --Yair rand 15:02, 26 October 2009 (UTC) " On that we can agree!--Richardb 09:36, 30 November 2009 (UTC)
The schedule was suggested by Conrad.Irwin on the meta discussion page, it's right above the comment I added about adding the schedule to the voting page. The debate on the beer parlour that I managed to end was about whether it matters that the discussion was started on Meta, rather than here. An extra link to the voting page on the meta discussion page was not needed as they are sprinkled throughout the page. There's a link not only at the top of the page, but right above the current discussion under the words "Current Status". Further publicity on Meta isn't needed; the Wiktionaries are being notified through their Beer parlour equivalents. If you would like, for some reason, to post the schedule and a link to the voting page once again on meta, feel free. And no, I am not "running" this vote, even though it may seem like that as I might be the most active contributor to the vote; this is a wiki system. And what do you mean by "authority"? The reason for the vote is that 71 who voted for having a vote want it. --Yair rand 16:52, 30 November 2009 (UTC)

Subidioms in the inflection line

I believe that this:

"to give what for"

shows, in concept, how multi-word entries what include idiomatic components should appear in the inflection line. One desirable alteration might be a fainter-appearing underline or perhaps simply an underline between "what" and "for". A faint underline could appear under all elements of multiword terms or even all terms if that were simpler or more resource-thrifty to implement.

I expect that HTML in the inflection line is really bad, if not explicitly forbidden. So I think this needs some technical support if we agree that this is a good idea.

This particular idiom illustrates the need fairly well. Linking to each individual word gives users no clue about the construction of this and may lead them to look to insert a question mark or to add an object to the preposition. DCDuring TALK 13:11, 22 November 2009 (UTC)

Help:Writing definitions

I suggest a help page like Help:Writing definitions/Help:Defining terms/Help:Definitions, to give advice on good ways to go about writing good definitions. AFAICT, we don't have such a page, and there are a handful of very good definition writers here and a couple of very good help-page writers too. --Rising Sun 17:39, 22 November 2009 (UTC)

I sure could use a page like that. DCDuring TALK 18:18, 22 November 2009 (UTC)
I've started a page with three central principles (which can eventually be expounded upon with examples both right and wrong). I've also hinted at two areas I think ought to be included in the page, but which will require quite a bit of work to assemble. --EncycloPetey 05:32, 23 November 2009 (UTC)
It occurred to me as I've been writing, that the results look more like a page for the Wiktionary: namespace that for the Help: namespace. The Help: namespace is usually for technical issues, and I can't think of very many that apply to definition writing (other than the initial hash, no blank lines between defs, context tags). Should we move/redirect the page to Wiktionary:Writing definitions? What do other people think? --EncycloPetey 06:36, 23 November 2009 (UTC)
The Wiktionary tutorial is badly in need of a rewrite. I suggest that this be made into a section of the tutorial. Right now all we have in it is basically of copy of the Wikipedia tutorial which is mostly irrelevant to Wiktionary. This would be a good starting point for redoing the tutorial. --Yair rand 06:09, 23 November 2009 (UTC)
Yes, and no. The tutorial is supposed to teach the basic mechanics of a wiki, and not teach style or dictionary-specific skills. --EncycloPetey 06:36, 23 November 2009 (UTC)
Oh. Never mind, then. --Yair rand 06:44, 23 November 2009 (UTC)

Category:Lojban language

It would be rather rare to nominate every word in a language for cleanup, but Lojban needs it. The definitions are written in a Lojban style that I can't understand at all. This needs some sort of communal decision. Btw is Lojban actually used as a language? I suspect most constructed languages appear more often in dictionaries than in texts. Mglovesfun (talk) 17:47, 22 November 2009 (UTC)

Lojban makes for a very sticky situation here. We need to have the entries written in English and they need to be as comprehensible, but unfortunately this is virtually impossible to do. With the languages that have the same parts of speech as English, it's very simple to explain because English words exist for those parts of speech and definitions. What are we supposed to do with a language with completely foreign concepts that English words aren't really well-suited for? How do we define a word for which we can't simply say "to do x", "a x", "having x characteristics", "in a x manner" or use English parts of speech? I can't think of any solution better than use what we currently have and assume anyone reading this has some understanding of the language. (And I don't have a clue whether Lojban is actually used as a language but I suspect the folks over at the Lojban Wikipedia and Wiktionary use it to some extant.) --Yair rand 05:31, 23 November 2009 (UTC)

ttbc (Translations to be checked) tags on translations

Please refrain from flagging translations with "ttbc" lightly, if you make any changes to the English entries, especially adding/modifying senses. Please respect the translators' work! They may not be available any more. I personally find it annoying and discouraging. It's a lot easier to change the English entry than to find and fix the translations into other languages! Anatoli 01:31, 23 November 2009 (UTC)

I don't know when this entry: that is going to be checked. I think there are nicer ways of handling the situation with new senses in the English entry. --Anatoli 01:38, 23 November 2009 (UTC)
What would you recommend? DCDuring TALK 03:52, 23 November 2009 (UTC)
The original translations (the first that appeared in the entry) were the translations of the most common or default sense of the word, in this case "connecting noun clause" (that). If you can't contact the translators to verify, be brave and leave in the original sense. My point is, the translators have already taken the effort to "check" those translations, added their translations and not necessarily "watch" this entry or have time or desire to look at it again. Additional senses in translations may be left blank or with trreq tags (translation requests). Careful examination of the original entry may prevent from redoing a collective effort work. --Anatoli 04:01, 23 November 2009 (UTC)
Those who move these words lightly to ttbc sections should be mindful that some languages have very few or no active contributors and may never be reviewed. Taos, for example, is a finished project, and it is unlikely that anyone will ever add any more. What we have now is all we will ever have for that language, since the Taos speakers do not want it to be written down or published and will not contribute. If Taos words are moved to ttbc, they will remain there forever. (Likewise, if anyone messes with Taos contexts, categories, or templates, they should be very careful, because nobody else is ever going to clean up after them.)
You should be careful about using ttbc, especially for minority languages. I think anyone who moves words to ttbc should also be heavily involved in the checking and restoring of at least one of the languages. That is the only way that you can have a sense of job that you are creating. If you don’t care to check ttbc tranlations yourself, don’t expect others to do it for you. —Stephen 08:17, 23 November 2009 (UTC)
Unfortunately, English is not a finished project either. In one common situation English senses that are best separated are initially combined. When they are separated what should be done?
And, of course, there are many other kinds of errors in the supposed main definitions. The entries for the non-English words are often not much help because they are usually one-word glosses rather than full definitions (by policy as I understand it). They are sometimes glossed with obsolete terms or with highly polysemic English words. It could well be that translation should not be commenced for an entry until it meets some minimal quality standard in terms of having senses that are distinct, with usage examples that correspond. Without some kind of process to note entries and senses that are "ready" for translation (which noting may be wrong or simply changed because this is a wiki), there will be many cases where English entry changes make the glosses in the translation tables no longer match the senses. Should a new "trans" be set up that does correspond? How should the no-longer-corresponding transtable be marked? DCDuring TALK 11:19, 23 November 2009 (UTC)
The ttbc tables' main purpose now seems to be "we don't know which table this translation should be in" rather than "we don't know how accurate the translation is", perhaps it would be better to reserve a translation gloss (something like "Other translations") which could be used for this situation - with a short hat-note much as ttbc already has to explain to anyone who does know how to fix the problem. Conrad.Irwin 13:27, 23 November 2009 (UTC)
Is the problem really with the use of {{ttbc}}, or is it with the moving down to the "checktrans" box? Personally, when I radically alter a sense, I'll frequently tag the citations with {{ttbc}}, but leave them in place; if they're still fine, then it seems like very little work for a translator to remove the tag. Is that still problematic? —RuakhTALK 15:07, 27 November 2009 (UTC)
It is not the use of {{ttbc}} per se, but the moving down to the "checktrans" box. When the words are contributed, they usually have the correct meaning for the existing definition. When you move words to checktrans, it strips them of all sense and syntax and they cannot be restored or used in any way unless a knowledgeable speaker restores them. For languages that have no or almost no contributors, it is tantamount to deleting the words in question. —Stephen 15:57, 27 November 2009 (UTC)
Understood. Robert made a change to Tbot (or Autoformat?) a while back that makes it support the use of {{ttbc}} within the regular translations tables, so people should probably just take advantage of that when at all possible. —RuakhTALK 21:55, 27 November 2009 (UTC)
BTW, the glossless translation table I just found at surely reminded me that almost all of the {{checktrans}} insertions that I have done have been cases where there was no gloss. Translations sometimes predated additional senses, but in other cases translators blithely added translations to multi-sense words that did not have translation tables for each individual sense. I seem to recall that it has taken several senior contributors some time to make a serious dent in the number of glossless trans tables that remained until recently. If we add to that the translation tables using some variant of {{top}}, which seem to have glosses less than half the time, it will be quite some time before this problem is behind us. I also wonder why folks bother to translate any entries that have the Webster 1913 warning or otherwise show serious signs on obsolete wording (eg, only literary usage examples in the Webster's format). They do not meet the most basic standards of being satisfactory for translations that will not have to be subsequently reviewed and the work seems likely to be harder and less fruitful.
The best hope we have is the liberal use of {{trans-see}}. DCDuring TALK 01:30, 28 November 2009 (UTC)

Plurals vs. nouns forms

As pointed out, one of the stickier issues on the Wiktionary. For example I've just now discovered Category:French noun forms which AFAICT contains only plurals, with about 10 exceptions. I think that the Catalan and Spanish noun form categories are also up for deletion, right? As pointed out by Carolina wren, we don't have anything close to policy on this? FWIW I think it's pure bureacracy, because I don't think anyone uses these categories to look for words. What do other people think? Mglovesfun (talk) 11:34, 23 November 2009 (UTC)

Well, in French there are feminine noun forms of words like gardienne and tueuse, although I don't think they are classed as "noun forms" by any authority. I (along with, I assume, other French contributors) would be happy to get rid of this category, sticking with the Category:French plurals category. --Rising Sun 20:30, 23 November 2009 (UTC)
Yeah but paper dictionaries don't classify these at all, and most online dictionaries just redirect to the masculine singular. So we have something of a "carte blanche" to do what we want. Opi, Rising Sun and I are happy to delete Category:French noun forms and sort into the two categories above. Who says no? Mglovesfun (talk) 11:55, 24 November 2009 (UTC)
I might have said no at one time, but I no longer see the merit it this category. Delete. —Internoob (Disc.•Cont.) 23:09, 24 November 2009 (UTC)
Yes, these categories for Western Romance languages are ridiculous... nouns that have corresponding masculine and feminine forms like gardien and gardienne are in the minority. To compare languages who have a majority of nouns with two forms, one being the lemma, to languages like Russian (12 forms), Lithuanian (14 forms) and Hungarian (which has well over 30 forms) is just silly. I actually put Category:Spanish noun forms and Category:Catalan noun forms on WT:RFDO a while back, but that never went anywhere. — [ R·I·C ] opiaterein — 00:29, 25 November 2009 (UTC)
Okay, see Wiktionary:Requests for deletion/Others#Category:French noun forms. Mglovesfun (talk) 11:41, 25 November 2009 (UTC)
@R·I·C - It's more silly having a category named Fooian plurals that apples only to noun plurals and not adjective plurals in the Romance languages. However, I have no objection to having a properly named category for noun plurals, and a category for the feminine singular noun forms of those nouns that have two distinct gender forms, with the noun form category itself either deleted or used solely as holding category that should be empty save for the two subcategories. — Carolina wren discussió 02:53, 27 November 2009 (UTC)
What about a category like Category:Catalan nouns with both masculine and feminine forms? --EncycloPetey 03:01, 27 November 2009 (UTC)
What I'm interested in is a simple category for words like anglesa. The masculine form, anglès, has two common noun senses. In addition, anglesa is also adjective form of the male lemma, so to me it makes sense to mark it as a feminine form. That's to avoid a third repetition of the shared meanings (The Valencian masculine singular is anglés), so that it doesn't go into the index which an independent noun entry would, and because of the parallelism with the adjective form. Also, the name you proposed sounds more like something for nouns like bèstia which take either either gender (and hence either masculine or masculine adjectives, articles, or pronouns) but have no change of form. Dual gendered nouns, either with (anglès / anglesa), or without (bèstia) separate forms are typical for animate nouns in Catalan. The relative rarity of nouns with dual gender is due mainly to the fact that most nouns are not animate. There needs to be a category for noun forms like anglesa, with my own preference being something like Category:Catalan feminine singular noun forms or even Category:Catalan singular noun forms if we were to treat word pairs such as cabra / cabrot as having a feminine lemma and a masculine noun form. (I don't favor treating cabrot that way because I think that those few nouns that don't use the -∅/-a paradigm of the adjective masculine/feminine forms shouldn't be classified as having forms.)

requesting a bot for manual of style enforcement.

Can I suggest we create a bot to remove the "----". First of all the correct to separate things in either Wikipedia or Wikitionary is to use == title ==, these markers will be shown in the Table of Contents.

The division lines doesn't make the contents any easier to read. Users just use them to stylize any dictionary terms they are interested, and the abuse is becoming so worse that it degrades wiktionary as a formal dictionary, nevermind launching projects, such as Visual Thesaurus...etc. aka Google Search result 1.97 million each page has about 7~8 abusive/per page, that is way over 60%.

some of the horrible usage can be seen below.

--75.154.186.99 18:10, 23 November 2009 (UTC)

See WT:ELE and User:AutoFormat, the first is our layout policy, the second enforces it. Conrad.Irwin 18:13, 23 November 2009 (UTC)
The reasoning behind the lines is simple, if people want to extract a language section from the dump, they have to merely look for ==English==.*(----|$). Without these, they have to scan the whole thing linearly to find the next language, which is much slower and much more complicated. It is a matter of personal taste whether the entries look better or worse without them, as we've been using them on every page for many years, I think a more convincing reason than "I don't like them" is called for. Conrad.Irwin 18:34, 23 November 2009 (UTC)
I can't see anything horrible about this. Does this appears in WT:ELE? I think not, so keep them. Mglovesfun (talk) 11:57, 24 November 2009 (UTC)

They're, um, lines. Call me crazy, but I think we have better things to worry about as Wiktionarians than header dividers. Tooironic 09:26, 26 November 2009 (UTC)

Uh, I don't think we even have a "manual of style". --Yair rand 06:32, 30 November 2009 (UTC)

Cleanup project

Is there any single page for discussing specific cleanup project; orphaning templates and categories that have failed RFD, correct categories, templates, etc. Does this is exist? If not, surely it should. But under what name? Mglovesfun (talk) 14:06, 24 November 2009 (UTC)

No. When someone starts such a project, they create their own page for it and announce it in the WT:BP, but there is no other central place used for that. Such projects come along so rarely, that it hasn't been worthwhile to have another page for them. There is also a Category:Wiktionary Projects, but it doesn't seem to be much used. --EncycloPetey 15:48, 24 November 2009 (UTC)
I went for Wiktionary:Cleanup and deletion process/Requests. Please anyone, add stuff from your "to do" pages so everyone can see. Mglovesfun (talk) 17:27, 24 November 2009 (UTC)

misspellings

I thought we used to have a policy not to include misspellings. Did that change, or was it never the case? —scs 17:14, 26 November 2009 (UTC)

I think about this quite a lot. We include "common misspellings" but we have no criteria to define what that is, meaning that almost anything can be a misspelling of something. Right now, anything that goes on WT:RFD is just a pure vote. Mglovesfun (talk) 17:32, 26 November 2009 (UTC)

Annoyances: "This page has been deleted" for capitalised words

I use OneLook as my dictionary portal. It gives a potted definition and a list of urls to the word in a collection of actual dictionaries. When the potted definition isn't sufficient then I usually use the Wiktionary link because the other dictionaries suffer from bloat, ads or other deficiencies.

Unfortunately there's a problem with Wiktionary that makes it very annoying to access. This is the fact that OneLook uses capitalisation for the word that they reference and Wiktionary shows a "This page has been deleted" entry.

Why is this a problem? Because I don't care two hoots that a capitalised word's page has been deleted. I don't want to stop there and read it, the same text (bar the word in question) for every reference from OneLook. What I do want is to go straight to the definition page. I don't mind if the page has a "capitalised: Redirected from Capitalised" line, I just don't want to have to be pointlessly told that a page has been deleted.

Should OneLook fix this? No, because there will be cases when Wiktionary does have an actual entry for the capitalised word, something that OneLook cannot be expected to know.

I would be very pleased if you would delete "This page has been deleted" from the lookup process. Hopefully, if you consider the amount of time that these pages get in the way versus how often they serve a useful purpose (ie. one that someone would thank you for) then you'll agree to the need.

Thank you, on behalf of all OneLook-referred users.

That isn't technically possible, since it's part of the Mediawiki software for all projects including Wikipedia. Yes, there will be capitalized entires, but these will often be the German noun. If OneLook relies on matching spelling, but does not deal with capitalization, then users will have to stop and read a German entry when looking for English. This really is a OneLook issue, not a Wiktionary issue. --EncycloPetey 19:00, 26 November 2009 (UTC)
I'm not sure that you understand what I am asking. Going to, say, a German word that is capitalised is not the problem; not once has this occurred. What happens is that the page that Wiktionary presents is one saying that the prior capitalised entry has been deleted and giving a link to the uncapitalised word. This repeatedly hinders access to the target definition yet I have yet to observe any useful purpose in being informed that a page has been deleted. My request is that this unnecessary obstruction be removed. 78.151.155.41 13:28, 29 November 2009 (UTC)
I understood your question, but you don't seem to have understood my response. As I explained at the outset of my previous response, that isn't technically possible. Wiktionary has no control over that aspect of our software. Mediawiki controls that; we don't. The problem must be dealt with at either the level of Mediawiki (who develop the software for Wiktionary, Wikipedia, Wikisource, etc.) or it must be dealt with by OneLook. We can't make the kind of change you are asking. The simplest solution is for OneLook to recognize that there is a difference between lower-case and capital letters. --EncycloPetey 15:04, 29 November 2009 (UTC)
Why can't we just put REDIRECTs in, instead of saying that the prior capitalised entry has been deleted and giving a link to the uncapitalised word ? --Richardb 23:19, 29 November 2009 (UTC)
Because it's against the redirection policy at Wiktionary:Redirections, which should explain why it is unwise to use them that way. Equinox ◑ 23:27, 29 November 2009 (UTC)
It may be against the redirection policy that you have in your memory, but it's not against the policy as recorded at Wiktionary:Redirections. That states:-
  • .... Work could redirect to work, although this is unnecessary.
  • .... leaving a redirection for external links (such as those from other language Wiktionaries or Wikipedia, or mirror sites.)
So clearly we can have a redirection, and clearly it allows for redirections for links from external sites.--Richardb 06:26, 30 November 2009 (UTC)
But, surely, the question is, since If one enters the uppercase word in the search box, the software automatically redirects to the lowercase article (unless the uppercase exists). , then why do we have the page there with a deletion message. Just get rid of the page fully, and then it will automatically redirect. The deletion message is of no use, and is a definite hindrance.--Richardb 06:26, 30 November 2009 (UTC)
I will email the OneLook guys again, and see if they can fix it on their end. The reason they capitalize everything like that is because Wikipedia does, the list Connel generates for them is appropriately capitalized. They just need to be told that we don't do it the same way. - TheDaveRoss 00:20, 30 November 2009 (UTC)
unindent, for what it's worth, for those with javascript visiting http://en.wiktionary.org/wiki/Work will redirect to http://en.wiktionary.org/wiki/work?rdfrom=Work which contains a link to http://en.wiktionary.org/wiki/Work?redirect=no to turn off this behaviour. The best solution would be for everyone who wants to link to us to link: http://en.wiktionary.org/wiki/Work which will work no matter what case Work has. Conrad.Irwin 12:55, 30 November 2009 (UTC)
Are there good reasons why this is not part of the default? Obviously, if it is a OneLook only problem and OneLook can and will fix it, then any resource cost or implementation risk is probably not worth it. But is this an indication of a more widespread problem? DCDuring TALK 15:59, 30 November 2009 (UTC)
Most wikis are not case sensitive (for a very good reason :p) so it is only a concern on wiktionaries, this javascript behaviour is the default on en.wiktionary for all non-existant pages where an entry exists at {{lc: {{PAGENAME}} }}, {{uc: {{PAGENAME}} }}, or {{ucfirst: {{PAGENAME}} }} as we can't easily query other case combinations. This exists only for those who follow broken links to the site (and note that most sites, when they re-arrange content leave simply 404 messages behind, at least ours tell you the existance of the right page - and visit it automatically if you have javascript). The expected user behaviour is to use the search box and not the url bar, which doesn't have this problem at all. Conrad.Irwin 16:06, 30 November 2009 (UTC)
I was asking a question which I now realize you had already answered. The answer to the question I was trying to ask is: The default js already effectively directs users to an existing entry with different capitalization if the user's/portal site's capitalization does not yield an entry. Then can we infer that the user with the complaint is one of those without Javascript? Do we have information about what portion of relevant Web usage is via browsers without JS? DCDuring TALK 16:50, 30 November 2009 (UTC)
The chances of people using a browser with javascript turned off is very small, I'd extrapolate at around 99% from [4] and [5]. Given that many large companies don't have the patience/time/money for supporting javascript-less users (Google, Flickr, MySpace, though Facebook recently added support) I have no concerns that we are being overly discriminating in making them click a whole extra time (particular given that they are following a broken url from another site). Conrad.Irwin 23:36, 30 November 2009 (UTC)
All three computers in this house have js disabled by default, it is the only way to browse. Anyhow this isn't our problem at all, the problem is incorrect URIs generated by third parties. - TheDaveRoss 20:49, 4 December 2009 (UTC)

A dictionary is not for punctuation

The West Frisian word ús has two articles. The other one is for Us. This second article is unnecessary; it is not a separate word, and has no different meaning. It seems to exist only to show people that, in West Frisian, diacritics are usually not written on capital letters. Going by that, each West Frisian word that begins with an accented vowel would need a separate entry for its capitalized form, which is wholly unnecessary. Explaining orthographical rules, as this article's creator seemingly means to do, is not a dictionary's purpose.

Eal

Yes, I agree. Deleted. —RuakhTALK 17:05, 28 November 2009 (UTC)
I also found Citations:; earlier, which I found really odd. Mglovesfun (talk) 17:50, 28 November 2009 (UTC)
I completely disagree. People unfamiliar with a certain foreign language are likely to be unaware that an accent appears with one form and not on another, and remember that in some languages, accented letters are considered different letters entirely from their accentless counterparts. Where such a thing occurs, we should either have an entry for the accentless capitalized version, or a usage note on the page of the lemma. bd2412 T 15:29, 25 December 2009 (UTC)
I believe that in Greek accents are also optional (usually left out) on capital letters... but at any rate, I don't think diacritics are punctuation. — [ R·I·C ] opiaterein — 18:23, 28 November 2009 (UTC)

Christmas Competition 2009

This year's Christmas Competition is announced and is open to all contributors!
--EncycloPetey 07:47, 29 November 2009 (UTC)
Adventskranz-1.Advent.jpg

Genitive and Swedish

The question may seem ridiculous, especially as coming from a native. But: Do Swedish have a genitive case, or at least, should we claim so?. Why I don't know this? Well, when I started to add entries, I followed what layman knowledge I had, namely that possession is indicated by the genitive case. This is what is taught in school (at least when I went to school), this is what (most, afaik) other dictionaries state, this is what encyclopedias state (e.g. NE [6]), this is what textbooks claim even today, based on what I find on the net. But it has also been claimed that a more thorough analysis reveal that it isn't a case at all anylonger, and should be considered as a possessive form, just as in English (see e.g. this discussion on sv:wp, and this change here on wikt). One motivation is that it is so vastly more common to write the phrase The Queen of England's men as Drottningen av Englands män than Drottningens av England män, even though the latter still is in some very limited use. (The recommendation is actually to use the former, which would correspond better to a "possessive" analysis, even from those who aren't ashamed to call the form genitive.)

So: should we follow what is the academically correct description (i.e. contrasting a "base form" to a "possessive form"), or the description which is vastly more common (i.e. "nominative" versus "genitive")? \Mike 10:54, 29 November 2009 (UTC)

Swedish is usually no longer considered to have a case system. In the possessive form of a noun, an "s" is appended, but this is nowadays used as a clitic particle, just like the english "'s", and not a case ending. Your example about "the Queen of England's men" is spot on, since your first example (Drottningen av Englands män) is overwhelmingly more common today. The second (Drottningens av England män) is seen very rarely and might even be considered wrong by many Swedish speakers. Some grammar texts still refer to this "s" as a case inflection though. This issue is discussed to some extent in the Swedish Wikipedia article on the genitive case. As a comparison, Norwegian and Danish have completely done away with the case systems and consider the "s" ending a clitic particle and not a case inflection, but Swedish has slightly more remnants of a case system. Freezer Twelve 13:53, 6 December 2009 (UTC)

Wiktionary:Phrasebook

I've made a start on this, but it's tough going. We probably need some discussion about CFI for the phrasebook versus "languages", as it were. Mglovesfun (talk) 15:55, 29 November 2009 (UTC)

Wiktionary:CFI - has something gone missing ?

I thought this page used to have some sort of banner on it directing people who wanted to contribute ideas for change to Wiktionary:Editable CFI. There is no banner now. Am I just imagining this, or does anyone else remember there being such a "banner" ? --Richardb 07:56, 30 November 2009 (UTC)

Well, according to the history page, CFI has not been edited since a month before the creation of Wiktionary:Editable CFI, so apparently there never was such a banner. --Yair rand 08:09, 30 November 2009 (UTC)
Ah, but you don't know how devious some administrators can be to cover their tracks. Of course I checked the history, and the visible deletes. Doh!--Richardb 08:55, 30 November 2009 (UTC)
If you want to make changes to CFI, bring it up here. Duh. — [ R·I·C ] opiaterein — 17:02, 30 November 2009 (UTC)
So, why do we even bother having the page Wiktionary:Editable CFI???. And anyway, your response does not at all answer the question. Richardb 12:24, 1 December 2009 (UTC)
You are just imagining it. I pointed you at Editable CFI from the beer parlour, maybe that's what you remember? Conrad.Irwin 12:54, 1 December 2009 (UTC)
OK. Seems a likely explanation for my memory. But then the question is, should we have a banner/pointer from CFI to editable CFI ? And if so, how the heck would we ever get that approved to be put up on CFI ?--Richardb 02:19, 6 December 2009 (UTC)

December 2009

Template help

Please see the discussion here. If you're good at wikisyntax, please help. --The New Mikemoral ♪♫ 01:22, 1 December 2009 (UTC)

done. Conrad.Irwin 12:36, 1 December 2009 (UTC)

User page - time zone

I would like to encourage that people put a time zone box on their user page, like the one that I have on my user page: "UTC+1". I find this useful for estimating the time frame of a response in a conversation with a Wiktionary editor. If you find it a poor idea, please let me know. --Dan Polansky 11:29, 1 December 2009 (UTC)

Yes, these are very useful, but don't they count as userboxes? We really need a policy page for what userboxes are allowed. (I think global account userboxes should also be allowed.) --Yair rand 20:20, 1 December 2009 (UTC)
That information is listed on Wiktionary:Administrators, though I see no reason to prohibit people from using a timezone box, perhaps {{user timezone}}? I thought we already had a global account userbox sneakily created by some visiting Wiki*edian, {{User unified login}} (you can tell it's an import by the uppercase first letter ;) Conrad.Irwin 00:21, 2 December 2009 (UTC)
Yes, the timezone box is a userbox. It seems a natural extension of the userboxes for the languages a user speaks. It does not fall into the group of advertising edit counts, political affiliation, and other practically useless information. In the page Wiktionary:Administrators, timezone is listed alongside the spoken languages as key information about a user.
It would already help if people has the timezone in a paragraph text rather than in a userbox. It is just that is seems to me that entering timezone on one's user page should be almost as much encouraged as entering spoken languages, and that it should be entered on a standardized location, much like the spoken languages. --Dan Polansky 07:40, 2 December 2009 (UTC)
We seem to be migrating toward an explicit policy on userboxes of the form: "Everything that is not strongly encouraged is forbidden." I personally think timezone of normal location would be useful as part of an extended Babel box. DCDuring TALK 10:55, 2 December 2009 (UTC)
We're not migrating towards that; it's been in print for a long time already. Our Wiktionary:User pages draft policy currently says user pages "must not contain any Wikipedia-style userboxes other than Babel templates, though specific userboxes may be allowed after discussion." That was kind of the community consensus when we set up the draft. However, there are a number of NPOV things like UTC boxes that I think ought not to be forbidden, and if we have a discussion allowing UTC boxes, script boxes (also useful, such as those who know Cyrillic or IPA), or the like, then we can use them without even having to redraft the draft policy. --EncycloPetey 03:46, 11 December 2009 (UTC)
Went ahead and made {{User time zone}}, which also allows one to specify whether their district observes Daylight Savings Time. --Bequw → ¢ • τ 15:15, 11 December 2009 (UTC)

Wiktionary talk:About French#Accents and ligatures

Comment if you have an opinion. Mglovesfun (talk) 12:17, 1 December 2009 (UTC)

Gathering opinion on "Is a deleted page more value than a redirect"

Example shotgun marriage.

Can't remember exactly, but I was looking at the entry shotgun wedding, then must have gone to wedding, then to marriage. In marriage there was a red link to shotgun marriage, but, naturally, no mention of shotgun wedding.

So I created the page shotgun marriage with just a redirect to shotgun wedding. So, a user who is looking up the phrase shotgun marriage would at least get the definition, etymology etc of shotgun wedding, instead of nothing.

However, an admin, not long after, deleted the page, so we were back to square one. A red link in marriage, and no clues for anyone looking up shotgun marriage.

So, my question is. Which "Adds more value" to Wiktionary?

  1. To have the redirect to shotgun wedding as the sole content of shotgun marriage, and thus no red link in marriage.
  2. To have the page deleted.

--Richardb 12:20, 1 December 2009 (UTC)

I know one argument for having the deleted page, ie: no page, and a red link, may well be that some people try to track red links (thus shotgun marriage appears in pages User:Brian0918/Hotlist/S5 and User:Msh210/Duesentrieb/xjf). But can this not also be acheived by somehow having a list of pages that have REDIRECTS ? --Richardb 12:20, 1 December 2009 (UTC)

Redirects are considered problematic because they don't give much information, we could quite easily add pronunciation, quotations, usage notes, anagrams etc. to alternative form entries. Having the redirects reduces the (already low) percieved need to create such entries. I would strongly encourage people to use the search box, and not edit urls directly, then the problem almost completely disappears (providing the alternative form is listed on the page with the definition) as the correct entry is in the search results. If people want to edit urls or follow broken links, then they should accept the slight degradation in behaviour that results. Conrad.Irwin 12:41, 1 December 2009 (UTC)
But that goes against laziness, which is to say that it is not user-friendly. There is no amount of exhortation on this page that will reach the bulk of unregistered users. Is there any way to have a default in which red-links are treated as searches, at least for the JS enabled. This might be accepted if our experienced contributors could opt out (I'd prefer to be able to opt out.). DCDuring TALK 14:38, 1 December 2009 (UTC)
So what should be done with all redirects left over from the Conversion script (e.g. Inflammation)? Many already have been deleted, and it seems odd to keep some around. Should they be cleaned up or left? --Bequw → ¢ • τ 15:26, 2 December 2009 (UTC)
The ones that were linked to from the main namespace were not removed. Special:WhatLinksHere/Inflammation. Conrad.Irwin 15:59, 2 December 2009 (UTC)
Usually I remove the links for the main space as I find them, then delete. Mglovesfun (talk) 18:52, 2 December 2009 (UTC)
But why?? How is that adding value, not taking value away ? To me you are, in a sense, being a vandal, taking away vlaue for no good reason. If you have a reason, what is the reason. How is that action "adding value" ?--Richardb 23:47, 4 December 2009 (UTC)

Alphagrams

(from RFD)
Someone (or a bot) has systematically been adding alphagrams to pages (see doula for an example). I don't see any point in these. Why are they being included? They become all the more pointless when added to words like ab. — Paul G 17:48, 2 December 2009 (UTC)

Their non-wikiish style also cause contributors to place L4 translation sections after them, triggering rfc-structure tags. DCDuring TALK 18:47, 2 December 2009 (UTC)
Conrad.bot, no? Anyway, this is in the wrong place as you're not nominating the word Alphagrams for deletion. Hence I'll move to the Beer Parlor. Mglovesfun (talk) 18:50, 2 December 2009 (UTC)
It's all per Wiktionary:Votes/bt-2009-09/User:Conrad.Bot to do anagrams. Though the vote page doesn't explicitly mention alphagrams, they were part of the showcased edits. --Bequw → ¢ • τ 19:07, 2 December 2009 (UTC)
If there's a structure problem, that problem should be fixed, but the whole projects shouldn't just be abandoned. Mglovesfun (talk) 19:10, 2 December 2009 (UTC)
The problem is inherent in the use of a template. Imbecilic contributors see the template, can't imagine that it needs to be the last thing in the entry or that such a technical matter should be of concern to them, and insert an L4 translations header. (Imbecile that I am, I wasted a minute myself once on an erroneously place translation section I added.) Then the bot marks it and I clean it up. It hasn't happened often but the departure from our policy against that kind of template has not turned out to be without cost. I'll likely vote against such a departure next time. DCDuring TALK 22:29, 2 December 2009 (UTC)
The templates have since been (mainly) replaced by real sections, I also learned the hard way. A list of the entries remaining the wrong style is at [7], which I'll go through and correct manually if the bot doesn't catch them this time. Conrad.Irwin 23:06, 2 December 2009 (UTC)
I feel much better now. Really. DCDuring TALK 23:17, 2 December 2009 (UTC)
Alphagrams were in most anagrams sections before I started, I assumed the rationale is that people wanting to find anagrams might type the alphagram into the search box. If people want them excluded from the entry when the entry itself is tha alphagram, that can be done. As with the indexes, I don't claim to know the best way to do anagrams, so if people want to suggest improvements, I'm all ears. Conrad.Irwin 23:06, 2 December 2009 (UTC)
As I'm for the idea I've launched the French speakers debate in parallel. JackPotte 18:20, 7 December 2009 (UTC)
Alphagrams are explicitly allowed within an Anagrams section by WT:ELE. As long as there aren't entries being created for meaningless alphagrams, I don't see a problem. --EncycloPetey 03:39, 11 December 2009 (UTC)

Letter as Part of Speech

I'm curious as to why Letter is a Part-of-Speech subheading and I couldn't find it discussed anywhere. Thinking of the PoS of a letter is a bit odd because it's one of the few times when both the name for the thing and thing itself are in the dictionary (the name cee and thing c). A quick review of other dictionaries reveals that the Concise OED, Webster's, and Random House all classify as nouns both the letters and letter names (when included). Should we? Interestingly they all subdivide the letter definition differently. The OED keeps it unified, Random House has separate senses for 1) the letter and 2) the speech counterpart of the letter, and Webster's has subsenses for a) the letter, b) the glyph, and c) the speech counterpart. I'd lean towards using noun as, to me, the letterness (that it combines with others from the alphabet to forms words) is part of its definition. What do others think? --Bequw → ¢ • τ 20:58, 2 December 2009 (UTC)

We treat letters proper (as opposed to the names for them) as translingual, which I think precludes using any real part of speech as the header, since real parts of speech are somewhat language-specific. (IIRC, EncycloPetey has mentioned that language names are adverbs in Slovenian — like, "I speak Englishly" to mean "I speak English".) —RuakhTALK 21:50, 2 December 2009 (UTC)
There are both Translingual and language specific entries for the letters (e.g. a#Translingual and a#English). The Translingual ones use Letter and English ones usually use Letter but sometimes Noun. I think the English one should be all Noun (other languages can decide for themselves). Translingual is tricker, but don't we treat the species names (eg E. coli) as Translingual nouns (or is there no consensus on that)? If there are some languages that use the Latin alphabet but don't label the letters as nouns, then maybe keeping Letter is fine. Does anyone have an example? If there aren't, I think they could be changed Noun as well. --Bequw → ¢ • τ 21:06, 3 December 2009 (UTC)
Translingual species names are Proper nouns. They are defined that way in the international codes that govern their usage. The problem with labelling all the Letter entries as Noun is that letters are not always used as nouns; they're most often used as letters within words. The only time they're used as nouns is when they are being mentioned rather than used. So, if we define "L" as a noun because of sentences like "The L is silent.", then we have to define "you" as a noun for sentences like "The you in that command is understood." and we have to define "the" as a noun for sentences like "I forgot to write a the in front of the noun." However, these are all mentions, not uses. On Wiktionary, we normally do not group entries in part of speech by their mentions, but by their usages. --EncycloPetey 16:06, 5 December 2009 (UTC)
Though I think of the abstract idea of a letter as a thing, you're right that we have to base definitions on usage. So if not noun, would it make sense to label them as Symbols with "letter" as part of the definitions? It would be analogous to saying the period (.) was a Symbol with "punctuation mark" as part of the definition. This could be used not just for English, but also Translingual. --Bequw → ¢ • τ 23:23, 5 December 2009 (UTC)
I'm not strongly attached to either system, but I think there is a very important and useful "grammatical" distinction between a Letter and Symbol. A letter is inherently a part of a word, where a Symbol can represent a sound, word, phrase, or idea. The function of a letter is thus specific, while a symbol can have any of a wide variety of functions. Since the English language is one written in letters, and we are the English Wikipedia, I think that makes the distinction even more useful. If I come across an entry labelled as "Letter" for its POS, then I know immediately a great deal about what it is. If I come across a "Symbol" then I expect to have to do a lot more reading to figure out what I've got. Were this the Mandarin Wiktionary, I'm not so sure that distinction would mean as much, but I think it does for a Wiktionary in a language that uses an alphabet. --EncycloPetey 03:37, 11 December 2009 (UTC)

(unindent) I see the distinction you're making. I wouldn't necessarily call it "grammatical", though, but more "syntactic" since it's about how to compose units of meaning, not how those units get fit together in a grammatically correct phrase. As you mention, the distinction between letter and non-letter characters is a bit fuzzy. Even in Latin-script languages you have words that contain ( / ), ( ' ), and ( - ) (plus some more rare ones). Both letter and w:Letter classify them as "Symbols". Certain classes of symbols have more rigid rules on their usage. Letters have many, but also mathematical and punctuation symbols do as well. I'd still classify them as all symbols (since nouns are out:) More importantly, though, they should at least be consistent. --Bequw → ¢ • τ 15:46, 11 December 2009 (UTC)

Template:nonstandard spelling of

See Wiktionary talk:About French. Seemed like the quickest, best solution. Should be used for other languages, like we have {{obsolete spelling of}} and others. I actually thought this might already exist, but I can't find it. Mglovesfun (talk) 21:30, 3 December 2009 (UTC

See {{form of}}, which allows some flexibility, valuable before standard wording is settled on. It allows more precise and less (or more) pejorative wording: "Obsolete form of", "Typographical variant", "Pedantic variant of", etc. DCDuring TALK 22:22, 3 December 2009 (UTC)

Chinese categories

At WT:RFDO#Category:yue-hk:Verbs Carolina Wren brought up renaming the Chinese categories I thought it was a good idea. Her basic idea was to:

  1. Use the language name for PoS categories:
    • zh-cn:Nouns →  Mandarin (Simplified) Nouns.  zh-tw:NounsMandarin (Traditional) Nouns.  zh:NounsMandarin (Pinyin) Nouns.  Mandarin Nouns - stays the same (already contains all scripts) and would be the parent of the previous three.
  2. Replace the country code extensions (-cn, -tw/-hk) with the proper script codes (-Hans and -Hant) for the remaining topical categories:
    • zh-cn:All topicszh-Hans:All topics.  zh-tw:All topicszh-Hant:All topics.  zh:All topicszh-Latn:All topics.  Presumably there'd be a new zh:All topics that was the parent of all three.
    2a) An alternative here would be to note the script in the name.
    • zh-cn:All topicszh:(Simplified) All topics. etc.

Is the wording clear on (2a)? If so, that would be preferable since it keeps the prefix for only the language code, but the wording would have to be clear. Whichever is choosen, though, I think these changes would be an improvement. Hopefully they would make the categories easier to understand for casual editors. How do others feel? Are there any problems with these types of renamings? --Bequw → ¢ • τ 21:58, 3 December 2009 (UTC)

As one of the half a dozen or so active Chinese contributors here I feel like I should give some input. Please forgive my lack of understanding of scripts and such, but I will try my best to grasp what it is that is being proposed.
Firstly, to be honest, I don't really see the point of changing "Zh-cn" to "Mandarin (Simplified)". To anyone who speaks Mandarin, the phrasing is extremely awkward. "Mandarin" (either taken as the group of Northern Chinese dialects OR Standard Mandarin Chinese) was never "simplified"; the script it was written in was. No one (certainly no one I know or in anything I have read) refers to the Simplified script as "Mandarin (Simplified)" or any variation of that. It just sounds... bizarre to me.
Secondly, what are these "Hant" and "Hans" script codes you speak of and why are they considered "proper"? No, really, I have no idea! And why should they be considered "preferable"? The wording is no clearer than "-cn", "-tw" and "-hk". I think most Chinese speakers could guess what the latter mean at first glance. "Hant" and "Hans" though - again - I have to say, "WTF"?
Lastly, I would like to ask, are any of the people proposing these changes Chinese speakers themselves? Given the awkward rephrasing they are proposing, my guess would be no? I don't mean to be flippant, but seriously, if any kind of major reform of Chinese is going to happen on wiktionary, IMO it should be done by (or in co-ordination with) the people who actually have some knowledge of what is being discussed. This is not to diss anyone, I'm just concerned that someone will change all this complicated script stuff and leave us poor (and less IT-capable) editors to pick up the pieces. Tooironic 07:42, 4 December 2009 (UTC)
I second Tooironic's opinion. --Anatoli 08:31, 4 December 2009 (UTC)
I don't mean to sound unilateral at all. No change should be made without the agreement of the active Chinese contributors. I bring the matter up on WT:BP, rather than on WT:AZH, however because the goal is find a structure that will enable casual editors to improve Chinese entries/categories. I and at least a few others steer clear of Chinese entries/categories because the categorization is so different than all other languages here on wiktionary. Chinese categories need not be structured exactly the same as other languages, but I hope there's room for improvement. Let me explain the reasoning behind these changes, and hopefully someone will be able find better solutions than the ones I proposed. The basic confusion I see arises from the implementation of the (good and proper) desire to have per-script categories for simplified, traditional, and romanized characters. Here's what I think is confusing to some editors:
-cn,-tw/-hk codes: These are ISO 3166 (country) codes are currently used to distinguish categories for simplified and traditional characters (Category:yue-cn:Nouns for simplified characters vs. Category:yue-hk:Nouns for traditional characters).
  1. These give unfair emphasis to specific countries. Traditional characters aren't just used in Taiwan and Hong Kong, but in Macau and many overseas communities. Likewise, simplified characters are used not just in (mainland) China, but Malaysia and Indonesia. Additionally, it completely ignores historical differences within those countries.
  2. It's not obvious that the lack of extension stands for the category of the usual romanization (Category:yue:Insects is for Jyutping romanization, while Category:yue-cn:Insects is for simplified and Category:yue-hk:Insects is for traditional characters).
I think it would be better to switch to categorization that is somehow explicitly by script. Hant and Hans are the ISO 15924 script codes for Han (Traditional variant) and Han (Simplified variant) respectively. If that is too obscure, an alternative option would be writing the script out. If my wording choice was awkward, perhaps you may provide a better one.
Organization between the script-differing categories.
  1. Currently the name of the Part-of-Speech category that holds all scripts is much different than the ones that contain single scripts (Category:Cantonese adjectives which contains all three vs Category:yue:Adjectives, Category:yue-cn:Adjectives, and Category:yue-hk:Adjectives). Additionally, with the language prefix, these latter three confuse people into thinking they are topical categories about "Adjectives", like Category:ja:Adjectives.
  2. There's no category links (or hierarchy) between the pan-script and single-script categories.
  3. There's no topical category designated to be for all scripts like there is on the PoS side (since the extensionless form is for romanizations only).
There is no reason we can't have consistent names for both pan-script and single-script categories. If my wording was awkward, could you suggest alternatives (that hopefully could allow for pan-script topical categories)? It would seem to me, as one who does much cleanup, that the pan-script categories should be parents of the single-script categories. Does that seem logical?
I do hope that something can be improved. And I would be willing to help in anyway possible. Hopefully people won't TLDR this:) --Bequw → ¢ • τ 17:10, 4 December 2009 (UTC)
Thank you for your detailed explanation. Very helpful, though my head is still spinning :P What is "pan-script"?
I do get what you mean though about the pinyin and character entries getting mixed in together. This is something I raised a few months ago but no action was taken.
Let's take nouns and one possible way to categorise them:
  • Mandarin Nouns (Simplified Script)
  • Mandarin Nouns (Traditional Script)
  • Mandarin Nouns (Pinyin Script)
  • Mandarin Nouns (Wade-Giles Script) (do we even have these?)
  • Mandarin Nouns (Yale Script) (do we even have these?)
  • Mandarin Nouns (Bopomofo Script) (do we even have these?)
Note, I can only speak for my own dialect, Mandarin. Cantonese and Min Nan speakers on wiktionary will have to devise their own category names. (That is, I don't know what romanisation schemes are used for those dialects, nor how to categorise them.) Anyway, I think these categories would prevent that whole country-bias thing you were talking about (.cn, .tw, .hk), plus make it a lot easier for casual users to understand. However, when writing the actual script, we could simply use abbreviations to make our job easier:
  • Mandarin Nouns (Simplified Script) → Category:Mans:Nouns
  • Mandarin Nouns (Traditional Script) → Category:Mant:Nouns
  • Mandarin Nouns (Pinyin Script) → Category:Manp:Nouns
  • Mandarin Nouns (Wade-Giles Script) → Category:Manw:Nouns
  • Mandarin Nouns (Yale Script) → Category:Many:Nouns
  • Mandarin Nouns (Bopomofo Script) → Category:Manb:Nouns
As for vocab categories, these could be displayed like this:
  • Category:Mandarin Nouns (Simplified Script):Insects
  • Category:Mandarin Nouns (Traditional Script):Insects
  • Category:Mandarin Nouns (Pinyin Script):Insects
  • Category:Mandarin Nouns (Wade-Giles Script):Insects
  • Category:Mandarin Nouns (Yale Script):Insects
  • Category:Mandarin Nouns (Bopomofo Script):Insects
But written as script (again to make our lives easier) like this:
  • Category:Mans:Insects
  • Category:Mant:Insects
  • Category:Manp:Insects
  • Category:Manw:Insects
  • Category:Manb:Insects
  • Category:Manb:Insects
Are the programming whizzes able to do this? Do let me know if I've overlooked something, this whole scripting thing does my head in. Tooironic 00:13, 5 December 2009 (UTC)
I was asked to chime in on this, so here goes. First, some points upon which we probably all agree:
  1. Any modification should be more intuitive than the current arrangement, not less.
  2. Any modification should respect the preferences of the Chinese speakers who are actually creating new Chinese entries.
The following represents my own opinion on the subject:
  1. I don't like categories that include parentheses.
  2. I second Tooironic's opinion that Hant and Hans are not an improvement over zh-tw and zh-cn, in terms of user friendliness.
  3. If I were going to change the current scheme, and I'm not convinced that it would be worth it at this point, I would probably do something like:
    Category:Mandarin Nouns in Simplified script
    Category: Mandarin Nouns in Traditional script
    Category: Mandarin Nouns in Pinyin script
    Category: Min Nan Nouns in POJ script
The one thing that I don't like about spelling out the categories is that the categories become excessively wordy. On the other hand, I don't think replacing one abbreviation for another necessarily helps anything, and may actually turn out to be a step backwards. The argument for keeping zh-tw and zh-cn would be that, rightly or wrongly, their usage is long established in the computer world. In short, I think we should leave it like it is, unless we can come up with a much better scheme. Otherwise, it could mean a lot of work for a very minor gain. -- A-cai 12:58, 5 December 2009 (UTC)
P.S. Are we also proposing to change "zh-tw:Fruits" and "zh-cn:Christianity" to "Mandarin Fruits in Traditional script" and "Mandarin Christianity in Simplified Chinese"? If so, that just sounds weird to me :) -- A-cai 13:05, 5 December 2009 (UTC)
Yeah I totally get what you mean. I think some people forget how far behind we are in Chinese entries on wiktionary (despite our hard work to this date). I mean I just created an entry for "rectangle" (長方形) yesterday! I'm afraid that all this stuffing around with scripts and whatnot might just be a big waste of time, and create more issues than it fixes in the long run. Tooironic 21:39, 5 December 2009 (UTC)

(unindent)@Tooironic. Thank you for posting suggestions. By "pan-script" I meant categories like Category:Mandarin nouns that contain entries written in all three scripts: Simplified, Traditional, and Latin (all the romanization schemes are transliterations into Latin characters so it's technically one script for all which is coded as Latn). Per WT:AZH, we currently only have entries for the main romanization scheme for each language (so we have Pinyin for Mandarin but not the others that you mention). For the topical categories, we shouldn't invent script/language codes (such as Manp, Manw, Manb) so I don't think that's the route to take.

@A-Cai, zh-cn et al. are localization languages codes (see w:Language localization#Language tags and codes). They specify the country (whose code is correctly capitalized but still sometimes lowercased) to target a dialect (eg en-GB = "British English") not to differentiate scripts like we are trying to do here. (We usually note dialect differences on the definition line.) So their's no reason to use them for this purpose here.

I think A-cai's idea for the PoS category naming is good. It's both consistent across the different Chinese script categories and it's similar to the PoS categories of other languages. If desired, a parent category (eg Category:Mandarin Nouns) could be used to relate the script-specific categories together and possibly also to contain entries in all script forms.

Is there something we can do for the topical categories? Following A-cai's preference (which is reasonable) away from codes and parentheses, would a suitable replacement for Category:yue-cn:Insects be something like Category:yue:Insects in Simplified script? This type of naming (which could be changed a bit if it sounds awkward) would fix all the confusions that I listed above. --Bequw → ¢ • τ 00:20, 6 December 2009 (UTC)

Per your suggestions about Category:yue:Insects in Simplified script, why not do all of them that way? For example, we could have Category:cmn:Nouns in Simplified script and Category:cmn:Insects in Simplified script? The "pan-script" categories, if you still want them, would be Category:cmn:Nouns and Category:cmn:Insects. -- A-cai 13:05, 6 December 2009 (UTC)
Because using the language code as the prefix for only the topical categories distinguishes them from Part-of-Speech categories. It allows for a distinction between a category of words that are nouns ("dog", "house") and a category of words about nouns ("uncountable", "collective noun"). See for instance the difference between Category:Japanese adjectives and Category:ja:Adjectives. Additionally, since this is the system used wiktionary-wide it would help other users edit Chinese categories. --Bequw → ¢ • τ 18:00, 6 December 2009 (UTC)
Fair point. In that case, allowing for your argument, why not have Category:cmn:Mandarin Nouns in Simplified Chinese, Category:cmn:Words about Nouns in Simplified Chinese and Category:cmn:Insects in Simplified Chinese? Simply having a category called Category:Nouns for English words about nouns does not strike me as being all that intuitive to the average user (and has caused much confusion and debate, over the last few years). Shouldn't we call it what it is, i.e. Category:Words about Nouns? -- A-cai 18:27, 6 December 2009 (UTC)
I agree that the naming isn't always obvious, but changing the policy Wiktionary-wide is a separate discussion. In the long-run it's most useful to have consistency among languages. Towards that end, were my previous suggestions natural-enough sounding?. --Bequw → ¢ • τ 00:06, 7 December 2009 (UTC)
They seem reasonable to me, but this is something that we may need to bring to a vote before proceeding. -- A-cai 12:08, 8 December 2009 (UTC)
Wouldn't hurt. I'll set it up and then post announcements. --Bequw → ¢ • τ 15:23, 10 December 2009 (UTC)
Created Wiktionary:Votes/2009-12/Chinese categories. It won't start for a week. Please note corrections or changes. With regards to case, should it be "simplified script", "Simplified script", or "Simplified Script"? I see mixed usage across the internet. --Bequw → ¢ • τ 16:34, 11 December 2009 (UTC)
That's a good question. Now that I think of it, perhaps lower case would be better, since "simplified script" isn't really a proper noun. Also, just to clarify, the proposal is to change all Chinese dialects to this new format, correct? -- A-cai 00:43, 12 December 2009 (UTC)

Beer Parlour

I don't know if this has been bothering anyone else, but the Beer Parlour is most likely more than twice as used as the next largest discussion room. A ridiculous amount of stuff goes in here, and nothing stays near the bottom for more than a day or two. In an attempt to fix this situation, I propose that the Beer Parlour be split into two seperate discussion rooms. To summarize the stuff that the Beer Parlour is currently used for:

  • Proposals for new policies
  • Proposals to change current policies
  • Proposals to start/change anything else
  • Bot requests
  • Discussions about current policies, and how to interpret them.
  • Discussions on the current practices.
  • Discussions on anything that affects Wiktionary as a whole (main page, logo, etc.)
  • Requests for permissions. (AWB, etc.)
  • Announcements, notices...
  • "Random stuff" (meaning everything else), which occupies a very large portion of the discussions.

Now, I would be okay with any restructure, but what I think would be best would be to split the first four of those items into a new discussion room, which would be set up with subpages, and the rest to stay in the Beer Parlour.

Comments? --Yair rand 00:20, 4 December 2009 (UTC)

How much could be off-loaded to language-specific discussions? In particular items that are English-specific. I understand that there are reasons to get cross-language expertise involved in languages that have few active contributors, but English has the least problem in that regard. I would expect that much of the "random stuff" is arguably language-specific. DCDuring TALK 00:37, 4 December 2009 (UTC)
The English stuff could go at Wiktionary_talk:About_English. --Bequw → ¢ • τ 17:11, 4 December 2009 (UTC)
I have just inserted "Move to WT:AEN" at the entries that seemed to have principally English related content. There were not very many. Perhaps some other items could be moved to English, but they potentially have implications for other languages. Such subject areas include "translations" and RfV. By policy only English sections have translations. I haven't noticed too many RfVs for non-English terms. I don't recollect whether that is by policy. Including these would at least double the amount that WT:AEN would take. DCDuring TALK 17:45, 4 December 2009 (UTC)
Should Wikisaurus discussions fall under AEN? Technically they are only English, but AEN doesn't seem to be the best place. Maybe they should go to Wiktionary talk:Wikisaurus. --Yair rand 17:51, 4 December 2009 (UTC)
I had assumed if the discussion was here, there was a desire for a broader audience. I hope that WT:AEN could be broad enough. DCDuring TALK 18:02, 4 December 2009 (UTC)
Well, I don't really care about the amount of stuff in BP, but one thing I wish we would do is have subpages for discussions, like we have on the voting pages. It'd make tracking and contributing to your preferred discussions A WHOLE SHITLOAD easier. — [ R·I·C ] opiaterein — 17:39, 4 December 2009 (UTC)
That might be a big help as soon as a discussion passed a threshold level, probably well before someone would have felt compelled to add an arbitrary edit link. Some of the most voluminous discussions would not readily belong on any other root discussion page. I'm not so sure it should be done by default. It would be handy also if such a move automatically added the page to the watchlist of all (registered users ?) who had participated in the discussion or even to all admins or all registered users who request or don't opt out of such treatment. DCDuring TALK 17:59, 4 December 2009 (UTC)
I quite like the way Beer Parlour works now - there are already several forums to watch, including Tea Room, RFD, RFDO, and RFV. Following the individual threads in Beer Parlour is a bit harder, but I don't see how managable the alternative of having one subpage per thread would be. There is already an established process of archiving BP, which works quite well thanks to dedicated archivers. OTOH, maybe watching all the currently running discussions in subpages could be done using the "related changes" function, given all the subpages would be linked to or transcluded in Beer Parlour. I don't like the option of creating several subforums managed in a way similar to the current BP, with threads being directly in the discussion page.
What could be created, however, is something identified clearly as a non-binding chatroom. Any old rant could be posted to the chatroom. The room would not be a problem-solving one but rather a digressive off-topic one. Right now, none of the forums is setup for the purpose, each focused on solving different kinds of problems. Once the discussion would grow serious, and would clearly pertain to policy making, it could be brought to "Beer Parlour". One of the problems is that the name "Beer Parlour" suggests that Beer Parlour is such a chatroom. The chatroom could be called "Ramble bar", or not. The content of the chatroom could be of IRC quality. Once the chatroom would be there, any posting to Beer Parlour that does not relate to policy making could be quickly redirected to Ramble bar. The present posting could, too, at least in part, be redirected to Ramble bar ;). --Dan Polansky 18:11, 4 December 2009 (UTC)
I dunno. It seems like to many proposals to start something new just die when they get too high up in the beer parlour. (Whatever happened to the Portals discussion?) If these were moved to a separate subpage-based room it seems more likely that they would actually succeed. --Yair rand 18:27, 4 December 2009 (UTC)
Many of my trial balloons have turned out to be leaden. I don't think that is a fault of BP layout. DCDuring TALK 18:40, 4 December 2009 (UTC)
The portals proposal generated very little response. I personally was not interested in that proposal. If someone wants to start something, he has to show he is able to actually build it instead of only proposing it. He has to create a model portal as a showcase in his user space or something of the sort. But even that may be insufficient. In any case, a proposal needs to be backed with energy to work on it and bring it to a useful minimal state. If someone else responds positively to the proposal, the proposal can gain traction. But even when the proposal does have a dedicated page, it can grow stale. A case in point is Wiktionary:Picture dictionary. It got started by an enthusiastic person who stayed with the proposal only shortly. --Dan Polansky 18:53, 4 December 2009 (UTC)
I personally find myself forgetting the names of discussions and not wanting to take the time to find them. Subpages = better. — [ R·I·C ] opiaterein — 18:47, 4 December 2009 (UTC)

I have personally long argued that "discussion pages" are where most of the discussions should take place, with a just an "advert" placed in BP. If we are discussing CFI, lets do it at the CFI discussion page, if discussing Wikisaurus, at a Wikisaurus discussion page. My analogy is this - Beer Parlour is like a rowdy pub. Everyone gets a bit overheated on a topic for a short while, then forgets about it when the next topic comes along. Meanwhile, the real long term, careful work should be done in the backroom committee rooms. Anyone from the Bar can call into the committee room to make their contribution, but can also first easily inform themselves of all the past discussion recorded in the discussion room, not scattered over many BP archives. Every so often someone can go to the bar and let everyone know the discussion is on, if they are interested. In addition, a category such as "Active WT Discussion" could be used to identify all these discussion rooms, so you wouldn't have to remember where they all are.--Richardb 01:53, 6 December 2009 (UTC)

It would certainly take out a huge chunk of the discussions were the ELE, CFI, and Wikisaurus pages to handle discussion about them, but we still have a huge amount going through here. Basically, we have three things left that go here which should really be split: new proposals, bot requests, and general discussions that can't go anywhere else. If we were to split proposals and bot requests into a new room and move discussions that already have pages for them, the Beer Parlour would become a usable size. The "Active WT Discussion" idea sounds great if possible. --Yair rand 02:56, 6 December 2009 (UTC)
One other thing I just noticed: We could also move some of the stuff about discussion pages themselves to the talk pages of those discussion pages. Which means we could move this discussion to Wiktionary talk:Beer parlour. --Yair rand 03:07, 6 December 2009 (UTC)
Would anyone object if I were to move this discussion to Wiktionary talk:Beer parlour? --Yair rand 04:29, 7 December 2009 (UTC)
That sounds a bit like "which came first, the chicken or the egg". Eventually, yes, it might make sense. But not until the question is resolved, and implemented. But I'm not sure much would be achieved for this one special instance. Beer PArlour already is a discussion page. Do we need a discussion page for a discussion page ??? In this case, I'd say don't jump to doing it until the principl eis established and implemented. --Richardb 09:25, 7 December 2009 (UTC)
Plus - be a bit patient. Don't rush this before you have enough people having viewed what you are intending.--Richardb 09:25, 7 December 2009 (UTC)
If this was only about the beer parlour, then it should be at Talk:Beer parlour, but it's not, it's about all discussion rooms. I don't see that splitting off bot proposals would do anything, we get maybe 1 every few months (we could always put them on WT:GP as opposed to inventing somewhere new). Policy proposals are "supposed" to be here, read the heading. A much better solution to this problem is archiving discussions better, I am of the opinion that we should archive them all to [[Wiktionary:Beer parlour/topic heading]] and (but only if someone is keen enough) to keep a log of discussions by date. The other thing that does not help is that people start lots of very similar discussions in multiple sections on this page, instead of going back and re-opening old discussions. Move them to the bottom again if you want to give them more attention. Conrad.Irwin 14:59, 7 December 2009 (UTC)

Javascript archiving

Administrators can enable "Add Archive links to Beer Parlour sections" at WT:PREFS (after clearing your cache (ctrl+shift+F5)) or by adding importScript('User:Conrad.Irwin/beerArchiver.js'); to your monobook.js. Simply click the "Archive" button click "OK" and wait. Comments, bugs, improvements, please let me know. Conrad.Irwin 03:04, 10 December 2009 (UTC)

Wow, this will certainly be very helpful. Just one question: why does it archive to Wiktionary:Beer parlour archive/Month Year instead of Wiktionary:Beer parlour archive/Year/Month? --Yair rand 03:33, 10 December 2009 (UTC)

Categories for morphemes: un/productive

Is the productivity or non-productivity of a morpheme in a language stable enough to make for a worthwhile set of categories such as Category:English productive suffixes and Category:English nonproductive suffixes? It would have some value in helping with the creation of morphology-type etymology sections for some of our derived terms. It would also be useful for detecting certain errors in etymology (claiming modern English derivation for terms borrowed mostly formed from other languages).

One problem is that only some etymologies/senses are productive for some affixes (eg, -ly, -s). This is symptomatic of the more general problem is that a category is a language-entry attribute, rather than an etymology-language-entry attribute or a sense-language-entry attribute. Another problem is the intentional formation of archaic words of limited, though sometimes attestable, use (eg, be- ?). Thoughts? DCDuring TALK 00:28, 4 December 2009 (UTC)

Hi DC – it definitely seems worthwhile and legit to distinguish between -ly (regular adverb) formation on the one hand, and be- or -en (plural) on the other, and AFAICT this is pretty clear and stable, outside of conscious archaisms such as boxen.
One may distinguish some of these categories, as in Category:English plurals ending in "-en" (which could use some subcategories for “men” and “children” – you can hardly find the oxen in it), and one could meaningfully use Category:English adverbs suffixed with -s, to incorporate POS in the category name.
—Nils von Barth (nbarth) (talk) 08:23, 14 December 2009 (UTC)

The doublewiki on a single page Wikisource extension

Utilisation example on on Wikisource: to my mind we should install it and synchronise all the wiktionaries paragraphs orders to make it clearer. JackPotte 12:48, 5 December 2009 (UTC)

French version. JackPotte 12:51, 5 December 2009 (UTC)
Expansion/explication by Amgine: The DoubleWiki Extension adds a small linked '⇔' symbol next to interwiki links. If clicked, the current page you are viewing is displayed side-by-side with that of the other language. An example of the extension in use is Engel & Marx's Introduction of the Communist Manifesto in English and Greek. Currently all namespaces of all Wikisource languages have the extension enabled. Urls add a parameter "match=[lang code]".
The extension can enforce exact alignment. For example, language headers can be aligned across Wiktionaries allowing readers to quickly find and compare translations or shared terms. This will require cross-language cooperation to insure a single standard for aligning marks is used. The extension is otherwise atomic, assuming the articles on two separate wikis have exactly the same elements and order of elements and aligning them as they appear. - Amgine/talk 17:13, 5 December 2009 (UTC)
Doesn't seem to have any real advantage over just clicking on the interwiki and then clicking the 'back' arrow. Mglovesfun (talk) 20:43, 5 December 2009 (UTC)
But if would if a user had to compare more than one thing. Side-by-side would be easier than clicking 'iwiki link', 'back', 'forward', 'back', etc. Not sure if this is currently common, but it could be. Imagine trying to see if to#English had the same basic separation of senses on two different wiktionaries. --Bequw → ¢ • τ 21:00, 5 December 2009 (UTC)
Not that useful, you can open multiple windows, then again, not intrusive, so whatever people want. I would personally much prefer we got the transliterator extension that we've been waiting for since August. Conrad.Irwin 00:06, 6 December 2009 (UTC)
"This will require cross-language cooperation to insure a single standard for aligning marks is used" - that seems like a very heavy cost for something that seems to have little use or support. I think you might say "biting off more than you can chew". Perhaps it needs the next version development of Wiktionary. What happended to that ?--Richardb 02:02, 6 December 2009 (UTC)
I'm not familiar with all this stuff, but wouldn't the standard aligning marks be headers? Almost all languages use the same basic format of ==Languag== (Sometimes with css) followed by definitions. Bawolff 20:57, 7 December 2009 (UTC)
Amgine, does that mean that this can only be used for single-language entries, and other entries where the order of languages happens to be the same between two Wiktionaries? —RuakhTALK 21:40, 7 December 2009 (UTC)
Sorry about the delay in response. No, the current implementation defaults to atomic alignment - each block element will be aligned sequentially. The cool element is that, with a tiny invisible div tag, the language headers can be aligned, and the parts of speech, and the etymologies, and so on. The problem is starting the process of adding these small pieces. For en.Wiktionary it would be easy to do with Autoformat bot. It would likely be a bit more challenging to add such standardized tags on every other Wiktionary, but by no means insurmountable. - Amgine/talk 05:28, 12 December 2009 (UTC)
A point I forgot to mention: Although the comparison page shows the local article plus one other interwikied article, the edit link is only to the local page, so one can be a single back/forward click from the editor. Of course, one of the primary consumers of this would be readers as well as translators; the ability to read a more expansive native definition as well as a localized translation is often helpful. - Amgine/talk 16:34, 12 December 2009 (UTC)
For the concrete question, going to WT:PREFS and checking the box that reads:
Show an interwiki link under the language heading when one exists in the sidebar.
…yields the very simple and practical linking of Wiktionary entry in a foreign language to the corresponding entry in the foreign language Wiktionary – this is simple, low-tech, and works well for certain purposes.
The larger goal of “synchronizing all Wiktionaries” is a worthy goal, but rather better suited to a database backed dictionary, notably OmegaWiki, no? (Manually lining up paragraphs seems…baroque?)
—Nils von Barth (nbarth) (talk) 08:38, 14 December 2009 (UTC)

Template:bird (et al.)

Stuff like {{flower}} and {{fish}} that I can think of. Instead of proposing them for deletion, can't we just change the head word to (orthnithology), (botany) and (marine life) or something like that? We still have 700 pages using bird, but it's failed RFDO. Mglovesfun (talk) 14:32, 5 December 2009 (UTC)

No we can't, since these words are not restricted to a particular field of science but used in everyday life. So these must be deleted by hand (not doable by bot either) -- Prince Kassad 15:27, 5 December 2009 (UTC)
I oppose deletion of these templates, and I oppose their removal from entries. They can be turned into category templates, such ones that are visible depending upon user customization while hidden by default. That is, these templates would be placed exactly where restrained-context templates are, would categorize the entry as they currently do, and would be invisible by default using CSS, but would have a CSS class "category-template" that makes it possible for a user like me to make them visible by placing appropriate CSS code to my custom monobook.css. On categorization, the template "bird" should categorize (as it does) an entry into the category of birds, which is a topical category rather than a restrained-context category, as follows from its name.
We should not change the showed label to "ornithology", "botany" or similar, as that would indicate restrained context. --Dan Polansky 15:52, 5 December 2009 (UTC)
You are welcome to propose a vote for such a policy, but this template currently is already in the process of deletion in accordance with a previous vote on the issue. --EncycloPetey 15:59, 5 December 2009 (UTC)
I do not know of any vote on the mass deletion of these templates, but I'll gladly stand corrected.
I know of the vote Wiktionary:Votes/pl-2009-03/Context labels in ELE v2, which does not propose a deletion of templates. If it did, I would have opposed.
I also know of Wiktionary:Requests_for_deletion/Others#Template:bird created on 29 April 2009, which, technically, is a not a vote. It is, admittedly, a precedent-making decision made outside of the full light of Beer Parlour.
And I know of Wiktionary:Beer_parlour#template:mammal, October 2009.
Anyway, I've created a prototype at the template "mammal" by placing the code "<span class="category-label" style="display: none;">(''mammal'')</span>" in there. The category label is invisible by default. The user can make the category labels visible by placing "span.category-label { display: inline ! important }" to his "monobook.css".
Other than that, votes are supposed to confirm current practice. The current practice has to grow from somewhere; it does not start with a vote. So my understanding anyway. --Dan Polansky 16:25, 5 December 2009 (UTC)
The vote you have linked above explicitly says "they must not be used merely for categorization". You are proposing using a context label merely for categorization, which is prohibitted as a result of the vote.
Template:Mammal was deleted in accordance with a full discussion on WT:RFDO. Deletion discussions are not duplicated in other locations. People may choose to watch those discussions or not, but items that fail RFD(O) should not be recreated. --EncycloPetey 16:30, 5 December 2009 (UTC)
(unindent) I am proposing that those templates that are not restricted-context templates should be turned into category templates on the model of template:mammal. I am proposing the introduction of category labels for categorization. I find these category labels convenient, and some other users like them too. The category labels are hidden by default. What I am proposing is thus in no contradition to "... context label templates ... must not be used merely for categorization"; once the templates become "category templates" they exit the scope of the term "context label templates".
On another note, if the intended consequence of this vote was to get all the templates deleted, that should have been made very clear, so that I could, in good conscience and for very good reason, have opposed in the vote. --Dan Polansky 16:47, 5 December 2009 (UTC)
The template:mammal was recreated by Stephen G. Brown, edited by me today, and deleted by EncycloPetey before short, on 5 December 2009. Thus, people interested in what I was proposing and how it looked like cannot have a look at it. The template:mammal was deleted by the RFDO vote[8], in which three people voted for the deletion: EncycloPetey, Visviva and Mzajac. In that RFDO vote, msh210 proposed a redirection, although msh210 executed the deletion which implies agreement. This vote of three people is supposed to prevent any further recreation of the template by those who happen to disagree after the process. That is ridiculous. The RFDO process is meant to prevent too hasty deletions, not too hasty recreations by experienced and major contributors such as Stephen G. Brown. --Dan Polansky 17:00, 5 December 2009 (UTC)
As I pointed out, your proposal would require a vote to go into effect. Your proposed "context" templates are for categorization only, which was prohibitted by the previous vote. You can speculate about the purpose of RFDO, and you can be frustrated by it if you like, but please note that one of the reasons for deletion states "Failed RFD or RFDO; do not re-enter", and so these items should not be recreated. This is long-standing Wiktionary practice. the fact that only three people bothered to explicitly comment is sad, yes, but that's how most RfD discussions work. Most people either agree and don't bother to comment, or don't care either way in the matter, or else don't participate. --EncycloPetey 18:53, 5 December 2009 (UTC)
Exactly. I agreed with the deletion of mammal, but since there was no displayed controversy, didn't feel I had to weigh in. Startup another vote if you want to see the policy changed. --Bequw → ¢ • τ 21:12, 5 December 2009 (UTC)
(unindent) I do not want to get any policy changed. Categorization templates are not context templates; thus the result of a vote that prohibits certain uses of context templates does not apply to them. Categorization templates can be invisible by default. I have as yet heard no reasoned objection to categorization templates, no explanation of what harm they do, other than that they are allegedly context templates--which they are not--and that a vote on context templates forbids them. --Dan Polansky 11:57, 6 December 2009 (UTC)
I just rfd'd {{flower}}, mainly to see what would happen. If we can word a vote right on "non-context context templates" then we can delete them per the vote, rather than by individual rfds. I just found {{city}} by chance too. Mglovesfun (talk) 14:51, 7 December 2009 (UTC)
Dan, you are free to make a pure-categorization template, but I don't think that's exactly what you are intending. Categorizing on the sense line is controversial so such a template would be best put at the bottom of the entry. The name should be unlike context label templates for clarity (perhaps birds-cat). Finally, I imagine it would not display anything, ever (no CSS-optional display). But with so little benefit over simply typing [[Category:Birds]] it's no wonder that there are practically no entry-categorization templates in Wiktionary. What you proposed seemed to be a context label template for pro users (and it wouldn't even display right for them because it couldn't be "stacked" in correctly with other context labels). It would be confusing to normal editors who could see it's context-like name & placement but couldn't see any output. I would be against such a template. What exactly is the problem with just putting the plain-old cat at the bottom of the entry? What problem are you trying to correct? --Bequw → ¢ • τ 15:13, 7 December 2009 (UTC)
Bequw, I have used wrong words when speaking of "categorization templates". The right term is "category label templates". The point really is to show the label to pro users rather than merely categorize the entry. I do appreciate that the category labels should better look different from context labels. OTOH it is clear that "(bird)" is not a context label, as "bird" is not a restricted context. If there is a will for finding a way how to make category labels workable, there are surely several technical and formatting options I have not looked into. On the other hand, if I get outvoted in the RFDO of template:flower that MG has just started, the discussion is over anyway. --Dan Polansky 11:45, 8 December 2009 (UTC)
It looks like you want to circumvent the vote for Pro users. "Category label templates" were discussed in the Vote, and to my (and others') reading explicitly forbidden. --Bequw → ¢ • τ 14:49, 8 December 2009 (UTC)
I do not know how you and others read the subject of the vote, but category label templates are not explicitly forbidden in the vote AFAICS. And the phrase "to my (and others') reading explicitly forbidden" seems self-contradictory to me; either things need reading in the sense of guessing the intention and meaning, or they are explicit, but not both.
The vote was on approving the following text:
"A context label identifies a definition which only applies in a restricted context. Such labels indicate, for example, that the following definition occurs in a limited geographic region or temporal period, or is used only by specialists in a particular field and not by the general population. Many context label templates also place an entry into a relevant category, but they must not be used merely for categorization (see category links, below) ... ."
In this text I see no mention of deleting several templates, neither do I see explicit forbidding of category label templates; I see forbidding the use of context templates such as "geography" for mere categorization of, say, "river" or "mountain".
But this disagreement seems academic anyway. I get easily outvoted at RFDO of flower, which is going to formally confirm that the intention of the voters at the discussed vote really was to forbid category label templates. --Dan Polansky 18:01, 8 December 2009 (UTC)
Dan Polansky, is your intention to help the reader, or just to avoid deleting these templates because it will be very time-consuming? Mglovesfun (talk) 18:08, 8 December 2009 (UTC)
MG, my intention is of course not to prevent anything merely because it seems time-consuming; and it is not all that time-consuming with the help of a bot that, with the help of a straightforward regexp replace statement, replaces each use or invocation of the template with a category assignment.
My intention is to have senses tagged with topical category labels, invisible by default to satisfy those who find this idea too pioneering or unusual. --Dan Polansky 19:11, 8 December 2009 (UTC)
We already have a way to categorize that is "hidden" by default (straight categoriation), so what you are proposing to change is to have the option of seeing the categorization at the start of the definition line. This is technically messy...very messy, since it requires coding to check all manner of things before it displays. If placed at the start of the definition line, such a template would have to interact with {{context}}, in case people started using the expected code like {{context|archaic|Australia|mammal}}, and so would have to be a kind of context template. You can't get around that by claiming it's not a context template. You are intending it to do the same job, but to be hidden from most users and from all anons. --EncycloPetey 15:03, 9 December 2009 (UTC)
I've got another idea. We could use category tags like those proposed by Dan, but limited to non-English entries and added at the end of senses. This in order to avoid or limit the need to add a lot of definitions to non-English entries (like it happens in the underneath example of фаланга). You can see how this would work in User:Barmar/Italian N1, an example that I have created with an invented Italian word with invented meanings and categories. Basically, you have a categorization label ie {{tree||lang=x}} that includes a white space for comments or scientific names for plants and animals (IMHO useful because often different plants or animals share the same common name) and a language parameter. This category label would add at the same time definitions to different meanings of a given word and categories at the end of the entry. Any thoughts? --Barmar 15:28, 21 December 2009 (UTC)
Hi there (and sorry for my written English that is not very good). Here's my two cents. And what about using for common plant/animal names something like (comm., botany) or (comm., zoology) where comm. means common name, like some dictionaries do? I mean, the template {{fish}} applied to the word shark would add (comm., zoology) at the beginning of the definition and the category [[Category:Fish]] at its end. The template {{plant}} applied to tansy would add the (comm., botany) at the beginning of the definition and the category [[Category:Plant]] at its end. And so on for mammals, trees, mollusks and so on. This would be very beneficial for non-English entries, where there is more need of categorization because we usually don't use definitions. Without category tags like botany or zoology or whatever would become more difficult to distinguish between the many meanings of a single word. --Barmar 09:28, 20 December 2009 (UTC)
I want to second Barmar here. I disagree with the results of the Wiktionary:Votes/pl-2009-03/Context_labels_in_ELE_v2 vote, whereby context labels now “indicate usage as jargon within the indicated field”. This may make sense in an En-En dictionary, where the definitions are fully written, but not so for FL entries. A context (zoology) would help me immediately to understand that the “shark” in акула refers to the fish and not the figurative senses. Also, with contexts (history) and (anatomy) I can easily define фаланга without explaining in parentheses that the first sense is about the military unit, the second about the bone; and the word gets categorized at the same time. In short, I think {{fish}} should display as (zoology) and categorize into Category:xx:Fish; {{flower}} should display as (botany) and categorize into Category:xx:Flowers; that’s what 90% of my bilingual dictionaries do.
Also, let's not act like Wiktionary:Votes/pl-2009-03/Context_labels_in_ELE_v2's 7-2-1 result is a genuine consensus. I know at least three more people who haven't voted but would like the labels behave like I described. --Vahagn Petrosyan 11:24, 20 December 2009 (UTC)
Vahagn, I can tell you from personal experience that botanists do not use the common names of flowers, so to label then with (botany) would be positively misleading. Those words are not part of botanical jargon; they are used by gardeners, horticulturalists, and amateur nature lovers, but not by botanists. Worse, some of the so-called "common names" aren't actually used by anyone except publishers of field guides. --EncycloPetey 21:59, 20 December 2009 (UTC)
My point is that we should not use context labels to mark words that are part of a jargon in a certain field. To me, a context label is a means of distinguishing senses and translations of French grue. I just looked up my fr-en dictionaries: all of them mark the bird sense as (zoology) or (ornithology), and the machine sense as (technology). I cannot see why would you want to replace this useful purpose of context labels (accepted by most bilingual dictionaries) with the function of merely indicating certain-area-jargon. --Vahagn Petrosyan 13:28, 25 December 2009 (UTC)
I would support reversing that vote, which I regret not participating in. The problem seems to me that as it is being implemented, it has the effect of wiping out content by converting sense-level information to language-level information. I would have hoped that this would bother people as they were doing it. At the very least the process of template deletion has to stop now. DCDuring TALK 11:52, 20 December 2009 (UTC)
@Vahagn Petrosyan: your example about акула is perfect. Actually with tags like (comm. botany) where comm. means common name or (comm. zoology) I was proposing something different: a compromise between context label = technical jargon and context label = common use/sense of the word. @DCDuring: could you please explain what you mean by 'language-level information'? (sorry, I've not understood) --Barmar 13:54, 20 December 2009 (UTC)
Sorry that I relied so much on context. Thanks for the opportunity to clarify. A category is naturally associated with a page, in principal namespace, an entry. By our practice, a context is associated with a sense. By our practice and category structure, we also have specialized our categories to be language-specific. I was referring to this as "language level". The narrowest specification of category information is an L2 language header. This is certainly useful. But, for long language sections (most commonly, English), it is not at all easy to determine which sense might be associated with a particular category.
For me the problem arises in associating grammatical or grammatical/semantic categories with particular senses. I am using these to attempt to improve the quality of the definitions, usage examples and synonyms for adverbs, for example. I cannot do it without context tags which are visible to me, short of some kind of rectal tonsillectomy, which is beyond both my surgical and technical skills. DCDuring TALK 15:39, 20 December 2009 (UTC)
Can you give a specific example so that maybe a technical solution can be found? Carolina put forth the idea of categorizing on the sense-line. This accomplishes the goal of relating cats to senses, but it's slightly hidden in the wikitext. --Bequw → ¢ • τ 01:45, 23 December 2009 (UTC)
I was hoping for something visible without opening the editbox. I can insert as comment a terse abbreviation that reminds me of the category and doesn't waste edit box space (long full category names), but it is double work.
See the subcategories under Categories:English adverbs, such as Category:English degree adverbs. "Degree adverb" is a characteristic associated with a sense. It is arguably not a satisfactory term for normal users, but it is very useful for those who might be hoping to improve the entry. There is nothing technically complicated about having it appear as a context label and having the context template assign the headword to a category. The problem is just in whether we want to exclude that orienting information from appearing because it violates our idea of the purpose of a context label. It looks too much like scaffolding and chalklines, not like the facade of a finished building. To me we are still a construction site: entries that seemed great two years ago already could stand some major upgrades. So perhaps what I'm looking for is a class of sense templates that assign a category and provide a label that can be seen by registered users who wish to see them. If this turns out to be too complicated or of insufficiently wide interest, then I can make do, as HippieTrails seems to have, with embedded comments visible only in the edit box. DCDuring TALK * Holiday Greetings! 03:44, 23 December 2009 (UTC)
But Barmar, shark is not a common name of any species; it is a everyday word which does not quite correspond to any taxonomic category. Your label would therefore be incorrect. Even more problematic, there is no longer any taxonomically defined group that corresponds to fish; zoologists use technical terms to refer to certain groups of fish, but "fish" is no longer a taxonomic category and thus "fish" is not the common name of any current zoological taxon. What you are proposing we do, then, is to inappropriately label items that are simply everyday words. Also, what would you do for a word that translates as "plant" (in the sense of the organism)? There are at least three different botanical meanings of that term in use that are very, very different. Labelling as (plant) or (botany) would not help clarify such a situation. It is better by far to put the definition in the definition, rather than try to cheat by putting some of the definition into a category tag. --EncycloPetey 21:55, 20 December 2009 (UTC)
Ok some dictionaries don't agree on that, but I definitely prefer editing to discussing :), so if I have understood it right, in non-English entries we must add definitions to translations. I.e. фаланга (example of Vahagn Petrosyan) would become
===Noun===
фаланга (falánga) f.
  1. phalanx (ancient Greek military unit)
  2.  phalanx (bone of the finger or toe)
  3.  sun spider, wind scorpion (insect of the genus Solifugae<-or whatever it is) 

Can you confirm this? --Barmar 08:16, 21 December 2009 (UTC)

It could, yes, except the last could simply be glossed (arachnid)...it isn't an insect. However, I rather doubt that either sun spider or wind scorpion have any other meaning in English, so no additional gloss should be necessary at all. Also, depending on its actual usage, phalanx (2) could be labelled (anatomy) as it currently is. That depends on whether it's an everyday word or anatomical jargon. If it's jargon, like it is in English, then the (anatomy) context tag is correct as it stands in the entry. --EncycloPetey 15:52, 21 December 2009 (UTC)

User:JackBot

For a few days, we need to make apply the terms consensus in our categories in order to correct a grammatical error. After an IRC confirmation, I've begun manually today, and for Category:English_terms_spelled_with_ligatures I could also test automatically with replace.py, before being stopped by EncycloPetey (I apologize for the time I made lost to him), because I've abused to let the script modify its 200 entries whereas our policy tells 10-100 entries enough clearly.

As far I can see there is now around 1,000 articles to change with a bot.

If nobody wants to do it I propose to continue 1 week with a bot flag:

  1. Your user name: User:JackPotte.
  2. The proposed bot user name: User:JackBot (already flagged on fr.wikt).
  3. The bot software you'll be using, with pointers to its source and documentation if it's not standard: replace.py.
  4. An indication of the task or tasks you'll be performing with the bot: replace the string of characters "words" by "terms" in a few categories in around 1,000 articles.

JackPotte 20:21, 5 December 2009 (UTC)

Sounds okay. Create the bot User page, and indicate the replacement patterns (regexp) if you would. --Bequw → ¢ • τ 21:15, 5 December 2009 (UTC)
Yes check.svg Done, with the converter, it's "Category:English words spelled with ligatures" replaced by "Category:English terms spelled with ligatures" in the pages described on my bot request profile. JackPotte 22:12, 5 December 2009 (UTC)
  • I noticed no prior consensus. I also can't understand what is being proposed. That is, I do not understand the sentences as English. The word "consensus" doesn't match my understanding. I have no idea who the referent for "we" is. I don't recognize "make apply". I have no idea what any of the discussion has ever had to do with a "grammatical error". And that's just the first sentence. It is very difficult to trust any process so dreadfully explained. DCDuring TALK 01:09, 6 December 2009 (UTC)
Clearly, "hors d'œuvre" isn't a single word, and "hors" doesn't include any ligature. Yesterday I wasn't the only one to understand this current grammatical problem to let it in Category:English words spelled with ligatures, on this page, the French equivalent and on the #Wiktionary, for more than 15 days. That's why I begun to correct it. JackPotte 09:52, 6 December 2009 (UTC)
JackPotte, that's almost a punny use of grammatical, the issue is in the choice of word and the word is about grammar (you'd need a whole sentence to find a grammatical error), also you need a WT:VOTE to get a bot flag. DCDuring, I think this is "consensus" as in "lack of disagreement despite publicity" (the discussion was on WT:BP afterall). The process is simple enough, replace the many occurances of the word "words" in our category names by the word "terms"; the rationale is that a "word" is commonly (mis)understood to mean a set of letters without spaces, wheras a "term" has a broader intuitive meaning. Conrad.Irwin 10:06, 6 December 2009 (UTC)
Ahh. Intelligibility. Trust. DCDuring TALK 10:29, 6 December 2009 (UTC)
If we have a bot vote, that will quite clearly discuss what the bot is actually doing, which will serve as a consensus in this case, right? Mglovesfun (talk) 10:58, 6 December 2009 (UTC)
Fulfil(l)ed. JackPotte 11:54, 6 December 2009 (UTC)

What happened to "the next version of Wiktionary"

In the past there was a proposal for a next version of Wiktionary, using software specifically developed to support Wiktionary, a semantic network etc. Instead we seem to have an ever increasingly complicated Wiktionary with a labyrinth of templates and layout rules, with serious arguments between those like me, interested in adding new words, new information, but not too fussed about format and layout, and those very concerned about layout and format.

  • Does anyone know what happened to that proposed new version ?
  • Should the community perhaps be more interested in developing such seemingly necessary new software, rather than making Wiktionary increasingly complex, complicated ?--Richardb 02:11, 6 December 2009 (UTC)
See http://omegawiki.org/ Such software would be nice, and I (and many others) have given a lot of thought to how to do it. To be successful, I think the following must remain true:
  1. It must be a smooth transition. (There's no way we can expect all users to unilateraly move to a new system that requires new training and skills, there's also no easy way to import the wealth of information we already have)
  2. It must be able to encode all the information we have now, even the stuff that means nothing to a computer and is just notes for humans.
  3. It must be flexible enough to allow non-technical users to add all the information they want to in a manner that is readable by a computer.
OmegaWiki took a different viewpoint and just started from scratch, they have lots of translations and definitions, and technical-looking pages (though I'd be the first to admit that ours are hardly less intimidating for the uninitiated).
The way I would like to see evolution in the immediate future is to stick with the Wiktionary we have in the middle, but to work downwards (towards machines) and upwards (towards humans). The downwards work would consist of adding a basic API, and then making the wikitext more regular and templatised; meanwhile the upwards work would be projects like WT:EDIT which allow the humans to add knowledge without needing knowledge of the formatting (though the wikitext will still be there for the advanced users and robots who wish/need to edit it). The difficulty with downwards development is doing it in a manner that Wikimedia will permit on their servers (i.e. it has to be reasonably efficient), the difficulty with upwards development is trying to design interfaces that are actually pleasant to use. (And of course the whole thing is made harder because you can't build in either direction individually, better editing tools need more structured wikitext, and more structured wikitext requires tools to hepl people edit). Conrad.Irwin 02:37, 6 December 2009 (UTC)
I'd say that working upwards is more important at the moment. It would be great to have something like WT:EDIT for Wikisaurus. And for topic categories... --Yair rand 03:11, 6 December 2009 (UTC)

Automatic TOC in categories

These changes were made by me some months ago, but in case you want to know, improve, criticize, praise or discuss:

  1. TOC templates are added automatically into every POS category (programmed with {{poscatboiler}}) and every affix category (programmed with {{affixcatboiler}}), considering the examples below.
  2. There is a {{ru-categoryTOC}} but not a {{ru-categoryTOC/full}}, so the existing template is added into all Russian categories that contain more than 200 members.
  3. There is a {{pt-categoryTOC}} and a {{pt-categoryTOC/full}}, so the former is added into Portuguese categories that contain more than 200 members, except those with 2500 or more members. When a Portuguese category has 2500 or more members, {{pt-categoryTOC/full}} is used instead.
  4. The exact naming scheme is "Template:xx-categoryTOC" and "Template:xx-categoryTOC/full", where xx is any language code. For consistency reasons, old messy template names such as hypothetical "Template:EnglishTOC" or "Template:TOC-en" wouldn't work.

--Daniel. 11:09, 6 December 2009 (UTC)

The big Finnish monster

Okay I was talking to Jyril about this yesterday and I think it's about time someone (namely me, at least partially) tackled this. What I propose to do is make it like Category:Hungarian noun forms. I don't know how hard it'll be but Jyril did say {{fi-form of}} or something was made in such a way that it'd be easy to do this. Thoughts? (and yes..., I know that a categorisation issue such as this is not Wiktionary as a whole's no. 1 priority but it's something I want to deal with) 50 Xylophone Players talk 19:55, 6 December 2009 (UTC)

mazel tov; whatever floats your boat - seriously. DCDuring TALK 20:10, 6 December 2009 (UTC)
I don't see any purpose in these numerous noun/verb forms subcategories. IMHO they should all be kept together in one giant PoS category like Finnish. And also noun plurals categories - they make no sense for languages with cases such as Finnish and Hungarian. It's not "plural form of <lemma>", it's "nominative plural form of <lemma>" (and usually some other case as well). --Ivan Štambuk 21:23, 6 December 2009 (UTC)
Well, I hate the idea of maybe someday having over a million entries in categories like this as I already stated on Jyril's talk page. As for the "(nominative) plural of", it makes some sense because nom. plural is the simple plural form indicating nothing but plurality, e.g. házak corresponds to "houses", whereas házakban corresponds to "in houses". Perhaps a note should be added to Category:Hungarian plurals to disambiguate the matter of exactly what kind of plurals they are. Category:Swedish plurals already has something doing the same kind of job. 50 Xylophone Players talk 15:22, 7 December 2009 (UTC)
Like Ivan, I see no purpose in micro-categorizing all the inflected form entries according to the specifics of their morphologies/grammar. --EncycloPetey 02:43, 10 December 2009 (UTC)
Also, even if you have separate subcategories for specific case forms (which might possibly come in handy, although a case in point is not evident to me at this time), I don't really see why the entries should not also be categorized in the main noun form category. I think there is greater need for a category that lists (ideally and hopefully eventually) all conceivable noun forms than noun forms within a specific case. So, to sum it up: keep the main noun forms category as it is, but by all means add the others if you find it useful. – Krun 11:29, 11 December 2009 (UTC)

Main Page redesign

The Main page redesign seems to have basically died down. Anyone want to try a six-day rush job, throwing in nominations for the new features and deciding all the last minute stuff, to attempt to make the redesign launch on Wiktionary Day? --Yair rand 20:08, 6 December 2009 (UTC)

Okay, that was rather ridiculous and totally impossible. We should at least try to get it done this year though, so it's still the 2009 redesign. --Yair rand 06:03, 9 December 2009 (UTC)
I don't think there is enough initiative to tackle something this big any more. Some of the proposed additions to the Main Page withered long ago, and would need sustained input over a long time in order to work. I've been doing Word of the Day nearly solo for several years now, and on the rare occasion that I fail to get the new WOTDs put in, it's been three or four days before the community even noticed this. Extending the Main Page to include several additional items that require such continued and regular attention doesn't seem likely to succeed, in my estimation. I'd like to be wrong about that, but that's my own experience. --EncycloPetey 02:50, 10 December 2009 (UTC)
Well, the "Interesting stuff" feature seems to mainly just pull stuff out of our existing stockpiles of rhymes, glossaries, appendices, random translations and a lot of other stuff lying around. I think if we trim the harder sections of Interesting stuff, and expand the standards of Word du Jour to include basically anything, this could work. But maybe not... --Yair rand 03:28, 10 December 2009 (UTC)

Logo voting

... is now open, so that "Vote on a new logo for Wiktionary" link at the top of the page actually means something now. L☺g☺maniac ☃ 00:10, 7 December 2009 (UTC)

I have advised the Russian Wiktionary. Please spread the word across other language projects. Anatoli 00:26, 7 December 2009 (UTC)
Posted in the Spanish Wiktionary's Café. L☺g☺maniac ☃ 15:33, 7 December 2009 (UTC)

Another place name deleted

Another place name has been deleted - Chiayi - a city in Taiwan, an administrative centre. It was rfv'ed, not rfd'ed. What has been achieved? How does it improve the Wiktionary? Does anybody care? I do and I am very upset. Anatoli 02:53, 7 December 2009 (UTC)

I am confused. This entry was RFV'd and no one was able to provide citations for it which means the city must not exist, right? --Yair rand 03:05, 7 December 2009 (UTC)
I am confused about the request for citation - why an what. Chiayi (Jiayi) 嘉義/嘉义 does exist, of course, not just in the reality but in the dictionaries, Wikipedia, etc. The RFV serves as a signal to delete for some, which is a big worry. --Anatoli 03:16, 7 December 2009 (UTC)
Then the entry may be recreated once someone finds a citation that the city exists, right? --Yair rand 03:19, 7 December 2009 (UTC)
I am not sure what drives place name haters here. Will the citations be satisfactory? Will the sources be considered "reliable"? Are these the real reason for the deletion? As if a city will stop exist, if there are no satisfactory English citations. In my opinion, it's a misuse of authorities given. The place names from the English speaking world are also welcome, not from other places. I feel sorry for my time spent on the entry - finding translations and transliterations in other languages. The English only entry itself has little value and would not take a long time to recreate. --Anatoli 03:32, 7 December 2009 (UTC)
Wow, straw man much? We've had this argument enough times, I'd think you'd at least have a vague notion of what people have been telling you. Are you intentionally misrepresenting them, or were you just ignoring their explanations to begin with? —RuakhTALK 03:37, 7 December 2009 (UTC)
Not quite. According to our criteria for inclusion (CFI), the question isn't whether the city exists, but whether the English name Chiayi is used attributively, with a widely understood meaning. (One can argue that the CFI allow it to be listed just as a name, like we do with given names and surnames; but first of all, I'm not clear on exactly how that would work, and secondly, that wouldn't satisfy Anatoli.) Personally, I don't RFV real place-names, given that the "attributive use" criterion doesn't really seem to have consensus, but if someone lists them at RFV, and no one provides citations, I'm not sure what Anatoli wants me to do. We've had many discussions and votes towards addressing the issue, and none of them has accomplished anything. —RuakhTALK 03:37, 7 December 2009 (UTC)
Where was the vote to change CFI? I am not ignoring what I am being told, I disagree with what they say - I can only remember maximum about 5 users who would agree with you. People with this opinion (CFI based on attributive usage) are not in majority here but are the administrators with the right to delete. Yes, there were a lot of discussions and I could tell that the majority was for the increase of CFI, not for the decrease. The rule to base CFI on attributive usage is not followed, will only allow place names known to English speakers. If you want to be nice and don't know what to do when someone RFV's, place "missing citations" or other flags, add to discussion but why delete? Deletion is not productive, it's destructive, in any case. Do you personally doubt that the place exists and will have entries in dictionaries and pretending to be innocent? Anatoli 03:49, 7 December 2009 (UTC)
The simple logic is that it is not necessary that any place names be in a dictionary. It had been decided before I arrived here that they were to be excluded, except for those names that meant something beyond their literal meaning. "Golden Gate Bridge" is meaningful because its use as a jumping point for suicides appears in publications in ways beyond simple reporting of the fact of a suicide. This kind of usage is not very common, but some famous places have that kind of associated meaning. There have been various proposals to allow some kinds of places. The proposals always end up with some kind of non-lexicographic criteria that amount to to notability.
But, frankly, there seems to be no one willing to assume responsibility for making an intelligent proposal, let alone implementing it. Until there are at least two or three people who seem willing to put in the work required and start to do so, I doubt that a vote to change policy will succeed. Right now, if people are even unwilling to take the trouble to find out what geography entries might be included under current and proposed standards and do the work to cite a few entries, I think there is no basis for expecting anyone to do the sustained work required. At this point many of the geographic entries that have been made do not even meet our formatting standards. If they do not, they show up on cleanup lists. When they do, I often tag them if they do not seem likely to meet our current standards.
There seem to be quite a few people here who are already favorably disposed to place-name entries. It probably would only take a core group to make a good effort to understand the issues and make a proposal to get things changed. Perhaps someone could take a stab at something at Wiktionary:Editable CFI. DCDuring TALK 04:06, 7 December 2009 (UTC)
Well suppose it doesn't? Suppose someone, as a joke, makes up a realistic sounding place name - let's say, Fjeurnsalooften, and claims it is a town in Norway. How can we protect the integrity of the dictionary from false entries without requesting proof? bd2412 T 04:09, 7 December 2009 (UTC)
Use an atlas/Google/Google Earth!!! That's how lol...50 Xylophone Players talk 15:25, 7 December 2009 (UTC)
So you don't think that atlases, Google, and Google Earth constitute "proof", but do think that we should base our CFI on them? Sorry, but I don't think that makes sense. (And anyway, decent atlases don't include such place-names as Valhalla, which surely merits an entry more than Chiayi does.) —RuakhTALK 15:50, 7 December 2009 (UTC)
WTH? I never said I didn't think atlases etc. constituted proof. o.O Where are you getting that from?? As for Valhalla, while it is a "place name", whether it merits inclusion or should, IMO, be judged along with the likes of entries for Greek, Norse, etc. gods. 50 Xylophone Players talk 23:41, 8 December 2009 (UTC)
The place names situation is very similar to the given names and surnames, which also don't have an official policy. What we need here is for someone to start up a proposed policy page on place names, given names, and surnames, for everyone to work on, so we can have a vote and finally settle matters. --Yair rand 04:25, 7 December 2009 (UTC)
We've had 4 vote attempts on this issue so far, it's not such a simple matter. See Wiktionary:Votes/pl-2009-08/Common placenames get entries, Wiktionary:Votes/pl-2007-06/Placenames 2-A, Wiktionary:Votes/pl-2007-05/Placenames 2, Wiktionary:Votes/pl-2007-02/Placenames. If you want to create such a page do it, nothing on Wiktionary ever happens if the task is assigned to "someone". Conrad.Irwin 14:05, 7 December 2009 (UTC)
We also had this vote, Wiktionary:Votes/pl-2009-05/Names of specific entities, which I still believe to be the most sensible answer. Maybe it didn't pass as an overall solution to the proper names issue, yet it could pass in a narrower scope. I've never heard and would hardly believe analogy such as "the Chiayi of..." so I'm fairly confident the term in question wouldn't pass in English. In Chinese it's probably a different story though. And if not for Chiayi, maybe for the Chinese spelling of Taoyuan. And if not for Taoyuan, then for a bigger city like Taichung. And if not for Taichung, then most certainly for Kaoshung, depending on how inclusive or exclusive the criteria. Apart from where the line is drawn for any single language, the question becomes, would we allow translations of place names that can be cited only in a different language? If we use citation to back entries, inevitably there will be some that are highly recognized abroad and all but unknown to English speakers.
I personally think that a vote on expansion of exclusions to include "Wiktionary is not a gazetteer" (in the sense of a geographic dictionary) could culminate in a decision to or not to include placenames that are only placenames and do not have additional meanings. That might be too simplistic an idea, though. --Ceyockey 05:08, 8 December 2009 (UTC)
Not to include any place names that have no further meaning, including New York, Africa, and Jupiter? --Yair rand 05:40, 8 December 2009 (UTC)
I think a suitable enough criterion is whether a translations section is possible, i.e. whether knowledge of the place name is well-known enough for translations to have developed. --Yair rand 05:43, 8 December 2009 (UTC)
"Wiktionary is not a gazetteer" would not be an absolute blockade on inclusion of standalone placenames, just as "Wiktionary is not an encyclopedia" does not dictate definition content. Rather, it would set a high bar for inclusion, albeit a bar which would be open to interpretation (for better or for worse). --Ceyockey 11:22, 8 December 2009 (UTC)
  • The most important sentence in our CFI is the first one "As an international dictionary, Wiktionary is intended to include “all words in all languages”". Perhaps some people would like to change that to "most words in most languages" - but I would prefer it to stay as it is. SemperBlotto 11:34, 8 December 2009 (UTC)
    • Is not the entire purpose of the CFI to qualify the aspiration of "all words in all languages"? --Ceyockey 23:25, 8 December 2009 (UTC}
      Indeed. Not a single constituent of our slogan is without qualification, including "in". Much of it is necessary so we have the time to upgrade the quality of our content so it approaches that of our competitors. In the long run, we may have enough contributors to be able to successfully include more. DCDuring TALK 23:49, 8 December 2009 (UTC)
      Well, what is a word? Is a place name a word if it's proven to be in use? Is it a word if it's part of one language, meaning that other languages have their own versions? Is it a word if it has been mentioned in published works, or if has been displayed in a map or an atlas? Are all place names assumed to be words, or just those that are integrated into a language as much as any regular word? --Yair rand 00:01, 9 December 2009 (UTC)
FYI, all toponyms in all languages are words. --Ivan Štambuk 01:06, 9 December 2009 (UTC)

Clarifying placename issues

To try and get some structure going here, let's assume we want "all words in all languages" and we have workable definitions for "all", "in", "language", just wishing to clarify "word" as it relates to placenames. After a few days/replies/when this whole structure disintegrates, hopefully we'll be able to see why we have disagreement; then we could try to solve it. Does anyone have short(ish) answers to the following questions, you don't have to answer them all, but try to avoid replying to answers (at least initially) any comments can probably go positively under the opposite section. Conrad.Irwin 00:37, 9 December 2009 (UTC)

Why is a placename not a word?
because it's a placename. Conrad.Irwin 00:37, 9 December 2009 (UTC)
Generally, word is phonetically or orthographically separable sequence of sounds (in most languages, in some due to extreme sandhi or agglutination only grammatically separable). Whether something is a placename or not has to do only with semantics. All placenames in all languages are words on their own. The whole "problem" of placenames as "non-words" was raised only to somehow degrade their status, as if they have nothing to do in a dictionary, which is in fact wrong as all modern dictionaries of all languages include at least some placenames. --Ivan Štambuk 01:31, 9 December 2009 (UTC)
re "because it's a placename": this is an empty reason, isn't it? It does not state any property of placenames from which their wordness or non-wordness could be inferred. --Dan Polansky 09:15, 9 December 2009 (UTC)
A placename, like a personal name, may be arbitrary and of no practical use to a dictionary. It seems obvious that if I decide to name my house Beedeevaynia, that would not merit an entry here (even though it is clearly a word that I have coined. On the other hand, it seems equally obvious (to me, at least, and to most of us, I think) that we ought to have entries for Chicago, Connecticut, Andorra, Gulf of Mexico, Mount Everest, and Ganymede, for example. The trick is explaining why we want the latter and pinning down the dividing line, a function I find to be fairly well served by the existing CFI. bd2412 T 04:16, 10 December 2009 (UTC)
A placename, like a personal name, may be arbitrary - Excuse me, but what exactly is this suppose to mean? How are toponyms "arbitrary" ?
...and of no practical use to a dictionary. How can you say something like that after everything what was written in this discussion. Entries or toponyms are exactly like the "normal" entries minus the definition lines. They have their own pronunciations (often extremely unpredictable), etymologies, translations, obsolete spellings, archaic varieties, slang synonyms, their own derived forms (demonyms, relative adjectives, even verbs in some cases). They are also extremely important source of etymological information (because they're usually attested before most of the languages were ever written. There are even some languages entirely reconstructed from toponyms). --Ivan Štambuk 13:16, 10 December 2009 (UTC)
If a placename is not a word, should Wiktionary include it anyway?
useful to readers. Conrad.Irwin 00:37, 9 December 2009 (UTC)
many of the same reasons as regular words: provide translations, pronunciation, etymology etc. --Yair rand 00:59, 9 December 2009 (UTC)
seconding input from Yair rand --Ceyockey 01:17, 9 December 2009 (UTC)
An entry without at least translations may be of little value but translations can't be added if the entry doesn't exist. A purpose of any dictionary for a place names is to look up its name in another language in a convenient way. The (previously) deleted Chiayi entry contained this useful information. An example, how do you find out how to pronounce 嘉義 in Russian or Korean? What do the characters and mean?--Anatoli 02:39, 9 December 2009 (UTC)
Contributors can also contribute to the wikipedia w:Chiayi page. We aren't the only place people can add info to. And those pages probably have easier formatting details (though you'd have to write a few more words to make stub entries for those not already created). --Bequw → ¢ • τ 15:04, 9 December 2009 (UTC)
You cannot add translations to foreign languages to that page. In fact, you cannot add foreign-language entry info at all to Wikipedia because it's monolingual project and not multilingual like Wiktionary. We also have languages many of which don't even have an associated Wikipedia project. The type of content we're primarily interested in as a dictionary is of little or no value to encyclopedia and vice versa. The most logical conclusion is that they should complement each other. --Ivan Štambuk 15:35, 9 December 2009 (UTC)
Translations are done via the iwikis (works only for languages that have a wikipedia project). This does raise the bar for editing since to add a new a translation someone has to create at a least a stub article in the foreign language wikipedia before adding the iwiki. Admittedly, no perfect. --Bequw → ¢ • τ 19:26, 9 December 2009 (UTC)
Interwikis function very poorly as translations:
  1. For once, they're not translations at all. They're articles on FL wikipedias on the same topic or related topic. An article on English Wikipedia on some English village might interiwiki to an article on FL wikipedia on the entire county. An article on a certain mountain peak on English Wikipedia might intrwiki to an article on FL wikipedia on the mountain itself.
  2. Often interwikied articles are not in lemma forms. We deal with this all the time because users copy/paste interwikis from Wikipedia translations that happen to be in plural, in definite form, or similarly grammatically marked "title" form.
  3. Interwikis are bound to only one script, which is a major drawback in languages written in several scripts (like Mandarin - the most spoken language in the word). They also don't provide other additional information that our translations do: transliteration, gender and alternative display (with marked accents - often of utmost importance for proper pronunciation).
In general Wikipedia interwiki serves as a very bad source for the translations of toponyms. We could include all that I listed above plus more: historical and regional forms that will never have their own Wikipedia articles. Interwikies are opaque and decoding them requires intuition and lots of unnecessary assumption. Reading checked Wiktionary translations OTOH provides information user can firmly rely on. --Ivan Štambuk 20:17, 9 December 2009 (UTC)
Why is a placename a word?
clearly a "distinct unit of language" (cf. word). Conrad.Irwin 00:37, 9 December 2009 (UTC)
it is a proper noun or noun phrase with a distinct though historically mutable meaning. --Ceyockey 01:15, 9 December 2009 (UTC)
gets pronounced, written, typed, sometimes contains no spaces, is a proper noun and proper nouns are words; in other words, looks like a word or a multi-word term, is non-SOP (the location of "New York" cannot be determined from the location of "York" and the meaning of the common noun "new"). --Dan Polansky 09:10, 9 December 2009 (UTC)
Agree, but not all proper nouns are words (e.g. Winston Churchill is not a word, but two words). But most placenames are words, including New York. Lmaltier 22:45, 9 December 2009 (UTC)
I don't mean to sound like a broken record, but where do you get these rules from? You've provided no evidence that the "wordness" of one is different than the other. I think it's far more "fuzzy" than you assert what a word is. --Bequw → ¢ • τ 23:52, 9 December 2009 (UTC)
If a placename is a word, why should Wiktionary not include it?
waste of time, it's in Wikipedia. Conrad.Irwin 00:37, 9 December 2009 (UTC)
dog is in Wikipedia, too. But information included is not the same. Here, it should be linguistic info, just like all paper language dictionaries dedicated to placenames (they mainly deal with etymologies). Lmaltier 22:45, 9 December 2009 (UTC)
Especially since the default search here no searches Sister projects like Wikipedia. --Bequw → ¢ • τ 15:04, 9 December 2009 (UTC)
a distraction from core task of being as good a dictionary as our competitors. DCDuring TALK 00:52, 9 December 2009 (UTC)
We are competing with someone ? --Ivan Štambuk 00:57, 9 December 2009 (UTC)
Like all living things in this universe. DCDuring TALK 02:23, 9 December 2009 (UTC)
In order words: we are not competing with anyone, and you're simply using a blank and invalid argument. --Ivan Štambuk 02:38, 9 December 2009 (UTC)
I responded in kind to your vacuous question. DCDuring TALK 12:54, 9 December 2009 (UTC)
My question was hardly "vacuous". You gave a serious answer - that apparently inclusion of toponyms somehow "degrades" long-term goals of Wiktionary, as if we're effectively competing with commercial dictionaries. That is hardly the case. Wiktionary and all of the other Wikimedia projects are based on free, volunteering effort, and even tho certain goals may be more "desirable" from someone's perspective than some others based on the utility for the end-users (e.g. coverage of "big" and "important" languages as opposed to smaller and "less significant" ones), forcing contributors not to contribute valuable content is principally against Wiki principles of collaboration and self-managed creation of content. --Ivan Štambuk 14:54, 9 December 2009 (UTC)
It is gratifying to see you lay out this set of principles. My and your attempts to persuade are part of the "managed" part of self-managed creation of content. Collaboration among ourselves to compete with others is something I hardly can object to. I can think of at least a small number of "smallish" languages whose contributors I have tried to encourage. To prevent further OT divergence from I have opened a heading for #Competition. DCDuring TALK 15:44, 9 December 2009 (UTC)
In the sense that we aim to provide useful information to users and there are other sites that offer similar content, yes we are competing. If we don't try and address the needs of the user, and DCDuring thinks there are more important way we can do this than working on toponyms, then all of this is intellectual masturbation. --Bequw → ¢ • τ 14:25, 9 December 2009 (UTC)
This a volunteering project, no one is getting paid to do anything, and contributors are free to contribute in any domains then like to. It's preposterous to think of us "competing" with somebody. In this tempo Wiktionary won't reach the quality and coverage level of comprehensive English dictionaries in at least 5 years. I'm sure that DCDuring would like it to be sooner, but this kind of exclusivity fascism will not "force" anyone to contribute in the direction they don't feel like contributing, and DCDuring and his English-focused friends would. Toponyms are hardly an "intellectual masturbation" - their study is a well-established discipline in lexicography and historical linguistics. There is absolutely no reason why Wiktionary couldn't function as a dictionary of toponyms. --Ivan Štambuk 14:54, 9 December 2009 (UTC)
To prevent further departure of this discussion from its original susubject I have started a new header #Competition. DCDuring TALK 15:54, 9 December 2009 (UTC)
Let me get this straight: You're basically saying:
  1. If we allow toponyms as entries we'd enter "a new arena of competition" that would somehow degrade the quality of "normal" (non-toponymic) entries?
  2. It is desirable to explicitly forbid creation of certain type of content if that measure would "force" contributors to focus on domains that are from someone's perspective more "important" as long-term goals ? --Ivan Štambuk 15:45, 9 December 2009 (UTC)
Instead of achieving pre-eminence in one field, which is very possible, we will remain the fifth best free online English dictionary. DCDuring TALK 15:54, 9 December 2009 (UTC)
And I see absolutely nothing wrong with that. We'll get to the quality of commercial dictionaries sooner or later (much sooner than Wikipedia will reach the quality of commercial encyclopedias). This is a free dictionary and explicitly forbidding certain type of content is against the tenets of free knowledge that the Wikimedia Foundations cherishes. It's all about choice. --Ivan Štambuk 16:02, 9 December 2009 (UTC)
their definitions are quite difficult to nail down with much accuracy without resorting to inclusion of encyclopedic content --Ceyockey 01:15, 9 December 2009 (UTC)
Can you give an a few example toponyms whose definitions are "difficult to nail down"? --Ivan Štambuk 01:21, 9 December 2009 (UTC)
This appears to be addressed adequately below without my providing redundant input. --Ceyockey 03:58, 10 December 2009 (UTC)
Where below? Can you list a few such examples here? --Ivan Štambuk 13:17, 10 December 2009 (UTC)
How do you say "of or pertaining to Moscow" in Russian? Or "woman citizen of Moscow" ? How do you pronounce it? What is the etymology of that word? How does it inflect? What is the translation of that word in languages that don't have Wikipedia articles on Moscow (or worse, don't even have Wikipedias at all)? Countless lexicographically relevant information can and should be be included. The problem is not whether allow toponyms or not, but to set the lowest bar of criteria for their inclusion. --Ivan Štambuk 01:03, 9 December 2009 (UTC)
Wiktionary cannot do justice the special needs of placenames: special data structures, special data, maps, photos, without doing great violence to its existing content and seriously challenging its technical resources. DCDuring TALK 02:23, 9 December 2009 (UTC)
Not a good excuse, DCDuring, Wikipedia can do these things much better. Place names dictionaries don't have to do this. The linguistic information is all that's required - meaning (minimal info), spelling, grammar (gender, declensions, etc), pronunciation, etymology, translations, alternative names/spellings if I haven't missed anything. Anatoli 02:29, 9 December 2009 (UTC)
What special data structures? Photos would be nice (at most 1), maps are of no use (though linking to google maps or similar external resources in ====External links==== should be allowed). Absolutely everything else would simply follow the normal layout of WT:ELE. --Ivan Štambuk 02:38, 9 December 2009 (UTC)
Coordinates, borders, overlapping borders, changing borders. Without some geographic information this is of minimal value, especially to users in the host language. With geographic information, it will start and remain far behind WP, Google Earth, etc. DCDuring TALK 12:54, 9 December 2009 (UTC)
Coordinates can be trivially linked to in ====External links==== (they're static). Borders and order geographic information that you mention are completely irrelevant for our cause: this is a dictionary and we only focus on lexicographically relevant content. Users who want to find out how do you pronounce, inflect, translate X, what is the demonym or relative adjective of X, or the etymology of it, would look it up in a dictionary. If they're interested in X's climate, population and industry - they'd look it up in encyclopedia. You're really exaggerating when you claim that this is "of minimal value" - this type of information we could provide Wikipedia normally does not provide at all, and neither to Google Earth and others. We are not interested in providing the type of content they provide at all. --Ivan Štambuk 15:16, 9 December 2009 (UTC)
Yes, and in my opinion, Google Maps could be used to confirm the existence of the toponym. I don't see any difference in checking out the existence of place names from other words. Like with any word human errors are possible but a simple check is easy for people worrying about the integrity of entries. Real and significant names are easy to check, especially in English. Anatoli 02:45, 9 December 2009 (UTC)
Services such as Google Maps aren't "published" in the traditional sense. They can and do change the info in their mapping database. How then could we properly cite & reference this changing medium? --Bequw → ¢ • τ 14:51, 9 December 2009 (UTC)
Coordinates are static aren't they? We could possibly embed them in some kind of template that will generate link to several online Maps services (Google Maps, Bing Maps etc.) This is really something completely optional. --Ivan Štambuk 15:22, 9 December 2009 (UTC)
Clarifying placename issues — AEL
(a) they are traditionally excluded from dictionaries; only few place names are included by them if any;
(b) there are overwhelmingly many place names; (bi) they overflood the random-page function, and (bii) they overflood the next-page and previous-page functions, analogous to browsing a printed dictionary page by page;
(c) place names are not really a part of the vocabulary of a language; their knowledge is not needed for understanding of texts. I am not sure how valid these reasons are, though. --Dan Polansky 09:08, 9 December 2009 (UTC)
Though there are many advocates-in-principle of WikiGazetteer entries there seem to be no advocates-in-action. IOW, lots of talk, no work. If no one can be found to even put forth a proposal that anticipates and answers practical objections, why should we expect this project to be anything other than a waste of time on a scale vastly larger than these tedious discussions? DCDuring TALK 12:54, 9 December 2009 (UTC)
I'm shocked by this comment DCDuring. This whole discussion started after one of our editors did actual work, and complained after the entry was mercilessly deleted by one of the admins strictly abiding by the defective policy that basically prohibits 99.999% of world's toponyms. It's important to settle down disputed points (which apparently range from "placenames are not words" to "placenames are worthless from lexicographical viewpoint) first before making any kind of formal proposal. The purpose of policy pages is not to settle objections but to codify established consensus. --Ivan Štambuk 15:30, 9 December 2009 (UTC)
We have a never-ending stream of vandals who do that kind of "work": adding entries or other material that doesn't meet CFI. The true work that has to be done is to make a proposal that defines the change to be made and anticipates and addresses the issues that will be faced once CFI is relaxed to include the entries that ought to be in a Wikigazetteer project. What you define as "work" seems to be whatever someone wants to enter as long as it is properly formatted, without regard to any policy, guidelines or practices concerning inclusion. The intent of the existing rules is to exclude most toponyms. It is not accidental.
The resort to pejorative labels makes this discussion more like some kind of public demonstration rather than something practical. DCDuring TALK 16:25, 9 December 2009 (UTC)
I've said more than once that I'd personally lock this project to registered users only. 99% of IP-generated content is either vandalism or so badly-formatted that it needs someone's attention to the point that the entry was better created from scratch. And the vandalism argument is not particularly strong IMHO: very rarely are the IP-generated entries toponyms. It is highly unlikely that there is a stream of some vandals out there that is eager to create thousands of low-quality toponym entries that would cause havoc on RfV. It's is much more likely that it would stay confined to a group of dedicated regulars who are primarily interested in that type of content. Like we have Makaokalani and Alasdair for personal names.
Anyway, as I said, the only thing that I see that needs to be done, judging from this discussion, is raising the bar a bit higher, so that we evade mass creation of stubbish entries. --Ivan Štambuk 17:08, 9 December 2009 (UTC)
As we have no intention at present of locking the project to non-registered users, I suppose that the WikiGazetteer proposal could wait until that time.
It is specious to use the behavior of users now to suggest their behavior in the future in such a way. If users find that we have place names, they will be more likely to look up their own favorite places and, finding them missing, add them. These are just the kind of entries and users that we need, I suppose, to successfully achieve high coverage of place names.
I eagerly await the arrival of the users interested in that kind of content. Do we have three from existing active contributors? Are they willing to do the real work of making a credible proposal that anticipates and addresses problems and objections. Or is there just going to be more whining, blathering, and fencing. Fencing can be a fun diversion, but it doesn't accomplish much. Why doesn't someone who wants this start a project page and get this show on the road? DCDuring TALK 18:25, 9 December 2009 (UTC)
It is specious to use the behavior of users now to suggest their behavior in the future in such a way - no, it's prefectly reasonable and common-sense. My prediction is based on the observation of empirical data, yours on some irrational apocalyptic scenario that has no foundation in the actual history of IP edits.
As I said below, we could explicitly request that all toponym entries be created with citations or some kind of additional value. That would significantly throttle "creation only for the purpose of creation". No we're not interested in coverage of world's toponyms at all - we're interested in the coverage of lexicographically relevant data. Entry such as ==English== ===Noun==={{en-proper-noun}} # City in Southern Arizona is basically worthless. But, if it includes some kind of information that a dictionary is interested in - that's entirely different thing. Once again, we are not interested in defining toponyms (as you cannot "define" onomastics): we're interested in including absolutely everything else.
We already have at least 1 user that happens to be quite interested in placenames: Anatoli who initiated this discussion in the first place. How many are there is irrelevant to this discussion. Feedback is still being actively gathered and discussed. There is absolutely no hurry as we have 117 billion years ahead of us (that's how long universe is going to last). --Ivan Štambuk 20:01, 9 December 2009 (UTC)
It seems (and I hope I'm not over simplifying too much) that those for toponyms are arguing that the only real criteria that should be used for inclusion is whether or not it (a toponym in this case) is a word. Taking that argument ad absurdum, would they be for the inclusion of any person's name (eg Ben Affleck) as well? People's anmes seem to share all the same dictionary needs as toponyms: they have translations (some are transliterated, some go to the nearest cognate, and some are reproduced exactly in the original script), etymologies ("we named him after ..."), inflections (in some languages), they are proper nouns, and there are mononymous and polynymous members. I'm definitely against including non-attributive senses of people's names, so I'm suspicious of the "for toponym" arguments. If some people are for toponyms but against individual names, how then should the criteria for inclusion be expanded beyond the the simple "is it an (unidiomatic & attested) word/term" test? --Bequw → ¢ • τ 14:47, 9 December 2009 (UTC)
Translations, pronunciations and inflections of personal names are almost always equal to translations, pronunciations and inflections of individual component parts combined, which we do allow as entries. I wouldn't really consider a factoid after whom sb was named that name's "etymology" - it's a result of non-linguistic, external sources that are not the topic of a dictionary. Except if that is somehow connected with the word's meaning (which usually is not in case of personal names, except in nicknames). Our inclusion criteria for toponoyms (and all of onomastics in general) should be expanded to allow all toponyms in all languages, every village, hill, river, lake, mountain peak on Earth and elsewhere: every Martian canyon and Moon valley, asteroid or galaxy. We only need to agree what is the lower limit of quality for their creation, in order to only have quality content and not thousands of bot-generated stubs that are worthless (like Wikipedia does). Something like "at least one translation, link to Wikipedia, and coordinates linked to in Google Maps". --Ivan Štambuk 15:10, 9 December 2009 (UTC)
Hear, hear. --Vahagn Petrosyan 15:50, 9 December 2009 (UTC)
While this has its ground, Ivan, please take into account the demand for 3 durably archived quotations, which the entry should provide, if demanded. If I create entries for hills and rocks in the vicinity of the village of my grandparents, e. g. Голо бърдо (not that from the western part of the Macedonia region) or Черната скала, they most certainly would fail such a procedure, even under Bulgarian headers (not to speak about English). In my opinion, it would be accepatable to allow such entries, if they are attestable by three quotations (though I am not favourable of them being prædicative), thus making no præcedent in current policies. Why not set a limit of, say, 10 000 inhabitants? Or even 10 inhabitants, if you will, some threshold is indispensable. There is no use in accepting defunct small settlements (I am not talking about Pompeii, but about El'ginski for example). I personally like your point of view, I myself am an ardent supporter of Aut Caesar, aut nihl, but let this comply with the extant policies. The uſer hight Bogorm converſation 16:17, 9 December 2009 (UTC)
Mandatory requirement of 3 citations when creating an entry could also serve as a reasonable alternative to prevent automatic creation of stubbish entries with little or no lexicographical value. Population as a parameter is a very bad choice, because there are some villages that are depopulated or barely populated today but have centuries or even millennia old history. In what external way (e.g. population, economy, historical significance etc.) is a certain toponym "important" shouldn't be a argument: that's the prominence type of inclusion criteria used for an encyclopedia. We don't care of the properties of real-word objects that the toponyms refer to, simply because we're not interesting in defining them at all (it is arguable whether onomastics terms can be defined at all, in the sense of normal gloss-definitions that we usually provide). In that respect, all toponyms are equally important. The more "important" and "less important" toponyms could have equally thorough entries and equally minimal definitions. The "important" toponyms are likely to get more attention and eventually rise in quality, and we should simply allow that process to occur spontaneously. --Ivan Štambuk 16:54, 9 December 2009 (UTC)
But Ivan, you're reasoning just separates multi-word proper nouns from single-word proper nouns. You're comment amounts to saying that usually peoples name are polynymous, and the constituent words are usually already in the dictionary. But, by applying that rule to places as well, you'd include "Moscow" one word, but not "New York". I don't think that's the inclusion criterion you're looking for. Why should toponyms as a class of words be treated differently that people's names?--Bequw → ¢ • τ 19:22, 9 December 2009 (UTC)
No, it's impossible to e.g. translate "New York" to FL by translating "New" + "York". In most languages it's actually some kind of counter-intuitive phonetically adapted form that regularly needs to be learned. Or perhaps it is some kind of a calque, or possible even some completely indigenous term (although that it very unlikely in case of New York. Perhaps in some Indian language or sth). In case of "John Doe" translation would literary amount to "John" + "Doe".
Also, I don't really see the point of drawing comparisons with polynymous personal names. We are not discussing their inclusion at all (and nobody wants them included anyway). Place names are special and distinct category and we should focus only on them. --Ivan Štambuk 20:28, 9 December 2009 (UTC)

OK, so say I'm reading a historical fiction book and come across a placename, say Ouagadougou. I get interested in the name and want to find information on a) how to say it, b) where it derived from, c) what it is, anyway, and d) how to say it in Spanish, the language that I'm learning at the moment. I don't want the Wikipedia article about the city itself, I want a dictionary entry which includes etymology, pronunciation, definition, translations, perhaps a map and a link to Wikipedia for information on the city itself. I look it up in Webster's Third, which gives pronunciation but really no other helpful information. Then I turn to Wiktionary, and find Ouagadougou with pronunciation, a definition and translations. Not perfect, but it does contain the information I was looking for and not much extra.
OK so that didn't really happen, but it could, and that would be why we would include such information. Right? Or did I misunderstand this whole discussion? L☺g☺maniac ☃ 16:34, 9 December 2009 (UTC)

Comment. For translations to be useful, it's not enough to include place-names; we have to include actual places, indexed by name, like a Wikipedia disambiguation page. For example, consider the place-name Paris. I imagine that most modern languages have a name for the capital of France; but do they all use that name when referring to Paris, Texas? Now, there's no intrinsic reason that we can't include places — that's what Anatoli has been pushing for — but before we make that leap, I think we should pause to consider whether we really want to do that. That's a lot further than we go with given names and surnames. (At least, it's further than we're supposed to go, according to the CFI. In practice, we do currently include a lot of specific people, just as we do currently include a lot of specific places.) —RuakhTALK 17:59, 9 December 2009 (UTC)

Sure. (And don't forget Paris, Missouri! And probably a boatload of others) Maybe then the definition would read "A placename used most commonly to refer to the capital city of France but also to several places in the US and elsewhere" and, if the languages have different words for different places, use maybe two trans-tables: one for Paris, France and one for others with explanations in the table as to which city each term refers to? L☺g☺maniac ☃ 18:12, 9 December 2009 (UTC)
Different toponyms that happen to have the same form in English must be separated. In FL they might have different forms: e.g. French Paris being borrowed from French and American from English. Some languages might have some "native" term, others might not and use some international that happens to coincide with some other placename. There are countless possibilities. To me the best would seem not to separate them in the definition lines, but in different subsections altogether. --Ivan Štambuk 19:47, 9 December 2009 (UTC)
Who said anything about "different toponyms that happen to have the same form in English"? Plenty of American cities are named after European cities — same toponym, no "happen"-ing at all. —RuakhTALK 20:25, 9 December 2009 (UTC)
Yes, toponyms may have several senses, derived from each other, just like other words. And linguistic info about these senses (pronunciation, demonyms, etc.) may be the same or different. Lmaltier 22:45, 9 December 2009 (UTC)
Yup, we agree. —RuakhTALK 04:35, 10 December 2009 (UTC)
That's a good point, Ruakh, that including placenames as names are including placenames as referring to specific places are two wholly different things. That was the idea behind the vote I proposed in August: I thought that names as names, at least, could be agreed on. (I was wrong, apparently, as the vote stalled due to opposition.) But the difference between names as bare names and names as referring to specific places is one that's not always made, leading to discussion at cross purposes. Specifically, I don't think everyone taking part in this discussion is talking about the same thing.​—msh210 23:34, 9 December 2009 (UTC)

Comment. Just to qualify an "afore-made" comment, whereby someone said at least 1 person (Anatoli) was interested in this matter. There are at least two others, making for a total of three.

Allow me to explain separately:

First, there is myself; last year I created entries for two toponyms (each specifying a place in a different country) : Tarica and Nkinora. Sure, I did create them for the Christmas Competition, but there was also the underlying, ulterior motive of actually making a nice, little, new entry to add to Wiktionary's "menagerie". ;-) Sadly though, they were later obliterated after allegedly failing RFV, no doubt due to certain (IMO) half-assed parts of the deformed chimera that is CFI... N.B. these in anyone's eyes were surely not the worst thing to emerge in the name of someone who wanted to win the competition. I remember the undesirable and waaay too uncommon relatives of T. rex and E. coli all too well ;P
Secondly, there is SB. I cannot cite any specific place but I know that in the discussion of votes on matters like this and whatnot, he has called himself an inclusionist and thus pushed (if only slightly) for the inclusion of stuff like this. 50 Xylophone Players talk 22:28, 9 December 2009 (UTC)

Comment The "attributive use" condition strikes me as possibly causing some somewhat strange effects: say that "Venice" is verified as being used attributively, so that the page may exist (rather: the 'city' sense of that entry). But may we then add translations of that word (in particular the city definition) without first checking that the translation verifies the attribution test? Or should translations to language A (which never use its word for "Venice" in an attributive manner) be unlinked? Or perpetually red? Also, I can't say I ever understood the why attributive use would be relevant as a condition. CFI mentions "New York", and that it's included because of the existence of terms such as "New York delicatessen". Okay, I can see why that could motivate us to include the adjective. But why would the adjective motivate the proper noun? Why an all-or-nothing situation? Why either both adjective and proper noun, or none? Sorry, but I simply don't understand why the presence of an adjective is relevant to the presence of a proper noun. \Mike 23:21, 9 December 2009 (UTC)

Many dictionaries have similar restrictions on entries for people and places. The OED for instance on includes them if they are used attributively, possessively (eg Foucault's pendulum), figuratively, or allusively(to meet one's Waterloo). It's general practice to note the geographic/biographic referent and then explain the meanings that extend beyond that referent. --Bequw → ¢ • τ 00:00, 10 December 2009 (UTC)
This is because the EOD does not want to include all words, its option is the traditional one in language dictionaries: excluding proper nouns (except when there is a specific reason to include them). But other language dictionaries are specialized in first names, other ones in surnames, other ones in toponym etymologies. We have no reason to specifically exclude placenames (no space limit). Including all words including toponyms might seem an issue for the random page feature, but this feature is not for those looking for information. Lmaltier 07:10, 10 December 2009 (UTC)
Lmaltier is correct: comparisons with traditional monolingual dictionaries are pointless and misleading. We are multilingual multi-purpose dictionary, and there is no reason why we couldn't also function as a dictionary of onomastics (personal names, toponyms, *nyms of all kind). These themselves already have their own specialized dictionaries, and there is plethora od lexicographically relevant data that we can build on and integrate into "normal" entries. --Ivan Štambuk 13:07, 10 December 2009 (UTC)
On self-management and self-allocation of contributors as resources: Like some other editors and contributors above, I think each contributor should manage himself as a human resource rather than being managed by the collective of editors. If a person wants to allocate his scarce resources including attention and time to contributing toponyms, he should have the option. The only question should be whether that person's lexicographical toponym-documentation activity should be channeled to Wiktionary or to a dedicated Wikigazetteer project. Either way, the person is going to be spending part of their time and attention outside of English definitions. --Dan Polansky 11:17, 10 December 2009 (UTC)
To me the question is not simply about the time and enthusiasm of those who want toponyms, it is also with the loss of focus of the project as a whole. What technical resources we have will be spread yet thinner. Those who have the knowledge and experience to integrate a new class of entries into Wiktionary will find more and more demands on their time. There are likely to be a steady stream of compromises and confusions about the policies, guidelines and practices applicable to various classes of entries, as should already be evident in the discussion of attestation.
I think Wiktionary could play a role as an incubator for a portion of the content of Wikigazetteer. But for Wikigazetteer to be in any way limited by the structure, policies, guidelines, practices, habits, reputation, and volunteer base of Wiktionary or Wikipedia seems silly for an enterprise of such promise. Wiktionary is having difficulty in getting even Wikisaurus, a subproject close to the core of the function of project, to a level of coverage and use approaching that of the main dictionary. DCDuring TALK 11:56, 10 December 2009 (UTC)
To me the question is not simply about the time and enthusiasm of those who want toponyms - the question should be only about that. Absolutely everything else is irrelevant.
it is also with the loss of focus of the project as a whole - Again, there is no such thing as "project focus" that you speak about. People are free to contribute whatever interest them, whenever they want. There is no pan-project management and strategy. Common interest groups ("wikiprojects") and "could you help me with this"-type of wiki-friendships arise completely spontaneously.
Those who have the knowledge and experience to integrate a new class of entries into Wiktionary will find more and more demands on their time. - that is their problem not yours. We already happen to have people creating valuable toponymic information that is being deleted under the absurd CFI policy. That is much more relevant problem than any of your could-be scenarios, which I personally find very far-fetched. Who are we to forbid them to contribute what they like in their free time?
There are likely to be a steady stream of compromises and confusions about the policies, guidelines and practices applicable to various classes of entries, as should already be evident in the discussion of attestation. - Nothing more problematic than what has be done with "normal" non-onomastics entries. Again you're making ominous predictions without empirical data to substantiate it.
Wiktionary is having difficulty in getting even Wikisaurus, a subproject close to the core of the function of project, to a level of coverage and use approaching that of the main dictionary. - Wikisaurus receives as much attention as is statistical interest in it among Wiktionary contributors. Like every other part of this project. All of them are "having difficulties" as being undermanned and missing a bulk of essential information. But that is of absolutely no concern to the problem of forbidding toponymic entries on Wiktionary and I would be grateful if you finally stopped insinuating causal relationships between the two. --Ivan Štambuk 13:02, 10 December 2009 (UTC)

Active discussions

Richardb suggested that Wiktionary should have an "Active discussions" category a little while ago, which I thought would be really useful, so I made the {{discussion}} template, which, when placed on a discussion page, will (hopefully) place the discussion page in Category:Active discussions (not made yet, waiting to see if anyone proves the whole thing impossible) if the page has been edited in the past three days. The category will list the pages in order of how recently they were edited (I think).

Any comments, suggestion, criticism, proof that it will never work and is entirely impossible and/or ridiculous? (I've actually never had something like this work on the first try so it's pretty unlikely that it will work properly.) Could someone please try to review the template to see if I messed up before it's put on talk pages? --Yair rand 06:58, 7 December 2009 (UTC)

This category was deleted (by me) in 2007, along with {{active discussions}}. It does not work, it requires people to manually add it, manually remove it, and to manually look into the category, it doesn't allow people to chose which discussions are relevant to them, the watchlist is a much saner way of doing this, that has been proven to work. Because of the way pages are cached, the template will only be re-processed when the page is edited, and so all pages it is added to will appear in the category sorted under 0. (The previous template just added the category). Conrad.Irwin 13:57, 7 December 2009 (UTC)
You mean it will stay in the category permanently?--Yair rand 16:32, 7 December 2009 (UTC)
Okay, how about this: a template that when subst'd will replace itself with the same template using subst'd nosubst ... that would subst REVISIONTIMESTAMP as the first parameter ... inside noincluded comment tags ... and will show parameter one ifeq:PAGENAME|active discussions ... with the rest of the page inside noinclude ... and the active discussion page could take the revision timestamp and calculate it there and show it if it's less then three days. Does that make any sense? --Yair rand 17:44, 7 December 2009 (UTC)
That only solves manual removal, not manual addition, nor classification of relevantness. It will also still only remove the page when someone else edits it (i.e. the chances are someone will find the discussion in the category, go and reply to it, and thus remove it from the category despite the fact that the discussion is renewed). Conrad.Irwin 18:28, 7 December 2009 (UTC)
If you have something like {{#ifexpr:{{#time:U|{{CURRENTTIMESTAMP}}}}-{{#time:U|{{Wiktionary talk:Wikisaurus}}}}<259200|[[Wiktionary talk:Wikisaurus]]}} on the active discussions page it still won't update automatically? --Yair rand 18:34, 7 December 2009 (UTC)
No, you can test at WT:SAND, which is 20091207201652. Conrad.Irwin 20:17, 7 December 2009 (UTC)

Reporting ABUSE OF ADMINISTRATIVE POWER

In response to Block ID: 53325 initiated by Mglovesfun <http://en.wiktionary.org/wiki/User:Mglovesfun> .

After attempting to ADD CITATIONS to an entry WITHOUT ANY, I've been completely removed form your website. I can't even log on and edit the entry on dysafferentation that I STARTED AND CITED with my original username. It is not just a chiropractic word, and that misleading bit needs to be removed; but I can't.

I am sloppy, and editing my entries must be difficult and annoying to administrators. For that I apologize.

I study in Missouri, USA. I am earning a doctorate at a fully accredited institution with several hundred formal education hours under my belt. I know a lot about the history and use of the word subluxation as it is used everyday in the USA and around the world. I am aware of the controversy and disagreement within even the Chiropractic profession surrounding the word. So I attempted to begin to add bit by bit, every reliable source defining this word SPOKEN EVERY DAY FOR NEARLY 100 YEARS.

When the entry was changed back WITHOUT ANY EXPLANATION. I edited it again, with good intentions, using DIFFERENT AND NEW INFORMATION, and told those who disagree to read more of the scientific literature on the topic.

Originating in mid America, chiropractic is a legitimate scientific profession. I understand it is historically primarily an American phenomenon; just you wait. My entry may not apply in France where McFunGlove's lives and studies, I don't know. But I didn't find any affiliation with the French Chiropractic Association on McFunGlove's user page. He left me without anyway of contacting him.

An updated definition for subluxation is needed. Chiropractic may not be as big and the most trusted and used method of healthcare in the word, TRADITIONAL CHINESE MEDICINE. However, there are thousands of chiropractors with millions of benefactors who say the word SUBLUXATION multiple times a day. AND THEY DON'T MEAN "LESS THAN A ISLOCATION." If allowed to debate with FunGlove, we could perhaps have agreed on a new entry "chiropractic subluxation complex;" But nobody except maybe 1st year chiropractic students would say it that way.

I was called STUPIDLY STUBBORN, but McFunGlove gave no citation or even an adult explanation!

He explained "This is a dictionary. We do definitions, we don't do biographies or miscellaneous information" Too busy to read...

1. A publication, usually a book, with a list of words from one or more languages, normally ordered alphabetically and explaining each word's meaning and sometimes containing information on its etymology, usage, translations, and other data.

AS DEFINED BY WIKIPEDIA! "and sometimes containing information on its etymology, usage, translations, and other data."

I tried to make the connection between the French luxation leading to a word commonly used to mean "less than a dislocation." (Etymology) DENIED

I tried to make the connection between the Latin lux leading to a DIFFERENT word literally meaning "less light" (Etymology) and connotatively used to mean "a condition of less life" (Usage everyday in America, found in online advertising for chiropractic care) DENIED

So I got the message. And changed my approach.

I tried to add the historical usage as definition by the founder and by the developer of Chiropractic, with four requirements for a subluxation. (Other Data)

I tried to stimulate further study by adding that the osteopathic term "somatic dysfunction" describes a subluxation complex (Other Data) (Dr. Still, also part of mid American history and who collaborated with Dr. Palmer)

FOR THAT I WAS BLOCKED BY ONE PERSON, Martin Richard Gardner.

Somatic dysfunction is in the current literature and not on your website! I'd add it if I could.

I used recent sources and intended on collecting the newest agreement among chiropractic colleges in American as well as agreement among current chiropractic physicians. Including a recent poll of chiropractors, majority of which agreed they wanted to keep the word "subluxation."

In my explanation for my edit I stressed the importance and need for more multidisciplinary sources. THERE WEREN'T ANY before I came along. I TRIED WITH GOOD INTENTION TO JOIN YOUR ONLINE COMMUNITY. Until further notice I will understand I am unwanted here. YOUR LOSS. I just wasted some time, but it was worth it. Now I know to STOP DEFENDING AND RECOMMENDING WIKTINARY to my peers.

I must add, in case anybody is blind to the fact, that a large population of humanoids are living in the dark ages; for lack of resources, time, ability, or EFFORT. I used to view Wiktipedia as a part of the solution. I was wrong.

THE ONLY CLINICAL ENTITY THAT CHIROPRACTIC CLAIMS TO TREAT IS SUBLUXATION! I'm not talking about partial dislocations. As a student I am hinting at fixations causing dysafferentation and therefore contributing to cord sensatization, neurogenic inflammation, and sympatheticotonia. Most importantly, as a future practitioner of strait chiropracTIC, I am thinking about how to remove a strangle hold on your brainstem.

WHAT A HEAVY, LOADED WORD! And McFunGlove would have you read one fragment about it.

MUCH LOVE,

AND READ THE LITERATURE!

To repeat what MG said on your talk page, this is a dictionary. We do definitions, we don't do biographies or miscellaneous information. The content you added was not appropriate for Wiktionary. And please don't type in capital letters so much. --Yair rand 22:07, 7 December 2009 (UTC)
  • I understand where you are coming from, typing-in-caps-supposedly-no-name-user. However that first usage note simply didn't make any sense, and the historical stuff you added after that was way, way too detailed. Wiktionary is supposed to help the average person understand how words are used - not give detailed linguistic, socio-cultural, professional and/or historical information. Although I'm not directly involved in this dispute, I can see how perhaps Mglovesfun could have communicated this to you better. However, it would appear he was merely following Wiktionary's guidelines. Tooironic 22:26, 7 December 2009 (UTC)
    Indeed. Long explanations that go into detail are for Wikipedia. Your contribution seemed to talk a lot about one specific person, which I thought might have been yourself as we get quite a lot of promotional material on here. If I had nothing to do on here but reply to contributors, my reply would have been much better. But instead, I created and checked some entries. Mglovesfun (talk) 22:29, 7 December 2009 (UTC)

Placement of example media

I am unsure of where on entries to put "example" media:

  • Sounds (not pronunciations): Where would I put an example sound file of a laugh at laugh?
  • Images: How do we put more than one image per Part-of-Speech without making the entry look too clutter. One attempt at A#See also 3 put's all the images in an image gallery in a =See also= section, but I don't think that's what the header is really for.
  • Text (not quotations): Where should we put "The ball was kicked by Fred" on passive voice? We currently use {{examples-right}} a bit, but it's right-aligned (most real content should be at least able to be put inline), and makes some pages look weird, such as prepositional phrase.

As these all deal with the actual referents rather than the word, how would people be disposed to adding another heading to WT:ELE, such as =Examples=, or something better worded? There would of course be formatting to determine and placement (under each PoS or Etymology?). --Bequw → ¢ • τ 22:56, 7 December 2009 (UTC)

I am in favour of this, but also couldn't feel of an amazing title for the section. Examples is probably good enough (at level 4). Conrad.Irwin 22:59, 7 December 2009 (UTC)
If I can remember it, I found an article where I checked the French from T-bot, but with the image and {{wikipedia}} notices, the English section interfered with the French. It should be in my last 500 edits, so I will look tomorrow when I wake up. Mglovesfun (talk) 23:03, 7 December 2009 (UTC)
The idea of a =Examples= header sounds good to me. L☺g☺maniac ☃ 23:12, 7 December 2009 (UTC)
Yes, sounds good.​—msh210 17:45, 8 December 2009 (UTC)
The additional sound file may interfere with the pronunciation audio when the index is generated. I had a sound file for cselló in the See also section and it was picked up as the pronunciation in the index. This entry does not have an audio file. --Panda10 23:24, 7 December 2009 (UTC)
I should fix that, and only loook for audio templates under ===pronunciation=== ? Conrad.Irwin 02:55, 8 December 2009 (UTC)
Part of the ratinale for {{examples-right}} is that its rhs positioning conserves vertical screen space. Images and examples both seem to fit into the same broad class of non-core content. I don't know how non-human users of our content handle such material but it is safe to assume that human users like to see useful content at the first screen they land on and some direction to what might lie below the fold. If examples have good content that illustrates a particular sense, then we should have the ability to place that content as close as practical to that sense, without radically disrupting user habits and expectations. I don't really see how a separate header accomplishes that better than our current non-standardized approach. DCDuring TALK 00:22, 8 December 2009 (UTC)
And many times that lay right-hand side layout is perfectly fine. We have decent ways of adding an image or a text example to the right-hand side and we should find a way to do audio like that as well. But my question is, what do we do when that layout looks bad, for instance when we get more than a couple media for a PoS? If there are too many quotations for a sense, some go in a quotation section. I think we should have the same facility for examples, and the decision to use the subheading would of course be on an article by article basis. --Bequw → ¢ • τ 00:51, 8 December 2009 (UTC)
It just occurred to me that we have had galleries of images in entries. See head. No header to set it off, just <gallery> HTML. DCDuring TALK 02:17, 8 December 2009 (UTC)
Technically, everything is under some heading (unless it's before the first L2 like {{also}}). In this case, the image gallery is under a =Quotation= heading, which seems just about as right/wrong as what was going on with A#See also 3. --Bequw → ¢ • τ 02:41, 8 December 2009 (UTC)
I've yet to see {{examples-right}} in a context it didn't look cluttered. It's wider than the images which makes it look out of place, coupled with the margins being wrong and the yellow bizarre, it's a truly dreadful effect. For audio examples, it is possible they will fit in a side-audio template like a sister-project box, but I'm not a huge fan of those either. Conrad.Irwin 02:55, 8 December 2009 (UTC)
Ruakh admittedly just cobbled something together, for which I am grateful. Now that it has been used a bit we have a population of situations that such a thing should handle and we can emend, amend, or replace it. We don't have so many uses that we couldn't virtually start over. The side-bar approach is widely used in contemporary textbook publishing as well as on the Web. We would probably benefit from looking at the approaches that have gained traction on the Web on sites that are comparable from a user PoV. DCDuring TALK 12:34, 8 December 2009 (UTC)
I have to admit that most of my previous complaints were removed by forcing the width to be like a box (though there are still some pages that explicitly set the width of it to 50%, yuck!) and by removing the right floating TOC stuff from my monobook.css (I'd forgotten that wasn't turned on for everyone). Conrad.Irwin 12:43, 8 December 2009 (UTC)
There are cases where the examples are long, too long for the default width. Whatever the weaknesses of the sidebar approach, having text examples (of figures of rhetoric or of any linguistic phenomena) appear unmarked and in-line confuses them with usage examples. That was and is the only essential point with respect to textual examples. Achieving consistency of presentation with other classes of examples media seems desirable, as does conserving vertical screen space and some proximity between definition and example. The head example shows that we can't always get even three of the desiderata, but long entries are multiply problematic anyway. DCDuring TALK 13:00, 8 December 2009 (UTC)
I tried putting {{audio}} in {{examples-right}} on laugh. Works OK. --Bequw → ¢ • τ 01:12, 9 December 2009 (UTC)
Glad it worked. At least it's a good second step, now fairly well advertised. DCDuring TALK 02:29, 9 December 2009 (UTC)

Started Wiktionary:Votes/pl-2009-12/Addition of Examples header to ELE

Could we perhaps use "Illustrations" rather than "Examples", as a more general term? -- Visviva 01:36, 20 December 2009 (UTC)
That might look weird with example sound files. What about "Samples" or "Representations"? --Bequw → ¢ • τ 09:00, 26 December 2009 (UTC)

Why delete orphaned talk/citations pages?

Per Citations:American-born Chinese. Is there any advantage to deleting this, if so, what is it? At the very least archived deletion/cleanup/verification discussions should not be deleted when the article is. I notice this is in our policies, and I can see that in some case it is appropriate, but in others not. I sometimes feel a bit mean when I delete a legitimate, intelligent discussion because the entry failed RFD. Mglovesfun (talk) 16:05, 8 December 2009 (UTC)

It was copied without thought from Wikipedia. The talk page should be decided on its own merit, often there is nothing useful there, but sometimes there are previous discussions which it seems a shame to delete (that said the chances of anyone refinding them unless they are linked to is quite small). Conrad.Irwin 16:12, 8 December 2009 (UTC)
I'd favor keeping both the talk and citations components by default. They are a bit more findable in the talk and citation spaces than anywhere else I think.
Also citations are sometimes not for the exact spelling (spacing/hyphenation/ligature/diacritic) of the headword. If we are going to have these headwords, then ultimately the citations ought to be sorted appropriately. However, I would not want to burden the closing of RfDs with this. Perhaps we could bot-identify such citations. both in citations pages and in entries. Ultimately, all these fine (and some are not really "fine") distinctions might be worth respecting in full. DCDuring TALK 16:19, 8 December 2009 (UTC)
I agree we should by default keep citations , though for deleted-as-SoP entries they can move to the appropriate parts' citations' pages instead of remaining where they are if anyone wants to bother. (If we already have equally lexicographically interesting citations of all the parts, then the sum's citations can be deleted.) Talk pages are a good place for the RFD/RFV discussion to be archived; if the archivist goes another route than archiving there (we have no fixed process), then the talk page should be kept or deleted depending on content.​—msh210 17:43, 8 December 2009 (UTC) Actually, what Ruakh says below about consensus makes more sense. 19:30, 8 December 2009 (UTC)
We have no formal procedure for archiving, but everyone seems to use WT:PDE for deleted stuff, probably because if it's deleted we delete the talk page too, so it can't go to the talk page. I still think that's somewhat useful as it keep the terms together by letter. And I suspect nobody is going to bother retrieving the thousand-or-so archived debates and putting them on talk pages, so it will have to stay! Mglovesfun (talk) 17:50, 8 December 2009 (UTC)
Re: "everyone seems to use WT:PDE [] we delete the talk page": Actually, no. In the past thirty days, you're the only one who's touched the PDE pages; and you're also the only person I've seen deleting these talk-pages. So the main problem is that you were doing it a different way from everyone else, destroying other people's efforts. (To be clear, I'm not blaming you; you didn't know better, and you seem to have stopped as soon as you realized. I'm just saying, you've already fixed the problem, and no further discussion is necessary on that subject.) —RuakhTALK 18:34, 8 December 2009 (UTC)
I don't think "orphaned" citations-pages should normally be deleted; after all, the major reason we created that namespace is to keep track of citations for entries that fail RFV or RFD. I don't think that all citations-pages are worth keeping, but our standards for them should be much, much lower than for entries. We deleted "American-born Chinese" because we (FSV of "we") decided it was SOP; but unless there was unanimous consensus that the term was obviously and unambiguously SOP, then I don't think that decision should carry over to the citations-page. —RuakhTALK 18:34, 8 December 2009 (UTC)
What Ruakh says makes sense to me. Keep citations-pages with exceptions; keep SoP citations pages unless there is an unanimity about their SoPness. Keep talk pages with expections. --Dan Polansky 19:18, 8 December 2009 (UTC)
Agreed. And to add: “Citations pages for uncontestedly SoP phrases shall have their quotations copied to the citations pages of each of the phrase’s constituent words.” Agreed?  (u):Raifʻhār (t):Doremítzwr﴿ 20:42, 8 December 2009 (UTC)
I don't oppose "Citations pages for uncontestedly SoP phrases shall have their quotations copied to the citations pages of each of the phrase’s constituent words", yet it seems that copying citations of uncontested SoPs to the citations pages of the constituent terms should better be optional, especially for the case that the constituent citations pages are already full. --Dan Polansky 06:45, 9 December 2009 (UTC)
Citations pages can’t (in practice) be filled. How many is too many? Fifty? To my knowledge, there are maybe two such citations pages in the entire English Wiktionary. That simply isn’t a problem. OTOH, requiring such copying ensures that editors’ work and the sorely-needed citations resource (upon which a descriptive dictionary ought to be based, and of which we have an extreme paucity) aren’t wasted. Copying should definitely be obligatory.  (u):Raifʻhār (t):Doremítzwr﴿ 18:26, 11 December 2009 (UTC)
Err on the side of keeping Citations pages, and even when everyone agrees to delete, you can still keep the citations themselves by moving them to a different page. 70.112.24.181 05:50, 23 December 2009 (UTC)

Competition

A somewhat OT digression from the placenames discussion is the nature of the competition Wiktionary is engaged in. Rather than clog that discussion any more than it already is, I am attempting to lead this thread here.

The competition that Wiktionary is engaged in is not a voluntary thing. We are in it whether we choose to acknowledge or not. Depending on one's profession, one's living circumstances, one's life experience, and one's ideology, one may be more or less inclined to perceive it and accept it. Wiktionary competes for:

  1. the attention of users (against other online and other references);
  2. the time of qualified and willing volunteers (against a great number of language-oriented commercial, non-commercial, and semi-commercial (ad $) sites);
  3. for technical resources from WMF (against our sister projects);
  4. for financial support from WMF or potential donors (against other language-oriented educational vehicles).

Whether attempting to compete in a new arena against new competitors by entering a new universe of entry types will help us in any of these competitions for resources more than, say, quality improvement or targeted recruitment of volunteers is a yet undiscussed set of questions. DCDuring TALK 15:28, 9 December 2009 (UTC)

From what I gather, according to your personal opinion the most important part of Wiktionary are the English definitions: you want to forbid placenames so that people would focus more on that part. But Wiktionarians are not a herd of cattle; if you bar them in one direction, they won't necessarily flow to where you want. People who do placenames, will do them; others who focus on en-content, will focus on it. Besides, I don't think Wiktionary's English content can ever compete with commercial dictionaries: not in 5 years, not in 10. I know I never use it. Where we can compete are the translations, FL-entries, great etymologies, inflection information and lexicographical content about placenames. IMO. --Vahagn Petrosyan 16:18, 9 December 2009 (UTC)
Please keep the placename "discussion" under its heading above.
I personally believe that:
  1. the English language is the host language,
  2. English entries are the only one's that usually have full definitions and translations,
  3. English entries are what most users expect us to be good at,
  4. English entries are our most numerous entries.
  5. Many of our English entries are wrong, dated, or incomplete.
  6. Consequently, many of the translations are likely to be misleadingly wrong.
  7. If we continue to fail to get enough native English speakers to contribute to Wiktionary, then English Wiktionary will fail and, with it, all the translation work.
Wiktionary's main problem is the quality of the English-language definitions, the principal reason English speakers and, especially, writers use a dictionary. As these definitions are the targets of the translations, translations are unlikely to be superior in quality to the English definitions. There is no quantity of definitions and other work that is likely to overcome a significant English-definition quality deficit. DCDuring TALK 18:44, 9 December 2009 (UTC)
Sorry to break it down for you DCDuring, but in case you haven't noticed most of our native English contributors have rather limited contributions to English entries. In fact, they mostly focus on non-English entries in a language they happen to be studying at the moment. English entries have achieved enough quality and coverage to host FL entries, and only rather obscure and rarely used words have remained to be added. (As well as many -ly, -ness, -ian, and similar secondary forms, as well as countless idioms that are being coined faster than any dictionary could record them). We cannot "fail", because our goals (all words in all languages) is pretty much impossible to achieve anyway. After some thinking, I pretty much agree with that Vahagn has written: as the time goes on English entries will constitute ever-smaller and smaller number of total Wiktionary lemma entries, and most visitors from the Web will come here to look up inflection tables or a pronunciation of some Russian or Japanese verb because we happen to be the only site on the Internet that provides it.
I have no idea how to attract more native English contributors that are willing to contribute to English entries, but as the time goes on and Wiktionary's general quality improves, the threshold for making quality contributions will only rise and only rare sort of language freaks would left to be attracted.
I personally see little difference in a thoroughly covered FL and En entry - if the only difference is that the En term has definition lines (which FL term can have in terms of glosses). It would be IMHO extremely wrong to impose prejudice that English is somehow the "primary" target of this project and that absolutely everything else needs to be modeled around that thesis. No: We're here primarily to lexicalize words in all languages using English as a host language.
People are contributing in domains they feel like however they're able to. Our greatest potential lies in the inherent medium which is not bound by physical capacity, and the integrative approach. We simultaneously function as En-En, En-FL, FL-En dictionary, as a dictionary of names, etymological dictionary, we allow phrases and idioms, slang, proverbs, absolutely everything imaginable under the sky but which has never ever been compiled together.
I also firmly believe that in distant enough future EN Wiktionary will subsume all the other FL Wiktionaries. Perhaps in some OmegaWiki-type of common platform, or simply because we'll have 10 times more quality entries in their native languages than they do, and they'll get so demotivated to simply relocate here. So it's sth to keep in mind for the long-term. --Ivan Štambuk 19:40, 9 December 2009 (UTC)
I am completely in agreement, and I take issue only with the 'impossibility' of achieving the goal. There are a finite number of words in all languages out there (so long as we are not compelled to 'define' the endless streams of gibberish uttered by madmen), and I'd bet that our 1.5 million entries covers a good chunk of all the words that have ever been that would qualify for inclusion. And if we add another twenty thousand place names to that total, so much the better, we'll draw more people to the site through them. bd2412 T 15:58, 11 December 2009 (UTC)
  • I think at the end of the day users will contribute however they want to; like Vahagn said, we're not "cattle". :P I'm not a fan of this apparently negative attitude, that the project somehow has the potential to "fail" because of supposed lack of quality in our entries. Now, I'd be the first to admit that Wiktionary has a long way to go to add new entries in English (not to mention the LOTEs) - but let's not forget how far we've come. I, for one, actually do use Wiktionary as a resource - it's quick and easy to search and, best of all, no ads. :) Moreover, I enjoy adding to the project and watching it grow, and see it more like a personal record of everything I learn in my language pair (Chinese/English), something which is very useful for me personally. As for this "competition" you speak of, DCDuring, isn't the whole point of volunteer programs that we don't have to desperately conform to commercial interests? Call me naive, but I'd rather just enjoy my time here and get on with the work that has to be done. Tooironic 21:43, 9 December 2009 (UTC)

Yes, there's competition considering a lot of things. Including, I'd like to add, competition for volunteer's time from different tasks which all have to be done, or at least could be done. Such tasks all have different weights for different users - if I may take myself as an example, the three tasks which compete most successfully for my attention while I'm here are Swedish definitions, Swedish translations and templates for Swedish, in that (approximate) order. Contributing to English definitions is quite far down, by the simple reason that I know my English is not quite advanced enough to do so. Would I instead like to add place names, to reconnect to the issue that started the whole discussion? Certainly not impossible, but most likely mainly Swedish. But: if these were to be banned - perhaps even further than they are today (I am e.g. not at all certain that Helsingfors would qualify under a strict interpretation of CFI - Swedish exceedingly rarely use noun attributively at all - if ever.). I would for certain not add more English definitions instead, by reasons already put forward.

So, in short I'm of the opinion that you can't be certain that people will work on what you think is important to work on just because you ban work on other things. Competition simply doesn't work that way. Instead we may see them go elsewhere where they *can* contribute with their special interests. \Mike 23:59, 9 December 2009 (UTC)

At this time, I'm only going to respond to DCD's #7 "If we continue to fail to get enough native English speakers to contribute to Wiktionary, then English Wiktionary will fail". The first website I spent my time contributing to allowed contributors the same kind of freedom (mostly) that Wiktionary or Wikipedia has, but existed long before either one. The small number of contributors that we had (about 6, but most of the content work was done by 4 people in their spare time) tended to focus on preparing content in smaller or lesser-known subjects, and we often overlooked the larger ones that people might normally expect users to want to find. What we discovered was that this bias actually contributed to our success. We kept very detailed statistics on how people arrived, what they went looking for, etc. In a growing internet, where most sites were focussing on general and big-picture items, we succeeded because our site floated to the top in searches for all the hard-to-find and obscure things people went looking for. We developed a reputation for being the place to look because of that, and the Smithsonian awarded our web site a medal. So, I have to disagree with DCD that failings with our big-ticket entries in English will lead necessarily to failure. It leads to problems (and often errors) in our Translations, but not failure. --EncycloPetey 02:38, 10 December 2009 (UTC)
What are the implications for us of your site's experience? The one I draw (of course) is that it would be really good to have good statistics on hits for anons, by source or source type (our own wikilinks, internal search box, sister-project link, portal (eg. OneLook), and search engine (Google)) and on which were the high-volume hits (eg, MILF).
Is the site still going in some form?
I don't know what you mean by big-ticket English entries? The long ones? The high-frequency-in-use ones? The high-frequency-of-hits ones? The high-number-of-translations ones? DCDuring TALK 18:01, 11 December 2009 (UTC)
Yes, the site is still running, although it's no longer growing like it used to, and has accordingly dropped in terms of its profile (relative to other similar sites). Policy and personnel changes that were made after a few years practically locked out additional growth of the site. Instead of free editing access to the various contributors, everything (new images, corrections, additions, minor edits) had to go through a single individual who had other responsibilities besides the site. As well, the various active contributors all left for other work around that time. The site does now have a professional graphics designer working to make things look nicer, but most "real" content has changed comparatively little since I was involved.
By "big-ticket I mean those words that are common, have many meanings (eve if we lack them), and usually have sizeable entries in print dictionaries as a result. --EncycloPetey 04:38, 12 December 2009 (UTC)

Toneless pinyin

There seems to be a consensus that toneless pinyin entries are not desirable; despite this they continue to be created. I've never seen it explicitly stated, so it would be nice to have something concrete. Nadando 19:26, 9 December 2009 (UTC)

They are quite ugly aren't they? And yet I suppose some beginner Mandarin learners might find such entries useful. Like many aspects of Chinese entries on Wiktionary, there is no consensus about this (AFAIK). I don't see anything wrong with them per se, as long as the actual tones are provided in the entry, along with tones in example sentences. (Unfortunately this is not the case with Shijieyu, renzao and some of yuyan, for example.) That being said, it would be hard to police such requirements as we Chinese contributors are stretched as it is and I think most of us probably don't want to spend our time adding tone marks to pinyin entries when we could be doing much more interesting things. So, yeah, there's two sides to it. Tooironic 21:50, 9 December 2009 (UTC)
For the record, I have been mainly responsible for deleting some of them and I have been doing so as per the precedents at talk:ou and talk:ojo. —Internoob (Disc.•Cont.) 01:03, 10 December 2009 (UTC)
Removing words easily attested in print, of the type that a reader is particularly likely to need help discerning, simply makes the dictionary less useful. bd2412 T 01:45, 10 December 2009 (UTC)
Also, the discussions you cite do not offer a "precedent" for wholesale deletion of thousands of entries. They involved very few members of the community, and ojo is not even Chinese. bd2412 T 01:59, 10 December 2009 (UTC)
It failed RFD. (Twice.) What can I say? I am aware that ojo is not Chinese. The very same arguments apply to Japanese as to Chinese with regards to romanization.
AFAICT, these words are not attested except other-language texts. We don't use disambiguation pages on Wiktionary. That's why we have {{also}} and why the search feature displays hits for forms of the query with different diacritics. —Internoob (Disc.•Cont.) 02:21, 10 December 2009 (UTC)
here is an example of a book filled with toneless pinyin (worse yet, it mixes sections of toneless pinyin with sections of pinyin with tones, to guarantee maximum reader confusion). Toneless pinyin entries are not disambiguation, they are entries which reflect the actual use of words in the real world. If we prescriptively prohibit such uses, we merely prevent the dictionary from being useful to those most befuddled by the words they encounter. In any event, only two specific terms failed RfD, one of which is not a syllable, and is inapplicable to the specific discussion of pinyin syllables, which are the fundamental elements of the Chinese language. You offer no means of determining which particles are more or less likely to need such an entry, nor any means of assisting readers who come across a toneless term for which one option is an actual toneless entry (such as bo). Unless and until there is a statement of policy on this subject which achieves the broad consensus of the community (and not three people voting in an RfD), there is no basis for putting the resources into tearing down the hard work that went into making the hundreds of entries that have been made for this purpose. bd2412 T 03:31, 10 December 2009 (UTC)
If I remember correctly, we have had at least one or two discussions about this topic on Beer Parlor in the past. I believe the decision was to permit non-tone Pinyin entries, provided that they are properly formatted. However, non-tone entries are not a priority. Since most students of Chinese quickly move to Chinese characters within a few semesters, I originally decided to place an emphasis on non-tone entries for Beginning Mandarin words. In fact, I had made it through to the T's in the Beginning Mandarin list, before getting sidetracked with other things. Take a look at bangongshi for an example of a properly formatted non-tone Pinyin entry. Thanks. -- A-cai 02:35, 10 December 2009 (UTC)
To me, creating toneless pinyin entries seems not worth too much effort. I don't think toneless pinyin has no usage at all. E.g. since this is the official romanisation in China and company, product or street names may have this spelling. However, more important is to have proper Chinese entries with pinyin with tones. If a word has become an English word (or other languages) - like jiaozi, guanxi, pinyin, they are no longer just Chinese and the entry should say it. I already mentioned on abc123's page that toneless pinyin could be ambiguous like yaofang and we don't have the disambiguation pages. There are two many possible combinations, even pinyin with tones seems to be a waste of time but perhaps it could be used to find the proper Chinese entry or if search allowed to enter toneless pinyin (or with tones) to find the Chinese entry with ease. --Anatoli 00:53, 12 December 2009 (UTC)
I've thought this over, and think I have a solution. Instead of the current layout, we can simply use the {{misspelling of|}} template to correctly characterize toneless pinyin transliterations as a misspelling. bd2412 T 18:30, 10 December 2009 (UTC)
Different alternatives could be linked together as these examples: niuroumian, niúròumiàn, niu2rou4mian4, 牛肉麵, 牛肉面.
This format may be a practical solution for linking different alternatives of Chinese entry: {{cmn-noun|p|pin=[[niúròumiàn]] / [[niu2rou4mian4]] / [[niuroumian]]|pint=niu2rou4mian4|tra=牛肉麵|sim=牛肉面}}
I'd go for that. As pointed out, these words are used. Are we denying that? Mglovesfun (talk) 18:34, 10 December 2009 (UTC)
That would be an RfV question - and it's not particularly hard to find volumes of toneless pinyin when you know what you're looking for. Can we get a bot to do the work though? Shouldn't be to hard to pluck out the forms that I added and replace them with templated lines. bd2412 T 18:47, 10 December 2009 (UTC)
An example of the lyrics of the Internationale in Romanized Chinese (non-tune-marked Pinyin)
One more example: Examples in Romanized Chinese (toneless Pinyin)
Romanized Chinese (non-tone Pinyin) are used in the Internet such as:
* Google hits: "niuroumian"
* Google hits: "kaoshi"
  • I don't think anyone really doubts that toneless pinyin transliterations exist. The question is, should we exclude them from the dictionary anyway, and, if we do include them, how do we do it. bd2412 T 02:29, 11 December 2009 (UTC)

I've changed a few to reflect them as common misspellings - see ai and bo, for example. bd2412 T 06:09, 11 December 2009 (UTC)

  • Maybe use the template "{{alternative spelling of|lang=cmn}}" is better, because toneless Pinyin are formal spellings in some cases, such as this and this
  • Good point, although I fear we'd have to track down an official usage for each word to justify calling it an alternative spelling, rather than a mere misspelling. I suspect that if we look at enough street signs and official documents in China, we'd get every one of the 1,400+ base syllables. I'm game for whatever the community thinks is best. bd2412 T 15:49, 11 December 2009 (UTC)

My deletions and moves have all been reverted at this point. My sincerest apologies for acting without the due consensus. —Internoob (Disc.•Cont.) 00:13, 12 December 2009 (UTC)

I am going to open a vote fairly soon on how these issues should be addressed - should we use the current single-sense format, template these as misspellings, or template these as alternative spellings. bd2412 T 05:31, 14 December 2009 (UTC)

FWIW ou didn't failed RFD simply because it was toneless, but also because it acted as a dismabiguation page and we don't do that here. So I wouldn't apply that example too easily to all other pinyin. Mglovesfun (talk) 11:36, 14 December 2009 (UTC)
Still working on a proposal - will take some time, as I'm pressed this week, and I'm aiming to get it right. Cheers! bd2412 T 05:13, 16 December 2009 (UTC)
Please take into account the efforts, the number of contributors we currently have and may have. Proper Chinese entries (in Hanzi, this is English, not toneless Pinyin) or translations from English into Chinese should be the priority. It may be fun to write sentences like "ni qu nar? wo qu mai dongxi" but this is not the proper Chinese and we are a dictionary. let me compare this with the discussion on place names. The English vocabulary here is in a very advanced state, we can add some extra stuff - expressions, place names, etc. Chinese content is still very low or not enough for a decent dictionary. As a result, we still need to save efforts and add more (proper) Chinese contents. When this is more or less achieved, we can worry about redirects, toneless pinyin and other features. --Anatoli 05:22, 16 December 2009 (UTC)
Actually, my primary concern is individual toneless pinyin syllables - of which there are only about 410, and for which I already made entries several years ago. The format of the entries that I made has been disputed, so the main question is whether to keep, delete, or change, those existing entries. bd2412 T 16:17, 16 December 2009 (UTC)

{{prefixsee}}, {{suffixsee}}

These templates transclude from from Category:x pages suffixed with y, to insert a list of derived terms into prefix and suffix pages (see ante-). I'm hoping that this will make the categories more useful / visible. I don't know if it is possible to style so that it is not simply a long vertical list. Thoughts? Nadando 00:12, 10 December 2009 (UTC)

With CSS columns, this should be doable using <categorytree>, which according to the documentation takes the style attribute. Of course, columns aren't official yet, and older browsers (including I think current versions of IE and many others) don't support them.​—msh210 17:29, 10 December 2009 (UTC)

Loving this. Make it multi-column, please. --Vahagn Petrosyan 20:06, 17 December 2009 (UTC)

(British) Alternative spelling of

We have any number of entries that have {{UK}} {{alternative spelling of}} (or {{US}} {{alternative spelling of}}, but for brevity and clarity I'll stick to the UK ones). Some of these are actual UK alternative spellings: that is, they are alternative UK spellings of words that are, in the UK, primarily spelled a different way. But many — I suspect the vast majority — are actually the primary UK spellings of the word, but listed as {{UK}} {{alternative spelling of|some US spelling}}, I assume for want of a better way to wikify it. Well, that way now exists: Thanks largely to Conrad, {{context labelcat}} now exists, and — though I wish to avoid technical discussion here — can be used, if we slightly modify template:alternative spelling of, so that the latter template displays

  1. British spelling of foo.

linking 'British' (as {{UK}} does), and categorizing both as cat:English alternative spellings (as the template does now) and as cat:British English (as {{UK}} does). This method is already being successfully applied with {{eye dialect}}, q.v. This seems like a great boon, though care must be taken to only apply this change to primary UK spellings and not true alternative spellings (as I distinguish above), but I wanted to check here first to make sure there's no objection to this fairly major change.​—msh210 17:19, 10 December 2009 (UTC)

While the idea is fine in principle, I forsee a complication. We have a number of "UK" entries that become (UK, Aus) or (Commonwealth). Could this idea work with multiple regional contexts (and would that work well)? I also think this might be inconsistent formatting. There are some definitions whose usage is strictly "UK", so having a split in ofrmat style between those entries that have a restricted sense versus those that have a regionally restricted spelling could be confusing for both editors and users. --EncycloPetey 03:27, 11 December 2009 (UTC)
Re the "UK, Aus" issue: Should be doable by means of multiple parameters (e.g., from, from2, ..., from5) and code similar to that found in template:also. I'm not sure I understand your second issue, though, EP, so can you please give examples (made up, if you prefer) of the different kinds of senses that you say will confuse people?​—msh210 18:21, 14 December 2009 (UTC)
Two examples:
  1. If we say kerb is the "British spelling of curb", then how will the user know which meaning of curb is used in the UK?
  2. If we label kilometre as "British spelling of kilometer", there would be no doubt about the deifnition, but we'd be inconsistent in labelling. Consider that lift in the UK is the same as elevator in the US, but we can't say it's a "British spelling" because they're not the same word. We have to use (UK) to mark that deifnition of lift.
So, with the proposal above, we'd have some entries labelled with (UK), but others labelled with a non-gloss definition line. That's inconsistent and would confuse both editors and users. --EncycloPetey 04:02, 15 December 2009 (UTC)
There's currently no form-of template on [[kerb]]: there's a full definition. I have no problem leaving it there (nor, even, leaving full definitions on both kilometre and kilometer). I merely want to modify existing form-of entries, which brings me to your second objection. Yes, there will be inconsistency the way I envision it, but better inconsistency, imo, than inaccuracy, which is what we have now with "(British) Alternative spelling of..." on things that are not alternative in the British context (but, rather, the only British spelling). I'd even be happier with "(British) Spelling of..." than with what we have now, though I think that "British spelling of..." less ambiguous, referring more obviously to an only-spelling-of form.​—msh210 18:03, 15 December 2009 (UTC)
I like this idea, though I assume there are some cases where two spellings are British, so it really is {{British}} {{alternative spelling of}}; but this is certainly not the common case, and I would support moving to embedding the dialect in the template. Conrad.Irwin 00:28, 17 December 2009 (UTC)
Fine. I will implement this. Thanks for your input, folks.​—msh210 19:02, 28 December 2009 (UTC)

Category talk:Topical context labels

Give your two cents, please. Mglovesfun (talk) 10:23, 12 December 2009 (UTC)

You propose that you sort topical context labels currently found in Category:Topical context labels into subcategories, for which you are planning to create new subcategories in that category. The reason or stimulus for your proposal is that you find 567 items in the category for topical labels too much.
I think having a flat category for topical labels is just fine, so I'd prefer you don't. --Dan Polansky 14:50, 13 December 2009 (UTC)
I agree with Dan.​—msh210 18:17, 14 December 2009 (UTC)

bare-butt etc.

I'm not entirely happy about this "bare-butt" spanking stuff in example sentences (scroll down), as continually added by Verbo/Fastifex. They aren't natural-sounding examples and seem to exist only to serve some fetish. OTOH I don't know any Dutch to replace them with something saner. Should they be cleaned up? Equinox ◑ 07:41, 13 December 2009 (UTC)

IMO yes, we do not censor but there's no reason not to replace an example with a better example. This is a good example of one to replace, this one seems more justifiable as it's in the right context. It does seem to be a "fetish" of Verbo's as you say. Mglovesfun (talk) 11:29, 13 December 2009 (UTC)
This user has been blocked three times for this already, please do so again. Conrad.Irwin 13:07, 13 December 2009 (UTC)

Redirects for apostrophes

Irritatingly enough, the French Wiktionary uses the printers' apostrophe (’) so we typical use redirects to allow Interwicket to link to them by interwiki. As pointed out, rather than creating these redirects by hand, can't a bot do it? Example: violon d'Ingres. Click on the French interwiki, then click again to get back to English. Mglovesfun (talk) 11:33, 13 December 2009 (UTC)

Link to Foreign Language Wiktionary in inflection line?

On reflecting on the doublewiki vote: might we consider linking to the foreign language wiktionary in or near the inflection line, as we already do for translations?

For example, on the page for renard (fox (in French)), list the main line as something like:

renard(fr)

as is currently done by the translation {{t}} template?

This is because the most compelling reason to visit other language wiktionaries is to see their coverage of their own words – thus somehow highlighting this or making it easier would be very helpful, as we already do in the translation section.

Including a link in the inflection line may be distracting, but currently the link to the French entry for renard is relegated to the “in other languages” box, between فارسی (Farsi) and Galego – it would be great if it could be highlighted more prominently. Thoughts?

—Nils von Barth (nbarth) (talk) 07:59, 14 December 2009 (UTC)

There is a preference which does something similar. If you check the option "Show an interwiki link under the language heading when one exists in the sidebar.", you'll see it does something similar to what you ask for. Of course, one could consider doing this the default option... someone? \Mike 08:19, 14 December 2009 (UTC)
Thanks Mike! That’s exactly what I had in mind. It’s admittedly a bit buried – perhaps it’s useful enough that it should be default?
—Nils von Barth (nbarth) (talk) 08:27, 14 December 2009 (UTC)
The one drawback to this is that there's no guarantee that the entry will be for the same language. That is, our Galician entry for cantar might let you go to the Galician Wiktionary, but their entry might be only for the Portuguese or for the Spanish word with that spelling. Programming the links to go to the other Wiktionary and to the correct language section is much trickier. --EncycloPetey 03:56, 15 December 2009 (UTC)
This is a good point - there is no guarantee that a language (esp. a less-used language) will necessarily have entries for their own language (for terms that exist in their language and others), so I'd be careful about doing anything too formal or automated (too many moving parts), but it's a pretty good bet, and perhaps worth highlighting.
—Nils von Barth (nbarth) (talk) 04:23, 25 December 2009 (UTC)

Main page redesign

I've proposed some changes to the proposed Main Page redesign. Perhaps with these changes, the redesign could be doable. --Yair rand 06:52, 15 December 2009 (UTC)

My concern is for the Prather family history. I am finding less and less about them.

Some where somebody has the information. Why not let it be found here?

I am willing to help. I don't see where I need to comitt myself to contributing any money. Times are hard.

Jack Prather

Not here, this is a dictionary
What you need is a genealogical database - try FamilySearch.org. Cheers! bd2412 T 05:12, 16 December 2009 (UTC)
Though, if anyone's interested, there is a proposed Wikimedia project called Rodovid which is a genealogical wiki. The demo site has 250,000 records. --Yair rand 19:52, 16 December 2009 (UTC)

Fundraiser (again)

For those who find the strangely centered, colour-pulsating, fundraiser banner as irritating as I do, the WT:PREF, "Hide the display of site-wide notices at the top of the screen." is bliss-inducing. Though a similar effect can be acheived by adding

#siteNotice, #fundraiser, .fundraiser-box {display:none; !important;}

to your monobook files. Conrad.Irwin 02:28, 16 December 2009 (UTC)

The current banner doesn't show for me, but it does seem to interfere with the acelerated page creation. I usually have to attempt to create the page three or more times before it fills in the information correctly, which is very irritating. --EncycloPetey 04:23, 19 December 2009 (UTC)

Anagram layout

As noted by Bequw, the anagrams section of tesla was fairly monstrous, I've updated it to put everything on the same line, which uses about 5% of the amount of screen space. As this change goes against the example given in WT:ELE (though not against the text), I thought I'd ask your permission before updating Conrad.Bot to use the new horizontal format henceforth (old anagrams sections won't be updated unless they need anagrams added or removed). Are there any objections? Conrad.Irwin 02:28, 17 December 2009 (UTC)

The "eveything on the same line" link above should be to http://en.wiktionary.org/?oldid=8078582, not what it links to now.​—msh210 02:59, 17 December 2009 (UTC)
I prefer the one-line way. It does seem to violate ELE, but I, for one, am willing to overlook that, or change ELE.​—msh210 02:59, 17 December 2009 (UTC)
I think they should be kept vertical unless it's longer than five or so. We have the same situation we the synonyms/antonyms lists, that they look a little ridiculous when they get really long. I don't think there should be a bot running in violation of ELE, so I think it might be a good idea to hold a vote to modify ELE. --Yair rand 03:11, 17 December 2009 (UTC)
From memory, synonyms are supposed to be in horizontal lists, one per sense. I did think about the "up to five" idea, but it seems fairly arbitrary; a word with five synonyms has five lines, a word with six has one; if the word with six was still on five lines it would feel more consistent, but then it wouldn't make much sense. Conrad.Irwin 13:34, 17 December 2009 (UTC)
Maybe the vote can trim the Anagram section of the ELE a bit, and have the majority at Wiktionary:Anagrams, that way if we want to make minor layout changes a vote won't be required. --Bequw → ¢ • τ 17:03, 17 December 2009 (UTC)
What if we did like this for large sections?:
The one-line proposal is fine too. I'm not fussy about the arbitrariness of what defines a "long section", personally. —Internoob (Disc.•Cont.) 03:24, 18 December 2009 (UTC)
I don't think that's worthwhile. I would prefer we be fully consistent, rather than collapsing some sections and not others, and the vast majority of anagram sections aren't long. We could use visible columns, if needed. --EncycloPetey 04:21, 19 December 2009 (UTC)


I am still under the impression that all on one line is more aestetic, would people prefer multiple columns? Conrad.Irwin 12:58, 19 December 2009 (UTC)
Using one line for all anagrams entries is a possibility with me too, as long as we do it consistently for all entries. --EncycloPetey 13:06, 19 December 2009 (UTC)

Started Wiktionary:Votes/pl-2009-12/Modify anagram section of ELE. --Bequw → ¢ • τ 23:33, 19 December 2009 (UTC)

I prefer all on one line, as opposed to the two in the vote, but I'm not overly fussed. Conrad.Irwin 23:35, 19 December 2009 (UTC)
How would you prefer to show the alphagram (the only reason that I listed two lines instead of one). --Bequw → ¢ • τ 01:13, 23 December 2009 (UTC)
As above: Conrad.Irwin 11:33, 23 December 2009 (UTC)


{{idiomatic}}

Is this actually useful as a context template? For example, at [[Nuremberg defense]], what does the "idiomatic" tag tell me that's not otherwise obvious? —RuakhTALK 15:53, 17 December 2009 (UTC)

The fact that a given set of words is idiomatic is already given by its mere existence, as is explained at WT:CFI. However the template categorizes to [[:Category:<language> idioms]] which is useful, so it might just need a different name. -- Prince Kassad 16:45, 17 December 2009 (UTC)
That would be true were it not for the numerous actual exceptions to WT:CFI that actually exist, not all by accident or unintentional neglect. DCDuring TALK 17:59, 17 December 2009 (UTC)
I think it's possibly overused. There are certainly some pages where it is used well, putting heart under {{anatomy}} is pretty much stating the obvious too. There are cases like mettre en bouteille where, for me, the tag helps. Mglovesfun (talk) 16:56, 17 December 2009 (UTC)
Some uses and beneficiaries:
  1. It is useful for editors as a marker of that justification for inclusion (or lack).
  2. It is useful to all to distinguish idiomatic from non-idiomatic senses of a headword.
  3. It has been useful for English editors as a tools to migrate away from use of Idiom as a PoS header in favor of more grammatical PoS headers. It might be useful in the same way in other languages if they migrate away from the Idiom PoS header.
  4. It is useful to new users as a marker of a departure in en.wikt's approach to idioms (separate headwords) vs. what most general print dictionaries do (placement under the first and/or "heaviest" component word).
  5. It is useful, mostly for editors, because it creates a category which can be provide a useful list for scanning or for narrowing clean up lists using the intersection search tools.
As a matter of policy we might do well to not assume the identity between being idiomatic and meeting WT:CFI. We already have a few entries that are explicitly only translation targets and more "phrasebook" entries. DCDuring TALK 17:59, 17 December 2009 (UTC)
As DCDuring implies, since we (arguably de jure, and certainly de facto) include unidiomatic senses of idiomatic expressions, this is useful.​—msh210 23:11, 17 December 2009 (UTC)
Really? O.K., if we assume that "unidiomatic" means "not being an idiom" (which isn't what it means in real life, but is basically how we use it here), then I'll grant that we do include some "unidiomatic" senses. But aren't those the exception rather than the rule? Shouldn't those senses be the ones set apart (by giving definitions such as {{non-gloss definition|Used literally; see this, is, not, an, idiom}})? I mean, if we were only using {{idiomatic}} at entries that have one or more "unidiomatic" senses, that would be one thing (though not my preferred approach), but at an entry like [[Nuremberg defense]], how is a reader supposed to guess the purpose (and therefore meaning) of the tag? (BTW, I don't think "Nuremberg defense" is really an idiom in the ordinary sense of that term. I suppose it's an idiom in that it's multiple words and not guessable from its parts, but it's actually more of an "allusion". I don't think anyone uses it without knowing what it refers to.) —RuakhTALK 23:26, 17 December 2009 (UTC)
I agree with Ruakh here.  (u):Raifʻhār (t):Doremítzwr﴿ 23:44, 17 December 2009 (UTC)
No, the only appropriate use I can think of for this tag is on idioms' entries that have "unidiomatic" (as we use it here, yes, Ruakh) senses. But now that you mention it, marking the unidiomatic senses does make more sense.​—msh210 00:45, 18 December 2009 (UTC)
There is a great deal to criticize in our definition and use of this tag. I would welcome a recommendation in detail as to how this could be done better and how any change could be accomplished without the process being entirely manual. It would be nice if we had some concept that did not require large-scale revision in one or two years. —This unsigned comment was added by DCDuring (talk • contribs).
I'll look for some examples of where this is used in Latin. I find the {{idiomatic}} and {{figurative}} tags very useful for Latin, since there are many Latin words that have an "everyday" or literal use, but also have one or two senses that are highly idiomatic. Marking them as such shows much more about meaning in Latin than not marking them. --EncycloPetey 02:43, 18 December 2009 (UTC)
Thanks, that would be useful to see. —RuakhTALK 00:04, 19 December 2009 (UTC)
Hmmm... On search and reflection, I guess I havent used {{idiomatic}} much for Latin after all. I found only four entries where it was used, and two of those don't need it. The one that I think most benefits is amabo te, which literally means "I will love you", but translates more often as please. If examples for {{figurative}} would still be helpful, I know I have done lots of those and could pull a few. --EncycloPetey 04:19, 19 December 2009 (UTC)
Nah, don't worry about it. I use {{figurative}} all the time, too. Thanks for looking. :-)   —RuakhTALK 13:41, 19 December 2009 (UTC)

Licensing of audio files

Hi,

This question is not specific to Wiktionary only. Please let me know who to ask if it's the wrong place.

What are the licensing arrangements for the audio files? Can they be downloaded and used on other websites? Or they can only be linked to with a direct link. --Anatoli 01:46, 18 December 2009 (UTC)

Each file may be different. You should check the file's page on the Commons for its license.​—msh210 01:54, 18 December 2009 (UTC)
Either. See commons:Commons:Reusing_content_outside_Wikimedia for details. If the other site is running MediaWiki it can be configured to use files from wikimedia commons directly. Commons does not allow files that cannot be reused, with the exception of some of the wikimedia logos. Conrad.Irwin 01:58, 18 December 2009 (UTC)
But — and this is the reason I said to check licenses — you may have to do something to use the file elsewhere, such as include (or link to) a copy of the license.​—msh210 02:20, 18 December 2009 (UTC)

Thank you for the answers! --Anatoli 02:55, 18 December 2009 (UTC)

Greek → Modern Greek (again)

Previous discussion: Wiktionary:Beer parlour archive/2009/August#Greek derivations.

Please take note of this. It seems that, despite the use of {{etyl|el}} and {{etyl|grc}} in place of {{Gr.}} and {{AGr.}}, contributors still confuse Modern Greek (displayed as Greek) with Ancient Greek. This is probably down to the fact that most people don’t think to distinguish the two, and it really doesn’t help that a lot of dictionaries write “Greek” (or an abbreviation thereof) in their etymology sections when they mean “Ancient Greek”. I’m guessing that the switch to ISO codes hasn’t helped much because:

  1. A contributor wants the language-linky code thing that he’s seen in other entries for the etymology section he’s writing (if he doesn’t, then the language ends up as Greek, written in plain text only, and still suggesting derivation from Modern Greek), but he doesn’t know the ISO code.
  2. He searches for “Greek” on Wikipedia, which gets him to this disambiguation page.
  3. He scrolls down, and more often than not, seeing “language” in the name of the link (perhaps using Ctrl+F to look for it), clicks on the link for Greek language.
  4. He looks for the language’s ISO code in the section entitled “Language codes” in the box on the right (or uses Ctrl+F for “iso”), finds the ISO 639-1 code el, and uses that.
  5. The result looks fine, so he carries on oblivious, thus introducing an avoidable error to the project.

The best solution I can see would be to “outlaw” the use of el as a language code on Wiktionary; use of el would autocategorise that entry into a clean-up category, whose members would require having that ISO 639-1 substituted with any of the seven more specific ISO 639-3 codes, viz. grc, ell, pnt, gmy, gkm, cpg, and tsd (for Ancient Greek, Modern Greek, Pontic Greek, Mycenaean Greek, Byzantine Greek, Cappadocian Greek, and Tsakonian Greek, respectively). The initial clean-up effort could be as sluggish as it wants if the clean-up category is made a HIDDENCAT and the displayed language of {{etyl|el}} isn’t changed; however, this leads to the eventual elimination of this error and allows its easy detection in future. Does that sound like a good idea to everyone?  (u):Raifʻhār (t):Doremítzwr﴿ 20:39, 18 December 2009 (UTC)

Re: your first two sentences: The relevant edit was part of Autoformat's misguided mass-conversion, more than a year ago, of {{Gr.}} to {{etyl|el}}. Before the conversion, use of {{Gr.}} for Ancient Greek was very common, but after the conversion, I'm not sure that any human editors have been using {{etyl|el}} where they should be using {{etyl|grc}}. —RuakhTALK 23:35, 18 December 2009 (UTC)
Hmmm, OK. Nevertheless, IMO, the scenario set out in 1–5 seems pretty plausible to me. Furthermore, I think moving from the catch-all, imprecise ISO 639-1 code el to the seven ISO 639-3 codes would be a definite improvement that we should institute regardless; what do you reckon?  (u):Raifʻhār (t):Doremítzwr﴿ 23:41, 18 December 2009 (UTC)
Well, according to the standards, el means Modern Greek; it's synonymous with ell. It's not really a catch-all or imprecise code, it just became one here as a result of an Autoformat mishap. I don't think we should start trying to distinguish el from ell; but the last time I raised this issue, I suggested using el-GR (Modern Greek as spoken in Greece), which amounts to the same thing. Alternatively, we could take the opposite approach: bot-convert all the {{etyl|el}} uses to {{etyl|Greek}} for cleanup, then change {{etyl|el}} to be explicit about meaning Modern Greek.
But either way, I don't think further BP discussion is going to get us anywhere. The current approach is to use {{etyl|el}} both for real Modern Greek derivations and as a cleanup template for the auto-converted entries; and the editors who are actually working on cleaning out seem to be happy with that approach. Personally, I don't like it, but we can't really force them to change. (But if they change their minds, then we can certainly offer suggestions, technical help, etc., towards a better approach.)
—RuakhTALK 23:58, 18 December 2009 (UTC)

Wiktionary:Votes/2009-12/Proposed CFI exception for SI Units

Pursuant to the unresolved discussion at Wiktionary:Beer parlour archive/2009/September#SI units and abbreviations, and several pending RfV/RfD nominations, I have initiated Wiktionary:Votes/2009-12/Proposed CFI exception for SI Units. Cheers! bd2412 T 21:15, 18 December 2009 (UTC)

AWB request

Even though I already have AWB access, I would like to ask the community if I can use my AWB access to make changes like this and this to Ido nouns to get all of them standardized. The edits will be made manually because of how complex the changes are. Thank you, Razorflame 00:14, 20 December 2009 (UTC)

Seems fine to standardize on the Ido inflection line template. --Bequw → ¢ • τ 01:28, 22 December 2009 (UTC)

Proposed pages

I would like to propose two new pages, one for for handling unresolved CFI and ELE issues, and the other for suggested templates, bots, and scripts.

The first of these would hold (in subpages) links to old archived (or not yet archived) discussions about ELE and CFI, and would provide centralized discussion areas for working on issues that have not been finished yet. Often, these discussions would have to take place partly in the BP to have a wider audience, but these could be moved afterward and continued. I think it would best look something like this (the draft does not actually contain all unresolved CFI/ELE issues, as I don't know what all of them are).

The second of these would be for suggested templates, bots, and scripts that there is no immediate need for, but would be useful to have. I notice that many users do have good ideas for bots, but do not have the necessary programming skills to build their own. Suggestions could remain on the page until they are either built, or have been proven to be unnecessary or impossible. Maybe something like this.

This leads to a third possible page, which could be for suggested suggestions pages, but I don't think we need that now :D.

So, what does everyone think? --Yair rand 08:25, 21 December 2009 (UTC)

The bots and scripts can stay on WT:GP, I have no objections if you want to create the page for old CFI and ELE issues, it'd be nice to have some proper thematic archives and that seems a good way to start. Conrad.Irwin 12:41, 21 December 2009 (UTC)
I think a single placement for CFI and ELE discussions would be helpful, and I support this idea. I basically agree with Conrad, continued Grease Pit discussion of bots, etc., is fine. bd2412 T 20:22, 22 December 2009 (UTC)
Wiktionary:Unresolved CFI and ELE issues now started, although it really needs work. --Yair rand 03:02, 30 December 2009 (UTC)

Wiktionary:List of protologisms/large numbers

Can I have the content of this page before it was deleted pasted to my userpage. I am the originator of the content. Shoof 13:57, 21 December 2009 (UTC)

Not sure who did so, but looks like it's there now. DAVilla 06:16, 23 December 2009 (UTC)
Why does list of protologisms actually exist? Isn't it just a green light to create nonsense in a Wiktionary appendix? Mglovesfun (talk) 06:51, 23 December 2009 (UTC)
Apparently, just like the /more pages in Wikisaurus. --Yair rand 16:52, 25 December 2009 (UTC)

Toneless pinyin proposal

At top, bei, jing, lu;
bottom left, yan, zhong, lu
bottom right, guang, wei, lu.
dong, da, qiao, lu.
min, zhu, lu.
Top: li, jiang
bottom: de, qin.
fung and shing.

I am preparing the a vote at Wiktionary:Votes/pl-2009-12/Treatment of toneless pinyin syllables, regarding our options for the presentation of toneless pinyin syllables, discussed above at #Toneless pinyin.

A fundamental feature of spoken Chinese is the use of tones - four specific variations in pronunciation which can impart different meanings on any of the ~410 basic syllable combinations which make up the language. Chinese characters are romanised using pinyin, these tones are usually represented as either accents over the affected vowel, or numbers next to the syllable (for example and ma1). There are numerous instances of "toneless pinyin" out there in the wild - that is, instances where someone has used words that should have a tone, but left the tone out. This is particularly prevalent in the names of Chinese cities (like Beijing, which is bei and jing), with Chinese street signs, on Chinese currency, in certain official documents, in Chinese Restaurant names, and in certain common menu items (e.g. kung pao chicken and wontons), as well as in some books and bibliographies discussed earlier. I made entries for all 410 or so toneless pinyin syllables in 2007. We have four basic options on how to treat these, with examples set forth below.

The first is to continue using the format that I used in making the initial entries. From gang:

Mandarin

Pinyin syllable

gang

  1. A transliteration of any of a number of Chinese characters properly represented as having one of three tones, gāng, gǎng, or gàng.
Usage notes

English transcriptions of Chinese speech often fail to distinguish between the critical tonal differences employed in the Chinese language, using words such as this one without the appropriate indication of tone.


The second option is to treat them as misspellings. From chan:

Mandarin

Pinyin syllable

chan

  1. {{misspelling of|chān|lang=cmn}}
  2. {{misspelling of|chán|lang=cmn}}
  3. {{misspelling of|chǎn|lang=cmn}}
  4. {{misspelling of|chàn|lang=cmn}}
Usage notes

English transcriptions of Chinese speech often fail to distinguish between the critical tonal differences employed in the Chinese language, using words such as this one without the appropriate indication of tone.


The third option is to treat them as alternative spellings. From zhen:

Mandarin

Pinyin syllable

zhen

  1. {{alternative spelling of|[[zhēn]]|lang=cmn}}
  2. {{alternative spelling of|[[zhěn]]|lang=cmn}}
  3. {{alternative spelling of|[[zhèn]]|lang=cmn}}
Usage notes

English transcriptions of Chinese speech often fail to distinguish between the critical tonal differences employed in the Chinese language, using words such as this one without the appropriate indication of tone.


The fourth option is to exclude them altogether. A side issue is, if they are retained in any form, whether to also include the usage note which I added when I made the initial entries.

Cheers! bd2412 T 02:10, 22 December 2009 (UTC)

Wouldn't it make sense to put the Usage notes content inside a template? --Yair rand 06:33, 23 December 2009 (UTC)
Actually, it is in a template for chan, but not for zhen or gang. --Yair rand 06:37, 23 December 2009 (UTC)
It used to be in a template for all of them, but I long ago subst'ed them. The wording is unlikely to change, but I've raised the question of whether we ought to have the usage note at all in the proposed vote. I'd like to add to the above that whichever option is ultimately selected, I can make all the changes with AWB in the space of a few days. bd2412 T 21:05, 24 December 2009 (UTC)

References

Braille

We seem to have no specific policy (or even a precedent) on how to format Braille entries. In , we have English, Chinese, Cyrillic and Japanese Braille information under the ==Translingual== header, and information about Hebrew and Korean Braille are under ==Hebrew== and ==Korean==, respectively. The same is true for all of the other Braille entries. It seems like we should standardize these by either placing all information under the Translingual header, or by putting each piece in its respective language header. --Yair rand 05:36, 22 December 2009 (UTC)

This is an odd consequence of the fact that Braille symbols encode for letters in multiple scripts and that by our Translingual policy characters in some scripts are considered Translingual and others are not. Since the Latin, Cyrillic, and Hiragana scripts are used by multiple languages, characters in those scripts can have "Translingual" entries (see Latin a, Cyrrillic а, and ). This is not the case for Hebrew and Korean (though "technically" Hangul is also used now in Cia-Cia) where the characters still can only have individual language entries. So the Translingual entries for the three Translingual script are together and the other two are separate. I think there is room for improvement somehow though. --Bequw → ¢ • τ 06:24, 22 December 2009 (UTC)
Fyi, Hebrew characters are also used in other languages (though I don't know whether Hebrew Braille is).​—msh210 20:11, 22 December 2009 (UTC)
In any event, I think certain Braille characters need an English section: those that code for words. For example, codes for child in English, per this font of information on Braille.​—msh210 20:11, 22 December 2009 (UTC)

Proposal to pick a language for our next focus

I'd like to think at this point that we have coverage of almost every word in the English language for which people are likely to turn to a dictionary seeking a definition, synonyms, or similar guidance. I'd like to propose that one of our main value-added propositions is our utility as a translating dictionary. To that end, I'd like to propose that we as a community pick one specific language, and agree to spend the next month expanding our coverage of that language (both in the addition of entries in that language, and in the inclusion of translations into that language in English entries), through all the tools at our disposal. I feel that, given that month (and our existing decent coverage of widely spoken languages), we will be able to surpass the coverage of the typical translation dictionary. I would also suggest that either Spanish or French would be a good place to kick off such a focus, given our existing strong coverage of each. Thoughts? bd2412 T 03:49, 24 December 2009 (UTC)

While I think that this is a good idea, I just don't think that we have the userbase to be able to do so. There aren't enough users that would be able to spend a whole month helping out in our coverage with certain languages. I believe that instead of focusing on particular languages, we should instead let people continue making pages for the languages that they like to make the pages for as that will help us expand our coverage of more than one language at a time, which would be better than spending one month working on just one language. Razorflame 04:05, 24 December 2009 (UTC)
I certainly respect the fact that users will work on what they are interested in working on, but I think if we come together on this, enough people will put in enough time to make it worth doing. bd2412 T 04:24, 24 December 2009 (UTC)
It is certainly worth doing, but I just don't know if enough users will be able to come together on this. We could always try, though :) Razorflame 04:36, 24 December 2009 (UTC)
If others are for it, I say why not try. bd2412 T 04:42, 24 December 2009 (UTC)
I disagree with the premise that we have sufficient English coverage. We sorely lack many compound terms. I the past few weeks, we've added mobile home and electron cloud. Rather than picking a new language, I think we need to seek out compound terms in English that we lack.
In addition, we very badly need to clear up missing and incomplete definitions of existing English entries. Having an entry is not the same as having all the basic information. Consider the entry fluidly, whose entire entry consists of a PoS header and the *ahem* definition "in a fluid manner". So, if I hear that someone "spoke fluidly", does that mean he drooled a lot while talking? And what about the etymology, the pronunciation, the synonyms, the translations tables, etc.? There may be an entry for fluidly, but it's hardly an entry that would enlighten anyone. There is still a LOT of work to be done on our coverage of English words. --EncycloPetey 05:14, 24 December 2009 (UTC)
At least this would get us started on translation tables for words having none. I see your point, however. Perhaps a future project then, once English is more fully accomplished. bd2412 T 05:21, 24 December 2009 (UTC)
I would like to see a process of advancing our English definitions to a state where they were worthy of the effort of, 1., initial translation and, 2., translation review. To focus translation effort initially, we might consider the, 3., use of {{trans-see}}, which, in turn, requires the, 4., identification of synonyms.
  1. One class of terms that need an initial translation effort will be adverbs. In almost all monolingual dictionaries most adverbs are not defined. We would be differentiating ourselves somewhat by providing good definitions and translating them. Until the adverb definitions are checked for adequacy, I am not sure the translation effort is worth it.
    Another class of terms are English idiomatic predicates and English idioms in general. The rationale for having them is that they cannot be understood or translated word-for-word. Yet a large portion of them have no translations. Massive insertion of trans tables and {{trreq}} would be a constructive step for terms that seem to have adequate definitions.
  2. Many basic terms have benefited from massive translation effort. Unfortunately much of that effort was applied to entries that were not renovated after import from Webster 1913. Visviva has compiled a list of terms on the GSL list of basic words that show signs of needing updating. It would help if we had some marker for a sense being sufficiently up to date to be translation-worthy.
  3. There is little value to copying translations to truly close synonyms. There is good reason to have translation effort concentrated. That is the purpose of {{trans-see}}. For English terms have particularly close synonyms, it is good to mark the term most worthy of translation effort with trans-see. To help translators and users it would be very desirable for the template to direct them to the specific Etymology-PoS. If trans-see directs users to our more massive basic-word entries, it may be necessary to remove it.
  4. To apply trans-see and to help with translations, it is helpful to have synonymous entries to refer to. Some entries do not have synonyms marked or wikilinked, or the in-line synonym links are lost in a sea of blue links.
While this is going on, there is no reason not to have massive addition of words in other languages. DCDuring TALK * Holiday Greetings! 11:48, 24 December 2009 (UTC)
If that is what you want to do, then that would be very useful. But experience has shown that you can't tell (or even ask) people to cooperate on a project. We all do what we want. Cheers. SemperBlotto 11:56, 24 December 2009 (UTC)
It is part of what I have been doing. They are just suggestions for others. They fit into a program that couldn't possibly be completed without others working on it or similar programs too. The idea of a program in a volunteer effort is to solicit interest to get efforts that add up to more than the individual steps. I gather that I don't have your interest, which I regret.
My objective is to improve English definition quality enough so that translation effort is not wasted. I am, in part, responding to complaints about an excessive number of {{ttbc}}. To avoid excessive ttbcs (or divergence of trans glosses and definitions) translation effort shouldn't be wasted on poor entries, such as those still with obsolete 1913 wording. I would think it very cool if by 2013 we had honored Webster's 1913 by updating its language in our entries. DCDuring TALK * Holiday Greetings! 13:19, 24 December 2009 (UTC)
We have a lot of missing English entries. I worked through a little bit of the first S page of User:Brian0918/Hotlist. Before we get anything big started with foreign-language entries, I think it'd be better to perhaps work on English quite a bit. There are lots of 'missing lists'... L☺g☺maniac ☃ 15:13, 24 December 2009 (UTC)
After some further thought, I'm wondering if it would be a good thing to have an organized Wiktionary:Sign up sheet or something with the major tasks to be done for each language listed so that users (especially newcomers) can have some idea of what needs to be done and who's working on what. Any thoughts on that? L☺g☺maniac ☃ 16:11, 24 December 2009 (UTC)
As with everything, it needs a dedicated user to devote time and energy into keeping it up to date, WT:DW (the previous attempt at something like this) fell into disuse; I think the WT:CDPR is a better idea, where people can create lists of things and mark them off as they get done. Conrad.Irwin 16:43, 24 December 2009 (UTC)
Wiktionary Community portal has a section one or two page-downs from the top that directs users to some particular pages. Perhaps that could be made more prominent. Perhaps, also, we could identify and mark appropriately some tasks that are well suited for new users and suggest what skills are required (eg, basic knowledge of WT:ELE, knowing how to bold words in a usage example or quote, advanced knowledge of English grammar, knowing how to add or remove wikilinks appropriately, ability to add synonyms, ability to add transitive, intransitive tags, ability to detect and mark hard-to-understand definitions).
This would require:
  1. Some specific rfc-tags,
  2. Corresponding automated cleanup list pages with instructions,
  3. A page to help users find the tasks, and
  4. Some means of advertising some or all the tasks to new users (possibly a "featured task").
The first step might be to get consensus on some specific tasks that fit this kind of framework and were better done by new users than by bots. It might be necessary to make sure that bots were likely to catch the kind of errors a new user might make in working such lists. DCDuring TALK * Holiday Greetings! 16:42, 24 December 2009 (UTC)
I've started User:Logomaniac/Sign up sheet, if other editors would please add helpful links for each language. L☺g☺maniac ☃ 20:06, 24 December 2009 (UTC)
Question: Is that list specifically for missing words, or can any tasks be placed there? --Yair rand 20:20, 24 December 2009 (UTC)
It would be better to keep tasks at WT:CDPR, where we could do with taking some of DCDuring's advice. Conrad.Irwin 20:26, 24 December 2009 (UTC)
(e/c) It is for any (semi- to large) tasks that need to be done. I understand that CDPR is for smaller things. L☺g☺maniac ☃ 20:57, 24 December 2009 (UTC)

There was a certain meta-logic to my thinking on this. Outpacing print translation dictionaries in a particular language brings us more utility that other resources; which brings more users; some of whom will stay and become editors. bd2412 T 21:09, 24 December 2009 (UTC)

The meta-logic is well worth some thought, too. I'm interested in tasks that improve en def quality where new users can help and learn something about en.wikt as they do it. Some of them might become long-term contributors too. Because this is en.wikt, English has a special role. I have focused on English definitions because they are important to both monolingual users and translation seekers and have quality problems which seem to have been hard to solve. Not everyone has the skills or desire to work in that direction, but we can reduce the skill requirement a bit by decomposing the tasks and serve the tasks up as clean-up lists a format that seems to direct effort fairly effectively in many cases.
I assume that the many non-en-N contributors here would like to be able to count on en.wikt to have good English definitions, usage examples, synonyms, glosses, and grammar information to facilitate their translation work and answer any questions about English. The quality of the English sections or the ambition to improve them must play some role in their being here.
One thing that I would really appreciate would be some feedback from those whose are not en-N, en-5, or en-4 about the intelligibility of definitions, especially at the sense level. Perhaps some new sense-level clean-up templates like {{rfc-hunh}}, {{rfc-saywhat}}, or, more prosaically {{rfc-def}} would help. DCDuring TALK * Holiday Greetings! 23:15, 24 December 2009 (UTC)
I don't understand this premise at all. If we haven't got a good number of contributors for most languages as it is now, how are we supposed to "focus" on one language? People either contribute in their language of proficiency or not - some kind of "focus" proposal won't change that - that is, unless you mean getting people who don't speak the language to contribute too which will only lead to errors. Tooironic 22:40, 26 December 2009 (UTC)
Clearly there are choices in how folks spend their time. With negligible language skills outside of English I can still choose to facilitate work of translators by inserting {{trans}} (with glosses) and

{{{1}}} see {{{1}}} and clean up messes in the trans tables. I can clean up WT:ELE errors in entries outside of English.

With en skills of 2-3 someone could still evaluate English definitions for intelligibility and opine on the adequacy or relevance of usage examples.
At the level of shared resources, policies, and cleanup lists there are lots of choices that determine what is made easier or harder; encouraged or discouraged.
People ultimately do what they want and what they can. Most people want to do things that further a larger objective. Their view of how the larger objectives, how wiktionary as a whole might get there, and how they can best contribute is subject to discussion and change. In addition, what people can do changes as a result of what they learn, often by explicit choice. For example, I didn't own CGEL or Garner's before coming to en.wikt and didn't have much conscious knowledge about grammar. I chose to learn more to help more in those areas. Some of our more technically adept contributors have produced very helpful cleanup lists which facilitate correcting certain kinds of deficiencies. If a particular kind of deficiency doesn't capture enough of their interest, it will get corrected much more slowly. DCDuring TALK * Holiday Greetings! 23:32, 26 December 2009 (UTC)
<tongue-in-cheek> Why don't we go through all the [[Category:User lang]]s, pick out all the users that are anything-3, 4, 5 or N, and leave a message on their talkpage, You list yourself as proficient in this language, would you please care to contribute in it? </tongue-in-cheek> L☺g☺maniac ☃ 23:33, 26 December 2009 (UTC)
The bottom line is this: what is our value-added proposition? Why should people in need of a dictionary turn to us instead of a print dictionary, or one of the countless websites offering dictionary services? Granted, most of those websites are either electronic versions of existing print dictionaries, or regurgitations of public-domain dictionaries that we have already absorbed and improved upon. Still, what is it that we offer that will make us the go-to source of lexical information? bd2412 T 23:50, 26 December 2009 (UTC)
why we are better than other dictionaries: 1) We are always being improved instead of once every several years like print dictionaries 2) We are a wiki, which means it's easy for someone to fix a mistake or add needed information 3) we rock. :)
Just FTR: I think it would be a good thing to have editors working together on specific language(s). But it would probably be better as a smaller project, not a community-wide one, as there are many editors who won't know the language(s) in the spotlight. So if we started something in, say, French or Spanish, all the French or Spanish-speaking editors (es or fr-2 (3?) or above, probably) would be invited to participate. L☺g☺maniac ☃ 00:12, 27 December 2009 (UTC)

I think there actually is a way we could actually make this work, and have a temporary specific-language focus:

  1. Don't do it often. Have a specific month for working on a language once every six months or so, or maybe once a year.
  2. Make it an 'event', as in "February is French month on the English Wiktionary! You can help!" and try to get users (especially newbies and IP's) enthusiastic about the idea. Throw in a sitenotice and we've got a huge load of new users interested in helping.
  3. Advertise off-Wiktionary. Wikimedia has very many users who would be glad to help with an event related to their language. For example, in the case of Spanish, we could post a message to w:wt:WikiProject Spain, and wherever the Spanish translators would see it, and try to get the word around Wikimedia. Again, try to make it a Wikimedian event, make it "exciting".
  4. Big, colorful, and active. I think everyone knows what I mean.

Still, it would be difficult. On the other hand, if it's a huge success, that would be a seriously major victory for Wiktionary. --Yair rand 04:51, 27 December 2009 (UTC)

Sorry for taking so long to get back to this - I think that is a great strategy, and I think we should aim to try it out in June. If we can get the other Wiktionaries on board, and have that indeed be a Spanish month, in which several different Wiktionary projects work on adding Spanish content, we might be able to get a synergistic benefit of cross-posting from one Wiktionary to the next (by which I mean, someone posts a new Spanish word on Italian wiktionary, and then posts that same new word here so that we have a quick interwiki connection established). bd2412 T 03:40, 3 January 2010 (UTC)

How about a "month of the obscure language focus" ? Several interested editors pick a rather obscure language (living or extinct), decide to learn it and add quality entries to Wiktionary (complete entries with pronunciations & inflections). And I'm not talking about regional or minority variants of "big" languages with (semi-)official status (which are rather easy to pick up once you learn the standard idiom), but of indigenous languages, languages with little or no literary tradition, or e.g. ancient extinct languages attested in a relatively small corpus of texts (Hmm, that whould be pretty much all except Greek, Latin and Sanskrit). There should only be one condition: that there is at least one online reference work of it available (preferably a comprehensive grammar). And that a language is, of-course, "obscure" (e.g. spoken < 1 million people). There are a lot of high-quality research works (usually in PDF format) available on the Web, including audio recordings made during the fieldtrips as well as texts (songs, folk tales etc.), that can be utilized as a learning material. There are countless indigenous languages of America, Africa, Australia, Caucasus, Siberia etc. to chose from, most of which are extremely interesting. One or two months should be enough to cover the basic lexicon (a few thousand basic words). If there is anyone interested in this kind of teamwork, please let me know (either here or on my talkpage), and I'll start a project page! --Ivan Štambuk 06:29, 3 January 2010 (UTC)

That is a noble idea, but what does more to increase the utility of Wiktionary to the world at large? I suggest that we become a resource for translations into common languages first as a means of drawing in more participants, and once we have done that, use the additional manpower to plow through more obscure tongues. bd2412 T 16:55, 3 January 2010 (UTC)

Name appendices

Perhaps Appendix:Names male-A, Appendix:Names male-B, etc. could be renamed to Appendix:Masculine given names/A, Appendix:Masculine given names/B, etc. --Daniel. 13:15, 24 December 2009 (UTC)

Support renaming. --Yair rand 19:55, 24 December 2009 (UTC)
Support, definitely a clearer and more intuitive naming scheme. bd2412 T 20:38, 24 December 2009 (UTC)
Support. —AugPi (t) 21:03, 24 December 2009 (UTC) (see below)
I prefer Appendix:Male given names/A which would be consistent with all uses of {{given name|male}} and its categories. (male: 5800, female: 5500, for interest) Conrad.Irwin 21:23, 24 December 2009 (UTC)
Good point: Support this alternative. —AugPi (t) 21:38, 24 December 2009 (UTC)(see below)
Support Appendix:Male given names. --Dan Polansky 12:58, 25 December 2009 (UTC)
Support original proposal and changing "male names" everywhere to "masculine names". Names are grammatically masculine. Names are not male, as they have no genitalia and do not reproduce; only the bearers of such names can be male. --EncycloPetey 21:42, 24 December 2009 (UTC)
But then, the categories are wrong! —AugPi (t) 21:53, 24 December 2009 (UTC)
I agree. The category names are wrong, and should be corrected. They were named incorrectly the last time we went through name cleanup, becasue the change was hurried through without adequate discussion. --EncycloPetey 22:01, 24 December 2009 (UTC)
Names in English have no gender, male names are those given to males. While I can quite happily agree that a masculine name is one given to a masculine person, it seems fairly pointless to change the tens of thousands of pages just to "correct" a perceived mistake. Conrad.Irwin 22:38, 24 December 2009 (UTC)
The mistake is more than merely perceived if in fact there are women who have these names. Which, in some cases, there are. bd2412 T 23:16, 24 December 2009 (UTC)
What do you mean? We called a girl at school Harry Potter (because she happened to look `exactly` the same as Daniel Radcliffe or something) it didn't make the name less male or the person less female. Conrad.Irwin 02:14, 25 December 2009 (UTC)
If a given name is used for both males and females, it is both a male name and a female name; I see no problem here. --Dan Polansky 12:48, 25 December 2009 (UTC)
I support Conrad's "Appendix:Male given names/A":
The term "male name" can be read in several ways, one of them being "name given to males" on the model of "tree name" or "person name", meaning that "male" in "male name" is read as a noun used attributively. And English has no gender, so "masculine name" as applied to, say, English "John" seems technically incorrect. --Dan Polansky 10:12, 25 December 2009 (UTC)
According to Wiktionary, one definition of masculine is "pertaining to male humans", so masculine given names is not technically incorrect when applied to English. --Daniel. 12:36, 25 December 2009 (UTC)
As far as "masculine" means "pertaining to male humans", "masculine name" is synonymous to "male name", while the term "male name" is less ambiguous than "masculine name" for the purpose of denoting "name given to males". In "masculine word", "masculine" clearly refers to grammatical gender, so "masculine name" seems per default to be read as a "name with masculine grammatical gender". For instance, "Praha" is a Czech feminine name, albeit a place name.
To apply "masculine name" in the sense "name with masculine grammatical gender" to "John" seems technically incorrect. --Dan Polansky 12:48, 25 December 2009 (UTC)
An afterthought, to expand on the point with "Praha": the point of dividing given names into male ones and female ones is to divide them by the gender of people to which they apply, not by the grammatical gender of the name. We have neither categories nor appendices on "Feminine place names" and "Masculine place names", not because these names can have no grammatical gender but because places have no gender. Yes, the gender of the person to which the name applies and the grammatical gender of the given name almost always coincide, hence this discussion in the first place. --Dan Polansky 13:05, 25 December 2009 (UTC)
Sorry, no, I have to disagree about "male name" and "masculine name" meaning the same thing. They do not. One is grammatically correct in English, the other is a misuse of "male". Further, per my comments below, John does have grammatical gender, and saying so is actually correct. --EncycloPetey 15:41, 25 December 2009 (UTC)
I am not saying that "male name" and "masculine name" mean, in one of their senses, the same thing; it is Daniel who implied this assumption in his argument above. I have only picked up his assumption, and, working from that assumption, shown what makes me think that even under that assumption "male name" seems preferable to "masculine name".
The claim that "male name" is grammatically incorrect seems implausible: from what I can see, it is an attributive use of the noun "male", on the model of "tree name" and "person name", a model that I have already mentioned. The claim that the term "male name" is grammatically incorrect is as yet lacking any proof, other than an attempt to interpret "male" as an adjective applied to biological entities, which "name" admittedly is not. --Dan Polansky 09:16, 26 December 2009 (UTC)
Let us see some Google results:
While mere web hit counts cannot indisputably demonstrate that something is a correct grammar, they at least indicate that the users of the language do not consider it an issue. This seems relevant in the absence of an indisputable theoretical proof that the thing in question is not a correct grammar. --Dan Polansky 09:37, 26 December 2009 (UTC)
Thank you for throwing out so many meaningless numbers. Now search "Male Arabic names" vs "Masculine Arabic names", or better still, look at what was actually returned in the first list of links you listed above. On the very first page are some hits for "masculine names" that lack the combination "male names" that you were supposedly searching for. Google numbers cannot be used the way you have tried to, and we've gone through the problems of using such numbers as support many, many times before. The Google search is not as strict as people naively believe, but searches also for things you might want, but didn't actually search for. It's also weighted in favor of sites with more pages, since it returns page links rather than sites, and doues not weed out duplicates.
Now look at google hits for "he be sick", which has more than one million returns, but which every grammarian will agree is a problematic construction in English. Your arguments from Google numbers are irrelevant. --EncycloPetey 16:16, 26 December 2009 (UTC)
Support original proposal, per EP. —AugPi (t) 22:30, 24 December 2009 (UTC)
No, stay with male as English given names (and lots of others) don't have a gender. Male can be used to me "for a male person" so to say that Peter is a male given name is correct, but not masculine. Not in English anyway. But yes move these somewhere better. Mglovesfun (talk) 14:31, 25 December 2009 (UTC)
Debatable as it is, I don't think French given names have genders, they just take the gender of the person they are referring to. While Martine is a female given name, if someone did call their son Martine people would say un Martine not une. 14:35, 25 December 2009 (UTC)
Given names do have gender, even in English. This is not to say that they have morphological endings specific to the gender, but that each name falls into an expected category of gender value, and takes corresponding pronouns that match this expected gender. These expectations affect understanding. If you saw the sentence, "Why can't Anna open her locker?" you would assume Anna was trying to open her own locker. On the other hand, if you saw the sentence, "Why can't Anna open his locker?" then you would assume Anna was attempting to open some other person's locker. This occurs because we expect that Anna is a feminine name. The expected gender of names in English affects understanding of the language, affects choice of pronoun, and so I maintain that English given names have gender. --EncycloPetey 15:36, 25 December 2009 (UTC)
The example that you have given only proves that the gender of the person called "Anna" can be securely inferred from "Anna", not that the term "Anna" has a grammatical gender. Yes, given the knowledge that Anna is a female name and not a male name, the inference can be made that in "Why can't Anna open his locker?" it is not the locker of Anna but of a male person. By contrast, in Czech, "osoba"--person--is a term of feminine grammatical gender, yet no conclusion can be drawn from it on the gender of the person denoted by "osoba". It is not the supposed grammatical gender of "Anna" but its being exclusively a female name that makes the inference on the gender of the person possible. --Dan Polansky 09:16, 26 December 2009 (UTC)
An afterthought: are you saying that "woman", "girl", "fiancée", "baroness", and "actress" have grammatical gender in English? Each of them can be substituted into the argument that you have built above; each is sufficient for an unambiguous determination of the human gender or sex of the referent: "Why can't the actress open his locker?" --Dan Polansky 09:45, 26 December 2009 (UTC)
You've misunderstood my argument. Nowhere did I say that Anna referred to a woman; the gender of the individual is not at issue. I said that assumptions about gender of the name affect understanding of the language and use of pronouns. That is what grammatical gender means. --EncycloPetey 16:07, 26 December 2009 (UTC)
I could have misunderstood your argument, but tell me: Are you saying that "woman", "girl", "fiancée", "baroness", and "actress" have grammatical gender in English? If they do not have grammatical gender, what makes them different from "Anna" for the sake of determination of the presence or absence of grammatical gender? Put differently, what feature, property, or quality that "Anna" has and "actress" does not have makes you think "Anna" has a grammatical gender? --Dan Polansky 10:48, 27 December 2009 (UTC)
EncycloPetey, I think your assertion is incorrect, and propose that people assume that "John" is male for the same reason they assume in "Alfred drank qwenkjqbn with his bakbjw" that "qwenkjqbn" is a liquid and that "bakbjw" is a consumable (food or drink, I'm not sure -- maybe we should include Klingon?). The same reason that I assumed Sandy was a male name, until I met a female Sandy. There is no grammatical magic here, it's merely our wordly experience; it is incidentally the reason that computers, lacking a large store of wordly experience, have enormous difficulty in reading text. I do agree with comments above that "male name" could be misconstrued, and that "masculine name" could be misconstrued, and would request the authors of such comments read a little more English, this will learn them how to use context to disambiguate multiple possible interpretations of terms. Both are arguably correct, and arguably confusing, "male" is simpler and shorter and requires less work to implement consistently, thus I prefer it. Conrad.Irwin 21:56, 26 December 2009 (UTC)
In your example "Alfred drank qwenkjqbn with his bakbjw", I could not assume that bakbjw was a consumable, because it could mean "mouth" or "straw". However, through the magic of grammar, I can assume that it is a noun. This is not a question of prior experience, but of grammatical context. If I say to you "That Gryldis sure likes his dog", then you assume "Gryldis" is masculine because of the grammar, and not because of any prior experience with a person named "Gryldis". This is what grmmatical gender means. As a parallel real example, consider the Spanish noun teorema. From previous experience with Spanish (or related langauges), you might assume from the terminal -a that the word is of feminine gender, but it isn't. The reason we say it's masculine is that it takes masculine articles, adjectives and pronouns, as in El teorema es correcto. It is grammatical context in relation to other words that identifies the gender to us.
Since "male name" is more likely to be misconstrued, the problem should be corrected, and not passed on simply because it will take work to fix. It took a lot of work to put the categories into their current shape, and we did it successfully. Now it's time to improve them. --EncycloPetey 22:40, 26 December 2009 (UTC)
But you agree, surely, that "qwenkjqbn" is a liquid? Is that grammar too? I can eaily insert "tasty" before "bakbjw" to make it more clearly a food (I would not assume an "implement" on first reading because of the "his", normally it would be with "a" fork, though the "mouth" interpretation works fine, and I suppose if Alfred was edible he could have a tasty mouth, but I certainly would not assume such when reading) Sure, I can use the "his" to tell me that Gryldis is probably (but not certainly) a male; I strongly disagree with the assertion that it's a grammatical property of the name. As above, even when "Harry" refered to the girl at school (and this was certainly the male name Harry, short for Harry Potter), "Harry liked her dog" still uses "her" to agree with "Harry". It is a property of the referent and not the word. For the purposes of category naming, neither are hugely advantageous, and I will happily go with the flow, but I would like to get to the bottom of the "do names have gender" issue now we've started on it. I disagree that "male names" is more misinterpretable than "masculine names" because I don't accept your parse of the sentence. Conrad.Irwin 23:01, 26 December 2009 (UTC)
Being a liquid is not a grammatical property, but being a noun is. Adding "tasty" (an adjective) further supports the interpretation that it's a noun. The issues you have with referent/word are not as easily separable for proper names as for common nouns. Proper names have the characteristic of being applicable only to a particular referent. However, here's an example in English of a proper name having a grammatical gender, even though the referent does not: "Britannia's finest hour was her perseverance in the war." The proper noun Britannia takes the feminine pronoun her, but does not refer to a female entity. English names of countries and vessels are routinely treated grammatically as feminine in English, despite having no inherent biological gender. --EncycloPetey 23:12, 26 December 2009 (UTC)
Indeed, and I contest that, for English names at least, being "female" is not a grammatical property. I find the notion that "Harry" when used to refer to a girl has different grammatical properties to "Harry" used for a boy quite absurd - we were deliberately giving her a boy's name. Mglovesfun seems to indicate that he understands the same to hold for French names, the gender stays with the referent. I am quite happy to agree that poetically, ships, countries (and indeed almost anything else an author would like) can be referred to as female, I don't think that makes their names, or the words that describe them female ("The Thames gently lapping her banks", "The Thames broke its banks" - it's up to the author, not the language). Our usage notes agree with my (possibly naive) interpretation of the situation "# Ships are traditionally regarded as feminine and the pronouns her and she are normally used instead of it." Conrad.Irwin 23:43, 26 December 2009 (UTC)
And part of my point in this discussion is that being female is a biological property, never a grammatical one. So I agree that "being "female" is not a grammatical property". However, being feminine is a culturally and grammatically determined property. This is why I object to "male name" over "masculine name" &co. You do realize that the reason English treats names of countries as feminine is a holdover from a time when English did have complete grammatical distinction, and parallels Latin tendencies to name places with feminine names? Modern English has largely lost the grammatical gender distinction, but does retain it vestigially in its pronouns, in many proper nouns, and a few common nouns that refer to animals. --EncycloPetey 23:55, 26 December 2009 (UTC)
The problem is that neither "male" nor "masculine" is completely appropriate, "male" because it normally denotes sex, "masculine" because in a dictionary it normally denotes grammatical gender. Given names correspond to social gender, which is another thing entirely. You seem to think that, because English used to have grammatical gender, we can assume that socially gendered given names are a vestigial reflex of that; but I just don't think that's true. —RuakhTALK 00:01, 27 December 2009 (UTC)
Pairing with certain pronouns is a feature of grammar, not of social gender. Your statements about what I seem to think have missed most of what I've said to this point, and draw an incorrect conclusion about my reasoning, which is from demonstrated grammatical context, not from historics. History implies cause, not current status. --EncycloPetey 00:31, 27 December 2009 (UTC)
Ruakh is, to my mind, correct; and I thank him for stating what I feel so clearly. I can quite happily acknowledge that pronouns change their form depending on their referrent, and can quite happily acknowledge that this is due to grammar; I simply have a huge problem with saying that because this Harry is a male, this "Harry" is a masculine word, and that that process is grammar. This may well be due to a lack of linguistic training on my part, in which case thank you for your attempts to explain. Conrad.Irwin 01:07, 27 December 2009 (UTC)
At least do me one courtesy (Conrad and Ruakh): What would you look for in a word (in any language) to determine (1) whether it had gender, (2) which gender it was, and (3) does your answer apply to both inflected and uninflected languages? For your convenience, please note that Wikipedia defines grammatical gender as "classes of nouns reflected in the behavior of associated words." --EncycloPetey 04:29, 27 December 2009 (UTC)
When I was learning French, it seems that the article used would give the gender, from my memories of Latin, the set of endings used dictated the gender - though I can see that this is probably a backwards answer. I suppose you could thus retroactively say that because "he" is used to refer to Harry, and "Harry" must agree with "he", "Harry" is masculine, but that sounds a bit convoluted to me. Conrad.Irwin 22:24, 27 December 2009 (UTC)
Name appendices — AEL
I reject the supposition that we can view gender as a property of "a word (in any language)". Some languages have noun classes, and of these languages, some have grammatical traditions whereby the noun classes are called "genders". In most of the latter group, a noun's gender is usually arbitrary, but frequently correlates with semantics (e.g., there may be one gender containing most words for men and another gender containing most words for women; if so, the former will likely be called "masculine" and the latter "feminine"), with phonological properties (e.g., Hebrew words ending in /-a/ are usually feminine, including new loanwords), and so on. But these concepts can't be transferred to languages like English that lack grammatical gender. English has a few gendered pronouns, but it doesn't have gendered nouns: a gender-marked pronoun's gender is not determined by its antecedent noun, but rather by its real-world referent. (Really, I'm not sure "gender" is even the right word to apply to these pronouns, but it's close enough, and I don't know of a better one.) —RuakhTALK 02:32, 28 December 2009 (UTC)
Ruakh, your argument is tautological. You are saying that names in English lack gender because English lacks grammatical gender. This is akin to a botanical discussion I was party to, where a botanist insisted structures on a certain species couldn't be stipules. When asked why, the botanist stated that members of that plant family don't have stipules, so those couldn't be stipules. He refused to accept that the axiomatic declaration was incorrect, despite evidence to the contrary. It is true that people say English doesn't have gendered nouns, but the evidence is that it does have them, as I have presented evidence that supports that view. True, most English nouns are ungendered, but there are some that are, and these take gendered pronouns of the same gender. This pattern is not limited by referent, as I have provided examples where the referent of the noun has neither sex nor gender.
Again, could you do me the courtesy of answering the question I've asked: "What would you look for in a word (in any language) to determine whether it had gender?" the answer you gave of "there may be one gender containing most words for men and another gender containing most words for women" seems to point to a referent-based argument, but you then rejected that idea. So, what would you look for in a language to determine whether a noun had gender? --EncycloPetey 03:00, 28 December 2009 (UTC)
Re: "You are saying that names in English lack gender because English lacks grammatical gender": Not quite. I am saying that English lacks grammatical gender, so it's meaningless to try to assign genders to names. We can just as well say that "Bob" is a feminine proper noun because it always takes feminine adjectives (which in English are identical to masculine ones). My basis for claiming that English lacks grammatical gender is that I am not aware of any feature of English that is best explained that way.
Re: "True, most English nouns are ungendered, but there are some that are, and these take gendered pronouns of the same gender": No, I don't think so. English nouns are ungendered; most correlate very strongly with a specific "gender" of referent, and therefore tend to co-refer with pronouns of that "gender", but the nouns themselves are without gender. Desk always (presumably?) has an inanimate referent, so goes with it; mother and Anna usually have female human referents, so go with she/her (though they sometimes have female non-human referents, in which case it is also possible); and so on. Not a grammatical property, but a semantic property with secondary grammatical consequences.
Re: "This pattern is not limited by referent, as I have provided examples where the referent of the noun has neither sex nor gender": Sorry, please re-provide them. I can't figure out which of your above comments provides these.
Re: "the answer you gave of 'there may be one gender containing most words for men and another gender containing most words for women' seems to point to a referent-based argument, but you then rejected that idea": What I was saying is that when noun classes exist, the referents of each class of nouns are typically a factor in choosing the terms "gender", "masculine", "feminine", and so on (or in not choosing them). In a language like English, where noun classes don't exist, we can still apply terms like "masculine" and "feminine" and such, but they no longer indicate grammatical gender — and in a multilingual dictionary, that's a confusing thing to do, because usually we use those terms only in reference to grammatical gender (e.g., we call French professeur "masculine", even though it can refer to a woman just as well as to a man).
By the way, even if you wish to maintain that English has a grammatical distinction between masculine and feminine, surely you must admit that there are some languages that do not. Do you suggest that we simply remove "male" and "female" from descriptions of Finnish, Hungarian, Persian, etc., given names, and leave our readers in the dark about the sex of those names?
—RuakhTALK 04:10, 28 December 2009 (UTC)
I begin to get the feeling that you aren't going to answer my question. You've posted a long reply, but still have not answered my question that has twice been directed to you: "What would you look for in a word (in any language) to determine whether it had gender?" --EncycloPetey 04:31, 28 December 2009 (UTC)
What would I look for in a word? I'd look for its language. If its language is a language with gender, then it has gender; otherwise, it doesn't. There, a simple answer. Happy? —RuakhTALK 13:23, 28 December 2009 (UTC)
Simple and circular. You still have presented no criteria for actually making a determination, but have merely deferred the question to "How do you know if a language has gender?" Stacking turtles does not solve the problem. --EncycloPetey 16:54, 28 December 2009 (UTC)
Not circular at all; I've merely replaced what I consider to be the wrong question (yours) with what I consider to be the right question (which you've now correctly identified). It is meaningless to consider a word in isolation and say that it has gender; gender is a property of an entire language, not of an individual word. Anyway, the answer to the right question is that a language has gender if it has a small number of noun classes, and it is reasonable to label these classes with terms like "masculine" and "feminine" (and sometimes "neuter"), or "common" and "neuter", or the like. (There can be borderline cases where it's not clear if a language's noun classes are best considered "genders" — in those cases, we just have to do our best to reflect tradition and linguistic consensus — but the entire question is moot if the language doesn't have noun classes to begin with.) Next question: "How do you know if a language has noun classes?": A language has noun classes if other words (such as adjectives, determiners, verbs, etc.) change form to agree with the choice of noun. For example, in French, if I refer to something as une chose ("a thing"), then any associated adjectives will be inflected for feminine gender, whereas if I refer to it as un object ("an object") or un truc ("a thingy"), then any associated adjectives will be inflected for masculine gender. Note that this never happens in English; no words inflect for gender, and while we have a few gendered pronouns (he/him/his/himself, she/her/hers/herself, and it/it/itself), the choice of which pronoun to use not determined by its antecedent noun, but rather by its real-world referent. —RuakhTALK 18:29, 28 December 2009 (UTC)
RE: "Note that this never happens in English": the experts at Oxford and Cambridge disagree with you on that point. They state that gender is not a matter of inflection, and that English pronouns and some English nouns do indeed have gender. See my lengthy reply below. --EncycloPetey 18:39, 28 December 2009 (UTC)
Thanks for that information; it's very helpful. (To be sure, they do agree with me on the specific point that you quote, but I definitely see what you're saying.) It looks like the CGEL applies "gender" to English in a few different ways, at times using terms like "personal" and "non-personal" (in reference to who, what, which, etc.) and at other times terms like "masculine", "feminine", and "neuter". However, and I think this is important, it doesn't look like the CGEL ever describes any noun as having a specific gender; it seems to apply those terms only to the pronouns. (We can certainly infer gendered nouns from its examples — e.g., it has a King/himself example with the word "masculine" off to the right — but it itself never seems to make that hop.) But please correct me if I'm wrong; I'm one of those lame-os who hasn't bought a copy yet, so am going from the preview on Amazon.com. —RuakhTALK 19:24, 28 December 2009 (UTC)
Actually, they do in the section entitled "common noun gender classes" beginning on page 489. They also treat the issue of nouns which can associate with either a masculine or feminine pronoun (tutor is used as an example), and they say that because the association with gender of the pronoun is not consistent, the noun tutor has no encoded gender. They classify these nouns as "dual-gender masculine/feminine". However, the discussion you're really looking for begins on page 490 with the section on "single-gender masculine nouns". They say of this class (in part): "This class contains man [] ; various kinship or similar terms involving marriage relations; a good number of occupational terms compounded from man; and names of various social ranks such as duke, count, squire. In all, they list seven noun classes in English. --EncycloPetey 21:51, 28 December 2009 (UTC)
Butting in, unindenting and reiterating my question that likely got lost above, to EncycloPetey:
  • Q1: Are you saying that "woman", "girl", "fiancée", "baroness", and "actress" have grammatical gender in English?
  • Q2: If they do not have grammatical gender, what makes them different from "Anna" for the sake of determination of the presence or absence of grammatical gender? Put differently, what feature, property, or quality that "Anna" has and "actress" does not have makes you think "Anna" has a grammatical gender? --Dan Polansky 09:03, 28 December 2009 (UTC)
The question wasn't lost; I'm just waiting for someone to answer my question in which I'm trying to get at an operational definition of grammatical gender. So far, only Conrad has made a real attempt to answer the question. I'm now waiting for a serious answer from Ruakh, but you are welcome to reply as well. "What would you look for in a word (in any language) to determine whether it had gender?" Once I have an answer to that, then a discussion can continue. Otherwise, we'll keep working past each other. --EncycloPetey 16:54, 28 December 2009 (UTC)
For me, it would depend on the language. In Spanish, I'd look at the article used with a noun and possibly the ending of the word to determine the gender of the noun. (And I don't know any other languages to give examples in them..... and English just uses 'the' for everything) L☺g☺maniac ☃ 17:07, 28 December 2009 (UTC)
OK, but what if there were no article used, and the noun ended in -z? (both are very real possibilities in Spanish, and I am trying to go somewhere with this question; it relates to Conrad's answer above) --EncycloPetey 17:13, 28 December 2009 (UTC)
I would look the word up in a dictionary. And if I couldn't find it in a dictionary I wouldn't use the word. :p L☺g☺maniac ☃ 17:30, 28 December 2009 (UTC)
So the gender of a word is determined by proclamation from a published authority? Really? But we are a dictionary, so we need to know the criteria that a dictionary writer would use in making the determination, and to employ those criteria ourselves. This is what we do when we write definitions based on collected citations. Even if a published dictionary lacks the sense we want to add, if we have amassed evidence for that sense, then we include that sense. My question then is: what evidence do we look for to justify assignment of gender to a noun? Yes, an article pairing can help, and sometimes the ending gives a clue (but can be uninformative or misleading, even in Spanish). So, are there any other things we could look for, and what are they? The other things I would look for are pairings with certain adjectival forms and certain pronoun forms. This is essentially what the Wikipedia definition of "grammatical gender" says, and it's correct. Grammatical gender is determined by the behavior of other associated words.
The Oxford Companion to the English Language says that in languages with gender "these parts of speech when used together must agree in gender". It is the consistent pairings of different parts of speech that marks gender groups. The OCEL also says: "In English, grammatical distinctions of gender are mainly confined to the third-person singular pronouns," and "Some natural-gender distinctions between pairs of nouns show a derivational relationship [] but most have no morphological connection. Some feminine endings are criticized as pejorative and sexist [] In recent years, conscious attempts have been made to use the unmarked or masculine term for both sexes." The OCEL explicitly acknowledges the existence of gender in English, but notes it is largely confined to third-person pronouns, and to a few noun pairs associated with natural-gender distinctions.
The Cambridge Grammar of the English Language discusses gender on pages 484-499. It notes that "Gender is not an inflectional category in English. Gender classes can be differentiated only on the basis of relations with pronouns [] " They do point out that " [] some linguists argue that English simply has no gender system, that the category of gender is irrelevant to English. That is not the view we take here: we regard the differences between English and French or German as a difference in the degree to which gender is grammaticalised in these langugaes, not in whether or not they have a category of gender." (link provided solely for entry creation purposes) The CGEL then proceeds to describe and differentiate the gender of pronouns, and the common noun gender classes in English. They do not, unfortunately address the issue of gender for proper nouns, except for the (long-since) aforementioned application of the feminine gender to nations and ships, but their treatment of what constitutes a proper noun (or what they call a proper name) is lacking in other regards as well and sticks principally to descriptions of usage rather than delimitation of defining universal properties.
Thus, linguistics experts in English for both Oxford and Cambridge agree that there is gender distinction in English, and the CGEL unambiguously states that some English nouns have grammatical gender. --EncycloPetey 18:30, 28 December 2009 (UTC)
(e/c) I only told you what I would do. I am not a linguistic expert and I don't expect to know what I'm supposed to do to figure out the gender of a word. I'm a teenager and I know that dictionaries usually know what they're talking about. Frankly, I don't care much how they know that, I'm just glad they do. L☺g☺maniac ☃ 19:39, 28 December 2009 (UTC)
Sorry if the reply sounded as if it were directed specifically at you and only you. This is the line of reasonig I was hoping someone would point to, and you just happened to be one the first to get there. This was a reply to several questions that had been posed above by several individuals, and it seemed best to express the whole shebang in one go, rather than do it piecemeal or repetitively. --EncycloPetey 21:51, 28 December 2009 (UTC)
If I want to know the gender of a Portuguese noun, I'd look for how it may be substituted or accompanied by pronouns, articles or most adjectives. (The pronouns o, ele, os, eles, seus, esse, aquele, among others, indicate masculine context, so when they refer correctly to any noun, this noun is masculine; and while most adjectives have regular inflectional suffixes that indicate gender, the other adjectives are clearly unclear - that is, they have suffixes which don't indicate any gender.) --Daniel. 18:05, 28 December 2009 (UTC)
Exactly. Please also read the reply to Logomaniac and Dan Polansky, immediately above your post (at the moment). It agrees with you and adds iformation from expert sources about the nature of grammatical gender in Egnlish. --EncycloPetey 18:30, 28 December 2009 (UTC)
Okay, so the question was not lost, but you are not yet giving any reply to it.
I only speak Czech, German, and English, so I cannot be sure that an answer that I provide on how to recognize gender applies to other languages.
I see the following effects that the grammatical gender of a gender-bearing noun has on its surroundings:
  • (a) choice of a gender-specific article (German: "die Sonne" - feminine, "der Hund" - masculine)
  • (b) choice of a gender-specific modifying adjective (Czech: "čisté nebe" - neuter, "čistá studánka" - feminine)
  • (c) choice of a gender-specific verb (Czech: "žena běžela" - feminine, "čas běžel" - masculine)
  • (d) choice of a gender-specific pronoun
The gender of a gender-bearing noun can be determined from the following context-free markers:
  • (e) the ending of the nominative form; this alone often does not suffice ("kočka" f, "předseda" m)
  • (f) the inflection pattern of the noun; the set of all its inflected forms ("singular datives "kočce" and "předsedovi" reveal the gender of both)
From the listed effects and markers, English nouns can be speculated to have (d) and in part (e). But for this effect, seen in the sentence "Why can't the actress open his locker?", there is an alternative explanation: it is the gender of all the potential referents of "actress" that guides the choice of the pronoun, not the grammatical gender of "actress". I see no evidence in the given sentence to the contrary. Given that most of gender markers and effects are absent in English nouns, the explanation through the gender of the referents seems more plausible to me than the claim that "actress" is an English noun of feminine grammatical gender.
A conspicuous feature that distinguishes effects of grammatical gender from object gender or referent gender is the disagreement between the two, as in Czech feminine noun "osoba" when applied to a male, or Czech neuter "děvče" and German neuter "Mädchen", both meaning "girl". I am unaware of any such disagreement in English.
Can you now please indicate whether "actress" has a grammatical gender by your account, so we may start "Category:English feminine nouns" or "Appendix:English feminine nouns"? I think there are going to be quite a few. --Dan Polansky 09:42, 29 December 2009 (UTC)
I have answered that question in the three lengthy paragraphs above, where I support ny position on gender of English common nouns with information from the OCEL and CGEL. Although many languages do have disagreement between the biological gender of the referent and the grammatical gender of the word, this is not a linguistic requirement. General agreement in this matter determines the choice of category label only, and not the existence or absence of the category. For example, Dutch nouns (since their 20th-century language reforms) are now all considered "gendered" (common) or "neuter", although some traces of masculine/feminine distinction do still exist, such as among the third-person singular pronouns.
As both the OCEL and CGEL indicate, gender in English is heavily reduced compared to other Germanic and IE languages, but it does exist. The CGEL offers seven gender classes of English nouns, although this is a result of simultaneously considering all possible genders a word may have. So their classes distinguish between words that are "feminine (only)" and "masculine/feminine" and "feminine/neuter", etc. I don't think categories based on that level of distinction are needed for our purposes, although an appendix might cover it. A Category:English feminine nouns could be a good idea, though, as there are some English nouns that regularly take feminine pronouns. --EncycloPetey 20:01, 30 December 2009 (UTC)
The word "actress" does not appear in your answer above, so I assume that your answer to the specific question is implied, and that it is that, yes, "actress" is a feminine noun, along with "woman", "girl", "fiancée" and "baroness".
Other than the authority of OCEL and CGEL, I still see no evidence that the fact that the gender of the pronoun is governed by the noun is best explained through grammatical gender rather than referent gender. And even CGEL say, quoting from above "some linguists argue that English simply has no gender system, that the category of gender is irrelevant to English. That is not the view we take here: ...", so this is at least a disputed issue among linguists. I have presented some considerations that lead me to side with those linguists who argue that English simply has no gender system, that the category of gender is irrelevant to English. CGEL takes a different view, but they do just that: take a view, rather than delivering a faultless and indubitable proof.
On another note, I doubt that the social gender rather than biological sex is at stake. Given names are allocated to newborns, who have sex but not yet any social gender; the gender-specific socialization into a group of same-sex people showing gender-specific expectations about the behavior of the group's memebers is yet to happen to the newborn.
If I get it correctly from the web search, "male gender" refers to social gender as often as "masculine gender" does. So it seems untrue that "male" automatically selects the context of biology and biological reproduction rather than social gender and its expected gender role. Searches: google:"male gender role", google:"masculine gender roles". This is unlike in Czech, where the sex of a human person is labeled as "mužský" and "ženský", while the sex of animals is "samčí" (male, of animals and plants) and "samičí" (female, of animals and plants). --Dan Polansky 08:53, 31 December 2009 (UTC)
  • Support renaming (with "male" wording, preferably). Ƿidsiþ 00:10, 27 December 2009 (UTC)
  • Support renaming Appendix:Male given names/A. Oppose renaming Appendix:Masculine given names/A. By the way, these appendices would be much more useful, and worth the tremendous effort User:Alasdair has put into them, if somebody could create bots:
1.To check all the given names and surnames in given name/surname categories and add the missing ones into the appendices, making the appendix work as an index;
2.To make all the hidden remarks, like [[Aach]] <!-- Frisian, NL, dim. of Agatha --> into [[Aach]] <small> Frisian, NL, dim. of Agatha </small> . The names are defined by country, not language, and there are references to TV personages etc, but I've understood that the CFI for appendices isn't as strict as for actual entries. A preface could be added. In any case nobody can check these remarks as long as they are invisible.--Makaokalani 13:16, 28 December 2009 (UTC)
  • Support renaming but keeping the male wording. It's one syllable and easier to say. I don't really care about the grammatical whatnot...... L☺g☺maniac ☃ 17:07, 28 December 2009 (UTC)
  • Support male wording per Logomaniac. --Yair rand 18:32, 28 December 2009 (UTC)
  • I support renaming to either of the proposed titles.​—msh210 20:20, 28 December 2009 (UTC)

Daniel., EncycloPetey, AugPi and BD4212 (and supposedly Yair rand initially, but not currently) prefer Appendix:Masculine given names/A while Conrad.Irwin, Dan Polansky, Mglovesfun, Ƿidsiþ, L☺g☺maniac and Yair rand (and AugPi at some point, but not currently) prefer Appendix:Male given names/A. Apparently, Ruakh stated that neither is appropriate, and msh210, that both are appropriate. This informal vote results up to this date in consensus as everybody supports (either male or masculine) renaming. But as 6-4-2, no consensus, since there are four and six "votes" for each possibility and two abstentions. Personally, I think EncycloPetey presented very good reasons based on grammatical evidence, so I'll take the liberty to rename appendices to the masculine version. Furthermore, please see Wiktionary:Votes/2009-12/Masculine and feminine given names. --Daniel. 23:13, 28 December 2009 (UTC)

Most of that paragraph reads like an explanation of why you're about to take no action — and then I get to "Personally, I think [] , so I'll take the liberty to rename appendices to the masculine version". Huh? Please refrain. And if you're going to rename them, why start a vote about it?​—msh210 23:19, 28 December 2009 (UTC)
The vote is intended to reach a decision about the treatment of given names in the whole Wiktionary, not only the 131 appendices discussed here. In fact, no one expressed in this discussion reasons to keep the previous scheme Names male-A, so I concluded that any renaming would be suitable until the vote reaches a specific conclusion. Since you and Yair rand objected my recent action, I've restored the appendices to the Names male-A scheme. Except surname appendices, as Appendix:Surnames/A, doesn't seem objectionable yet. --Daniel. 02:39, 29 December 2009 (UTC)
The renaming that has the majority support in the voting so far, by a narrow margin, is "Appendix:Names male-A" --> "Appendix:Male given names/A". This is a renaming that the proponents of the option with "masculine" should have no problem with, as it does not make things worse from their point of view, only equally bad. --Dan Polansky 09:56, 29 December 2009 (UTC)
Your opinion about my opinon is incorrect. Any edit that propogates an error makes things worse. I object stongly on the grounds that male is a primarily a descriptor of biological factors pertaining to sex, while masculine is primarily a descriptor of cultural/grammatical factors pertaining to gender. Names are cultural and/or grammatical, not biological. --EncycloPetey 18:49, 30 December 2009 (UTC)
I did not mean to misrepresent you. I do not see how the renaming propagates the error: before the renaming, "female" is there, while after the renaming, "female" is still there, so the number of page titles in which this error-would-be occurs remains constant. I don't see how this can be called a propagation. Be it as it may, I now accept that you prefer no renaming to renaming that uses "female", or that is what I understand from your reply. --Dan Polansky 08:35, 31 December 2009 (UTC)

Misspellings

See Wiktionary talk:Spellings#Misspellings. Mglovesfun (talk) 15:00, 25 December 2009 (UTC)

etait

Isn't this a bad redirect (to était), just like Raised et al? AFAICT our search will allow users to find words with diacritics they can't type. That's what I used to do before I learnt a large number of alt codes. Mglovesfun (talk) 20:54, 25 December 2009 (UTC)

It's a good example of a time to use {{DEFAULTSORT:etait}}, Mglovesfun (talk) 10:23, 26 December 2009 (UTC)

Mandarin, zh

Unfortunately, the zh as Mandarin looks bad at WT:LANGTREAT. This language code is officially ISO 639-1 for Chinese, which would make more sense in that list. Since the community seems to not want a Chinese language in any context, my work is done. --Daniel. 21:48, 25 December 2009 (UTC)

Because Wikimedia used "zh" for the Chinese Wikipedia (which happens to be written in Mandarin), we have a huge mess: Firstly, "zh" is widely used (here) to denote both Mandarin and Chinese, secondly (perhaps understandably) Chinese is used to mean Mandarin. The only sensible way to proceed is to create a new template ({{zho}} perhaps), to represent Chinese, and then, using {{cmn}} we could (manually) remove all instances of the horribly broken, ambiguous and misused {{zh}}. I do not forsee there being many things for which {{zho}} can be correctly used, as you note at WT:LANGTREAT we treat "Chinese" as a seperate language, (it should certainly not be labelled as Mandarin there). Conrad.Irwin 22:19, 26 December 2009 (UTC)
In my opinion, ideally {{zh}} would be our code for Chinese. According to WT:LANGCODE, {{zhx-zho}} seems a good alternative. I've moved the code {{chinese-language}}, which you created, to {{zhx-zho}} and updated WT:LANGCODE and WT:LANGTREAT. Thanks. --Daniel. 04:29, 27 December 2009 (UTC)

Editable policy

I'd like to move Wiktionary:Editable CFI to Wiktionary:Criteria for inclusion/Editable and create the shortcut WT:CFIE for it. In addition, the same for WT:ELEE. --Daniel. 14:07, 26 December 2009 (UTC)

Seems fine. Wiktionary:Editable ELE was recently created. --Bequw → ¢ • τ 23:36, 26 December 2009 (UTC)
Done. Perhaps non-editable WT:CFI should link to WT:CFIE and non-editable WT:ELE should link to WT:ELEE. Unless a vote is required, since they're non-editable. --Daniel. 04:50, 27 December 2009 (UTC)

I've edited the header of WT:CFIE to link to WT:CFI. May I do the same at WT:CFI to link it to WT:CFIE? --Daniel. 14:29, 31 December 2009 (UTC)

WT: redirect to Wiktionary:, WS: redirect to Wikisaurus:

I propose that the WT: namespace redirect to the Wiktionary: namespace and that WS: (not yet a namespace) redirect to the Wikisaurus: namespace in the same way that WP: redirects to Wikipedia in the English Wikipedia. I suggested this a while ago in the Grease Pit and I can see no problems with this. The Simple English Wiktionary recently decided to redirect the WT: namespace to Wiktionary: there. --Yair rand 00:18, 27 December 2009 (UTC)

Sounds sensible to me. --EncycloPetey 00:28, 27 December 2009 (UTC)
Yes please. Conrad.Irwin 22:10, 27 December 2009 (UTC)
I've started the vote here. --Yair rand 23:55, 28 December 2009 (UTC)
I like this idea, except that WS: used to be the shortcut fakespace for Wiktionary: and some links still exist (e.g. to WS:BP). I doubt any links exist to WS:foo where foo is something that we're likely to have in Wikisaurus: space, since the WS: pages were fullcaps. But just in case some do, WS: for Wikisaurus: may not be the best idea. Is it possible to analyze a dump for current links to WS: pages? And do we want to worry about old revisions' links?​—msh210 18:14, 29 December 2009 (UTC)
The only old WS: links that I can see that could possibly end up being Wikisaurus entries are WS:NEWS, WS:WIN, WS:cuts, and WS:CAT, all of which are only linked to from User:Dcljr/Sandbox. --Yair rand 20:46, 29 December 2009 (UTC)
Thanks, that eases my mind. How'd you get a list of all linked-to WS: pages?​—msh210 18:36, 30 December 2009 (UTC)
I think Special:PrefixIndex/WS: might do it. Mglovesfun (talk) 18:43, 30 December 2009 (UTC)
There should be no, or few, links to WS: pages left. I removed them all in January 2008. (except to WS:OP which was kept as it is linked to in block summaries). Conrad.Irwin 19:36, 30 December 2009 (UTC)

Harry Potter terms

The small list Appendix:Glossary of Harry Potter terms includes Veritaserum, which I don't think meets CFI. Should this appendix stick to CFI and list only verifiable terms such as muggle and Voldemort or include universe-only terms such as Aguamenti (spell for creating water), Chaser (attacker position in the sport Quidditch) and Pensieve (artifact for keeping and organizing thoughts and memories)? --Daniel. 04:34, 27 December 2009 (UTC)

If veritaserum is mentioned in three works, then it meets CFI for inclusion in a fictional universe appendix. There are a host of books and articles written about the Harry Potter universe, and published reviews and summaries of the films. I'd be genuinely surprised if veritaserum were not mentioned in three works. --EncycloPetey 04:52, 27 December 2009 (UTC)
I see. Then, virtually all spells, creatures, places and items from Harry Potter universe deserve a Wiktionary entry and a place at that appendix. --Daniel. 05:40, 27 December 2009 (UTC)
No, see Wiktionary:Criteria for inclusion/Fictional universes. The criteria for a place in an appendix is not the same as for an entry in the main namespace. --Yair rand 05:42, 27 December 2009 (UTC)
Understood. Then my statement about "all spells, creatures, places and items from Harry Potter universe" is true only for appendices; and in a more restrictive manner, as places such as Azkaban are not permitted as solely in-universe context. I did not find further restrictions, so perhaps an appendix with 493 Pokémon names multiplied by four official languages would also be possible. Interesting. Thank you. --Daniel. 06:05, 27 December 2009 (UTC)
Appendix:Pokémon done, with basic information. References shall be added soon. --Daniel. 09:23, 27 December 2009 (UTC)
You might consider adding links to any WP articles about specific Pokémon. --EncycloPetey 19:50, 30 December 2009 (UTC)
Not a good idea. WP has 14 pokemon-specific articles at the moment, a number which changes ridiculously frequently due to disputes about whether there's enough "real-world information" to be an article. --Yair rand 19:59, 30 December 2009 (UTC)
While Wikipedia doesn't contain currently a high or stable quantity of individual articles about specific Pokémon, it does contains valuable information on the subject. As you can see at, for instance, Appendix:Pokémon/E, Wikipedia links were added where appropriate. --Daniel. 15:51, 7 January 2010 (UTC)

Change Ido plural category to Ido noun form category

Hello there all. I would like to ask the community if you think that changing the category where the Ido plurals go from Ido plurals to Ido noun form category to standardize it with what the other languages are doing. Is this a good idea? I await your feedback! Razorflame 19:27, 28 December 2009 (UTC)

I think the only noun forms that Ido has is plurals, right? Noun forms is used for highlu inflected languages like Greek, Russian and Latin. Mglovesfun (talk) 19:29, 28 December 2009 (UTC)
Oh, ok. I didn't know that. Thanks for the feedback! Cheers, Razorflame 19:38, 28 December 2009 (UTC)
To expand a bit: if only nouns have plurals, nouns have only singulars and plurals (no cases or states or anything, and no other numbers), and the singular forms are the lemmata, then there's no problem with just "<langname> plurals". —RuakhTALK 19:43, 28 December 2009 (UTC)

English plurals

[[Category:English plurals]] is supposed to contain nouns only - this statement is not easily affirmed through intuitiveness, its current talk page, current description nor through analyzing its 67,248 members. This statement may even be incorrect, as there are English plural verb forms and pronouns. --Daniel. 20:17, 28 December 2009 (UTC)

Where does it say "[[Category:English plurals]] is supposed to contain nouns only"?
There are only few plural pronouns--"they", "them", "their"--and few verb forms--"are" of "to be". What have I forgotten to list?
The talk page Category_talk:English_plurals says nothing of interest.
The decision for a category to contain something rests with the creator of the category; if the creator decides to exclude plural pronouns from [[Category:English plurals]], I see no problem with that.
What is the problem that you are trying to address, and what solution do you propose? --Dan Polansky 10:42, 29 December 2009 (UTC)

Standardizing Finnish noun templates

Hello there all. It has come to my attention that some of the Finnish noun entries use {{infl|fi|noun}} and some of them use {{fi-noun}}. I believe that we need to pick one or the other and stop using the other one to standardize and make uniform our entries. This process could easily be done by bot, using AWB with a preset list of the Finnish nouns and enabling the find and replace function to either find and replace {{infl|fi|noun}} with {{fi-noun}} or the other way around. I thought that I might get others' thoughts on this matter first? Razorflame 10:42, 29 December 2009 (UTC)

I know nothing about the Finnish noun entries, and I hope nothing is done along the lines you suggest without input from those who work in Finnish. But I can speak from experience that sometimes I use {{infl}} for an English or Hebrew noun — even though specialized templates exist — because the templates are not quite suited for the particular noun. For example, English pluralia tantum don't accommodate {{en-noun}} well (or, if they do, I don't know how), so I use {{infl}}.​—msh210 18:09, 29 December 2009 (UTC)
I don't plan on doing anything until both Jyril and Hekaheka have said their piece here. I understand that there are particular nouns that don't work with the general template, and we could maybe make a list of those nouns and they would be put on the exception list so that they don't get changed. The whole idea of this was to help centralize and standardize the Finnish entries so that they all are the same without much variation (in terms of layout) so that people don't get confused with why one of the entries is different than the others. Razorflame 18:12, 29 December 2009 (UTC)
Again, I don't know Finnish, but I'd assume that if some class of nouns can't use {{fi-noun}}, then that class is open in some sense, and there may be nouns that belong on such a list that no one will think to put on it.​—msh210 19:54, 29 December 2009 (UTC)
So maybe the better thing to do is change all of the Finnish nouns from {{fi-noun}} to {{infl|fi|noun}} then? Razorflame 19:56, 29 December 2009 (UTC)
No. {{fi-noun}} provides conveniences and standardized layout. You can make a horribly un-standard inflection line with {{infl}} as it allows for so many (unnamed) parameters. Language-specific inflection line templates evolve, gradually being able to encompass more and more classes of words. Those that can be appropriately converted from {{infl|fi}}{{fi-noun}} should be converted and {{fi-noun}} should be updated to encompass more and more classes of nouns. There is no utility in standardizing on the most base template. --Bequw → ¢ • τ 20:48, 29 December 2009 (UTC)
I would agree with you there, but I would have no idea about where to begin trying to update {{fi-noun}} to include more and more classes of Finnish nouns as I am not a template person. Razorflame 20:52, 29 December 2009 (UTC)

Based on some earlier discussions (probably with Jyril but I'm not sure) I have regarded <infl> -template as the standard to be strived towards, and systematically changed all <fi-noun>'s that I have encountered to it. Now I understand that Razorflame would like to do the other way round. I do not understand the finesses of the template business and I have used both in their basic form only adding an occasional <|p> in plurale tantum -entries, which by the way only works with <infl>. I'm willing to go either way, but let's start by writing down the pros and cons of each. --Hekaheka 05:50, 30 December 2009 (UTC)

Currently the main reason why I prefer <infl|fi|noun> is that the robot (or whatever) that creates this page: [[9]] automatically interprets <fi-noun> -entries as not having a declension table. I guess some template-wizard could fix this in no time, but I really would want to get Jyril's opinion on this, because we are talking of something that is pretty much built by him. --Hekaheka 00:25, 6 January 2010 (UTC)

My bot

My bot was blocked by EncycloPetey for using it when the concensus was against it being used about two weeks ago. He has told me that if the community thinks that I am a capable enough bot operator or that there is concensus for me to be able to use a bot, that he would unblock it. Therefore, I would like to ask you this: Am I a capable enough editor that you can trust me with the operation of a bot here on the English Wiktionary?

A few things to keep in mind: I've been running Darkicebot since January 2008 as an interwiki bot and over the years, he has gained the bot flag on more than 50 Wikipedias, as well as gaining the global bot flag about half a year ago. I've been working with Darkicebot since January 2008, and I believe that I am a capable enough bot operator.

My question to you is this: Can my bot be unblocked? I've already told EncycloPetey that I have no intentions of running it unless I am ready to test something out (and by test, I mean 5-10 edits) while I get it ready to possibly use in the future. Do you think I am a capable enough bot operator to handle this responsibility? Thanks, Razorflame 19:17, 29 December 2009 (UTC)

I agree with EP that since your bot is blocked, your few test edits should be done through your Razorflame account. The username change should only require a find&replace in your bot code (+ storing new login credentials). --Bequw → ¢ • τ 20:54, 29 December 2009 (UTC)
Ok. I will see what I can do about it. Thanks for the help and opinion. Other opinions are still welcome on this subject. Razorflame 15:53, 7 January 2010 (UTC)

Definition layout.

I've created Wiktionary:Votes/pl-2009-12/Definition layout. The underlying proposal is lemma-to-lemma translation; the vote page is an attempt to formulate that in a way we can vote on. Right this instant, I'm not really interested in objections to the underlying proposal; the last discussion was about that, and I expect we'll have another one before the vote takes place (I haven't given it a start-time yet, as I don't think there's a need to rush this). Rather, what I'm looking for right now is help in getting the vote page to really reflect the underlying proposal: is it modifying the right part of the right policy, does the wording convey what it's supposed to, and so on. (I'm also wondering if I'm trying to put too many changes in one vote. The stuff about "a" and "to" feels logically connected, but maybe it's not connected enough to go in the same vote.) Please comment at Wiktionary talk:Votes/pl-2009-12/Definition layout. —RuakhTALK 03:44, 30 December 2009 (UTC)

Translingual categories

I find it a little disconcerting that a Translingual entry (eg +) shows up only in Translingual categories and not in the language-specific categories for which that translingual definition is valid (eg Category:Mathematics). When viewing categories like Category:English proper nouns and Category:Mathematics, I'd want some inkling of the content in Category:Translingual proper nouns and Category:mul:Mathematics. Sure, the PoS categories are siblings and the Translingual topical category is a child of the English one, but I think we should do better than this. Here's what I was thining:

  1. Working at merely the category level we could edit the category page templates (eg {{poscatboiler}} and {{topic cat}}) to prominently display links in the category description to related extant Translingual categories. This is easy, but might wrongfully imply that all entries in the Translingual category are valid in a particular language.
  2. Working at the article level, we could formalize the Translingual entry structure for noting the languages for which a Translingual sense is valid. We could have, for instance, do something like:
===Usage notes===
* {{sense|the plus sign}} {{mul-range|pos1=symbol|topic1=mathematics|en|fr|ja|cmn}}
* {{sense|symbol for and}} {{mul-range|pos1=symbol|topic1=informal|en|fr}}
This would both display for the reader the languages a specific sense is valid in, as well as put the entry into the appropriate language-specific PoS and topical categories. This is more specific than #1 and our coverage would therefore be low for the foreseeable future.

Do either of these ideas, or a mixture of the two, seem appropriate? Does someone else have a better way of relating Translingual categories to others? --Bequw → ¢ • τ 22:36, 30 December 2009 (UTC)

Ambiguous category names

Unfortunately Wikimedia hasn't figured out how to allow categories to be renamed, which is really irritating. For example, the categories Occupations, Construction and Drugs are all unusually ambiguous. I'd like to see [[Category:Occupations]] become [[Category:Professions]], [[Category:Construction]], well I don't know but it has a lot of meanings (grammatical, phonological, etc.) and [[Category:Drugs]] is currently at WT:RFDO because the content has to be split, although nobody really know how yet. Mglovesfun (talk) 12:53, 31 December 2009 (UTC)

Why change Occupations to Professions? They are quite different things. I.e. an occupation can be a profession but not necessarily the other way around. They're supposed to be generic, aren't they, as they function as hypernyms/umbrella terms. Tooironic 01:51, 1 January 2010 (UTC)

January 2010

Wiktionary logo vote

The first round is complete, and the top two logos are #59, with 289 votes, and #1 (the "tiles" logo) with 253 votes. These logos will now proceed to the second round, which will last until January 31st. --Yair rand 00:22, 1 January 2010 (UTC)

i do not want to come across as contumelious but please consider (re)casting your vote for the tile logo in m:Wiktionary/logo/refresh/voting as--besides using English--the book logo has a clear directionality of horizontal left-to-right, starkly contrasting with Arabic and Chinese, two of the six official UN languages. As such, the tile logo is the only translingual choice left and it was also elected in m:Wiktionary/logo/archive-vote-4. Beyond this, several Wiktionaries, especially Dutch, have commented that this vote is being forced on the rest of us by the English Wiktionary. A tile logo outcome would bring back harmony between the Wiktionaries in a way that book logo would not because many of us boycotting the vote see the tile logo as the only legitimate choice.
To help explain the relevant context, i borrowed the following table from m:Wiktionary/logo/refresh#nl.wiktionary_vote:
Classic 16 wikis ca cs cy de en es ga hi id is ja no pl pt ru vo
Tiles 10 wikis fr it ko lt ms nl oc sv vi zh  
Other 1 wiki gl  
Warmest Regards, :)--thecurran Speak your mind my past 05:56, 2 January 2010 (UTC)
a) tile logo is ugly as hell b) it has a big red Latin letter W in the middle. Why not Cyrillic, Arabic or Armenian? According to you it too must be considered Anglo-centric. c) no one is going to use a microscope to zoom in and try read stuff on the book logo, to see if its English, left, right or whatever. d) you should not canvass votes for your preferred variant: it's unethical. --Vahagn Petrosyan 06:15, 2 January 2010 (UTC)
Well, that big red Latin letter W in the middle is actually a variable. It displays differently depending on the language of the wikitionary. See for example ko, lt, and zh. Armenian Wiktionary will display a big red Վ. --Stephen 06:35, 2 January 2010 (UTC)
Hmm, indeed. On the other hand, the English wording "Wiktionary, the free dictionary" of the book logo too will be variable by language. I guess, we should drop the Anglo-centric argument and choose by design, each one according to his taste. --Vahagn Petrosyan 07:12, 2 January 2010 (UTC)
How is it unethical, Vahagn Petrosyan, when well-informed users like yourself are completely unaware that Chinese texts bear no gaps on the left-hand or right-hand sides but, rather, on the bottom? Without any magnification it is abundantly clear that the text on the book logo has gaps on the right. It would have to be at least reflected horizontally for Arabic but it simply cannot work with Chinese. Besides, why are we ignoring the original vote and pushing our views on to the rest of the Wiktionaries in the first place? Warmest Regards, :)--thecurran Speak your mind my past 06:59, 2 January 2010 (UTC)
Re the second part: "Following [the second round], each language Wiktionary will hold their own vote on whether to accept the winning logo. In the event that less than 60% of the Wiktionaries approve of the logo, none of the Wiktionaries will use the logo." So, no one is pushing anyone. The Dutch can reject the book logo in their local vote. Re Chinese and Arabic: you must surely understand that for languages writing in the wrong direction the logo can be slightly redesigned in minutes. And yes, I said wrong direction :D --Vahagn Petrosyan 07:12, 2 January 2010 (UTC)
If it can be done so easily, please do so within 24 hours for each of the six official UN languages. That gives you four hours apiece, which should be plenty if it takes minutes and since English is already done. While U are at it, can U or anyone else please explain why we are throwing away the original widely-accepted vote?
BTW, aside from yourself (~Armenian?) and Bulgarian w:User:Марио Николов, it seems the book side is almost completely English. Warmest Regards, :)--thecurran Speak your mind my past 07:33, 2 January 2010 (UTC)
Voila, the Arabic logo: the white gaps are on the left. As for the original vote, I don't know why it wasn't accepted. Hopefully someone else will remember. --Vahagn Petrosyan 07:51, 2 January 2010 (UTC)
JackPotte from French wiktionary has also indorsed the book logo, to my dismay, but as a whole you are right. The uſer hight Bogorm converſation 11:57, 2 January 2010 (UTC)
The spacing is non-cursive. Are you sure you completely re-created it instead of doing the minimalist horizontal reflection i alluded to above? Warmest Regards, :)--thecurran Speak your mind my past 08:17, 2 January 2010 (UTC)
Of course I did only a horizontal flip, nothing else. With a little more time and professionalism the text on the page can be made look like Arabic or Chinese. And even if we leave it as it is, it's still more multi-national than the tile-logo: why are there only 9 tiles? Why are Cyrillic, Latin, Arabic, Hebrew, Greek, Chinese, Korean, that other hieroglyph and that stupid face more important than other scripts? I want Armenian, Georgian, Syriac, Cuneiform, Avestan, Pahlavi and Runes there. But I don't write to others agitating for my favorite logo: you should stop too. This is my main point. --Vahagn Petrosyan 08:43, 2 January 2010 (UTC)
That "stupid face" is the Japanese Katakana character for si, シ, and that "other hieroglyph" is Devanagari for śa, श, as is used by several languages in South Asia. The central character is meant to be Armenian on the Armenian Wiktionary, etc. This is clearly stated on m:Wiktionary/logo/archive-vote-4. Until there is notice of such caveats for the book logo, it is most decidedly not translingual and it is important that Wiktionary admins know this. i have already stopped alerting them on their talk pages but i see no reason not to continue. i would sincerely like at least one person to explain why we have restarted the old vote. Warmest Regards, :)--thecurran Speak your mind my past 09:17, 2 January 2010 (UTC)
The whole point of the logo vote is to find out which logo people prefer, not which one they were forced to vote for. Stop canvassing, please. L☺g☺maniac ☃ 15:21, 2 January 2010 (UTC)
I have no objection to "canvassing" of this sort, since the editor has no personal gain to obtain by raising the issue. On the other hand, I don't particularly care which logo we use. bd2412 T 03:34, 3 January 2010 (UTC)
The table you used is quite out of date. I compiled some statistics on its usage about a couple weeks ago. – Minh Nguyễn (talk, contribs) 21:06, 2 January 2010 (UTC)
L☺g☺maniac, one of my main points in m:Talk:Wiktionary/logo/refresh/voting#Boycott is that this is a vote that many the users of other Wiktionaries feel is forced upon us. Excuse me Minh Nguyễn, does your red URL mean m:Wiktionary/logo? It states:
The "tiles" logo is currently used by the Arabic, Chinese, Corsican, Dutch, Estonian, French, Greek, Italian, Korean, Limburgish, Lithuanian, Malay, Occitan, Persian, Sicilian, Swedish, Turkish, Ukrainian, Vietnamese, Wolof, and Yiddish Wiktionaries. Additionally, the Simple English and Albanian Wiktionaries use variants of this logo. Together, these 23 wikis represent some 3.3 million entries, or 49.4% of all of Wiktionary, as of 22 December 2009. Please join the discussion at m:discussion on the logo votes on how to proceed from here on.
See also a request to change the favicon for all Wiktionaries.
Warmest Regards, :)--thecurran Speak your mind my past 00:39, 3 January 2010 (UTC)
I don't see anywhere that says "YOU MUST VOTE FOR A NEW LOGO OR ELSE" or anything suchly demanding. I'm sorry that the other Wiktionaries feel the vote has been forced upon them. L☺g☺maniac ☃ 00:57, 3 January 2010 (UTC)
The problem is that we already had a vote and it was accepted. Then some upstarts from en WT decided to start it all over again. It seems like a careless, monumental waste of resources. They did not even phrase it as a re-vote. They completely omitted references to the original. If we permit this, what is there to prevent another group from doing the same thing to this vote in another three years? Warmest Regards, :)--thecurran Speak your mind my past 01:23, 3 January 2010 (UTC)
What do you think? This vote was started while two logos are being used about equally, one that basically nobody likes, and the other that there is significant opposition to. Under these circumstances, with 71 supporters for starting a new vote, a new vote was begun. If the situation was that there was a logo used almost universally among Wiktionaries, and there was no significant support for starting a new vote, of course the vote would not go through. Does that answer your question? --Yair rand 01:43, 3 January 2010 (UTC)

I'd like to point out that the canvassing done by User:Thecurran was in fact most annoying overall, and has if anything prompted me to vote in the other direction. Even if I had had any respect for the tile logo to begin with, which I didn't, because my main opinion of it the first time was that it made the entire project look like a dubious, childish game. --Neskaya contribs talk? 19:39, 3 January 2010 (UTC)

Thank U for pointing that out. i have responded on your page. Warmest Regards, :)—thecurran Speak your mind my past 14:37, 10 January 2010 (UTC)

Vote: Renaming CFI section on genealogic names

I've created Vote: Renaming CFI section on genealogic names, to test how hard it is to get a straightforward cosmetic change to CFI passed.

The proposed change has no effect on inclusion of entries.

I have created the vote to last only two weeks, for the vote's being merely cosmetic and not touching the substance of CFI in any way.

I realize there was no discussion before the vote, but went for it anyway, estimating the change should be wholly uncontroversial. Let's see whether I was right in this.

The vote starts on 8 January 2010, and the wording may be still adjusted. However, let's aim at good enough rather than perfect. --Dan Polansky 15:45, 1 January 2010 (UTC)

Vote: Renaming given name appendixes

I have taken my courage and started another vote: Wiktionary:Votes/pl-2010-01/Renaming given name appendixes.

The vote starts on 8 January 2009 and lasts one month.

The vote is based on an ongoing discussion that has largely proceeded.

While there is at least one person who disagrees with the proposal, I deem the proposal rather incontroversial, as it keeps the terms "male" and "female" in place in the names of the appendixes. Who does not like the use of "male" and "female", insisting that we should deal with grammatical gender rather than object or referent gender, should still, I estimate, accept that the result of the proposed renaming is no worse than the current situation. --Dan Polansky 22:49, 1 January 2010 (UTC)

If there is still time, can I propose altering this vote to run along the lines of
Cquote1 black.svg
Please vote on which pair of terms you would prefer to describe given names throughout wiktionary.
  1. 'masculine name' and 'femininine name'
  2. 'male name' and 'female name'
Cquote2 black.svg
This is then more understandable, and provides a more useful answer. Conrad.Irwin 18:08, 5 January 2010 (UTC)
I'd rather leave the vote phrased the way it is, being specifically about the names of appendixes and not about the names of categories, and being about approval rather than preference. I admit that it is implied that the names of categories are also at stake, but that is not what the vote is about. It is an approval vote, meaning that a voter who gives support in the vote gives an approval to the renaming without saying that he actually prefers the renaming to the alternative with masculine and feminine; the voter thus says that the proposed renaming is good enough, is acceptable. Put differently, it implements the idea that a person who finds the renaming acceptable (can live with it) while he finds masculine and feminine the best possible options approves the renaming anyway, so that we avoid the block or deadlock resulting from the use of 70-75% majority voting scheme to non-constitutional changes in policy, in which a plain majority-50%, or a stable majority-60% would be more appropriate. The renaming option that I propose for approval is the one that has so far gained a plain majority of supporters. The voter can approve in the "I can live with it" way by abstaing, that is, by avoiding the oppose option. --Dan Polansky 11:29, 6 January 2010 (UTC)
In this case, as my vote is merely a preferential one, it is not trying to change existing policy (of which we have none on this matter), merely clarify opinion. I see no reason why even the preference that wins by one vote would not be acceptable, so long as that is made clear from the offset (this is all complete bikeshedding anyway). The vote is currently tangential, there is no opposition to renaming the appendices, there is only disagreement as to the preferred title. I think it sets bad precedence if we are to vote on every time we rename a group of appendices, and would prefer that the underlying issue is cleaned up instead of ignored. That said, I could just open the other vote on a similar topic which seems to be more what I want, and then we could let this vote run to completion (and presumably pass, but if it fails, then what?). Conrad.Irwin 11:43, 6 January 2010 (UTC)
I admit that it is not really a policy vote but an executive vote; it concerns the executive branch of Wiktionary government rather than the legislative branch. But the choice of the name of the appendixes has generated a lot of discussion and disagreement, and is controversial enough that no one up to now dared or bothered to rename the appendixes to the name that was so far informally preferred by the majority.
The preferences of the voters should be clear from the vote even with the current phrasing, as each voter will (a) enthusiastically support the vote by voting "support", or (b) vote "support" with a comment to the effect of "I prefer 'masculine' but I can live with 'male'", (c) vote "oppose", (d) vote "abstain" with a comment to the effect of "I prefer 'masculine' but I can live with 'male'", or (e) vote "abstain" without a comment.
It seems that a vote is needed because an informal consensus has not been achieved in the previous discussions. Votes do stand in a contradistiction to an informal consensus, and there is not way around this fact, not even by raising the bar to 70-75% majority voting scheme. The idea that every decision concerning a wiki should be made by consensus is flawed I think. --Dan Polansky 12:35, 6 January 2010 (UTC)

Wiktionary as a mathematical lexicon

Well, maybe there was already a discussion here about "Wiktionary as a lexicon", in general; in print, it is common to distinguish between those two.

Though, I have not found even a single source in the net that translates well between concepts, terms and phrases of mathematical languages in different spoken languages; trying to read a math article in Deutsch, for example, I find myself going to Wikipedia (DE) again and again to see what a term means in English (same with Hebrew).

Wiktionary can solve this, if we allow entries for mathematical terms and phrases. I'll give here a few examples:

  • degree. This example is more or less already done well in Wiktionary: degree can be translated (arguably) to דרגה in Hebrew and to Grad or to Valenz in German, when in Graph Theory context; it can be translated to מעלה in Hebrew (and still to Grad in German, but not to Valenz) when in Polynomials context.
  • More ambiguous are the terms regarding different types of convergence or continuity. Here are a few examples:
    • uniform means אחיד in Hebrew, but uniform convergence is התכנסות במידה שווה (which does not include the word "אחיד" in it). In german, uniform might be gleichförmig, while uniform convergence is Gleichmäßige Konvergenz.
    • So while אחיד means uniform, רציפות במידה אחידה which is based on "אחיד", is not uniform continuity, but equicontinuity; the former (uniform continuity) is Gleichmäßige Stetigkeit in German, and the latter (equicontinuity) is Gleichgradige Stetigkeit.
  • a ring in common language is translated to טבעת in Hebrew, but to חוג in an algebraic context; while "חוג" itself is translated back to English as class or group in everyday context. group is translated in everyday context to קבוצה, which in mathematical context is translated back to a set; but in mathematical context a group would be translated to חבורה, which in everyday context will probably be translated to a gang, or a clique, which also has a completely different mathematical meaning (which will be translated in that context to קליקה in Hebrew, and so on). A class in a mathematical context would be מחלקה in Hebrew, which will be translated in everyday context back to department or division (which - again - means something completely different).
  • Even general fields of interest have really different names in different languages, that cannot always be literally translated; a few examples:
  • One might also find a lot of theorems which have different names in different languages, leaving no way of literal translation. For example:
    • The Squeeze theorem is in Hebrew כלל הסנדביץ' (which means: the law of the sandwich), and Einschnürungssatz in German (which means more or less the constriction theorem).
  • And, at last, there are some common expressions and phrases used in math, which are not literally translated from language to another. I can think of a few examples in Hebrew and in English, but in some cases I don't even know the proper way of saying this in German. Examples:
    • to prove by contradiction, which is sometimes called Reductio ad Absurdum, is להוכיח בשלילה. It gets complicated when you try to translate conjugations of this (for example, "נניח בשלילה" - something like "let us negatively assume", which is not a correct usage)

and so on.

The list can be really wrong, but the important thing here is my general question: do you think that Wiktionary should do that job? If the answer is yes, I am ready to make the effort, and start a "Wiktioary Project" dealing with this. If the answer is no, I believe I should establish a new platform that will handle this, since I find it crucial.


Thanks for reading, waiting for your replies. Peleg 13:09, 2 January 2010 (UTC)

Yes, Wiktionary functions as a lexicon for all areas of knowledge in all languages. Fell free to add missing mathematical senses of words (marked with appropriate context labels) and a properly glossed translation. It would be the best to focus on the normal entries in the main namespace rather than on a specialized appendix (although the approach of a giant comparison table has its own merits) because that's where most of the people will look it up. Starting up a wikiproject might be an overkill since this project is seriously undermanned, but that might even work given that there are several mathematically inclined regulars around here. --Ivan Štambuk 16:13, 2 January 2010 (UTC)
Thanks for your reply. If so, I'll start doing it here in Wiktionary, in an organized way. The project page will be here: User:Peleg/Mathematical Multilingual Lexicon.
Thanks for working on this! bd2412 T 01:07, 3 January 2010 (UTC)
It is my pleasure :) You can help by letting people you know know about this project, and where it is located at the moment. In the meanwhile, I'll work slowly on it. Peleg 19:45, 6 January 2010 (UTC)

Hidden categories

Is there any sort of logic to which categories are hidden or not? For example, requests for deletion. IMO categories that are related to maintenance and are not lexical or topical should be hidden. For example, we are very inconsistent on [[Category:<langname> words needing attention]] if they are hidden or not. Mglovesfun (talk) 08:49, 5 January 2010 (UTC)

Support the proposal categories that are related to maintenance and are not lexical or topical should be hidden, including [[Category:<langname> words needing attention]]. --Daniel. 11:16, 5 January 2010 (UTC)
Of course, only regular editors will see hidden categories (and only some regular editors). So the idea i think is to hide any category that we don't want readers (who are not usually editors) to see in an entry. The "Requests for" categories (except deletion, verification, autoformat, and perhaps a coupla others) I think should be visible, so that even such readers can see that something should be added, and maybe they'll add it. No harm in it, and it can certainly help the entries, especially (but not only) for languages in which no regular editors are proficient. Some of the other categories perhaps should be hidden, though, including "Languagename words needing attention" and "Translations to be checked (Languagename)". Just by way of comparison, enWP hides maintenance categories except for its stub categories and its "uncategorized" categories.​—msh210 17:01, 5 January 2010 (UTC)

Numeral categories

I'd like to create [[Category:Portuguese cardinal numerals]] and [[Category:Portuguese ordinal numerals]]. --Daniel. 10:58, 5 January 2010 (UTC)

I was hoping someone would open this can of worms. Would their parent be [[Category:Portuguese numerals]] inside [[Category:Portuguese parts of speech]]? --Vahagn Petrosyan 11:04, 5 January 2010 (UTC)
Exactly. --Daniel. 11:13, 5 January 2010 (UTC)
This would need quite a bit of discussion. Currently, "cardinal number" is considered a topical category, not a lexical one. -- Prince Kassad 11:18, 5 January 2010 (UTC)
I'm for deleting [[Category:pt:Cardinal numbers]] in favor of [[Category:Portuguese cardinal numerals]]. Numerals are considered a part of speech in all languages I work with. --Vahagn Petrosyan 11:20, 5 January 2010 (UTC)
Note the important distinction: number ≠ numeral... there are words which are one but not the other, I've been told. -- Prince Kassad 11:25, 5 January 2010 (UTC)
Using this search for "numeral" I have found the votes and discussions; and more:
  • Wiktionary:Votes/pl-2006-10/Number versus Numeral.
  • Wiktionary_talk:Entry_layout_explained/POS_headers#Number_versus_Numeral, October 2006
  • Wiktionary:Beer_parlour_archive/2007/April#Numerals_and_their_categories
--Dan Polansky 12:29, 5 January 2010 (UTC)

From what I see on current categorization scheme, number is a numerical symbol or group of such symbols (for instance, the contents of [[Category:Khmer numbers]] and [[Category:Japanese numbers]]), numeral is a numerical word, when not considered of other part of speech (such as [[Category:English numerals]] which doesn't contain the adjective second). Finally, ordinal number and cardinal number are topical categories that specify types of numerical words. However, this scheme is not exactly suitable for Portuguese, because "cardinal" and "ordinal" are distinct grammatical classes in this language — that is, both ordinal numerals and cardinal numerals have a set of different characteristics from each other and from other parts of speech; these characteristics include position in sentences, presence or absence of a distinct feminine form and pluralization. --Daniel. 16:19, 7 January 2010 (UTC)

Number is polysemic and refers to (a) the number of pieces in a group (or elements of a set) such as the number of * in "*****", (b) a numeral such as "345", "two hundred" or "fifth". Numeral is polysemic too, and includes "345", "two hundred", "1st" and "fifth". The decadic numeral "15", the numeral "fifteen" and the binary numeral "1111" all denote the same number in the sense (a). A numeral is a syntactic object, while a number is a semantic object, a meaning of a numeral.
Not only Portuguese but also English cardinal numerals and ordinal numerals are grammatically or syntactically distinct: two (cardinal), second (ordinal); three, third; four, fourth; etc.
The term "ordinal number" is ambiguous and refers to (i) ordinal numeral such as "fifth", and (ii) the set-theoretic or order-theoretic concept that includes such individuals as omega, omega + 1, and epsilon zero.
This acount is still simplified in that it omits further meanings of "number" and "numeral".
In any case, the term "ordinal number" means in one of its senses the same as the term "ordinal numeral", but for this purpose, "ordinal numeral" is less ambiguous. However, "ordinal number" is possibly more commonly used than "ordinal numeral" in English grammar.
--Dan Polansky 18:48, 7 January 2010 (UTC)
Addressing several comments above: The Category:Portuguese numerals (and the like for other languages) is intended to be the POS category for all the numerals. The problem with categorizing Cardinals and Ordinals as parts of speech is that, in many languages, not all the cardinal numbers function as numerals grammatically. In English, for example, hundred is a cardinal number, but it is not a numeral; it is a noun. Similar situations occur in Spanish and Latin, and presumably in other related languages. In most modern romance languages, the ordinal numbers are functional adjectives, not numerals, despite being numeric and relating to a cardinal. So, the compromise we currently use is to have the Category:pt:Cardinal numbers and Category:pt:Ordinal numbers, so that we can list all those words in one place regardless of the part of speech it actually is. These categories also allow us to include nonfinite cardinals, such as aleph-null within a subcategory of mathematics, even though the word does not function grammatically as a numeral. To summarize, I see no advantage to the proposed change, but see several logistical headaches, such as having no category for a language that includes all the cardinal numbers because they would be sorted by grammatical function instead of by topic. --EncycloPetey 03:45, 8 January 2010 (UTC)

Seeking examples of types of flawed definitions

I am interested in accumulating examples of types of flaws in definitions for purposes of Wiktionary:Definitions, Wiktionary:Improving definitions, or some associated page(s). Examples that come to mind are: unsatisfactory technical definitions, unnecessary hyperspecialization, and "An X is when..." definitions (of "X").

The associated talk pages or the body of the Wiktionary pages would be good places for such things.

These Wiktionary pages are intended to provide something more comprehensive than Help:Writing definitions. They would allow that page to be more explicitly simplified for the benefit of new users. Ideas about and contributions to these pages are desired. At this stage, additions are preferred to deletions. Restructuring and massive revision are likely if the effort does not peter out. DCDuring TALK * Holiday Greetings! 16:07, 5 January 2010 (UTC)

  • I have an sort of essay at User:Msh210/specificity, but see also its talkpage.​—msh210 16:38, 5 January 2010 (UTC)
Much needed, but spreading this info out across a bunch of different pages is not a recipe for success IMO. Given our limited editorial resources, could we concentrate this at WT:STYLE until/unless the volume of data becomes unmanageable? -- Visviva 17:18, 5 January 2010 (UTC)
I didn't realize we had such a thing. It looks good. I have been reading Landau and realized that all dictionaries have a style manual of some considerable length. I doubt that we could get a copy of a modern one, but perhaps someone has some headings that would be a useful partial model. I will attempt to not duplicate or be inconsistent with WT:STYLE.
I suppose I view Wiktionary:Definitions as a comprehensive and verbose repository of every generalization about definitions from which we can extract the important for WT:STYLE, the essential-for-newbies for Help:Writing definitions, and certain items for Wiktionary:Improving definitions.
I was intending "Improving" to be narrow in focus. It is to provide the rationale and structure for checklist-type tasks that were less format-oriented than many of our current cleanup lists and more aimed at laying the groundwork for revising definitions. I got started writing it and realized that I was not close to ready. That is when I turned to the more general "Definitions".
Help:Writing definitions needs to be simplified so that it would be actually read by a newbie will waiting for his/her block to expire. Simplification and updating might be good for all of our Help. We need more contributors who can help us update and expand Wiktionary, including English definitions. DCDuring TALK * Holiday Greetings! 22:31, 5 January 2010 (UTC)
  • For a good example of a bad definition, how about our former one for apricot: "A stone fruit"? Circeus 13:51, 11 January 2010 (UTC)
  • I would say that definitions that consist only or primarily of a list of synonyms are bad definitions also. —Internoob (Disc.•Cont.) 23:37, 11 January 2010 (UTC)
    • In some cases we do that (using a single synonym) to avoid recopying the first definition, esp. when one term is more common than the others. Mglovesfun (talk) 11:16, 13 January 2010 (UTC)
Single-synonym definitions sometimes seem appropriate for words that are obsolete. But if the defining synonym is polysemous, it would seem necessary to say which sense applies, in which case a full definition seems necessary. Multiple (>3) synonym definitions are especially suspect to me, especially when the defining words are polysemous. It may be possible for a native speaker to locate the common sense among the synonyms, but I thought we were trying to serve other types of users too. DCDuring TALK 12:21, 13 January 2010 (UTC)

Proposed Wikisaurus style changes

Wikisaurus doesn't really put up a good appearance, not having a small logo like most Wikiprojects on WP, or even a real main page like other Wikimedia sub-projects, Wikijunior and Wikimedia cookbook. Also, the Wikisaurus doesn't have much of a style in it's entries. Thus, I propose the following:

  • Using the top section of this as the Wikisaurus main page, with the current Wiktionary:Wikisaurus being moved to Wiktionary:Wikisaurus project page or similar.
  • Using the bookglobe logo designed by commons:User:Ephemeronium as the Wikisaurus logo, to be used in Template:Wikisaurus-link and external interwiki links to Wikisaurus. (To the best of my knowledge, no other Wiktionary has a logo for Wikisaurus, so this doesn't have to be a huge multiple-Wiktionary issue.)
  • Changing the templates {{ws header}} and {{ws}} to the designs shown, again, on this page
  • Adding the header shown on the same page to Wiktionary talk:Wikisaurus, which could hopefully become a proper discussion room, perhaps with a link from Wiktionary:Discussion rooms.

If there is consensus for these changes, what are the chances this could be done without the usual mess of bureaucracy (read: votes) which wastes a huge amount of time? --Yair rand 19:25, 5 January 2010 (UTC)

Distinguishing the portal aspects from the project aspects is useful. Your changes seem good. --Bequw → ¢ • τ 08:50, 6 January 2010 (UTC)
Separating a portal page from a project page for Wikisaurus could be okay, but I really do not see a need for doing so. I don't believe creating a flashy portal page will help us (or help me :p) build Wikisaurus. The project page "Wiktionary:Wikisaurus" contains the essential information for starting contributing to Wikisaurus, all in one page.
I disagree with the proposed style: User:Yair_rand/WS-main. While I admit that it looks superficially nice, it sacrifices the function of Wiktionary:Wikisaurus to flashiness. Instead of organizing information by sections, it uses a tabular layout, which makes it harder to find information, for me anyway. It distracts the attention by excessive flashiness.
Wikisaurus entries: I disagree with making the changes to appearance of Wikisaurus entries that are shown in User:Yair_rand/WS-main, including the putting of a small logo at every bullet, which I deem wholly non-functional and unduly flashy.
Logo: I still think that Wikisaurus does not need any logo, being a namespace and subproject of Wiktionary rather than a standalone project. The problems of building Wikisaurus are not tied to a logo or appearance; someone has to do the real work of finding the best headwords and term and concept structures, and idetifying the semantic relations between words—the one that does not consist of setting up flashy CSS-styled boxes.
Discussions: I disagree that discussions about Wikisaurus should be channelled to a talk page rather than to Beer Parlour, based on my experience with trying to find out information in
  1. Wiktionary:Thesaurus considerations -- starting in 2002 and 2003, getting more traffic in 2004, with most discussion ended by the end of 2006
  2. Wiktionary:Wikisaurus/Improvements 1 -- created in February 2005, and stopped immediately; a surge of activity appeared in July 2008
  3. Wiktionary:Wikisaurus/Improvements 2
Searching in Beer Parlour for Wikisaurus-related discussions is fairly easy, using the keyword "Wikisaurus" in the namespace "Wiktionary:".
Wikisaurus-link: I oppose the use of {{Wikisaurus-link}}; I encourage to link to Wikisaurus through "Synonyms" section, as (a) Wikisaurus is not a standalone project like Wikipedia or Commons, and (b) "Synonyms" section is the one where people come looking for synonyms. --Dan Polansky 10:55, 6 January 2010 (UTC)
I was not in any way advocating the use of {{Wikisaurus-link}} (though I do use it myself sometimes). The main page design on User:Yair rand/WS-main is, well, not really supposed to be useful for finding information itself, the current Wiktionary:Wikisaurus is far better for that, which is why it's linked to from the my design. What a main page is supposed to do is really show the reader a nice design, give a bit of a "visual identity", show a short outline and a couple of useful links, and showcase either a "featured" piece of content or some new content so that the reader can tell what it's supposed to look like, and show a bit of what people can do to help. As for the bullet, maybe that was a bad idea, but I still like having the header be more than just a block of text and a search box. --Yair rand 18:15, 6 January 2010 (UTC)

More Integration with Wiktionary

I think that Wikisaurus should be more integrated with Wiktionary, not less. In my opinion, the best scheme would be to make the regular Wiktionary entries act as a index to a "keyless" Wikisuarus entry, like in Roget's International Thesaurus by Harper-Collins. (RIT uses numbers for their thesaurus entries but we can hide those numbers within the software.)

For example, the regular Wiktionary entry for immigrant would have a Wikisaurus index section like:

  • WIKISAURUS FOR Immigrant
    • migrant
    • incomer
    • citizen
    • settler
    • newcomer

Clicking on "migrant" would bring up this Wikisaurus list:

Noun

  • migrant
  • migrator
  • trekker
  • immigrant
  • wetback (informal)
  • etc.

See Also

  • Traveler

Clicking on "immigrant" would bring you back to the regular Wiktionary entry for immigrant. Clicking on "trekker" would bring you to the Wiktionary entry for trekker, which would have a WIKISAURUS FOR Trekker index (which, in turn, would include the entry for "migrant", among possibly others) and so forth.

Clicking on "incomer" in the original WIKISAURUS FOR Immigrant entry would bring up:

Noun

  • incomer
  • entrant
  • visitor
  • immigrant
  • etc.

Clicking on "Traveler" in See Also would bring up a super-index of:

Nouns

  • traveler
  • wanderer
  • vagabond
  • nomad
  • migrant
  • etc

See Also

  • Places

Clicking on any of these would bring up the appropriate lowest-level index. Clicking on "Places" in See Also would bring up a super-super-index of:

  • Space
  • Location
  • Displacement
  • Traveler
  • etc.

Software (probably a template) would have to be developed so that, when a Wikisausus index is added to or removed from a word entry in Wiktionary, the software would automatically change the Wikisausus page. We would need editor software to change the order of the Wikisaurus word entries. The first entry would be the index tag (in the first case above, "migrant") of the reguler Wiktionary page. We would also have to edit the "See Also" entries and the usage notes (like "informal").

This is just an idea and I apologize if it's been proposed and slammed before. It is a big change but it does seem to be a method for including virtually all words in Wiktionary in one or multiple Wiksaurus entries. I put it into a separate sub-section because I think it relates to the Proposed Wikisaurus style changes but it's a somewhat different line of thought. --RoyGoldsmith 14:44, 9 January 2010 (UTC)

I am very confused. How does this relate at all to the proposal of having a portal-like page and logo for Wikisaurus? --Yair rand 23:29, 9 January 2010 (UTC)
It doesn't, per se. But after you reach the Thesaurus (by whatever means), the contents of the thesaurus would be (I hope) more organized. Rather than having an alphabetic list of terms (some of which overlap -- see copulate, sexual activity and sexual intercource, for example) you would be presented with a number of highly-abstact concepts: The Body, The Senses, Feelings, Places, etc.
Click on, say, The Body, and you would be presented with another list of more-refined conceptual terms: Birth, The Human Body, Hair, Clothing, Nutrition, etc. Click on Birth and you would get: NOUNS birth; VERBS be born, give birth; ABJECTIVES born. Click on (say) give birth and you would be presented with the actual words that relate to the concept of giving birth: give birth, bear, have young, have a baby, bear a child, whelp, litter, labor, etc. Click on one of these and you would be redirected to the actual Wiktionary entry. This entry would then have a reverse link to the lowest level of Wikisaurus.
Thus there would now be two main entry points for Wikisaursus. The first is from the top down, through the current way or your portal or something else entirely. The second would be from the bottom up, through the entry in Wiktionary (via the new "WIKISAURUS FOR xxx" section or something akin to it) up through a list of highly related terms (give birth, have young, etc.) and from there up to even more abstract concepts: for example, Birth in general and from there to The Body.
That's all I'm saying. You gave a new method for accessing Wikisaurus from the top; I'm giving a new method of accessing Wikisaurus from the bottom. (Of course, my method changes the entire structure of Wikisaurus and is therefore much harder to implement.) If you would prefer to have my proposal treated totally separate from yours, simply change the header "=== More Integration with Wiktionary ===" to "== More Integration with Wiktionary ==" (two equal signs on either side instead of three) at the top of this subsection and, magically, it will be transformed into two, independent proposals. --RoyGoldsmith 15:08, 10 January 2010 (UTC)

Preposition forms and prepositional phrases.

A number of languages have what are called "inflected prepositions". The details vary from language to language, but the general idea is that certain preposition+pronoun combinations are expressed by inflecting the preposition to incorporate the pronoun. There are a few minor examples of this in Romance languages (such as Spanish conmigo (with me), which historically results from roughly con + + con, but nowadays amounts to a special form of the preposition con that incorporates the first-person singular personal pronoun ), but in some languages, such as certain Celtic and Semitic languages, it's very pervasive: for example, all Hebrew prepositions are inflected in this way, though some prepositions are defective (meaning they can't take certain objects) and/or suppletive (meaning that not all of their forms have descended from the same ancestral preposition). Incidentally, Hebrew doesn't even have stand-alone object forms for the animate personal pronouns, since they're always expressed via an inflected form of whatever they're the object of.

From one perspective, these are forms of prepositions; but from another perspective, they are complete prepositional phrases (since the preposition's object is built-in), so they don't behave quite like English's transitive prepositions. Header-wise, this makes for a bit of a conundrum. As I see it, we have a few options:

  1. ===Preposition===. This is what I've been using.
  2. ===Adjective=== and ===Adverb=== (with identical or nearly-identical information under each). This accords with how we've been handling prepositional phrases in English.
  3. ===Idiom=== or ===Phrase===. Either of these would be misleading, since these are one-word forms; even in cases where the pronoun was a separate word in some ancestor language, it's not one now.
  4. ===Contraction===. A bit better, but still misleading, since conmigo is longer than *con mí, and since in Hebrew it's not clear what they're contractions of (since the personal pronouns simply don't have standalone object forms).
  5. ===Preposition form=== (currently not sanctioned by ELE). This is the most explicit and the most precise, but I'm not sure how much clearer it actually is for an English-speaker, even one who speaks an affected language. And we avoid headers like "Noun form" and "Verb form", so this would introduce some inconsistency.
  6. ===Prepositional phrase=== (currently not sanctioned by ELE). This is still a bit misleading, what with the word "phrase" in there, but overall, I think it would work pretty well. It could also be used for other kinds of prepositional phrases, which would be a bit clearer and more accurate than our current ===Adjective===/===Adverb=== approach (especially for languages where true adjectives inflect for number and such). Admittedly, there's still a bit of inconsistency, in that we avoid "Noun phrase" and so on, but I think it's clear that a noun phrase is noun-like in a way that a prepositional phrase is not preposition-like, so hopefully that's O.K.

All in all, I think my preferred option is to start allowing ===Prepositional phrase===, and to use it for object-including preposition forms as well as for regular prepositional phrases; but what do y'all think?

—RuakhTALK 21:55, 5 January 2010 (UTC)

I certainly see the advantage of Prepositional phrase as a PoS-type header for English: eliminating low-value duplication of senses in Adjective and Adverb sections at no additional (possibly even less) user confusion. Category:English prepositional phrases would make it relatively easy to rapidly transition to such a header. The simple nature of English prepositional phrases would make it easy to find omissions, too. DCDuring TALK * Holiday Greetings! 22:48, 5 January 2010 (UTC)
I, too, like the idea of "Prepositional phrase" for English. I'll keep out of the conversation w.r.t. Hebrew though, at least for now: having thought about it and discussed it (elsewhere), I'm really not sure what's best. Ruakh, if you have the time and inclination, how do Even Shoshan and his counterparts list לי?​—msh210 17:24, 6 January 2010 (UTC)
Aside from Wiktionaries and Wiktionary mirrors, I know of no dictionary, online or in print, in any language or between any pair of languages, that has entries for individual word-forms. Online dictionaries will sometimes just redirect you to the main entry for the lexeme, and print ones will sometimes do the print equivalent for forms that are extremely irregular or appear very far away in alphabetical order (like, between the entries for sommer and sommet, you might find "sommes see être"), but so far as I know, our informative soft redirects are without precedent. So I think we're on our own for this. —RuakhTALK 20:54, 6 January 2010 (UTC)
At least "prepositional phrase" is a term that seems to be used in English fairly consistently in virtually every vintage and level of grammar book. We should try to anticipate problems and make sure that we have supporting material (glossary definition, entry, category text, Appendix) of good quality to support the idea. We might discover problems as we try to prepare such material. One small problem I can anticipate would be with the "postpositions" (eg, "notwithstanding", "apart", aside") if they head any CFI-meeting phrases. It may also lead us to face the question of whether to analyze some terms (eg, with regard to) as "compound prepositions" (Quirk et al, en.wikt) vs. elements of layered prepositional phrases (per CGEL) and also such odd cases as to do with. DCDuring TALK * Holiday Greetings! 22:11, 6 January 2010 (UTC)
Postposition is an ELE-approved POS header, so I don't see any problem with having Postpositional phrase if we have Prepositional phrase. The only issue would be where to put such an (English) phrase, and we can, I think, cross that bridge when we come to it.​—msh210 17:52, 7 January 2010 (UTC)
We don't really want the Postposition header in English, whatever value it has for other languages. Few normal people notice or are bothered by "preposition" being a misnomer etymologically when applied to the few terms that are atypical in this regard. Stranded prepositions are always postpositive anyway, without the name being what upsets most of those who complain about such things. In any event, the position of a preposition relative to the rest of the phrase it is in is not the important grammatical feature of a preposition, nor is it unique to prepositions ("The" is prepositive.).
My point was only that, given that we don't want the Postposition header, any includable prepositional phrase that uses one of these oxymoronic normally-postpositioned prepositions may look funny and cause comment. DCDuring TALK 19:03, 7 January 2010 (UTC)
O.K., strike my earlier comment. I totally should have checked before writing it. I'm sorry. I was totally wrong. It turns out that Even-Shoshan does include inflected forms for some of the really basic prepositions. In general, it doesn't give any definition at all for these, or only a definition like "רְאֵה מִן" ("see from"), but for לִי (li), to me) it actually goes all out for some reason, defining it as "אֵלַי, לְעַצְמִי וְכוּ׳" ("toward me, to myself &c."). For most of the preposition forms I looked at, it gives the POS as מ״י (M.Y.), prep.); the only one for which it gives a different POS is לוֹ (lo), for which it gives מ״ג (M.G.), pron.). I'm inclined to regard that inconsistency as a mistake, though. —RuakhTALK 02:05, 7 January 2010 (UTC)
Thanks.​—msh210 17:52, 7 January 2010 (UTC)

Update: since no one seems to have any objections, I've set up a vote at Wiktionary:Votes/pl-2010-01/Allow "Prepositional phrase" as a POS header, to start in one week. If anyone does object, please speak up before then! —RuakhTALK 21:09, 7 January 2010 (UTC)

So, I don't understand why you object to using ===Contraction=== and then labelling the function on the inflection line or in the usage notes. The problem with ===Prepositional phrase=== is that it isn't a full description of the term, nor is it the functional part of speech. Prepositional phrases are prepositions plus an object, so a label that emphasizes only the prepoisiton part is misleading and focusses on what is usually the minor particle in the combination. Also, prepositional phrases tend to function as adverbs or adjectives, and not a prepositions, so the label doesn't describe the word's function either. For the aforementioned Romance languages, I prefer ===Contraction=== for these situations, since that at least avoids giving the impression that the term somehow is a preposition. That doesn't mean this would necessarily be ideal for Hebrew or for other languages, but I haven't seen a convincing reason for ===Prepositional phrase===, and that was a header we tried to deprecate before. --EncycloPetey 03:35, 8 January 2010 (UTC)
I thought I explained why I object to using "contraction": it's not accurate. Spanish conmigo is not exactly a "contraction" of *con mí, because it's actually longer than said; and Hebrew לי is not a "contraction" of anything, any more than English babies is.
"Prepositional phrase" focuses on the preposition part because that's the part that determines the grammar, no? I don't understand your statement that "prepositional phrases tend to function as adverbs or adjectives, and not a prepositions, so the label doesn't describe the word's function either". I agree that "preposition" doesn't describe the function of a prepositional phrase, but surely "prepositional phrase" does?
Re: "For the aforementioned Romance languages, I prefer ===Contraction=== for these situations, since that at least avoids giving the impression that the term somehow is a preposition": I don't understand. Are you saying that the header "prepositional phrase" gives the impression that the term somehow is a preposition?
—RuakhTALK 04:17, 8 January 2010 (UTC)
BTW, I should say that while I'm not a big fan of the ===Contraction=== header for terms like conmigo, I do at least consider it to be an O.K. approach for those. It's not ideal, but it's not the end of the world. —RuakhTALK 04:33, 8 January 2010 (UTC)
If I understand correctly, then the Hebrew term you're concerned with is not actually a prepositional phrase, since it's written as a single word. Phrases are inherently multi-word, so a label of "X phrase" would be inaccurate on that count alone. And no, "prepositional phrase" does not describe the function because prepositional phrases can have many different functions. Most often, they function as an adjective or adverb, but they can appear in other functions as well, such as an interjection or even as the subject of a sentence. --EncycloPetey 17:22, 9 January 2010 (UTC)
Re: "Phrases are inherently multi-word, so a label of 'X phrase' would be inaccurate on that count alone": Yes, as I said.
Re: "And no, 'prepositional phrase' does not describe the function because prepositional phrases can have many different functions. Most often, they function as an adjective or adverb, but they can appear in other functions as well, such as an interjection or even as the subject of a sentence": Yes. That's why it's problematic to try to cover them in "Adjective" or "Adverb" sections: they're not really adjectives or adverbs, and their range of functions is not the same as that of adjectives or adverbs. Hence my suggestion that we give them their own POS header.
—RuakhTALK 22:28, 9 January 2010 (UTC)
So, you want to use a POS tag that describes neither the structure nor the function? I don't understand how that could be a good idea. --EncycloPetey 23:56, 9 January 2010 (UTC)
If by "neither the structure nor the function" you mean "both the structure and the function", then yes. ;-)
It may not describe the structure perfectly, but it describes it better than anything else besides "preposition form", which you don't seem to be advocating. And it does describe the function perfectly. You seem to be contradicting yourself, simultaneously arguing (1) that prepositional phrases have their own set of functions, not exactly the same as those of adjectives and adverbs and (2) that "prepositional phrase" doesn't describe their functions.
—RuakhTALK 03:49, 10 January 2010 (UTC)
You are misunderstanding, so let me try to explain it this way: Why not use the POS "Word", which would "perfectly" describe the function? The answer is that word is too broad a description encompassing too many possible functions. Likewise, a "prepositional phrase" has many possible functions, so that such a label does nothing to distinguish which possible functions an entry might have. This is why we use Adverb, Adjective, etc. as POS headers for prepositional phrases, since such phrases as a group could have any of quite a number of functions, but individually they have one or two of those functions only. So when I said that the header you are proposing describes neither the structure nor the function, I meant exactly that, and was not using the "opposite-speak" you have credited me with. --EncycloPetey 05:03, 10 January 2010 (UTC)
Thanks; if that's true, then it makes sense as a reason. But do you have any evidence for your statement that any given prepositional phrase is only used in a few ways? That's not obvious to me. (In Wikipedia terms, [citation needed]. :-)   —RuakhTALK 05:11, 10 January 2010 (UTC)
Perhaps I should clarify that those which are used in more than one way have a different meaning when so used, so there would be separate senses functioning in different ways (as diffeent POSes) and with different meanings. Do you have evidence that individual prepositional phrases can regularly be used in both an adjectival and adverbial sense? I have yet to see more than a tiny handful that can. --EncycloPetey 07:42, 10 January 2010 (UTC)
The use of Adjective and Adverb headers, jointly or separately, for prepositional phrases is not perfectly accurate and requires duplication of essentially the same meanings.
The duplication of content argument has been used as part of the justification for excluding PoS sections for attributive use of nouns and common noun use of proper nouns. I would extend it in English to exclude some similar duplication in the case of -ing forms in English.
In English the accuracy issue arises from prepositional phrases not normally meeting tests for the adjective and adverb PoSs. For example, they normally do not accept modification by "too" and "very", sometimes accept modification by "right" and "straight", and cannot serve as the complement of "become".
One departure from the pattern is instructive. A prepositional phrase modified by "very" or "too" prefers to be distinguished by its stress pattern or orthographically by quotes or hyphens. "It was very in-the-moment". "He was speaking in-the-moment." "It was a very in-the-moment speech." DCDuring TALK 12:00, 10 January 2010 (UTC)

International Wikisaurus

I'm not really sure about the state of Wikisaurus, but I'm sensing that it's not in any way interconnected across language versions... which is too bad, because synonyms have to be in the same language as the given term, and therefore the words should be exactly the same way under all the wikis. There has to be some way to take advantage of that -- otherwise, you end up doing the same work for the gazillion language version there are, or some people are missing out. By the way, I'd really like other language synonyms in the wikisaurus... —This unsigned comment was added by MirekDve (talk • contribs).

Wikisaurus is not multilingual, meaning that the English language Wikisaurus only included entries for English words. The reason for this is, as you pointed out, we would end up doing the same work for the gazillion language version there are, with no benefits, because anyone could easily look up the word in the Wikisaurus of that language. --Yair rand 02:17, 7 January 2010 (UTC)
I assume that the subject of this thread is "Multilingual Wikisaurus", meaning whether Wikisaurus in English Wiktionary should contain non-English entries.
It has not yet been decided whether Wikisaurus should be multilingual. I support its being multilingual, but have so far not pushed the idea very far, as there has been some opposition, and I am trying to build up the English section of Wikisaurus first.
A model non-English page in Wikisaurus is Wikisaurus:příbuzný.
See also:
  • Wiktionary:Wikisaurus#Multilingualism
  • Wikisaurus - non-English entries - Mar 2009
  • Wikisaurus_talk:juoppo - result of a RFDO - request for deletion of other - on a non-English Wikisaurus entry.
--Dan Polansky 11:56, 7 January 2010 (UTC)

To be most useful, each Wikisaurus entry should include comments (in English) when needed, e.g. about how senses are slightly different between words. Also remember that a thesaurus is not only about synonyms, but should include all words readers are likely to need when reading the entry, all words which come to mind when addressing the subject (e.g. kennel in the entry for dog, look at Roget's thesaurus for other such examples). Therefore, the entry should be organized into several sections (with section titles in English). This means that a good thesaurus cannot be common to all wiktionaries, and that there is no reason to limit Wikisaurus to the English language. But the titles should always be in English: Wikisaurus entries are not about a specific word (whatever its language), but about words of some language relating to something. Good titles could be something like Wikisaurus:dog (English) or Wikisaurus:dog (Dutch). Lmaltier 22:47, 9 January 2010 (UTC)

Wikisaurus entries stand for senses AKA meanings rather than terms; right.
Non-English entry titles in Wikisaurus can be designed using various approaches. The current approach is (a) that the headwords should be in the given language: "Wikisaurus:příbuzný". One alternative proposed was (b) that the language should be indicated using its code before the term: "Wikisaurus:cs:příbuzný". If the codes are disliked, (c) the disambiguation can proceed in natural language: "Wikisaurus:příbuzný (Czech)". And there is also the option of (d) using English headwords: "Wikisaurus:relative (Czech)".
I tend to favor the approach (a), given the current state of discussion. There are level-2 headings for "English" in Wikisaurus entries already in place for the cases where more languages occupy one headword. The approach (d) would cause some complications in linking between Wikisaurus entries using {{ws}}: in a list of hyponyms, the template links a hyponym not only to mainspace but also to Wikisaurus. So if there is the hyponym {{ws|ďítě}} of "příbuzný", the template is automatically looking for a Wikisaurus page entitled "dítě" rather than "child (Czech)". It would be necessary to tell explicitly to the template where to look for the Wikisaurus entry, like {{ws|dítě|child|ws=child (Czech)"}} or {{ws|dítě|child|ws=child|lang=cs}}. Alternatively, it could be assumed that the gloss (the second parameter of ws) matches exactly an entry headword, so it would suffice to enter {{ws|dítě|child|lang=cs}}, but this assumption would be made only if the language is not English. In any case, (a) can be tried immediately without further technical adjustments. --Dan Polansky 10:37, 10 January 2010 (UTC)

Delinking {{io}}

Hi there all. I would like to ask the community what they think about me requesting for {{io}} to be delinked due to the size/amount of entries that have been made in Ido so far. I believe that it is starting to become more used on here, and that having it linked isn't necessary any longer because of the popularity of the language. Ido is very similar to Esperanto, and Esperanto is not linked, so I think that on that basis alone, {{io}} should be delinked. Thanks, Razorflame 16:49, 7 January 2010 (UTC)

From the old discussions at Wiktionary_talk:Translations/Wikification, it seems as though such changes have been made by any interested editor after a warning at that talkpage, allowing objections to be voiced, assuming none were.​—msh210 17:44, 7 January 2010 (UTC)
I see no advantage in doing it, at all. Mglovesfun (talk) 01:19, 8 January 2010 (UTC)
Usually, headers aren't supposed to be wikilinked, although that might not hold true here. Anyways, I'd like it to not be wikilinked, as that is what I am most used to. There probably isn't any disadvantage with doing it, and there doesn't seem to be any advantages to not doing it, so we might as well do it :) Razorflame 01:22, 8 January 2010 (UTC)
Then don't link the headers. However, please do not remove the link from {{io}}, as that template is primarily linked there for Tbot in dealing with Translations sections. We decided long ago which languages should and should not be linked within the Translations sections, and our ISO templates have been set up accordingly. If people are using the ISO templates to insert headers, then that is fine, but that's not their primary function. Frankly, the commonness of a language on Wiktionary is not a consideration in deciding which language templates are linked. It's based on recognition of the language name by the general population and whether the name of the language sufficiently resembles the name of the country where the language originated or is principally spoken. Ido is a language I would never have heard of if it weren't for working on Wiktionary. --EncycloPetey 03:28, 8 January 2010 (UTC)
I should point out that WT:ACCEL uses those templates for headers, and that's probably the reason Razorflame requested de-linking it. --Yair rand 03:33, 8 January 2010 (UTC)
Thanks for explaining. I've now fixed WT:ACCEL not to linkify language headers. —RuakhTALK 04:30, 8 January 2010 (UTC)

Audio requests

This month, I'm going to be working on clearing the backlog at Category:Requests for audio pronunciation (US). However, I cannot record British English. I need the help of a person who is able to record UK English. Thanks, The New Mikemoral ♪♫WT:APR 03:41, 8 January 2010 (UTC)

Numbers (year?) preceding definition

What are the numbers preceding the noun definitions of boob? __meco 16:43, 8 January 2010 (UTC)

There was a reference to them in the Etymology section. I've cleaned it up. WT:RFC may be better, incidentally.​—msh210 18:22, 8 January 2010 (UTC)
Wouldn't that be request for comments? I'm not sure I understand the delineation between these pages. __meco 18:40, 8 January 2010 (UTC)
This page is for matters of "policy" that span some class of entries. We have at least two places for similar matters relating to individual entries. If you were asking an informational question you could have taken this to WT:TR, inserting {{rft}} in the entry. If the question was really a request for action, you could take it to WT:RFC (cleanup), using {{rfc}}. HTH. DCDuring TALK 18:58, 8 January 2010 (UTC)
I was considering that this was in adherence with guidelines, and if that was so, the guidelines needed to be changed since it was confusing. I guess I didn't know how to check whether it was complying (i.e. with which guideline). __meco 19:21, 8 January 2010 (UTC)

Bing

Have you tried Bing, the new Microsoft search engine (http://www.bing.com)? It's really incredible. With default settings (for France), I tried a few French and English common words. Here are results I get for the Wiktionnaire:

  • manger: 1st page (1st hit)
  • pain: 1st page (2nd hit)
  • eat: 1st page (10th hit)
  • bread: 1st page (3rd hit)
  • football: 1st page (8th hit)
Do you get the same kind of results in other countries? Lmaltier 09:28, 9 January 2010 (UTC)
No.
  • manger: 16th hit
  • pain: not in first 6 pages
  • eat: not in first 6 pages
  • bread: somewhere on th 5th page
  • football: not in first 6 pages --Vahagn Petrosyan 12:51, 9 January 2010 (UTC)
Thanks for the info. This must have something to do with the country... But I would have expected similar results in other countries (with the local language dictionary instead of fr.wiktionary, of course). Lmaltier 16:38, 9 January 2010 (UTC)
My country's local language wiktionary is brain-dead. Maybe that has something to do. --Vahagn Petrosyan 17:44, 9 January 2010 (UTC)

Clarification or revision of WT:QUOTE#How to format a quotation

Formatting note 1 states “The year is always in bold face, the title of the work in italics or, for shorter works, quotation marks.” (my emboldenment). Really? I never see the use of quotation marks for this purpose, and it would be more consistent to allow only italics, per the de facto standard (note that the rest of the page uses only italics). Or is this meant to apply to works published as a part of a compilation (e.g., an essay in a multi-authored book or an article in a journal)? Could we get some clarification or revision of this point, please?  (u):Raifʻhār (t):Doremítzwr﴿ 19:39, 9 January 2010 (UTC)

My understanding is that this clause, if you will, exists because that is the standard form used for citing poetry in essays, as shown by the MLA format (outlined here and in many other websites found in a typical Google search). Cdhaptomos 19:43, 9 January 2010 (UTC)
That looks like it’s talking about poems printed in anthologies; i.e., it’s “meant to apply to works published as a part of a compilation”.  (u):Raifʻhār (t):Doremítzwr﴿ 19:52, 9 January 2010 (UTC)
That doesn't look like the best example, having read it thoroughly. This is better. Cdhaptomos 20:13, 9 January 2010 (UTC)
That talks about quoting poetry in an essay, not citing it in a dictionary; lexicographical practice is likely to be very different. Does anyone know the OED’s practice in citing poetry? Or can someone offer a rationale for departing from our usual standard? I don’t see why we should have different formatting standards for quotations from poetry from those that hold for those from prose.  (u):Raifʻhār (t):Doremítzwr﴿ 20:41, 9 January 2010 (UTC)
The is the norm for citing just about anywhere in English. Shorter works such as scientific articles, short poems, and newspaper articles should have their titles in quotes. Longer works, such as epic poems, journals, novels, and other books should have their titles italicized. This is the norm here, as well as on Wikipedia, because it's standard in most style and publication guides. --EncycloPetey 21:17, 9 January 2010 (UTC)
This was totally unknown to me. Could someone please update the policy page to state this fact more explicitly? Cdhaptomos, do you care to do the honours?  (u):Raifʻhār (t):Doremítzwr﴿ 22:41, 9 January 2010 (UTC)
I have tried, as you can see. I don't know how well I've done at wording it, though. Cdhaptomos 23:58, 9 January 2010 (UTC)
I'm not sure this is accurate. Shorter works do usually appear in quotes, but that's because they are part of larger works, which appear in italics. "Article", Newspaper. "Scholarly article", Journal. "Poem", Anthology. Length doesn't factor into directly AFAICT. Consider for example Frankfurter's On Bullshit. It is considerably shorter than many journal articles, yet its title is always italicized, because it is published as a freestanding book.
I would tend to limit quotes-only to special cases where a very small work appears in isolation, such as broadside ballads, or poems that are known to have been written down long before they appeared in print. Usenet posts would also qualify, the de-facto-standard "Post title", Usenet format notwithstanding. -- Visviva 03:34, 10 January 2010 (UTC)
This is my understanding also. Except that when citing poems, it's fairly common to do so in isolation, since they were usually first published in a magazine or elsewhere. Ƿidsiþ 06:21, 10 January 2010 (UTC)
Most shorter poems are able to be recognized as separate works, regardless of publication in an anthology or whatever. Fair enough, you could put "Poem", as published in Anthology, but at the end of the day, a poem is a poem. Cdhaptomos 18:56, 10 January 2010 (UTC)
My belief was pretty much as Visviva explains. I regard the citation information we give as the means of verifying the existence of the quoted extract, ideally in the place it first occurred; if the work in question does not occur independently, but rather only as part of a larger work, then that justifies the use of quotation marks, whereas italics should be used otherwise. Presumably, no one publishes individual poëms or scholarly articles, hence they always appear as parts of larger works only, hence they are always cited using quotation marks, not italics; the fact that the stubby On Bullshit and the epic Iliad are cited with italics, and not quotation marks, supports this. In no way would I recommend that we depart from established citational practice, only that we work out properly what the rational distinction being made is.  (u):Raifʻhār (t):Doremítzwr﴿ 00:43, 11 January 2010 (UTC)
I've been doing some investigating, but unfortunately do not have access to a copy of the book from which I first learned this convention. The books I have been able to examine either give no complete set of rules (the Chicago Manual of Style is frustrating in this respect), or do tend to support the idea of quotes used for portions of a larger work (this is noted in Peyton Hurt's book Bibliography and Footnotes: A Style Manual for Students). --EncycloPetey 01:13, 11 January 2010 (UTC)
Do you know if/when you will gain access to a copy of that book? (What is its title, BTW?) Do you want to wait until you’ve checked that, or shall we take Peyton Hurt’s lead on this matter?  (u):Raifʻhār (t):Doremítzwr﴿ 03:59, 11 January 2010 (UTC)

Titles for rhymes pages

Are there any norms for the titles of rhymes pages? For example Rhymes: -e (French) could be enormous, isn't it better to use that as an index page for Rhymes: -be and the other consonants we can put ahead of it? What about more syllables than just one? Mglovesfun (talk) 19:31, 10 January 2010 (UTC)

The rhymes are all named according to the IPA symbols beginning from the vowel (or diphthong) in the stressed syllable through to the end of the word. So, if there are a lot of French words ending in /e/, and with the stress on that final syllable, then yes there could be an enormous page as a result. You could conceiveably choose to arrange the contents of the page differently for French (in English we group according to the number of syllables), but I wouldn't recommend using a different page naming scheme. --EncycloPetey 01:10, 11 January 2010 (UTC)
Technically, Rhymes in French are not stress-based: all meaningful words (i.e. minus prepositions, pronouns...) in -/e/ are considered to rhyme! The French tradition is different and "length"-based. A rhyme based on only a final vowel is "poor" based on a final vowel and the preceding sound, or a vowel+consonant coda is "sufficient", and one with three or more phonemes is "rich". Some school of poetry have at time held that poor rhymes were not valid (so that marée and allée are not considered to rhyme)! Circeus 14:01, 11 January 2010 (UTC)
I think it should be language-dependent. Different languages have different traditions of what's considered to rhyme. See, e.g., oldish discussion at Wiktionary talk:About Hebrew#Rhymes.​—msh210 17:28, 11 January 2010 (UTC)
French (according to WT:AFR) isn't stressed, so I think Circeus is right. Mglovesfun (talk) 10:50, 13 January 2010 (UTC)

RFD and RFV request

I'm trying to archive these pages, it would greatly facilitate the process if when listing multiple terms for one discussion you used the seperate level three headings (even if all discussion happens only under one of them) instead of the comma-separated level two heading format. Additionally, when closing such discussions (striking the heading, and removing the {{rfv}} or {{rfd}} from the page), if you say one of the following "magic" words it will be picked up so that I don't have to manually intervene to tell it which template to use.

  • '''passed''', '''kept''', '''cited''' {{rfv-passed}} {{rfd-passed}}
  • '''failed''', '''deleted''' {{rfd-failed}} {{rfv-failed}}
  • '''striking''' {{rfd-archived}} {{rfv-archived}}

Common variations on the above are also matched, providing they are emboldened ('''RFVFailed''', '''rfd passed''' etc.) . This is really a request for things to "stay as they are" as the code for this detection was written based on what normally happens on those pages - certainly if it would be a hassle to do, don't bother, I can do it for you. Conrad.Irwin 19:15, 11 January 2010 (UTC)

Reverse index

Would it make sense to create a reverse index by language? A reverse index (I don't know the proper English expression) is similar to the current index except it's sorted by a reverse order of the letters, starting with the final letter. This index may also serve as a rhyme dictionary for some of the languages. --Panda10 23:02, 11 January 2010 (UTC)

Is "idiom" still a valid header?

Noting our to do list, if ===Idiom=== is no longer a valid header for English, why should it be for other languages? Idiom is not a part of speech in the same way that verb and noun are. Shouldn't we keep the categories <langname> idioms but remove the header? Mglovesfun (talk) 17:22, 12 January 2010 (UTC)

  1. Each language could come to its own conclusion until such time as there is a clear benefit from PoS standardization. Is there any clear benefit from amending WT:ELE to remove it? I would like to hear from those active in other languages why it was useful so as to know whether it would be something to discourage or deprecate or whether it should be explicitly "legislated" by each language.
  2. Several other permitted PoS headers are also not "parts of speech" (Proverb, Phrase, the 3 abbreviations headers, Contraction, Symbol, Letter, Number). Note that they are approximately equal in number to the number that are actual parts of speech. DCDuring TALK 18:16, 12 January 2010 (UTC)
Oh I agree, I think "phrase" and "contraction" are only to be used when nothing else fits. Personally I'd always advocate replacing "Idiom" with "Phrase" at worst. Mglovesfun (talk) 10:26, 13 January 2010 (UTC)

euphemisms

Interesting that Category:Euphemisms and euphemism define them as terms used to indirectly refer to something unpleasant, controversial or vulgar. Considering entries like I am thirsty, blessed event and disabled person are tagged as euphemisms perhaps we should rethink this definition. After all, is it really unpleasant, controversial or vulgar to be thirsty, born or have a disability? Tooironic 05:00, 13 January 2010 (UTC)

I think all three are unpleasant in some way. But yes, point taken. Mglovesfun (talk) 13:16, 14 January 2010 (UTC)

My bot (continued)

Earlier in the year (December 2009), I posted a request for Darkicebot to be unblocked. I was told that I should do the few test edits for my bot through my main account, which is what I have done. I have made all the forms of the verb frapar and batar using my bot code through my main account, and I was wondering if maybe my bot could run for the bot flag again now? Thanks, Razorflame 08:19, 14 January 2010 (UTC)

I'd imagine you can run for bot flag as often as you like, the question is if there's no chance of you getting it, why bother? It will only hurt your feelings. Mglovesfun (talk) 12:59, 14 January 2010 (UTC)
Why do you say that there isn't a chance of me getting it if I've proven that I know the language? I updated the Ido verb conjugation charts to include all of the possible form-ofs for verbs, and I've written the program to be exactly like the one I used for Esperanto, so I honestly don't see what the big deal is? Razorflame 13:02, 14 January 2010 (UTC)

Italian plurals

After going about my business, editing Ido and Esperanto, I've noticed quite a few Italian plural entries for both Adjectives and nouns that do not include the |p in them. My question is this: Why aren't they included, and shouldn't we include them? All opinions are welcome :). Cheers, Razorflame 12:50, 14 January 2010 (UTC)

It doesn't matter a great deal as the definition says [] Plural form of [] Note that on the French Wiktionary a decision was made not to put {{m|p}} (etc.) as it duplicates the definition line. Mglovesfun (talk) 12:58, 14 January 2010 (UTC)
All of our Spanish entries already automatically add the p, so I don't see why Italian shouldn't if we already use it for Spanish, yeah? Razorflame 13:00, 14 January 2010 (UTC)

ſ (long s) typographic variants

Below is a discussion at Wiktionary:Requests_for_deletion#.C5.BFeveral:

This is defined as an obsolete spelling of "several". In fact, it is nothing of the sort. That is simply what s looked like before the nineteenth century except at the ends of words. Unless hundreds of thousands of other such "spelling variants" that use the non-terminal s are to be admitted, it makes little sense to allow this one. At the moment ſeveral is the only word listed among English obsolete spellings. Also allowing this entry would by implication disqualify any entries that have only been recorded with the ſ. --82.0.9.23 17:06, 14 January 2010 (UTC)
I broadly agree, we should probably have a policy against these, in the same way that we have equus, but not EQUUS, EQVVS, or even eqvvs. Possibly move this to the Beer Parlour, then delete it when it is official a 'bad entry title', unless we already have such a policy. Mglovesfun (talk) 17:11, 14 January 2010 (UTC)

I suggest this should indeed be regarded as a bad title. --82.0.9.134 17:29, 14 January 2010 (UTC)

What do we do about words in e.g. Arabic that have a different spelling when they are "final forms" (if that makes any sense — I don't know much about it)? This seems possibly analogous: it's a typographical variant rather than a different word. Equinox ◑ 17:33, 14 January 2010 (UTC)
  • My feeling is that typographical variants should be hard redirects. (At least in the case of long esses; conceivably other cases might be different.) Ƿidsiþ 17:44, 14 January 2010 (UTC)

Pre-English section stuff

When most people search Wiktionary for a word, they're probably just looking for a bit of information on the English word. Right now, we have quite a few things that are before that section:

  1. The "Table of Contents" box. Nobody uses this, and it gets new users really confused. People arrive at a page, and all they see is a huge block of text on the left, sometimes a WP link, nothing to help them find the information that they're looking for. I know that users can click "hide" to shrink it, but that's not clearly visible, and no one bothers to click it anyway. I think that either the Table of Contents box should be collapsed by default, or it should be removed entirely. (I don't know technically how either of these things could be done, but there's probably a way.)
  2. The ==Translingual== header. Generally, nobody's looking for this. I really think that this should be placed below the ==English== section, before the other languages. (English > Translingual > Aari > Aasax, etc.)
  3. Interwiki boxes, images, and other random pieces that are really language-specific and should be placed in one of the sections. I seem to recall someone proposing a bot to place these correctly, but I don't remember what came of that.

Should these be fixed, the information on the English word will be visible as soon as the page loads, right near the top, which is generally where people would expect it to be. --Yair rand 22:41, 14 January 2010 (UTC)


Advertisements






Got something to say? Make a comment.
Your name
Your email address
Message