User talk:Byrial
Welcome to Wikidata, Byrial!
Wikidata is a free knowledge base that you can edit! It can be read and edited by humans and machines alike and you can go to any item page now and add to this ever-growing database!
Need some help getting started? Here are some pages you can familiarize yourself with:
- Introduction – An introduction to the project.
- Wikidata tours – Interactive tutorials to show you how Wikidata works.
- Community portal – The portal for community members.
- User options – including the 'Babel' extension, to set your language preferences.
- Contents – The main help page for editing and using the site.
- Project chat – Discussions about the project.
- Tools – A collection of user-developed tools to allow for easier completion of some tasks.
Please remember to sign your messages on talk pages by typing four tildes (~~~~); this will automatically insert your username and the date.
If you have any questions, don't hesitate to ask on Project chat. If you want to try out editing, you can use the sandbox to try. Once again, welcome, and I hope you quickly feel comfortable here, and become an active editor for Wikidata.
Best regards!
--Ymblanter (talk) 07:46, 16 March 2013 (UTC)
RFTA
[edit]The new policy is approved now. If you reopen your request, I can promote you. Regards, Vogone talk 21:37, 14 April 2013 (UTC)
- Permission granted. Thanks for volunteering. Regards, Vogone talk 21:58, 14 April 2013 (UTC)
- Thank you, Byrial (talk) 22:01, 14 April 2013 (UTC)
"No reason to invalidate existing translations by spliting the translation unit"
[edit]Hi I think that even for exsiting translations it is acceptable to split a translation unit into several parts. But in this case, a translation admin should correct the translations, as an act of goodwill. Do you agree? --Michgrig (talk) 11:32, 16 April 2013 (UTC)
- Hi! Yes, it is certainly acceptable – and even desirable – in many cases, e.g. when the translation unit is a list with many or long list items. (But when you do it, please don't mark the edit as minor). If the translation unit is small, there generally is no reason to bother the translators. In the actual case there was no problem at all, because the edit was in newly added text which was not yet marked for translation, and I later undid my own correction and apologized to the editor. Byrial (talk) 11:49, 16 April 2013 (UTC)
- "In the actual case there was no problem at all" - I've seen that, just wanted to discuss the general rule. BTW, if I'm not mistaken, you wanted to split long lists and tables into several translation units. Are you planning to do this? --Michgrig (talk) 12:07, 16 April 2013 (UTC)
- Yes, I plan to do that when I find cases where I consider it will help the translators. It is a nuisance trying to translate large chunks of text which cannot be seen in total in the text windows in the translation tool, and to try to find out what is changed after the original text have been edited. Byrial (talk) 19:53, 16 April 2013 (UTC)
- Yeah, it's so true... --Michgrig (talk) 20:26, 16 April 2013 (UTC)
- Yes, I plan to do that when I find cases where I consider it will help the translators. It is a nuisance trying to translate large chunks of text which cannot be seen in total in the text windows in the translation tool, and to try to find out what is changed after the original text have been edited. Byrial (talk) 19:53, 16 April 2013 (UTC)
- "In the actual case there was no problem at all" - I've seen that, just wanted to discuss the general rule. BTW, if I'm not mistaken, you wanted to split long lists and tables into several translation units. Are you planning to do this? --Michgrig (talk) 12:07, 16 April 2013 (UTC)
Re: Minor edit which was not so minor
[edit]Nothing... I am worried about this edit (not marked for translation yet). --β16 - (talk) 12:32, 16 April 2013 (UTC)
Property:P463 "member of"
[edit]Hi,
The above property is now available and can be used on items. I noticed you participated in its discussion. -- Docu at 16:15, 25 April 2013 (UTC)
List of bureaucrats
[edit]Hi
You changed the page Wikidata:Bureaucrats by adding translatable text. But there is another page Wikidata:List of bureaucrats with a list of 'crats that contains LangSwitches. How do you think: maybe it's better to mark it to translation and then to TNT it on the page about bureaucrats? --Michgrig (talk) 08:26, 2 May 2013 (UTC)
The same is about the list of admins. --Michgrig (talk) 08:28, 2 May 2013 (UTC)
- I also made the same change to Wikidata:Bots and Wikidata:List of bots earlier.
- My concern was that the list pages should both be able to function both as indepedent pages (I guessed that the creators want that because these list are in the project namespace) and be able to be transcluded into other pages. I don't think that is possible with TNT, and that the TNT template works with pages outside the template namespace as it is now. Byrial (talk) 10:52, 2 May 2013 (UTC)
- PS. I just saw your Russian translations of the new messages at Wikidata:Bureaucrats and Wikidata:Administrators, and noticed that you only used one plural form. I thought that the Russian language uses different plural forms dependent of the last digit of the number, so I inserted the PLURAL switch to allow that. Byrial (talk) 11:07, 2 May 2013 (UTC)
- The TNT template works in other namespaces. An example of this is Wikidata:List of properties where subpages are transcluded into the main one.
- P.S. Yes, generally the Russian language has more than two plural forms. However in that very phrases there are only two: for numbers ending with 1 (except for 11) and for all other numbers. --Michgrig (talk) 12:36, 2 May 2013 (UTC)
- Yes, I see that it works, so now I think it is better to use TNT. Thank you for telling me about it. I will not have time to change anything today, but I will do it tomorrow evening, if you don't do it earlier. -- Byrial (talk) 13:50, 2 May 2013 (UTC)
Flooders
[edit]Hejsan, vad kallar ni "Flooders" på Dansk? -- Lavallen (block) 18:50, 7 May 2013 (UTC)
- Jeg kender ikke noget ord for det. "Flood" hedder "oversvømmelse", så man kan måske bruge:
- a flooder = en oversvømmer
- flood flag = oversvømmelsesflag
- men det er ikke ord som findes i forvejen. Hvad bruges på svensk? Byrial (talk) 19:29, 7 May 2013 (UTC)
- Det står "Botanvändare" i svenska loggen, vilket kan förväxlas med bot/robot varför jag hellre skulle se ett annat ord. "Flood" kan översättas "Översvämning", men "Flod" om man syftar på tidvatten eller på Bibelns Noa. -- Lavallen (block) 19:35, 7 May 2013 (UTC)
- Det er også "flod" (modsat "ebbe") på dansk om tidevand, men det synes jeg ikke kan bruges her. Byrial (talk) 19:56, 7 May 2013 (UTC)
- Det står "Botanvändare" i svenska loggen, vilket kan förväxlas med bot/robot varför jag hellre skulle se ett annat ord. "Flood" kan översättas "Översvämning", men "Flod" om man syftar på tidvatten eller på Bibelns Noa. -- Lavallen (block) 19:35, 7 May 2013 (UTC)
Empty items
[edit]Hey, would you mind taking another database dump of empty items? Thanks, FrigidNinja 02:34, 14 May 2013 (UTC)
- Not sure where toplace them, but I created Wikidata:Requests for deletions/Bulk/3 and Wikidata:Requests for deletions/Bulk/4. Byrial (talk) 07:04, 14 May 2013 (UTC)
Adminship
[edit]Hello, I want to nominate you to adminship. I just made Wikidata:Requests for permissions/Administrator/Byrial. If you be interested to be an admin in this project please add your comment there after Candidate acceptance. Regards--DangSunM (talk) 20:18, 14 May 2013 (UTC)
- Thank you for acceptance. Good luck. Cheers--DangSunM (talk) 22:50, 14 May 2013 (UTC)
- Thank you, also for the nomination. Byrial (talk) 23:07, 14 May 2013 (UTC)
Help for small translation
[edit]Just go here [1] and translate just one description line in languages you know. It will take few seconds and help us a lot. Thanks.--Nizil Shah (talk) 22:53, 14 May 2013 (UTC)
- Sure, I added for Danish and Esperanto. Byrial (talk) 23:05, 14 May 2013 (UTC)
Teknikaliteter
[edit]Ursäkta en svagt tekniskt begåvad man!
"A PropertyIntervalSnak describes that an Entity has a certain Property with all values that are within a given range of Values. At present, it is intended to support this for intervals of time, which are extremely common in Wikipedia, but it could also be supported for other intervals or (more generally) sets of Values such as numbers or geographic locations." meta:Wikidata/Data model#PropertyIntervalSnak.
Betyder det att man kan använda det här för att beskriva att Frederik IX of Denmark (Q151312) var kung av Danmark, 1947–1972 i ett enda claim? -- Lavallen (block) 18:04, 17 May 2013 (UTC)
- Interessant, den tekst har jeg ikke bemærket før. Ja, det ser ud til at man vil kunne angive Frederiks regeringstid med en påstand med kun en kvalifikator som har et datointerval som værdi, hvis det bliver som beskrevet. (Der er forbeholdet: This document is a draft, and should not be assumed to represent the ultimate structure.). Byrial (talk) 18:34, 17 May 2013 (UTC)
Congratulations, Dear Administrator!
[edit]English | español | français | العربية | Nederlands | русский | +/−
Byrial, congratulations! You now have the rights of administrator on Wikidata. Please take a moment to read the Wikidata:Administrators page and watchlist related pages (in particular Wikidata:Project chat and Wikidata:Administrators' noticeboard), before launching yourself into page deletions, page protections, account blockings, or modifications of protected pages.
Please feel free to join us on IRC: #wikidata-admin @ irc.freenode.net. If you need access, you can flag someone down at #wikimedia-wikidata @ irc.freenode.net. You may find Commons:Guide to adminship to be useful reading, although it doesn't always completely apply here at Wikidata. You may also want to consider adding yourself to meta:Template:Wikidata/Ambassadors, and to any similar page on your home wiki if one exists. (WT:Wikidata/Wikidatans on En-WP.)
Please also add/update the languages you speak to your listing at Wikidata:List of administrators. Again, welcome to the admin corps!
Legoktm (talk) 02:39, 22 May 2013 (UTC)
- Congrats! --Michgrig (talk) 06:47, 22 May 2013 (UTC)
- Thank you to you all! Byrial (talk) 06:49, 22 May 2013 (UTC)
- As a newly appointed admin, you may add yourself to the Markadmins gadjet :) --Michgrig (talk) 06:57, 22 May 2013 (UTC)
- Done Byrial (talk) 07:04, 22 May 2013 (UTC)
- As a newly appointed admin, you may add yourself to the Markadmins gadjet :) --Michgrig (talk) 06:57, 22 May 2013 (UTC)
- Thank you to you all! Byrial (talk) 06:49, 22 May 2013 (UTC)
- Welcome to our admin group!!!!!--DangSunM (talk) 10:50, 22 May 2013 (UTC)
- What a stroke of luck! I didn't even see that cake on my talk page! :-) --Ricordisamoa 11:12, 22 May 2013 (UTC)
- Hoppsan, missade den omröstningen. Har lagt sidan på bevakning om ytterligare befordran är på gång!!! :) -- Lavallen (block) 11:42, 22 May 2013 (UTC)
- What a stroke of luck! I didn't even see that cake on my talk page! :-) --Ricordisamoa 11:12, 22 May 2013 (UTC)
Jurors 1–12
[edit]Hey Byrial,
I was just wondering what the consensus ended up with regarding items such as Q12052419. I know it's in use, but it will only ever be used in 1 item, and I don't think its notability can be justified. I think that is why Vogone deleted it earlier. Making an item for every answer to every statement on Wikidata would be like creating an article for every non-bluelink on Wikipedia, in my opinion. Del♉sion23 (talk) 23:52, 30 May 2013 (UTC)
- I am not sure there was consensus for neither keeping them nor deleting them. Personally I agree with you about the notability, but I restored them because I think they were deleted for the wrong reason: Which is that they were on my bulk deletion request where I wrongly claimed that they were not used as values in any statements. (See Wikidata:Administrators' noticeboard#Very large bulk deletion request) I am now removing items which are used as values in statements in other items from the bulk deletion request, and restoring those of them which already have been deleted – like the juror items and some others. If it is decided to delete them due to missing notability, that should be stated as the deletion reason, not just that they are empty. Byrial (talk) 00:22, 31 May 2013 (UTC)
Request
[edit]You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
--Ricordisamoa 12:10, 4 June 2013 (UTC)
- Thanks for the list! I'm going to do something of it... --Ricordisamoa 19:09, 6 June 2013 (UTC)
- Anyway, most of items with Polish description "włoskie obec" should have already been checked by the bot. --Ricordisamoa 19:11, 6 June 2013 (UTC)
- Could you please generate User:SamoaBot/włoskie obec and User:SamoaBot/szablon Wikipedia again? --Ricordisamoa 12:39, 9 June 2013 (UTC)
- With pleasure - when I have access to new data. I hope that a new database dump for Wikidata will available in less than a week, and when I have it I will update the lists. Byrial (talk) 13:14, 9 June 2013 (UTC)
- Thanks in advance --Ricordisamoa 14:10, 9 June 2013 (UTC)
- Done with data from 2013-06-10. Byrial (talk) 10:25, 12 June 2013 (UTC)
- Thanks a lot! Maybe we could make it a regular database report? --Ricordisamoa 17:57, 13 June 2013 (UTC)
- Is there a need for that? I thought that these incorrect descriptions was added in the past by a mistake that is now corrected. If the mistake is still being made, can it not be stopped? Byrial (talk) 18:20, 13 June 2013 (UTC)
- Thanks a lot! Maybe we could make it a regular database report? --Ricordisamoa 17:57, 13 June 2013 (UTC)
- Done with data from 2013-06-10. Byrial (talk) 10:25, 12 June 2013 (UTC)
- Thanks in advance --Ricordisamoa 14:10, 9 June 2013 (UTC)
- With pleasure - when I have access to new data. I hope that a new database dump for Wikidata will available in less than a week, and when I have it I will update the lists. Byrial (talk) 13:14, 9 June 2013 (UTC)
- Could you please generate User:SamoaBot/włoskie obec and User:SamoaBot/szablon Wikipedia again? --Ricordisamoa 12:39, 9 June 2013 (UTC)
Deletions
[edit]Please delete http://www.wikidata.org/wiki/Q12720187?uselang=ro and http://www.wikidata.org/wiki/Q12720187?uselang=ro. They were duplicates. 79.112.11.215 04:43, 16 June 2013 (UTC)
- You gave the same link twice, but I deleted Q12721728 and Q12720187 which I found by looking in your edit history. You can also use Wikidata:Requests for deletions (or the shortcut WD:RFD) to request deletions. Byrial (talk) 04:57, 16 June 2013 (UTC)
- Sorry. You proceeded right. Thank you. I searched the shortcut to Requests for deletions in the left side of the page like on Commons (Nominate for deletion), but I didn't find it. 79.112.11.215 05:06, 16 June 2013 (UTC)
- That's OK. Byrial (talk) 05:08, 16 June 2013 (UTC)
- Sorry. You proceeded right. Thank you. I searched the shortcut to Requests for deletions in the left side of the page like on Commons (Nominate for deletion), but I didn't find it. 79.112.11.215 05:06, 16 June 2013 (UTC)
P107 statistics
[edit]I've been curious for a while about how many P107 claims there were by each of the GND main types, so your comment in the current P107 about the 1,221,158 items with GND type person intrigued me. I've posted a question in Project chat that might interest you and the rest of the Wikidata community: Wikidata:Project_chat#P107:_are_statistics_on_the_use_of_each_GND_main_type_available.3F. Best, Emw (talk) 01:40, 22 June 2013 (UTC)
- Done. Please see User:Byrial/P107 with more statistics than you asked about. Byrial (talk) 10:53, 22 June 2013 (UTC)
list/search articles from a language(ml.wikipedia.org)
[edit]How to list/search articles from ml.wikipedia.org which has no other interwiki link connected; reason I wanted it because, malayalam article spark plug(Q13115194) and english article of the same(Q193340) has different wikidata connection. There are lots in this type!, to clear those I need a list of that.Thank you.--117.213.27.239 08:13, 23 June 2013 (UTC)
- You want a list like this: User:Byrial/Items with only Malayalam link?
- Thank you. Think this is what I searched for, how did you get this result , is there any link to get this search? Anyway I copied the result you gave.--117.213.27.239 09:36, 23 June 2013 (UTC)
- And how to list category and template(skipped in above list) , almost all of those doesn't have interwiki links.-- 117.213.27.239 09:38, 23 June 2013 (UTC)
- Thank you. Think this is what I searched for, how did you get this result , is there any link to get this search? Anyway I copied the result you gave.--117.213.27.239 09:36, 23 June 2013 (UTC)
- Oh I just read it, these "
lists are made from database dumps". I will look at it; and if there is any easy link for listing please let know, Thank you. -- 117.213.27.239 09:54, 23 June 2013 (UTC)- Yes, I made the list from the public database dumps (see Wikidata:Database download), in this case the data is extracted from the file wikidatawiki-20130610-pages-articles.xml.bz2. I have parsed it and put most of the content in a local database at my computer ready for use. I use programs written in C for the analysis, and you can have a copy under GPL license if you want.
- I skipped the category namespace because the list was already very long. A list for the category links is now at User:Byrial/Items with only Malayalam category link. It seems that there is no items which links to the template namespace. I do not understand why but I can find no links starting with either "Template:" or "ഫലകം:". Maybe they were never imported to Wikidata by any bot, or maybe it is some kind of bug. Byrial (talk) 10:15, 23 June 2013 (UTC)
- Thank you for taking this effort. Some Template/ഫലകം are still in wikidata (Q6192836 for Template:thumbs up/ഫലകം:കൊള്ളാം, Q6171351 for ഫലകം:Infobox film). It may be a bug.-- 117.213.27.239 10:30, 23 June 2013 (UTC)
- These 2 items are not in my local database. I will investigate my programs for bugs. Thank you for bringing my attention to this problem. Byrial (talk) 10:49, 23 June 2013 (UTC)
- I must have been confused. Both Q6192836 and Q6171351 are in my local database. I know 2790 items which links to Malayalam templates, but all expect Q10987689 (ml:ഫലകം:User Animator 1) also have links to other languages, and Q10987689 also had a link to English which is removed now when it was imported to Wikidata. I guess that no bot imported items directly from the Malayalam template namespace, so only those templates linked to from other Wikipedias have been imported. Byrial (talk) 14:18, 23 June 2013 (UTC)
- Yes I agree , malayalam templates interwiki-ed to english wikipedia is now in wikidata others still are not connected. --117.213.28.114 17:14, 23 June 2013 (UTC)
- I must have been confused. Both Q6192836 and Q6171351 are in my local database. I know 2790 items which links to Malayalam templates, but all expect Q10987689 (ml:ഫലകം:User Animator 1) also have links to other languages, and Q10987689 also had a link to English which is removed now when it was imported to Wikidata. I guess that no bot imported items directly from the Malayalam template namespace, so only those templates linked to from other Wikipedias have been imported. Byrial (talk) 14:18, 23 June 2013 (UTC)
- These 2 items are not in my local database. I will investigate my programs for bugs. Thank you for bringing my attention to this problem. Byrial (talk) 10:49, 23 June 2013 (UTC)
- Thank you for taking this effort. Some Template/ഫലകം are still in wikidata (Q6192836 for Template:thumbs up/ഫലകം:കൊള്ളാം, Q6171351 for ഫലകം:Infobox film). It may be a bug.-- 117.213.27.239 10:30, 23 June 2013 (UTC)
You may find User:Byrial/numbermerge/ml-en useful. It is generated by a new program I made to find possible merge candidates for items with numbers in the page title. The idea is that if a page in language 1 with a number in the title links to a page in language 2 with the same number in the title, then other pages in language 1 with the same text, but other numbers are candidates to link to pages in language 2 with the same text as before and matching numbers. Byrial (talk) 14:01, 24 June 2013 (UTC)
- Result is perfect , all need to be merged.--117.203.64.232 16:36, 24 June 2013 (UTC)
- But not complete,ml:വർഗ്ഗം:13-ആം നൂറ്റാണ്ടിൽ അസ്തമിച്ച സാമ്രാജ്യങ്ങൾ en:Category:13th-century disestablishments are same category but not included in this result.-- 117.203.64.232 16:46, 24 June 2013 (UTC)
- No, for a pair of items to be found, you have to have at least one case where the same texts already are linked together with other numbers. Otherwise the program cannot know that "-ആം നൂറ്റാണ്ടിൽ അസ്തമിച്ച സാമ്രാജ്യങ്ങൾ" are releated to "th-century disestablishments" as it have no idea about the meanings of the texts. (The pattern count in each line indicates how many pages in the two languages are already linked using the same texts but other numbers). Byrial (talk) 16:54, 24 June 2013 (UTC)
- Oh got it, 'at least one connection of a pattern of text' + number.-- 117.203.64.232 17:32, 24 June 2013 (UTC)
- No, for a pair of items to be found, you have to have at least one case where the same texts already are linked together with other numbers. Otherwise the program cannot know that "-ആം നൂറ്റാണ്ടിൽ അസ്തമിച്ച സാമ്രാജ്യങ്ങൾ" are releated to "th-century disestablishments" as it have no idea about the meanings of the texts. (The pattern count in each line indicates how many pages in the two languages are already linked using the same texts but other numbers). Byrial (talk) 16:54, 24 June 2013 (UTC)
- But not complete,ml:വർഗ്ഗം:13-ആം നൂറ്റാണ്ടിൽ അസ്തമിച്ച സാമ്രാജ്യങ്ങൾ en:Category:13th-century disestablishments are same category but not included in this result.-- 117.203.64.232 16:46, 24 June 2013 (UTC)
Your amazing lists
[edit]Firstly I would just like to let you know that the lists that you keep creating are amazing :) Any chance you could also take a look at Wikidata:Project_chat#Items_with_no_... ? :) Keep up the amazing work (I really should look at your source code soon) ·addshore· talk to me! 10:40, 26 June 2013 (UTC)
- Also I am just wondering! Do these lists have to be generated via dumps? Can then not be generated via the database replicas? If not, what is not included in the replicas that is needed for the list generation? ·addshore· talk to me! 22:52, 26 June 2013 (UTC)
- The replicated database at the Toolserver does not contain the actual page content, and the statements about items are only available by parsing the pages of the items. Byrial (talk) 23:00, 26 June 2013 (UTC)
- also another quick question, how automated are your lists in being generated? Do you have to manually run the generator for each after each new dump? ·addshore· talk to me! 08:53, 30 June 2013 (UTC)
- Nothing is automated. I manually run the list generation programs and upload the lists. I know it is possible to let a bot do it, but I like to see the contents before publication and make sure that it looks OK. I also often consider if the content is still relevant and/or if it can be improved. Byrial (talk) 14:10, 1 July 2013 (UTC)
- also another quick question, how automated are your lists in being generated? Do you have to manually run the generator for each after each new dump? ·addshore· talk to me! 08:53, 30 June 2013 (UTC)
- The replicated database at the Toolserver does not contain the actual page content, and the statements about items are only available by parsing the pages of the items. Byrial (talk) 23:00, 26 June 2013 (UTC)
RfC on Wikidata's primary sorting property
[edit]You recently participated in a deletion discussion for P107 - main type (GND). The discussion has been closed, as it is clear that a resolution won't come from PfD, and an RfC has been opened on the matter at Wikidata:Requests for comment/Primary sorting property. You are invited to participate there. Please note that this is a mass delivered message, and that I will not see any replies you leave on this page.
Yours, Sven Manguard Wha? 18:23, 30 June 2013 (UTC)
Another list request
[edit]Hi, again thanks for your amazing lists. I am interested in list Wikidata:Database reports/Missing links/cswiki, which will exclude items with template in title (as cswiki is using different language userbox system, thus babel-templates are occuping most of the list now) and maybe with 200 items maximum. In past i was creating (and updating) the list from another prewikidata tool, and that list helped me a lot with solving interwiki issues and encourage me (and other editors) to create new articles. --Jklamo (talk) 15:58, 2 July 2013 (UTC)
- Would User:Byrial/cs do? It is a list of the items with most links but no link to Czech. Items with links to English or Slovak which are not to articles are excluded. If there are no links to English or Slovak, I cannot tell if they are articles (but most are properly not). Byrial (talk) 17:01, 2 July 2013 (UTC)
- For sure, thanks a lot (adding enwiki and cswiki links was great idea). I will process anyway the list before publishing it on cswiki, so i will delete "unwanted" items. --Jklamo (talk) 14:24, 3 July 2013 (UTC)
I think we can make a list of items which have only link to Talk, User, User talk, Wikipedia talk, File, MediaWiki, Template talk, Category talk, and "Unknown" (Should be special pages) namespace, which do not meet Wikidata:Notability, and if possible, delete them.--GZWDer (talk) 05:32, 4 July 2013 (UTC)
- I also think we can make a list of items with no links, no use in statements, and only statements for P21 (sex) or P107 (GND type) or source, and if possible, delete them.--GZWDer (talk) 05:36, 4 July 2013 (UTC)
- User:Byrial/namespace list is a list of items which only have sitelinks to talk, user, file, MediaWiki and special pages. List of unused items with no links will come later. Byrial (talk) 20:25, 5 July 2013 (UTC)
Numbers merge
[edit]I think you probably missed my message on Project chat about new list requests. Never mind I am posting it here. I will be through with ml-en list shortly. I want to request you for ta-en and hi-en lists. You can make them one at a time in your spare times. Thanks.--Vyom25 (talk) 18:35, 4 July 2013 (UTC)
- Done: ta-en and hi-en. The Tamil Wikipedia doesn't seem to use Tamil numbers (೦௧௨௩௪௫௬௭௮௯), so I didn't learn the program about those. But the "hi-en" list have matches for both European and Devanagari numbers in the Hindi Wikipedia. Byrial (talk) 23:26, 4 July 2013 (UTC)
- Thanks a lot. I will start working on it soon.--Vyom25 (talk) 04:52, 5 July 2013 (UTC)
- I went through the hi-en page. 99% were merged and around 25 items (with en link) related to Wimbledon championship were merged with other duplicates of other language so I couldn't merge them. Please update the list as soon as you get fresh data. I don't know about additional pages but these items will definitely show up. Thanks.--Vyom25 (talk) 12:34, 8 July 2013 (UTC)
- I think you missed this message.--Vyom25 (talk) 09:06, 11 July 2013 (UTC)
- Hi, I didn't miss it, but I have no fresh data yet. The 2013-07-10 database dump (or rather the file in it that I use – the dump is still in progress) became available today, and I am currently importing data to my local database. This will take some time, but I will start to generate new reports with the new data today or tomorrow. Byrial (talk) 09:36, 11 July 2013 (UTC)
- Ooops... sorry for pressing you this hard; you are doing an amazing job creating all this lists. Thank you for that. And sorry again. Take your time.:)--Vyom25 (talk) 06:17, 12 July 2013 (UTC)
- Hi, Can you please create mr-en and gu-en for numbers merge? and please update ta-en in your next run. Thanks.--Vyom25 (talk) 12:10, 18 July 2013 (UTC)
- Done numbermerge mr-en and gu-en. As a special bonus also Done countrymerge mr-en, gu-en, ta-en, hi-en. :) Byrial (talk) 12:57, 18 July 2013 (UTC)
- Well your superb work has left my stocks of thanks in shortfall. But will not hesitate to deplete it further. Thank you very much. ;)--Vyom25 (talk) 05:27, 19 July 2013 (UTC)
- Please update numbersmerge of hi-en, mr-en and countrymerge of mr-en. Also create te-en for both numbersmerge and countrymerge and ml-en for countrymerge. Thank you.--Vyom25 (talk) 06:48, 24 July 2013 (UTC)
- Done the new lists. The database dump from 2013-07-10 is still the newest, so I cannot update yet. Byrial (talk) 15:29, 24 July 2013 (UTC)
- Please create sa-en list for numbersmerge and countrymerge. Thank you.--Vyom25 (talk) 12:09, 24 August 2013 (UTC)
- Done the new lists. The database dump from 2013-07-10 is still the newest, so I cannot update yet. Byrial (talk) 15:29, 24 July 2013 (UTC)
- Please update numbersmerge of hi-en, mr-en and countrymerge of mr-en. Also create te-en for both numbersmerge and countrymerge and ml-en for countrymerge. Thank you.--Vyom25 (talk) 06:48, 24 July 2013 (UTC)
- Well your superb work has left my stocks of thanks in shortfall. But will not hesitate to deplete it further. Thank you very much. ;)--Vyom25 (talk) 05:27, 19 July 2013 (UTC)
- Done numbermerge mr-en and gu-en. As a special bonus also Done countrymerge mr-en, gu-en, ta-en, hi-en. :) Byrial (talk) 12:57, 18 July 2013 (UTC)
- Hi, Can you please create mr-en and gu-en for numbers merge? and please update ta-en in your next run. Thanks.--Vyom25 (talk) 12:10, 18 July 2013 (UTC)
- Ooops... sorry for pressing you this hard; you are doing an amazing job creating all this lists. Thank you for that. And sorry again. Take your time.:)--Vyom25 (talk) 06:17, 12 July 2013 (UTC)
- Hi, I didn't miss it, but I have no fresh data yet. The 2013-07-10 database dump (or rather the file in it that I use – the dump is still in progress) became available today, and I am currently importing data to my local database. This will take some time, but I will start to generate new reports with the new data today or tomorrow. Byrial (talk) 09:36, 11 July 2013 (UTC)
- Thanks a lot. I will start working on it soon.--Vyom25 (talk) 04:52, 5 July 2013 (UTC)
- Done at User:Byrial/numbermerge/sa-en and User:Byrial/countrymerge/sa-en Byrial (talk) 18:17, 24 August 2013 (UTC)
- Thanks, another request for hi-mr and kn-en in both numbersmerge and countrymerge.--Vyom25 (talk) 05:24, 27 August 2013 (UTC)
Re: Duplicates
[edit]You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
--Ricordisamoa 10:33, 5 July 2013 (UTC)
MeSH code table
[edit]Hi Byrial! I was wondering if you could make a table (if you have time) that lists all the diseases on Wikidata. It could list all items that have the Mesh ID set (MeSH descriptor ID (P486)). The table could look like this:
Label | MeSH ID |
---|
Thank you! --Tobias1984 (talk) 09:23, 5 July 2013 (UTC)
- Is a list like User:Byrial/Mesh ID OK? Byrial (talk) 09:53, 5 July 2013 (UTC)
- Look perfect! Thanks again! --Tobias1984 (talk) 10:02, 5 July 2013 (UTC)
Numbermerge
[edit]Hi Byrial, thanks for your Numbermerge, it's a great idea. Can you update it-en and add it-fr, it-es and it-de? --ValterVB (talk) 11:10, 6 July 2013 (UTC)
- Thank you. I will update them when I have new data. I expect the next Wikidata database dump to be ready in 4-5 days. Byrial (talk) 11:24, 6 July 2013 (UTC)
- Great --ValterVB (talk) 11:33, 6 July 2013 (UTC)
- I overlooked that you also asked for new ones. it-fr, it-es and it-de are ready (but contains from the start lots of red links due to old data). I will also update these when new data is ready. Byrial (talk) 12:48, 6 July 2013 (UTC)
- A lot of dirty job :) Thanks. --ValterVB (talk) 13:00, 6 July 2013 (UTC)
- I overlooked that you also asked for new ones. it-fr, it-es and it-de are ready (but contains from the start lots of red links due to old data). I will also update these when new data is ready. Byrial (talk) 12:48, 6 July 2013 (UTC)
- Great --ValterVB (talk) 11:33, 6 July 2013 (UTC)
Would you be so kind (I know you are) to make numbermerge lists for es-en, es-fr and es-de. Thanks!!! Andreasm háblame / just talk to me 01:25, 7 August 2013 (UTC)
- Done es-en, es-fr, es-de. es-en turned out to be very long (548,860 bytes). If that is a problem for you, I can split the page to smaller subpages. It will hopefully be smaller next time if the items on the list are merged. Byrial (talk) 07:08, 7 August 2013 (UTC)
Merging
[edit]Hi Byrial,
I saw your list User:Byrial/numbermerge/ml-en. Many of these lonely ml items can be merged with their respective en item. Do you have any bot program to do this? If not, I am thinking about writing a script for this purpose.
With regards --Vssun (talk) 17:45, 6 July 2013 (UTC)
- Sorry, for jumping in but it would be great because I have been merging them manually from time to time.--Vyom25 (talk) 17:52, 6 July 2013 (UTC)
- There is a gadget called Merge which you can enable in the your preferences. It will add a point "Merge it with ..." to the top menu in the pages of items. Byrial (talk) 18:27, 6 July 2013 (UTC)
I am using that gadget. But question is about merging the entire list at once using a bot script. --Vssun (talk) 02:01, 7 July 2013 (UTC)
- Please see the recent edits of user:VsBot and my request for botflag here --Vssun (talk) 07:02, 7 July 2013 (UTC)
Do you think numbermerge is fit for bots to merge en bloc?! Littledogboy (talk) 12:00, 7 July 2013 (UTC)
- I think that each item on these lists should be checked by a human (preferably understanding the used languages and knowing the customs of the Wikipedias) to see if the items really should be merged. When the merge is approved by a human, I see no problem if it actually is done by a bot (semiautomatic, or automatic using an edited list with only approved cases). The bot operator should be aware that the items may be used as values in statements in other items, and if necessary also update these other items. Byrial (talk) 12:14, 7 July 2013 (UTC)
- The items listed in ml-en list are of same subjects so it is visibly apparent that they should be merge. In this case semi automated or automated work is done only to complete the merge.--Vyom25 (talk) 12:23, 7 July 2013 (UTC)
- In the pages I have seen (cs, sk...), there was over 90% line with items good to merge, which is brilliant – a practical benefit for Wikipedias alredy yielded from Wikidata. But certainly not all of it. Littledogboy (talk) 12:44, 7 July 2013 (UTC)
Weird
[edit]How can Q9724950 and Q13512416 link to same page of ta wiki?--Vyom25 (talk) 13:45, 8 July 2013 (UTC)
- It is a bug. See bugzilla:48260 and bugzilla:42325, and a list of similar cases at User:Byrial/Duplicates (this case is not on the list because Q13512416 was created after the list was made). Sometime when an item is created, the sitelinks given at the creation time is not inserted into Wikidata's internal table of sitelinks, and therefore Wikidata will not discover when it later is also used in another item. I deleted Q9724950. Byrial (talk) 14:08, 8 July 2013 (UTC)
- Oh, okay, went through both the bugzilla links and this is a new thing I learned today. Thanks.--Vyom25 (talk) 14:18, 8 July 2013 (UTC)
You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
Most aliases
[edit]Hi, is it possible to make a list of items where is most aliases with Finnish language? Because when this project was started and bots imported pages, they added Wikipedia-redirects to aliases here, so here is many wrong aliases on items. --Stryn (talk) 18:45, 10 July 2013 (UTC)
- Like this or do you want more items? Byrial (talk) 21:09, 10 July 2013 (UTC)
- This is good. Thanks! --Stryn (talk) 21:17, 10 July 2013 (UTC)
Categories in Malayalam
[edit]can you please update user:Byrial/Items_with_only_Malayalam_category_link --Vssun (talk) 10:05, 11 July 2013 (UTC)
- I will then I have fresh data from the next database dump. Byrial (talk) 10:08, 11 July 2013 (UTC)
I have one number merge suggestion. ml:വർഗ്ഗം:1891-ൽ പുറത്തിറങ്ങിയ ചലച്ചിത്രങ്ങൾ = en:Category:1891 films --Vssun (talk) 10:27, 11 July 2013 (UTC)
- If any item links to a page "വർഗ്ഗം:#-ൽ പുറത്തിറങ്ങിയ ചലച്ചിത്രങ്ങൾ" i mlwiki and a page "Category:# films" where # is the same number in both pages, my program will automatically find the pattern and suggest the merge. Byrial (talk) 10:35, 11 July 2013 (UTC)
Thank you. Please add this too ml:വർഗ്ഗം:1350-കളിൽ മരിച്ചവർ = en:Category:1350s deaths --Vssun (talk) 10:45, 11 July 2013 (UTC)
and ml:വർഗ്ഗം:14-ആം നൂറ്റാണ്ടിൽ ജനിച്ചവർ = en:Category:14th-century births --Vssun (talk) 10:50, 11 July 2013 (UTC)
- I do not manually add pages to the lists. If any item links to pages in mlwiki and enwiki which have the same number in their titles, the used pattern will be found. So do at least one merge to make sure that the pattern can be found. Byrial (talk) 10:54, 11 July 2013 (UTC)
OK. I will merge such samples, so that I can expect them in your future lists. Thank you. --Vssun (talk) 08:06, 12 July 2013 (UTC)
Statistic of top editors
[edit]Hi Byrial
Could you also include a statistic of top editors? If possible filtered by label, description and statements. I'm really wondering how many edits in statements was made by a top contributor (without bots). Kind regards --Nightwish62 (talk) 18:28, 12 July 2013 (UTC)
- Sorry, I do not work with history files. I can only report the current state of items and properties, not who did what and when. Byrial (talk) 18:32, 12 July 2013 (UTC)
New report
[edit]If is possible I think can be useful a list of item with different sitelink and label (search substring label in sitelink) for specific language. Can you try for italian? --ValterVB (talk) 07:32, 14 July 2013 (UTC)
- I did make the list, but didn't upload because it is very long. There is 1036415 links to namespace 0 (articles) in itwiki, and I sorted them in these groups:
- No label: 9690 (listed)
- Same link and label: 898814
- Same link and label, except that the label starts with a lowercase character: 6461
- Different link and label, label is a prefix of the link: 112020
- Different link and label, label is a prefix of the link, except that the label starts with a lowercase character: 1056
- Different link and label, same length (in number of bytes): 2661
- Different link and label, link shorter: 3199
- Different link and label, link longer: 2514
- How much do you want to see? Byrial (talk) 12:18, 14 July 2013 (UTC)
- I have numbered the options:
- 1) Already fixed
- 2) It's OK
- 3) Not useful
- 4) Disambiguation so not useful
- 5) Same as 4
- 6) Have you an example?
- 7) Have you an example?
- 8) Perfect if are excluded case 4
- Thanks a lot. --ValterVB (talk) 13:04, 14 July 2013 (UTC)
- There is 500 examples of each the categories 6, 7 and 8 at User talk:Byrial/link-label-compare/it Byrial (talk) 13:22, 14 July 2013 (UTC)
- 8 is perfect, I can correct the label with my BOT. Can you create the complete list (only q number)? Thanks --ValterVB (talk) 14:01, 14 July 2013 (UTC)
- You never saw the examples of type 6, as I by mistake instead listed the example of type 7 twice.
- But now all items of type 8 is on the list. (I hope I got it right this time, you should check!) Byrial (talk) 14:57, 14 July 2013 (UTC)
- 8 is perfect, I can correct the label with my BOT. Can you create the complete list (only q number)? Thanks --ValterVB (talk) 14:01, 14 July 2013 (UTC)
- There is 500 examples of each the categories 6, 7 and 8 at User talk:Byrial/link-label-compare/it Byrial (talk) 13:22, 14 July 2013 (UTC)
another merge idea
[edit]The number-merge is working great (at least for the fr-en report) and it gave me an idea for a similar criterion for merge candidates. Your report used Morocco national under-17 football team (Q3590713) to merge two items that now form Morocco national under-20 football team (Q3590714) because they only differed on the age-class (under-20 and under-17) but these items also suggest that any items whose French article is "Équipe Country1 de football des moins de 17 ans" should be merged with the item whose English article is "Country1 national under-17 football team". I guess the problem is that this is much more annoying to code because you need to use a dictionary telling you that "du Maroc" is the French translation of "Morocco" in this situation... So maybe not such a great idea. :-) Pichpich (talk) 05:59, 16 July 2013 (UTC)
- Thank you for the good idea. I may not need a dictionary besides Wikidata itself because all country names can (at least in principle) be found by looking at the sitelinks for items with a claim instance of (P31) sovereign state (Q3624078) (there is 212 items with that claim). The program doesn't have to know the meaning of the words, just that the text of the French link to Morocco (Q1028) is a substring of the French link to Morocco national under-20 football team (Q3590714), and same for the English links. It would depend on the grammar of the languages (use of grammatical case etc.) if it will work, but I think it will for many languages. It can be coded. But I am not sure if I can get a reasonable execution time when I have to look for over 200 different substrings in each link text for all links to a Wikipedia, so I will have to think about how to implement it. But again thank you for the idea, it may be possible to do. Byrial (talk) 09:40, 16 July 2013 (UTC)
- Done! It was easier than I thought to modify the numbermerge program. I found an efficient and easy to use free library to do the substring searches. Please check out User:Byrial/countrymerge/fr-en. Byrial (talk) 16:15, 16 July 2013 (UTC)
- Hi Byrial. Can you refresh User:Byrial/countrymerge/fr-en? Also I wouldn't mind looking at User:Byrial/countrymerge/fr-de or User:Byrial/countrymerge/de-en if they are doable. Thanks! Pichpich (talk) 21:28, 17 July 2013 (UTC)
- I made fr-de and de-en. I cannot refresh fr-en before I get new data from the next database dump. It may take 1-2 weeks from now. Byrial (talk) 21:38, 17 July 2013 (UTC)
- Hi Byrial. Can you refresh User:Byrial/countrymerge/fr-en? Also I wouldn't mind looking at User:Byrial/countrymerge/fr-de or User:Byrial/countrymerge/de-en if they are doable. Thanks! Pichpich (talk) 21:28, 17 July 2013 (UTC)
- Done! It was easier than I thought to modify the numbermerge program. I found an efficient and easy to use free library to do the substring searches. Please check out User:Byrial/countrymerge/fr-en. Byrial (talk) 16:15, 16 July 2013 (UTC)
Thanks
[edit]Hey Byrial, just wanted to say thanks for the new reports, you've excelled yourself again! I'm finding them really useful. Del♉sion23 (talk) 18:18, 17 July 2013 (UTC)
- You are welcome. Programming is fun, and when people like the result it is even funner. :) Byrial (talk) 21:21, 17 July 2013 (UTC)
- I was just thinking that what worked with the country names in English and French may also work with the names of bands and musicians. They often have many categories named after them, such as "Category:Foo albums" and "Category:Foo songs". Just an idea. Del♉sion23 (talk) 22:54, 17 July 2013 (UTC)
- Thank you very much for the idea. I have thought about states in federations like USA and Germany, and provinces in other some countries like Australia, Canada and Spain. And I have thought about the 12 month names. But I did not think about bands and musicians. A way to find them would be items with discography (P358), but maybe there are too many (over 3000) – I will have to find out if it works. Byrial (talk) 07:46, 18 July 2013 (UTC)
- Please take a look at User:Byrial/bandmerge/fr-en. I am not satisfied with the result as many bands have names which are common words, giving a lot of false results. But there are some good results among the bad ones, and I am considering how to increase the ratio between good and bad results. Any input would be welcome. Byrial (talk) 13:43, 19 July 2013 (UTC)
- Looks good, it has found quite a few merges that need performing. However, as you say, it has brought up quite a few false positives. Could it perhaps be narrowed down by concentrating on specific patterns to do with music, such as (album) or Categroy:"Band" songs? Thanks again for the great work on these lists. I believe they should have their own dedicated Wikidata page :D. Del♉sion23 (talk) 16:50, 19 July 2013 (UTC)
- Also, from my own work on merging music related items, Italian categories were common mistakes, so perhaps an en-it version may throw up more matches. Del♉sion23 (talk) 16:55, 19 July 2013 (UTC)
- I made User:Byrial/bandmerge/it-en with the same code, and is still not happy with the result. My plans right now is trying to detect patterns with two variables: The artist and the title of an album/single/video/song/whatever, like:
- "en:$title ($artist album)" vs. "fr:$title (album de $artist)"
- "en:$title ($artist song)" vs. "fr:$title (chanson de $artist)"
- The problem is that I am not sure how to do that, but I suppose that I will get some inspiration sooner or later. Byrial (talk) 17:25, 19 July 2013 (UTC)
- I made User:Byrial/bandmerge/it-en with the same code, and is still not happy with the result. My plans right now is trying to detect patterns with two variables: The artist and the title of an album/single/video/song/whatever, like:
- Please take a look at User:Byrial/bandmerge/fr-en. I am not satisfied with the result as many bands have names which are common words, giving a lot of false results. But there are some good results among the bad ones, and I am considering how to increase the ratio between good and bad results. Any input would be welcome. Byrial (talk) 13:43, 19 July 2013 (UTC)
- Thank you very much for the idea. I have thought about states in federations like USA and Germany, and provinces in other some countries like Australia, Canada and Spain. And I have thought about the 12 month names. But I did not think about bands and musicians. A way to find them would be items with discography (P358), but maybe there are too many (over 3000) – I will have to find out if it works. Byrial (talk) 07:46, 18 July 2013 (UTC)
- I was just thinking that what worked with the country names in English and French may also work with the names of bands and musicians. They often have many categories named after them, such as "Category:Foo albums" and "Category:Foo songs". Just an idea. Del♉sion23 (talk) 22:54, 17 July 2013 (UTC)
countrymerge
[edit]Please can you add it-en it-fr it-es it-de thanks Rippitippi (talk) 19:51, 18 July 2013 (UTC)
splitting football players
[edit]Hi Byrial. I'm splitting the item association football player (Q937857) into more specific items for American/Canadian football and Australian rules football. This means that many statements need to be made more precise so could you compute the following lists?
- Every item that links to Q937857 and has an English description that contains one of the following words "American", "Canadian", "NFL", "CFL".
- Every item that links to Q937857 and has an English description that contains the word "rules"
Thanks! Pichpich (talk) 19:52, 20 July 2013 (UTC)
- What a task! There is 137070 claims with association football player (Q937857) as value. You cannot use the word "American" in the description to decide the kind of football. Many of the descriptions says "American soccer player" while other say something like "American footballer" with many variations. It is the same with "Canadian" - it most often describes the nationality of the player, not the kind the of football.
- Counts:
- "American" is used in 405 descriptions.
- "Canadian" is used in 57 descriptions.
- "NFL" is used in 9 descriptions.
- "CFL" is used in 0 (zero) descriptions.
- "rules" is used in 35 descriptions. All in the combination "Australian rules football(er)"
- This will account for about 500 out of 137070, and most of them cannot even be used directly.
- I think a better approach would be to look at the values of member of sports team (P54) for each player, and then league or competition (P118) for each team. There is 889 different ligas in Wikidata. If all of them had a statement for the type of sports, then this could be used to sort most of the players. So I think you should start with that.
- Another way to go is to go to a big Wikipedia and look for membership in categories for all players. Byrial (talk) 21:41, 20 July 2013 (UTC)
- There are very few notable Canadian or American soccer players so I would assume that the vast majority of these would need to be changed. Maybe if you can tweak it slightly to have an occurrence of "American" but no occurrence of "soccer" (or "Canadian" but not "soccer") I think the error ratio will be very small. In the long term, using sports teams and Wikipedia categories is the solution but for now I'd just like to correct the obvious ones. Pichpich (talk) 21:58, 20 July 2013 (UTC)
- I still don't understand why you will work with a few hundreds out of 137000 footballers. You could get many more by looking at the ligas of their teams. But I made the requested lists. They are at User:Byrial/Footballer. Byrial (talk) 07:28, 21 July 2013 (UTC)
- If you can do it for footballers then can you do it for Cricketers? Is it possible?--Vyom25 (talk) 06:52, 22 July 2013 (UTC)
- Sorry, wrong thread, I was talking about merge list.--Vyom25 (talk) 06:55, 22 July 2013 (UTC)
- I still don't understand why you will work with a few hundreds out of 137000 footballers. You could get many more by looking at the ligas of their teams. But I made the requested lists. They are at User:Byrial/Footballer. Byrial (talk) 07:28, 21 July 2013 (UTC)
- There are very few notable Canadian or American soccer players so I would assume that the vast majority of these would need to be changed. Maybe if you can tweak it slightly to have an occurrence of "American" but no occurrence of "soccer" (or "Canadian" but not "soccer") I think the error ratio will be very small. In the long term, using sports teams and Wikipedia categories is the solution but for now I'd just like to correct the obvious ones. Pichpich (talk) 21:58, 20 July 2013 (UTC)
Disambiguation page duplicates
[edit]Hi, is it possible to make a page like User:Byrial/Merge_candidates#Same_links_in_different_disambiguation_items, but only for items where is fi-wiki link. I don't know does this help, but I've made a list of all disambiguation pages of fi-wiki (though not all of them are yet on Wikidata). --Stryn (talk) 12:36, 21 July 2013 (UTC)
- I have been busy programming the two substring merge lists today, but I will this tomorrow. It was also been on my personal to-do list for a while. Byrial (talk) 19:24, 21 July 2013 (UTC)
- Great to hear. Thanks a lot. --Stryn (talk) 19:47, 21 July 2013 (UTC)
- Which format would you prefer? A list of all links like it is now at User:Byrial/Merge candidates#Same links in different disambiguation items or just the items with maybe a link count or something else? Byrial (talk) 10:09, 22 July 2013 (UTC)
- Just something simple, like: Abigail (Q1605677 ← → Q2769157). First link always where is the fi-wiki link. --Stryn (talk) 10:20, 22 July 2013 (UTC)
- Is User:Byrial/disambigmerge/fi simple enough? I like to have some information about how simple/easy the case is, so I added link count and use in statements count, and warned about language conflicts and different links to some languages. If you don't like it, I can change it. Byrial (talk) 14:16, 22 July 2013 (UTC)
- It looks perfect, thanks again! --Stryn (talk) 14:26, 22 July 2013 (UTC)
- Is User:Byrial/disambigmerge/fi simple enough? I like to have some information about how simple/easy the case is, so I added link count and use in statements count, and warned about language conflicts and different links to some languages. If you don't like it, I can change it. Byrial (talk) 14:16, 22 July 2013 (UTC)
- Just something simple, like: Abigail (Q1605677 ← → Q2769157). First link always where is the fi-wiki link. --Stryn (talk) 10:20, 22 July 2013 (UTC)
- Which format would you prefer? A list of all links like it is now at User:Byrial/Merge candidates#Same links in different disambiguation items or just the items with maybe a link count or something else? Byrial (talk) 10:09, 22 July 2013 (UTC)
- Great to hear. Thanks a lot. --Stryn (talk) 19:47, 21 July 2013 (UTC)
Duplicates through shared ID in other databases
[edit]Hi Byrial. Normally, each film/actor/director has its own IMDb identifier. Have you tried finding merge candidates on the basis of a shared value in IMDb ID (P345)? Obviously, the same idea should work in many similar properties such as all authority control properties or ATP player ID (P536). Pichpich (talk) 20:50, 24 July 2013 (UTC)
- Thank you for the suggestion. I have considered doing it, but not had the time to do it yet. Right now I am busy with changes to support Wikivoyage links in my reports, but after that I may return to this. Byrial (talk) 10:55, 25 July 2013 (UTC)
- Isn't that what's already done in Wikidata:Database reports/Constraint violations/P345#"Unique value" violations & Co? --YMS (talk) 12:20, 25 July 2013 (UTC)
- Yes, but it is not easy to see from these list if the items should be merged, or if the problem is something else. I can test if the items have links to the same project or not. But of course if someone would remove all constraint violations, then the merge cases would also be fixed. Byrial (talk) 12:33, 25 July 2013 (UTC)
- I made User:Byrial/unique-property-merge while waiting for the next database dump. It is like constraint violation list, but have only items with links to distinct projects. Byrial (talk) 17:27, 25 July 2013 (UTC)
- Yes, but it is not easy to see from these list if the items should be merged, or if the problem is something else. I can test if the items have links to the same project or not. But of course if someone would remove all constraint violations, then the merge cases would also be fixed. Byrial (talk) 12:33, 25 July 2013 (UTC)
- Isn't that what's already done in Wikidata:Database reports/Constraint violations/P345#"Unique value" violations & Co? --YMS (talk) 12:20, 25 July 2013 (UTC)
Mega batch of deletion
[edit]With the next dump can you recheck for items with no links, no statements, and no use in statements? Thanks. --ValterVB (talk) 16:25, 27 July 2013 (UTC)
- I will do that. The next dump is started today, but the files that I use are not yet ready. It may be Monday. Byrial (talk) 16:28, 27 July 2013 (UTC)
You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
ActiveStats
[edit]All should be fixed now. (Bet you heard that one already. :p) The page is meant to be transcluded into the template space, and documentation attached to it. There it should be linked to or transcluded in other places.Cyberpower678 (talk) 07:15, 29 July 2013 (UTC)
You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
Precision
[edit]this looks better? :) http://www.wikidata.org/w/api.php?action=wbgetclaims&format=xml&entity=q1003873&property=P625 ·addshore· talk to me! 17:52, 31 July 2013 (UTC)
- I have responded again, I would really love to try and get the precision fixed before I sleep :) Otherwise I will be thinking about it all night. ·addshore· talk to me! 19:38, 31 July 2013 (UTC)
Size of merge lists
[edit]- Discussion moved from en:User talk:Byrial:
Hi Byrial, your excellent merge lists are so successful that many of the pages they are stored on are so big that my computer struggles with them, which slows down work that can be done on them. Would it be possible to break them down into smaller pages of about 100,000 to 150,000? Right now if I right click an item, it takes several seconds to respond. Del♉sion23 (talk) 10:05, 1 August 2013 (UTC)
- Hi. I know that some of the countrymerge lists are very long, but I was hoping that it is a temporary problem and that the next versions will be much shorter without me doing anything special because of the merges that is happening. Already now when I look at the pages, I see many red item links where merges is done. Do you think that I should split the long pages now instead of wait for the next database dump? Byrial (talk) 13:23, 1 August 2013 (UTC)
- If there are red links appearing then maybe others have a faster computer than me :D To be honest, I'm working on another part of Wikidata merging right now anyway (the thousands of items than WYImporterBot brought over from Wikivoyage) and so won't have the time to do many other mergers anyway. As you say, hopefully most of these backlogs are only big because Wikidata is new and they will be smaller over time. Cheers, Del♉sion23 (talk) 08:22, 3 August 2013 (UTC)
- OK, I will wait to see if the next versions of the long lists will be shorter due to the ongoing merging. Byrial (talk) 08:46, 3 August 2013 (UTC)
- It's true that the problem is somewhat transitory but would it be a lot of work to write a little script to remove the entries that have turned red? (maybe it is...) @Byrial: two more unrelated remarks on those excellent merge lists.
- Yes, there is a big mess concerning categories for people because some wikis separate male/female categories and some don't. It's made a lot worse by the fact that en.wiki changed its practices quite significantly over the last year (most notably by splitting Category:Actors) whereas the interwikis were set before the split occurred. And if that weren't complicated enough, some languages (such as French) tend to use the masculine grammatical gender to refer to a set that contains both male and female individuals. This problem involves thousands of categories across multiple languages and will be absolutely impossible to solve by hand piecemeal! It's the ultimate Wikidata:Interwiki conflicts issue and a lot of discussion there needs to take place... So what does that mean for your lists? I don't know. It's really hard for you to distinguish these cases so I'm tempted to suggest that you ignore all this. On the other hand, the very fact that these show up in the list encourages editors to make a lot of merges that will probably have to be undone in the future.
- Do you plan to update or expand the lists of the "two substring" type? I hope so: they were shorter but they had an excellent success rate and picked up items that were mostly linked to Wikipedia articles (high priority imo) rather than Wikipedia categories (low priority imo). If you update them, note that I removed all merged items of User:Byrial/two substring merge/albums so what you still see in that list should give you a good idea of the parts of the regexs that don't work as well as hoped. Cheers, Pichpich (talk) 03:37, 5 August 2013 (UTC)
- It would be a lot of work to remove red links, because the lists are generated offline from a local database which is made from the latest Wikidata database dump. You would have to query the server about deletions to know which items are deleted since the last database dump. I will consider doing it, but will not promise anything.
- Ad remark 1) I cannot see how I can do anything about male/female categories in the lists.
- Ad remark 2) New "two substring merge" lists at this time would mostly have the wrong matches because all the successful matches have already been merged. So I need to find some improvements to find more good cases before new lists will be useful. Byrial (talk) 09:30, 5 August 2013 (UTC)
- It's true that the problem is somewhat transitory but would it be a lot of work to write a little script to remove the entries that have turned red? (maybe it is...) @Byrial: two more unrelated remarks on those excellent merge lists.
- OK, I will wait to see if the next versions of the long lists will be shorter due to the ongoing merging. Byrial (talk) 08:46, 3 August 2013 (UTC)
- If there are red links appearing then maybe others have a faster computer than me :D To be honest, I'm working on another part of Wikidata merging right now anyway (the thousands of items than WYImporterBot brought over from Wikivoyage) and so won't have the time to do many other mergers anyway. As you say, hopefully most of these backlogs are only big because Wikidata is new and they will be smaller over time. Cheers, Del♉sion23 (talk) 08:22, 3 August 2013 (UTC)
wikivoyage
[edit]Hi Bryal can be your script userd for search wikivoyage item which can be merged with wikipedia items, Thanks Rippitippi (talk) 12:05, 2 August 2013 (UTC)
- Yes (although some of the programs will need adjustments before being used for Wikivoyage). Which kind of mergelist do you have in mind? Byrial (talk) 08:44, 3 August 2013 (UTC)
- example Q14207824 and Q52437 are duplicated Rippitippi (talk) 16:11, 3 August 2013 (UTC)
another 'countrymerge' please,
[edit]Could you do a 'countrymerge' for ga-en please?
I used to be fluent in Irish (ga) many years ago and i would like to practice it some more. Sorting matches would be just the thing. Filceolaire (talk) 00:52, 3 August 2013 (UTC)
- Countrymerge for ru-en pair would be appreciated too. --4th-otaku (talk) 03:12, 3 August 2013 (UTC)
- Done User:Byrial/countrymerge/ga-en and User:Byrial/countrymerge/ru-en. The Russian list is long and have many conflicts. I suspect that the program that finds adjectives/alternative formes for country names do not do well for Russian. I will try to look into it, but it may be hard because I do not understand Russian. Byrial (talk) 07:13, 3 August 2013 (UTC)
- I did some adjustments to the program to improve how it recognizes country names in more than one word like for example New Zealand, and remade ru-en. Byrial (talk) 08:41, 3 August 2013 (UTC)
- Thanks. Thats great. Filceolaire (talk) 15:25, 3 August 2013 (UTC)
- Done. The only false positives were for ga pages for countries (lots of links per page) matched with en pages for languages (1 link per page). Filceolaire (talk) 13:03, 5 August 2013 (UTC)
- Thanks. Thats great. Filceolaire (talk) 15:25, 3 August 2013 (UTC)
- I did some adjustments to the program to improve how it recognizes country names in more than one word like for example New Zealand, and remade ru-en. Byrial (talk) 08:41, 3 August 2013 (UTC)
- Done User:Byrial/countrymerge/ga-en and User:Byrial/countrymerge/ru-en. The Russian list is long and have many conflicts. I suspect that the program that finds adjectives/alternative formes for country names do not do well for Russian. I will try to look into it, but it may be hard because I do not understand Russian. Byrial (talk) 07:13, 3 August 2013 (UTC)
True duplicates
[edit]You can remove this notice at any time by removing the {{Talkback}} or {{Tb}} template.
PS: thanks for the list! --Ricordisamoa 10:37, 4 August 2013 (UTC)
Query wikidata database
[edit]Hi Byrial,
Could i ask you some help?
I would extract from wikidata the information below. I would obtain a list of people who were born in a country and who later have moved (emigrated) in another country.
Information is coded in wikidata using P551 (residence) in this way:
Item: Person XYZ
Property: residence Value: country A (say, this is the place the person was born and lived for the first years) qualifier: start date Value: XX.YY.ZZZZ qualifier: end date Value: XX.YY.ZZZZ
Property: residence Value: country B (say, he makes here a longer stay for several years, e.g. for education) qualifier: start date Value: XX.YY.ZZZZ qualifier: end date Value: XX.YY.ZZZZ
Property: residence Value: country A (say, he came back to the country he has born) qualifier: start date Value: XX.YY.ZZZZ qualifier: end date Value: XX.YY.ZZZZ
Property: residence Value: country B (say, this is the country he has emigrated to) qualifier: start date Value: XX.YY.ZZZZ qualifier: end date Value: XX.YY.ZZZZ qualifier: as value: emigrant
I done this example on Q5563216:
P551 (residence) (1) Firenze
P580 (data di inizio / from ) febbraio 9 1893 P582 (data di fine / to ) gennaio 24 1908
(2) Unites States
P580 (data di inizio / from ) gennaio 24 1908 P582 (data di fine / to ) dicembre 23 1982
So i need (for example) to obtain a list of people born in Italy and then emigrated (moved) to United States including the date and ordered by the date.
The result should be for example:
1) Gino Corrado , gennaio 24 1908 2) Rodolfo Valentiano, 1911 3) etc etc etc ....
you can do a thing like this in Wikidata? Exists a tool in wikidata to perform a query like this?
thanks LucaBiondi (talk) 14:30, 6 August 2013 (UTC)
- Hi. Last time my property statistics was updated residence (P551) was only used for 78 items, so I doubt that there will be any persons on the list, but I try to make it. Right now I am preparing to read a new database dump and remake my regular lists, so it will have to wait to after that is done. Regards, Byrial (talk) 16:06, 6 August 2013 (UTC)
- Thanks Bryial! I will wait your list!
- and for the future I would try to see if I can fill P551 starting from the category "Immigrants to the United States"
- Just a technical question? How do you extract your statistic? do you perform a query directly on the database dump?
- Do you know how to query wikidata online? i mean something like "http://www.wikidata.org/wiki/Wikidata:Tools#WikiData_query"
- p.s. i have try this: http://toolserver.org/~magnus/ts2/php/wd_query.php?q=[%22P551%22,%22Q215627%22] you obtain a list of people that have P551
- No, I do not directly query the database dump. I use it to build a local database at my own computer, which I then query. My programs are available (see link at my user page), but may be hard to use if you are not programmer yourself. (You have to be able to compile and build the programs from the sources without instructions). The Wikidata server cannot (yet) be queried like the "WikiData query" tool can which is why the tool is useful. Byrial (talk) 21:45, 6 August 2013 (UTC)
- Now you can see a list with all items with more than one claim with P551 at User:Byrial/Residence. Byrial (talk) 14:19, 8 August 2013 (UTC)
- Hi Byrial, Thank you for the list! great job! I found 1 item i expected!
- Could you also do this, if you can and if you have time?
- all item with one o more claim with P551 where the country of the value of P551 is not equal to the contry of value of P19 (place of birth)
- What i would obtain is all peole emigrated from a country to another
- Example:
- Q5041458 born in Pisa (country: Italy)
- exists a P511 equal to "New York" (country: United States)
- so i will add Q5041458 to my list because he is emigrated from italy to Unites States
- you think you can do?
- thank you !!!! LucaBiondi (talk) 13:14, 9 August 2013 (UTC)
- I can do that, but it will be no proof of emigration, because the person may have been born while the mother has traveling or otherwise temporary away from home. Byrial (talk) 13:19, 9 August 2013 (UTC)
- I had proposed a new property "emigrated to" but this property was rejected.So i'm inserting in wikidata P551 about people who emigrated.
- For the future when the "AS" qualifiers will be avaiable will help you identify emigrants
- Property: residence
- Value: country B (say, this is the country he has emigrated to)
- qualifier: start date
- Value: XX.YY.ZZZZ
- qualifier: end date
- Value: XX.YY.ZZZZ
- qualifier: as
- value: emigrant
- ...but if you have a better idea to identify who is emigrated i'm very interested ... :-) LucaBiondi (talk) 14:28, 9 August 2013 (UTC)
Countrymerge
[edit]And you can make same for the East Slavic languages (be, be-x-old, ru, uk)? --Чаховіч Уладзіслаў (talk) 18:45, 7 August 2013 (UTC)
- Yes, I can make it for all languages with a Wikipedia. But it will depend on the grammar of languages how well it works. I made for all combinations: be-be-x-old, be-ru, be-uk, be-x-old-ru, be-x-old-uk, ru-uk. Byrial (talk) 10:59, 8 August 2013 (UTC)
Simple links only
[edit]Hi Byrial, would you be able to make a list like this for simple wiki? I would have thought any of them would be relatively easy to link to English wiki articles. Thanks for your help. Del♉sion23 (talk) 19:28, 8 August 2013 (UTC)
- Done at User:Byrial/Items with link only to simplewiki. Byrial (talk) 07:09, 9 August 2013 (UTC)
- See also User:Byrial/countrymerge/simple-en. Byrial (talk) 07:23, 9 August 2013 (UTC)
- Thanks very much again, Byrial. Great work! :) Del♉sion23 (talk) 10:23, 11 August 2013 (UTC)
- See also User:Byrial/countrymerge/simple-en. Byrial (talk) 07:23, 9 August 2013 (UTC)
Swedish municipality code (P525) är inte nödvändigtvis en sammanslagningskandidat bara för att två svenska kommuner har samma kod. Det kan dock vara bra att ha koll på vilka som använder samma kod. Normalt är det två olika kommuner vid olika tidpunkter. Tex en kommun -1970 och en annan kommun 1971-. -- Lavallentalk(block) 12:45, 10 August 2013 (UTC)
- Hej! Hvis to Wikidata-objekter med samme værdi for Swedish municipality code (P525) länkar til olika sidor på samme Wikipedia (for eksempel svensk Wikipedia), da vil de ikke komme på User:Byrial/unique-property-merge. Der listes kun objekter som länkar til olika Wikipediaer eller Wikivoyager. Byrial (talk) 06:00, 11 August 2013 (UTC)
A place to list exceptions
[edit]Hi Byrial. Thank you for the very useful merge lists. I think you should expect a problem in the not-so-far future with these lists: as time goes by, most of the items that should be merged will be merged but the false positives will stay there forever. For instance User:Byrial/numbermerge/fr-en suggests merging
- Q3284894 (fr:Maman (film, 1990), 1 link), Q29816 (en:Mother (1990 film), 4 links).
It makes sense so it's not a mistake to pick that up but ultimately it's wrong. The problem is that I have no way of telling you to keep this particular suggestion out of your future reports and even if I delete that line, it will reappear as soon as the next database dump arrives. This means that if 50 editors work on that report in the next year, 50 editors will have to redo the work of checking that suggestion. Right now, it's not that big a deal because there are only few people working on these lists and the lists are gigantic anyways but it could become more problematic. Best, Mergehappy (talk) 04:28, 11 August 2013 (UTC)
- Hi! Thank you for bringing this up. You are absolutely right. I have briefly considered this before, but not until now worked on it. I will think about how to best implement an exception list, and hopefully have something before too long. Byrial (talk) 06:08, 11 August 2013 (UTC)
- Maybe you can create a page for exclusion like User:Byrial/numbermerge/fr-en/exclusion. When someone find two items that mustn't be merged add them in exclusion page, so you can check it:
--ValterVB (talk) 06:58, 11 August 2013 (UTC)
- Often the same false positives will appear on more than one merge list, so I will rather make global exclusion list that applies to all of the merge lists at the same time. Byrial (talk) 07:06, 11 August 2013 (UTC)
- Please place false positives from the merge lists at User:Byrial/merge exclusion. I will exclude items listed on the page in future updates of the merge lists. Byrial (talk) 09:40, 14 August 2013 (UTC)
- I've added a few. Note that I also listed single items which your number-merge lists suggest splitting but are in fact ok (for example: 1944 ≠ 1943: Q1217369 links to fr:Jane Eyre (film, 1944) and en:Jane Eyre (1943 film)). Mergehappy (talk) 16:31, 14 August 2013 (UTC)
- Please place false positives from the merge lists at User:Byrial/merge exclusion. I will exclude items listed on the page in future updates of the merge lists. Byrial (talk) 09:40, 14 August 2013 (UTC)
- Often the same false positives will appear on more than one merge list, so I will rather make global exclusion list that applies to all of the merge lists at the same time. Byrial (talk) 07:06, 11 August 2013 (UTC)
Statistics
[edit]Hello,
Thank you for your statistics pages. Could you please make one about items by number of labels? I.e. how much items have no label, how much have a label just in one language, how much have a label in two languages, etc. I would also be interested in the evolution of these figures throughout time. Ljubinka (discussion) 07:12, 12 August 2013 (UTC)
Could you please tell me how much is the total number of items in the last two database dumps? Ljubinka (discuter) 19:27, 29 August 2013 (UTC)
- Hi, for the first question (which I had forgotten) I made a quick query in my local database. You can see the result below. You can always see the total number of items (13,685,369) in the first section of the page User:Byrial/Statement statistics. For older numbers look for older versions of the page in the history of the page. Regards, Byrial (talk) 20:49, 29 August 2013 (UTC)
mysql> select count(*), (select count(*) from label where l_id = i_id) as labels from item group by labels; +----------+--------+ | count(*) | labels | +----------+--------+ | 243282 | 0 | | 8915673 | 1 | | 1504683 | 2 | | 760995 | 3 | | 427378 | 4 | | 310774 | 5 | | 330689 | 6 | | 246435 | 7 | | 144843 | 8 | | 152759 | 9 | | 104023 | 10 | | 63092 | 11 | | 50704 | 12 | | 43479 | 13 | | 33597 | 14 | | 28137 | 15 | | 26633 | 16 | | 22655 | 17 | | 19848 | 18 | | 17659 | 19 | | 15162 | 20 | | 14646 | 21 | | 14357 | 22 | | 13029 | 23 | | 12942 | 24 | | 12433 | 25 | | 16409 | 26 | | 12183 | 27 | | 9669 | 28 | | 8446 | 29 | | 7524 | 30 | | 6721 | 31 | | 6363 | 32 | | 6332 | 33 | | 5466 | 34 | | 5078 | 35 | | 4332 | 36 | | 5030 | 37 | | 3887 | 38 | | 3198 | 39 | | 2818 | 40 | | 2654 | 41 | | 3265 | 42 | | 2380 | 43 | | 2077 | 44 | | 1808 | 45 | | 1731 | 46 | | 1617 | 47 | | 1526 | 48 | | 1388 | 49 | | 1475 | 50 | | 1388 | 51 | | 1336 | 52 | | 1325 | 53 | | 1255 | 54 | | 1155 | 55 | | 1103 | 56 | | 1023 | 57 | | 974 | 58 | | 931 | 59 | | 844 | 60 | | 803 | 61 | | 817 | 62 | | 794 | 63 | | 747 | 64 | | 743 | 65 | | 758 | 66 | | 671 | 67 | | 666 | 68 | | 589 | 69 | | 575 | 70 | | 535 | 71 | | 505 | 72 | | 485 | 73 | | 443 | 74 | | 426 | 75 | | 359 | 76 | | 335 | 77 | | 271 | 78 | | 256 | 79 | | 264 | 80 | | 225 | 81 | | 211 | 82 | | 211 | 83 | | 176 | 84 | | 171 | 85 | | 149 | 86 | | 162 | 87 | | 165 | 88 | | 146 | 89 | | 153 | 90 | | 134 | 91 | | 1168 | 92 | | 604 | 93 | | 196 | 94 | | 152 | 95 | | 141 | 96 | | 110 | 97 | | 124 | 98 | | 109 | 99 | | 112 | 100 | | 86 | 101 | | 119 | 102 | | 83 | 103 | | 85 | 104 | | 104 | 105 | | 142 | 106 | | 177 | 107 | | 237 | 108 | | 203 | 109 | | 225 | 110 | | 264 | 111 | | 173 | 112 | | 177 | 113 | | 158 | 114 | | 141 | 115 | | 128 | 116 | | 111 | 117 | | 107 | 118 | | 75 | 119 | | 80 | 120 | | 71 | 121 | | 65 | 122 | | 65 | 123 | | 69 | 124 | | 59 | 125 | | 57 | 126 | | 33 | 127 | | 35 | 128 | | 42 | 129 | | 41 | 130 | | 43 | 131 | | 37 | 132 | | 25 | 133 | | 22 | 134 | | 27 | 135 | | 25 | 136 | | 22 | 137 | | 22 | 138 | | 18 | 139 | | 22 | 140 | | 19 | 141 | | 22 | 142 | | 24 | 143 | | 23 | 144 | | 33 | 145 | | 25 | 146 | | 43 | 147 | | 56 | 148 | | 65 | 149 | | 64 | 150 | | 63 | 151 | | 62 | 152 | | 50 | 153 | | 39 | 154 | | 34 | 155 | | 32 | 156 | | 35 | 157 | | 28 | 158 | | 14 | 159 | | 16 | 160 | | 17 | 161 | | 26 | 162 | | 20 | 163 | | 18 | 164 | | 14 | 165 | | 8 | 166 | | 22 | 167 | | 21 | 168 | | 16 | 169 | | 17 | 170 | | 16 | 171 | | 16 | 172 | | 10 | 173 | | 19 | 174 | | 15 | 175 | | 13 | 176 | | 9 | 177 | | 13 | 178 | | 6 | 179 | | 7 | 180 | | 11 | 181 | | 11 | 182 | | 14 | 183 | | 7 | 184 | | 13 | 185 | | 2 | 186 | | 6 | 187 | | 4 | 188 | | 13 | 189 | | 11 | 190 | | 7 | 191 | | 10 | 192 | | 12 | 193 | | 6 | 194 | | 5 | 195 | | 9 | 196 | | 5 | 197 | | 3 | 198 | | 6 | 199 | | 6 | 200 | | 10 | 201 | | 9 | 202 | | 6 | 203 | | 4 | 204 | | 5 | 205 | | 4 | 206 | | 4 | 207 | | 4 | 208 | | 6 | 209 | | 5 | 210 | | 2 | 211 | | 2 | 212 | | 3 | 213 | | 5 | 214 | | 9 | 215 | | 2 | 216 | | 7 | 217 | | 2 | 218 | | 8 | 219 | | 5 | 220 | | 4 | 222 | | 1 | 223 | | 7 | 224 | | 4 | 225 | | 5 | 226 | | 3 | 227 | | 2 | 228 | | 1 | 229 | | 2 | 230 | | 4 | 231 | | 4 | 232 | | 2 | 233 | | 4 | 234 | | 2 | 235 | | 2 | 236 | | 2 | 237 | | 2 | 239 | | 2 | 240 | | 2 | 241 | | 2 | 242 | | 3 | 243 | | 3 | 244 | | 1 | 245 | | 1 | 246 | | 3 | 247 | | 2 | 248 | | 2 | 249 | | 1 | 250 | | 1 | 254 | | 2 | 255 | | 2 | 256 | | 2 | 258 | | 2 | 259 | | 2 | 261 | | 2 | 263 | | 1 | 265 | | 2 | 267 | | 1 | 269 | | 1 | 270 | | 1 | 271 | | 1 | 276 | | 1 | 277 | | 1 | 280 | | 1 | 281 | | 2 | 283 | | 1 | 284 | | 1 | 285 | | 1 | 294 | | 1 | 298 | +----------+--------+ 270 rows in set (1 min 36.62 sec) mysql>
- Thanks a lot. Ljubinka (discuter) 21:38, 29 August 2013 (UTC)
The property significant event (P793) is available now. I saw that you participated in the discussion. --Tobias1984 (talk) 17:06, 13 August 2013 (UTC)
Wikivoyage orphans
[edit]Hi Byrial. After the success of the simple wiki orphan page, Addshore and I were wondering if the same could be done for en.wikivoyage. This would help in reducing the number of Wikivoyage orphans we currently have. Also, although it may sound unusal, I was wondering if you could run a numbermerge for bs–vi and ro-vi as I believe there may be a few duplicates for "Category:YEAR BC deaths/births". Thanks again for your merge pages, they really help! Del♉sion23 (talk) 19:10, 15 August 2013 (UTC)
- Looks like you were just ahead of me on the first but in creating User:Byrial/projectmerge!. Del♉sion23 (talk) 23:18, 15 August 2013 (UTC)
- 1) Projectmerge is not exactly what you asked for. There is a big overlap, but not all orphans are on it, and it also have non-orphans. But I guess it is in order to handle projectmerge first before making a new one because of the overlap.
- 2) I made User:Byrial/numbermerge/bs-vi and User:Byrial/numbermerge/bs-vi. I don't speak Bosnian, Romanian or Vietnamese, but can guess a little from Romanian as it is a Romance language. It looks like there is categories for death years in common between these 3 languages, but not connected to the big Western languages like en, de or fr. Byrial (talk) 05:36, 16 August 2013 (UTC)
- As you say, it would be best to concentrate on projectmerge before moving on to the Wikivoyage orphans. I'll be copying sections of the page into my own user space so that it is easier to handle. Thanks for making the unusual language mixes :D It seems that I was right in thinking that there would be quite a few death and birth categories needing merges. Del♉sion23 (talk) 17:44, 16 August 2013 (UTC)
commonsmerge
[edit]Please, can you make cs-de commonsmerge file from next dump? JAn Dudík (talk) 20:39, 15 August 2013 (UTC)
- I forgot to say that it is done at placed at User:Byrial/commonsmerge/cs-de. Byrial (talk) 16:13, 22 August 2013 (UTC)
All those subpages you make
[edit]Maybe it would be a good idea to a) move them to WD:Database reports subpage, b) and sort them out slightly better. For example, all of the merge ones could go to WD:Database reports/Merge/*name* -- example: User:Byrial/numbermerge -> WD:Database reports/Merge/Number. One language links might go to WD:Database reports/One language link; e.g. User:Byrial/Items with only Gujarati link -> WD:Database reports/One language link/Gujarati. Just some thoughts. --Izno (talk) 14:50, 18 August 2013 (UTC)
- You are right. The number of the reports have just grown from a few to now well other 100. I am right now reading a new database dump dated 2013-08-17 into my local database, and it will be ready to make updated reports from in a few hours. I will not delay the updates by starting a move of the reports now. But after this update and before the next one, I will take care of moving and reorganising the reports. Byrial (talk) 16:05, 18 August 2013 (UTC)
- Cool. :D --Izno (talk) 21:14, 18 August 2013 (UTC)
Same linked article
[edit]I don't know if is feseable, but can you create al list of page with same Linked article? Example:
- Create a list with it link but without en link
- Create a list with en link but without it link
- Report all the page with the same link.
--ValterVB (talk) 14:43, 22 August 2013 (UTC)
- Sorry, I do not understand what you mean by list of page with same linked article. Can you please try to explain with more words and maybe give an example? Regards, Byrial (talk) 15:56, 22 August 2013 (UTC)
- Sorry maybe "Linked article" isn't clear but is used in page for "interlink". Example (not real): If on en.wiki there is a page called "Nicklas Utgren" but without interlink to it.wiki and on it.wiki there is a page called "Nicklas Utgren" probably is a candidate to be merge. --ValterVB (talk) 16:14, 22 August 2013 (UTC)
- I will look at it. Byrial (talk) 19:28, 23 August 2013 (UTC)
- Sorry maybe "Linked article" isn't clear but is used in page for "interlink". Example (not real): If on en.wiki there is a page called "Nicklas Utgren" but without interlink to it.wiki and on it.wiki there is a page called "Nicklas Utgren" probably is a candidate to be merge. --ValterVB (talk) 16:14, 22 August 2013 (UTC)
update
[edit]Please con you update this page [2] with last dump? thanks Rippitippi (talk) 16:36, 23 August 2013 (UTC)
shares border with (P47) mellan Själland och Skåne
[edit]Jag har ställt en fråga på sv:WP:FF om gränsen mellan Skåne och Själland. Hittills har jag inte fått några svar från skåningarna på svwp, kanske jag kan få hjälp av en dansk? -- Lavallentalk(block) 17:25, 23 August 2013 (UTC)
- Jeg ved ikke med sikkerhed om havområder tilhører danske kommuner, men jeg tror det ikke. Jeg er i øvrigt ikke særlig stedkendt i Øresundsregionen (jeg er jyde). Byrial (talk) 19:27, 23 August 2013 (UTC)
- Och jag är norrlänning (just nu). Nej, det är möjligt vattnet inte tillhör kommunerna, men P47 borde kunna användas ändå... -- Lavallentalk(block) 06:46, 24 August 2013 (UTC)
Indonesian places
[edit]Hey Byrial, I've stumbled upon a large number of items that need merging. I was wondering if you could work your magic to form a list. The pattern appears to be Indonesian (id) and Dutch (nl) language links (e.g. Q12497851) and another item with a Sunda (su) language link (e.g. Q13199826). There is a lot of similarity between the id and su links. The only difference is that the su links have accents on the letter "e". Is there a way of generating a list of these that need merging? Thanks. Del♉sion23 (talk) 20:12, 26 August 2013 (UTC)
- I've found a list of them at the toolserver. May post it onto Project chat to ask for more help. It is quite a big list of merges. Del♉sion23 (talk) 22:32, 27 August 2013 (UTC)
List of False positives
[edit]Hope you don't mind but I edited your user page to add a section about the Merge and Unmerge exclusion pages as I was finding it hard to find them. Filceolaire (talk) 12:54, 27 August 2013 (UTC)
- That is fine. Thank you, Byrial (talk) 06:07, 29 August 2013 (UTC)
More ga-en merge lists
[edit]I've cleared the ga-en Countrymerge list and all the remaining items on that list have been added to the exclusion list.
Could you do a run of all the other types of merge list for these two languages?
Thanks Filceolaire (talk) 13:14, 27 August 2013 (UTC)
Can you help ?
[edit]Hi ! I'm new to Wikidata and don't know how to merge articles Neonatal phototherapy and Q14720649 . Can you merge them ? Thanks --Alborz Fallah (talk) 06:03, 28 August 2013 (UTC)
- (talk page stalker) I Merged them. If you want to find out all about merging, see Help:Merge. The Anonymouse (talk) 06:13, 28 August 2013 (UTC)
- Thank you . I will read it .--Alborz Fallah (talk) 14:05, 30 August 2013 (UTC)
List of items with only links to subtemplate
[edit]See Wikidata:Project_chat#subtemplate. Can you make this list?--GZWDer (talk) 14:43, 28 August 2013 (UTC)
- Any update?--GZWDer (talk) 15:02, 19 September 2013 (UTC)
countrymerge
[edit]Can you id-ms please. Thanks. Aurora (talk) 15:16, 30 August 2013 (UTC)
Roman numbers
[edit]Hi Byrial. Maybe it would be an idea to extend your number merge reports by Roman numbers (I, II, III, IV, ... as equivalents for 1, 2, 3, 4). This could catch some movie sequels and stuff, but especially some Spanish by-century-categories, as (at least) the Spanish Wikipedia numbers the centuries in Roman numerals, and possibly several more things (electoral districts, etc.). I don't think your script would have to be able to correctly count in Roman numbers, it should be sufficient to hardcode the numbers 1 to like 20. --YMS (talk) 19:57, 2 September 2013 (UTC)
- Good idea, thank you. I had a look and see that it also much used for numbers of kings, queens, popes, emperors etc. I will do that. Byrial (talk) 06:35, 3 September 2013 (UTC)
Polish merges
[edit]Hey Byrial, would you be able to create a country merge and number merge list for en-pl? Polish is probably the largest Wikipedia that's not had the treatment yet. I expect there to be quite a few Sydney 2000 Olympics articles in there. Thanks very much. Del♉sion23 (talk) 00:02, 4 September 2013 (UTC)
Mixed namespaces
[edit]Hello, Could you create list of items, where are mixed link to categories with other namespaces? And maybe templates and Portals too. (I am not sure about Wikipedia namespace, because there are some items like Finnish Wikipedia (Q175482) ) JAn Dudík (talk) 07:36, 5 September 2013 (UTC)
P21
[edit]Har du någon statistik över användandet av sex or gender (P21)? Hur många män går det på en kvinna här på WD? 4/1 på svwp enligt senaste tråden på sv:WP:BB. -- Lavallentalk(block) 08:19, 7 September 2013 (UTC)
List of chemical compounds
[edit]Hello, I am trying to macth wikidata items about chemicals with external identifiers. But I need first to extract the list of chemicals present in wikidata. From your dump tool can you extract the list of Q number for items defined as chemical ? The small problem is the definition of a chemical: at the beginning all chemicals from fr and en were tagged as instance of (P31): chemical compound (Q11173). But since some month different approaches were applied: replacement of instance of (P31) by subclass of and intermediate classification. So I think we can get the list of chemicals merging these different searches:
- every item which has instance of (P31): chemical compound (Q11173).
- every item which has instance of (P31): alcohols (Q156).
- every item which has instance of (P31): amino acid (Q8066).
- every item which has instance of (P31): carbohydrate (Q11358).
- every item which has instance of (P31): furfuryl alcohol (Q27335).
- every item which has instance of (P31): vitamin (Q34956).
- every item which has instance of (P31): lipid (Q11367).
- every item which has instance of (P31): hydrocarbon (Q43648).
- every item which has instance of (P31): organohalogen compound (Q387914).
- every item which has instance of (P31): organic acid (Q421948).
If you can do these searches and merge the different lists into one list of Q numbers, you can just copy-paste the list in user:Snipre/Test1 using comma as delimiter. Thanks Snipre (talk) 18:19, 7 September 2013 (UTC)
- Sorry for the demand, I found this tool about query. Snipre (talk) 22:43, 8 September 2013 (UTC)
re-run /projectmerge
[edit]Can you please start your bot again for the /projectmerge (Wikipedia & Wikivoyage)? --A.Bernhard (talk) 17:35, 8 September 2013 (UTC)
A few new lists
[edit]Hello, thanks for your lists. Could you please create new:
- numbermerge lists for cs-de, cs-fr, cs-es, sk-de, sk-fr, sk-es,
- countrymerge lists for cs-sk, cs-de, cs-fr, cs-es, sk-en, sk-de, sk-fr, sk-es,
- commonsmerge lists for sk-es, sk-de, sk-pl (like here)
when fresh dump comes? If one of these lists is too short, simply skip it. Glad you are member of the Wikidata community! Matěj Suchánek (talk) 15:22, 11 September 2013 (UTC)
- Hi. May I also ask for ru-uk, ru-be, ru-be-x-old pairs for numbermerge and en-ru for commonsmerge? --putnik 19:19, 11 September 2013 (UTC)
Request merge two pages
[edit]Q553459 and Q553201 is the same topic with Hatsune Miku: Project DIVA , please merge two pages--Lkt1126 (talk) 09:34, 12 September 2013 (UTC)
- No it isn't. First is about the game released in 2009, second about the game series. So don't merge. --A.Bernhard (talk) 10:39, 12 September 2013 (UTC)
Merge request
[edit]Could you please merge these two [3] and [4], thanks. --HistoryofIran (talk) 17:12, 21 October 2013 (UTC)
Could you also do the same with these two [5] and [6], thanks. --HistoryofIran (talk) 14:21, 25 October 2013 (UTC)
- not sure about this: the life descriptions doesn't quite match and there are two articles in Portuguese language. --Robot Monk (talk) 19:14, 25 October 2013 (UTC)
And these [7] and [8], thank you very much. --HistoryofIran (talk) 18:31, 25 October 2013 (UTC)
- I merged them --Robot Monk (talk) 19:14, 25 October 2013 (UTC)
These two [9] and [10] are the same but with different versions of the name, could you please merge them, and also these [11] and [12], thanks. --HistoryofIran (talk) 15:45, 7 November 2013 (UTC)
- I merged the last two, but the first two can't be merged as long as both have a Portuguese sitelink. Are those two correct? Maybe you can fix this, otherwise it's probably a case for Wikidata:Interwiki conflicts. Generally, merging is something anyone can do, see Help:Merge for some information about it (especially the gadget is a handy thing). Byrial, sadly, hasn't been seen here for almost two months now. I hope he's well (he likely is, his last edit just sounded like he's busy in real life). --YMS (talk) 16:23, 7 November 2013 (UTC)
- Merge request 2014-01-04
Q7915431→Q429160 or viceversa (same topic). Thanks in advance. Pjoef (talk) 15:57, 4 January 2014 (UTC)
Esperanto language communty at Wikidata
[edit]Saluton! I found you at project:Administrators by language. Please see project:Diskutejo#kie_komenci. I will post some invitations (at a new section project:Diskutejo#invitoj) there. Please feel free to add @לערי_ריינהארט at topics which might be interesting for me / related to Esperanto issues. Thanks in advance! לערי ריינהארט (talk) 10:52, 12 November 2013 (UTC)
Statistics
[edit]Hello, any chance that you get the time to update you statistics page ? They are dearly missed. --Zolo (talk) 08:14, 21 November 2013 (UTC)
- (talk page stalker) : After Rippitippi's thread on the Project chat has been archived, I repeat and update the information given there here: I sent a wikimail to Byrial on December 23, asking if he's okay, and if he's going to return some time soon, or if he has updated his report scripts. I did not receive an answer yet. He also has not made a single edit on any Wikimedia project since this one in September (neither did his bot), where he stated that he hopes "to soon resume a more normal level of activity". He obviously was not. Let's not hope so, but it looks like he has some more serious problems than running some scripts on Wikidata. I will announce here in case he will reply my mail rather than posting something here himself. Or does someone have a more personal connection to him and can try to contact him in another way?
- In the mean time, someone else may try to keep these reports running again. Byrial released all his scripts under the GPL on his toolserver account, though I don't know if those are the most recent versions, and that last posting I linked implied that they require some updates. They are written in C and Byrial has given some instructions on his user page. If someone tries to get along with them and faces some problems, I would try to help him, though not being really experienced with C and the Wikidata dumps and of course not having any particular knowledge about Byrial's code so far. --YMS (talk) 16:43, 4 January 2014 (UTC)
- I do not know Byrial personally, but (s)he has a history of editing at Wikimedia sometimes with longer breaks. The user was a frequent contributor to svwiki until September 2009 and disappeared until May 2013 as an example. I'm not worried, life is life, you know... -- Lavallen (talk) 18:02, 4 January 2014 (UTC)
- I tried to run some of his code, but for some reason I could not get it to work on my machine. I might try again sometime when I get the chance, though. The Anonymouse [talk] 05:42, 5 January 2014 (UTC)
- My copy of code is on https://github.com/steenth/byrial-wikidata-programs. But not all work. --Steenth (talk) 00:52, 6 January 2014 (UTC)
- @The Anonymouse, YMS: It is work now! See - User:Steenth/Sandkasse. Sourcecode is in github... Some will update reports? --Steenth (talk) 12:42, 5 August 2014 (UTC)
- Great, thank you very much! The need for those reports has decreased, as we User:Pasleim's lists (e.g. User:Pasleim/projectmerge) as well as User:Magnus Manske's Wikidata:The Game now, but I surely will have a look on your reports later. --YMS (talk) 14:37, 5 August 2014 (UTC) PS: Forgot User:Ivan A. Krestinin's lists, e.g. User:Ivan A. Krestinin/To merge. --YMS (talk) 14:39, 5 August 2014 (UTC)
- Made a page that lists all merge candidates from "The Game". Letter A. Whole list available, but >60K items pairs, takes a few minutes to load. --Magnus Manske (talk) 22:06, 5 August 2014 (UTC)
- Great, thank you very much! The need for those reports has decreased, as we User:Pasleim's lists (e.g. User:Pasleim/projectmerge) as well as User:Magnus Manske's Wikidata:The Game now, but I surely will have a look on your reports later. --YMS (talk) 14:37, 5 August 2014 (UTC) PS: Forgot User:Ivan A. Krestinin's lists, e.g. User:Ivan A. Krestinin/To merge. --YMS (talk) 14:39, 5 August 2014 (UTC)
- @The Anonymouse, YMS: It is work now! See - User:Steenth/Sandkasse. Sourcecode is in github... Some will update reports? --Steenth (talk) 12:42, 5 August 2014 (UTC)
- My copy of code is on https://github.com/steenth/byrial-wikidata-programs. But not all work. --Steenth (talk) 00:52, 6 January 2014 (UTC)
- User:Byrial/countrymerge is now updated. --Steenth (talk) 14:04, 6 August 2014 (UTC)
Bonvenon al Wikidata!
[edit]Rilate al user:Byrial/common.js. Bv. noti helpeton cxe User:Rotsaert8000. Antauxdankon! Kun amikaj salutoj el Munkeno לערי ריינהארט (talk) 19:35, 26 February 2014 (UTC)
Inactivity
[edit]Hello. I'm sorry to inform you that, since you have made no admin actions in the past six months, per Wikidata:Administrators#Losing adminship, your administrator access has been removed. If you wish to regain adminship, you may file a new RfP. Thanks.--Jasper Deng (talk) 06:27, 1 March 2014 (UTC)
Byrialbot
[edit]Your bot has been listed at Wikidata:Requests for permissions/Removal/Inactive bot accounts as being inactive for over two years. As a housekeeping measure it's proposed to remove the bot flag from inactive bot accounts, unless you expect the bot will be operated again in the near future. If you consent to the removal of the bot flag (or do not reply on the deflag page) you can rerequest the bot flag at Wikidata:Requests for permissions/Bot should you need it again. Of course, You may request retaining your bot flag here if you need the bot flag. Regards--GZWDer (talk) 11:53, 26 June 2017 (UTC)
- The bot flag has been removed. Lymantria (talk) 05:59, 8 July 2017 (UTC)