Wikipedia:Bots/Requests for approval/SieBot
- The following discussion is an archived debate. Please do not modify it. Subsequent comments should be made in a new section. The result of the discussion was Approved.
Operator: Siebrand (main account on nl.wp)
Automatic or Manually Assisted:
- Automatic: for autonomous interwiki changes
- Manually assisted: for manually assisted interwiki changes on problematic entries (rarely)
- Manually assisted: NowCommons tagging for images that have been copied to Wikimedia Commons
Programming Language(s): Pywikipediabot framework
Function Summary: interwiki, NowCommons tagging
Edit period(s) (e.g. Continuous, daily, one time run): Continuous for autonomous interwiki changes, occasional for NowCommons image tagging and manually assisted interwiki.
Edit rate requested: 5 edits per minute for NowCommons tagging
Edit rate requested: 4 edits per minute for autonomous interwiki changes
Edit rate requested: 10 edits per hour for manually assisted interwiki changes
Already has a bot flag (Y/N): N (although experienced running bots on nl.wp, Commons and few Wikipedias)
Function Details: As described above the bot has three roles:
- Autonomous and manually assisted interwiki.py are pretty well known and require no explanation, I guess.
- NowCommons tagging: with the help of Orgullomoore I have created a bot using the pywikipediabot framework that is capable of copying images from a local wiki to Wikimedia Commons, so that not only one, but all Wikimedia wikis can make use of images. It is manually assisted and a push of a button will copy the image to Commons using CommonsHelper and CommonSense and tag the source image once the process was completed succesfully. This bot is currently experimental and running on nl.wp. The bot has now copied almost 10.000 images there, all images tagged own work (cc-by, GFDL and PD types), identified by the license tags and the categories they end up in because of having that tag. Because of the large image store en.wp has, this wiki is an interesting target once the work on nl.wp is done.
Discussion
editAs far as the interwikis, everything looks great. I'm really curious to find out about the image copying bot, though. Can you point us to some diffs from nl.wiki or make a few edits here as a trial? Thanks. Approved for trial. Please provide a link to the relevant contributions and/or diffs when the trial is complete. Also, would you be interested in releasing the source code for the bot? (You don't have to, although it would be nice.) —METS501 (talk) 03:26, 5 April 2007 (UTC)[reply]
- Thanks for the trust regarding the interwikis. As I'm admin on nl.wp, I remove the images rather quickly after they have been copied, also removing the edit the bot made (check out the new version of nowcommons.py with the switch -autonomous :P). I've done a test run with 10 images here on en.wp, so you can check out SieBot's edits. Based on how things end up on Commons, I do have some sorting out to do with the CommonsHelper coder ({{GFDL-user|username}} is not correctly translated back to the copyright holder's user page), but that'll be just fine is my experience. The source code will most probably be released somewhere in the near future in the pywikipediabot framework, but I also have to discuss that with User:Orgullomoore, who has written most of the actual code (I just created specifications). The code currently also contains a few annoying bugs that I'd like to have taken out before making the code available. Experienced bot users can drop me a mail if they want the code for their use only, at the moment. Siebrand 08:44, 5 April 2007 (UTC)[reply]
- I see, so on the English Wikipedia the bot only puts the {{nowcommons}} tag after uploading the image to commons. That makes sense. Make another 50-75 edits on trial, and then we can approve the bot. —METS501 (talk) 18:05, 5 April 2007 (UTC)[reply]
- Yep, if the name does not change, it's a subst:ncd, if it changes it's subst:ncd|Image:NewName.ext. The edit summaries also differ slightly in both cases. I have made a request on Template talk:GFDL-self for a categorisation change that will help identify images that can be copied to Commons more easily. Please take a look at it if you have the time. I'll find some time to make the larger test run once Magnus has changed Commons helper, most probably this weekend. I'll post here once done. Cheers! Siebrand 18:56, 5 April 2007 (UTC)[reply]
- Test run completed. 75 images copied. Siebrand 22:03, 8 April 2007 (UTC)[reply]
- Yep, if the name does not change, it's a subst:ncd, if it changes it's subst:ncd|Image:NewName.ext. The edit summaries also differ slightly in both cases. I have made a request on Template talk:GFDL-self for a categorisation change that will help identify images that can be copied to Commons more easily. Please take a look at it if you have the time. I'll find some time to make the larger test run once Magnus has changed Commons helper, most probably this weekend. I'll post here once done. Cheers! Siebrand 18:56, 5 April 2007 (UTC)[reply]
- I see, so on the English Wikipedia the bot only puts the {{nowcommons}} tag after uploading the image to commons. That makes sense. Make another 50-75 edits on trial, and then we can approve the bot. —METS501 (talk) 18:05, 5 April 2007 (UTC)[reply]
I am confused by the bot's dealphabetization of interwiki links (see this, this, this). I'm sure there are many other instances – the bot has made a lot of edits in the last few days. These were in the first few contributions I looked at. Dekimasuよ! 05:54, 9 April 2007 (UTC)[reply]
- The second one, I notice, isn't really an error, because they were already out of order... it just failed to fix anything by moving the links around. I'm more confused by the first and third diffs. Dekimasuよ! 05:58, 9 April 2007 (UTC)[reply]
- Immediately after reading your comments and looking at the diffs I may have been as confused as you were. I run an unmodified interwiki.py (currently from pt.wp) with no special switches but autonomous. Interwiki sort order is defined, as you can see below, in the pywikipediabot framework[1]. As far as I know, every interwiki bot will apply the sort order as you observe on every wiki that defines self.alphabetic as the interwiki sort order. I do not know who designed the sort order and when, but it can be changed. I do know that my bot sorted them consistently according to the rules all other interwiki bots adhere to. Does someone else have a clue why the sort order is as it is with the perceived inconsistencies?
- More detailed explanation: (1) Español (es) comes before Esperanto (eo). (2) Српски / Srpski (sr) comes before Suomi (fi). (3) I have no idea how to pronounce Korean and Japanese in their respective languages, but for Japanese, from the sort order, I guess there's a Nippon-like ring to it. For Korean I imagine the same. I hope this explains things. If you think all interwiki bots are using an interwiki sort that is fundamentally wrong, please create an issue report on SourceForge with how you think it should be. Cheers! Siebrand 08:37, 9 April 2007 (UTC)[reply]
./families/wikipedia_family.py, lines 476-497 (version 1.169): # A revised sorting order worked out on http://meta.wikimedia.org/wiki/Interwiki_sorting_order self.alphabetic_revised = ['aa','af','ak','als','am','ang','ab','ar','an', 'arc','roa-rup','frp','as','ast','gn','av','ay','az','id','ms','bm', 'bn','zh-min-nan','map-bms','jv','su','ban','ba','be','bh', 'bi','bo','bs','br','bug','bg','bxr','ca','ceb','cv','cs','ch', 'ny','sn', 'tum','cho','co','za','cy','da','pdc','de','dv','nv','dz','mh','et', 'na','el','eml','en','es','eo','eu','ee','to','fab','fa','fo','fr','fy','ff', 'fur','ga','gv','sm','gd','gl','gay','ki','glk','gu','got','zh-classical','xal','ko','ha','haw', 'hy','hi','ho','hsb','hr','io','ig','ilo','bpy','ia','ie','iu','ik','os','xh','zu', 'is','it','he','kl','pam','kn','kr','ka','ks','csb','kk','kk-cn','kk-kz','kw','rw','ky', 'rn','sw','kv','kg','ht','kj','ku','lad','lbe','lo','ltg','la','lv','lb','lij','lt', 'li','ln','jbo','lg','lmo','hu','mk','mg','ml','mt','mi','mr','mzn','chm','cdo','mo', 'mn','mus','my','nah','fj','nl','nds-nl','cr','ne','new','ja','nap','ce', 'pih','no','nn','nrm','nov','oc','or','om','ng','hz','ug','uz','pa', 'pi','pag','pap','ps','km','pms','nds','pl','pt','kk-tr','ty','ksh','ro', 'rmy','rm','qu','ru','se','sa','sg','sc','sco','st','tn','sq','ru-sib','scn', 'si','simple','sd','ss','sk','sl','cu','so','sr','sh','fi','sv','tl', 'ta','roa-tara','tt','te','tet','th','vi','ti','tlh','tg','tpi','chr','chy', 've','tr','tk','tw','udm','uk','ur','vec','vo','fiu-vro','wa', 'vls','war','wo','wuu','ts','ii','yi','yo','zh-yue','cbk-zam','diq','zea','bat-smg', 'zh','zh-tw','zh-cn'] ./families/wikipedia_family.py, lines 541 (version 1.169): 'en': self.alphabetic,
You are completely right; I withdraw my objection and sincerely apologize. User:Ynhockey tells me that: "There was a straw poll about it a while ago and the result was about 50/50. Basically the bot ordered the languages by their native name, which is easier to navigate for reader (and harder for editors, but you learn after a while). Some notable languages which are not as their appear are Hebrew (Ivrit), Japanese (Nihongo), Chinese (Zhongwen), Korean (Hangugeo) and Finnish (Suomi)." Thank you for your answer as well. Dekimasuよ! 09:09, 9 April 2007 (UTC)[reply]
- No problem. You gave me the opportunity to familiarise myself a bit with what is where in the code for the bot :-) Siebrand 10:04, 9 April 2007 (UTC)[reply]
Approved. Mets already approved the interwiki; the NowCommons work looks great. The only request I have is that you update the bot's user page and have a forum for feedback and complaints on this wiki. Currently you're directing people to your user talk on the NL wiki. --kingboyk 14:10, 13 April 2007 (UTC)[reply]
- OK. I'll do that. For interwiki related stuff I assume it's OK to have people redirected to my nl.wp talk page, right? Once I start working on images, I'll ensure to check my user pages more often (done now abot once/day for en.wp and the wiki I have configured as my interwiki home wiki - at this point in time that is pt.wp). Does that sound OK?
- Thanks again for the smooth procedure. Siebrand 17:16, 13 April 2007 (UTC)[reply]
- You're welcome, and thank you for the positive feedback :)
- With regards to your talk page, as long as people can make any complaints/report problems on this wiki and as long as you check your talk page here every so often, that's all we ask. We like to keep such business on site. Whether you field general queries on another Wikimedia site is entirely up to you. Cheers! --kingboyk 12:27, 17 April 2007 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. Subsequent comments should be made in a new section.