Wikipedia:Bots/Requests for approval/KiloBot
- The following discussion is an archived debate. Please do not modify it. Subsequent comments should be made in a new section. The result of the discussion was Approved.
Operator: Wikihermit
Automatic or Manually Assisted: Automatic - unsupervised
Programming Language(s): pywikipedia
Function Summary: Search new pages for copy right violates
Edit period(s) (e.g. Continuous, daily, one time run): Most likely continuously
Edit rate requested: read request: 5 edits per minute
Already has a bot flag (Y/N): N (probably won't need one)
Function Details: KiloBot will search Special:newpages for copyright violations. If it finds one, it will save it to a text file on my computer with the website it found it on. I can then add the log to wikipedia for users to deal with the copyvios. I would have the bot upload it to a page on wikipedia, but I'm having problems with pagefromfile.py If anyone could help, (either by helping me with getting pagefromfile.py to work, to by rewriting copyright.py to upload to a user page) that would be great. ~ Wikihermit 04:17, 6 July 2007 (UTC)[reply]
Discussion
edit- How will it search for copyright violations? Google? Daniel 05:13, 6 July 2007 (UTC)[reply]
- If this bot will only be reading, it doesn't need much approval here, as it obviously doesn't need flagging. To be official we can still go through this though in which case:
- How often will this bot run (24/7?)
- When (if ever) will the bot edit?
- Thanks, — xaosflux Talk 05:31, 6 July 2007 (UTC)[reply]
- The bot will use yahoo to search for copyright violations, since yahoo allows use of its search engine with an ID. The bot will run 24/7 to check newpages. It will edit as soon as I can figure out the command to enter for pageforfile.py. If I can't figure this out shortly, we can close the BRFA until I figure it out.(if anyone can help with this: this is what I've been entering in: (I need help with the parts after -file ))
c:\pywikipedia2 c:\python25\python pageforfile.py -file:c:\pywikipedia2\copyright\output.txt -appendtop -summary:logging
- I need the bot to edit/add logs to User:KiloBot/CopyVioLogs... I'm not sure what to do with the -end:xxx and -start:xxx (it hasn't been working :-/)
- So, in short, the bot can detect and report copy vios to a text file. I can then put that into a wikipedia page. The (optional) part that doesn't work is getting the bot to automatically upload it. Thanks! ~ Wikihermit 05:50, 6 July 2007 (UTC)[reply]
- Yeah, as long as it's only editing one page in it's own userspace and not going crazy (read: 50+ writes a minute) I think it's a speedy approve. It's going to be putting less stress on the servers than the anti vandalism bots and most of it's action will be reads which is fairly server light. -- Tawker 17:48, 6 July 2007 (UTC)[reply]
- Yup. I'll keep it down to five reads/edits per minute. I'm trying to use C# now along with pywikipedia, but my laptop doesn't have internet access right now, and it has Microsoft Visual Basics on it, so I'll have to work on it in two weeks or so. And it will only edit one page (its own userspace, unless there is a demand for a bot that reports possible copyright vios in the wikipedia space). ~ Wikihermit 02:34, 10 July 2007 (UTC)[reply]
- I wouldn't rush to a speedy approval just yet. I'd very much like to see some sample work first. The idea sounds good, but I'd very much like to see how well the execution actually goes before granted rushed blanket approval. --Cyde Weys 03:32, 12 July 2007 (UTC)[reply]
Can we hold the speedy approval bit a moment? Please see User talk:Eagle_101#Python where a request was made to me by this bot operator to write some code for him, I'm presuming that this code would be used in conjunction with this bot. as its specifically stated in this request that there is no way to upload the results. Who is writing the code for this? How much of the code is actually working? Could you go step by step and explain how the bot knows which pages to check, is it using IRC, the rss feed, ect? Also how does it identify a copyright violation? What are the criteria? Also note that google has a 2,500 search (query) limit per IP that accesses its servers (this is assuming that google is being used here to find the copyright violations). Any and all replies to these questions would be welcome. Thanks. —— Eagle101Need help? 04:45, 15 July 2007 (UTC)[reply]
- Almost all of your questions can be answered above. They have the main code, just not the uploading code. They are not using google, they are using Yahoo, with their API. Matt/TheFearow (Talk) (Contribs) (Bot) 04:49, 15 July 2007 (UTC)[reply]
- Ok, thats fine, I missed the yahoo bit above, in that case may I ask what exactly is the criteria that pywiki is using? —— Eagle101Need help? 04:59, 15 July 2007 (UTC)[reply]
- Yea, I went ahead and wrote a script (for upload) and am going to try it out. The bot uses pywikipedia: copyright.py. As far as what to check, its an argument: -new:50. I hope this answers your questions. ~ Wikihermit 01:04, 16 July 2007 (UTC)[reply]
Comment May I point out that a similar bot already exists? It is Wherebot (talk · contribs) and has been operating on Wikipedia:Suspected copyright violations for quite a while now. Perhaps some communication with Where would be in order so as not to duplicate functionality? Миша13 09:03, 16 July 2007 (UTC)[reply]
- Duplicate bots are great, it stops things from not being done if one is offline. Also, this scans newpages, not a prepared list. That means it will find it a lot easier. Matt/TheFearow (Talk) (Contribs) (Bot) 00:43, 17 July 2007 (UTC)[reply]
- Just to clarify, Wherebot scans the new pages feed on IRC; it does not operate on a prepared list. It creates a list. Please see the source here. — Madman bum and angel (talk – desk) 02:48, 19 July 2007 (UTC)[reply]
- Done I've written a quick C# script to upload the .txt file to Wikipedia. I published the source: User:KiloBot/source. The bot will still edit in its userspace only, and I can use {{User:KiloBot/copyrightlogs}} to put it on Wikipedia:Suspected copyright violations ~ Wikihermit 17:45, 26 July 2007 (UTC)[reply]
- Example. ~ Wikihermit 17:50, 26 July 2007 (UTC)[reply]
- Done I've written a quick C# script to upload the .txt file to Wikipedia. I published the source: User:KiloBot/source. The bot will still edit in its userspace only, and I can use {{User:KiloBot/copyrightlogs}} to put it on Wikipedia:Suspected copyright violations ~ Wikihermit 17:45, 26 July 2007 (UTC)[reply]
Approved. Make sure the edit rate isn't excessive, and you are free to continue. --ST47Talk·Desk 18:36, 26 July 2007 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. Subsequent comments should be made in a new section.