User talk:GreenC/2022
Wishing you a happy 2022!
edit-
MMXXII Lunar Calendar
Have a great 2022 and thanks for your continued contributions to Wikipedia.
– Background color is Very Peri (#6868ab), Pantone's 2022 Color of the year
IABot edits on dawiki
editHi! I noticed edits like da:Special:Diff/10981352 and came to your page to read more. I noticed that you have a bot task for "fixes known problems with Internet Archive Wayback Machine links" etc. I wonder what those problems are and if you could also fix them on dawiki if relevant? --MGA73 (talk) 21:13, 4 January 2022 (UTC)
- In theory, it depends what the problem is due to technical limits of working with templates in other languages, how complex the the problem is. Like following the footsteps of IABot and fixing formatting errors on wiki, but not fixing IABot itself. -- GreenC 05:38, 5 January 2022 (UTC)
- Thank you. Yes you are right that language could be a problem. As far as I know all English templates and parameters work on dawiki too. I checked out User:GreenC/WaybackMedic 2.5. Is there any where else I can look to know more about the edits? --MGA73 (talk) 12:58, 5 January 2022 (UTC)
- I do not mean that it is not good enough. I just wanted to be sure I'm reading the right place. It looks very cool so far. --MGA73 (talk) 13:02, 5 January 2022 (UTC)
- Yes that's generally what it does, also a lot more it's continually under development. The program is so large to make it work in another language would be a major project. I'd like to, it won't be any time soon. For example there are many requests at WP:BOTREQ that currently can only be done in Enwiki, that should also be done in other languages. -- GreenC 15:18, 5 January 2022 (UTC)
- Thank you. Well if you ever want to try you could perhaps just run the bot on dawiki (like 50 test edits) without any changes of the code. Then we could see what happens. Hopefully it will just skip templates in Danish but still fix those in English. If it breaks stuff I can fix that manually. --MGA73 (talk) 19:58, 5 January 2022 (UTC)
- MGA73 - can you choose a few articles good for testing? Not 50, like 4 or 5. -- GreenC 15:02, 6 January 2022 (UTC)
- Thank you. Well if you ever want to try you could perhaps just run the bot on dawiki (like 50 test edits) without any changes of the code. Then we could see what happens. Hopefully it will just skip templates in Danish but still fix those in English. If it breaks stuff I can fix that manually. --MGA73 (talk) 19:58, 5 January 2022 (UTC)
- Yes that's generally what it does, also a lot more it's continually under development. The program is so large to make it work in another language would be a major project. I'd like to, it won't be any time soon. For example there are many requests at WP:BOTREQ that currently can only be done in Enwiki, that should also be done in other languages. -- GreenC 15:18, 5 January 2022 (UTC)
- I do not mean that it is not good enough. I just wanted to be sure I'm reading the right place. It looks very cool so far. --MGA73 (talk) 13:02, 5 January 2022 (UTC)
- Thank you. Yes you are right that language could be a problem. As far as I know all English templates and parameters work on dawiki too. I checked out User:GreenC/WaybackMedic 2.5. Is there any where else I can look to know more about the edits? --MGA73 (talk) 12:58, 5 January 2022 (UTC)
Hi! Thank you. I picked a few articles IABot edited recently that also have many links (is that a good criteria?):
Please only test if it is not too much work :-) And in case there are any errors or problems I can clean up. --MGA73 (talk) 15:20, 6 January 2022 (UTC)
- OK. I need to finish another project first, the bot is currently tooled for that project, than can run some tests on these articles. The bot separates processing from upload so I can process the articles, look at proposed diffs and logs to see what it would do, before upload. -- GreenC 16:11, 6 January 2022 (UTC)
- Thanks a million! --MGA73 (talk) 18:37, 6 January 2022 (UTC)
- I tried it (no upload). Unfortunately the dates are a problem it is seeing
|archive-date=5. januar 2007
and switching to|archive-date=January 5, 2007
. There are a lot of functions related to dates it would not be quick or easy to add support for other forms. GreenC 21:55, 9 January 2022 (UTC)
- I tried it (no upload). Unfortunately the dates are a problem it is seeing
- Thanks a million! --MGA73 (talk) 18:37, 6 January 2022 (UTC)
Another question: At some point dawiki had a bad code in the CS1-module so a mix of bad things made some urls being marked as dead instead of live. Does the bot check and fix that? Or can we ask the bot to fix that? --MGA73 (talk) 11:02, 9 January 2022 (UTC)
- Not that I am aware of. It would probably require a separate bot to do header checks for status 200. If there is a redirect URL it is more difficult due to soft-404s, those could be skipped initially. -- GreenC 22:03, 9 January 2022 (UTC)
- Thanks. Too bad. Well if everything else fails we can always remove "dead" and have the bot check them again. --MGA73 (talk) 06:41, 10 January 2022 (UTC)
- Someone had an old database-dump so I got a list of 419 articles that I think may have a wrong url-status. So I will just remove "dead" from all links in those articles and hopefully IABot will be so kind to check the links again :-) --MGA73 (talk) 14:57, 10 January 2022 (UTC)
- It seems that if I remove "dead" or "url-status=dead" then the bot will still ignore the links. Do I have to remove "archive-url" and/or "archive-date" to get the bot to check the link? Or should I change to "live"? --MGA73 (talk) 16:03, 10 January 2022 (UTC)
- You could try but I am not sure it will work as IABot might either do nothing or set it back to dead since a missing url-status is the same as url-status=dead for CS1. Another option that's available on enwiki (not sure dawiki) is
|url-status=bot: unknown
which is a flag saying the status was set by a bot, but the bot doesn't know the true status. It's a flag for humans or other bots so they know it needs help. -- GreenC 16:48, 10 January 2022 (UTC)- Thank you. I tried to remove the url and that seems to work. But according to https://iabot.toolforge.org/index.php?page=runbotqueue I can't ask the bot to check all the pages. I have to take them 1 page at a time :-( --MGA73 (talk) 18:27, 10 January 2022 (UTC)
- Oh I see, good idea. Yes, when doing archive all links option, it allows one request/page at a time. -- GreenC
- It seems that bot jobs are completely disabled. --MGA73 (talk) 20:50, 10 January 2022 (UTC)
- I think Cyberpower is trying to fix a problem today unrelated. -- GreenC 21:03, 10 January 2022 (UTC)
- Oh I see, good idea. Yes, when doing archive all links option, it allows one request/page at a time. -- GreenC
- Thank you. I tried to remove the url and that seems to work. But according to https://iabot.toolforge.org/index.php?page=runbotqueue I can't ask the bot to check all the pages. I have to take them 1 page at a time :-( --MGA73 (talk) 18:27, 10 January 2022 (UTC)
- You could try but I am not sure it will work as IABot might either do nothing or set it back to dead since a missing url-status is the same as url-status=dead for CS1. Another option that's available on enwiki (not sure dawiki) is
- It seems that if I remove "dead" or "url-status=dead" then the bot will still ignore the links. Do I have to remove "archive-url" and/or "archive-date" to get the bot to check the link? Or should I change to "live"? --MGA73 (talk) 16:03, 10 January 2022 (UTC)
- Someone had an old database-dump so I got a list of 419 articles that I think may have a wrong url-status. So I will just remove "dead" from all links in those articles and hopefully IABot will be so kind to check the links again :-) --MGA73 (talk) 14:57, 10 January 2022 (UTC)
Hi! On Help_talk:Citation_Style_1#CS1_maint:_url-status we talked about "url-status=dead" and the use of "Dead link". On da.wiki I noticed da:Special:Diff/11055454 where IABot set status to dead leading to an "CS1 maint: url-status"-error. The bot does now about "Dead link"-template (in Danish "Dødt link") per da:Special:Diff/11055568. So I wonder why this happens. Should IABot be able to fix it or is it an example of what your bot would fix? --MGA73 (talk) 09:58, 13 February 2022 (UTC)
- It is a bug in IABot, or incorrect site cfg, not sure which. I left a bug report. My bot would simply remove an orphan
|url-status=dead
since it has no other members|archive-url=
and|archive-date=
which are required. . -- GreenC 15:11, 13 February 2022 (UTC)
Edit conflict
editSorry we overlapped edits. Thanks for fixing this. GA-RT-22 (talk) 05:04, 5 January 2022 (UTC)
- GA-RT-22 no problem, it was my fault to begin with for deleting that source, earlier today, I missed that factoid needed that source. -- GreenC 05:10, 5 January 2022 (UTC)
archive.org and cloudflare captcha
editIs there any way to save a page/site which has cloudflare captcha enabled? i.e. this page as reference for Citizen News. I have been trying the different archival sites, but they can't get past the captcha. – robertsky (talk) 05:51, 5 January 2022 (UTC)
- That's a good question. Archive.today often is ahead of the curve but it's hit or miss. Another option create a free account with Conifer (old webrecorder.io) and it will allow you to interact with the page as it records the save real-time. Thus you can save archives of infinite scroll, multi-click slideshows and presumably captcha, but I never tried captcha. Another to try is ghostarchive.org .. let me know if you find a solution. -- GreenC 07:36, 5 January 2022 (UTC)
- conifer proxies the cloudflare hcaptcha network requests, and thus fed me into a loop of the page being reloaded constantly. tried ghostarchive and archive.is, but they also faced the same issue. – robertsky (talk) 10:33, 5 January 2022 (UTC)
- I got it! I utilised webrecorder's complementary tool, archiveweb.page to record the page in browser. I also have Cloudflare's Privacy Pass extension enabled to skip their captcha with pre-loaded challenge token. After that it is a matter of exporting the warc file and upload it on to conifer. – robertsky (talk) 16:27, 5 January 2022 (UTC)
- Hey, nifty! I did this because IABot will delete the archive URL as unrecognized otherhwise. Webrecorder has cool stuff. Wikipedia users could generate warcs that can be moved, copied, saved and hosted from anywhere: https://replayweb.page/docs/ -- GreenC 17:16, 5 January 2022 (UTC)
- I have extracted the list of urls that are currently on the enwiki, and archived them all. just in case we wanna manually/semi-automate inserting of archive-url on the existing refs. https://conifer.rhizome.org/robertsky/citizen-news-wp-ext-links – robertsky (talk) 23:55, 5 January 2022 (UTC)
- – robertsky: are the youtube links working for you? They do not for me /embed/5wAppSO66_w. Wonder if the "embed" is the problem. A good service for youtube on-demand is ghostarchive: /embed/5wAppSO66_w. Wayback also supports but may be slower to save via a queue: /embed/5wAppSO66_w. BTW Ghost uses webrecorder software on the back-end. There are a couple things about Conifer to be aware of. It is a security risk since anyone can create a warc, modify the content to support a conspiracy theory misinformation; I could imagine a day when enwiki decides to ban the site for that reason. I'm looking into any solutions such as checksums. Also there is limited storage space per account and youtube uses a lot. --- GreenC 14:40, 6 January 2022 (UTC)
- I didn't test the youtube links. those are generated incidentally when trying archive webpages with youtube videos embedded. I wasn't too much concerned with youtube as there isn't much of a running cost to have the videos hosted on youtube, unlike the website. Agreed on conifer being a possible security risk. I could think of multiple ways to conduct such nefarious activities when I was doing the archiving. – robertsky (talk) 16:33, 6 January 2022 (UTC)
- Relevant. I know one of the authors. Basically there are no accepted standards, but people have been looking at it for years. Hard problem it seems. -- GreenC 19:31, 6 January 2022 (UTC)
- reminds me of Quis custodiet ipsos custodes? – robertsky (talk) 06:32, 9 January 2022 (UTC)
- Relevant. I know one of the authors. Basically there are no accepted standards, but people have been looking at it for years. Hard problem it seems. -- GreenC 19:31, 6 January 2022 (UTC)
- I didn't test the youtube links. those are generated incidentally when trying archive webpages with youtube videos embedded. I wasn't too much concerned with youtube as there isn't much of a running cost to have the videos hosted on youtube, unlike the website. Agreed on conifer being a possible security risk. I could think of multiple ways to conduct such nefarious activities when I was doing the archiving. – robertsky (talk) 16:33, 6 January 2022 (UTC)
- – robertsky: are the youtube links working for you? They do not for me /embed/5wAppSO66_w. Wonder if the "embed" is the problem. A good service for youtube on-demand is ghostarchive: /embed/5wAppSO66_w. Wayback also supports but may be slower to save via a queue: /embed/5wAppSO66_w. BTW Ghost uses webrecorder software on the back-end. There are a couple things about Conifer to be aware of. It is a security risk since anyone can create a warc, modify the content to support a conspiracy theory misinformation; I could imagine a day when enwiki decides to ban the site for that reason. I'm looking into any solutions such as checksums. Also there is limited storage space per account and youtube uses a lot. --- GreenC 14:40, 6 January 2022 (UTC)
- I have extracted the list of urls that are currently on the enwiki, and archived them all. just in case we wanna manually/semi-automate inserting of archive-url on the existing refs. https://conifer.rhizome.org/robertsky/citizen-news-wp-ext-links – robertsky (talk) 23:55, 5 January 2022 (UTC)
- Hey, nifty! I did this because IABot will delete the archive URL as unrecognized otherhwise. Webrecorder has cool stuff. Wikipedia users could generate warcs that can be moved, copied, saved and hosted from anywhere: https://replayweb.page/docs/ -- GreenC 17:16, 5 January 2022 (UTC)
- I got it! I utilised webrecorder's complementary tool, archiveweb.page to record the page in browser. I also have Cloudflare's Privacy Pass extension enabled to skip their captcha with pre-loaded challenge token. After that it is a matter of exporting the warc file and upload it on to conifer. – robertsky (talk) 16:27, 5 January 2022 (UTC)
List of Pokémon Go-related injuries and deaths
editI am currently making the page, as well as the 2 deaths caused by that Burger King pokeball. Draft:List of deaths caused by the Pokemon franchise — Preceding unsigned comment added by 98.148.167.84 (talk) 06:45, 6 February 2022 (UTC)
Now that WebCite archives are no longer accessible (they might have been destroyed, who knows) is anyone/any bot doing anything to replace them with working archives? Kailash29792 (talk) 05:34, 18 February 2022 (UTC)
- @Kailash29792: There was consensus to replace them with Wayback links, and I think GreenC had an approved bot to do so. The problem, according to him (or at least what I think he said) is with content drift - sometimes, the webcite archived page is different than the one on web.archive.org. In addition, maybe there were pages that worked with webcite but not web.archive.org. And also apperently, there are some talks going on with WebCite and archive.org to transfer the database (not sure if that's true or the talks are still ongoing).
- It's my personal belief that if we know Webcite is never coming back, then we should just replace all the links with web.archive.org. Rlink2 (talk) 14:49, 18 February 2022 (UTC)
- That's right. There is still some thread of hope, but what he's attempting to do, will take time and money he has to raise. From what I know of the owner, I don't particularly trust him to do the right thing, for Wikipedia purposes, even if he gets it back online, we'd be better without it long term.
- Also, it might be that archive.today could be better than Wayback in this case. Because WebCite has been around since 1997, it was the only 'save page now' option until Archive.today arrived in 2012 (Wayback SPN started around 2014, I think). At the time, archive.today did a sweep of all links on Wikipedia including saving the WebCite archive pages themselves (double archive). Don't bother checking Wayback due to how WebCite structures its archives, the Wayback Machine to this day in incapable of reliably double archiving WebCite. There is an opportunity to find WebCite archives pages hosted at Archive.today that would match the dates we need. No idea how many there might be. -- GreenC 05:39, 20 February 2022 (UTC)
Following up from external links discussion
editHi, you were pinged in that discussion as someone who knows the answer to my question: Museum Folkwang recently (don’t know how long ago) restructured their site index, leaving many links on Wikipedia broken. How do I go about getting a bot to check, update, and fix these links? Viriditas (talk) 21:44, 19 February 2022 (UTC)
- Hi User:Viriditas, yes you found the right person. Normally these are reported at WP:URLREQ. Each case is different. The more known the better, like, are the pages still alive but at new URLs (site migration). If so, can the new URLs be deciphered from the old, or are they completely different. Do the old URLs have redirects or just plain dead that need archives. Anything that can discovered would be helpful for determine how to configure the bot. -- GreenC 05:09, 20 February 2022 (UTC)
- Good to know for the future! Luckily, Special:Linksearch shows it can be fixed manually since there’s so few errors. Thank you! Viriditas (talk) 06:53, 20 February 2022 (UTC)
- Great. Those are the best kinds :) -- GreenC 16:26, 20 February 2022 (UTC)
- I fixed the broken links on the English Wikipedia, but there are still issues with wikis in other languages and sister projects like Commons, which continue to have broken links. To fix them, all I did was add "eMP” to the url, like this. Is that enough info for you to take a look at links to Museum Folkwang on the other projects? Viriditas (talk) 21:42, 20 February 2022 (UTC)
- I’ll make a request at URLREQ. Viriditas (talk) 21:47, 20 February 2022 (UTC)
- I fixed the broken links on the English Wikipedia, but there are still issues with wikis in other languages and sister projects like Commons, which continue to have broken links. To fix them, all I did was add "eMP” to the url, like this. Is that enough info for you to take a look at links to Museum Folkwang on the other projects? Viriditas (talk) 21:42, 20 February 2022 (UTC)
- Great. Those are the best kinds :) -- GreenC 16:26, 20 February 2022 (UTC)
- Good to know for the future! Luckily, Special:Linksearch shows it can be fixed manually since there’s so few errors. Thank you! Viriditas (talk) 06:53, 20 February 2022 (UTC)
NUMBEROF and Wikidata
editQuick bug report: {{NUMBEROF}} doesn't seem to work with counts for Wikidata? Eg {{NUMBEROF|activeusers|wikidata}}
returns 24631. (Thanks for the great template/module/bot, by the way :) ) --Yair rand (talk) 23:24, 22 February 2022 (UTC)
- List of sites tracked. Not a bug, just not tracked. (thank you). I think Wikidata is so different from everything else it doesn't fit the model. Have not looked at it too closely before. API:Siteinfo, which it uses to get stats, I don't think works with Wikidata? -- GreenC 02:17, 23 February 2022 (UTC)
- Siteinfo seems to work on Wikidata. But thanks anyway. --Yair rand (talk) 08:13, 23 February 2022 (UTC)
- So it does. I guess that would require a new Lua module and update to the bot. -- GreenC 14:52, 23 February 2022 (UTC)
- It now works:
{{NUMBEROF|activeusers|www.wikidata}}
-> 24631 .. let's follow up at the other thread on the template talk page. -- GreenC 15:55, 23 February 2022 (UTC)
- It now works:
- So it does. I guess that would require a new Lua module and update to the bot. -- GreenC 14:52, 23 February 2022 (UTC)
- Siteinfo seems to work on Wikidata. But thanks anyway. --Yair rand (talk) 08:13, 23 February 2022 (UTC)
web.archive.org can archive ghostarchive videos
editExample: https://web.archive.org/web/20220206195653/https://ghostarchive.org/varchive/-CIOhY4ysRE
Could be useful as a "second level backup" for if ghost goes down (Not that I think it would). Thought I would let you know. Rlink2 (talk) 14:01, 25 February 2022 (UTC)
- Wayback has or plans to save every YouTube on Wikipedia so presumably in the case Ghost went down there might be a wayback version available: https://web.archive.org/web/20220217052941/https://www.youtube.com/watch?v=-CIOhY4ysRE .. good to know Wayback can archive Ghost, and Ghost allows itself to be archived. It's not being done automatically, though it might be a good idea. -- GreenC 14:34, 25 February 2022 (UTC)
Unclear on autonomy of IABot
editHi, following up with our last discussion, I was wondering about the autonomy of IABot. Is it trawling all the wikis and checking for dead links? My understanding is that it’s not. If, as I assume, it’s not, can I request that it trawl all the articles linked to the WikiProject Visual Arts template on the English Wikipedia? Viriditas (talk) 21:38, 25 February 2022 (UTC)
- It does auto scan all pages in theory, enwiki is so large it might take a very long time. Template:WikiProject Visual arts has a transclusion count of 75.6k .. could get the list of pages, break into 5 or 10 parts, with each as a user-submitted job via iabot.org -- GreenC 05:05, 26 February 2022 (UTC)
- Thanks. Maybe I should just start small and focus on one category for now. How do I ask the bot to fix all articles listed in Category:Pierre-Auguste Renoir, including subcategories? Viriditas (talk) 09:23, 26 February 2022 (UTC)
IA Bot - Books and languages
editHello! I wanted to ask 2 questions in regard to IA Bot mostly out of curiosity.
- It has come to my attention lately that IABot "has been merged" with GreenCBot in some tasks about books. Can you tell me more about this whole thing?
- In SqWiki (my homewiki) we ask for all our citations to have their languages specified for statistical purposes. We have this category that has around 12k articles with missing language values in their citations. Considering that IA Bot is very powerful and has access to a lot of information regarding references, could it be possible so that it also determined and put the language value in some of our citations? Anything that could lower that number somehow would be appreciated. - Klein Muçi (talk) 23:12, 27 February 2022 (UTC)
- @Klein Muçi Nice to see you again. I am sorry you lost the Steward elections but I think you may have a chance next year or two. Keep up the good work and I will be voting for you if you decide to go at it again.
- Regarding Number 2, it doesn't seem to me like IABot would have the ability to detect the language for any arbitary source. At best, maybe it could detect the language parameter for limited and whitelisted sources. GreenC would know more about this. (I don't know anything about question 1, GreenC can answer)
- In the case IABot does not handle this, it is sort of possible to develop a tool like this for your wiki to detect the language used on any given source. There are some offline tools (don't depend on any external service) that do this, I have used them and they are pretty good at langauge detection (not translation; just detection and identification). Rlink2 (talk) 00:03, 28 February 2022 (UTC)
- @Rlink2, hello! :) Thank you very much for your support! Can you tell me more about such tools? That's exactly what I'm looking for. As for the IA capabilities, that's, again, what I was hoping for. Maybe given its vast information it has about citations in general, it can deduct the language for some links taken from certain websites or other mediums which are known to only produce content in one specific language. - Klein Muçi (talk) 00:12, 28 February 2022 (UTC)
- @Klein Muçi
- Sure, I've used such tools (as in "programming libraries") before. Sometimes the website in the code will declare the language of the content, so you can just use what they give you.
- For the sites that don't (99% of them most likely) there are libaries for most popular programming languages that will detect the language of any given text. At the very least, it is really great for determining if something is English or not English (not sure about its accuracy rate for actually telling the right language, but its a start)
- The hardest part of making such a tool would probably making sure the text extracted from the website is correct. As GreenC will know, there are a whole bunch of strange sites with strange layouts. Some sites like Youtube, Facebook, Twitter, and Instagram may not work (but those sites have always been special cases, IABot can't even handle any of those sites correctly, see phab:T294880).
- I just tried it on a couple random sites and it seemed to work just fine though. Could be an interesting idea to explore, who knows.
- If IABot has language detection it would probably just be for
mediums which are known to only produce content in one specific language.
, as you said. Rlink2 (talk) 00:34, 28 February 2022 (UTC)- @Rlink2, just being able to automatically put |language=en in articles that use English citations would be an immense help. I'm sure tens of thousand of entries would be immediately removed from that category making the remaining list manageable.
- That idea has been suggested to me 2 years ago. Take a look here. But ever since I haven't been able to find further help on it. - Klein Muçi (talk) 00:44, 28 February 2022 (UTC)
- @Klein Muçi
- Regarding Majavah's comment, the attribute is one way of determing the language (as I stated above as well). But not the only way, most sites do not use that attribute. For the ones that do, it would usually be more reliable than the language detector. For the ones that don't, that's where the language detector libraries come in.
- You can set the tool to only mark sources with English if it is only certain (at a confidence percentage you can set) it is actually English.
- Do you have a list of these backlogged citations that are in need of a lang parameter? Rlink2 (talk) 01:08, 28 February 2022 (UTC)
- @Rlink2, I'm not sure I understand your question correctly. I've already provided above the category (list) of the said citations you're asking for. Do you mean something else other than that? - Klein Muçi (talk) 01:10, 28 February 2022 (UTC)
- @Klein Muçi
- Oh yes, I see it now. Didn't read. Silly me.... Rlink2 (talk) 01:13, 28 February 2022 (UTC)
- @Rlink2, no problem at all. I was thinking that maybe you wanted something more specific. - Klein Muçi (talk) 01:15, 28 February 2022 (UTC)
- @Klein Muçi
- I cooked up a quick script using the tool, and copy and pasted the result into the edit window. It seems to be working ok (note it only works for cite web templates). See https://sq.wikipedia.org/wiki/Speciale:Kontributet/Rlink2 for diffs.
- For cite books and journals obviously it does not have access to the cited material (maybe this is where IAbot can come in) but it could detect if the title in the citation is in English. If the title of the book is in English, it is highly highly probable the actual content is actually in english. Rlink2 (talk) 01:39, 28 February 2022 (UTC)
- @Rlink2, yes, the results do indeed look good. If the work can be automatized, we can do a full run for the web sources and see how much are left and after that decide how to act with the remaining ones. Even though I believe the title's language can be good enough to determine the content's language. - Klein Muçi (talk) 01:42, 28 February 2022 (UTC)
- @Klein Muçi
If the work can be automatized
yes it can be automatized if that is your wish. It would be good to see a larger sample of diffs before running it fully unsupervised, to clean up and prevent any bugs and false markings.- Thanks for bringing this matter to us, it is very much appreciated. I am always happy to see people appreciate stuff I do. Rlink2 (talk) 02:01, 28 February 2022 (UTC)
- @Rlink2, thank you. It has been more than 2 years I look for ways to solve that problem and I've even asked at WP:User scripts for help but so far this is the only time we're talking about something concrete about this.
- Will you be running the script by your account? If so, I can give you autopatrolled rights now so everyone has an easier time. Does it work on any language or only English for the moment? Also, can we set it up with the parameter marked in its long form for standardization reasons? (|language= instead of |lang=) I believe we can continue the conversation further on SqWiki, either at your talk page or mine, so GreenC doesn't get a notification for each message that we send. - Klein Muçi (talk) 02:10, 28 February 2022 (UTC)
- @Rlink2, yes, the results do indeed look good. If the work can be automatized, we can do a full run for the web sources and see how much are left and after that decide how to act with the remaining ones. Even though I believe the title's language can be good enough to determine the content's language. - Klein Muçi (talk) 01:42, 28 February 2022 (UTC)
- @Rlink2, no problem at all. I was thinking that maybe you wanted something more specific. - Klein Muçi (talk) 01:15, 28 February 2022 (UTC)
- @Rlink2, I'm not sure I understand your question correctly. I've already provided above the category (list) of the said citations you're asking for. Do you mean something else other than that? - Klein Muçi (talk) 01:10, 28 February 2022 (UTC)
- @Rlink2, hello! :) Thank you very much for your support! Can you tell me more about such tools? That's exactly what I'm looking for. As for the IA capabilities, that's, again, what I was hoping for. Maybe given its vast information it has about citations in general, it can deduct the language for some links taken from certain websites or other mediums which are known to only produce content in one specific language. - Klein Muçi (talk) 00:12, 28 February 2022 (UTC)
Phil Yates
editHi there! Do you have anything that might help Draft:Phil Yates get back into article space? BOZ (talk) 19:00, 1 March 2022 (UTC)
Happy April 1
editDon't open this!
|
---|
|
Wargames drafts
editDo you know of any sources to help me get any of these drafts published?: Ancients (3W, 1986), MBT (Avalon Hill, 1989), Tomorrow the World (3W, 1989), 5th Fleet (Victory Games, 1989), Rise and Fall (Engelmann, 1989), and Shell Shock! (Victory Games, 1990). BOZ (talk) 22:10, 2 April 2022 (UTC)
- @BOZ (talk page watcher) Have you tried looking at Wikipedia:WikiProject Board and table games/Sources? GoingBatty (talk) 16:03, 6 April 2022 (UTC)
GreenC bot hasn't run today
editHi GreenC! I see that GreenC bot hasn't run today to create the backlinks reports for Certes and me. Is that something you'd be able to fix today? Thanks! GoingBatty (talk) 14:48, 6 April 2022 (UTC)
- All my jobs on Toolforge are being dropped. Hrmph! GreenC 15:23, 6 April 2022 (UTC)
- Traffic jam, too many jobs created gridlock. Should be cleared out for now. I'll need to move some tools to different accounts. -- GreenC 16:08, 6 April 2022 (UTC)
- Now run, just 11 links to fix in my batch today. Thanks! Certes (talk) 17:55, 6 April 2022 (UTC)
- 43 links for me to review. Thanks GreenC! GoingBatty (talk) 20:25, 6 April 2022 (UTC)
- @GoingBatty: What proportion of your reviews result in an edit? I review 100–150 links a day, of which about 10% need fixing. I skip 80–90% of them with little effort, either by title (British Fantasy Award is clearly linked correctly to Birmingham and not a mistake for Birmingham, Alabama) or with a quick hover for popups (Henrique Galvão
was a Portuguese military officer
, so obviously a captain rather than a captain (sports) etc.). The other 10% need a proper check, of which most are genuine errors. Does this match your experience, or should I cut out the targets with higher false positive rates? Certes (talk) 20:56, 6 April 2022 (UTC)- @Certes Some almost always need fixing (e.g. Billboard), while some are almost always correct (e.g. [{Country]]). I'm probably closer to 40% need fixing, but I can go through them quickly. Without the daily report, I'd make a lot less fixes. GoingBatty (talk) 21:02, 6 April 2022 (UTC)
- Yes, mine are variable too. Model is almost always Model (person). Madonna is usually right but the religious references are easy to spot. Perhaps it's time for a prune. Certes (talk) 21:26, 6 April 2022 (UTC)
- @Certes Some almost always need fixing (e.g. Billboard), while some are almost always correct (e.g. [{Country]]). I'm probably closer to 40% need fixing, but I can go through them quickly. Without the daily report, I'd make a lot less fixes. GoingBatty (talk) 21:02, 6 April 2022 (UTC)
- @GoingBatty: What proportion of your reviews result in an edit? I review 100–150 links a day, of which about 10% need fixing. I skip 80–90% of them with little effort, either by title (British Fantasy Award is clearly linked correctly to Birmingham and not a mistake for Birmingham, Alabama) or with a quick hover for popups (Henrique Galvão
- Traffic jam, too many jobs created gridlock. Should be cleared out for now. I'll need to move some tools to different accounts. -- GreenC 16:08, 6 April 2022 (UTC)
Potential dead domain coming up end June 2022
editHello there. I was just randomly reading through some articles, and encountered a overlay notice on this site https://joins.com. It says something about Joins Prime being discontinued and service removed by end of June 2022. I am iffy in my translations, but I am assuming that the entire website may just simply vanish by then. There are currently 3,000 external links to that site and its subdomains at the moment here (and probably way more in kowiki). This is probably an advance notice, but maybe you can start your bot on checking through each link for archived versions first, and set the domain to dead if it really turns dead by then. – robertsky (talk) 15:49, 20 April 2022 (UTC)
- @Robertsky: Joins Prime appears to be a pass which provides access to 100s of subscription magazines online. The joins.com site looks like it might do other things as well. Such as news, tv. Hard to say which links will stop working in June, IABot has recorded nearly 20 thousand unique links across all wikis. Wayback Machine has decent coverage of the site. -- GreenC 13:55, 22 April 2022 (UTC)
BRFA input
editCould use your expertise at Wikipedia:Bots/Requests for approval/ScannerBot, as you might have some ideas about the task that I don't think of. Primefac (talk) 16:04, 14 May 2022 (UTC)
How is it we've never met?
editThanks for your kind words over at User:Carletteyt. It seems we hang out on the same street corners for years but I never learned your name. From reading your user page it seems you and I share a great deal in appreciation of irony and Wikipedia culture. Nice to hear from you and hope we stay in touch. BusterD (talk) 10:59, 21 May 2022 (UTC)
- Well I was impressed with your patience and determination to teach this user given how difficult the case is so maybe it didn't work out but hats off to you for trying. We often spend the most time on Wikipedia with people we disagree with so it's good to balance that with people we appreciate. See you around! -- GreenC 14:56, 21 May 2022 (UTC)
Mail!
editIt may take a few minutes from the time the email is sent for it to show up in your inbox. You can {{You've got mail}} or {{ygm}} template. at any time by removing the
I assume you can guess the subject matter. Hobomok (talk) 14:21, 25 May 2022 (UTC)
Given the recent context of the 2 editors engaged above. Yes we all can assume, the "subject" matter.
Given the context and your 2 recent, tag-team style of edit conduct, when outwardly presenting yourselves as independent.
Receiving any communications as above, that involves private discusssion on how 2 ostensibly acclaimed independent editors, will edit articles or are to engage, in an attempted effort to make "subject"s of another editor.
Is what is termed...being engaged in canvassing and being meat-puppetted and should the subject be another editor. WP:HOUNDING of that another editor.
Assuming the "subject matter"...
...really seeing more of your deepening relationship. That you would not of brought to an open forum..
ITN recognition for John M. Merriman
editOn 29 May 2022, In the news was updated with an item that involved the article John M. Merriman, which you updated. If you know of another recently created or updated article suitable for inclusion in ITN, please suggest it on the candidates page. Black Kite (talk) 17:47, 29 May 2022 (UTC)
Category:E-book awards has been nominated for discussion
editCategory:E-book awards has been nominated for possible deletion, merging, or renaming. A discussion is taking place to decide whether this proposal complies with the categorization guidelines. If you would like to participate in the discussion, you are invited to add your comments at the category's entry on the categories for discussion page. Thank you. * Pppery * it has begun... 16:21, 2 June 2022 (UTC)
Wednesday June 8, 11am-5pm: New York Botanical Garden - Environment of the Bronx - Editing Wikipedia for Beginners | |
---|---|
Hello GreenC! The LuEsther T. Mertz Library of the New York Botanical Garden and the Environment of New York City Task Force invite you and the general public of all experience levels to come to the Mertz Library in person and learn how to use Wikipedia and write about the environment of the Bronx! All skill levels welcome at the event! Experienced Wikipedia editors from the Wikimedia New York City chapter will be in attendance and available to help. A one hour training session will be offered at the start of this event covering introductory topics. Attendees familiar with editing Wikipedia can edit off of a worklist focused on the environment of New York City; as well as, a sub-list focused on the environment of the Bronx. The Mertz Library will pull topical media from their collection to assist the editing. --Wikimedia New York City Team via Wil540 art (talk) 02:23, 7 June 2022 (UTC) |
Conduct in deletion-related editing
editHi GreenC. I have collapsed part of your evidence, as this case is not focused on ARS it is focused on "Conduct in deletion-related editing, with a specific focus on named parties." Obviously much of your evidence was about a named party and so that remains uncollapsed. I am also discussing this decision with the other drafting arbitrators. Additionally, you should know that in this case we have added an expectation that if you name a non-party in substantial ways that they be notified. I have done this for you. Please let me know if you have any questions. Barkeep49 (talk) 21:26, 22 June 2022 (UTC)
- User:Barkeep49: My question is why this is acceptable Wikipedia:Arbitration/Requests/Case/Conduct_in_deletion-related_editing/Evidence#The_Article_Rescue_Squadron_(ARS)_has_long_been_an_inclusionist_haven_to_canvass_for_AfD_votes when "this case is not focused on ARS". -- GreenC 21:45, 22 June 2022 (UTC)
- Also I refactored the paragraph about Wikipediocracy as 7&6 is mentioned throughout that thread concerning this very ArbCom it should be in evidence. If I am not supposed to refactor let me know what the alternative is. -- GreenC 21:54, 22 June 2022 (UTC)
- I have reversed my collapsing for now as we talk about it, but absent that you shouldn't have refactored. In the future if you have a concern with an action that a clerk or arb does, ask about it on the talk page. Arb space is obviously a little different than other places about such things. While I'm here two other notes. First your original evidence also contained OUTING and so I have removed and oversighted that information. This is an example where I think our policy can lead to semi-absurd outcomes but it remains our policy and as an oversighter I feel some obligation to uphold it. Second, you say at the end
I picked two users, as examples
. Are those two users 7&6 and Eeng? Barkeep49 (talk) 22:08, 22 June 2022 (UTC)- I didn't think about it because they have the same name and they don't hide it but I suppose you are right when you think about it technically is outing. That's why you are in charge and I am not :) Can I just edit again to clarify the users because it is EEng and Marshal. -- GreenC 22:15, 22 June 2022 (UTC)
- Yes you can certainly edit it again to do that or other changes. Barkeep49 (talk) 01:27, 23 June 2022 (UTC)
- I didn't think about it because they have the same name and they don't hide it but I suppose you are right when you think about it technically is outing. That's why you are in charge and I am not :) Can I just edit again to clarify the users because it is EEng and Marshal. -- GreenC 22:15, 22 June 2022 (UTC)
- I have reversed my collapsing for now as we talk about it, but absent that you shouldn't have refactored. In the future if you have a concern with an action that a clerk or arb does, ask about it on the talk page. Arb space is obviously a little different than other places about such things. While I'm here two other notes. First your original evidence also contained OUTING and so I have removed and oversighted that information. This is an example where I think our policy can lead to semi-absurd outcomes but it remains our policy and as an oversighter I feel some obligation to uphold it. Second, you say at the end
Arno Tausch deletion all over again
editMight interest you: https://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion/Arno_Tausch_(4th_nomination) Austrian political observer (talk) 05:41, 8 July 2022 (UTC)
Kalki on Wayback Machine
editThis link was created today, but it says, "This URL has been excluded from the Wayback Machine", like various Kalki archives accumulated over the past year. What to do? Is this a glitch or has Wayback started negating all Kalki archives? Kailash29792 (talk) 07:37, 21 July 2022 (UTC)
- I guess Kalki requested Wayback to take down the archives usually what that means. 518 pages have archive URLs. -- GreenC 16:11, 21 July 2022 (UTC)
- I have sent an email to Wayback, expecting a reply. But thankfully, archive.is [1] and ghostarchive.org [2] can still archive Kalki... for the time being. Hope GreenC_bot starts replacing them with alternate archive links (if not, just remove the dead archives), or Rlink2 does so the way he did numerous times with ghostarchive. Kailash29792 (talk) 03:54, 22 July 2022 (UTC)
- It's a three step process. First generate a list of all URLs. Second save each at archive.today. Third replace them in wiki. I can do step 1 and 3. Only for archive.today. Maybe Rlink2 is better setup for step 2? -- GreenC 04:07, 22 July 2022 (UTC)
- I am manually archiving each Kalki link at archive.today over the past few days. Hope Rlink2 can help add them to each page. Kailash29792 (talk) 11:09, 25 July 2022 (UTC)
- Great! I can add archive.today let me know when you are ready. -- GreenC 15:37, 25 July 2022 (UTC)
- Almost two months later, now I say I am. Kailash29792 (talk) 12:28, 16 September 2022 (UTC)
- User:Kailash29792 OK great. I'll work on it give me a few days. -- GreenC 22:56, 18 September 2022 (UTC)
- Almost two months later, now I say I am. Kailash29792 (talk) 12:28, 16 September 2022 (UTC)
- Great! I can add archive.today let me know when you are ready. -- GreenC 15:37, 25 July 2022 (UTC)
- I am manually archiving each Kalki link at archive.today over the past few days. Hope Rlink2 can help add them to each page. Kailash29792 (talk) 11:09, 25 July 2022 (UTC)
Zola translations
editHi there -- I was going based on the actual scans of the translations on archive.org that show the correct publication details -- the secondary source appears to be incorrect on all of these. Any thoughts on the best approach here? MichelCastagne (talk)
- MichelCastagne: I think you are right, it is confirmed here and similar advertizements and I can't find any that say "edited by". Same for Love Episode here. Some others say edited by, but not these two. These are commercial adds. Will revert, thanks ! -- GreenC 20:02, 28 July 2022 (UTC)
- Thanks for confirming! MichelCastagne (talk) 21:38, 28 July 2022 (UTC)
Question: self-referential vebug bot exclusion
editI boldly made this edit to the your vebug task description, but I wasn't sure whether the exclusion I set up was okay, in case it would interfere with automated editing of the task description (I checked the page history, and it doesn't look like you use the bot to edit task descriptions, but just wanted to double-check). Retro (talk | contribs) 13:46, 31 July 2022 (UTC)
Precious anniversary
editSix years! |
---|
Astoria (book) additional information request.
editHi GreenC. I was wondering if you are the person who added the following, final sentence to the Astoria (book) WP article: « For a synopsis of the accuracy of Irving's work, see the Edgeley W. Todd edition (1964). »
If so, would it be possible to summarize the synopsis in the WP article? I would do it myself, I have not been able to locate it. Dr Dobeaucoup (talk) 17:06, 20 August 2022 (UTC)
- User:Dr Dobeaucoup, This was back in 2014 which I barely remember now and should have cited. I do not have this book, but learned about the Edgeley W. Todd edition from the JSTOR article cited in the same section book review which I no longer have access the full article, but it should be possible to obtain through WP:REX or some other method. I suspect it has a synopsis of the synopsis. -- GreenC 19:20, 20 August 2022 (UTC)
- Yes. Thanks. I have just gone through JSTOR and it has provided some useful sources. Have added a bit more information on the accuracy of Irving's account. Will continue to work on it. Thanks again. Dr Dobeaucoup (talk) 19:25, 20 August 2022 (UTC)
- Much better! I have not read Irving's account but did read a more mdern take Astoria: John Jacob Astor and Thomas Jefferson's Lost Pacific Empire (2014) which was very good. -- GreenC 20:16, 20 August 2022 (UTC)
- Yes. Thanks. I have just gone through JSTOR and it has provided some useful sources. Have added a bit more information on the accuracy of Irving's account. Will continue to work on it. Thanks again. Dr Dobeaucoup (talk) 19:25, 20 August 2022 (UTC)
Nomination of National Republican Army (Russia) for deletion
editThe article will be discussed at Wikipedia:Articles for deletion/National Republican Army (Russia) until a consensus is reached, and anyone, including you, is welcome to contribute to the discussion. The nomination will explain the policies and guidelines which are of concern. The discussion focuses on high-quality evidence and our policies and guidelines.
Users may edit the article during the discussion, including to improve the article to address concerns raised in the discussion. However, do not remove the article-for-deletion notice from the top of the article until the discussion has finished.
Orphaned non-free image File:James Foley in 2011.jpg
editThanks for uploading File:James Foley in 2011.jpg. The image description page currently specifies that the image is non-free and may only be used on Wikipedia under a claim of fair use. However, the image is currently not used in any articles on Wikipedia. If the image was previously in an article, please go to the article and see why it was removed. You may add it back if you think that that will be useful. However, please note that images for which a replacement could be created are not acceptable for use on Wikipedia (see our policy for non-free media).
Note that any non-free images not used in any articles will be deleted after seven days, as described in section F5 of the criteria for speedy deletion. Thank you. — Red-tailed hawk (nest) 02:11, 25 August 2022 (UTC)
Replaceable fair use File:James Foley in 2011.jpg
editThanks for uploading File:James Foley in 2011.jpg. I noticed that this file is being used under a claim of fair use. However, I think that the way it is being used fails the first non-free content criterion. This criterion states that files used under claims of fair use may have no free equivalent; in other words, if the file could be adequately covered by a freely-licensed file or by text alone, then it may not be used on Wikipedia. If you believe this file is not replaceable, please:
- Go to the file description page and add the text
{{Di-replaceable fair use disputed|<your reason>}}
below the original replaceable fair use template, replacing<your reason>
with a short explanation of why the file is not replaceable. - On the file discussion page, write a full explanation of why you believe the file is not replaceable.
Alternatively, you can also choose to replace this non-free media item by finding freely licensed media of the same subject, requesting that the copyright holder release this (or similar) media under a free license, or by creating new media yourself (for example, by taking your own photograph of the subject).
If you have uploaded other non-free media, consider checking that you have specified how these media fully satisfy our non-free content criteria. You can find a list of description pages you have edited by clicking on this link. Note that even if you follow steps 1 and 2 above, non-free media which could be replaced by freely licensed alternatives will be deleted 2 days after this notification, per the non-free content policy. If you have any questions, please ask them at the Media copyright questions page. Thank you. — Red-tailed hawk (nest) 02:13, 25 August 2022 (UTC)
- User:Red-tailed hawk: Understood, a free version was unknown at the time. The replacement is a video screencap and he looks terrible. Maybe I can figure out how to get a better looking frame. -- GreenC 02:29, 25 August 2022 (UTC)
- I agree that it's not exactly a great photo. I have no opposition to getting a frame in which he looks better, though I've not really been able to get anything better where there isn't some sort of text overlay over him. — Red-tailed hawk (nest) 02:36, 25 August 2022 (UTC)
Soft 404
editFrom Simple English wiki and Citation Bot. List from English Wiki will be much bigger, but this does take some work by hand. https://web.archive.org/web/20160303221101/http://ww.whereinmanila.com/philippine-stock-exchange-ayala-tower-1 https://web.archive.org/web/20121104095000/http://usagym.org/pages/home/college/nissenemery.html https://web.archive.org/web/20150924013209/http://www.findmypast.co.uk/post84BMDSearchStart.action https://web.archive.org/web/20190308081704/http://www.beritasore.com/cgi-sys/suspendedpage.cgi?option=com_content&task=view&id=2435&Itemid=36 https://web.archive.org/web/20200421172734/http://esa.un.org/unpp/p2k0data.asp https://web.archive.org/web/20140202101715/http://www.disneychannelmedianet.com/DNR/2011/A_N_T%20_Farm_Season_2_FINAL.doc https://web.archive.org/web/20131012062138/http://www.seattleweekly.com/music/reverb/915082-129/lissssssssts/ https://web.archive.org/web/20181226144351/http://www.khyberpakhtunkhwa.gov.pk/Gov/index.php https://web.archive.org/web/20181226144351/http://www.khyberpakhtunkhwa.gov.pk/Gov/index.php%20 https://web.archive.org/web/20200309015903/http://www.aguascalientes.gob.mx/404.html https://web.archive.org/web/20191130034747/https://www.washingtonpost.com/world/national-security/ap-source-tom-donilon-resigns-as-obama-national-security-adviser-susan-rice-to-take-over/2013/06/05/78286c10-cdd1-11e2-8573-3baeea6a2647_story.html https://web.archive.org/web/20180212173246/http://www.islamabadthecapital.com/info-desk/government-offices https://web.archive.org/web/20201002163830/https://holbergprisen.no/en/julia-kristeva/french-order/ https://web.archive.org/web/20200612024604/https://www.goldenglobes.com/2015_73rd_Golden_Globes_Nominees/ https://web.archive.org/web/20120420032141/http://www.aggielambdachi.org/page.php?page_id=9692 https://web.archive.org/web/20200315004239/https://www.northwestgeorgianews.com/catoosa_walker_news/view/full_story/20727887/article-general-to-speak-during-veteran%e2%80%99s-day-recognition/?instance=home_local_news https://archive.today/20120722045115/www.billboard.com/column/chartbeat/chart-highlights-adult-contemporary-alternative-1005073232.story#/column/chartbeat/chart-highlights-adult-contemporary-alternative-1005073232.story
AManWithNoPlan (talk) 15:25, 26 August 2022 (UTC)
- @AManWithNoPlan: Thanks. I'll need to develop processes and code to deal with these on-wiki, and in the IABot database. It will make sense when there are many to process. -- GreenC 02:18, 29 August 2022 (UTC)
A barnstar for you!
editThe Original Barnstar | |
Because someone who's been around as long as you and done as much good work as you have, probably doesn't have enough barnstars. Andre🚐 16:21, 2 September 2022 (UTC) |
- Andre, thanks! As a fish fan you might enjoy this video, it's a white fish similar to Haddock. -- GreenC 19:30, 3 September 2022 (UTC)
- Ah very cool! Andre🚐 19:32, 3 September 2022 (UTC)
Board of Trustees election
editThank you for supporting the NPP initiative to improve WMF support of the Page Curation tools. Another way you can help is by voting in the Board of Trustees election. The next Board composition might be giving attention to software development. The election closes on 6 September at 23:59 UTC. View candidate statement videos and Vote Here. MB 03:27, 5 September 2022 (UTC)
A barnstar for you!
editThe Tireless Contributor Barnstar | |
For fixing thousands of external links to an informative website that went down, but was archived. BD2412 T 16:50, 20 September 2022 (UTC) |
Speedy deletion nomination of Wikipedia:Link rot/cases/kalkionline.com
editA tag has been placed on Wikipedia:Link rot/cases/kalkionline.com requesting that it be speedily deleted from Wikipedia. This has been done for the following reason:
All the issues have been sorted out
Under the criteria for speedy deletion, pages that meet certain criteria may be deleted at any time.
If you think this page should not be deleted for this reason, you may contest the nomination by visiting the page and clicking the button labelled "Contest this speedy deletion". This will give you the opportunity to explain why you believe the page should not be deleted. However, be aware that once a page is tagged for speedy deletion, it may be deleted without delay. Please do not remove the speedy deletion tag from the page yourself, but do not hesitate to add information in line with Wikipedia's policies and guidelines. If the page is deleted, and you wish to retrieve the deleted material for future reference or improvement, then please contact the deleting administrator, or if you have already done so, you can place a request here. Kailash29792 (talk) 04:33, 22 October 2022 (UTC)
You were recommended as someone who has a bot that could, reasonably quickly, go through the site and pick out every cited tweet and make sure that they've been archived (a task which has suddenly become much more urgent).
Is this something you could manage? Thanks. DS (talk) 18:08, 18 November 2022 (UTC)
{{Cite tweet}}
has about 40k. insource:/twitter.com/ has another 55k. So around 100k links. I suspect most of them are already archived. I did some manual spot checks and every one had an archive available. Wayback has a number of jobs that focus on finding and archiving Twitter links from Wikipedia and elsewhere. The other thing is if there was imminent danger of lost Tweets the archive team would be all over it really fast it's so high profile. Basically all one needs to do is extract every twitter.com URL and then issue a simple GET with /save/ in the URL could be done by anyone with a script. -- GreenC 20:12, 18 November 2022 (UTC)
DS, I have a list of every Twitter URL on enwiki about 250k. Checking each to see if there is a Wayback available, will take about a week. When that's done will will issue a save on those missing. -- GreenC 15:48, 20 November 2022 (UTC)
ArbCom 2022 Elections voter message
editHello! Voting in the 2022 Arbitration Committee elections is now open until 23:59 (UTC) on Monday, 12 December 2022. All eligible users are allowed to vote. Users with alternate accounts may only vote once.
The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.
If you wish to participate in the 2022 election, please review the candidates and submit your choices on the voting page. If you no longer wish to receive these messages, you may add {{NoACEMM}}
to your user talk page. MediaWiki message delivery (talk) 00:39, 29 November 2022 (UTC)
Removal of valid source
editHi GreenC,
GreenCbot removed a reference to a US Navy report which was originally linked to a usurped site (Rubicon Research Reository), then you manually removed the ref name from the text, leaving an uncited claim in a featured article. The source is not nullified by not being conveniently online, it remains as a printed document, probably available on request from the USN, possibly elsewhere too. Was this accidental? If not, what is the rationale? There were several other refs used in underwater diving, linked to Rubicon, that were not removed.
Cheers, · · · Peter Southwood (talk): 18:59, 14 December 2022 (UTC)
- it remains as a printed document It's been a while but I guess that was not clear to me at the time. There were several other refs used in underwater diving, linked to Rubicon, that were not removed. They have archive URLs available (except one had a typo in the template and now fixed) -- GreenC 20:09, 14 December 2022 (UTC)
- So accidental, probably. Or at least collateral damage not intended as part of a systematic program. Thanks, · · · Peter Southwood (talk): 05:10, 15 December 2022 (UTC)
- Is there a policy or guideline or consensus agreement you are following regarding which broken refs should be removed completely? Are you checking whether the source is likely to exist elsewhere other than on IA? (please ping with reply).· · · Peter Southwood (talk): 05:22, 15 December 2022 (UTC)
- @Pbsouthwood: the procedure is outlined at WP:USURPURL and it's been discussed many times at Help talk:Citation Style 1 about removing cites under conditions like this, though I can't point to a specific thread right now. (and this time was human error as there is a source offline). My program checks multiple archive providers not just IA. -- GreenC 14:54, 15 December 2022 (UTC)
- Thanks, I will read it up. And thanks for the work you and your bot do cleaning these things up. Sometimes shit happens, and has to be fixed. Sometimes the fix does not go as intended. That's life. My interest is largely because I like to be able to fix this sort of thing when I see it, but first must find out the accepted procedures, which I assume you are following. Is it complicated to check the archive providers? Cheers, · · · Peter Southwood (talk): 13:55, 16 December 2022 (UTC)
- @Pbsouthwood: There are about 20 I search though most come from IA and archive.today since they actively archive links found on Wikipedia. It's complicated because each provider have their own methods and quirks to code for. -- GreenC 15:06, 16 December 2022 (UTC)
- Looks like specialist work with a steep learning curve. Probably not an efficient use of my time if I only do it occasionally. Thanks again for doing it for all the rest of us. Cheers, · · · Peter Southwood (talk): 10:57, 17 December 2022 (UTC)
- @Pbsouthwood: There are about 20 I search though most come from IA and archive.today since they actively archive links found on Wikipedia. It's complicated because each provider have their own methods and quirks to code for. -- GreenC 15:06, 16 December 2022 (UTC)
- Thanks, I will read it up. And thanks for the work you and your bot do cleaning these things up. Sometimes shit happens, and has to be fixed. Sometimes the fix does not go as intended. That's life. My interest is largely because I like to be able to fix this sort of thing when I see it, but first must find out the accepted procedures, which I assume you are following. Is it complicated to check the archive providers? Cheers, · · · Peter Southwood (talk): 13:55, 16 December 2022 (UTC)
- @Pbsouthwood: the procedure is outlined at WP:USURPURL and it's been discussed many times at Help talk:Citation Style 1 about removing cites under conditions like this, though I can't point to a specific thread right now. (and this time was human error as there is a source offline). My program checks multiple archive providers not just IA. -- GreenC 14:54, 15 December 2022 (UTC)
Thanks
editHi GreenC. Thanks for sorting out the citation to the Zilberg paper on Sylvester Mubayi and elsewhere. I had tried to "rescue" a certainly-dead version at this oid on Henry Munyaradzi and found via Google scholar a new URL to Academia which gave access to the .pdf but only via a strange cloudfront.net redirect it invoked. I was unaware of WP:AWSURL and successfully created a Wayback machine archive version (as it was within the valid time window) which did work but was clumsy. I'll know better in future! Regards. Mikedt10 (talk) 12:24, 16 December 2022 (UTC)
- It's OK the way you did it was not wrong it worked but since there is a live URL avail might as well use that because those Cloudfront URLs are not ideal. You're right they usually arrive via Google Scholar about once a day I get notified of a new one which I then convert to the original academia.edu (found via Google search on title of paper and site:academia.edu)-- GreenC 15:01, 16 December 2022 (UTC)
- In fact I had found the correct academia.edu link, which was different from the one we previously had but for some reason (probably my incompetence) I couldn't make it work without being redirected to the Cloudfront page, so that's when I decided to archive that instead. All a useful learning experience! Mikedt10 (talk) 17:22, 16 December 2022 (UTC)
Merry Christmas!
editBOZ (talk) is wishing you a Merry Christmas! This greeting (and season) promotes WikiLove and hopefully this note has made your day a little better. Spread the WikiLove by wishing another user a Merry Christmas, whether it be someone you have had disagreements with in the past, a good friend, or just some random person. Don't eat yellow snow!
Spread the holiday cheer by adding {{subst:User:Flaming/MC2008}} to their talk page with a friendly message.
I'm wishing you a Merry Christmas, because that is what I celebrate. Feel free to take a "Happy Holidays" or "Season's Greetings" if you prefer. :) BOZ (talk) 23:07, 22 December 2022 (UTC)
- BOZ, thank you, bud! A Christmas Peace to you. -- GreenC 00:25, 23 December 2022 (UTC)
Happy Holidays
editI wish you and your loved ones a Merry Christmas and a prosperous New Year. Best regards RV (talk) 12:05, 23 December 2022 (UTC)
- Thank you, RV -- GreenC 04:20, 26 December 2022 (UTC)
Merry Christmas!
editRlink2 (talk) is wishing you a Merry Christmas!
This greeting (and season) promotes WikiLove and hopefully this note has made your day a little better. Spread the WikiLove by wishing another user a Merry Christmas, whether it be someone you have had disagreements with in the past, a good friend, or just some random person. Happy New Year! Spread the Christmas cheer by adding {{subst:Xmas3}} to their talk page with a friendly message. |
Rlink2 (talk) 04:52, 24 December 2022 (UTC)
- Thanks, Rlink2. -- GreenC 04:18, 26 December 2022 (UTC)
Happy Holidays
editHappy Holidays | ||
Hello, I wish you the very best during the holidays. And I hope you have a very happy 2023! Bruxton (talk) 01:54, 26 December 2022 (UTC) |
- Thank you, Bruxton! -- GreenC 04:19, 26 December 2022 (UTC)