User talk:HBC NameWatcherBot/Blacklist/Archive 1
This is an archive of past discussions about User:HBC NameWatcherBot. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 1 | Archive 2 | Archive 3 |
bollox
Is "bollox" really profanity to the point that it violates WP:U? I am not English so I cannot tell how vulgar it is, but around here it is essentially inert. HighInBC(Need help? Ask me) 02:59, 11 May 2007 (UTC)
- Hmm well it is more vulgar here, but I agree that it isn't really as much profanity as many of the other strings. It would more come down to context - my thinking was that most of its use would probably come under "usernames that are defamatory or insulting to other people". Feel free to get rid of it if you think it will throw up too many inert names though. Will (aka Wimt) 15:03, 11 May 2007 (UTC)
I know, I can make a sub-page can act as a "keyword probation area". The bot can post to a special page and after a week or so, I can look over them and see how many false positives it gets. HighInBC(Need help? Ask me) 16:00, 11 May 2007 (UTC)
Beans?
The advantage of having this list online is that it can be adjusted real time in accordance with consensus. Keeping the list offline will not stop people from flooding Wikipedia with bad names, people already know what dirty words are, and even if they did not they could just watch what the bot reports. I have brought back the list for now and will keep an eye on it. HighInBC(Need help? Ask me) 02:26, 14 May 2007 (UTC)
Housethe
I've been vandalized quite a bit by a user who always begins his sock names with "Housethe." See Category:Suspected Wikipedia sockpuppets of Housethe. He hasn't made a ton yet, but he hasn't shown an indication that he's worried about people recognizing the name. Could we add it to the suspected sock name section of the blacklist? Does anyone foresee possible legitimate uses of the string? Leebo T/C 12:57, 18 May 2007 (UTC)
- ";housethe:SOCK_PUPPET(Housethe)" has been added to the list. Any false positives will be reported in the context of a suspected sockpuppet. (H) 18:11, 21 May 2007 (UTC)
Protection
Hey guys, could I ask if this could be downgraded to semi-protection? This list does not appear to be critical to the operation of Wikipedia, and similar lists (e.g., [[1]]) are only semi-protected, and appear to have done much better as a result (and were never vandalized). What's more, if for some unknown reason a vandal wanted to vandalize this list, it is watchlisted by enough people it could be undone. If there are technical reasons not to allow this, I would understand though (e.g., adding information that would crazy up the bot). The Evil Spartan 18:07, 21 May 2007 (UTC)
- If the list was vandalized it could be made to report every name created. A clever vandal could produce other undesired effects. I keep an eye on the page though and will fulfill requests. (H) 18:09, 21 May 2007 (UTC)
Arse
Would it be alright if I added variations of all the "ass..." blacklisted names, except using "arse" e.g. arsehole, as it doesn't seem to be covered. SGGH speak! 14:19, 22 May 2007 (UTC)
- Sure, but make sure you whitelist words like "arsenal". (H) 14:27, 22 May 2007 (UTC)
- Have done so, cheers SGGH speak! 15:04, 22 May 2007 (UTC)
- Excellent use of the LOW_CONFIDENCE flag. Question, do you find the blacklist interface simple enough to understand? I made it more complicated than I originally planned. (H) 15:32, 22 May 2007 (UTC)
- Though I can't edit it, I think it's simple enough. And if you don't understand it, just copy can paste a word that you want the same settings for, and change the word. --R ParlateContribs@ (Red Sux!) 22:16, 22 May 2007 (UTC)
More words
"Block" (e.g., User:You blocked me), "username", "suck", "sucks", "Yourmom", "Your mom", ".com", ".org", anything with "@", and anything with 12 or more consecutive numbers (e.g., User:Jfish871345687346578). Take the suggestions as you will. The Evil Spartan 18:17, 23 May 2007 (UTC)
- Well, block and username and suck will provide too many false positives I think. The @ sign and the top level domains are a great idea. As for the consecutive numbers I will need to write that right into the bot, which I will in time. The (your|yur|yer) (mom|mum)'s should be added too. Thanks. (H) 19:24, 23 May 2007 (UTC)
- Done, except for the consecutive numbers. I may have gone overboard on the top level domains, we will see what happens. (H) 19:35, 23 May 2007 (UTC)
- I am going to add "sucks" to my keyword probation area and see if it gets many false positives before I add it to the main reporting page. (H) 19:36, 23 May 2007 (UTC)
- Oops, adding those top level domains caused the bot to interpret the "." as a wildcard, I stopped and reverted it, then fixed the code to do a literal search. I will re-add regular expression ability later only to strings with the REGEX flag. That will allow me to shorten the list a bit. (H) 19:52, 23 May 2007 (UTC)
- A minor false positive as of recently that may be fixable: The "ass" regex matches "Top brass H01". Oh, and I believe I didn't congratulate you yet on this excellent bot. Thanks! Sandstein 20:33, 24 May 2007 (UTC)
- Lol, well since h01 is another way of spelling hole, I will just have to whitelist "brass". (H) 00:30, 25 May 2007 (UTC)
I notice "isafag" and "is a fag" aren't on the blacklist. I suggest we add them. Leebo T/C 12:59, 12 June 2007 (UTC)
- Added. (H) 13:45, 12 June 2007 (UTC)
Mistake
{{editprotected}} The line:
(Check to see if this name is being used to impersonate or insult User:Jimbo)
should be
(Check to see if this name is being used to impersonate or insult User:Jimbo Wales)
shouldn't it? Until(1 == 2) 04:00, 28 June 2007 (UTC)
- Done. Cheers. --MZMcBride 13:21, 28 June 2007 (UTC)
"porn"?
Mightn't it be a good idea to list the string "porn" (and variants pron, pr0n, p0rn etc.)? I understand that "porn" itself would have to be listed as low confidence, but noting that User talk:JesusLovesPorn (indefblocked by me) was only listed by BetacommandBot as containing the "jesus" string, it seems it might be worthwhile. Nihiltres(t.l) 13:39, 15 July 2007 (UTC)
- Good idea, and don't you mean HBC, not Betacommand? --(Review Me) R ParlateContribs@ (Let's Go Yankees!) 23:28, 15 July 2007 (UTC)
- It was BetacommandBot for some reason (beats me): see the diff. Nihiltres(t.l) 01:24, 16 July 2007 (UTC)
;p([o0]r|r[0o])n:REGEX,LABEL(Porn),LOW_CONFIDENCE
should do it. Until(1 == 2) 21:21, 21 July 2007 (UTC)
Expand for dotnet/org/biz
{{editprotected}}
I have seen "dotnet" and "dotorg" names go by unreported, so I suggest the following change:
;dotcom:NOTE(Please check if this is a web address - web page addresses are generally considered likely to be promotional)
into
;dot(com|net|org|biz):REGEX,LABEL(dotfoo),NOTE(Please check if this is a web address - web page addresses are generally considered likely to be promotional)
This change will match "dotcom","dotnet","dotorg" and "dotbiz". The "REGEX" flag is to tell it to read the string as a regular expression, and the LABEL tag calls the string "dotfoo" instead of "dot(com|net|org|biz)". Until(1 == 2) 21:19, 21 July 2007 (UTC)
Adding
;p([o0]r|r[0o])n:REGEX,LABEL(Porn),LOW_CONFIDENCE
would be helpful too. Until(1 == 2) 21:23, 21 July 2007 (UTC)
- Done for both, thanks for coming up with the syntax for the "porn" one I suggested. Nihiltres(t.l) 02:38, 22 July 2007 (UTC)
"Names from India" sentence
{{editprotected}}
The warning phrase for the word "shit" warning that names from India may contain this phrase is currently not working. I believe this to be because of this edit which added a comma to the sentence; the instructions at the top of the page state that flag paramateres cannot contain commas. Could anyone rewrite this sentence so that it does not contain a comma? Gandoman 09:14, 9 August 2007 (UTC)
- Done, good catch. Until(1 == 2) 13:36, 9 August 2007 (UTC)
Deadbot
Deadbot has stopped reporting names due to a discussion that decided to move the job to this bot. Here is a link to the blacklist Deadbot used when shut down: [2]. Some of these may need to be moved over here. Until(1 == 2) 18:18, 10 August 2007 (UTC)
Promotional
{{editprotected}}
Please add the words "marketing", "service" and "management" to the Promotional-section. I've reported a lot of promotional usernames which included these words. Melsaran 14:51, 14 August 2007 (UTC)
- (edit conflict)
Done- note that I added the note attached to the other Promotional-section entry to each of the new ones: NOTE(Check to see if the username is promotional in nature). I also added LOW_CONFIDENCE to the "service" entry, I think it's liable to produce false positives. Due to the edit conflict, I have not yet added "management". Nihiltres(t.l) 15:13, 14 August 2007 (UTC)- I've disabled the editprotected request. --MZMcBride 03:23, 15 August 2007 (UTC)
Block
Maybe the word "block" should be added to the blacklist. I often see people creating names such as "Block me now!" and such. --(Review Me) R ParlateContribs@ (Let's Go Yankees!) 17:26, 18 August 2007 (UTC)
- I think that would produce way to many false positives. ((1 == 2) ? (('Stop') : ('Go')) 17:28, 18 August 2007 (UTC)
- Hmm...what words contain "block" in it, and when would "block" be used in a good way as a username? --(Review Me) R ParlateContribs@ (Let's Go Yankees!) 17:37, 18 August 2007 (UTC)
- I share Until(1 == 2)'s concern about "block" being too likely to produce false positives, though "unblock" might not be bad. Nihiltres(t.l) 18:00, 18 August 2007 (UTC)
- Hmm...what words contain "block" in it, and when would "block" be used in a good way as a username? --(Review Me) R ParlateContribs@ (Let's Go Yankees!) 17:37, 18 August 2007 (UTC)
- lots of legitimate names have "block" in them. ((1 == 2) ? (('Stop') : ('Go')) 18:44, 18 August 2007 (UTC)
- Yes, but I think we should stick to keywords that give a reasonably low level of false positives. I suggest a regex describing the different ways people spell "block me", that would work. ((1 == 2) ? (('Stop') : ('Go')) 18:54, 18 August 2007 (UTC)
- I have zero experience with regular expressions, but mightn't something along the lines of
;((un|un )b[1l][o0]ck( )?(m[3e]|th[i1][5s])?|(un|un )?b[1l][o0]ck( )?(m[3e]|th[i1][5s])):REGEX,LABEL(BlockFoo),NOTE(Most usernames referencing blocking or unblocking are considered inappropriate)
work, assuming I've guessed my way through the regular expression syntax appropriately? (in other words, "this might work, but please check the syntax, I'm a regex noob")... Nihiltres(t.l) 19:45, 18 August 2007 (UTC)
- I have zero experience with regular expressions, but mightn't something along the lines of
- Hmm I will try to simplify that.
- (un)?\s?b[1l][o0]ck\s?(m[3e]|th[i1][5s])
- REGEX,LABEL(BlockFoo),NOTE(Usernames referencing blocking or unblocking may be considered inappropriate)
- So the "un" is optional, as is a space after it, the word "block"(with number substitions) followed by and optional space and "me" or "this" will trigger it. I did not make the "me"/"this" pattern optional because it avoids false positives. Look good? ((1 == 2) ? (('Stop') : ('Go')) 22:03, 19 August 2007 (UTC)
- OK, \s is an escaped space, that helps. (I'll probably be fine for future regex creation with some practice :D ) I tentatively added your version to the blacklist, but part of the reason for the overcomplexity of my version was to ensure that a name of the form "User:Unblock Vandal123" (which would seem to be a reasonably common form) would be included while "User:BlockDude123" would be fine. In that case, do nested parenthesis arguments of the form (foo|z(bar|omg)) work? Nihiltres(t.l) 00:01, 20 August 2007 (UTC)
- Nested parenthetical ors(|) do work, I just think "(Un)?block <any word>" is too wide an expression, and will find too many false positives. ((1 == 2) ? (('Stop') : ('Go')) 00:14, 20 August 2007 (UTC)
- Coming back on this by chance, I wonder if you misunderstood what I meant - I meant to flag names with the form of either "un\s?block <anything>" or, if "un" was absent, "block (me|this)". A regex setup (refined from last time :) ) that would flag as such would be (given that nested ors work properly)
- (^un\s?b[1l][o0]ck|b[1l][o0]ck\s?(m[3e]|th[i1][5s]))
- REGEX,LABEL(BlockFoo),NOTE(Usernames referencing blocking or unblocking may be considered inappropriate)
- I just want to confirm that "unblock X" is not too wide. Perhaps I'm being too wierd about this. :p Nihiltres(t.l) 19:14, 2 September 2007 (UTC)
- I did misunderstand. I am not sure about unblock X, but I don't see the harm in trying. If it does not work then it can be removed. The bot also supports reporting to different pages if we wanted to have a place to check keywords for effectiveness. ((1 == 2) ? (('Stop') : ('Go')) 17:45, 7 September 2007 (UTC)
British English
I haven't checked whether anyone's actually tried to use this, but perhaps the "paedoph" substring should be blocked along with "pedoph" to catch the British English spelling? Colin Watson 16:46, 19 August 2007 (UTC)
{{editprotected}}
- Done. Cheers. --MZMcBride 19:58, 19 August 2007 (UTC)
Pimp
Often used, such as User:Arab pimp User:PIMPZ and User:Mister pimp. Cheers, JetLover (talk) 02:17, 20 August 2007 (UTC)
- I don't think that there is a consensus calling oneself a "pimp" is a violation of the username policy. ((1 == 2) ? (('Stop') : ('Go')) 04:33, 20 August 2007 (UTC)
Alternative account
Would it be possible to add terms like "alternative account", "another account", "2nd account", "secondaccount" and so on with low confidence comments? Have come across a couple this morning. SGGH speak! 14:42, 25 August 2007 (UTC)
- Alternate accounts are allowed if you follow the WP:SOCK policy, so I don't see why these should be reported. ((1 == 2) ? (('Stop') : ('Go')) 18:07, 1 September 2007 (UTC)
Foreign nasties
It occurs to me that there are several non-English words that should probably be on the blacklist. For instance cazzo, Italian slang for penis, which was in a username the other day. More foreign words that may need to be excluded are in the top of unrelated ANI thread. Thoughts? Flyguy649 talk contribs 15:31, 31 August 2007 (UTC)
Another one for the list
Came across this tonight. Spooge is a slang term for semen/ejaculate. See user:Tluvz2spooge. Flyguy649 talk contribs 04:15, 1 September 2007 (UTC)
- Good idea, added it . I cannot see any false positives coming up. ((1 == 2) ? (('Stop') : ('Go')) 04:50, 2 September 2007 (UTC)
Edited domain name rules
I've seen that some administrators block names that happen to coincidentally match parked domains, because the blocking administrator isn't familiar with what parked domains are for (they're not real websites, they're placeholders to catch accidental traffic, like what you might get when you type people's usernames into your URL bar). So I've reworded the messages to warn about parked domains. I've also put WAIT_TILL_EDIT on the rule about country-code URLs, because it's been creating so many false positives.
There is nothing wrong, for example, with someone indicating that they are from France with a ".fr" on the end of their username, even though it is likely to coincidentally be a parked domain (basically every name of the form [short name].[country code] is going to be parked by someone). rspeer / ɹəədsɹ 16:49, 7 September 2007 (UTC)
How about this one
Another one for the list? "isback"/"is back", e.g. user:Braceyisback. It is often used by vandals/sockpuppets. Probably reasonably low confidence. Flyguy649 talk contribs 17:32, 7 September 2007 (UTC)
- This is one I'd suggest be tested in a probation area, because of the low-confidence issue. Nihiltres(t.l) 19:56, 15 September 2007 (UTC)
How about blacklisting Facebook, since it is a social network, just like Myspace, which is already being blacklisted? Later, can the blacklist include names of other social networking services. Johnny Au 19:13, 15 September 2007 (UTC)
Show/hide
Why do we have the list embedded in this ridiculous show-and-hide box? For when "obscenities bother you"? I thought Wikipedia was not censored. This is not necessary at all. Melsaran (talk) 17:56, 3 October 2007 (UTC)
- It is not censorship, it is an editorial decision. It is not really harmful, and I see no reason to display the very things we are trying to keep off Wikipedia. Basically it is in the spirit of WP:DENY. WP:CENSOR says that we may contain offensive material, not that we must, it is also referring to NPOV in articles, not bot configuration pages. If we are going to show offensive content we need a good reason. ((1 == 2) ? (('Stop') : ('Go')) 18:05, 3 October 2007 (UTC)
- On a more technical note, the list is rather long, and wide, causing both scroll bars to appear on the browser. This is not needed unless someone decides to look. ((1 == 2) ? (('Stop') : ('Go')) 18:06, 3 October 2007 (UTC)
- That's true, but I still see no urgent need to have it behind some stupid box. It makes it look like we have something to hide. It's at the bottom of the page anyway, and someone reading a section titled "List" on the blacklist expects to see the obscenities anyway. Melsaran (talk) 18:24, 3 October 2007 (UTC)
- Something to hide? The text clearly tells you how to find it. I don't see any harm in it. Perhaps we should wait for others to give their opinions on this matter. I like it because I can go to this page in mixed company without fear of racial slurs and crude references to body parts popping up. This page is a technical one, not an exploration of the words contained within, the content is for a bot to read, not humans. ((1 == 2) ? (('Stop') : ('Go')) 18:40, 3 October 2007 (UTC)
WAIT_TILL_EDIT
twat, cunt, fuck, shit, whore, and all these[3]. I cannot imagine why we would wait for such names to edit before wanting to know about them. So I reverted this edit. ((1 == 2) ? (('Stop') : ('Go')) 02:14, 7 October 2007 (UTC)
- Join the discussion at WT:UAA, then, where I asked for comments before going ahead and doing this. The reason to wait until an edit is to stop rewarding username trolls. They don't care when they get blocked, and they're quite happy when they get everyone on WP:UAA to read their dumb name. rspeer / ɹəədsɹ 02:26, 7 October 2007 (UTC)
Poof
Poof is a British derogatory term for gay man, more or less a UK equivalent of the American "fag". Perhaps this, along with the derived "poofter", should go on the list? Lurker (said · done) 14:51, 12 October 2007 (UTC)
- I don't think most names with "poof" should be blocked on sight. "Poof" is also a common onomatopoeia for disappearance; in fact, I hadn't even heard of the slang term. I think such names would have to be discussed on WP:RFCN instead. Melsaran (talk) 14:56, 12 October 2007 (UTC)
- I can mean something appearing or disappearing suddenly, it it can describe a cloud of dust. It has too many innocent meanings, and is only offensive when used in very specific contexts. I think we can leave this type of names for humans to find. 1 != 2 15:52, 12 October 2007 (UTC)
Myspace
Why is this on the blacklist? Is it to catch names like "GorillaBoyzonMyspace" or something? Because I looked at Special:Listusers for names that start with "Myspace" and all of them were fine. Those editors may be trying to link to some myspace page or other but the username seems to be rarely a violation. Mangojuicetalk 12:53, 16 October 2007 (UTC)
- For reference, the names that matched this pattern: User:MyspaceBoyz, User:Tommyspace2099, User:Jehovah's Witness Cunt Myspace, User:Jehovah's Witness Myspace Cunt, User:Hookupmyspace, User:Whiteybulger&myspace, User:Myspace127,User:Chris.from.myspace, User:X myspacefreak X, User:Matthewjmyspace. Of those, only the two with "cunt" are clear violations, and Whiteybulger is pretty borderline. Mangojuicetalk 14:23, 16 October 2007 (UTC)
- I agree it should be removed, I really don't know why it was added. 1 != 2 00:05, 17 October 2007 (UTC)
- I added it. At the time there was some very bored person creating usernames like "Wikipédia is myspace for aspergers losers" and stuff like that. Let's also remove "Wikipedia" and variations. Grandmasterka 01:06, 17 October 2007 (UTC)
- No, not as long WP:U still states that "Usernames that confusingly refer to a Wikipedia process, namespace, or toolbar item" and "Usernames that imply the user is an administrator or official figure on Wikipedia" are disallowed. Melsaran (talk) 10:28, 17 October 2007 (UTC)
- Let's not worry about it, "Wikipedia" is on the username blacklist anyway. Mangojuicetalk 11:37, 17 October 2007 (UTC)
- No, not as long WP:U still states that "Usernames that confusingly refer to a Wikipedia process, namespace, or toolbar item" and "Usernames that imply the user is an administrator or official figure on Wikipedia" are disallowed. Melsaran (talk) 10:28, 17 October 2007 (UTC)
- I added it. At the time there was some very bored person creating usernames like "Wikipédia is myspace for aspergers losers" and stuff like that. Let's also remove "Wikipedia" and variations. Grandmasterka 01:06, 17 October 2007 (UTC)
- I agree it should be removed, I really don't know why it was added. 1 != 2 00:05, 17 October 2007 (UTC)
root
suggest we add root as an extension of admin, on the grounds that it is a name that wanabe skript kiddees will pick on. 21:11, 22 October 2007 (UTC) —Preceding unsigned comment added by Mayalld (talk • contribs) 21:11, October 22, 2007 (UTC)
- No, I don't think that would be a good addition, as root will produce far too many false positives. Besides, the only really good target for that, User:Root, is taken (check the link). ;) Nihiltres(t.l) 21:39, 22 October 2007 (UTC)
REGEX:?
The last entry, www(\.|dot)
, has REGEX:LABEL(WWW)
(with a colon instead of a comma). Is this correct? --cesarb 22:39, 24 October 2007 (UTC)
- Good catch, I have fixed that. 1 != 2 01:22, 8 December 2007 (UTC)
KKK
I've added "kkk" to the "violence" section of the list, for its association with lynchings in the U.S. -- I've manually blocked names like this a number of times. The Anome (talk) 11:11, 5 January 2008 (UTC)
Your mum...
I've also added a regexp to match the ever-popular "yo' momma..." variation on "your mom". -- The Anome (talk) 11:41, 5 January 2008 (UTC)
Yet more entries
I've added yet more entries, based on a review of my own block log. I've also added "lolol" and "hahaha": in my experience, simulated laughter in usernames is generally associated with vandalism-only accounts. -- The Anome (talk) 13:27, 10 January 2008 (UTC)
666
Maybe 666 should be added, as that is used often with bad faith? It is very unlikely that there will be good faith person with that name. Soxred93 | talk count bot 02:09, 22 January 2008 (UTC)
- There has been a long standing tendency at WP:RFCN to not block based on number combinations. Too many numbers are against some form of superstition. (1 == 2)Until 18:50, 21 February 2008 (UTC)
Bardeep
Maybe adding this to the name filter might help, as all of this vandal's sock puppets begin with this (see here). VivioFateFan (Talk, Sandbox) 15:17, 29 January 2008 (UTC)