From Wikipedia, the free encyclopedia

What should we do for cites from punesite dot com?

moved from User talk:GreenC

Hi. The domain has expired and is parked on GoDaddy. When I tried to access it, Chrome gave a "Deceptive site ahead" warning in full page red background. Either way, what should we do about the citations from these sites? Should we add usurped or remove the citations? I remember you told me something, but forgot already. Special:LinkSearch What about all domains which are expired and parked, just like this one? — DaxServer ( talk to me) 18:19, 26 August 2021 (UTC)

Normally add archives and mark usurped. This method has a problem if the link is bare/square and dead, or bare/square followed by {{ webarchive}}, but depending how many they might be resolved manually. -- Green C 19:06, 26 August 2021 (UTC)
@ DaxServer: ahh only 7 articles they should be done manually can you do it? I'll update IABot for other wikis. -- Green C 19:12, 26 August 2021 (UTC)
@ GreenC Yes, I can add them. Also engineeringwatch dot in — DaxServer ( talk to me) 19:36, 26 August 2021 (UTC)
4 articles. Will update IABot same. -- Green C 19:40, 26 August 2021 (UTC)
New websites:
— DaxServer ( talk to me) 13:44, 27 August 2021 (UTC)
@ DaxServer: - ran all 7 domains through the bot and it edited 34 pages, adding 31 archive URLs and toggling a number existing from |url-status=dead to |url-status=usurped. It also removed a number of {{ webarchive}} replacing with a square link archive URL. Set all to Blacklisted in the IABot database. Can I ask how did you find them? -- Green C 02:55, 29 August 2021 (UTC)
@ GreenC Please usurp indiaenews.com 70 articles. — DaxServer ( talk to me) 09:34, 5 September 2021 (UTC)
indiaenews.com is done. -- Green C 02:57, 7 September 2021 (UTC)
To your question (forgot to reply) on how I found them, I couldn't remember now :/ All I could recollect is that I found the punesite first, since I put it in the header, and started removing it in the articles from the Special:LinkSearch, and found the remaining sites in the articles that followed in the link search results. As to the indiaenews.com, I was updating the Prayagraj-->Allahabad and found it, perhaps because the citation is in the same line where I'm editing. — DaxServer ( talk to me) 09:41, 5 September 2021 (UTC)
Ok thanks, I wasn't sure if you had an automated method of discovery. -- Green C 03:18, 10 September 2021 (UTC)
Please usurp sulekha.com 309 results — DaxServer ( talk to me) 09:08, 6 September 2021 (UTC)
This domain appears to have a mix of live and dead, will need to run it through the soft-404 method to pick out which URLs are dead. -- Green C 03:18, 10 September 2021 (UTC)

Drowned in Sound

I noticed that Drowned in Sound Archived 2021-06-02 at the Wayback Machine has a notice that it's been archived/closed. While the links are still working, I was wondering if it was possible to archive all of the links that are currently used in Wikipedia from Drowned in Sound just in case this website fully closes. The page also links to Internet Archive as well. Thanks! -- MrLinkinPark333 ( talk) 18:10, 4 June 2021 (UTC)

Started an IABot job, I'll see how it goes. EpicPupper (he/him | talk, FAQ, contribs) 21:53, 9 June 2021 (UTC)
(note, since I replied to MrLinkinPark333 offwiki: I finished this task) 🐶 EpicPupper (he/him | talk) 18:05, 16 September 2021 (UTC)

theonlinecitizen.com

GreenC The Online Citizen has been taken offline by its editor due to 'disagreements' (my word) with local authorities, and should remain so unless the editor has miraculously exited Singapore and revive it in another form. Please set the domain https://www.theonlinecitizen.com to dead. – robertsky ( talk) 15:01, 16 September 2021 (UTC)

Now set dead in IABot and queued to process enwiki (11 articles). -- Green C 01:24, 18 September 2021 (UTC)

lviv-orthodox.net.ua

lviv-orthodox.net.ua has been usurped, and should be marked dead. There's only a few usages across enwiki and ukwiki, but if they could be auto-archived, that'd be fantastic. See also m:Talk:Spam_blacklist#lviv-orthodox.net.ua. Perryprog ( talk) 00:59, 18 September 2021 (UTC)

It's now set permadead in the IABot database, which contains 12 unique URLs for all sites. -- Green C 01:21, 18 September 2021 (UTC)

economicdevelopmentboardsa.com.au now used for SEO purposes

this website has been hijacked for SEO purposes. the links on en.wiki have been nullified. See: Special:Contributions/49.128.61.210. 17:36, 26 August 2021 (UTC)

Now permadead in IABot nothing to do on enwiki. -- Green C 02:29, 21 September 2021 (UTC)

Ainews.com no longer Adult Industry News

Referencing this discussion: Wikipedia:Help desk#AiNews.com - Wrongly Indexed

Ainews.com was formerly "Adult Industry News", a news site for the porn industry, which has a lot of citations.

The domain now belongs to "Artifical Intelligence News". Needless to say, the new owner doesn't want its domain linked in porn-related articles.

Adult Industry News is now ainews.xxx.

Experimenting with some of the ~130 links from https://en.wikipedia.org/?target=*.ainews.com&title=Special%3ALinkSearch it seems that one cannot simply substitute .com with .xxx. The pages must be found on archive.org. ~ Anachronist ( talk) 15:36, 27 August 2021 (UTC)

That's new. Usually a domain is usurped by porn not from porn (we even have a parameter |unfit= sort of for that reason). Looks like 60 articles on Enwiki. I guess a strategy would be to manually determine if an article topic is porn, and treat ainews.com on that page as a dead link, leaving non-porn pages alone. Luckily porn star names are easy to spot (Aurora Snow, India Summer, etc) vs. AI topics. In fact this would be a good application for AI. In terms of updating the IABot database, that's more difficult and will need to think about it. -- Green C 16:25, 27 August 2021 (UTC)
I've gone through a dozen or so pages to manually run the IABot. It's keeping the sources as ainews.com rather than ainews.xxx, is that alright? –– FormalDude talk 22:36, 27 August 2021 (UTC)
Please don't run IABot, thanks. -- Green C 23:31, 27 August 2021 (UTC)
GreenC, can I ask why not? –– FormalDude talk 02:54, 28 August 2021 (UTC)
FormalDude, it is a usurped domain, not merely dead. IABot does not support usurpation. The source URL should be hidden from the user via |url-status=usurped (or |url-status=unfit). Same with bare and square links, not use {{ webarchive}} but combine into a single URL. There are other tools for usurpation. -- Green C 03:27, 28 August 2021 (UTC)
@ FormalDude: As I said above, you can't just simply change ainews.com to ainews.xxx. The links aren't the same on the xxx domain. The ainews.com links must be converted to archive.org links. ~ Anachronist ( talk) 02:59, 28 August 2021 (UTC)
@ GreenC: the vast majority of the porn-related ainews.com links have the form ainews.com/Archives/Story#####.phtml where ##### are numeric digits. If those were fixed, then the handful of remaining ones could be fixed manually. ~ Anachronist ( talk) 03:15, 28 August 2021 (UTC)
Yeah that might be the way just match on common patterns. -- Green C 03:27, 28 August 2021 (UTC)

I believe this is done. Let me know if you see any problems. Diffs Special:Contributions/GreenC_bot from last hour. -- Green C 05:15, 28 August 2021 (UTC)

Results

  • Edit 57 pages
  • Add 36 archive URLs
  • Switch 51 |url-status=usurped
  • Convert 3 {{ webarchive}} to square link archive URL
  • Set permadead in IABot

asiarooms.com now a redirect site

The links are all now redirects to laterooms.com rootpage. Probably time that we cull those links, and look to stop new additions, which I have seen in the last few days. Uncertain yet whether new additions are spammy or simply copies from other WPs, still investigating. — billinghurst sDrewth 23:58, 9 September 2021 (UTC)

OK. Bot ran for usurpation, 90 pages edited. Example diffs: [1] [2] [3] [4]. Will also Blacklist in IABot db for 120+ other wikis, though it takes a while to propagate, and it will only set dead not usurp. -- Green C 03:14, 10 September 2021 (UTC)

Results

  • Edit 90 pages
  • Add 68 archive URLs
  • Switch 60 |url-status=usurped
  • Convert 2 {{ webarchive}} to square link archive URL
  • Set permadead in IABot

worldaffairsjournal.org usurped

Hi. The domain worldaffairsjournal.org is now a gambling site and has many many links that need cleansing [5]billinghurst sDrewth 08:05, 19 September 2021 (UTC)

Results

  • Edit 140 pages
  • Add 87 archive URLs
  • Switch 125 |url-status=usurped
  • Convert 13 {{ webarchive}} to square link archive URL
  • Set permadead in IABot

@ Billinghurst: thanks. -- Green C 02:17, 21 September 2021 (UTC)

adherents.com possibly usurped

adherents.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

It's a bit hard to research into it at the moment due to a power outage at the Internet Archive, but it unfortunately seems like a major site—I count just below 5,000 usages. I'm also not sure how long ago this usurpation occurred as the whois data gives and updated date from early 2020 and a creation date of 1998. I could just be misinterpreting what it's saying, though. Due to the sheer size of how much this domain was used it may be worth further action beyond an IABot usurpation run, but I'll leave that decision up to someone more experienced with these. Perryprog ( talk) 17:30, 19 September 2021 (UTC)

Now that archive.org is back up, it does very much look like this was a usurpation from early 2020. This is the last archived version before it got taken over, I believe. Perryprog ( talk) 23:41, 19 September 2021 (UTC)

Results

  • Edit 442 pages
  • Add 452 archive URLs
  • Switch 381 |url-status=usurped
  • Convert 24 {{ webarchive}} to square link archive URL
  • Set permadead in IABot

@ Perryprog: thanks. -- Green C 02:16, 21 September 2021 (UTC)

GreenC wow, thanks yet again! Is this including xwiki usages? Perryprog ( talk) 12:01, 21 September 2021 (UTC)
Perryprog, thanks for shining a light that can be the hardest part discovery. Right now WaybackMedic can only edit Enwiki. For xwiki there is IABot permadead status for the domain, which means when the bot sees a link, on 120+ wikis, it will act as if the link is dead without checking if live/dead. It only adds an archive URL or {{ dead link}} (if no archive) it can't do the usurped process. Better than nothing, but not ideal, wish it could do more. Should the day ever arrive we can do usurped xwiki this forum has a record of domains. Permadead is an IABot admin action which can be requested here (I am admin) or meta:User_talk:InternetArchiveBot .. it's included in the package when making requests here. -- Green C 18:11, 21 September 2021 (UTC)
Thank you for the background! That's all very helpful. I'll be rooting for you all to someday get XWiki usurpation, for sure—that could make a tremendous impact on the project in general, in my opinion. Perryprog ( talk) 00:39, 22 September 2021 (UTC)

2011 NAB Cup

/info/en/?search=2011_NAB_Cup

Inline match-report links are all dead. Rather than manually updating them, can I get the bot to update all the links to waybackmachine archives? Thanks! — Preceding unsigned comment added by Electricmaster ( talkcontribs)

Can do this. Takes time the entire domain needs to be processed over 4,000 pages have links. It checks each link to see if dead. -- Green C 04:32, 25 September 2021 (UTC)

Results

  • Edit 1,075 pages
  • Add 5,661 archive URLs
  • Add 235 {{ dead link}}
  • Update about 4,000 links in IABot for xWiki

The Week in Chess

The Week in Chess was hosted on chesscenter.com and chess.co.uk before it moved to theweekinchess.com. Hundreds of dead links of the form http://www.chesscenter.com/twic/twicXXX[.html] or http://www.chess.co.uk/twic/twicXXX.html can be fixed by substituting them with https://theweekinchess.com/html/twicXXX.html, e.g. old1new1 and old2new2. Other dead links to The Week in Chess cannot be replaced that easily. I suggest marking chesscenter.com and chess.co.uk as dead for IABot once the easy replacements are done.

See also

Thanks — Dexxor ( talk) 18:05, 10 September 2021 (UTC)

 Doing... can have a shot at this. I do not have access to the IABot admin interface, though, so someone else will have to do that. 🐶 EpicPupper (he/him | talk) 20:38, 15 September 2021 (UTC)
Also, I haven't edited anything with template parameters, so |status=usurped might have to be added to some of the links. 🐶 EpicPupper (he/him | talk) 21:36, 15 September 2021 (UTC)
Just a caution that URL work can be more difficult than it appears. For example in the above, do all of the pages at the new URLs work? During migrations admins will often forget or intentionally drop some pages. There are archive URLs, {{ webarchive}} templates, soft-404s, |url-status= changes, removal or addition of {{ dead-link}}. -- Green C 18:18, 16 September 2021 (UTC)
I've decided to modify the archive URLs as well as I think that makes more sense. As for url-status and {{ dead-link}}, I'm not sure how to proceed.
I've also noticed that some publication/title fields might need to be changed, but would like some feedback; the new URL is the same publication, but perhaps with a different title? Would the publication fields need to be changed if it is something like "chess.co.uk"? 🐶 EpicPupper (he/him | talk) 17:17, 28 September 2021 (UTC)

Small Arms Survey

I found many broken links to the Small Arms Survey website, but I can't fix all of them manually. Can they be repaired automatically? Jarble ( talk) 15:35, 28 September 2021 (UTC)

Results

  • Edit 247 pages
  • Add 279 archive URLs
  • Toggle 171 |url-status=dead for existing archives
  • Add 1 {{ dead link}}
  • Update 204 unique links in IABot for xWiki, remove global live state = "Subscription"

@ Jarble: -- Green C 17:55, 28 September 2021 (UTC)