Wikipedia:Link rot/URL change requests
This page is for requesting modifications to URLs, such as marking dead or changing to a new domain. Some bots are designed to fix link rot; they can be notified here. These bots include InternetArchiveBot and WaybackMedic. This page can be monitored by bot operators from other language wikis since URL changes are universally applicable.
US agencies
[edit]| This section is pinned and will not be automatically archived. |
90+ agencies identified as having web pages deleted during the Trump admin: https://asia.nikkei.com/static/vdata/infographics/deleted-website/
Ļ Awaiting further developments and time to go through them -- GreenC 16:46, 1 April 2025 (UTC)
White HouseDepartment of Health and Human ServicesDepartment of AgricultureUSAIDNational Park Serviceworker.govDepartment of LaborU.S. Agency for Global Media - usagm.govFederal Mediation and Conciliation Service (United States) - fmcs.govWoodrow Wilson International Center for Scholars - wilsoncenter.orgInstitute of Museum and Library Services - imls.govCommunity Development Financial Institutions Fund - cdfifund.govMinority Business Development Agency - mbda.govDepartment of Transportation - dot.gov- includes 11 agencies: FAA, FHWA, FMCSA, FRA, FTA, GLS, MARAD, NHTSA, OIG, OST, PHMSAEnvironmental Protection Agency - epa.govDepartment of Housing and Urban Development - hud.gov- Centers for Disease Control and Prevention
- Federal Emergency Management Agency
- National Institutes of Health
- General Services Administration
- Department of Homeland Security
- Department of Commerce
- employer.gov
- Office of the Assistant Secretary for Health
- sftool.gov
- Department of Energy
- Department of the Interior
- Department of Education
- NOAA
- Substance Abuse and Mental Health Services Administration
climate.gov- Department of Defense
- Health Resources & Services Administration
- AbilityOne Commission
- Department of State
- United States Patent and Trademark Office
- BOEM
- The Census Bureau
- CISA
- HUD User
- MILLENNIUM CHALLENGE CORPORATION
- performance.gov
- National Archives and Records Administration
- Bureau of Safety and Environmental Enforcement
- Federal Aviation Administration
Food and Drug Administration- House of Representatives
- Department of Justice
National Endowment for the Humanities- Department of the Treasury
- youth.gov
- American Climate Corps
- Federal Trade Commission
- Global Change Research Program
- NASA
- Administration for Community Living
- National Endowment for the Arts
- ATF
- Bureau of Indian Affairs
- Customs and Border Protection
- Consumer Financial Protection Bureau
- Consumer Product Safety Commission
- Office of the Director of National Intelligence
- Economic Development Administration
- Equal Employment Opportunity Commission
- Export-Import Bank of the United States
- FBI
- Federal Committee on Statistical Methodology
- Federal Housing Finance Agency
- geoplatform.gov
- Assistant Secretary for Technology Policy
- IRS
- National Labor Relations Board
- Office of Personnel Management
- Department of Veterans Affairs
- American Battle Monuments Commission
- Agency for Healthcare Research and Quality
americorps.gov- Advanced Research Projects Agency for Health
- Bonneville Power Administration
- cms.gov
- congress.gov
- digital.gov
- ENERGY STAR
- ej.gov
- farmers.gov
- medicalcountermeasures.gov
- peacecorps.gov
- Securities and Exchange Commission
- Social Security Administration
- stopbullying.gov
- Citizenship and Immigration Services
United States Interagency Council on Homelessness- workcenter.gov
Air ForceArmyNavy- Marine Corps
Australian Dictionary of Biography
[edit]The Australian Dictionary of Biography blocks many calls from "http://" so can any strings in articles like "http://adb.anu.edu.au/biography", "http://adbonline.anu.edu.au/biogs" or "http://www.adb.online.anu.edu.au/biogs" be altered to https:// please? (the template {{Cite Australian Dictionary of Biography}} was recently modified to add the "s" to https://) DivermanAU (talk) 02:23, 2 September 2025 (UTC)
- Anyone able to assist here? By not having "https" the links to the old site do not redirect to the new site, so user will see a "Page not found" message. Or, as an editor for over 10 years, can I make the changes myself? (I just need a few instructions). DivermanAU (talk) 19:16, 8 September 2025 (UTC)
- I do these requests chronologically and you are next in line (see the "Done" tag above this request). -- GreenC 19:54, 8 September 2025 (UTC)
- @GreenC ā Thanks so much for doing this! It really makes a difference to users reading these articles. DivermanAU (talk) 12:52, 10 September 2025 (UTC)
- You are welcome. -- GreenC 03:00, 12 September 2025 (UTC)
adb.anu.edu.au
[edit]- Enwiki
- Checked 7,561 pages and edited 4,600 pages. Moved 5,929 links to a new URL: 115 normal redirects, 5,806 ruled mapped redirects, 8 ghost mapped redirects, Removed 3
{{dead link}}. Added 26{{dead link}}. Switched 127|url-status=deadto live. Added 15 archive URLs (14 Wayback).
- Checked 7,561 pages and edited 4,600 pages. Moved 5,929 links to a new URL: 115 normal redirects, 5,806 ruled mapped redirects, 8 ghost mapped redirects, Removed 3
adb.online.anu.edu.au
[edit]- Enwiki
- Checked 1,533 pages and edited 1,502 pages. Moved 2,038 links to a new URL: 32 normal redirects, 2,006 ruled mapped redirects, Removed 1
{{dead link}}. Added 1{{dead link}}. Switched 16|url-status=deadto live. Added 1 archive URLs (1 Wayback).
- Checked 1,533 pages and edited 1,502 pages. Moved 2,038 links to a new URL: 32 normal redirects, 2,006 ruled mapped redirects, Removed 1
- Thanks again for these fixes, I can see the entire old url has been changed to the new one, which is great. But I can still find 1,519 articles if I search
insource:"http://www.adb.online.anu.edu.au/biogs"including January 4, Type 26 frigate and Protectionist Party (the two External links) which result in "page not found" when the ADB link is clicked. ā DivermanAU (talk) 03:23, 12 September 2025 (UTC)
- Thanks again for these fixes, I can see the entire old url has been changed to the new one, which is great. But I can still find 1,519 articles if I search
- I think you are seeing a backend delay. Looking at Type 26 frigate, the URL in the wikitext is correct. -- GreenC 03:55, 12 September 2025 (UTC)
- Yes, looks like you were right - it's just a delay. I can see those articles are fixed now! Thanks again! ā DivermanAU (talk) 05:57, 12 September 2025 (UTC)
- Great. I'll make it done again but if you see anything else, let me know. -- GreenC 14:45, 12 September 2025 (UTC)
- Yes, looks like you were right - it's just a delay. I can see those articles are fixed now! Thanks again! ā DivermanAU (talk) 05:57, 12 September 2025 (UTC)
Done -- GreenC 14:45, 12 September 2025 (UTC)
billboard.com/articles/columns
[edit]These links are redirecting to new URLs with various subdomains. Please note the following:
- This need the apostrophe removed to make a redirect to the new URL for Justin Timberlake discography.
- Sometimes the number ID will stay. For example, this goes here for United Kingdom.
If any of these don't redirect, please let me know. I can see if the URL needs adjusting in order to make redirects. Thanks! MrLinkinPark333 (talk) 18:59, 2 September 2025 (UTC)
@MrLinkinPark333: I processed 5,000 pages (not uploaded) and the stats are:
- Checked 5,000 pages and edited 4,983 pages. Moved 10,007 links to a new URL: 6,381 normal redirects, 3,623 ruled mapped redirects, 3 ghost mapped redirects, Resolved 29 soft-404s. Removed 1
{{dead link}}. Added 16{{dead link}}. Switched 291|url-status=deadto live. Switched 54|url-status=liveto dead. Added 108 archive URLs (108 Wayback).
You asked "If any of these don't redirect, please let me know": 108 archives, 54 live to dead, and 16 dead link templates. -- GreenC 04:32, 10 September 2025 (UTC)
- Do you see a pattern in the ones that didn't redirect? If you could post a few examples, I'll see if they got moved to new URLS. MrLinkinPark333 (talk) 17:16, 10 September 2025 (UTC)
- Wikipedia:Link rot/Cases/Billboard 50 sample wayback links. - GreenC 19:40, 10 September 2025 (UTC)
- I found that adding a ending slash to make this redirects to that for Falling Down (Selena Gomez & the Scene song) and Hit the Lights (Selena Gomez & the Scene song). This doesn't work for others. MrLinkinPark333 (talk) 20:05, 10 September 2025 (UTC)
- OK great, I'll add that rule and retry these that it missed. -- GreenC 20:52, 10 September 2025 (UTC)
- To be fair, the URL didn't have a slash at the time. In any case, I didn't find a lot of the other sample links. MrLinkinPark333 (talk) 21:49, 10 September 2025 (UTC)
- I see a problem. In Falling Down (Selena Gomez & the Scene song) [1]. Note the trailing query string "?page=0%2C1" which apparently is causing the redirect to fail. Removed and it works. I've never seen the query cause a redirect fail. Some variation of this query is in 66 pages. I just added a rule to remove it, reprocessed those 66, and only 2 URLs were fixed: the same two. Strange. -- GreenC 02:49, 11 September 2025 (UTC)
- Found that removing that string at Gabe McDonough makes a working redirect. Since it's a bare external link, I'm guessing the bot doesn't pick it up? MrLinkinPark333 (talk) 18:45, 11 September 2025 (UTC)
- I see a problem. In Falling Down (Selena Gomez & the Scene song) [1]. Note the trailing query string "?page=0%2C1" which apparently is causing the redirect to fail. Removed and it works. I've never seen the query cause a redirect fail. Some variation of this query is in 66 pages. I just added a rule to remove it, reprocessed those 66, and only 2 URLs were fixed: the same two. Strange. -- GreenC 02:49, 11 September 2025 (UTC)
- To be fair, the URL didn't have a slash at the time. In any case, I didn't find a lot of the other sample links. MrLinkinPark333 (talk) 21:49, 10 September 2025 (UTC)
- OK great, I'll add that rule and retry these that it missed. -- GreenC 20:52, 10 September 2025 (UTC)
- I found that adding a ending slash to make this redirects to that for Falling Down (Selena Gomez & the Scene song) and Hit the Lights (Selena Gomez & the Scene song). This doesn't work for others. MrLinkinPark333 (talk) 20:05, 10 September 2025 (UTC)
- Wikipedia:Link rot/Cases/Billboard 50 sample wayback links. - GreenC 19:40, 10 September 2025 (UTC)
- That page was not in the list because the URL starts with billboard.com/biz/articles of which there are 1,800 -- GreenC 00:59, 12 September 2025 (UTC)
- Ah. I misread the url. I was going to request billboard.biz in the future. I'll make a request soon as billboard ones have been fixed. MrLinkinPark333 (talk) 05:41, 12 September 2025 (UTC)
- Ok. Billboard is like the BBC a monster domain. -- GreenC 14:44, 12 September 2025 (UTC)
- That's why I'm only focusing on only parts of them :) MrLinkinPark333 (talk) 15:03, 12 September 2025 (UTC)
- Ok. Billboard is like the BBC a monster domain. -- GreenC 14:44, 12 September 2025 (UTC)
- Ah. I misread the url. I was going to request billboard.biz in the future. I'll make a request soon as billboard ones have been fixed. MrLinkinPark333 (talk) 05:41, 12 September 2025 (UTC)
- That page was not in the list because the URL starts with billboard.com/biz/articles of which there are 1,800 -- GreenC 00:59, 12 September 2025 (UTC)
Enwiki
- Batch 1 (00001-05000): Checked 5,000 pages and edited 4,983 pages. Moved 10,009 links to a new URL: 6,381 normal redirects, 3,625 ruled mapped redirects, 3 ghost mapped redirects, Resolved 30 soft-404s. Removed 1
{{dead link}}. Added 16{{dead link}}. Switched 291|url-status=deadto live. Switched 54|url-status=liveto dead. Added 106 archive URLs (104 Wayback).
- Batch 2 (05001-15099): Checked 10,100 pages and edited 10,089 pages. Moved 21,436 links to a new URL: 14,386 normal redirects, 7,034 ruled mapped redirects, 16 ghost mapped redirects, Resolved 52 soft-404s. Removed 1
{{dead link}}. Added 26{{dead link}}. Switched 502|url-status=deadto live. Switched 111|url-status=liveto dead. Added 229 archive URLs (224 Wayback).
Done -- GreenC 14:44, 12 September 2025 (UTC)
independent.ie
[edit]The following domains redirect to various regional pages at independent.ie:
- argus.ie
- corkman.ie
- drogheda-independent.ie
- fingal-independent.ie
- herald.ie
- kerryman.ie
- newrossstandard.ie
- sligochampion.ie
- wexfordpeople.ie
- wicklowpeople.ie
Some examples:
- On Probation of Offenders Act 1907, [2] redirects here, the original content is at [3].
- Sometimes the numbers at the end change. On Great Island Power Station, [4] redirects here, the original content is at [5].
- Sometimes the path segments are kept. On Kevin O'Connor (footballer, born 1995), [6] redirects here, the original content is at [7].
- On Rockchapel, [8] redirects here, the original content is at [9].
There's probably more patterns, I only checked like 20 links. This is my first time here, sorry for any formatting issues. ClumsyOwlet (talk) 18:51, 5 September 2025 (UTC)
- ClumsyOwlet: I would normally say this is impossible. There is no way to map this to that. However there is logic: given the last field "probation-act-for-ducie-after-donation-made" is common to both URLs, make a Google search of the site independent.ie for this common string. It correctly returns a match for the new URL. However, I can't automate Google searches without being blocked. But I can run Google Gemini (AI) queries, and ask it to run Google searches. This loophole works. It's not free, but Google seems OK with it so long a there is payment involved. Whose paying? My boss, The Internet Archive. I did some cost analysis, if I run the query 2,000 times it will cost $3.67 US total. I think we can afford it to repair all these URLs is cheap. This would be a new AI approach never done before. -- GreenC 05:08, 10 September 2025 (UTC)
- AI is not working it is hallucinating too much. I came up with a different solution using "Ruled mapped inferred redirects" (last section) - basically it searches the WaybackMachine index for the common string. It misses some because the URLs are not in the WaybackMachine. I am out of tricks to find those, they will be converted to archive URLs. -- GreenC 20:50, 10 September 2025 (UTC)
- ClumsyOwlet: I would normally say this is impossible. There is no way to map this to that. However there is logic: given the last field "probation-act-for-ducie-after-donation-made" is common to both URLs, make a Google search of the site independent.ie for this common string. It correctly returns a match for the new URL. However, I can't automate Google searches without being blocked. But I can run Google Gemini (AI) queries, and ask it to run Google searches. This loophole works. It's not free, but Google seems OK with it so long a there is payment involved. Whose paying? My boss, The Internet Archive. I did some cost analysis, if I run the query 2,000 times it will cost $3.67 US total. I think we can afford it to repair all these URLs is cheap. This would be a new AI approach never done before. -- GreenC 05:08, 10 September 2025 (UTC)
Enwiki
- Batch 1 (0001-0200): Checked 200 pages and edited 188 pages. Moved 161 links to a new URL: 161 ruled mapped inferred redirects, Switched 31
|url-status=deadto live. Switched 7|url-status=liveto dead. Added 104 archive URLs (73 Wayback).
- Batch 2 (0201-1508): Checked 1,308 pages and edited 1,227 pages. Moved 988 links to a new URL: 988 ruled mapped inferred redirects, Resolved 1 soft-404s. Added 2
{{dead link}}. Switched 192|url-status=deadto live. Switched 75|url-status=liveto dead. Added 701 archive URLs (518 Wayback).
Done -- GreenC 01:51, 11 September 2025 (UTC)
bbc.co.uk misc
[edit]Thank you for finding so many URL replacements for bbc.co.uk. There are 11k left, but not all of them will need fixing:
- URLs that end in .shtml tend to be working, with no changes needed. These pages will primary say that BBC archived the page. However, I found a broken link at Chordate. ~5k
- URLs that are not sport, news, or shtml tend to be working or redirect like this one. ~6k
The main things I see are either changing HTTP to HTTPS or archive fixes. As some of these links already have archived links in the article, this should hopefully be resolved quickly. Thank you again! MrLinkinPark333 (talk) 22:22, 5 September 2025 (UTC)
- Using a different method of searching (SQL query), for #1 it returns over 29,000 pages, and for #2 is 134,000 pages. I think CirrusSearch can't accurately search in this case because if there is /news anywhere in the page it will not be reported. For example a page has two URLs - one with /news and the other not - it will skip the entire page since it contains /news. SQL shows every URL, you can filter and see which pages contain a URL pattern. -- GreenC 15:26, 12 September 2025 (UTC)
- Does it work better if you search with the website name, like this? It's giving me a lot more than I thought. MrLinkinPark333 (talk) 16:09, 12 September 2025 (UTC)
- Better, but same issue:
-insource:"bbc.co.uk/news/"means if this string appears anywhere on the page, don't list the page, even though there might be other URLs on the page that should be included. According to SQL, the number of pages containing a BBC url is about 150,000. There might be some /news or /sport in that 150,000 but those pages also contain other BBC links. It excludes pages that only contain /news or /sport. Since there is no real difference in how the URLs are processed, I suggest we consider the 150k as the primary set, then break down into smaller batches. It could be a lot of batches. If it runs as well as last time, very large batches are possible then it won't be many. I can start slow with small batches to see what problems come up. -- GreenC 16:41, 12 September 2025 (UTC)- Hmm. I'm not sure which batch to focus on. A lot of them look to be working with no issues. I've found some with various issues:
- Maybe you misunderstood what I wrote. There is no sense separating based on URL path, because every BBC urls needs to be processed. /teach, /sound/ etc.. all of them need to be checked: *.bbc.co.uk/* -- GreenC 20:11, 12 September 2025 (UTC)
- Ah okay. I didn't want you to waste your time. MrLinkinPark333 (talk) 21:05, 12 September 2025 (UTC)
- It's alright so far over 80% of the pages have a change. The results are similar in quality but more in quantity than /news and /sport .. The work is on the computer. It's actually more work to do separate projects because it requires creating a new project, updating configurations, downloading a list of target articles. By keeping it under the same project I only need to start a new batch ("Batch 1", "Batch 2" etc) which is fairly easy. If the projects require different configurations they need separate, but this project it's looking all the same. -- GreenC 00:02, 13 September 2025 (UTC)
- Hmm. I'm not sure which batch to focus on. A lot of them look to be working with no issues. I've found some with various issues:
- Better, but same issue:
- Does it work better if you search with the website name, like this? It's giving me a lot more than I thought. MrLinkinPark333 (talk) 16:09, 12 September 2025 (UTC)
Rolled into whole set
|
|---|
=== /teach/ === ~60 redirects === /sounds/ === For some reason, the links works then redirects to a broken page. ~1000. === /dna/ === === /cult/ === Mixture of working and broken. ~700. If you could extract a list of sections to go through, that'd be great. I'd only need the section names, not the URLs. I don't think all of them will need checking. I can then check, and post batches in later requests. I'll just leave the 4 above here, so you can work on other requests. --MrLinkinPark333 (talk) 18:32, 12 September 2025 (UTC) |
- @GreenC: This seems to be changing lots of news.bbc.co.uk links to https when the https actually redirects back to http. Examples: [10] [11] [12] [13] In addition it's changing between news.bbc.co.uk/1/* and news.bbc.co.uk/2/* which seem to randomly redirect to each other. According to
metatags the former is theUKFS_URLand the latter is theIFS_URLwhich apparently stand for "UK facing site" and "international facing site", but something is seemingly misconfigured as they now redirect randomly from the same IP. EvenTwist41 (talk) 02:17, 17 September 2025 (UTC)- Thanks. It's on hold. I need to think about how to proceed. -- GreenC 05:11, 17 September 2025 (UTC)
- There are two issues isolated to news.bbc.co.uk : A) https redirects to http B) /1/ redirects to /2/ and other way randomly .. there are also two piles of links: X) URLs already modified listed below. Y) URLs yet to be modified.
- I think for A+X, it is best to leave them alone, it causes no harm, and maybe one day they will properly support https anyway. For A-Y, there is no compelling reason to switch to https. For B-X, this is harmless best left alone. For B-Y, same, best left alone not make any more changes.
- End result: do nothing, except add code to skip processing news.bbc.co.uk going forward, at least when they only change is of type A or B. -- GreenC 20:19, 19 September 2025 (UTC)
- Thanks. It's on hold. I need to think about how to proceed. -- GreenC 05:11, 17 September 2025 (UTC)
- Just saw this on a page I frequently edit. What did your bot do? It changed it from http to https. What was that for? Are you saying that whoever copied the url is wrong? There was nothing wrong with it in the first place! RandomEditorofWiki (talk) 14:01, 27 September 2025 (UTC)
- Right, discussed immediately above.. -- GreenC 17:49, 27 September 2025 (UTC)
- Yes, so why did it matter? Whatās the difference between it being http and https? RandomEditorofWiki (talk) 20:24, 27 September 2025 (UTC)
- Right, discussed immediately above.. -- GreenC 17:49, 27 September 2025 (UTC)
Enwiki
- Batch 1 (000001-005000): Checked 5,001 pages and edited 4,130 pages. Moved 9,039 links to a new URL: 673 normal redirects, 8,366 ruled mapped redirects, Resolved 8 soft-404s. Removed 2
{{dead link}}. Added 72{{dead link}}. Switched 134|url-status=deadto live. Switched 95|url-status=liveto dead. Added 660 archive URLs (592 Wayback).
- Batch 2 (005001-041000): Checked 36,002 pages and edited 29,514 pages. Moved 64,641 links to a new URL: 5,324 normal redirects, 59,317 ruled mapped redirects, Resolved 17 soft-404s. Removed 27
{{dead link}}. Added 637{{dead link}}. Switched 825|url-status=deadto live. Switched 440|url-status=liveto dead. Added 5,347 archive URLs (4,619 Wayback).
- Batch 3 (041001-071000): Checked 30,000 pages and edited 24,711 pages. Moved 52,790 links to a new URL: 4,473 normal redirects, 48,317 ruled mapped redirects, Resolved 274 soft-404s. Removed 17
{{dead link}}. Added 476{{dead link}}. Switched 784|url-status=deadto live. Switched 364|url-status=liveto dead. Added 4,792 archive URLs (4,148 Wayback).
- Batch 4 (071001-104000): Checked 33,006 pages and edited 27,145 pages. Moved 57,766 links to a new URL: 5,299 normal redirects, 52,467 ruled mapped redirects, Resolved 462 soft-404s. Removed 8
{{dead link}}. Added 520{{dead link}}. Switched 747|url-status=deadto live. Switched 513|url-status=liveto dead. Added 6,152 archive URLs (5,731 Wayback).
- Batch 5 (104001-134000):
On hold per above -- GreenC 05:11, 17 September 2025 (UTC)
granitehighworld.com
[edit]An HTTP era domain found in Rudolph G. Wilson that is now usurped; I assume it most likely was a high-school newspaper for Granite City, Illinois, but now it seems the type of site that'd go on the spam blacklist, with the Chinese text and the markedly not-high-school-friendly content of the site. Departureā (talk) 20:28, 6 September 2025 (UTC)
- For what it's worth, I don't know if it's cited in any other articles and I'm going to bring the article I found it on to AFD momentarily, but it doesn't hurt to check. Departureā (talk) 20:34, 6 September 2025 (UTC)
- It's only the one article. I did this Special:Diff/1309543267/1309946672 for the record, and this Special:Diff/1309940301/1309946556, that should take care of it. -- GreenC 21:20, 6 September 2025 (UTC)
- p.s. I was able to confirm that Granite High World is on the Granite City High School page as the listed newspaper, so my initial suspicions as to the original source were correct (uncited, but right enough in my book). Departureā (talk) 21:48, 6 September 2025 (UTC)
- It's only the one article. I did this Special:Diff/1309543267/1309946672 for the record, and this Special:Diff/1309940301/1309946556, that should take care of it. -- GreenC 21:20, 6 September 2025 (UTC)
Done seems like -- GreenC 01:06, 19 September 2025 (UTC)
Billboard biz
[edit]These ones are mainly for billboard.biz. I added a related one as well:
- billboard.biz tend to soft 404 redirect to the main page of https://www.billboard.com/pro/ ~3600
- billboard.com/bbbiz/ 7
I found a different domain with bbbiz in the URL, but I'll make it a separate request. Thanks again :) MrLinkinPark333 (talk) 18:46, 12 September 2025 (UTC)
billboard.biz
[edit]Enwiki
- Checked 3,202 pages and edited 1,421 pages. Resolved 6,293 soft-404s. Added 1,142
{{dead link}}. Switched 1,031|url-status=liveto dead. Added 1,056 archive URLs (854 Wayback).
IABot DB
- MrLinkinPark333: Apparently I already permadead'd many of the .biz links in May 2021 (example). This time through I found another ~200 archive.today links missed. Possibly I wasn't checking for archive.today in 2021. It's interesting that so many more links in 2025 needed updating: 1,142
{{dead link}}. Switched 1,031|url-status=liveto dead. Added 1,056 archive URLs (854 Wayback). Maybe these links have since died but were active in 2021, maybe my methods in 2021 were inaccurate, or maybe IABot was unable to parse/fix them on-wiki. Anyway, things continue to move in the right direction. -- GreenC 02:19, 21 September 2025 (UTC)
billboard.com/bbbiz/
[edit]I'm going to skip these 7 because it's only 7 easily fixed manually I am falling behind on requests thanks. -- GreenC 02:21, 21 September 2025 (UTC)
Done -- GreenC 03:38, 22 September 2025 (UTC)
yjc.news
[edit]Usurped. Old site of the Young Journalists Club, the new one is yjc.ir.
Examples:
- On Iranian handicrafts, http://www.yjc.news/fa/news/6228037 is now at https://www.yjc.ir/fa/news/6228037/ŲŖŁŲŖŁ-ŁŲ§ŪŁ-ŪŚ©Ū-Ų§Ų²-ŲµŁŲ§ŪŲ¹-ŲÆŲ³ŲŖŪ-Ų³ŪŲ³ŲŖŲ§Ł-ŲÆŲ±ŪŲ§ŚŁ-ŁŲ§Ł ŁŁ-ŚŲ“Ł -Ų§ŁŲŖŲøŲ§Ų±-ŲŪŲ§ŲŖ-ŲÆŁŲØŲ§Ų±Ł-Ų¢Ł-Ų§Ų³ŲŖ (I just changed .news to .ir in the original link and it turned into the correct link).
- On The Accused Escaped, https://www.yjc.news/fa/news/6397790/%D8%B1%D8%A7%D8%B2-%D8%A7%D8%B5%D8%BA%D8%B1-%D9%81%D8%B1%D9%87%D8%A7%D8%AF%DB%8C-%D9%BE%D8%B3-%D8%A7%D8%B2-%D8%B3%D8%A7%D9%84-%D9%87%D8%A7-%D9%81%D8%A7%D8%B4-%D8%B4%D8%AF-%D9%81%DB%8C%D9%84%D9%85 is now at https://www.yjc.ir/fa/news/6397790/Ų±Ų§Ų²-Ų§ŲµŲŗŲ±-ŁŲ±ŁŲ§ŲÆŪ-پس-Ų§Ų²-Ų³Ų§Ł%E2%80%8CŁŲ§-ŁŲ§Ų“-Ų“ŲÆ-ŁŪŁŁ (Just changed .news to .ir. Also works if you take the "https://www.yjc.news/fa/news/6397790" part of the original link and change .news to .ir.).
- On Heshmatollah Falahatpishe, https://www.yjc.news/en/news/38193 is now at https://www.yjc.ir/en/news/38193/iran-to-claim-compensation-from-us-for-chemical-weapons-victims-mp (Same thing for English).
- On 2022 Hormozgan earthquakes, https://www.yjc.news/fa/amp/news/8162021 is now at https://www.yjc.ir/fa/news/8162021/Ų²ŁŲ²ŁŁ-ŁŲ§Ū-پŪ-ŲÆŲ±-پŪ-ŲÆŲ±-ŲŗŲ±ŲØ-ŁŲ±Ł زگاŁ-Ų²ŁŲ²ŁŁ-ŪµŪ²-Ų±ŪŲ“ŲŖŲ±Ū-ŚŲ§Ų±Ś©-Ų±Ų§-ŁŲ±Ų²Ų§ŁŲÆ-ŁŪŁŁ -Ł-ŲŖŲµŲ§ŁŪŲ± (Removed /amp and changed .news to .ir).
- On List of Esteghlal F.C. managers, https://www.yjc.news/00U4Iq is now at https://www.yjc.ir/fa/news/7166384/ŁŲ±ŪŲ§-ŲŗŁŁŲ±Ū-سر٠ربŪ-Ł ŁŁŲŖ-Ų§Ų³ŲŖŁŁŲ§Ł (URL Shortening. .news to .ir works.)
ClumsyOwlet (talk) 02:53, 13 September 2025 (UTC)
- This will be multi-step. Because the domain name has changed from http://www.yjc.news/fa/news/6228037 --> https://www.yjc.ir/fa/news/6228037 I can do a domain move on existing URLs. It will be configured so any that can't be moved will be set as
|url-status=deadand archive URL added (or dead link tag). After that is complete, I will add the old domain to WP:JUDI, so those remaining links in the old domain get the usurpation treatment, as part of a future JUDI batch run. That should cover both moving the domain where possible, and the usurpation where a move was not possible. Unfortunately I can't easily do both at the same time as moving and usurpation are different types of processes. -- GreenC 02:31, 21 September 2025 (UTC)
Enwiki
- Checked 136 pages and edited 135 pages. Moved 153 links to a new URL: 153 ruled mapped redirects, Resolved 71 soft-404s. Removed 1
{{dead link}}. Switched 8|url-status=deadto live. Added 4 archive URLs (4 Wayback).
IABot DB
- Set domain permadead (IABot does not support URL moves)
Done and updated WP:JUDI -- GreenC 04:14, 21 September 2025 (UTC)
coa.inducks.org
[edit]Hi Can you please change all web links with the domain name "coa.inducks.org" into the domain "inducks.org". There are hundreds if not thousands of them in Wikipedia. Here is an example of what should be done: https://en.wikipedia.org/w/index.php?title=Junior_Woodchucks&diff=1311246207&oldid=1305819590 You can safely change any https URL with domain coa.inducks.org into inducks.org, except in archive.org URLs of course. Lerichard (talk) 08:20, 14 September 2025 (UTC)
- Lerichard: Website reports:
- "Due to a high number of AI bots scrawling our website we've had to take the decision to ask visitors to please log-in or register before browsing this website. We apologize for the inconvenience and hope we will find a better solution in the future."
- Since I don't have a login, it requires a "blind move" ie. switch the URL without verifying. Blind moves are risky, there are usually some links that don't work, but since there are only about 120 pages containing coa links, it is a better option than nothing. If it breaks things let me know I can try to repair. -- GreenC 00:48, 22 September 2025 (UTC)
Done -- GreenC 03:38, 22 September 2025 (UTC)
- Great, thanks! Lerichard (talk) 20:19, 22 September 2025 (UTC)
ted.com
[edit]The old TED video URL format was "http://www.ted.com/talks/talk_name_here.html", which now return 404. The current TED video URL format is: "https://www.ted.com/talks/talk_name_here" with the trailing ".html" removed (and HTTPS). A quick search suggests there could be about 1,000 affected links. UnlikelyEvent (talk) 07:02, 15 September 2025 (UTC)
Enwiki
- Checked 3,195 pages and edited 1,449 pages. Moved 1,786 links to a new URL: 270 normal redirects, 1,482 ruled mapped redirects, 34 ghost mapped redirects, Resolved 43 soft-404s. Added 15
{{dead link}}. Switched 75|url-status=deadto live. Switched 3|url-status=liveto dead. Added 72 archive URLs (70 Wayback).
Done -- GreenC 03:37, 22 September 2025 (UTC)
whitehousemuseum.org
[edit]Looks like the domain expired and the site moved to tysto.com according to this blog post.
http://www.whitehousemuseum.org/Something ā http://www.tysto.com/Something -- Nintendofan885T&Cs apply 20:52, 15 September 2025 (UTC)
Enwiki
- Checked 69 pages and edited 67 pages. Moved 101 links to a new URL: 101 ruled mapped redirects. Switched 6
|url-status=deadto live.
IABot DB
- Set permadead (IABot does not support URL moves)
Done -- GreenC 01:32, 23 September 2025 (UTC)
artinfo.com
[edit]572 pages. This domain was usurped. Cherry Cotton Candy 12:14, 16 September 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
bostonmetroopera.com
[edit]A couple of pages. This domain was usurped, and at least 2 pages have links to the currently active malicious site. 2601:19E:8000:A4F0:E9F4:6B28:A0A1:A249 (talk) 17:48, 18 September 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
consequenceofsound.net
[edit]Website moved to https://consequence.net/ with their old links redirecting. Came across this one with ?new=true that still redirects. I think ?new=true should be removed as it still works without it. ~4800 articles. Thanks! MrLinkinPark333 (talk) 01:10, 21 September 2025 (UTC)
Enwiki
- Checked 4,789 pages and edited 4,560 pages. Moved 5,250 links to a new URL: 2,769 normal redirects, 2,362 ruled mapped redirects, 119 ghost mapped redirects, Resolved 5 soft-404s. Added 1
{{dead link}}. Switched 140|url-status=deadto live. Switched 34|url-status=liveto dead. Added 45 archive URLs (43 Wayback).
IABot
- IABot does not have support for URL moves
Done -- GreenC 01:06, 24 September 2025 (UTC)
vnuemedia.com
[edit]Some of these are Billboard biz links. Unfortunately, they can't be converted like this to that because of the different number ID. ~290. Thank you! MrLinkinPark333 (talk) 01:22, 21 September 2025 (UTC)
Enwiki
- Checked 293 pages and edited 129 pages. Added 106
{{dead link}}. Switched 45|url-status=liveto dead. Added 117 archive URLs (15 Wayback).
IABot DB
- Updated 385 links and set domain permadead
Done -- GreenC 16:38, 7 October 2025 (UTC)
stannenj.com
[edit]This used to be a Catholic school, now it's an online gambling blog. Crywalt (talk) 14:02, 25 September 2025 (UTC)
- It's at St._Anne_School_(Fair_Lawn,_New_Jersey) Crywalt (talk) 14:05, 25 September 2025 (UTC)
- Added to WP:JUDI, which is the correct place for this Big Blue Cray(fish) Twins (talk) 07:44, 4 October 2025 (UTC)
- Thanks! Crywalt (talk) 23:42, 4 October 2025 (UTC)
- Added to WP:JUDI, which is the correct place for this Big Blue Cray(fish) Twins (talk) 07:44, 4 October 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
fimi.it
[edit]Fimi.com, the provider of the Italian official albums and singles charts, recently renewed the website.
Links to the albums charts archives changed from https://www.fimi.it/top-of-the-music/classifiche.kl#/charts/1/2023/8 to https://www.fimi.it/top-of-the-music/archivio-classifiche-settimanali/archivio-classifiche-per-settimana/?tipo=2&anno=2023&settimana=8.
Links to the singles charts archives changed from https://www.fimi.it/top-of-the-music/classifiche.kl#/charts/3/2023/8 to https://www.fimi.it/top-of-the-music/archivio-classifiche-settimanali/archivio-classifiche-per-settimana/?tipo=2&anno=2023&settimana=8#tabs-1b (the difference with album chart being the suffix #tabs-1b).
~ 1,900 pages. --Cavarrone 07:29, 28 September 2025 (UTC)
Enwiki
- Checked 1,925 pages and edited 1,327 pages. Moved 2,311 links to a new URL: 2,311 ruled mapped redirects, Switched 10
|url-status=deadto live.
Done -- GreenC 21:21, 7 October 2025 (UTC)
fishesofaustralia.net.au
[edit]Fishes of Australia
- museumsvictoria.com.au
Bad: (Hard 404) : https://museumsvictoria.com.au/home/species/3305
Works: https://fishesofaustralia.net.au/home/species/3305
NB: https://collections.museumsvictoria.com.au/ seems to be unaffected, so it'll just be:
https://museumsvictoria.com.au/home/species/# -> https://fishesofaustralia.net.au/home/species/#
I can't seem to find a search to give you a rough idea of how many there are, including various fixes such as archive refs, but I think there'll be a few hundreds.
Please could you get your bot to replace these? Thanks Big Blue Cray(fish) Twins (talk) 19:08, 2 October 2025 (UTC)
- Big Blue Cray(fish) Twins: I can't find any: [14] -- GreenC 18:41, 7 October 2025 (UTC)
- fishesofaustralia.net.au
- Thanks for looking, guess that's why my search found no results. Thought it might be my search syntax.
- If I may beg your indulgence, could you look at
- http://www.fishesofaustralia.net.au/home/species/# -> https://fishesofaustralia.net.au/home/species/# instead ??
- As per my recent change to Neosilurus brevidorsalis (web id 2763)
- Many Thanks,
- Big Blue Cray(fish) Twins (talk) 18:57, 7 October 2025 (UTC)
- OK. 103 pages. -- GreenC 21:25, 7 October 2025 (UTC)
- Thank you very much Big Blue Cray(fish) Twins (talk) 06:17, 8 October 2025 (UTC)
- OK. 103 pages. -- GreenC 21:25, 7 October 2025 (UTC)
Enwiki
- Checked 104 pages and edited 100 pages. Moved 100 links to a new URL: 100 ruled mapped redirects, Switched 31
|url-status=deadto live.
Done -- GreenC 05:13, 8 October 2025 (UTC)
flycmi.com
[edit]Found on University of Illinois Willard Airport and has a big fat 404 with a banner containing "judi" up top. Departureā (talk) 00:37, 3 October 2025 (UTC)
- Added to WP:JUDI, which is the correct place for this Big Blue Cray(fish) Twins (talk) 07:44, 4 October 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
villagevoice.com
[edit]Many of these links need to be converted to new URLs like this. This will have to be in batches because it's not all the same method
Non PHP links
[edit]These parts needs removed to create the new URLs. There URLS may need more than one of these points removed:
- Dates: These are /YYYY-MM-DD/ or /YYYY/MM/DD/.
- Sections: These are usually after the date before the article name. They may be after .com like below. [15]
- Sections with Numerical IDs: These are after the end of the URL. [16]
- Apostrophe: [17] is now [18]
- /number/ at the end like this and that
Alternatively:
- Any URLs missing an ending slash at the end needs one like Gaga's link above.
- Commas: These links will most likely need ghost redirects as this is now here.
PHP links
[edit]Majority of the links will redirects. They follow the same rules above with some exception.
- Underscores: These need to change to hyphens while removing the majority of the URL: Converting to that redirects to here
- /issues/: Although this is now here, no luck in converting.
- /specials/: No luck converting articles like this which is already archived.
If you want to go through the entire website, including the working links, ~7k Thank you very much! MrLinkinPark333 (talk) 23:56, 3 October 2025 (UTC)
- Thank you for the rules set. -- GreenC 04:01, 10 October 2025 (UTC)
- You're welcome! It was a complicated request. MrLinkinPark333 (talk) 04:02, 10 October 2025 (UTC)
- Thank you for the rules set. -- GreenC 04:01, 10 October 2025 (UTC)
objection
[edit]Is there a way of excluding a domain name? There are hundreds (probably thousands) of citations using villagevoice.com sourcing Scientology topics. Although VV copied the text of their articles from a prior website style, the new copies are shit (images and formatting gone). The latest run was on article List of Scientologists which screwed up. By marking newer VV URLs as live, we are losing formatting and images of the Wayback Machine archived articles... well, not exactly losing, but deferring to the newer shit copy. Can you just make villagevoice.com an exclusion of GreenC Bot? There's no documentation at User:GreenC bot to explain what this bot does or how to exclude it, etc., and there are about 800 articles in the Wikiproject:Scientology; likely every one of them linking to a VV article. Fixing this by hand is not an option. ā¶ I am Grorp ā 08:13, 10 October 2025 (UTC)
- User:Grorp this was a job request made on a noticeboard initiated another editor to process the entire domain. The link to this location was in the edit summary Special:Diff/1314954569/1316049965 .. User:MrLinkinPark333 I have put the other half of the domain on hold until it is clear how to proceed because right now there is no longer consensus. I have not looked too closely at the issue raised by Grop yet. Also Grop, you are looking for documentation about the bot, that is also in the edit summary. -- GreenC 16:52, 10 October 2025 (UTC)
- @GreenC: That link (i.e., this thread) doesn't have documentation about what GreenC bot is doing... it's just some request about villagevoice.com. I would have expected documentation at User:GreenC bot that explains what the bot is intended to do -- keeping in mind that I (and most editors) were not part of any request to invent the bot. Are you saying GreenC bot only does villagevoice.com? Even if so, I would expect some sort of documentation (or pointer to it) on its page. I'm not sure what other article types cite to villagevoice.com articles. Keeping in mind that the previous editor of the paper, Tony Ortega, has been covering the Scientology topic for a very long time. As editor and reporter, there are many articles on the topic, and many are cited within WP:WikiProject Scientology-tagged articles, which were duly copied into the Wayback Machine. The Wayback Machine copies are much more desirable than the 'new' VV URLs, which I swear were created by an automated process when they updated their website to a new coding format. ā¶ I am Grorp ā 20:22, 10 October 2025 (UTC)
- The bot is actually WaybackMedic, indicated in the edit summary. The account that runs the bot is GreenC bot, which also runs many others bots with various names. That's how bots work on Wikipedia: 1 user account, many bots attached to it. Otherwise I'd need 10 different accounts for each bot program which is not practical. What WaybackMedic does is address link rot issues - it's been in development for over 10 years and has thousands of features due to the complexity of link rot. When you ask "what is it doing", for this page it is for user's to request modifications to URLs so that dead links become live again, or add an archive URL Special:Diff/1315452952/1316054727.
- Back to VV: Compare archive vs. live. The formatting doesn't look too bad, but I agree the image link is broken. Sometimes Wayback Machine copies are superior to live pages, this might one such case. -- GreenC 00:16, 11 October 2025 (UTC)
- GreenC: Village Voice redesigned/reformatted/reprogrammed their website in 2017. Anything prior to then had its text copied from the original article -- minus images, formatting, bolding, etc. Of course a Wayback archive of the new design looks just like the new (current) design, like in your example above which is a 2017 article. But take older articles and you'll see the difference. Here is one example of a 2008 article (used in 11 Wikipedia articles): current article, archived original. Hardly the same. If the Wayback Machine's archived copy is better than VV's archive copy, I don't see why we need to primarily point to VV's archived copy, especially not to mark the newer url as live even if there is already a valid older url-archive listed. Now here's the rub... in the edit which alerted me to GreenC bot's failures, the very first change it made is blatantly incorrect. The bot took this June 20, 2011 article and changed it to this June 28, 2011 article. They are not even the same article. I don't know what the bot does to attempt to find the same article, but this one was indeed a failure, and an editor unfamiliar with looking up old edits and correcting the bot's error might simply check the June 28 article and realize it doesn't verify the text which precedes it, and might delete the "unverifiable" content from the article. Second example: The second change in that edit illustrates the loss of images: Wayback archive versus current VV version; especially noticeable is the loss of the last image which is referred to in the text of the article. I don't know how bots get programmed, but maybe for villagevoice.com it can (a) skip over articles that are pre-2017, and (b) skip over any citations that already contains an archive-url parameter. A better solution for villagevoice.com/blog urls is to just find the Wayback Machine archived copy. I just don't understand what problem GreenC bot is trying to solve that isn't better solved by User:InternetArchiveBot. ā¶ I am Grorp ā 10:50, 11 October 2025 (UTC)
- Seems like it was an error on the site. Using this redirect instead points to the right article. For some reason, had to remove the _t at the end, even though it was already there in the url per your revision. If IA copies are needed instead, then that works for me. MrLinkinPark333 (talk) 15:57, 11 October 2025 (UTC)
- This has become more complicated, the details were not understood before and kindly brought to attention by Grorp. The first thing is to stop more changes, the bot was halted two days ago. The second is to undo the controversial changes made in Batch 1, namely "Moved 2,043 links to a new URL" and "Switched 400 |url-status=dead to live". There are logs so it should be possible with programming work. It might be possible to convert to archive URLs in the same pass, or possibly two passes (first revert, second archive). -- GreenC 21:45, 11 October 2025 (UTC)
- The site is too much trouble and holding up work with other domain requests, moving on for now. -- GreenC 23:48, 17 October 2025 (UTC)
- This has become more complicated, the details were not understood before and kindly brought to attention by Grorp. The first thing is to stop more changes, the bot was halted two days ago. The second is to undo the controversial changes made in Batch 1, namely "Moved 2,043 links to a new URL" and "Switched 400 |url-status=dead to live". There are logs so it should be possible with programming work. It might be possible to convert to archive URLs in the same pass, or possibly two passes (first revert, second archive). -- GreenC 21:45, 11 October 2025 (UTC)
- Seems like it was an error on the site. Using this redirect instead points to the right article. For some reason, had to remove the _t at the end, even though it was already there in the url per your revision. If IA copies are needed instead, then that works for me. MrLinkinPark333 (talk) 15:57, 11 October 2025 (UTC)
- GreenC: Village Voice redesigned/reformatted/reprogrammed their website in 2017. Anything prior to then had its text copied from the original article -- minus images, formatting, bolding, etc. Of course a Wayback archive of the new design looks just like the new (current) design, like in your example above which is a 2017 article. But take older articles and you'll see the difference. Here is one example of a 2008 article (used in 11 Wikipedia articles): current article, archived original. Hardly the same. If the Wayback Machine's archived copy is better than VV's archive copy, I don't see why we need to primarily point to VV's archived copy, especially not to mark the newer url as live even if there is already a valid older url-archive listed. Now here's the rub... in the edit which alerted me to GreenC bot's failures, the very first change it made is blatantly incorrect. The bot took this June 20, 2011 article and changed it to this June 28, 2011 article. They are not even the same article. I don't know what the bot does to attempt to find the same article, but this one was indeed a failure, and an editor unfamiliar with looking up old edits and correcting the bot's error might simply check the June 28 article and realize it doesn't verify the text which precedes it, and might delete the "unverifiable" content from the article. Second example: The second change in that edit illustrates the loss of images: Wayback archive versus current VV version; especially noticeable is the loss of the last image which is referred to in the text of the article. I don't know how bots get programmed, but maybe for villagevoice.com it can (a) skip over articles that are pre-2017, and (b) skip over any citations that already contains an archive-url parameter. A better solution for villagevoice.com/blog urls is to just find the Wayback Machine archived copy. I just don't understand what problem GreenC bot is trying to solve that isn't better solved by User:InternetArchiveBot. ā¶ I am Grorp ā 10:50, 11 October 2025 (UTC)
- @GreenC: That link (i.e., this thread) doesn't have documentation about what GreenC bot is doing... it's just some request about villagevoice.com. I would have expected documentation at User:GreenC bot that explains what the bot is intended to do -- keeping in mind that I (and most editors) were not part of any request to invent the bot. Are you saying GreenC bot only does villagevoice.com? Even if so, I would expect some sort of documentation (or pointer to it) on its page. I'm not sure what other article types cite to villagevoice.com articles. Keeping in mind that the previous editor of the paper, Tony Ortega, has been covering the Scientology topic for a very long time. As editor and reporter, there are many articles on the topic, and many are cited within WP:WikiProject Scientology-tagged articles, which were duly copied into the Wayback Machine. The Wayback Machine copies are much more desirable than the 'new' VV URLs, which I swear were created by an automated process when they updated their website to a new coding format. ā¶ I am Grorp ā 20:22, 10 October 2025 (UTC)
- Rolled back changes in about 1,800 pages as first step. Example Special:Diff/1316044173/1318059180 -- GreenC 17:59, 21 October 2025 (UTC)
Processing results
[edit]Enwiki
- Pass 1: Batch 1 (0001-3000): Checked 3,000 pages and edited 2,329 pages. Moved 2,043 links to a new URL: 7 normal redirects, 2,036 ruled mapped redirects, Resolved 417 soft-404s. Removed 10
{{dead link}}. Added 9{{dead link}}. Switched 400|url-status=deadto live. Switched 119|url-status=liveto dead. Added 446 archive URLs (415 Wayback). - Pass 2: Batch 1 (0001-3000): Undo 1,984 moved links from Pass 1
- Pass 3: Batch 1 (0001-3000): Checked 3,000 pages and edited 1,993 pages. Resolved 3 soft-404s. Added 28
{{dead link}}. Switched 697|url-status=liveto dead. Added 1,504 archive URLs (1,470 Wayback).
- Batch 2 (3001-7025): Checked 4,026 pages and edited 2,775 pages. Moved 6 links to a new URL: 6 ruled mapped redirects, Resolved 6 soft-404s. Added 40
{{dead link}}. Switched 590|url-status=liveto dead. Added 2,498 archive URLs (2,370 Wayback).
IABot DB
- Updated about 9,500 URLs.
Done (I hope!) -- GreenC 04:14, 23 October 2025 (UTC)
Discussions
[edit]- Of the archived urls, was any of them non-php? I'm wondering to see if any more can be converted. MrLinkinPark333 (talk) 04:54, 10 October 2025 (UTC)
- @GreenC Why is this marking as dead and adding archives to sources that are still alive [19]? the only one I checked. PARAKANYAA (talk) 06:22, 22 October 2025 (UTC)
- This is still alive too, the only other one on my watchlist [20]. I think there are some issues. PARAKANYAA (talk) 06:24, 22 October 2025 (UTC)
- @PARAKANYAA: After a discussion (see above) GreenC decided to reverse their earlier edits on villagevoice.com citations. You'll notice that these new edits are the reverse of ones the GreenCBot did earlier. ā¶ I am Grorp ā 12:52, 22 October 2025 (UTC)
- @Grorp That's fine, but here it's not reversing anything. It hadn't edited the page before in either case. PARAKANYAA (talk) 12:59, 22 October 2025 (UTC)
- Yeah unfortunately this is a complex site with a lot of dead links that were not being repaired by IABot. I was left with a bad choice, or a very bad choice. So I went in the direction of the bad choice: adding archives in some cases where the link is still live. The alternative, the very bad choice, was to not add archives when links are dead. This later category is much larger than the former, so I went with the former because it did the least amount of harm for the most good, relative to the other option. Blame the website owners for having such a messed up website. -- GreenC 15:18, 22 October 2025 (UTC)
- @Grorp That's fine, but here it's not reversing anything. It hadn't edited the page before in either case. PARAKANYAA (talk) 12:59, 22 October 2025 (UTC)
nyti.ms
[edit]972 pages. Expand web short URL for nytimes.com -- GreenC 18:30, 5 October 2025 (UTC)
Enwiki
- Checked 972 pages and edited 951 pages. Moved 1,498 links to a new URL: 936 normal redirects, 355 ruled mapped redirects, 207 ghost mapped redirects, Removed 1
{{dead link}}. Added 6{{dead link}}. Switched 6|url-status=deadto live. Added 5 archive URLs (2 Wayback).
Done -- GreenC 05:41, 19 October 2025 (UTC)
- This change broke one example used in the artcile for .ms. 2001:464D:9529:4:AD70:1530:19AE:5B7C (talk) 13:13, 30 October 2025 (UTC)
- Fixed. And added code for parsing
{{URL}}with{{cbignore}}, I think it will skip next time. -- GreenC 20:28, 30 October 2025 (UTC)
- Fixed. And added code for parsing
Perhaps a slightly odd request, but I'd figured I'd bring this one to attention of users on this board. The template was deprecated and now is WP:TFDHd, but needs its uses changed to {{Cite POWO}}. There is not an automatic translation of URLs in the one that can be used with the other apparently. This board has the right background to do something about that perhaps. Izno (talk) 23:39, 6 October 2025 (UTC)
- Izno: There is no map between old and new URLs they are using different schemes. The best I can do is convert the ~1,000 instances of
{{WCSP}}to{{cite web}}and add archive URLs (or{{dead link}}). -- GreenC 04:48, 18 October 2025 (UTC)
- Izno, I think the template is now gone from main, file and template space. There are two other template pages transcluding it somehow, but I can't figure out where. -- GreenC 17:33, 19 October 2025 (UTC)
Enwiki
- Checked 1,092 pages and edited 1,054 pages. Converted 1,139 templates. Added 63
{{dead link}}. Added 1,077 archive URLs (1,077 Wayback).
Done -- GreenC 17:33, 19 October 2025 (UTC)
nationalpost.com
[edit]The domain hostname in the FQDN needs to be removed to create redirects to the new URL. Example: Changing this to that redirects here. These are in two categories:
Working redirects:
- news.nationalpost.com ~1900
- sports.nationalpost.com ~310
- arts.nationalpost.com ~480
- life.nationalpost.com ~60
- fullcomment.nationalpost.com ~180
Not working redirects
- network.nationalpost.com ~320 It looks like national post is excluded from IA.
This means only ~3k of the 7k articles need adjusting/archives. Thank you! MrLinkinPark333 (talk) 21:31, 7 October 2025 (UTC)
- Checking whole 7k, it has other problems the bot can repair eg. this to that. Seeing that WaybackMachine has excluded nationalpost.com a few Archive.today are available. -- GreenC 19:37, 19 October 2025 (UTC)
- MrLinkinPark333: Almost 5,000 links from dead to live (3,956 + 788). The 1,000 dead links are unfortunate, nothing to be done, but only about 1 in 6 of the total. This process moving dead links live is a particularly good idea when WaybackMachine excludes the domain. Good catch and an important domain. -- GreenC 01:18, 20 October 2025 (UTC)
- Nice find with the extra fixed links! MrLinkinPark333 (talk) 02:13, 20 October 2025 (UTC)
- MrLinkinPark333: Almost 5,000 links from dead to live (3,956 + 788). The 1,000 dead links are unfortunate, nothing to be done, but only about 1 in 6 of the total. This process moving dead links live is a particularly good idea when WaybackMachine excludes the domain. Good catch and an important domain. -- GreenC 01:18, 20 October 2025 (UTC)
Enwiki
- Checked 7,831 pages and edited 3,871 pages. Moved 3,956 links to a new URL: 311 normal redirects, 3,645 ruled mapped redirects, Resolved 317 soft-404s. Removed 21
{{dead link}}. Added 1,017{{dead link}}. Switched 788|url-status=deadto live. Switched 27|url-status=liveto dead. Added 30 archive URLs (0 Wayback).
IABot DB
- Checked about 14,000 URLs and updated about 9,000 which propagate to 300+ wikis -- GreenC 18:30, 20 October 2025 (UTC)
Done -- GreenC 18:30, 20 October 2025 (UTC)
architectlaunceston.com.au
[edit]Website leads to a spammy blog. Saw that it was used in some articles relating to Tasmania. EatingCarBatteries (contributions, talk) 02:35, 8 October 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
blockmrecords.org
[edit]Website http://www.blockmrecords.org/bach/ has been moved to https://smtd.umich.edu/bach-organ-works/ There is no easy way to link to individual works as they seem to have gone to a js based search system. NightWolf1223 <Howl at meā¢My hunts> 03:05, 9 October 2025 (UTC)
- The entire website blockmrecords.org redirects. It is in 31 pages. I tried to find a way to move the existing URLs to the new domain without success. I'll add archive URLs. -- GreenC 01:38, 20 October 2025 (UTC)
Enwiki
- Checked 31 pages and edited 31 pages. Switched 1
|url-status=liveto dead. Added 51 archive URLs (50 Wayback).
IABot DB
- Checked and updated 121 links and set to permadead
Done -- GreenC 19:14, 20 October 2025 (UTC)
ift.org.mx and cofece.mx
[edit]| This section is pinned and will not be automatically archived. |
These Mexican government agencies will likely be dissolved this month, and I'm not sure what will happen to references and other materials used within. Sammi Brie (she/her Ā· t Ā· c) 19:00, 10 October 2025 (UTC)
On hold pending dissolution. -- GreenC 04:21, 18 October 2025 (UTC)
- It took place on October 17 for the former and presumably a similar time for the latter. Sites are still up for now (but with the replacing agency's logo). Unclear whether they will use it or another page for their own business. I suspect the domain rpc.ift.org.mx (full of PDFs containing broadcasting technical information) will be retained intact at some other domain at some point. Sammi Brie (she/her Ā· t Ā· c) 06:32, 21 October 2025 (UTC)
- If it's a new agency at the same domain.. what does this mean we should do in terms of archiving URLs? Options are do nothing. Or treat all URLs as dead and add archive URLs. -- GreenC 16:06, 21 October 2025 (UTC)
- It took place on October 17 for the former and presumably a similar time for the latter. Sites are still up for now (but with the replacing agency's logo). Unclear whether they will use it or another page for their own business. I suspect the domain rpc.ift.org.mx (full of PDFs containing broadcasting technical information) will be retained intact at some other domain at some point. Sammi Brie (she/her Ā· t Ā· c) 06:32, 21 October 2025 (UTC)
fiu.edu/~mirandes
[edit]The Cardinals of the Holy Roman Church by Salvador Miranda
Seems the location was changed some time ago. The old URL was http://www2.fiu.edu/~mirandas and has been replaced with https://cardinals.fiu.edu, see example of update[21]. Currently used roughly 530 times[22]. -- LCU ActivelyDisinterested Ā«@Ā» °āt° 14:34, 14 October 2025 (UTC)
Enwiki
- Checked 1,226 pages and edited 1,225 pages. Moved 1,589 links to a new URL: 1,589 ruled mapped redirects, Resolved 1 soft-404s. Removed 8
{{dead link}}. Added 3{{dead link}}. Switched 286|url-status=deadto live. Added 8 archive URLs (8 Wayback).
IABot DB
- Checked about 7,100 and updated about 4,800 links
Done -- GreenC 05:05, 21 October 2025 (UTC)
- Thanks GreenC. -- LCU ActivelyDisinterested Ā«@Ā» °āt° 09:39, 21 October 2025 (UTC)
- I understand the need to update the URLs to reflect the updated location, but am not sure of the changes made in this edit where two URLs (one valid/live; the other incorrect/dead) were changed to SKIPDEADURL. ā Archer1234 (tĀ·c) 12:43, 22 October 2025 (UTC)
- User:Archer1234: Shoot. That is a typo in my code. It ended up in 352 pages. Thanks for the report. Working to repair. -- GreenC 15:26, 22 October 2025 (UTC)
- Repaired. -- GreenC 22:28, 22 October 2025 (UTC)
- User:Archer1234: Shoot. That is a typo in my code. It ended up in 352 pages. Thanks for the report. Working to repair. -- GreenC 15:26, 22 October 2025 (UTC)
alwaystouchout.com
[edit]This website appears to be dead/takes too long to respond. Where do we go next? There are 107 links to this and various sub-pages http://alwaystouchout.com/ Difficultly north (talk) Time, department skies 22:11, 17 October 2025 (UTC)
- Appears to have been down since May 2022 [23]. I'll archive it. 56 pages. -- GreenC 21:45, 20 October 2025 (UTC)
Enwiki
- Checked 56 pages and edited 31 pages. Switched 11
|url-status=liveto dead. Added 22 archive URLs (22 Wayback).
IABot DB
- Set domain to permadead
Done -- GreenC 16:02, 21 October 2025 (UTC)
pasadenastarnews.com
[edit]This website seems to have a mixture of working and broken links. I think it'd be easier to look through all of them. A few notes:
- Any links with ci_ in the URL are broken and don't have any new URLs like this.
- this link is a working redirect to here
Thanks! MrLinkinPark333 (talk) 20:08, 21 October 2025 (UTC)
Enwiki
- Checked 471 pages and edited 193 pages. Moved 188 links to a new URL: 54 normal redirects, 129 ruled mapped redirects, 5 ghost mapped redirects, Resolved 4 soft-404s. Added 6
{{dead link}}. Switched 6|url-status=deadto live. Added 11 archive URLs (6 Wayback).
IABot DB
- Checked 557 URLs and updated 147
Done -- GreenC 23:38, 23 October 2025 (UTC)
sacbee.com
[edit]Similarly, this website has working and broken links. Generally, the ones that are broken are in these formats:
However, it'd be easier to check all of them instead. ~2600
Thanks again! MrLinkinPark333 (talk) 20:15, 21 October 2025 (UTC)
Enwiki
- Checked 2,733 pages and edited 1,147 pages. Moved 950 links to a new URL: 46 normal redirects, 883 ruled mapped redirects, 21 ghost mapped redirects, Resolved 7 soft-404s. Added 230
{{dead link}}. Switched 29|url-status=deadto live. Switched 33|url-status=liveto dead. Added 301 archive URLs (283 Wayback).
IABot DB
- Checked 4,104 links and updated 1,908
Done -- GreenC 15:46, 24 October 2025 (UTC)
timesoftunbridgewells.co.uk
[edit]Has since been replaced by https://www.timeslocalnews.co.uk/ . The C of E God Save the King! (talk) 17:45, 22 October 2025 (UTC)
- User:The C of E, there is no obvious way to automate replacement, for example this does not go here. I will add archive URLs via bot. Even better, there are so few links, recommend if you can manually search for the new URLs and replace them. -- GreenC 18:54, 22 October 2025 (UTC)
- @GreenC: That particular one links to here. I hope that can assist. The C of E God Save the King! (talk) 16:27, 23 October 2025 (UTC)
- That works. They have bot blocking that I can not surmount ("Are you human?" from CloudFlare), I am unable to verify the new link works, so I made a "blind move" ie. simple search-replace without verification. I did manually verify 8 pages so I assume the other 4 are OK. The problem is some "/local-news/" redirect to "/lifestyle/" which is fine, except it might be a problem later with the Wayback Machine archives, which might not be able to follow the redirects. Ideally if you can go through and fix those redirects it would be best long term, I can't do it automatically because of the bot blocking. -- GreenC 16:18, 24 October 2025 (UTC)
- @GreenC: That particular one links to here. I hope that can assist. The C of E God Save the King! (talk) 16:27, 23 October 2025 (UTC)
Enwiki
- Checked 14 pages and edited 14 pages. Removed 2
{{dead link}}. Switched 8|url-status=deadto live.
Done -- GreenC 16:18, 24 October 2025 (UTC)
postandcourier.com
[edit]Old links for The Post and Courier unfortunately are not redirecting to their new URLs. For example, this is now here. Unless ghost redirects are found, I think archives will be needed. This is because the new URLs are not easily converted. Some of these links are already archived. 450. Thank you! MrLinkinPark333 (talk) 23:19, 23 October 2025 (UTC)
- MrLinkinPark333: I'll do the whole domain for example this (Danny Verdin), this (Newspaper endorsements in the 2012 United States presidential primaries), this (South Carolina Stingrays) -- in case you see any move rules. -- GreenC 16:29, 24 October 2025 (UTC)
- Of the three you posted, no luck with new URLs. MrLinkinPark333 (talk) 22:22, 24 October 2025 (UTC)
Enwiki
- Checked 2,458 pages and edited 759 pages. Moved 296 links to a new URL: 6 normal redirects, 279 ruled mapped redirects, 11 ghost mapped redirects, Resolved 75 soft-404s. Added 59
{{dead link}}. Switched 5|url-status=deadto live. Switched 63|url-status=liveto dead. Added 726 archive URLs (619 Wayback).
IABot DB
- Checked about 3,300 links and updated about 1,200
Done -- GreenC 16:27, 25 October 2025 (UTC)
National Library of Australia
[edit]Per Special:PermanentLink/1318509429#URL has changed, nla.gov.au has moved to library.gov.au. However it looks like Trove and Pandora/webarchive are still hanging off the old domain. ClaudineChionh (she/her Ā· talk Ā· email Ā· global) 09:27, 24 October 2025 (UTC)
- This domain exists in 62,000 pages -- GreenC 16:39, 24 October 2025 (UTC)
- I can't find examples where changing the domain name alone makes sense, except for the home page: https://nla.gov.au --> https://library.gov.au .. everything else is either the old Trove, Catalog and Pandora/Webarchive links .. or the links at the new domain have new paths, it's basically an entirely new website. Mapping the old links to new, without redirects, may or may not be possible, and it would take serious investigative work. Assuming the new website content is even the same as the old. It's like they abandoned the old site and started a new one. -- GreenC 17:07, 24 October 2025 (UTC)
- Thanks for checking, I suspected this might be the case (it was much too late in the day for me to investigate properly). ClaudineChionh (she/her Ā· talk Ā· email Ā· global) 00:49, 25 October 2025 (UTC)
- I found a match in Helena Blavatsky: this [dead link] is now available here. In Robert Louis Stevenson, this matches here. The matches are imperfect (eg. "german-colonies-in-the-pacific" vs. "german-colonies-pacific") it would require AI or fuzzy matching. Unfortunately there are only 37 pages. There is also "nla.gov.au/research-guides/" (19 pages) available here. I'm hesitant because of the low count and bespoke coding. Will think about how it might be done. -- GreenC 02:22, 25 October 2025 (UTC)
- Thanks for checking, I suspected this might be the case (it was much too late in the day for me to investigate properly). ClaudineChionh (she/her Ā· talk Ā· email Ā· global) 00:49, 25 October 2025 (UTC)
guardian.co.uk
[edit]This is a big request. This needs to be split up into two sections:
observer.guardian.co.uk
[edit]- These ones needs to have URL changes to create redirects. observer.guardian.co.uk needs to be changed to theguardian.com/observer. Therefore, changing this to that makes a redirect to here.
- Any URLs with %2C needs to be converted to commas for them to work as redirects.
- Enwiki
- Checked 1,515 pages and edited 1,447 pages. Moved 1,545 links to a new URL: 1,545 ruled mapped redirects, Resolved 77 soft-404s. Added 2
{{dead link}}. Switched 80|url-status=deadto live. Switched 3|url-status=liveto dead. Added 26 archive URLs (0 Wayback).
- Checked 1,515 pages and edited 1,447 pages. Moved 1,545 links to a new URL: 1,545 ruled mapped redirects, Resolved 77 soft-404s. Added 2
- IABot DB
- Checked and updated 3,436 links
Done -- GreenC 03:48, 28 October 2025 (UTC)
guardian.co.uk
[edit]- Majority of these ones are redirecting with some exceptions. This one is broken for Ice hockey at the 2007 Asian Winter Games. For others, the only change will be Http -> Https like this one for Piano Sonata No. 1 (Beethoven).
- Any URLs with %2C needs to be converted to commas for them to work as redirects. Changing this to that redirects here.
Observer might be easier to do first as it's ~1500 out of ~14000.
In case you are wondering, I requested theguardian.com in February. Thanks again! MrLinkinPark333 (talk) 01:27, 27 October 2025 (UTC)
- It's 9,500 for *.guardian.co.uk which includes the observer.guardian.co.uk - I'll skip over those during the bot's parsing step. Hard to get an accurate reading with Cirrus or SQL alone. -- GreenC 05:38, 29 October 2025 (UTC)
- Enwiki
- Checked 9,813 pages and edited 6,934 pages. Moved 7,725 links to a new URL: 90 normal redirects, 7,635 ruled mapped redirects, Resolved 1,070 soft-404s. Removed 5
{{dead link}}. Added 54{{dead link}}. Switched 231|url-status=deadto live. Switched 83|url-status=liveto dead. Added 948 archive URLs (726 Wayback).
- Checked 9,813 pages and edited 6,934 pages. Moved 7,725 links to a new URL: 90 normal redirects, 7,635 ruled mapped redirects, Resolved 1,070 soft-404s. Removed 5
- IABot DB
- Checked 68,000 links and updated about 20,000
Done -- GreenC 17:17, 31 October 2025 (UTC)
foxsportsasia.com
[edit]foxsportsasia.com is no longer active following the shutdown of Fox Sports Asia, the same reason as foxsports.ph. MarcusAbacus (talk) 07:31, 31 October 2025 (UTC)
Enwiki
- Checked 683 pages and edited 298 pages. Added 20
{{dead link}}. Switched 94|url-status=liveto dead. Added 316 archive URLs (314 Wayback).
IABOt DB
- Checked and updated about 1,000 links set domain permadead
Done -- GreenC 21:06, 31 October 2025 (UTC)
rivals.ph
[edit]rivals.ph is still active, but is now just a promotion for a gambling website. Any links that previously led to news such as this, just lead to a 404 page. MarcusAbacus (talk) 07:35, 31 October 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
londondatastore-upload.s3.amazonaws.com/docs
[edit]It looks like the archived local council election results from the London Datastore site have stopped working. I have replaced the reference for the 1982 local election results in the Karen Buck article. (See second paragraph in Career section.) The archived page was at this page and the original was here. The reference can be seen at this old revision of the article. The correct links to these results can be found here. I think I had to fix another one of these broken references recently. If anyone can help fix them elsewhere, that would be appreciated. I imagine these election results are used elsewhere. TrottieTrue (talk) 14:21, 6 November 2025 (UTC)
- This is a messy problem the URLs are inconsistent between the old amazonaws and new links at data.london.gov.uk. The easiest solution is treat the URLs as dead links, and replace with archive.org URLs. 574 pages. -- GreenC 21:23, 17 November 2025 (UTC)
Enwiki
- Checked 565 pages and edited 498 pages. Added 1
{{dead link}}. Switched 35|url-status=liveto dead. Added 1,394 archive URLs (1,394 Wayback).
IABot DB
- Updated about 30 URLs
Done -- GreenC 03:26, 18 November 2025 (UTC)
- Thank you. TrottieTrue (talk) 13:05, 18 November 2025 (UTC)
www.parliament.vic.gov.au
[edit]Hansard from the Parliament of Victoria has been at a new location since c. 2022, forcing edits like [24] to remain active. This is a simple fix, but I have no proof that all such fixes would require replacing www with hansard. Is there a good way to do this with a bot, or would bot work be limited to listing broken links to parliament.vic.gov.au that include the string hansard? I have no idea how many articles are involved, or how to ascertain this. Thanks, Nyttend (talk) 20:28, 10 November 2025 (UTC)
PS, because this is a citation to a printed resource, an acceptable (although suboptimal) action would be to remove the URL entirely. Nyttend (talk) 20:31, 10 November 2025 (UTC)
- Nyttend: Yes the bot can check every URL for 'www' -> 'hansard' and if that works change the citation to live status. If it's truly dead add an archive URL. This is boilerplate. You discovered a rule ('www' -> 'hansard'). If you find other rules that would be great. Sometimes domains have multiple rules. I program the rules into the bot and away it goes. -- GreenC 21:15, 10 November 2025 (UTC)
- But can you check to verify that the potentially archived document is the same file as the document that results when you change www to hansard? Or is this so unlikely to be a problem that it's safe to tweak the URLs without checking? Nyttend (talk) 07:12, 11 November 2025 (UTC)
- If it successfully changes www to hansard it won't add an archive URL. I have soft-404 checkers that would pick up most problems. If there is actual "content drift" ie. a few words were changed between revisions of an article, I can't pick that up, but that is a rare case. I suggest you do spot checks and if you see it we can figure out what to do based on severity, worse case I can unwind the changes. -- GreenC 16:16, 11 November 2025 (UTC)
- The below changes are done. It was able to convert around 250, the rest are dead links. -- GreenC 00:41, 20 November 2025 (UTC)
- If it successfully changes www to hansard it won't add an archive URL. I have soft-404 checkers that would pick up most problems. If there is actual "content drift" ie. a few words were changed between revisions of an article, I can't pick that up, but that is a rare case. I suggest you do spot checks and if you see it we can figure out what to do based on severity, worse case I can unwind the changes. -- GreenC 16:16, 11 November 2025 (UTC)
- But can you check to verify that the potentially archived document is the same file as the document that results when you change www to hansard? Or is this so unlikely to be a problem that it's safe to tweak the URLs without checking? Nyttend (talk) 07:12, 11 November 2025 (UTC)
Enwiki
- Checked 1,737 pages and edited 930 pages. Moved 244 links to a new URL: 27 normal redirects, 200 ruled mapped redirects, 17 ghost mapped redirects, Resolved 78 soft-404s. Added 30
{{dead link}}. Switched 14|url-status=deadto live. Switched 73|url-status=liveto dead. Added 863 archive URLs (833 Wayback).
Done -- GreenC 00:41, 20 November 2025 (UTC)
vonnegutlibrary.org
[edit]This is a bit of a weird one, and I'm unsure if this is even the right place to report this. The site appears normal and is serving up its usual content (about the Kurt Vonnegut Museum and Library). However, the page quickly redirected me to a malware download disguised as a browser update. How do we handle hijacked links like this on WP? As an aside, I put the URL through VirusTotal and only one malware vendor detected it (!) wizzito | say hello! 19:35, 14 November 2025 (UTC)
- When I go to the site, it works fine. No redirect to malware. Maybe a temporary problem? -- GreenC 19:53, 16 November 2025 (UTC)
Not done - User:Wizzito I'll close this request but if you still see a problem or it happens again let me know it can be reopened. -- GreenC 19:10, 3 December 2025 (UTC)
- Yeah, reached out to the organization over FB and seems to be fixed now. wizzito | say hello! 19:24, 3 December 2025 (UTC)
ejf.org.uk
[edit]The Edward Johnston Foundation appears to no longer exist, and the domain now advertises web design services. 17 usages.
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
economist.com (nov 2025)
[edit]Various links are redirecting with the following rules:
- URLs with /node/ tend to redirect such as this. I previously asked this URL change a few months ago, but I see ~600 more of these.
- Any capitals in the URL need to be swapped to lowercase to make a working redirect. Changing this to that redirects here.
- dot cfm needs to be removed at the end of the URL to make a working redirect. This does not apply to story_id URLs. Changing this to that redirects here.
If you want to check the entire site, there's 15k of articles. Thanks! MrLinkinPark333 (talk) 00:25, 15 November 2025 (UTC)
- For /node/ they were repaired; the original archive URL still showing in search eg. Special:Diff/1302829156/1303801147 -- GreenC 21:29, 16 November 2025 (UTC)
- Ah okay. Thank you for the clarification. MrLinkinPark333 (talk) 21:35, 16 November 2025 (UTC)
- MrLinkinPark333 : This is a complex site. Test runs are showing a ton of redirects, for everything. Example this to here. The live page is a truncated paywall, the original content and user-comments are available in archive here. Making links live thus could have the unintended effect of less verifiable vs. treating as dead with an archive URL. However 1. the Wayback Machine does not have archives of everything, 2. I am unable to create new archives (probably some kind of block), and 3. it's possible if archives exist they could be truncated paywall versions as well. Possibly if we can establish when the paywall went up, any URL with a date prior to that could be converted to an archive URL and expected to work. The post-paywall links probably none of them are going to work no matter the solution. -- GreenC 17:47, 21 November 2025 (UTC)
- Hmm. I don't know which would be the more easier option. MrLinkinPark333 (talk) 20:37, 21 November 2025 (UTC)
- Not sure. Paywalls have been going up increasingly due to AI scrapers stealing content, the old open web is kind of shutting down and the new web (AI-based browsers) has yet to emerge. I'm trying to find the cut off date for the paywall, the Economist fortunately puts dates in most URLs. Thus any URL from pre-firewall will be treated as dead and likely work with the archive version. -- GreenC 22:09, 21 November 2025 (UTC)
- If that's easier, feel free to use that method instead. I didn't realize this would be so complicated. MrLinkinPark333 (talk) 22:35, 21 November 2025 (UTC)
- Not sure. Paywalls have been going up increasingly due to AI scrapers stealing content, the old open web is kind of shutting down and the new web (AI-based browsers) has yet to emerge. I'm trying to find the cut off date for the paywall, the Economist fortunately puts dates in most URLs. Thus any URL from pre-firewall will be treated as dead and likely work with the archive version. -- GreenC 22:09, 21 November 2025 (UTC)
- Hmm. I don't know which would be the more easier option. MrLinkinPark333 (talk) 20:37, 21 November 2025 (UTC)
Not done: MrLinkinPark333, there is too much complexity and inconsistency. If there is a sub-path you want to work on like the previous /node/ request that is probably the best approach in smaller parts. Many websites can be done as a whole, which is usually ideal, but this one I keep getting different results hard to make sense of patterns. -- GreenC 04:39, 4 December 2025 (UTC)
- In that case, perhaps these two URL formats in the examples above would be more useful to focus on instead of the entire website:
- If you run into issues with these two URL formats, please let me know. MrLinkinPark333 (talk) 20:07, 5 December 2025 (UTC)
- I think these are some of the ones I was seeing problems with. The issue is that Economist really does not want people stealing their content. They have sophisticated blocking technology that works on WaybackMachine archives, as well we live pages. I can't detect it, so I don't know when its being blocked or not. And it's variable, you find a pattern or system, only to discover there is none, it's inconsistent. For these reasons I am hesitant to do bot work without understanding the technology they are using and how to deal with it, if it is even possible. As I mentioned above, the premier media sites are cracking down due to AI scrapers stealing content. They likely have rate limiting as well even if I signed up for an account. -- GreenC 20:32, 6 December 2025 (UTC)
- If you've been having problems with these URLs, then it's okay to close the request. MrLinkinPark333 (talk) 21:14, 6 December 2025 (UTC)
- I think these are some of the ones I was seeing problems with. The issue is that Economist really does not want people stealing their content. They have sophisticated blocking technology that works on WaybackMachine archives, as well we live pages. I can't detect it, so I don't know when its being blocked or not. And it's variable, you find a pattern or system, only to discover there is none, it's inconsistent. For these reasons I am hesitant to do bot work without understanding the technology they are using and how to deal with it, if it is even possible. As I mentioned above, the premier media sites are cracking down due to AI scrapers stealing content. They likely have rate limiting as well even if I signed up for an account. -- GreenC 20:32, 6 December 2025 (UTC)
indiatoday.intoday.in
[edit]These ones are redirecting to new URLs at indiatoday.in like this going to that..4900 ~ Thank you! MrLinkinPark333 (talk) 23:21, 15 November 2025 (UTC)
Enwiki
- Checked 4,986 pages and edited 4,903 pages. Moved 6,599 links to a new URL: 278 normal redirects, 6,149 ruled mapped redirects, 172 ghost mapped redirects, Resolved 141 soft-404s. Removed 6
{{dead link}}. Added 17{{dead link}}. Switched 320|url-status=deadto live. Switched 50|url-status=liveto dead. Added 125 archive URLs (101 Wayback).
Done -- GreenC 05:14, 25 November 2025 (UTC)
pastemagazine.com (games coverage)
[edit]Not sure if this needs to be addressed but Paste (magazine) in July 2025 spun-off their games & other related topics to a new dedicated outlet called Endless Mode. At the time, any old Paste article that fell under the umbrella of topics covered by Endless Mode was moved to the new url. Now Paste has just announced that they're shuttering Endless Mode and shifting all of that to The A.V. Club (which they purchased in March 2024). So now all the old Paste articles & Endless Mode articles have been moved to the A.V. Club url. My issue with the way they've done the redirect is that it is not clear these old articles were originally published by a different outlet before the A.V. Club acquisition. For example:
- Storm King's Thunder review - 2019 archive vs today
- Endless Mode announcement - July 2025 archive vs today
But I'm also not seeing anything in how they structured the URLs that would make it easy to find the articles that been moved from the Paste header to the A.V. Club header. Just wanted to check to see if these old Paste articles should be marked as dead so it is clear when used that Paste was the original source & not the A.V. Club. Sariel Xilo (talk) 00:09, 18 November 2025 (UTC)
- If the content at the new site is the same as the old site - other than site branding - it's probably not worthwhile changing a live working links to an archived link. It's sort of a pro/con situation either way. -- GreenC 05:43, 25 November 2025 (UTC)
leedsunited.com
[edit]IABOT treats this domain as permalive but links before about 2024 to this site no longer work. 688 pages
links between around 2019-2024 of format https://www.leedsunited.com/news/[team-news|academy]/[five digit number]/[title] are generally still live at https://www.leedsunited.com/en/news/[title] (e.g. [25] is alive at [26] and [27] is live at [28]) - these links should be easy to move. There are older URLs of this format that appear to be dead though (e.g. [29] does not seem to be live anywhere on leedsunited.com). Hence these should me moved where live at a different url and tagged as dead otherwise.
There are many older links of other formats that are dead and not necessarily marked as such - this includes "http://www.leedsunited.com/news/article" such as [30], "http://www.leedsunited.com/news/[date]" such as [31] and "http://www.leedsunited.com/page/LatestNewsDetail" such as [32] - a lot of these give a 522 error page rather than a 404 like the later URL format though, so im not sure if this will create a problem for running a bot over these links. It seems that every live link on leedsunited.com is at leedsunited.com/en though so it might work to mark anything that isn't and cant be moved as dead, if the 522 errors are problem. Microwave Anarchist (talk) 22:11, 18 November 2025 (UTC)
Enwiki
- Checked 691 pages and edited 544 pages. Moved 387 links to a new URL: 8 normal redirects, 376 ruled mapped redirects, 3 ghost mapped redirects, Resolved 14 soft-404s. Added 28
{{dead link}}. Switched 7|url-status=deadto live. Switched 133|url-status=liveto dead. Added 1,853 archive URLs (1,770 Wayback).
IABot DB
- Checked and updated about 3,700 links
Done -- GreenC 04:21, 26 November 2025 (UTC)
timetravel.mementoweb.org
[edit]Archive provider shut down September 2025.[33]
100 pages plus archive links in IABot DB need to be cleared. -- GreenC 05:27, 21 November 2025 (UTC)
Enwiki
- Checked 99 pages and edited 91 pages. Switched 1
|url-status=liveto dead. Added 139 archive URLs (102 Wayback).
IABot DB
- Checked and updated 323 urls
Done -- GreenC 18:31, 26 November 2025 (UTC)
www.b14643.de
[edit]The domain has been usurped, and now redirects to a gambling site. I checked a few of the affected pages and there seems to be complete archives on the Wayback Machine. 119 articles are affected. IrisPersephone (talk) 05:14, 23 November 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
readabilityofwikipedia.com
[edit]The URL Special:LinkSearch/*.readabilityofwikipedia.com has been hijacked by a pornography site. The current URL is https://readability.nl/. Toukouyori Mimoto (talk) 14:08, 23 November 2025 (UTC)
- See also Wikipedia:External_links/Noticeboard#Replace hijacked site Readability of Wikipedia. Toukouyori Mimoto (talk) 21:18, 24 November 2025 (UTC)
- Toukouyori Mimoto: Thanks. The bot can only update namespace 0, 6 and 10. There are none in those namespace: [34] .. what to do in other namespaces is a general problem because it would require editing pages with esoteric formatting and user posts. If the URLs can be replaced 1:1 it might work. For example this to this .. but it doesn't work. There are only 93 pages total. Maybe I could try replacing with archive URLs.. -- GreenC 05:31, 25 November 2025 (UTC)
- Thank you for you response, that was a good catch. It seems like the new website accepts only exact titles as they appear in the URL. I replaced all spaces with underscores and it seems to work: https://www.readability.nl/check/Wikipedia:Large_language_models. Toukouyori Mimoto (talk) 11:20, 25 November 2025 (UTC)
- OK will try the space->underscore you discovered -- GreenC 15:09, 25 November 2025 (UTC)
- It's done.
A few still show up in search that I don't understand.-- GreenC 16:59, 25 November 2025 (UTC)
- It's done.
- OK will try the space->underscore you discovered -- GreenC 15:09, 25 November 2025 (UTC)
- Thank you for you response, that was a good catch. It seems like the new website accepts only exact titles as they appear in the URL. I replaced all spaces with underscores and it seems to work: https://www.readability.nl/check/Wikipedia:Large_language_models. Toukouyori Mimoto (talk) 11:20, 25 November 2025 (UTC)
- Toukouyori Mimoto: Thanks. The bot can only update namespace 0, 6 and 10. There are none in those namespace: [34] .. what to do in other namespaces is a general problem because it would require editing pages with esoteric formatting and user posts. If the URLs can be replaced 1:1 it might work. For example this to this .. but it doesn't work. There are only 93 pages total. Maybe I could try replacing with archive URLs.. -- GreenC 05:31, 25 November 2025 (UTC)
Done -- GreenC 16:59, 25 November 2025 (UTC)
nyc-architecture.com usurped for gambling
[edit]Domain usurped and now redirects all links to a main page with the same domain name advertizing online gambling. I get 338 hits through link search. Found it tagged as live link in a ref of an article so I assume it is not tagged as usurped in most others (117 articles from what I saw). Choucas0 š¦āā¬ā š¬ā š 14:24, 24 November 2025 (UTC)
Done via WP:JUDI batch #29 Special:Diff/1324524612/1324542613 -- GreenC 05:05, 28 November 2025 (UTC)
worldfootball.net
[edit]All links to this page are redirecting to new URLs ([35] to [36] and [37] to [38] for example). Slight complication is that some of these links are crunchy 404 - [39] redirects to [40], rather than [41], where it should redirect (similar to kicker.de) - not sure whether there's a way of sending these links to the correct place or how they should be handled if not. 18,335 pages. Microwave Anarchist (talk) 00:40, 28 November 2025 (UTC)
- Microwave Anarchist: I wrote code to convert /player_summary/ URLs because they usually have a trailing number like "/2/" which is the key, in that case it indicates for Club-Matches, then I scrape the HTML and find the URL for the Club-Matches tab. Most of the URLs are player_summery, the rest I'll simply follow the redirects or add an archive URL. -- GreenC 19:00, 28 November 2025 (UTC)
- Sounds good, thank you :) Microwave Anarchist (talk) 19:24, 28 November 2025 (UTC)
- I also wrote code for worldfootball.net/teams/.. As far as I can tell, these two types (teams and players) have crunchy-404 problems, the rest seem to redirect alright. -- GreenC 15:50, 29 November 2025 (UTC)
- Sounds good, thank you :) Microwave Anarchist (talk) 19:24, 28 November 2025 (UTC)
Enwiki
- Batch 1 (00001-01000): Checked 1,000 pages and edited 418 pages. Moved 548 links to a new URL: 137 normal redirects, 408 ruled mapped redirects, 3 ghost mapped redirects, Resolved 9 soft-404s. Switched 3
|url-status=deadto live. Switched 4|url-status=liveto dead. Added 30 archive URLs (29 Wayback).
- Batch 1 (00001-01000): Checked 1,000 pages and edited 418 pages. Moved 548 links to a new URL: 137 normal redirects, 408 ruled mapped redirects, 3 ghost mapped redirects, Resolved 9 soft-404s. Switched 3
- Batch 2 (01001-10000): Checked 9,000 pages and edited 3,912 pages. Moved 6,194 links to a new URL: 1,718 normal redirects, 4,330 ruled mapped redirects, 146 ghost mapped redirects, Resolved 51 soft-404s. Added 32
{{dead link}}. Switched 17|url-status=deadto live. Switched 18|url-status=liveto dead. Added 299 archive URLs (280 Wayback).
- Batch 2 (01001-10000): Checked 9,000 pages and edited 3,912 pages. Moved 6,194 links to a new URL: 1,718 normal redirects, 4,330 ruled mapped redirects, 146 ghost mapped redirects, Resolved 51 soft-404s. Added 32
- Batch 3 (10001-26000): Checked 16,001 pages and edited 6,783 pages. Moved 10,288 links to a new URL: 3,133 normal redirects, 6,986 ruled mapped redirects, 169 ghost mapped redirects, Resolved 145 soft-404s. Added 42
{{dead link}}. Switched 44|url-status=deadto live. Switched 20|url-status=liveto dead. Added 506 archive URLs (472 Wayback).
- Batch 3 (10001-26000): Checked 16,001 pages and edited 6,783 pages. Moved 10,288 links to a new URL: 3,133 normal redirects, 6,986 ruled mapped redirects, 169 ghost mapped redirects, Resolved 145 soft-404s. Added 42
- Batch 4 (26001-42529): Checked 16,529 pages and edited 6,995 pages. Moved 11,330 links to a new URL: 3,294 normal redirects, 7,804 ruled mapped redirects, 232 ghost mapped redirects, Resolved 140 soft-404s. Added 60
{{dead link}}. Switched 44|url-status=deadto live. Switched 27|url-status=liveto dead. Added 598 archive URLs (570 Wayback).
- Batch 4 (26001-42529): Checked 16,529 pages and edited 6,995 pages. Moved 11,330 links to a new URL: 3,294 normal redirects, 7,804 ruled mapped redirects, 232 ghost mapped redirects, Resolved 140 soft-404s. Added 60
IABot DB
- Checked and updated 70,000 URLs
Templates
- Checked 5,528 pages and edited 5,446 pages. Converted 5,704
{{WorldFootball.net}}templates. Example: Special:Diff/1324142802/1325457149
Done -- GreenC 05:20, 3 December 2025 (UTC)
b14643.de
[edit]Please reconfigure the bot. Site www.b14643.de is at domain www.b14643.eu now. Thank you. Flanagancz (talk) 16:30, 28 November 2025 (UTC)
- Flanagancz, OK thanks for the info, I'll get to it and make it live again. The original request that it was usurped is Wikipedia:Link_rot/URL_change_requests#www.b14643.de -- GreenC 16:37, 28 November 2025 (UTC)
- OK, thanks too. Flanagancz (talk) 17:00, 28 November 2025 (UTC)
Enwiki
- Checked 121 pages and edited 97 pages. Moved 143 links to a new URL: 143 ruled mapped redirects, Resolved 48 soft-404s. Added 2
{{dead link}}. Switched 125|url-status=deadto live. Changed 94 citation metadata.
Done -- GreenC 16:56, 3 December 2025 (UTC)
gilbertandsullivanarchive.org
[edit]Moved to gsarchive.net per Special:Diff/1325531943/1325532158 -- GreenC 16:21, 3 December 2025 (UTC)
Enwiki
- Checked 19 pages and edited 17 pages. Moved 22 links to a new URL: 22 ruled mapped redirects, Switched 2
|url-status=deadto live.
Done
thecairopost.com
[edit]Hi,
The mentioned site seems to be dead now.
Here's an example URL:
Thanks, David O. Johnson (talk) 20:19, 3 December 2025 (UTC)
Enwiki
- Checked 132 pages and edited 114 pages. Added 3
{{dead link}}. Switched 18|url-status=liveto dead. Added 172 archive URLs (172 Wayback).
IABot DB
- Checked and updated 212 URLs
Done -- GreenC 03:27, 4 December 2025 (UTC)
sfsite.com
[edit]The SF Site went permanently offline on, or before, 22 October 2023. I checked to see how much Wikipedia uses it, and found that over 1,000 articles make use of it. āSusmuffin Talk 12:12, 6 December 2025 (UTC)
Enwiki
- Checked 1,104 pages and edited 1,002 pages. Added 3
{{dead link}}. Switched 86|url-status=liveto dead. Added 1,113 archive URLs (1,113 Wayback).
IABot DB
- Checked and updated 1,529 links
Done -- GreenC 04:55, 7 December 2025 (UTC)
defense.gov
[edit]Domain now redirects to WAR.gov and old links no longer appear to work, mostly soft-404'd. 6,500 pages. Previous run April 2025 prior to name change. -- GreenC 01:52, 7 December 2025 (UTC)
Enwiki
WaybackMedic in progress[status] -- GreenC 04:56, 7 December 2025 (UTC)