![]() | This is an archive of past discussions on Wikipedia:Link rot. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current main page. |
Old URLs for The Times don't work. While some of these have new URLs at thetimes.com, they can't be easily converted . For example, this is now here for Adele. Unfortunately, I think all of these links and the subdomains (entertainment.timesonline.co.uk, business.timesonline.co.uk, etc.) will need archives. It might be easier to do the subdomains first. Some articles already have archived links added like at Premier League. 15,000+ articles altogether. Thank you! MrLinkinPark333 (talk) 19:34, 12 September 2024 (UTC)
This is a difficult project due to a large number of soft-404s within archives:
soft404 rules for archives
|
---|
if url ~ "(the)?(sunday)?(times(plus|online)?)[.]co[.]uk": if url ~ "login=false": return "Check 6.131" if url ~ "(the)?(sunday)?(times(plus|online)?)[.]co[.]uk/[st][to][lo]/[?]CMP=": return "Check 6.132" if url ~ "(the)?(sunday)?(times(plus|online)?)[.]co[.]uk/[st][to][lo]/news/?([?](token=null|id=[a-zA-Z0-9]{2,10}$))?": return "Check 6.137" if url ~ "(the)?(sunday)?(times(plus|online)?)[.]co[.]uk/[st][to][lo]/(news|news/world|tv-radio|business|travel|arts|arts/(film/reviews|tv-radio))/?$": return "Check 6.135" if url ~ "the-tls[.]co[.]uk/tls/?$": return "Check 6.136" gsubs("://", "__T__", url) if url ~ "//": return "Check 6.133" gsubs("__T__", "://", url) if url ~ "obituaries/?$": return "Check 6.134" |
..where "url" is the redirected URL the page was saved from, as indicated on the archive page ie. not the URL on wiki or the live redirect (if any).
Enwiki
{{dead link}}
. Added 6,721 {{dead link}}
. Switched 28 |url-status=dead
to live. Switched 1,736 |url-status=live
to dead. Added 8,624 archive URLs (7,156 Wayback). Changed 593 citation metadata.{{dead link}}
is unfortunate, it represents the problem noted above of archives containing soft-404. -- GreenC 19:21, 26 September 2024 (UTC)IABot DB
Done -- GreenC 16:25, 30 September 2024 (UTC)
@User:GreenC, In Tewiki, we have more than 10,400 pages in the category CS1 errors: archive-url. Almost 99% of these are "timestamp mismatch" errors. Can you plesase run WaybackMedic_2.5 to correct the error in these pages? Thank you. __ Chaduvari (talk) 15:59, 31 July 2024 (UTC)
{{cite web}}
that use English-language parameters like |archive-url=
. GreenC 19:25, 31 July 2024 (UTC)
January | జనవరి |
February | ఫిబ్రవరి |
March | మార్చి |
April | ఏప్రిల్ |
May | మే |
June | జూన్ |
July | జూలై |
August | ఆగస్టు |
September | సెప్టెంబరు |
October | అక్టోబరు |
November | నవంబరు |
December | డిసెంబరు |
|archive-date=2024-09-24
. Most of the problems will probably be archive.today and webcitation.org (if any) so I would check every citation template with one of these archives and then reset the archive-date to ISO format, based on the value in the URL. -- GreenC 16:56, 26 September 2024 (UTC)User:Chaduvari, the tracking category was reduced from 10,400 to 664 for a 94% reduction. The bot I wrote only fixes mismatches in dates. There are other types of errors tracked in that category that bot does not fix. For example citations with an |archive-date=
but no |archive-url=
(or other way around). Or citations with |archive-url=
but no |url=
. These are more complex to automatically fix. -- GreenC 04:03, 2 October 2024 (UTC)
Old URLs for foxnews.com with numeric IDs either redirect to new URLs, redirect to the wrong page or are broken. Working URLs are mainly at www.foxnews.com/story/article-name
~3,200 articles.
Thank you! MrLinkinPark333 (talk) 20:48, 12 September 2024 (UTC)
Enwiki
{{dead link}}
. Added 6 {{dead link}}
. Switched 900 |url-status=dead
to live. Switched 10 |url-status=live
to dead. Added 240 archive URLs (198 Wayback). Changed 175 citation metadata.IABot DB
Done -- GreenC 04:25, 2 October 2024 (UTC)
https://dnd.wizards.com
now mostly redirects to https://www.dndbeyond.com
; website was used as a primary source for various D&D articles. It looks like links that start with https://dnd.wizards.com/news/, https://dnd.wizards.com/articles/, https://dnd.wizards.com/dndstudioblog, https://dnd.wizards.com/dungeons-and-dragons, etc
redirect to the D&D Beyond home page or change log. Some (like https://dnd.wizards.com/products/
) redirect to similar pages on D&D Beyond but the D&D Beyond page often contains less information (such as not having the ISBN, author credits or other production info) so I think the whole lot should be marked as dead. Thanks! Sariel Xilo (talk) 22:29, 20 September 2024 (UTC)
159 pages -- GreenC 04:01, 21 September 2024 (UTC)
Enwiki
{{dead link}}
. Switched 65 |url-status=live
to dead. Added 169 archive URLs (159 Wayback). Changed 413 citation metadata.IABot DB
Done -- GreenC 01:37, 7 October 2024 (UTC)
Each of the 30 MLB teams has a dead subdomain of the form <location>.<teamname>.mlb.com that should be archived, for example losangeles.angels.mlb.com. These now redirect to sites of the form mlb.com/<teamname>, and all content in the subdomains seems to be dead.
I combined the searches into 6 batches of 5 teams each, as combining all teams into one regex expression timed out the search and I didn't want to individually list the results for all 30 teams. I hope it isn't too difficult to process 30 different subdomains?
(Also, for some reason the searches counted a few pages where the text happened to contain <teamname>|mlb.com instead of <teamname>.mlb.com.)
diamondbacks, braves, orioles, redsox, cubs: 1,305 pages.
whitesox, reds, indians, rockies, tigers: 1,181 pages.
astros, royals, angels, dodgers, marlins: 1,134 pages.
brewers, twins, mets, yankees, athletics: 1,118 pages.
phillies, pirates, padres, giants, mariners: 1,304 pages.
cardinals, rays, devilrays (both are subdomains for the same team), rangers, bluejays, nationals: 1,260 pages. Helpful Raccoon (talk) 05:16, 14 September 2024 (UTC)
Enwiki
{{dead link}}
. Switched 1,160 |url-status=live
to dead. Added 5,495 archive URLs (5,431 Wayback). Changed 721 citation metadata.{{dead link}}
-- GreenC 21:27, 3 October 2024 (UTC)
{{dead link}}
. I am beginning to reprocessing those at a slower pace. -- GreenC 15:35, 5 October 2024 (UTC){{dead link}}
" from above, due to Wayback Machine timeouts. Converted 2,388 {{dead link}}
to archive URLs. -- GreenC 17:59, 6 October 2024 (UTC)IABot DB
Done -- GreenC 14:14, 8 October 2024 (UTC)
RFI Vietnamese, VTC News and Zing News changed their domain names:
Billboard Vietnam website (billboardvn.vn) has been closed. Cherry Cotton Candy (talk) 09:05, 22 September 2024 (UTC)
12 pages — Preceding unsigned comment added by GreenC (talk • contribs)
197 pages — Preceding unsigned comment added by GreenC (talk • contribs)
{{dead link}}
. Switched 3 |url-status=dead
to live. Switched 2 |url-status=live
to dead. Added 15 archive URLs (11 Wayback). -- GreenC 04:23, 7 October 2024 (UTC)246 pages — Preceding unsigned comment added by GreenC (talk • contribs)
{{dead link}}
. Switched 113 |url-status=dead
to live. Added 9 archive URLs (4 Wayback). -- GreenC 21:08, 7 October 2024 (UTC)Billboard 130 pages — Preceding unsigned comment added by GreenC (talk • contribs)
Thanhniennews 261 pages. These websites have been closed. Cherry Cotton Candy (talk) 03:59, 23 September 2024 (UTC)
{{dead link}}
. Switched 92 |url-status=live
to dead. Added 178 archive URLs (139 Wayback). -- GreenC 16:25, 8 October 2024 (UTC)41 pages. Some articles can be found manually on tuoitre.vn, for example:
Cherry Cotton Candy (talk) 03:59, 23 September 2024 (UTC)
124 pages. Some articles can be found manually on thanhnien.vn, for example:
Cherry Cotton Candy (talk) 03:59, 23 September 2024 (UTC)
49 pages. Few articles can be found manually on laodong.vn, for example:
Cherry Cotton Candy (talk) 03:59, 23 September 2024 (UTC)
Done -- GreenC 18:19, 8 October 2024 (UTC)
These (currently) 299 results ought to have "/operator/airline.php?var=" replaced by "/operators/". Updating the redirected domain "aviation-safety.net" to "asn.flightsafety.org" could be done along the way as well. 1234qwer1234qwer4 16:02, 24 September 2024 (UTC)
Enwiki
|url-status=dead
to live. Switched 2 |url-status=live
to dead. Added 22 archive URLs (21 Wayback).IABot DB
Done -- GreenC 22:52, 8 October 2024 (UTC)
260 pages that should have "planespotters.net/Airline/" changed to "planespotters.net/airline/". 1234qwer1234qwer4 17:16, 24 September 2024 (UTC)
{{dead link}}
. Added 1 {{dead link}}
. Switched 99 |url-status=dead
to live. Added 22 archive URLs (13 Wayback). Done -- GreenC 23:13, 8 October 2024 (UTC)