This page lists gene articles which are not linked from the base name. For example, there is no obvious route from ACR to ACR (gene). FOO is used as a placeholder to denote the base name such as ACR.
FOO is (or redirects to) a dab which does not list the gene.
Done Section completed: add an entry for FOO (gene) to existing dab FOO.
FOO is (or redirects to) an article about an unrelated primary topic. FOO (disambiguation) is (or redirects to) a dab which does not list the gene.
Done: Section completed: add an entry for FOO (gene) to existing dab FOO (disambiguation).
FOO is (or redirects to) an article about an unrelated topic. FOO (disambiguation) does not exist.
Fix: If the incumbent article is not primary, move it to FOO (topic) and list it along with the gene on a new dab FOO. Check for incoming links to FOO and update these. If the topic is primary but the initials also denote other topics, create FOO (disambiguation). Otherwise, the primary topic article needs a hatnote to the gene.
Done Section complete except for CTU2, which is the actual name of the C16orf84 gene: requesting a second opinion from PamD or Seppi333.
FOO describes an enzyme or protein related to FOO (gene) but does not link to the gene.
Fix: Expert advice is needed.
{{
Infobox gene}}
template when this happens (i.e., the duplicate article has a gene infobox but the primary article does not). Done
Seppi333 (
Insert 2¢) 23:23, 29 November 2019 (UTC)See individual entries for a description of each anomaly.
Fix: Expert advice is needed.
Merged the wikidata sitelinks for NFATC2IP, KCTD9, and NFAM1 and the corresponding (gene) pages. Will deal with the rest a bit later. Seppi333 ( Insert 2¢) 00:10, 30 November 2019 (UTC)
Re- ALG2 (gene): I think it may be worth recoding and rerunning my User:Seppi333/GeneListNLP script to detect/write a list of target pages that are wikilinked from the gene lists and that contain all 5 of the words "Set", "index" "page", "lists", and "articles" on them in order to identify links to set index articles, unless you can locate those with an SQL query. The last time I ran that script, it took 1:33:45 (1.5 hrs) to download and process all the pages, so if it's possible to locate them using another method, it'd probably best to do that instead. Seppi333 ( Insert 2¢) 01:23, 30 November 2019 (UTC)
FOO links to FOO (gene) (or the target of that redirect) in a complex way not spotted by the Quarry queries.
Fix: probably no action but we may consider a more direct link.
Here are some other link issues raised by the gene lists. They need an expert to fix them because the suggested fix may be wrong, they may indicate wider problems, or the initialism redirect might merit conversion into a dab.
The gene lists link directly to a page which is not in gene categories. These fall into two sections.
1. The target page appears not to be a gene. The link needs to be corrected. In each case, incoming links suggest that the non-gene article is the primary topic, but we could consider moving that article and creating a dab.
2. The target page appears to be a gene or closely related topic. Links may be correct but the gene page could be added to appropriate gene categories.
The gene lists link to a redirect to a page which is not in gene categories.
Ahh. I was wondering why my NLP script didn’t locate those... it’s the hatnotes. I should probably reprogram it to fix that bug. Will fix these pages later tonight and (nothing to fix, exception maybe conversion to DABs; I think you guys are better judges of when/how to disambiguate than I though, so I'll leave it to you) revise the wikitables once we locate all these pages.
Seppi333 (
Insert 2¢) 02:02, 1 December 2019 (UTC)
@ Seppi333: I've fixed incoming links apart from the gene lists which should link to CHML (gene) rather than CHML, AAMP (gene) rather than AAMP, etc. I see that some of these have been done manually in the lists (though a piped link might be better) but not in the Python. Also, do you have any thoughts about AKNA, CD96 and WRAP53? Certes ( talk) 00:25, 16 December 2019 (UTC)
Note: immediately after each bulleted entry below, there are two index values listed: My original script detected links to articles where none of 4 gene-related terms (i.e., "gene", "genes", "protein", "proteins") were found anywhere in the article's source code (NB: these links would be marked with The updated algorithm also listed all articles that included specific gene-related multi-word expressions (i.e., the following phrases: "the gene", "the genes", "the protein", "the proteins", "the enzyme", "the enzymes", "(gene)", "(enzyme)", and "(protein)") in the parameters of certain lead hatnotes if any were present – specifically, the
Entries in this list are articles where none of these 5 single-word tokens –
Entries in this list are articles where one or more of these 5 single-word tokens –
|
I went through all the links and fixed problems that I found. In addition to the 4 you identified ( CHML, DR1, HPX, and PIM2), it looks like only DDT is new. I'll fix these links in the lists shortly. Seppi333 ( Insert 2¢) 15:58, 24 December 2019 (UTC)
Then they synthesized the 24-mcr (MIF1RPNVGAMSNFYHYPNIIIII:) designed to form a four-stranded 13-sheet and to bind the insecticide DDT. It did indeed...). Working on recoding the python script for the list pages right now. Seppi333 ( Insert 2¢) 17:23, 24 December 2019 (UTC)