Chromosome 12 open reading frame 71: Difference between revisions
Wikiuser2101 (talk | contribs) No edit summary |
|||
(39 intermediate revisions by 15 users not shown) | |||
Line 1: | Line 1: | ||
{{Short description|Protein encoded in humans by c12orf71 gene}} |
|||
{{User sandbox}}<!-- EDIT BELOW THIS LINE --> |
|||
{{Infobox gene}} |
|||
= Chromosome 12 open reading frame 71 = |
|||
'''Chromosome 12 open reading frame 71 ''(c12orf71)''''' is a protein which in humans is encoded by ''c12orf71'' gene. |
|||
'''Chromosome 12 open reading frame 71 ''(c12orf71)''''' is a [[protein]] which in humans is encoded by ''c12orf71'' [[gene]]. The protein is also known by the alias LOC728858.<ref>{{Cite web |title=C12orf71 Gene - GeneCards {{!}} CL071 Protein {{!}} CL071 Antibody |url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=C12orf71 |access-date=2022-12-16 |website=www.genecards.org}}</ref> |
|||
== Gene == |
== Gene == |
||
The gene is located on the minus strand of chromosome 12 (12p11.23).<ref name=":0">{{Cite journal |date=2022-06-09 |title=Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 1, mRNA |url=http://www.ncbi.nlm.nih.gov/nuccore/NM_001080406.2 |language=en-US}}</ref><ref name=":1">{{Cite web |title=AceView: Gene:C12orf71, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView. |url=https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=human&term=c12orf71&submit=Go |access-date=2022-09-30 |website=www.ncbi.nlm.nih.gov}}</ref> The DNA sequence of the ''c12orf71'' gene is 3071 base pairs long and 8 structural variations have been identified including deletions, duplications, gain- and loss-of-function mutations.<ref name=":2">{{Cite web |title=C12orf71 Gene - GeneCards {{!}} CL071 Protein {{!}} CL071 Antibody |url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=C12orf71&keywords=c12orf71 |access-date=2022-09-30 |website=www.genecards.org}}</ref> ''c12orf71'' gene was determined to be altered (gain of 21 Mb) in the chromosomal region 12p11.21-p13.3 of a male patient with chromosomal aberrations and in a duplication (gain of 411 kb) at chromosome 12p11.23 along with ''c12orf70'', the coding regions of ''STK38L'' and ''ARNTL2'' and a portion of ''PPFIBP1.''<ref>{{Cite thesis |title=Caracterização citogenômica de aberrações cromossômicas |url=http://www.teses.usp.br/teses/disponiveis/17/17135/tde-13052020-115149/ |publisher=Universidade de São Paulo |date=2014-06-26 |degree=text |language=pt-br |first=Alexandra Galvão |last=Gomes}}</ref><ref> |
The gene is located on the minus strand of chromosome 12 (12p11.23).<ref name=":0">{{Cite journal |date=2022-06-09 |title=Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 1, mRNA |url=http://www.ncbi.nlm.nih.gov/nuccore/NM_001080406.2 |language=en-US|journal=Nucleotide}}</ref><ref name=":1">{{Cite web |title=AceView: Gene:C12orf71, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView. |url=https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=human&term=c12orf71&submit=Go |access-date=2022-09-30 |website=www.ncbi.nlm.nih.gov}}</ref> The DNA sequence of the ''c12orf71'' gene is 3071 base pairs long and 8 significant structural variations have been identified including deletions, duplications, gain- and loss-of-function mutations.<ref name=":2">{{Cite web |title=C12orf71 Gene - GeneCards {{!}} CL071 Protein {{!}} CL071 Antibody |url=https://www.genecards.org/cgi-bin/carddisp.pl?gene=C12orf71&keywords=c12orf71 |access-date=2022-09-30 |website=www.genecards.org}}</ref> ''c12orf71'' gene was determined to be altered (gain of 21 Mb) in the chromosomal region 12p11.21-p13.3 of a male patient with chromosomal aberrations and in a duplication (gain of 411 kb) at chromosome 12p11.23 along with ''c12orf70'', the coding regions of ''[[STK38L]]'' and ''[[ARNTL2]]'' and a portion of ''[[PPFIBP1]].''<ref>{{Cite thesis |title=Caracterização citogenômica de aberrações cromossômicas |url=http://www.teses.usp.br/teses/disponiveis/17/17135/tde-13052020-115149/ |publisher=Universidade de São Paulo |date=2014-06-26 |degree=text |language=pt-br |first=Alexandra Galvão |last=Gomes}}</ref><ref>{{cite journal | vauthors = Pyatt RE, Astbury C | title = Interpretation of copy number alterations identified through clinical microarray-comparative genomic hybridization | journal = Clinics in Laboratory Medicine | volume = 31 | issue = 4 | pages = 565–80, viii | date = December 2011 | pmid = 22118737 | doi = 10.1016/j.cll.2011.08.007 }}</ref> Manual inspection of alignments, has determined that ''c12orf71'' gene is mammalian specific.<ref>{{Cite thesis |title=Insights into mammalian adaptive evolution through genomics data |hdl=10803/397756 |publisher=Universitat Pompeu Fabra |date=2015-11-20 |degree=Ph.D. Thesis |first=Villanueva |last=Cañas}}</ref> Furthermore, genome-wide screening has identified ''c12orf71'' as one of 1000 disrupted genes that are positively selected by [[cisplatin]], a [[chemotherapy]] drug.<ref>{{cite thesis |id={{ProQuest|2665131757}} |last1=Ko |first1=Tengyu |year=2018 |title=Genome-Wide Screening Identifies Genes and Biological Processes Implicated in Chemoresistance and Oncogene-Induced Apoptosis }}</ref> |
||
== RNA == |
== RNA == |
||
[[File: |
[[File:Updated conceptual translation.jpg|thumb|'''Figure 1. Conceptual translation of human c12orf71 with labeled domains, motifs, post-translational modifications and conserved amino acids.''']] |
||
c12orf71 transcript variant 1 mRNA is 1022 nucleotides long and consists of 2 exons. There is one more, slightly longer transcript variant of c12orf71. The mRNA sequence of c12orf71 consists of a coding sequence that spans over |
c12orf71 transcript variant 1 mRNA is 1022 nucleotides long and consists of 2 exons.<ref name=":0" /> There is one more, slightly longer transcript variant of c12orf71, with a length of 1087 nucleotides.<ref>{{Cite journal |date=2022-08-19 |title=Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 2, mRNA |url=http://www.ncbi.nlm.nih.gov/nuccore/NM_001384983.1 |language=en-US|journal=Nucleotide}}</ref> The mRNA sequence of c12orf71 transcript variant 1 consists of a coding sequence that spans over two exons and 2 poly-A signal sequences. <ref name=":0" /> |
||
=== Expression === |
=== Expression === |
||
In humans c12orf71 has shown |
In humans c12orf71 has shown an intermediate expression level in [[Testicle|testis]] and low expression in the [[bone marrow]], [[skin]], [[spleen]], [[lymph node]] and [[liver]]. Human c12orf71 is expressed after the fetal-development stage.<ref>{{Cite web |title=C12orf71 chromosome 12 open reading frame 71 [Homo sapiens (human)] - Gene - NCBI |url=https://www.ncbi.nlm.nih.gov/gene/728858 |access-date=2022-09-30 |website=www.ncbi.nlm.nih.gov}}</ref> RNA-sequencing analysis has revealed that c12orf71 was expressed at a very low level or not expressed at all in [[osteoarthritis]] and non-osteoarthritis hip [[cartilage]]. <ref>{{Cite thesis |title=Functional analysis of the osteoarthritis susceptibility loci marked by the polymorphisms rs10492367 and rs9350591 |hdl=10443/3247 |publisher=Newcastle University |date=2016 |degree=Thesis |language=en |first=Katherine |last=Johnson}}</ref> A genome engineering study that studied mice knock-outs has found that c12orf71 has a decreased expression in humans compared to mouse testis, however the absence of the c12orf71 had no effect on mouse [[Fertilisation|fertilization]].<ref>{{cite journal | vauthors = Miyata H, Castaneda JM, Fujihara Y, Yu Z, Archambeault DR, Isotani A, Kiyozumi D, Kriseman ML, Mashiko D, Matsumura T, Matzuk RM, Mori M, Noda T, Oji A, Okabe M, Prunskaite-Hyyrylainen R, Ramirez-Solis R, Satouh Y, Zhang Q, Ikawa M, Matzuk MM | display-authors = 6 | title = Genome engineering uncovers 54 evolutionarily conserved and testis-enriched genes that are not required for male fertility in mice | journal = Proceedings of the National Academy of Sciences of the United States of America | volume = 113 | issue = 28 | pages = 7704–7710 | date = July 2016 | pmid = 27357688 | pmc = 4948324 | doi = 10.1073/pnas.1608458113 | doi-access = free | bibcode = 2016PNAS..113.7704M }}</ref> |
||
== Protein == |
== Protein == |
||
c12orf71 protein is 269 amino acids long and the unmodified precursor protein has a predicted molecular weight of 30.4 kDa and a theoretical [[isoelectric point]] of 5.21.<ref>{{Cite web |title=Compute pI/MW - SIB Swiss Institute of Bioinformatics {{!}} Expasy |url=https://www.expasy.org/resources/compute-pi-mw |access-date=2022-12-08 |website=www.expasy.org}}</ref> Additionally, the protein is rich in [[Serine]] and [[Aspartic acid|Aspartic Acid]] and has a relatively low amount of [[Valine]] and [[Tyrosine]]. <ref>{{Cite web |title=SAPS < Sequence Statistics < EMBL-EBI |url=https://www.ebi.ac.uk/Tools/seqstats/saps/ |access-date=2022-12-08 |website=www.ebi.ac.uk}}</ref> |
|||
=== |
=== Cellular localization === |
||
Cellular localization analysis showed that human c12orf71 protein is found in the [[cytoplasm]] of the cell. All of the orthologs of the protein were also localized to the cytoplasm.<ref name="auto">{{Cite web |title=Services |url=https://services.healthtech.dtu.dk/ |access-date=2022-12-08 |website=DTU Health Tech |language=en}}</ref> [[Immunohistochemistry]] with polyclonal [[antibody]] for c12orf71 localized the protein in the [[cytosol]] of the cell.<ref>{{Cite web |title=C12orf71 protein expression summary - The Human Protein Atlas |url=https://www.proteinatlas.org/ENSG00000214700-C12orf71 |access-date=2022-12-08 |website=www.proteinatlas.org}}</ref> |
|||
=== |
=== Domains === |
||
The first 21 amino acids of the [[Coding region|coding sequence]] are comprising a disordered region, followed by a domain of unknown function (DUF4640) which spans almost the whole coding sequence.<ref>{{Cite web |title=uncharacterized protein C12orf71 [Homo sapiens] - Protein - NCBI |url=https://www.ncbi.nlm.nih.gov/protein/NP_001371912.1 |access-date=2022-09-30 |website=www.ncbi.nlm.nih.gov}}</ref> Additionally, the human protein also contains a vacuolar domain, which is mammal specific and may be modulated by phosphorylation.<ref name=":1" /> |
|||
=== |
=== Post-translation modifications === |
||
[[File:Protein-20221204153600.png|thumb|344x344px|'''Figure 2. c12orf71 Protein diagram with labeled domains and post-translational modifications.''']] |
|||
c12orf71 protein has multiple predicted [[phosphorylation]] sites,<ref name="auto"/> which can have an impact on the protein interactions and sub-cellular localization as well as affect the protein's stability and activity. The protein has one predicted [[SUMO protein|SUMOylation]]<ref>{{Cite web |title=GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs |url=http://sumosp.biocuckoo.org/ |access-date=2022-12-08 |website=sumosp.biocuckoo.org |archive-date=2013-05-10 |archive-url=https://web.archive.org/web/20130510131129/http://sumosp.biocuckoo.org/ |url-status=dead }}</ref> and one [[ubiquitin]]ation predicted site, which can influence many biological functions of the protein, such as cellular response to stress and degradation, respectively. Five different Lysine [[acetylation]]<ref>{{Cite web |title=GPS-PAIL 2.0 - Prediction of Acetylation on Internal Lysines |url=http://pail.biocuckoo.org/online.php |access-date=2022-12-08 |website=pail.biocuckoo.org}}</ref> sites were predicted, which can neutralize the positive charge on the Lysine, but at the same time the transfer of acetyl group can increase the expression of the protein. 2 [[N-linked glycosylation|N-glycosylation]],<ref name="auto"/> multiple O-glycosylation<ref name="auto"/> and O-linked-N-acetylglucosaminylation<ref name="auto"/> sites were predicted, which could potentially affect the protein stability. There is a competition for Lysine-acetylation and ubiquitination at K130, suggesting that a [[Protein deacetylase|deacetylase]] enzyme is acting at this site. |
|||
=== |
=== Interacting proteins === |
||
There is a direct interaction between c12orf71 and [[AP2B1]], with a moderate confidence level. Adaptor related protein complex 2 subunit beta (AP2B1) helps establish a link between [[clathrin]] and receptors in coated vesicles.<ref>{{Cite web |title=PSICQUIC View |url=http://www.ebi.ac.uk/Tools/webservices/psicquic/view/results.xhtml?conversationContext=1 |access-date=2022-12-08 |website=www.ebi.ac.uk}}</ref> c12orf71 protein has been found to be present in a [[Protein–protein interaction|protein-protein interaction]] (PPI) network of the [[Carboxypeptidase M]] (''CPM'') gene, along with nine more genes.<ref name=":3">{{cite journal | vauthors = Asghari Alashti F, Goliaei B, Minuchehr Z | title = Analyzing large scale gene expression data in colorectal cancer reveals important clues; CLCA1 and SELENBP1 downregulated in CRC not in normal and not in adenoma | journal = American Journal of Cancer Research | volume = 12 | issue = 1 | pages = 371–380 | date = 2022-01-15 | pmid = 35141024 | pmc = 8822279 }}</ref> |
|||
[[File:Orthologs table.jpg|thumb|598x598px|'''Table 1. List of c12orf71 Orthologs and Related Properties.''' ]] |
|||
{| class="wikitable" |
|||
|+[https://string-db.org/cgi/network?taskId=bK3IVDmlf4L9&sessionId=boRx46mprqnT Human c12orf71 interacting proteins] |
|||
|'''Protein''' |
|||
|'''Function''' |
|||
|- |
|||
|'''YEATS4''' |
|||
|YEATS domain-containing protein 4; Component of the NuA4 histone acetyltransferase (HAT) complex which is involved in transcriptional activation of select genes principally by acetylation of nucleosomal histones H4 and H2A. |
|||
|- |
|||
|'''[[CPM (gene)|CPM]]''' |
|||
|Carboxypeptidase M; Specifically removes C-terminal basic residues (Arg or Lys) from peptides and proteins. It is believed to play important roles in the control of peptide hormone and growth factor activity at the cell surface, and in the membrane-localized degradation of extracellular proteins; Belongs to the peptidase M14 family |
|||
|- |
|||
|'''ZNF215''' |
|||
|Zinc finger protein 215; May be involved in transcriptional regulation; SCAN domain containing |
|||
|- |
|||
|'''SPACA5B''' |
|||
|Sperm acrosome associated 5B; Enable lysozyme activity |
|||
|- |
|||
|'''SPACA5''' |
|||
|Sperm acrosome-associated protein 5; Belongs to the glycosyl hydrolase 22 family |
|||
|- |
|||
|'''LYZL4''' |
|||
|Lysozyme-like protein 4; May be involved in fertilization (By similarity). Has no detectable [[bacteriolytic]] and lysozyme activities in vitro (By similarity); Belongs to the glycosyl hydrolase 22 family |
|||
|- |
|||
|'''LYZL6''' |
|||
|Lysozyme-like protein 6; May be involved sperm-egg plasma membrane adhesion and fusion during fertilization. Exhibits bacteriolytic activity in vitro against Micrococcus luteus and Staphylococcus aureus. Shows weak bacteriolytic activity against Gram-positive bacteria at physiological pH. Bacteriolytic activity is pH- dependent, with a maximum at around pH 5.6; Lysozymes, c-type |
|||
|- |
|||
|'''NUP107''' |
|||
|Nuclear pore complex protein Nup107; Plays a role in the nuclear pore complex (NPC) assembly and/or maintenance. Required for the assembly of peripheral proteins into the NPC. May anchor NUP62 to the NPC; Belongs to the nucleoporin Nup84/Nup107 family |
|||
|- |
|||
|'''[[SPRYD7|SPRYD4]]''' |
|||
|Spry domain-containing protein 4; SPRY domain containing 4 |
|||
|- |
|||
|'''[[CPSF6]]''' |
|||
|Cleavage and polyadenylation specificity factor subunit 6; Component of the cleavage factor Im complex (CFIm) that plays a key role in pre-mRNA 3'-processing. Involved in association with NUDT21/CPSF5 in pre-MRNA 3'-end poly(A) site cleavage and poly(A) addition. CPSF6 binds to cleavage and polyadenylation RNA substrates and promotes RNA looping; RNA binding motif containing |
|||
|} |
|||
== |
=== Structure === |
||
''Orthologs of the c12orf71'' gene have been found only in mammals, in particular ''Theria'' (marsupials and placentals) including ''Mus muculus'' (mouse), ''Pan troglodytes'' (chimpanzee) and ''Canis lupus familiaris'' (dog) and no paralogs of the gene have been reported.<ref>{{Cite web |title=C12orf71 orthologs |url=https://www.ncbi.nlm.nih.gov/gene/728858/ortholog/ |access-date=2022-09-30 |website=NCBI |language=en}}</ref> The ''Canis lupus'' ortholog of ''c12orf71'', located on chromosome 10, has been localized in proximity to the lysozyme (''Lyz'') gene.<ref>{{Cite journal |last=Irwin |first=David M. |last2=Biegel |first2=Jason M. |last3=Stewart |first3=Caro-Beth |date=2011-06-15 |title=Evolution of the mammalian lysozyme gene family |url=https://doi.org/10.1186/1471-2148-11-166 |journal=BMC Evolutionary Biology |language=en |volume=11 |issue=1 |pages=166 |doi=10.1186/1471-2148-11-166 |issn=1471-2148 |pmc=PMC3141428 |pmid=21676251}}</ref> |
|||
[[File:Alpha fold.jpg|415x415px|'''Figure 3. Predicted tertiary structure of human c12orf71 from [https://alphafold.ebi.ac.uk/entry/A8MTZ7 AlphaFold]''' |alt=Figure 3. Predicted tertiary structure of human c12orf71.|thumb|center]] |
|||
=== Evolutionary History === |
|||
[[File:Phylogenetic tree of c12orf71.jpg|thumb|400x400px|'''Figure 2. ''c12orf71'' Time-calibrated Unrooted Phylogenetic Tree.''' The colored circles correspond to the species classification. Common names of the species were used. The phylogenetic tree was crated using the One-Click Phylogeny Tool <ref name=":3">{{Cite journal |last=Asghari Alashti |first=Fariborz |last2=Goliaei |first2=Bahram |last3=Minuchehr |first3=Zarrin |date=2022-01-15 |title=Analyzing large scale gene expression data in colorectal cancer reveals important clues; CLCA1 and SELENBP1 downregulated in CRC not in normal and not in adenoma |url=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8822279/ |journal=American Journal of Cancer Research |volume=12 |issue=1 |pages=371–380 |issn=2156-6976 |pmc=8822279 |pmid=35141024}}</ref>.]]It has been estimated that ''c12orf71'' gene first appeared in marsupials approximately 160 million years ago. Among the marsupial species, based on the sequence similarity, the gene has first appeared in species from the Microbotheria taxonomic group, represented by the ''Dromiciops Gliroides'' (Colocolo opossum) species. Only one isoform of the c12orf71 protein has been found in this species. |
|||
== |
== Homology and evolution == |
||
[[Sequence homology|Orthologs]] of the ''c12orf71'' gene have been found only in mammals, in particular ''[[Theria]]'' (marsupials and placentals). No orthologs in [[monotreme]]s, [[bird]]s or [[reptile]]s, [[amphibian]]s, [[fish]], [[invertebrate]]s, [[Fungus|fungi]], [[plant]]s, [[bacteria]], and [[virus]]es<ref>{{Cite web |title=C12orf71 orthologs |url=https://www.ncbi.nlm.nih.gov/gene/728858/ortholog/ |access-date=2022-09-30 |website=NCBI |language=en}}</ref> |
|||
It has been found that c12orf71 protein is one of fifteen proteins that are overexpressed in mesenchymal cells of the olfactory epithelium due to a mutation in ''PSEN1'' gene that causes Familial Alzheimer’s disease, however no function has been indicated.<ref>Hernández, R., & Jhenifer, L. (2019). ''La mutación A431E en PSEN1 causante de enfermedad de Alzheimer Familiar altera el perfil proteómico de las células mesenquimales del epitelio olfatorio'' (Master's thesis, Tesis (MC)--Centro de Investigación y de Estudios Avanzados del IPN Departamento de Biomedicina Molecular).</ref> Additionally, the protein has been listed as a unique protein that is present in humans but absent in Neanderthals and is a modern human protein with most distinct regions against Neanderthals proteins (265 out of 269 amino acids have been determined as relatively unique).<ref>{{Cite journal |last=Hosseini |first=Morteza |last2=Pratas |first2=Diogo |last3=Pinho |first3=Armando J. |date=2019-09 |title=A Probabilistic Method to Find and Visualize Distinct Regions in Protein Sequences |url=https://ieeexplore.ieee.org/abstract/document/8902695 |journal=2019 27th European Signal Processing Conference (EUSIPCO) |pages=1–5 |doi=10.23919/EUSIPCO.2019.8902695}}</ref> |
|||
{| class="wikitable" |
|||
|+Human ''c12orf71'' gene orthologs |
|||
! |
|||
!Species |
|||
!Common name |
|||
!Order |
|||
!Date of divergence (MYA) |
|||
!Percent identity (%) |
|||
!Percent similarity (%) |
|||
!Length (amino acids) |
|||
!Accession number |
|||
|- |
|||
|rowspan= "5"|'''Primate mammals''' |
|||
|''Homo sapiens'' |
|||
|Human |
|||
|Primates |
|||
|0 |
|||
|100.0 |
|||
|100.0 |
|||
|269 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/NP_001073875.1 NP_001073875.1] |
|||
|- |
|||
| |
|||
|''Pan paniscus'' |
|||
|Pygmy chimpanzee |
|||
|Primates |
|||
|6.4 |
|||
|99.3 |
|||
|100.0 |
|||
|269 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_003828900.1 XP_003828900.1] |
|||
|- |
|||
| |
|||
|''Gorilla gorilla gorilla'' |
|||
|Gorilla |
|||
|Primates |
|||
|8.6 |
|||
|96.3 |
|||
|98.1 |
|||
|269 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_004052946.1 XP_004052946.1] |
|||
|- |
|||
| |
|||
|''Pongo abelii'' |
|||
|Sumatran orangutan |
|||
|Primates |
|||
|15.2 |
|||
|95.9 |
|||
|97.0 |
|||
|269 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_002823096.1 XP_002823096.1] |
|||
|- |
|||
| |
|||
|''Nomascus leucogenys'' |
|||
|Gibbon |
|||
|Primates |
|||
|19.6 |
|||
|93.7 |
|||
|95.5 |
|||
|269 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_003265679.1 XP_003265679.1] |
|||
|- |
|||
|rowspan="10"|'''Placental mammals''' |
|||
|''Equus quagga'' |
|||
|Zebra |
|||
|Perissodactyla |
|||
|87 |
|||
|59.9 |
|||
|72.0 |
|||
|279 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_046519988.1 XP_046519988.1] |
|||
|- |
|||
| |
|||
|''Loxodonta africana'' |
|||
|African savannah elephant |
|||
|Proboscidea |
|||
|87 |
|||
|61.3 |
|||
|73.5 |
|||
|279 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_023410800.1 XP_023410800.1] |
|||
|- |
|||
| |
|||
|''Mus musculus'' |
|||
|Mouse |
|||
|Rhodentia |
|||
|94 |
|||
|38.2 |
|||
|51.0 |
|||
|300 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_023410800.1 NP_001157708.1] |
|||
|- |
|||
| |
|||
|''Bos taurus'' |
|||
|Cattle |
|||
|Artiodactyla |
|||
|94 |
|||
|45.6 |
|||
|56.4 |
|||
|347 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_010803693.1 XP_010803693.1] |
|||
|- |
|||
| |
|||
|''Panthera uncia'' |
|||
|Snow leopard |
|||
|Carnivora |
|||
|94 |
|||
|49.7 |
|||
|62.9 |
|||
|325 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_049483174.1 XP_049483174.1] |
|||
|- |
|||
| |
|||
|''Vulpes vulpes'' |
|||
|Red fox |
|||
|Carnivora |
|||
|94 |
|||
|55.0 |
|||
|69.1 |
|||
|281 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_025851313.1 XP_025851313.1] |
|||
|- |
|||
| |
|||
|''Leptonychotes weddellii'' |
|||
|Weddell seal |
|||
|Carnivora |
|||
|94 |
|||
|55.0 |
|||
|70.9 |
|||
|281 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_006740901.1 XP_006740901.1] |
|||
|- |
|||
| |
|||
|''Orycteropus afer afer'' |
|||
|Aardvark |
|||
|Tubulidentata |
|||
|94 |
|||
|55.6 |
|||
|69.9 |
|||
|277 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_007947992.1 XP_007947992.1] |
|||
|- |
|||
| |
|||
|''Pteropus alecto'' |
|||
|Black flying fox |
|||
|Chiroptera |
|||
|94 |
|||
|55.9 |
|||
|72.2 |
|||
|278 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_024903180.1 XP_024903180.1] |
|||
|- |
|||
| |
|||
|''Canis lupus familiaris'' |
|||
|Dog |
|||
|Carnivora |
|||
|99 |
|||
|49.4 |
|||
|62.2 |
|||
|319 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_005637089.1 XP_005637089.1] |
|||
|- |
|||
|rowspan="5"|'''Marsupials''' |
|||
|''Gracilinanus agilis'' |
|||
|Agile gracile opossum |
|||
|Didelphimorphia |
|||
|160 |
|||
|26.4 |
|||
|43.0 |
|||
|341 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_044534595.1 XP_044534595.1] |
|||
|- |
|||
| |
|||
|''Trichosurus vulpecula'' |
|||
|Common brushtail possum |
|||
|Diprotodontia |
|||
|160 |
|||
|30.1 |
|||
|44.2 |
|||
|320 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_036616685.1 XP_036616685.1] |
|||
|- |
|||
| |
|||
|''Sarcophilus harrisii'' |
|||
|Tasmanian devil |
|||
|Dasyuromorphia |
|||
|160 |
|||
|30.2 |
|||
|45.9 |
|||
|318 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_003772759.2 XP_003772759.2] |
|||
|- |
|||
| |
|||
|''Vombatus ursinus'' |
|||
|Common wombat |
|||
|Diprotodontia |
|||
|160 |
|||
|31.7 |
|||
|44.6 |
|||
|321 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_027700114.1 XP_027700114.1] |
|||
|- |
|||
| |
|||
|''Dromiciops gliroides'' |
|||
|Colocolo opossum |
|||
|Microbiotheria |
|||
|160 |
|||
|32.3 |
|||
|45.9 |
|||
|317 |
|||
|[https://www.ncbi.nlm.nih.gov/protein/XP_043823568.1 XP_043823568.1] |
|||
|} |
|||
== |
=== Evolutionary history === |
||
There is only one disease associated with ''c12orf71'' gene, common warts.<ref name=":1" /> A study of global gene methylation of common warts caused by HPV infection found that ''c12orf71'' gene is differentially methylated in Arab male patients with common warts. In particular, ''c12orf71'' is hypomethylated in skin infected with common warts compared to normal skin.<ref>{{Cite journal |last=Alghamdi |first=Mansour A. |last2=AL-Eitan |first2=Laith N. |last3=Tarkhan |first3=Amneh H. |last4=Al-Qarqaz |first4=Firas A. |date=2021-01-01 |title=Global gene methylation profiling of common warts caused by human papillomaviruses infection |url=https://www.sciencedirect.com/science/article/pii/S1319562X20305313 |journal=Saudi Journal of Biological Sciences |language=en |volume=28 |issue=1 |pages=612–622 |doi=10.1016/j.sjbs.2020.10.050 |issn=1319-562X}}</ref> 10 SNPs from the GWAS catalog associate ''c12orf71'' gene with obsolete and androgenic alopecia, healing of bone mineral density and educational attainment.<ref name=":2" /> |
|||
It has been estimated that ''c12orf71'' gene first appeared in [[marsupial]]s approximately 160 million years ago. Among the marsupial species, based on the sequence similarity, the gene has first appeared in species from the [[Microbiotheria|Microbotheria]] taxonomic group, represented by the ''[[Monito del monte|Dromiciops Gliroides]]'' (Colocolo opossum) species. Only one [[Protein isoform|isoform]] of the c12orf71 protein has been found in this species. |
|||
=== SNPs === |
|||
[[File:Phylogenetic tree of c12orf71.jpg|thumb|345x345px|'''Figure 4. ''c12orf71'' Time-calibrated Unrooted Phylogenetic Tree.''' The colored circles correspond to the species classification. Common names of the species were used. The phylogenetic tree was created using the One-Click Phylogeny Tool <ref name=":3"/>.|center]] |
|||
== Clinical association == |
|||
There is only one disease associated with ''c12orf71'' gene, [[Wart|common warts]].<ref name=":1" /> A study of global gene methylation of common warts caused by [[Human papillomavirus infection|HPV infection]] found that ''c12orf71'' gene is differentially methylated in Arab male patients with common warts. In particular, ''c12orf71'' is hypomethylated in skin infected with common warts compared to normal skin.<ref>{{cite journal | vauthors = Alghamdi MA, Al-Eitan LN, Tarkhan AH, Al-Qarqaz FA | title = Global gene methylation profiling of common warts caused by human papillomaviruses infection | journal = Saudi Journal of Biological Sciences | volume = 28 | issue = 1 | pages = 612–622 | date = January 2021 | pmid = 33424347 | pmc = 7783806 | doi = 10.1016/j.sjbs.2020.10.050 | s2cid = 228813393 }}</ref> 10 [[Single-nucleotide polymorphism|SNPs]] from the [[GWAS catalog]] associate ''c12orf71'' gene with obsolete and [[Pattern hair loss|androgenic alopecia]], healing of bone mineral density and educational attainment.<ref name=":2" /> |
|||
== References == |
== References == |
||
{{Reflist}} |
|||
<references group="" responsive="1"></references> |
Latest revision as of 18:02, 27 March 2024
C12orf71 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C12orf71, chromosome 12 open reading frame 71 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1920594; HomoloGene: 53485; GeneCards: C12orf71; OMA:C12orf71 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.[5]
Gene
[edit]The gene is located on the minus strand of chromosome 12 (12p11.23).[6][7] The DNA sequence of the c12orf71 gene is 3071 base pairs long and 8 significant structural variations have been identified including deletions, duplications, gain- and loss-of-function mutations.[8] c12orf71 gene was determined to be altered (gain of 21 Mb) in the chromosomal region 12p11.21-p13.3 of a male patient with chromosomal aberrations and in a duplication (gain of 411 kb) at chromosome 12p11.23 along with c12orf70, the coding regions of STK38L and ARNTL2 and a portion of PPFIBP1.[9][10] Manual inspection of alignments, has determined that c12orf71 gene is mammalian specific.[11] Furthermore, genome-wide screening has identified c12orf71 as one of 1000 disrupted genes that are positively selected by cisplatin, a chemotherapy drug.[12]
RNA
[edit]c12orf71 transcript variant 1 mRNA is 1022 nucleotides long and consists of 2 exons.[6] There is one more, slightly longer transcript variant of c12orf71, with a length of 1087 nucleotides.[13] The mRNA sequence of c12orf71 transcript variant 1 consists of a coding sequence that spans over two exons and 2 poly-A signal sequences. [6]
Expression
[edit]In humans c12orf71 has shown an intermediate expression level in testis and low expression in the bone marrow, skin, spleen, lymph node and liver. Human c12orf71 is expressed after the fetal-development stage.[14] RNA-sequencing analysis has revealed that c12orf71 was expressed at a very low level or not expressed at all in osteoarthritis and non-osteoarthritis hip cartilage. [15] A genome engineering study that studied mice knock-outs has found that c12orf71 has a decreased expression in humans compared to mouse testis, however the absence of the c12orf71 had no effect on mouse fertilization.[16]
Protein
[edit]c12orf71 protein is 269 amino acids long and the unmodified precursor protein has a predicted molecular weight of 30.4 kDa and a theoretical isoelectric point of 5.21.[17] Additionally, the protein is rich in Serine and Aspartic Acid and has a relatively low amount of Valine and Tyrosine. [18]
Cellular localization
[edit]Cellular localization analysis showed that human c12orf71 protein is found in the cytoplasm of the cell. All of the orthologs of the protein were also localized to the cytoplasm.[19] Immunohistochemistry with polyclonal antibody for c12orf71 localized the protein in the cytosol of the cell.[20]
Domains
[edit]The first 21 amino acids of the coding sequence are comprising a disordered region, followed by a domain of unknown function (DUF4640) which spans almost the whole coding sequence.[21] Additionally, the human protein also contains a vacuolar domain, which is mammal specific and may be modulated by phosphorylation.[7]
Post-translation modifications
[edit]c12orf71 protein has multiple predicted phosphorylation sites,[19] which can have an impact on the protein interactions and sub-cellular localization as well as affect the protein's stability and activity. The protein has one predicted SUMOylation[22] and one ubiquitination predicted site, which can influence many biological functions of the protein, such as cellular response to stress and degradation, respectively. Five different Lysine acetylation[23] sites were predicted, which can neutralize the positive charge on the Lysine, but at the same time the transfer of acetyl group can increase the expression of the protein. 2 N-glycosylation,[19] multiple O-glycosylation[19] and O-linked-N-acetylglucosaminylation[19] sites were predicted, which could potentially affect the protein stability. There is a competition for Lysine-acetylation and ubiquitination at K130, suggesting that a deacetylase enzyme is acting at this site.
Interacting proteins
[edit]There is a direct interaction between c12orf71 and AP2B1, with a moderate confidence level. Adaptor related protein complex 2 subunit beta (AP2B1) helps establish a link between clathrin and receptors in coated vesicles.[24] c12orf71 protein has been found to be present in a protein-protein interaction (PPI) network of the Carboxypeptidase M (CPM) gene, along with nine more genes.[25]
Protein | Function |
YEATS4 | YEATS domain-containing protein 4; Component of the NuA4 histone acetyltransferase (HAT) complex which is involved in transcriptional activation of select genes principally by acetylation of nucleosomal histones H4 and H2A. |
CPM | Carboxypeptidase M; Specifically removes C-terminal basic residues (Arg or Lys) from peptides and proteins. It is believed to play important roles in the control of peptide hormone and growth factor activity at the cell surface, and in the membrane-localized degradation of extracellular proteins; Belongs to the peptidase M14 family |
ZNF215 | Zinc finger protein 215; May be involved in transcriptional regulation; SCAN domain containing |
SPACA5B | Sperm acrosome associated 5B; Enable lysozyme activity |
SPACA5 | Sperm acrosome-associated protein 5; Belongs to the glycosyl hydrolase 22 family |
LYZL4 | Lysozyme-like protein 4; May be involved in fertilization (By similarity). Has no detectable bacteriolytic and lysozyme activities in vitro (By similarity); Belongs to the glycosyl hydrolase 22 family |
LYZL6 | Lysozyme-like protein 6; May be involved sperm-egg plasma membrane adhesion and fusion during fertilization. Exhibits bacteriolytic activity in vitro against Micrococcus luteus and Staphylococcus aureus. Shows weak bacteriolytic activity against Gram-positive bacteria at physiological pH. Bacteriolytic activity is pH- dependent, with a maximum at around pH 5.6; Lysozymes, c-type |
NUP107 | Nuclear pore complex protein Nup107; Plays a role in the nuclear pore complex (NPC) assembly and/or maintenance. Required for the assembly of peripheral proteins into the NPC. May anchor NUP62 to the NPC; Belongs to the nucleoporin Nup84/Nup107 family |
SPRYD4 | Spry domain-containing protein 4; SPRY domain containing 4 |
CPSF6 | Cleavage and polyadenylation specificity factor subunit 6; Component of the cleavage factor Im complex (CFIm) that plays a key role in pre-mRNA 3'-processing. Involved in association with NUDT21/CPSF5 in pre-MRNA 3'-end poly(A) site cleavage and poly(A) addition. CPSF6 binds to cleavage and polyadenylation RNA substrates and promotes RNA looping; RNA binding motif containing |
Structure
[edit]Homology and evolution
[edit]Orthologs of the c12orf71 gene have been found only in mammals, in particular Theria (marsupials and placentals). No orthologs in monotremes, birds or reptiles, amphibians, fish, invertebrates, fungi, plants, bacteria, and viruses[26]
Species | Common name | Order | Date of divergence (MYA) | Percent identity (%) | Percent similarity (%) | Length (amino acids) | Accession number | |
---|---|---|---|---|---|---|---|---|
Primate mammals | Homo sapiens | Human | Primates | 0 | 100.0 | 100.0 | 269 | NP_001073875.1 |
Pan paniscus | Pygmy chimpanzee | Primates | 6.4 | 99.3 | 100.0 | 269 | XP_003828900.1 | |
Gorilla gorilla gorilla | Gorilla | Primates | 8.6 | 96.3 | 98.1 | 269 | XP_004052946.1 | |
Pongo abelii | Sumatran orangutan | Primates | 15.2 | 95.9 | 97.0 | 269 | XP_002823096.1 | |
Nomascus leucogenys | Gibbon | Primates | 19.6 | 93.7 | 95.5 | 269 | XP_003265679.1 | |
Placental mammals | Equus quagga | Zebra | Perissodactyla | 87 | 59.9 | 72.0 | 279 | XP_046519988.1 |
Loxodonta africana | African savannah elephant | Proboscidea | 87 | 61.3 | 73.5 | 279 | XP_023410800.1 | |
Mus musculus | Mouse | Rhodentia | 94 | 38.2 | 51.0 | 300 | NP_001157708.1 | |
Bos taurus | Cattle | Artiodactyla | 94 | 45.6 | 56.4 | 347 | XP_010803693.1 | |
Panthera uncia | Snow leopard | Carnivora | 94 | 49.7 | 62.9 | 325 | XP_049483174.1 | |
Vulpes vulpes | Red fox | Carnivora | 94 | 55.0 | 69.1 | 281 | XP_025851313.1 | |
Leptonychotes weddellii | Weddell seal | Carnivora | 94 | 55.0 | 70.9 | 281 | XP_006740901.1 | |
Orycteropus afer afer | Aardvark | Tubulidentata | 94 | 55.6 | 69.9 | 277 | XP_007947992.1 | |
Pteropus alecto | Black flying fox | Chiroptera | 94 | 55.9 | 72.2 | 278 | XP_024903180.1 | |
Canis lupus familiaris | Dog | Carnivora | 99 | 49.4 | 62.2 | 319 | XP_005637089.1 | |
Marsupials | Gracilinanus agilis | Agile gracile opossum | Didelphimorphia | 160 | 26.4 | 43.0 | 341 | XP_044534595.1 |
Trichosurus vulpecula | Common brushtail possum | Diprotodontia | 160 | 30.1 | 44.2 | 320 | XP_036616685.1 | |
Sarcophilus harrisii | Tasmanian devil | Dasyuromorphia | 160 | 30.2 | 45.9 | 318 | XP_003772759.2 | |
Vombatus ursinus | Common wombat | Diprotodontia | 160 | 31.7 | 44.6 | 321 | XP_027700114.1 | |
Dromiciops gliroides | Colocolo opossum | Microbiotheria | 160 | 32.3 | 45.9 | 317 | XP_043823568.1 |
Evolutionary history
[edit]It has been estimated that c12orf71 gene first appeared in marsupials approximately 160 million years ago. Among the marsupial species, based on the sequence similarity, the gene has first appeared in species from the Microbotheria taxonomic group, represented by the Dromiciops Gliroides (Colocolo opossum) species. Only one isoform of the c12orf71 protein has been found in this species.
Clinical association
[edit]There is only one disease associated with c12orf71 gene, common warts.[7] A study of global gene methylation of common warts caused by HPV infection found that c12orf71 gene is differentially methylated in Arab male patients with common warts. In particular, c12orf71 is hypomethylated in skin infected with common warts compared to normal skin.[27] 10 SNPs from the GWAS catalog associate c12orf71 gene with obsolete and androgenic alopecia, healing of bone mineral density and educational attainment.[8]
References
[edit]- ^ a b c GRCh38: Ensembl release 89: ENSG00000214700 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000040163 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "C12orf71 Gene - GeneCards | CL071 Protein | CL071 Antibody". www.genecards.org. Retrieved 2022-12-16.
- ^ a b c "Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 1, mRNA". Nucleotide. 2022-06-09.
- ^ a b c "AceView: Gene:C12orf71, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2022-09-30.
- ^ a b "C12orf71 Gene - GeneCards | CL071 Protein | CL071 Antibody". www.genecards.org. Retrieved 2022-09-30.
- ^ Gomes, Alexandra Galvão (2014-06-26). Caracterização citogenômica de aberrações cromossômicas (text thesis) (in Brazilian Portuguese). Universidade de São Paulo.
- ^ Pyatt RE, Astbury C (December 2011). "Interpretation of copy number alterations identified through clinical microarray-comparative genomic hybridization". Clinics in Laboratory Medicine. 31 (4): 565–80, viii. doi:10.1016/j.cll.2011.08.007. PMID 22118737.
- ^ Cañas, Villanueva (2015-11-20). Insights into mammalian adaptive evolution through genomics data (Ph.D. Thesis thesis). Universitat Pompeu Fabra. hdl:10803/397756.
- ^ Ko, Tengyu (2018). Genome-Wide Screening Identifies Genes and Biological Processes Implicated in Chemoresistance and Oncogene-Induced Apoptosis (Thesis). ProQuest 2665131757.
- ^ "Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 2, mRNA". Nucleotide. 2022-08-19.
- ^ "C12orf71 chromosome 12 open reading frame 71 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-09-30.
- ^ Johnson, Katherine (2016). Functional analysis of the osteoarthritis susceptibility loci marked by the polymorphisms rs10492367 and rs9350591 (Thesis thesis). Newcastle University. hdl:10443/3247.
- ^ Miyata H, Castaneda JM, Fujihara Y, Yu Z, Archambeault DR, Isotani A, et al. (July 2016). "Genome engineering uncovers 54 evolutionarily conserved and testis-enriched genes that are not required for male fertility in mice". Proceedings of the National Academy of Sciences of the United States of America. 113 (28): 7704–7710. Bibcode:2016PNAS..113.7704M. doi:10.1073/pnas.1608458113. PMC 4948324. PMID 27357688.
- ^ "Compute pI/MW - SIB Swiss Institute of Bioinformatics | Expasy". www.expasy.org. Retrieved 2022-12-08.
- ^ "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-08.
- ^ a b c d e "Services". DTU Health Tech. Retrieved 2022-12-08.
- ^ "C12orf71 protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-08.
- ^ "uncharacterized protein C12orf71 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-09-30.
- ^ "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Archived from the original on 2013-05-10. Retrieved 2022-12-08.
- ^ "GPS-PAIL 2.0 - Prediction of Acetylation on Internal Lysines". pail.biocuckoo.org. Retrieved 2022-12-08.
- ^ "PSICQUIC View". www.ebi.ac.uk. Retrieved 2022-12-08.
- ^ a b Asghari Alashti F, Goliaei B, Minuchehr Z (2022-01-15). "Analyzing large scale gene expression data in colorectal cancer reveals important clues; CLCA1 and SELENBP1 downregulated in CRC not in normal and not in adenoma". American Journal of Cancer Research. 12 (1): 371–380. PMC 8822279. PMID 35141024.
- ^ "C12orf71 orthologs". NCBI. Retrieved 2022-09-30.
- ^ Alghamdi MA, Al-Eitan LN, Tarkhan AH, Al-Qarqaz FA (January 2021). "Global gene methylation profiling of common warts caused by human papillomaviruses infection". Saudi Journal of Biological Sciences. 28 (1): 612–622. doi:10.1016/j.sjbs.2020.10.050. PMC 7783806. PMID 33424347. S2CID 228813393.