Whale Shark Positively selected genes
Contents:
- Anti-clotting: FGG, FGB, SEPRINE2
- Cell division: HAUS1, CENPI, CENPK, MCM3AP, CHAF1A, HP1BP3, C1orF35
- Cancer associated: RET, GSS, SERPINF1, NFKBIE
- Programmed cell death: CASP7, PDCD2, PDCD6IP
- DNA repair genes: MSH6, RAD54B, MRE11A, XRCC3, MCM8, DCLRE1B
- Other proteins: KIAA1143, GNL2, CTDP1, MTIF2, DECR1, ATE1, XXYLT1, PLEKHM3
- Brain and neuron related: NCDN
(Anti-clotting: FGG, FGB, SEPRINE2)
FGG
FGG 143 N E probably damaging deleterious fibrinogen gamma chain
FGG 167 D G probably damaging deleterious fibrinogen gamma chain
FGG 171 I A probably damaging deleterious fibrinogen gamma chain
FGG 176 G A probably damaging deleterious fibrinogen gamma chain
FGG 204 F M probably damaging deleterious fibrinogen gamma chain
FGG 254 L I probably damaging neutral fibrinogen gamma chain
FGG 335 G F probably damaging deleterious fibrinogen gamma chain
FGG 375 Y H possibly damaging deleterious fibrinogen gamma chain
FGB 284 N S possibly damaging deleterious fibrinogen beta chain
FGB 352 V L probably damaging neutral fibrinogen beta chain
FGB 409 D N probably damaging deleterious fibrinogen beta chain
FGB 470 W A probably damaging deleterious fibrinogen beta chain
Gene FGG together with fibrinogen alpha (FGA) and fibrinogen beta (FGB), polymerizes to form an insoluble fibrin matrix. Has a major function in hemostasis as one of the primary components of blood clots. In addition, functions during the early stages of wound repair to stabilize the lesion and guide cell migration during re-epithelialization.
The FGG is positively selected genes in the whale shark genome. In my opinion above listed function altering amino acid changes in FGG makes shark blood anticlotting. This may prevent blot clots from forming. Blood clots and coagulation to stop bleeding. However, if colts get into blood vessels, they can cause a heart attack or stroke.
Contents
Below amino acid changes that in human FGG, which involve in disease
Congenital afibrinogenemia (CAFBN)
Feature key |
Position(s) |
DescriptionActions |
Natural variantiVAR_072726 |
T → P in CAFBN; hypofibrinogenemia; heterozygous; no effect on fibrinogen complex assembly; impaired fibrinogen complex secretion. |
|
Natural variantiVAR_072727 |
D → H in CAFBN; hypofibrinogenemia; heterozygous; no effect on fibrinogen complex assembly; no effect on fibrinogen complex secretion. |
|
Natural variantiVAR_072728 |
N → D in CAFBN; hypofibrinogenemia; heterozygous; no effect on fibrinogen complex assembly; decreased fibrinogen complex secretion. |
|
Natural variantiVAR_072729 |
R → W in CAFBN; hypofibrinogenemia; heterozygous. |
Dysfibrinogenemia, congenital (DYSFIBRIN)
Feature key |
Position(s) |
DescriptionActions |
Natural variantiVAR_002410 |
R → H in DYSFIBRIN; fibrinogen Bergamo-2/Essen/Haifa/Osaka-3/Perugia/Saga/Barcelona-3/Barcelona-4. |
|
Natural variantiVAR_002411 |
G → V in DYSFIBRIN; fibrinogen Baltimore-1; impaired polymerization. |
|
Natural variantiVAR_002418 |
D → V in DYSFIBRIN; fibrinogen Milano-1; impaired polymerization. |
|
Natural variantiVAR_072621 |
S → P in DYSFIBRIN; fibrinogen Philadelphia. |
However, in whale shark genome, amino acid changes not in the same spots as in mutations. Since one amino acid mutation lead to dysfunction and to disease, functional changes -> in whale shark gene might lead to similar defects.
In website human FGG described in detail: http://www.uniprot.org/uniprot/P02679
FGG structure:
Type |
Positions |
Description |
Chain |
Fibrinogen gamma chain |
|
Domain |
Fibrinogen C-terminal |
|
Metal binding |
Calcium |
|
Metal binding |
Calcium |
|
Metal binding |
Calcium; via carbonyl oxygen |
|
Metal binding |
Calcium; via carbonyl oxygen |
|
Glycosylation |
N-linked (GlcNAc...); |
|
|
|
|
|
|
|
http://web.expasy.org/variant_pages/VAR_015853.html
Variant position: 335
Residue change: From Glycine (G) to Aspartate (D) at position 335 (G335D, p.Gly335Asp). Physico-chemical properties: Change from glycine (G) to medium size and acidic (D) Variant description: In Hillsborough; prolonged thrombin clotting time.
|
From Glycine (G) to Phenylalanine (F) at position 335
Change from glycine to large size aromatic and hydrophobic (F) |
In common Pathway of Fibrin Clot formation
The common pathway consists of the cascade of activation events leading from the formation of activated factor X to the formation of active thrombin, the cleavage of fibrinogen by thrombin, and the formation of cleaved fibrin into a stable multimeric, cross-linked complex. Thrombin also efficiently catalyzes the activation of several factors required earlier in the clotting cascade, thus acting in effect as a positive regulator of clotting. At the same time, thrombin activates protein C, which in turn catalyzes the inactivation of several of these upstream factors, thereby limiting the clotting process. Thrombin can be trapped in stable, inactive complexes with: antithrombin-III (SERPINC1), a circulating blood protein; heparin cofactor II (SERPIND1) which inhibits thrombin in a dermatan sulfate dependent manner in the arterial vasculature; protein C inhibitor (SERPINA5) that inhibits thrombin in complex with thrombomodulin; and Protease nexin-1 (SERPINE2) that inhibits thrombin at the vessel wall and platelet surface. The quantitative interplay among these positive and negative modulators is critical to the normal regulation of clotting, facilitating the rapid formation of a protective clot at the site of injury, while limiting and physically confining the process.
From: http://www.reactome.org/PathwayBrowser/#R-HSA-140875
According to excel file SERPINE2 -> is positively selected gene in shark common ancestors:
SERPINE2 146 F à Y probably damaging deleterious
This gene encodes a member of the serpin family of proteins, a group of proteins that inhibit serine proteases. Thrombin, urokinase, plasmin and trypsin are among the proteases that this family member can inhibit. This gene is a susceptibility gene for chronic obstructive pulmonary disease and for emphysema. .
The possible disease information based on polymorphism and mutation in SERPINE2:
Most of diseases are related to cancers. However, there is no disease, which have same alteration as in whale shark gene. (SERINE2). Consequently, it means that that functional change is not damaging. Beneficial for shark blood clotting.
Recent studies observed that SERPINE2 might prevent cartilage catabolism by inhibiting expression of MMP-13. The functional amino acid changes might me in order to prevent SERPINE2 suppression, because whole shark body consist from cartilage.
“SERPINE2 Inhibits IL-1α-Induced MMP-13 Expression in Human Chondrocytes: Involvement of ERK/NF-κB/AP-1 Pathway”
https://www.ncbi.nlm.nih.gov/pubmed/26305372
(Cell division : HAUS1, CENPI, CENPK, MCM3AP, CHAF1A, HP1BP3, PARP11 )
HAUS1
HAUS1 is 1 of 8 subunits of the 390-kD human augmin complex, or HAUS complex. The augmin complex is a microtubule-binding complex involved in microtubule generation within the mitotic spindle and is vital to mitotic spindle assembly.
- Important in cell replication
Whale shark specific genes:
HAUS1 76 L I probably damaging HAUS augmin like complex subunit 1
Whale shark and elephant shark common:
HAUS1 53 L Q probably damaging HAUS augmin like complex subunit 1
CENPI
Component of the CENPA-CAD (nucleosome distal) complex, a complex recruited to centromere which is involved in assembly of kinetochore proteins, mitotic progression and chromosome segregation. May be involved in incorporation of newly synthesized CENPA into centromeres via its interaction with the CENPA-NAC complex. Required for the localization of CENPF, MAD1L1 and MAD2 (MAD2L1 or MAD2L2) to kinetochores. Involved in the response of gonadal tissues to follicle-stimulating hormone
CENPI 113 L V possibly damaging deleterious centromere protein I
CENPI 262 F L probably damaging deleterious centromere protein I
CENPI 446 W Y probably damaging deleterious centromere protein I
Estimated protein expression in Liver the highest ß shark liver is the biggest organ in their body, it means liver cells proliferate many times
CENPK
Component of the CENPA-CAD (nucleosome distal) complex, a complex recruited to centromeres which is involved in assembly of kinetochore proteins, mitotic progression and chromosome segregation. May be involved in incorporation of newly synthesized CENPA into centromeres via its interaction with the CENPA-NAC complex. Acts in coordination with CASC5/KNL1 to recruit the NDC80 complex to the outer kinetochore.
<a href="https://www.ncbi.nlm.nih.gov/protein/NP_068562.1?from=47&to=306&sat=4&sat_key=135313005">Region</a> 47..306
/region_name="CENP-K"
/note="Centromere-associated protein K; pfam11802"
The functional amino-acid changes in centrosome associated region à important for binding
CENPK 252 A V probably damaging deleterious centromere protein K
CENPK 257 E D probably damaging neutral centromere protein K
MCM3AP
The minichromosome maintenance protein 3 (MCM3) is one of the MCM proteins essential for the initiation of DNA replication. The protein encoded by this gene is an MCM3 binding protein. It was reported to have phosphorylation-dependent DNA-primase activity, which was up-regulated in antigen immunization induced germinal center. This protein was demonstrated to be an acetyltransferase that acetylates MCM3 and plays a role in DNA replication. The mutagenesis of a nuclear localization signal of MCM3 affects the binding of this protein with MCM3, suggesting that this protein may also facilitate MCM3 nuclear localization
MCM3AP 730 I V MCM component 3 associated protein
CHAF1A
Core component of the CAF-1 complex, a complex thought to mediate chromatin assembly in DNA replication and DNA repair. Assembles histone octamers onto replicating DNA in vitro. CAF-1 performs the first step of the nucleosome assembly process, bringing newly synthesized histones H3 and H4 to replicating DNA; histones H2A/H2B can bind to this chromatin precursor subsequent to DNA replication to complete the histone octamer. CHAF1A binds to histones H3 and H4. It may play a role in heterochromatin maintenance in proliferating cells by bringing newly synthesized cbx proteins to heterochromatic DNA replication foci
CHAF1A 423 R K unknown
CHAF1A 509 L I probably damaging
CHAF1A 555 G C probably damaging
http://www.uniprot.org/uniprot/Q13111 according to the website information à above amino acids spots not functionally important changes.
HP1BP3
Component of heterochromatin that maintains heterochromatin integrity during G1/S progression and regulates the duration of G1 phase to critically influence cell proliferative capacity. Mediates chromatin condensation during hypoxia, leading to increased tumor cell viability, radio-resistance, chemo-resistance and self-renewal. HP1BP3 is a key mediator of tumor progression and cancer cell acquisition of therapy-resistant traits, and thus might represent a novel therapeutic target in a range of human malignancies.
In bowhead whale this protein has extended amino acids chain from 171 - > 240.
I would like to look to the whole peptide chain of the shark’s protein and compare with bowhead. Because, this protein gives condition to cancer, if some changes present like in bowhead whale protein seq., then it means they alter its function to suppress tumor.
PARP11
Plays a role in nuclear envelope stability and nuclear remodeling during spermiogenesis.
PARP11 328 Q H probably damaging deleterious
328TH position in PARP catalytic domain.
(Cancer associated: RET, GSS, SERPINF1, NFKBIE)
RET
This gene, a member of the cadherin superfamily, encodes one of the receptor tyrosine kinases, which are cell-surface molecules that transduce signals for cell growth and differentiation. This gene plays a crucial role in neural crest development, and it can undergo oncogenic activation in vivo and in vitro by cytogenetic rearrangement. Mutations in this gene are associated with the disorders multiple endocrine neoplasia, type IIA, multiple endocrine neoplasia, type IIB, Hirschsprung disease, and medullary thyroid carcinoma.
The RET gene provides instructions for producing a protein that is involved in signaling within cells. This protein appears to be essential for the normal development of several kinds of nerve cells, including nerves in the intestine (enteric neurons) and the portion of the nervous system that controls involuntary body functions such as heart rate (the autonomic nervous system). The RET protein is also necessary for normal kidney development and the production of sperm (spermatogenesis).
The RET protein spans the cell membrane, so that one end of the protein remains inside the cell and the other end projects from the outer surface of the cell. This positioning of the protein allows it to interact with specific factors outside the cell and to receive signals that help the cell respond to its environment. When molecules that stimulate growth and development (growth factors) attach to the RET protein, a complex cascade of chemical reactions inside the cell is triggered. These reactions instruct the cell to undergo certain changes, such as dividing or maturing to take on specialized functions.
https://ghr.nlm.nih.gov/gene/RET#conditions
RET 657 T S probably damaging deleterious ret proto-oncogene
Whale shark and elephant shark common:
RET 697 A G probably damaging deleterious ret proto-oncogene
RET 714 E Q probably damaging neutral ret proto-oncogene
GSS
(During hypoxia condition a lot oxidants form -> protection from free radicals *)
Glutathione is important for a variety of biological functions, including protection of cells from oxidative damage by free radicals, detoxification of xenobiotics, and membrane transport. The protein encoded by this gene functions as a homodimer to catalyze the second step of glutathione biosynthesis, which is the ATP-dependent conversion of gamma-L-glutamyl-L-cysteine to glutathione. Defects in this gene are a cause of glutathione synthetase deficiency.
ATP + gamma-L-glutamyl-L-cysteine + glycine = ADP + phosphate + glutathione
Whale shark:
GSS 152 F Y probably damaging deleterious prion protein
Whale shark and elephant shark common:
GSS 51 T S benign deleterious glutathione synthetase
GSS 456 H F probably damaging deleterious glutathione synthetase
SERPINF1
Pigment epithelium-derived factor (PEDF) also known as serpin F1 (SERPINF1), has a variety of functions including antiangiogenic, antitumorigenic, and neurotrophic properties. Endothelial cell migration is inhibited by PEDF. PEDF suppresses retinal neovascularization and endothelial cell proliferation. The antiangiogenic residues 24-57 were shown to be sufficient at inhibiting angiogenesis. PEDF is also responsible for apoptosis of endothelial cells either through the p38 MAPK pathway or through the FAS/FASL pathway. Antiangiogenic function is also conferred by PEDF through inhibition of both VEGFR-1 and VEGFR-2.
The antitumorigenic effects of PEDF are not only due to inhibition of supporting vasculature, but also due to effects on the cancer cells themselves. PEDF was shown to inhibit cancer cell proliferation and increase apoptosis via the FAS/FASL pathway. VEGF expression by cancer cells is inhibited by PEDF.
PEDF also displays neurotrophic functions. Retinoblastoma cells differentiate into neurons due to the presence of PEDF.
As it does not undergo the S (stressed) to R (relaxed) conformational transition characteristic of active serpins, it exhibits no serine protease inhibitory activity.
SERPINF1 106 R G probably damaging deleterious serpin family F member 1
SERPINF1 408 F H probably damaging deleterious serpin family F member 1
http://bioinf.umbc.edu/dmdm/generatelogo.php?domain=SERPIN&accession=smart00093&prot=313104314
In above website mutation spots are indicated, however whale shark amino acid changed spots not included there. In my opinion, those positively selected behavior is improved SERPINF1 function, giving anti-angiogenic, anti-tumorigenic specificity. Residue change in 408th amino acid from Phe to His, would not lead big changes, due to these two side chains are similar in their structure.
NFKBIE
The protein encoded by this gene binds to components of NF-kappa-B, trapping the complex in the cytoplasm and preventing it from activating genes in the nucleus. Phosphorylation of the encoded protein targets it for destruction by the ubiquitin pathway, which activates NF-kappa-B by making it available to translocate to the nucleus.
In tumor cells, NF-κB is active either due to mutations, some tumor cells secrete factors that cause NF-κB to become active. Blocking NF-κB can cause tumor cells to stop proliferating, to die, or to become more sensitive to the action of anti-tumor agents.
Active NF-κB turns on the expression of genes that keep the cell proliferating and protect the cell from conditions that would otherwise cause it to die via apoptosis.
Whale shark:
NFKBIE 261 T S possibly damaging deleterious
NFKBIE 411 A S probably damaging deleterious
Whale shark and elephant shark:
NFKBIE 322 L Q probably damaging deleterious
NFKBIE contains 6 ANK repeats.
The ankyrin repeat is a 33-residue motif in proteins consisting of two alpha helices separated by loops. Domains consisting of ankyrin repeats mediate protein–protein interactions and are among the most common structural motifs in known proteins.
In following ANK repeats whale shark’s mutation occurs:
Repeat |
258-291 |
ANK 1 |
Repeat |
293-322 |
ANK 2 |
Repeat |
403-432 |
ANK 5 |
If those amino acid changes improve NFKBIE function, then that would be one of the agent that suppress tumor cells. However, due to whale shark body size big, so need increased cell replication rate. Maybe that’s why ANK repeat had some alterations à to decrease protein-protein interactions. Low interaction with NF-kB, increase cell proliferations.
(Programmed cell death: CASP7, PDCD2, PDCD6IP,MYCBP2)
CASP7
catalytic activity: Strict requirement for an Asp residue at position P1 and has a preferred cleavage sequence of Asp-Glu-Val-Asp-|-.
function: Involved in the activation cascade of caspases responsible for apoptosis execution. Cleaves and activates sterol regulatory element binding proteins (SREBPs). Proteolytically cleaves poly(ADP-ribose) polymerase (PARP) at a '216-Asp-|-Gly-217' bond.
Overexpression promotes programmed cell death.
PTM:Cleavages by granzyme B or caspase-10 generate the two active subunits. Propeptide domains can also be cleaved efficiently by caspase-3. Active heterodimers between the small subunit of caspase-7 and the large subunit of caspase-3, and vice versa, also occur.
similarity: Belongs to the peptidase C14A family.
subunit: Heterotetramer that consists of two anti-parallel arranged heterodimers, each one formed by a 20 kDa (p20) and a 11 kDa (p11) subunit.,
tissue specificity: Highly expressed in lung, skeletal muscle, liver, kidney, spleen and heart, and moderately in testis. No expression in the brain.
CASP7 168 G E probably damaging deleterious
PDCD2
^ DNA-binding protein with a regulatory function. May play an important role in cell death and/or in regulation of cell proliferation.
^Programmed cell death 2 (PDCD2) is a highly conserved nuclear protein, and aberrant PDCD2 expression alters cell apoptosis. The data from the present study demonstrated that loss of PDCD2 expression could contribute to gastric cancer development and progression and that PDCD2-induced gastric cancer cell growth arrest at the early S phase of the cell cycle and apoptosis are p53-dependent.
Expression of this gene has been shown to be repressed by B-cell CLL/lymphoma 6 (BCL6), a transcriptional repressor required for lymph node germinal center development, suggesting that BCL6 regulates apoptosis by its effects on this protein.
* transfection of a VP16-BCL6 zinc fingers fusion protein, which competes with the binding of endogenous BCL6 in a Burkitt lymphoma cell line =>
increases PDCD2 protein expression and apoptosis
knockdown of the PDCD2 protein in this cell line by PDCD2-specific small interfering RNA duplexes inhibits apoptosis.
These studies indicate that one function of PDCD2 is to promote apoptosis in several human and mammalian cell lines and tissues, including lymphoma.
It is plausible that repression of PDCD2 expression by BCL6, which, in turn, leads to downregulation of apoptosis, is one mechanism involved in BCL6-associated lymphomatous transformation. The usefulness of increasing PDCD2 expression in the treatment of certain lymphomas merits further investigation.
<a href="https://www.ncbi.nlm.nih.gov/protein/NP_001186390.1?from=135&to=172&sat=4&sat_key=175005973">Region</a> 135-172
/region_name="zf-MYND"
/note="MYND finger; pfam01753"
Zync finger binding site amino acids changed à
PDCD2 150 H R benign deleterious programmed cell death 2
PDCD2 169 K R possibly damaging deleterious programmed cell death 2
PDCD6IP
overexpression of this protein can block apoptosis. In addition, the product of this gene binds to the product of the PDCD6 gene, a protein required for apoptosis, in a calcium-dependent manner. This gene product also binds to endophilins, proteins that regulate membrane shape during endocytosis. Overexpression of this gene product and endophilins results in cytoplasmic vacuolization, which may be partly responsible for the protection against cell death
PDCD6IP 118 Y F programmed cell death 6 interacting protein
PDCD6IP 171 L M neutral programmed cell death 6 interacting protein
PDCD6IP 175 P L deleterious programmed cell death 6 interacting protein
PDCD6IP 179 I T deleterious programmed cell death 6 interacting protein
PDCD6IP 361 E V deleterious programmed cell death 6 interacting protein
MYCBP2
protein associates with myc
PAM contains a N-terminal leucine zipper, central MYC-binding, and C-terminal histone-binding protein homology domains
Myc (c-Myc) is a regulator gene that codes for a transcription factor. The protein encoded by this gene is a multifunctional, nuclear phosphoprotein that plays a role in cell cycle progression, apoptosis and cellular transformation
PAM 285 A T probably damaging deleterious MYC binding protein 2, E3 ubiquitin protein ligase
PAM 602 V I probably damaging neutral
PAM 623 P S probably damaging deleterious
PAM 738 P A benign deleterious
PAM 768 I L probably damaging neutral
(DNA repair genes: MSH6,RAD54B, MRE11A,XRCC3, MCM8, DCLRE1B )
MSH6
Component of the post-replicative DNA mismatch repair system (MMR). Heterodimerizes with MSH2 to form MutS alpha, which binds to DNA mismatches thereby initiating DNA repair. When bound, MutS alpha bends the DNA helix and shields approximately 20 base pairs, and recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. After mismatch binding, forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. ATP binding and hydrolysis play a pivotal role in mismatch repair functions. The ATPase activity associated with MutS alpha regulates binding similar to a molecular switch: mismatched DNA provokes ADP-->ATP exchange, resulting in a discernible conformational transition that converts MutS alpha into a sliding clamp capable of hydrolysis-independent diffusion along the DNA backbone. This transition is crucial for mismatch repair. MutS alpha may also play a role in DNA homologous recombination repair. Recruited on chromatin in G1 and early S phase via its PWWP domain that specifically binds trimethylated 'Lys-36' of histone H3 (H3K36me3): early recruitment to chromatin to be replicated allowing a quick identification of mismatch repair to initiate the DNA mismatch repair reaction.
Whale shark specific genes having function altering amino acid changes
MSH6 199 P L benign deleterious
MSH6 451 F Y possibly damaging neutral
MSH6 733 M L benign deleterious
MSH6 1207 H Y possibly damaging deleterious
MSH6 1330 L F probably damaging deleterious
RAD54B
Involved in DNA repair and mitotic recombination. May play an active role in recombination processes in concert with other members of the RAD52 epistasis group
Abundantly expressed in testis and spleen.
RAD54B 194 M T possibly damaging deleterious
RAD54B 482 L I probably damaging neutral
RAD54B 598 L F probably damaging deleterious
Whale shark and elephant shark common genes having function altering amino acid changes:
RAD54B 383 I L possibly damaging neutral
RAD54B 533 E D possibly damaging neutral
MRE11A
Component of the MRN complex, which plays a central role in double-strand break (DSB) repair, DNA recombination, maintenance of telomere integrity and meiosis. The complex possesses single-strand endonuclease activity and double-strand-specific 3'-5' exonuclease activity, which are provided by MRE11A. RAD50 may be required to bind DNA ends and hold them in close proximity. This could facilitate searches for short or long regions of sequence homology in the recombining DNA templates, and may also stimulate the activity of DNA ligases and/or restrict the nuclease activity of MRE11A to prevent nucleolytic degradation past a given point. The complex may also be required for DNA damage signaling via activation of the ATM kinase.
MRE11A 304 V I probably damaging neutral
MRE11A 403 E D possibly damaging neutral
<em> Region: </em>10..402
/region_name="mre11"
/note="DNA repair protein (mre11); TIGR00583"
/db_xref="CDD:<a href="https://www.ncbi.nlm.nih.gov/Structure/cdd/cddsrv.cgi?uid=273153">273153</a>"
Defects in MRE11A can be a cause of nephronophthisis-related ciliopathies (NPHP-RC), a group of recessive diseases that affect kidney, retina and brain. A homozygous truncating mutation MRE11A has been found in patients with cerebellar vermis hypoplasia, ataxia and dysarthria.
XRCC3
Involved in the homologous recombination repair (HRR) pathway of double-stranded DNA, thought to repair chromosomal fragmentation, translocations and deletions. Part of the RAD21 paralog protein complex CX3 which acts in the BRCA1-BRCA2-dependent HR pathway. Upon DNA damage, CX3 acts downstream of RAD51 recruitment; the complex binds predominantly to the intersection of the four duplex arms of the Holliday junction (HJ) and to junctions of replication forks. Plays a role in regulating mitochondrial DNA copy number under conditions of oxidative stress in the presence of RAD51 and RAD51C.
XRCC3 259 V I probably damaging neutral
Whale shark and elephant shark common genes having function altering amino acid changes:
XRCC3 133 L Q probably damaging deleterious
<a href="https://www.ncbi.nlm.nih.gov/protein/NP_001093588.1?from=2&to=337&sat=4&sat_key=176719491">Region</a> 2..337
/region_name="recomb_DMC1"
/note="meiotic recombinase Dmc1; TIGR02238"
/db_xref="CDD:<a href="https://www.ncbi.nlm.nih.gov/Structure/cdd/cddsrv.cgi?uid=131292">131292</a>"
This model describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51.
In this region function altering changes occurred in whale shark and elephant shark.
MCM8
Component of the MCM8-MCM9 complex, a complex involved in homologous recombination repair following DNA interstrand cross-links and plays a key role during gametogenesis. The MCM8-MCM9 complex probably acts as a hexameric helicase downstream of the Fanconi anemia proteins BRCA2 and RAD51 and is required to process aberrant forks into homologous recombination substrates and to orchestrate homologous recombination with resection, fork stabilization and fork restart. May also play a non-essential for DNA replication: may be involved in the activation of the prereplicative complex (pre-RC) during G1 phase by recruiting CDC6 to the origin recognition complex (ORC). Binds chromatin throughout the cell cycle
*Highest levels in placenta, lung and pancreas. Low levels in skeletal muscle and kidney. Expressed in various tumors with highest levels in colon and lung cancers
Whale shark specific genes having function altering amino acid changes
MCM8 608 N S probably damaging deleterious
Domain part: 402-609
In 608th amino acid change might bring conformational changes.
DCLRE1B
5'-3' exonuclease that plays a central role in telomere maintenance and protection during S-phase. Participates in the protection of telomeres against non-homologous end-joining (NHEJ)-mediated repair, thereby ensuring that telomeres do not fuse. Plays a key role in telomeric loop (T loop) formation by being recruited by TERF2 at the leading end telomeres and by processing leading-end telomeres immediately after their replication via its exonuclease activity: generates 3' single-stranded overhang at the leading end telomeres avoiding blunt leading-end telomeres that are vulnerable to end-joining reactions and expose the telomere end in a manner that activates the DNA repair pathways. Together with TERF2, required to protect telomeres from replicative damage during replication by controlling the amount of DNA topoisomerase (TOP1, TOP2A and TOP2B) needed for telomere replication during fork passage and prevent aberrant telomere topology. Also involved in response to DNA damage: plays a role in response to DNA interstrand cross-links (ICLs) by facilitating double-strand break formation. In case of spindle stress, involved in prophase checkpoint.
DNA interstrand cross-links prevent strand separation, thereby physically blocking transcription, replication, and segregation of DNA. DCLRE1B is one of several evolutionarily conserved genes involved in repair of interstrand cross-links.
DCLRE1B 209 E Q possibly damaging deleterious
Whale shark and elephant shark common genes having function altering amino acid changes
DCLRE1B 119 G A probably damaging deleterious DNA cross-link repair 1B
<a href="https://www.ncbi.nlm.nih.gov/protein/NP_001306875.1?from=90&to=173&sat=4&sat_key=157385479">Region</a> 90..173
/region_name="DRMBL"
/note="DNA repair metallo-beta-lactamase; pfam07522"
/db_xref="CDD:<a href="https://www.ncbi.nlm.nih.gov/Structure/cdd/cddsrv.cgi?uid=284854">284854</a>"
(Other proteins: KIAA1143, GNL2, CTDP1, MTIF2, DECR1, ATE1, XXYLT1, PLEKHM3)
KIAA1143
KIAA1143 29 E S probably damaging deleterious KIAA1143
KIAA1143 95 I H probably damaging deleterious KIAA1143
KIAA1143 108 K T benign deleterious KIAA1143
On the uniprot website :
Modified residue |
108 |
N6-acetyllysine |
A lysine on 108th amino acid had post translational modification, however in shark’s case lysine changed to the Threonine. Thereby, acetylation won’t happen and changes structure of the protein.
GNL2
GNL2 2 V S possibly damaging neutral G protein nucleolar 2
GNL2 7 K Q possibly damaging neutral G protein nucleolar 2
GNL2 12 I E possibly damaging deleterious G protein nucleolar 2
GNL2 13 N R benign deleterious G protein nucleolar 2
GNL2 15 S E possibly damaging deleterious G protein nucleolar 2
GNL2 18 S M probably damaging deleterious G protein nucleolar 2
GNL2 20 N I probably damaging deleterious G protein nucleolar 2
GNL2 21 P Q probably damaging deleterious G protein nucleolar 2
GNL2 32 M T possibly damaging deleterious G protein nucleolar 2
GNL2 71 V T possibly damaging deleterious G protein nucleolar 2
GNL2 308 K R possibly damaging deleterious G protein nucleolar 2
function: GTPase that associates with pre-60S ribosomal subunits in the nucleolus and is required for their nuclear export and maturation.
similarity: Belongs to the MMR1/HSR1 GTP-binding protein family, NOG2 subfamily.
The brginning amino acids mostly changed -> possibly to improve signal sequence while exporting from nucleous.
CTDP1
This gene encodes a protein which interacts with the carboxy-terminus of transcription initiation factor TFIIF, a transcription factor which regulates elongation as well as initiation by RNA polymerase II. The protein may also represent a component of an RNA polymerase II holoenzyme complex.
Mutations in this gene are associated with congenital cataracts, facial dysmorphism and neuropathy syndrome.
Whale Shark and Elephant shark:
CTDP1 153 V I probably damaging neutral
CTDP1 176 R W probably damaging deleterious
CTDP1 179 R K probably damaging deleterious
DNAJB14
Acts as a co-chaperone with HSPA8 and promotes the degradation of misfolded transmembrane proteins in the ER-associated degradation (ERAD) pathway.
DNAJB14 346 A S possibly damaging deleterious
MTIF2
During the initiation of protein biosynthesis, initiation factor-2 (IF-2) promotes the binding of the initiator tRNA to the small subunit of the ribosome in a GTP-dependent manner.
MTIF2 297 P H probably damaging deleterious
MTIF2 380 T S probably damaging deleterious
MTIF2 572 Y F benign deleterious
DECR1
Auxiliary enzyme of beta-oxidation. It participates in the metabolism of unsaturated fatty enoyl-CoA esters having double bonds in both even- and odd-numbered positions. Catalyzes the NADP-dependent reduction of 2,4-dienoyl-CoA to yield trans-3-enoyl-CoA.
DECR1 44 F L probably damaging deleterious
Tissue specificity: Heart = liver = pancreas > kidney >> skeletal muscle = lung
44th residue not a binding site or active site, so this change not involve big changes in the function of the protein.
ATE1
This gene encodes an arginyltransferase, an enzyme that is involved in posttranslational conjugation of arginine to N-terminal aspartate or glutamate residues. Conjugation of arginine to the N-terminal aspartate or glutamate targets proteins for ubiquitin-dependent degradation.
ATE1 60 T S possibly damaging deleterious
Changes from Threonine to serine cause not big change in structure and function, because they are similar amino acids.
XXYLT1
XXYLT1 352 W N probably damaging deleterious
XXYLT1 367 D E possibly damaging deleterious
Region |
UDP-alpha-D-xylose binding |
|
Regioni |
Interaction with target protein |
Above protein domains and function described, and shark’s mutation spots not included.
PLEKHM3
Whale shark only:
PLEKHM3 305 E A probably damaging deleterious
PLEKHM3 413 V I probably damaging neutral
PLEKHM3 444 A T probably damaging deleterious
Domains:
PH1 domain: 211-308 positions
PH2 domain: 362- 456 positions
This domain can bind phosphatidylinositol lipids within biological membranes, and proteins such as the βγ-subunits of heterotrimeric G proteins, and protein kinase C. Through these interactions, PH domains play a role in recruiting proteins to different membranes, thus targeting them to appropriate cellular compartments or enabling them to interact with other components of the signal transduction pathways.
In whale shark’s PLEKHM3 two domains have functional altering amino acid changes.
Whale shark and elephant shark:
PLEKHM3 380 F Y probably damaging neutral
PLEKHM3 501 L I probably damaging neutral
(Brain and neuron related: NCDN)
NCDN
This gene encodes a leucine-rich cytoplasmic protein, which is highly similar to a mouse protein norbin that negatively regulates Ca/calmodulin-dependent protein kinase II phosphorylation and may be essential for spatial learning processes.
Norbin can modulate signaling activity and expression of metabotropic glutamate receptor 5; modulating mice with targeted deletion of NCDN in the brain have phenotypic traits usually found in the rodent models of schizophrenia, including disruptions in prepulse inhibition. Furthermore, norbin protein expression is altered in the schizophrenia brain.
NCDN 250 P A possibly damaging deleterious neurochondrin
NCDN 379 L F probably damaging deleterious neurochondrin