Difference between revisions of "Summary class Geromics 2024 HyoungJinChoi"
(24 intermediate revisions by the same user not shown) | |||
Line 62: | Line 62: | ||
| | ||
− | + | | |
== 2024.03.29 == | == 2024.03.29 == | ||
Line 70: | Line 70: | ||
<img style="width: 484px; height: 454px;" src=http://Biolecture.org/upload/20240329131251_image.png><br/> full txt link : [https://www.hani.co.kr/arti/society/rights/471412.html https://www.hani.co.kr/arti/society/rights/471412.html]<br/> <br/> <br/> | <img style="width: 484px; height: 454px;" src=http://Biolecture.org/upload/20240329131251_image.png><br/> full txt link : [https://www.hani.co.kr/arti/society/rights/471412.html https://www.hani.co.kr/arti/society/rights/471412.html]<br/> <br/> <br/> | ||
+ | | ||
== 2024.04.05 == | == 2024.04.05 == | ||
− | |||
=== DNA === | === DNA === | ||
Line 83: | Line 83: | ||
Within eukaryotic cells, DNA is organized into long structures called [https://en.wikipedia.org/wiki/Chromosome chromosomes]. Before typical [https://en.wikipedia.org/wiki/Cell_division cell division], these chromosomes are duplicated in the process of DNA replication, providing a complete set of chromosomes for each daughter cell. [https://en.wikipedia.org/wiki/Eukaryote Eukaryotic organisms] ([https://en.wikipedia.org/wiki/Animal animals], [https://en.wikipedia.org/wiki/Plant plants], [https://en.wikipedia.org/wiki/Fungus fungi] and [https://en.wikipedia.org/wiki/Protist protists]) store most of their DNA inside the [https://en.wikipedia.org/wiki/Cell_nucleus cell nucleus] as [https://en.wikipedia.org/wiki/Nuclear_DNA nuclear DNA], and some in the [https://en.wikipedia.org/wiki/Mitochondrion mitochondria] as [https://en.wikipedia.org/wiki/Mitochondrial_DNA mitochondrial DNA] or in [https://en.wikipedia.org/wiki/Chloroplast chloroplasts] as [https://en.wikipedia.org/wiki/Chloroplast_DNA chloroplast DNA].[https://en.wikipedia.org/wiki/DNA#cite_note-5 <sup>[5</sup>]] In contrast, [https://en.wikipedia.org/wiki/Prokaryote prokaryotes] ([https://en.wikipedia.org/wiki/Bacteria bacteria] and [https://en.wikipedia.org/wiki/Archaea archaea]) store their DNA only in the [https://en.wikipedia.org/wiki/Cytoplasm cytoplasm], in [https://en.wikipedia.org/wiki/Circular_chromosome circular chromosomes]. Within eukaryotic chromosomes, [https://en.wikipedia.org/wiki/Chromatin chromatin] proteins, such as [https://en.wikipedia.org/wiki/Histone histones], compact and organize DNA. These compacting structures guide the interactions between DNA and other proteins, helping control which parts of the DNA are transcribed.<br/> <br/> full text link : [https://en.wikipedia.org/wiki/DNA https://en.wikipedia.org/wiki/DNA] | Within eukaryotic cells, DNA is organized into long structures called [https://en.wikipedia.org/wiki/Chromosome chromosomes]. Before typical [https://en.wikipedia.org/wiki/Cell_division cell division], these chromosomes are duplicated in the process of DNA replication, providing a complete set of chromosomes for each daughter cell. [https://en.wikipedia.org/wiki/Eukaryote Eukaryotic organisms] ([https://en.wikipedia.org/wiki/Animal animals], [https://en.wikipedia.org/wiki/Plant plants], [https://en.wikipedia.org/wiki/Fungus fungi] and [https://en.wikipedia.org/wiki/Protist protists]) store most of their DNA inside the [https://en.wikipedia.org/wiki/Cell_nucleus cell nucleus] as [https://en.wikipedia.org/wiki/Nuclear_DNA nuclear DNA], and some in the [https://en.wikipedia.org/wiki/Mitochondrion mitochondria] as [https://en.wikipedia.org/wiki/Mitochondrial_DNA mitochondrial DNA] or in [https://en.wikipedia.org/wiki/Chloroplast chloroplasts] as [https://en.wikipedia.org/wiki/Chloroplast_DNA chloroplast DNA].[https://en.wikipedia.org/wiki/DNA#cite_note-5 <sup>[5</sup>]] In contrast, [https://en.wikipedia.org/wiki/Prokaryote prokaryotes] ([https://en.wikipedia.org/wiki/Bacteria bacteria] and [https://en.wikipedia.org/wiki/Archaea archaea]) store their DNA only in the [https://en.wikipedia.org/wiki/Cytoplasm cytoplasm], in [https://en.wikipedia.org/wiki/Circular_chromosome circular chromosomes]. Within eukaryotic chromosomes, [https://en.wikipedia.org/wiki/Chromatin chromatin] proteins, such as [https://en.wikipedia.org/wiki/Histone histones], compact and organize DNA. These compacting structures guide the interactions between DNA and other proteins, helping control which parts of the DNA are transcribed.<br/> <br/> full text link : [https://en.wikipedia.org/wiki/DNA https://en.wikipedia.org/wiki/DNA] | ||
+ | |||
+ | | ||
=== RNA === | === RNA === | ||
+ | |||
+ | '''Ribonucleic acid''' ('''RNA''') is a [https://en.wikipedia.org/wiki/Polymer polymeric] molecule that is essential for most biological functions, either by performing the function itself ([https://en.wikipedia.org/wiki/Non-coding_RNA non-coding RNA]) or by forming a template for the production of proteins ([https://en.wikipedia.org/wiki/Messenger_RNA messenger RNA]). RNA and [https://en.wikipedia.org/wiki/Deoxyribonucleic_acid deoxyribonucleic acid] (DNA) are [https://en.wikipedia.org/wiki/Nucleic_acid nucleic acids]. The nucleic acids constitute one of the four major [https://en.wikipedia.org/wiki/Macromolecule macromolecules] essential for all known forms of [https://en.wikipedia.org/wiki/Life life]. RNA is assembled as a chain of [https://en.wikipedia.org/wiki/Nucleotide nucleotides]. Cellular organisms use [https://en.wikipedia.org/wiki/Messenger_RNA messenger RNA] ('''''mRNA''''') to convey genetic information (using the [https://en.wikipedia.org/wiki/Nucleobase nitrogenous bases] of [https://en.wikipedia.org/wiki/Guanine guanine], [https://en.wikipedia.org/wiki/Uracil uracil], [https://en.wikipedia.org/wiki/Adenine adenine], and [https://en.wikipedia.org/wiki/Cytosine cytosine], denoted by the letters G, U, A, and C) that directs synthesis of specific proteins. Many [https://en.wikipedia.org/wiki/Virus viruses] encode their genetic information using an RNA [https://en.wikipedia.org/wiki/Genome genome]. | ||
+ | |||
+ | Some RNA molecules play an active role within cells by catalyzing biological reactions, controlling [https://en.wikipedia.org/wiki/Gene_expression gene expression], or sensing and communicating responses to cellular signals. One of these active processes is [https://en.wikipedia.org/wiki/Protein_biosynthesis protein synthesis], a universal function in which RNA molecules direct the synthesis of proteins on [https://en.wikipedia.org/wiki/Ribosome ribosomes]. This process uses [https://en.wikipedia.org/wiki/Transfer_RNA transfer RNA] ('''''tRNA''''') molecules to deliver [https://en.wikipedia.org/wiki/Amino_acid amino acids] to the [https://en.wikipedia.org/wiki/Ribosome ribosome], where [https://en.wikipedia.org/wiki/Ribosomal_RNA ribosomal RNA] ('''''rRNA''''') then links amino acids together to form coded proteins. | ||
+ | |||
+ | It has become widely accepted in science[https://en.wikipedia.org/wiki/RNA#cite_note-1 <sup>[1</sup>]] that early in the [https://en.wikipedia.org/wiki/History_of_life_on_Earth history of life on Earth], prior to the evolution of DNA and possibly of protein-based [https://en.wikipedia.org/wiki/Enzyme enzymes] as well, an "[https://en.wikipedia.org/wiki/RNA_world RNA world]" existed in which RNA served as both living organisms' storage method for [https://en.wikipedia.org/wiki/Genetic_information genetic information]—a role fulfilled today by DNA, except in the case of [https://en.wikipedia.org/wiki/RNA_virus RNA viruses]—and potentially performed catalytic functions in cells—a function performed today by protein enzymes, with the notable and important exception of the ribosome, which is a [https://en.wikipedia.org/wiki/Ribozyme ribozyme].<br/> <br/> Full text link : [https://en.wikipedia.org/wiki/RNA https://en.wikipedia.org/wiki/RNA] | ||
| | ||
Line 90: | Line 98: | ||
=== eQTL === | === eQTL === | ||
− | <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> [https://biolecture.org/Main_Page Main Page] » [https://biolecture.org/UNIST_Geromics_course UNIST Geromics course] » [https://biolecture.org/Geromics_Course_Students_Folder_2024 Geromics Course Students Folder 2024] » [https://biolecture.org/HyoungJinChoi_2024_Geromics_Course HyoungJinChoi 2024 Geromics Course] » [https://biolecture.org/Summary_class_Geromics_2024_HyoungJinChoi Summary class Geromics 2024 HyoungJinCho] | + | '''Distant and local, trans- and cis-eQTLs, respectively''' |
+ | |||
+ | An expression quantitative trait is an amount of an [https://en.wikipedia.org/wiki/MRNA mRNA] transcript or a [https://en.wikipedia.org/wiki/Protein protein]. These are usually the product of a single [https://en.wikipedia.org/wiki/Gene gene] with a specific chromosomal location. This distinguishes expression quantitative traits from most [https://en.wikipedia.org/wiki/Complex_traits complex traits], which are not the product of the expression of a single gene. Chromosomal loci that explain variance in expression traits are called eQTLs. eQTLs located near the gene-of-origin (gene which produces the transcript or protein) are referred to as '''local eQTLs''' or '''cis-eQTLs.''' By contrast, those located distant from their gene of origin, often on different chromosomes, are referred to as '''distant eQTLs''' or '''trans-eQTLs'''.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-Fairfax2012-3 <sup>[3</sup>]] [https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-4 <sup>[4</sup>]] The first genome-wide study of gene expression was carried out in yeast and published in 2002.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-Brem2002-5 <sup>[5</sup>]] The initial wave of eQTL studies employed microarrays to measure genome-wide gene expression; more recent studies have employed massively parallel [https://en.wikipedia.org/wiki/RNA-Seq RNA sequencing]. Many [https://en.wikipedia.org/wiki/Gene_expression expression] QTL studies were performed in plants and animals, including humans,[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-6 <sup>[6</sup>]] non-human primates[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-7 <sup>[7</sup>]][https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-8 <sup>[8</sup>]] and mice.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-9 <sup>[9</sup>]] | ||
+ | |||
+ | Some cis eQTLs are detected in many [https://en.wikipedia.org/wiki/Tissue_(biology) tissue] types but the majority of trans eQTLs are tissue-dependent (dynamic).[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-10 <sup>[10</sup>]] eQTLs may act in [https://en.wikipedia.org/wiki/Cis-acting cis] (locally) or [https://en.wikipedia.org/wiki/Trans-acting trans] (at a distance) to a [https://en.wikipedia.org/wiki/Gene gene].[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-11 <sup>[11</sup>]] The abundance of a gene [https://en.wikipedia.org/wiki/RNA transcript] is directly modified by [https://en.wikipedia.org/wiki/Polymorphism_(biology) polymorphism] in [https://en.wikipedia.org/wiki/Regulatory_elements regulatory elements]. Consequently, transcript abundance might be considered as a quantitative trait that can be mapped with considerable power. These have been named expression [https://en.wikipedia.org/wiki/QTL QTLs] (eQTLs).[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-12 <sup>[12</sup>]] The combination of [https://en.wikipedia.org/wiki/Genome-wide_association_study whole-genome genetic association studies] and the measurement of global [https://en.wikipedia.org/wiki/Gene_expression gene expression] allows the systematic identification of eQTLs. By assaying gene expression and [https://en.wikipedia.org/wiki/Genetic_variation genetic variation] simultaneously on a genome-wide basis in a large number of individuals, statistical genetic methods can be used to map the genetic factors that underpin individual differences in quantitative levels of expression of many thousands of transcripts.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-13 <sup>[13</sup>]] Studies have shown that [https://en.wikipedia.org/wiki/Single_nucleotide_polymorphism single nucleotide polymorphisms] (SNPs) reproducibly associated with complex disorders [https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-14 <sup>[14</sup>]] as well as certain pharmacologic phenotypes [https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-15 <sup>[15</sup>]] are found to be significantly enriched for eQTLs, relative to frequency-matched control SNPs. The integration of eQTLs with [https://en.wikipedia.org/wiki/Genome-wide_association_study GWAS] has led to development of the [https://en.wikipedia.org/wiki/Transcriptome-wide_association_study transcriptome-wide association study] (TWAS) methodology.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-16 <sup>[16</sup>]][https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-17 <sup>[17</sup>]] | ||
+ | |||
+ | '''Detecting eQTLs''' | ||
+ | |||
+ | Mapping eQTLs is done using standard [https://en.wikipedia.org/wiki/QTL QTL] mapping methods that test the linkage between variation in expression and genetic polymorphisms. The only considerable difference is that eQTL studies can involve a million or more expression microtraits. Standard gene mapping software packages can be used, although it is often faster to use custom code such as QTL Reaper or the web-based eQTL mapping system [https://en.wikipedia.org/wiki/GeneNetwork GeneNetwork]. GeneNetwork hosts many large eQTL mapping data sets and provide access to fast algorithms to map single loci and [https://en.wikipedia.org/wiki/Epistasis epistatic] interactions. As is true in all QTL mapping studies, the final steps in defining DNA variants that cause variation in traits are usually difficult and require a second round of experimentation. This is especially the case for trans eQTLs that do not benefit from the strong prior probability that relevant variants are in the immediate vicinity of the parent gene. Statistical, graphical, and bioinformatic methods are used to evaluate positional candidate genes and entire systems of interactions.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-Kulp_2006-18 <sup>[18</sup>]][https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-Lee_2009-19 <sup>[19</sup>]] The development of single cell technologies, and parallel advances in statistical methods has made it possible to define even subtle changes in eQTLs as cell-states change.[https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-20 <sup>[20</sup>]][https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci#cite_note-21 <sup>[21</sup>]] | ||
+ | |||
+ | Full text link : [https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci]<br/> | ||
+ | |||
+ | | ||
+ | |||
+ | | ||
+ | |||
+ | | ||
+ | |||
+ | == 2024.04.12 == | ||
+ | |||
+ | === Proteomics === | ||
+ | |||
+ | '''Proteomics''' is the large-scale study of [https://en.wikipedia.org/wiki/Protein proteins].<sup id="cite_ref-pmid9740045_1-0">[https://en.wikipedia.org/wiki/Proteomics#cite_note-pmid9740045-1 [1]]</sup><sup id="cite_ref-pmid10189717_2-0">[https://en.wikipedia.org/wiki/Proteomics#cite_note-pmid10189717-2 [2]]</sup> Proteins are vital parts of living organisms, with many functions such as the formation of structural fibers of [https://en.wikipedia.org/wiki/Muscle_tissue muscle tissue], enzymatic digestion of food, or synthesis and replication of [https://en.wikipedia.org/wiki/DNA DNA]. In addition, other kinds of proteins include [https://en.wikipedia.org/wiki/Antibodies antibodies] that protect an organism from infection, and [https://en.wikipedia.org/wiki/Hormones hormones] that send important signals throughout the body. | ||
+ | |||
+ | The [https://en.wikipedia.org/wiki/Proteome proteome] is the entire set of proteins produced or modified by an organism or system. Proteomics enables the identification of ever-increasing numbers of proteins. This varies with time and distinct requirements, or stresses, that a cell or organism undergoes.<sup id="cite_ref-3">[https://en.wikipedia.org/wiki/Proteomics#cite_note-3 [3]]</sup> | ||
+ | |||
+ | Proteomics is an interdisciplinary domain that has benefited greatly from the genetic information of various genome projects, including the [https://en.wikipedia.org/wiki/Human_Genome_Project Human Genome Project].<sup id="cite_ref-4">[https://en.wikipedia.org/wiki/Proteomics#cite_note-4 [4]]</sup> It covers the exploration of proteomes from the overall level of protein composition, structure, and activity, and is an important component of [https://en.wikipedia.org/wiki/Functional_genomics functional genomics]. | ||
+ | |||
+ | ''Proteomics'' generally denotes the large-scale experimental analysis of proteins and proteomes, but often refers specifically to [https://en.wikipedia.org/wiki/Protein_purification protein purification] and [https://en.wikipedia.org/wiki/Mass_spectrometry mass spectrometry]. Indeed, mass spectrometry is the most powerful method for analysis of proteomes, both in large samples composed of millions of cells<sup id="cite_ref-5">[https://en.wikipedia.org/wiki/Proteomics#cite_note-5 [5]]</sup> and in single cells.<sup id="cite_ref-6">[https://en.wikipedia.org/wiki/Proteomics#cite_note-6 [6]]</sup><sup id="cite_ref-7">[https://en.wikipedia.org/wiki/Proteomics#cite_note-7 [7]]</sup><br/> <br/> Full text link : [https://en.wikipedia.org/wiki/Proteomics https://en.wikipedia.org/wiki/Proteomics] | ||
+ | |||
+ | === Omics === | ||
+ | |||
+ | The branches of [https://en.wikipedia.org/wiki/Science science] known informally as '''omics''' are various disciplines in [https://en.wikipedia.org/wiki/Biology biology] whose names end in the suffix ''[https://en.wiktionary.org/wiki/-omics -omics]'', such as [https://en.wikipedia.org/wiki/Genomics genomics], [https://en.wikipedia.org/wiki/Proteomics proteomics], [https://en.wikipedia.org/wiki/Metabolomics metabolomics], [https://en.wikipedia.org/wiki/Metagenomics metagenomics], [https://en.wikipedia.org/wiki/Phenomics phenomics] and [https://en.wikipedia.org/wiki/Transcriptomics transcriptomics]. Omics aims at the collective characterization and quantification of pools of biological molecules that translate into the structure, function, and dynamics of an organism or organisms.<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/Omics#cite_note-1 [1]]</sup> | ||
+ | |||
+ | The related suffix '''-ome''' is used to address the objects of study of such fields, such as the [https://en.wikipedia.org/wiki/Genome genome], [https://en.wikipedia.org/wiki/Proteome proteome] or [https://en.wikipedia.org/wiki/Metabolome metabolome] respectively. The suffix ''-ome'' as used in molecular biology refers to a ''totality'' of some sort; it is an example of a "neo-suffix" formed by abstraction from various Greek terms in -ωμα, a sequence that does not form an identifiable suffix in Greek. | ||
+ | |||
+ | [https://en.wikipedia.org/wiki/Functional_genomics Functional genomics] aims at identifying the functions of as many genes as possible of a given organism. It combines different -omics techniques such as transcriptomics and proteomics with saturated mutant collections.<sup id="cite_ref-2">[https://en.wikipedia.org/wiki/Omics#cite_note-2 [2]]</sup><br/> <br/> Full text link : [https://en.wikipedia.org/wiki/Omics https://en.wikipedia.org/wiki/Omics]<br/> <br/> | ||
+ | |||
+ | | ||
+ | |||
+ | === -ology === | ||
+ | |||
+ | An '''ology''' or '''[https://en.wikipedia.org/wiki/-logy -''logy'']''' is a scientific discipline.<br/> | ||
+ | |||
+ | === Protein === | ||
+ | |||
+ | '''Proteins''' are large [https://en.wikipedia.org/wiki/Biomolecule biomolecules] and [https://en.wikipedia.org/wiki/Macromolecule macromolecules] that comprise one or more long chains of [https://en.wikipedia.org/wiki/Amino_acid amino acid] [https://en.wikipedia.org/wiki/Residue_(biochemistry) residues]. Proteins perform a vast array of functions within organisms, including [https://en.wikipedia.org/wiki/Enzyme_catalysis catalysing metabolic reactions], [https://en.wikipedia.org/wiki/DNA_replication DNA replication], [https://en.wikipedia.org/wiki/Cell_signaling responding to stimuli], providing [https://en.wikipedia.org/wiki/Cytoskeleton structure to cells] and [https://en.wikipedia.org/wiki/Fibrous_protein organisms], and [https://en.wikipedia.org/wiki/Intracellular_transport transporting molecules] from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the [https://en.wikipedia.org/wiki/Nucleic_acid_sequence nucleotide sequence] of their [https://en.wikipedia.org/wiki/Gene genes], and which usually results in [https://en.wikipedia.org/wiki/Protein_folding protein folding] into a specific [https://en.wikipedia.org/wiki/Protein_structure 3D structure] that determines its activity. | ||
+ | |||
+ | A linear chain of amino acid residues is called a [https://en.wikipedia.org/wiki/Polypeptide polypeptide]. A protein contains at least one long polypeptide. Short polypeptides, containing less than 20–30 residues, are rarely considered to be proteins and are commonly called [https://en.wikipedia.org/wiki/Peptide peptides]. The individual amino acid residues are bonded together by [https://en.wikipedia.org/wiki/Peptide_bond peptide bonds] and adjacent amino acid residues. The [https://en.wikipedia.org/wiki/Protein_primary_structure sequence] of amino acid residues in a protein is defined by the [https://en.wikipedia.org/wiki/DNA_sequencing sequence] of a gene, which is encoded in the [https://en.wikipedia.org/wiki/Genetic_code genetic code]. In general, the genetic code specifies 20 standard amino acids; but in certain organisms the genetic code can include [https://en.wikipedia.org/wiki/Selenocysteine selenocysteine] and—in certain [https://en.wikipedia.org/wiki/Archaea archaea]—[https://en.wikipedia.org/wiki/Pyrrolysine pyrrolysine]. Shortly after or even during synthesis, the residues in a protein are often chemically modified by [https://en.wikipedia.org/wiki/Post-translational_modification post-translational modification], which alters the physical and chemical properties, folding, stability, activity, and ultimately, the function of the proteins. Some proteins have non-peptide groups attached, which can be called [https://en.wikipedia.org/wiki/Prosthetic_group prosthetic groups] or [https://en.wikipedia.org/wiki/Cofactor_(biochemistry) cofactors]. Proteins can also work together to achieve a particular function, and they often associate to form stable [https://en.wikipedia.org/wiki/Protein_complex protein complexes]. | ||
+ | |||
+ | Once formed, proteins only exist for a certain period and are then [https://en.wikipedia.org/wiki/Proteolysis#Protein_degradation degraded] and recycled by the cell's machinery through the process of [https://en.wikipedia.org/wiki/Protein_turnover protein turnover]. A protein's lifespan is measured in terms of its [https://en.wikipedia.org/wiki/Half-life half-life] and covers a wide range. They can exist for minutes or years with an average lifespan of 1–2 days in mammalian cells. Abnormal or misfolded proteins are degraded more rapidly either due to being targeted for destruction or due to being unstable. | ||
+ | |||
+ | Like other biological macromolecules such as [https://en.wikipedia.org/wiki/Polysaccharide polysaccharides] and [https://en.wikipedia.org/wiki/Nucleic_acid nucleic acids], proteins are essential parts of organisms and participate in virtually every process within [https://en.wikipedia.org/wiki/Cell_(biology) cells]. Many proteins are [https://en.wikipedia.org/wiki/Enzyme enzymes] that [https://en.wikipedia.org/wiki/Catalysis catalyse] biochemical reactions and are vital to [https://en.wikipedia.org/wiki/Metabolism metabolism]. Proteins also have structural or mechanical functions, such as [https://en.wikipedia.org/wiki/Actin actin] and [https://en.wikipedia.org/wiki/Myosin myosin] in muscle and the proteins in the [https://en.wikipedia.org/wiki/Cytoskeleton cytoskeleton], which form a system of [https://en.wikipedia.org/wiki/Scaffolding scaffolding] that maintains cell shape. Other proteins are important in cell signaling, [https://en.wikipedia.org/wiki/Antibody immune responses], [https://en.wikipedia.org/wiki/Cell_adhesion cell adhesion], and the [https://en.wikipedia.org/wiki/Cell_cycle cell cycle]. In animals, proteins are needed in the [https://en.wikipedia.org/wiki/Diet_(nutrition) diet] to provide the [https://en.wikipedia.org/wiki/Essential_amino_acid essential amino acids] that cannot be [https://en.wikipedia.org/wiki/Amino_acid_synthesis synthesized]. [https://en.wikipedia.org/wiki/Digestion Digestion] breaks the proteins down for metabolic use. | ||
+ | |||
+ | Proteins may be [https://en.wikipedia.org/wiki/Protein_purification purified] from other cellular components using a variety of techniques such as [https://en.wikipedia.org/wiki/Ultracentrifugation ultracentrifugation], [https://en.wikipedia.org/wiki/Precipitation_(chemistry) precipitation], [https://en.wikipedia.org/wiki/Electrophoresis electrophoresis], and [https://en.wikipedia.org/wiki/Chromatography chromatography]; the advent of [https://en.wikipedia.org/wiki/Genetic_engineering genetic engineering] has made possible a number of methods to facilitate purification. Methods commonly used to study protein structure and function include [https://en.wikipedia.org/wiki/Immunohistochemistry immunohistochemistry], [https://en.wikipedia.org/wiki/Site-directed_mutagenesis site-directed mutagenesis], [https://en.wikipedia.org/wiki/X-ray_crystallography X-ray crystallography], [https://en.wikipedia.org/wiki/Nuclear_magnetic_resonance nuclear magnetic resonance] and [https://en.wikipedia.org/wiki/Mass_spectrometry mass spectrometry].<br/> | ||
+ | |||
+ | === PPI (Protein-Protein interaction) === | ||
+ | |||
+ | '''Protein–protein interactions''' ('''PPIs''') are physical contacts of high specificity established between two or more [https://en.wikipedia.org/wiki/Protein protein] molecules as a result of biochemical events steered by interactions that include [https://en.wikipedia.org/wiki/Electrostatic_forces electrostatic forces], [https://en.wikipedia.org/wiki/Hydrogen_bond hydrogen bonding] and the [https://en.wikipedia.org/wiki/Hydrophobic_effect hydrophobic effect]. Many are physical contacts with molecular associations between chains that occur in a cell or in a living organism in a specific biomolecular context. | ||
+ | |||
+ | Proteins rarely act alone as their functions tend to be regulated. Many molecular processes within a cell are carried out by [https://en.wikipedia.org/wiki/Molecular_machine molecular machines] that are built from numerous protein components organized by their PPIs. These physiological interactions make up the so-called [https://en.wikipedia.org/wiki/Interactome interactomics] of the organism, while aberrant PPIs are the basis of multiple aggregation-related diseases, such as [https://en.wikipedia.org/wiki/Creutzfeldt–Jakob_disease Creutzfeldt–Jakob] and [https://en.wikipedia.org/wiki/Alzheimer's_disease Alzheimer's diseases]. | ||
+ | |||
+ | PPIs have been studied with [https://en.wikipedia.org/wiki/Methods_to_investigate_protein–protein_interactions many methods] and from different perspectives: [https://en.wikipedia.org/wiki/Biochemistry biochemistry], [https://en.wikipedia.org/wiki/Quantum_chemistry quantum chemistry], [https://en.wikipedia.org/wiki/Molecular_dynamics molecular dynamics], [https://en.wikipedia.org/wiki/Signal_transduction signal transduction], among others.<sup id="cite_ref-Titeca_2019_1-0">[https://en.wikipedia.org/wiki/Protein–protein_interaction#cite_note-Titeca_2019-1 [1]]</sup><sup id="cite_ref-2">[https://en.wikipedia.org/wiki/Protein–protein_interaction#cite_note-2 [2]]</sup><sup id="cite_ref-3">[https://en.wikipedia.org/wiki/Protein–protein_interaction#cite_note-3 [3]]</sup> All this information enables the creation of large protein interaction networks<sup id="cite_ref-Mashaghi.2C_A._2004_113.E2.80.93121_4-0">[https://en.wikipedia.org/wiki/Protein–protein_interaction#cite_note-Mashaghi,_A._2004_113–121-4 [4]]</sup> – similar to [https://en.wikipedia.org/wiki/Metabolic_network metabolic] or [https://en.wikipedia.org/wiki/Genetic_networks genetic/epigenetic networks] – that empower the current knowledge on [https://en.wikipedia.org/wiki/Biochemical_cascade biochemical cascades] and molecular etiology of disease, as well as the discovery of putative protein targets of therapeutic interest.<br/> <br/> full text link : [https://en.wikipedia.org/wiki/Protein–protein_interaction https://en.wikipedia.org/wiki/Protein%E2%80%93protein_interaction] | ||
+ | |||
+ | | ||
+ | |||
+ | === String === | ||
+ | |||
+ | '''in computer sciece''' | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Computer_programming computer programming], a '''string''' is traditionally a [https://en.wikipedia.org/wiki/Sequence sequence] of [https://en.wikipedia.org/wiki/Character_(computing) characters], either as a [https://en.wikipedia.org/wiki/Literal_(computer_programming) literal constant] or as some kind of [https://en.wikipedia.org/wiki/Variable_(computer_science) variable]. The latter may allow its elements to be [https://en.wikipedia.org/wiki/Immutable_object mutated] and the length changed, or it may be fixed (after creation). A string is generally considered as a [https://en.wikipedia.org/wiki/Data_type data type] and is often implemented as an [https://en.wikipedia.org/wiki/Array_data_structure array data structure] of [https://en.wikipedia.org/wiki/Byte bytes] (or [https://en.wikipedia.org/wiki/Word_(computer_architecture) words]) that stores a sequence of elements, typically characters, using some [https://en.wikipedia.org/wiki/Character_encoding character encoding]. ''String'' may also denote more general [https://en.wikipedia.org/wiki/Array_data_type arrays] or other sequence (or [https://en.wikipedia.org/wiki/List_(abstract_data_type) list]) data types and structures. | ||
+ | |||
+ | Depending on the programming language and precise data type used, a [https://en.wikipedia.org/wiki/Variable_(programming) variable] declared to be a string may either cause storage in memory to be statically allocated for a predetermined maximum length or employ [https://en.wikipedia.org/wiki/Dynamic_allocation dynamic allocation] to allow it to hold a variable number of elements. | ||
+ | |||
+ | When a string appears literally in [https://en.wikipedia.org/wiki/Source_code source code], it is known as a [https://en.wikipedia.org/wiki/String_literal string literal] or an anonymous string.<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/String_(computer_science)#cite_note-1 [1]]</sup> | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Formal_language formal languages], which are used in [https://en.wikipedia.org/wiki/Mathematical_logic mathematical logic] and [https://en.wikipedia.org/wiki/Theoretical_computer_science theoretical computer science], a string is a finite sequence of [https://en.wikipedia.org/wiki/Symbol_(formal) symbols] that are chosen from a [https://en.wikipedia.org/wiki/Set_(mathematics) set] called an [https://en.wikipedia.org/wiki/Alphabet_(computer_science) alphabet].<br/> <br/> full text link : [https://en.wikipedia.org/wiki/String_(computer_science) https://en.wikipedia.org/wiki/String_(computer_science)]<br/> <br/> '''in structure'''<br/> '''String''' is a long flexible [https://en.wikipedia.org/wiki/Structure structure] made from [https://en.wikipedia.org/wiki/Fiber fibers] twisted together into a single strand, or from multiple such strands which are in turn twisted together. String is used to tie, bind, or hang other objects. It is also used as a material to make things, such as textiles, and in arts and crafts. String is a simple [https://en.wikipedia.org/wiki/Tool tool], and its use by humans is known to have been developed tens of thousands of years ago.<sup id="cite_ref-Evans_and_Webster_1-0">[https://en.wikipedia.org/wiki/String_(structure)#cite_note-Evans_and_Webster-1 [1]]</sup> In [https://en.wikipedia.org/wiki/Mesoamerica Mesoamerica], for example, string was invented some 20,000 to 30,000 years ago, and was made by twisting plant fibers together.<sup id="cite_ref-Evans_and_Webster_1-1">[https://en.wikipedia.org/wiki/String_(structure)#cite_note-Evans_and_Webster-1 [1]]</sup> String may also be a component in other tools, and in devices as diverse as weapons, musical instruments, and toys.<br/> <br/> full text link : [https://en.wikipedia.org/wiki/String_(structure) https://en.wikipedia.org/wiki/String_(structure)] | ||
+ | |||
+ | <br/> <br/> <br/> <br/> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | == 2024.04.19 == | ||
+ | |||
+ | === P-value === | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Statistical_hypothesis_testing null-hypothesis significance testing], the '''𝑝<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/81eac1e205430d1f40810df36a0edffdc367af36>-value'''<sup id="cite_ref-2">[https://en.wikipedia.org/wiki/P-value#cite_note-2 [note 1]]</sup> is the probability of obtaining test results at least as extreme as the [https://en.wikipedia.org/wiki/Realization_(probability) result actually observed], under the assumption that the [https://en.wikipedia.org/wiki/Null_hypothesis null hypothesis] is correct.<sup id="cite_ref-3">[https://en.wikipedia.org/wiki/P-value#cite_note-3 [2]]</sup><sup id="cite_ref-ASA_4-0">[https://en.wikipedia.org/wiki/P-value#cite_note-ASA-4 [3]]</sup> A very small ''p''-value means that such an extreme observed [https://en.wikipedia.org/wiki/Outcome_(probability) outcome] would be very unlikely under the null hypothesis. Even though reporting ''p''-values of statistical tests is common practice in [https://en.wikipedia.org/wiki/Academic_publishing academic publications] of many quantitative fields, misinterpretation and [https://en.wikipedia.org/wiki/Misuse_of_p-values misuse of p-values] is widespread and has been a major topic in mathematics and [https://en.wikipedia.org/wiki/Metascience metascience].<sup id="cite_ref-5">[https://en.wikipedia.org/wiki/P-value#cite_note-5 [4]]</sup><sup id="cite_ref-6">[https://en.wikipedia.org/wiki/P-value#cite_note-6 [5]]</sup> In 2016, the American Statistical Association (ASA) made a formal statement that "''p''-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone" and that "a ''p''-value, or statistical significance, does not measure the size of an effect or the importance of a result" or "evidence regarding a model or hypothesis".<sup id="cite_ref-7">[https://en.wikipedia.org/wiki/P-value#cite_note-7 [6]]</sup> That said, a 2019 task force by ASA has issued a statement on statistical significance and replicability, concluding with: "''p''-values and significance tests, when properly applied and interpreted, increase the rigor of the conclusions drawn from data".<sup id="cite_ref-ASA2019_8-0">[https://en.wikipedia.org/wiki/P-value#cite_note-ASA2019-8 [7]]</sup><br/> <br/> | ||
+ | |||
+ | In statistics, every conjecture concerning the unknown [https://en.wikipedia.org/wiki/Probability_distribution probability distribution] of a collection of random variables representing the observed data 𝑋<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/68baa052181f707c662844a465bfeeb135e82bab> in some study is called a ''statistical hypothesis''. If we state one hypothesis only and the aim of the statistical test is to see whether this hypothesis is tenable, but not to investigate other specific hypotheses, then such a test is called a [https://en.wikipedia.org/wiki/Statistical_hypothesis_testing null hypothesis test]. | ||
+ | |||
+ | As our statistical hypothesis will, by definition, state some property of the distribution, the [https://en.wikipedia.org/wiki/Null_hypothesis null hypothesis] is the default hypothesis under which that property does not exist. The null hypothesis is typically that some parameter (such as a correlation or a difference between means) in the populations of interest is zero. Our hypothesis might specify the probability distribution of 𝑋<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/68baa052181f707c662844a465bfeeb135e82bab> precisely, or it might only specify that it belongs to some class of distributions. Often, we reduce the data to a single numerical statistic, e.g., 𝑇<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7200acd984a1d3a3d7dc455e262fbe54f7f6e0>, whose marginal probability distribution is closely connected to a main question of interest in the study. | ||
+ | |||
+ | The ''p''-value is used in the context of null hypothesis testing in order to quantify the [https://en.wikipedia.org/wiki/Statistical_significance statistical significance] of a result, the result being the observed value of the chosen statistic 𝑇<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7200acd984a1d3a3d7dc455e262fbe54f7f6e0>.<sup id="cite_ref-9">[https://en.wikipedia.org/wiki/P-value#cite_note-9 [note 2]]</sup> The lower the ''p''-value is, the lower the probability of getting that result if the null hypothesis were true. A result is said to be ''statistically significant'' if it allows us to reject the null hypothesis. All other things being equal, smaller ''p''-values are taken as stronger evidence against the null hypothesis. | ||
+ | |||
+ | Loosely speaking, rejection of the null hypothesis implies that there is sufficient evidence against it. | ||
+ | |||
+ | As a particular example, if a null hypothesis states that a certain summary statistic 𝑇<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7200acd984a1d3a3d7dc455e262fbe54f7f6e0> follows the standard [https://en.wikipedia.org/wiki/Normal_distribution normal distribution] 𝑁(0,1),<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/56a1569ab3f65b7d30a222662aa537e7e4965344> then the rejection of this null hypothesis could mean that (i) the mean of 𝑇<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7200acd984a1d3a3d7dc455e262fbe54f7f6e0> is not 0, or (ii) the [https://en.wikipedia.org/wiki/Variance variance] of 𝑇<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7200acd984a1d3a3d7dc455e262fbe54f7f6e0> is not 1, or (iii) 𝑇<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7200acd984a1d3a3d7dc455e262fbe54f7f6e0> is not normally distributed. Different tests of the same null hypothesis would be more or less sensitive to different alternatives. However, even if we do manage to reject the null hypothesis for all 3 alternatives, and even if we know that the distribution is normal and variance is 1, the null hypothesis test does not tell us which non-zero values of the mean are now most plausible. The more independent observations from the same probability distribution one has, the more accurate the test will be, and the higher the precision with which one will be able to determine the mean value and show that it is not equal to zero; but this will also increase the importance of evaluating the real-world or scientific relevance of this deviation.<br/> <br/> full text link : [https://en.wikipedia.org/wiki/P-value https://en.wikipedia.org/wiki/P-value] | ||
+ | |||
+ | | ||
+ | |||
+ | === Log === | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Mathematics mathematics], the '''logarithm''' is the [https://en.wikipedia.org/wiki/Inverse_function inverse function] to [https://en.wikipedia.org/wiki/Exponentiation exponentiation]. That means that the logarithm of a number x to the '''base''' b is the [https://en.wikipedia.org/wiki/Exponent exponent] to which b must be raised to produce x. For example, since 1000 = 10<sup>3</sup>, the ''logarithm base'' 10<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/4ec811eb07dcac7ea67b413c5665390a1671ecb0> of 1000 is 3, or log<sub>10</sub> (1000) = 3. The logarithm of x to ''base'' b is denoted as log<sub>''b''</sub> (''x''), or without parentheses, log<sub>''b''</sub> ''x''. When the base is clear from the context or is irrelevant, such as in [https://en.wikipedia.org/wiki/Big_O_notation big O notation], it is sometimes written log ''x''. | ||
+ | |||
+ | The logarithm base 10 is called the ''decimal'' or [https://en.wikipedia.org/wiki/Common_logarithm ''common'' logarithm] and is commonly used in science and engineering. The [https://en.wikipedia.org/wiki/Natural_logarithm ''natural'' logarithm] has the number [https://en.wikipedia.org/wiki/E_(mathematical_constant) ''e'' ≈ 2.718] as its base; its use is widespread in mathematics and [https://en.wikipedia.org/wiki/Physics physics], because of its very simple [https://en.wikipedia.org/wiki/Derivative derivative]. The [https://en.wikipedia.org/wiki/Binary_logarithm ''binary'' logarithm] uses base 2 and is frequently used in [https://en.wikipedia.org/wiki/Computer_science computer science]. | ||
+ | |||
+ | Logarithms were introduced by [https://en.wikipedia.org/wiki/John_Napier John Napier] in 1614 as a means of simplifying calculations.<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/Logarithm#cite_note-1 [1]]</sup> They were rapidly adopted by navigators, scientists, engineers, [https://en.wikipedia.org/wiki/Surveying surveyors], and others to perform high-accuracy computations more easily. Using [https://en.wikipedia.org/wiki/Mathematical_table#Tables_of_logarithms logarithm tables], tedious multi-digit multiplication steps can be replaced by table look-ups and simpler addition. This is possible because the logarithm of a [https://en.wikipedia.org/wiki/Product_(mathematics) product] is the [https://en.wikipedia.org/wiki/Summation sum] of the logarithms of the factors: log𝑏(𝑥𝑦)=log𝑏𝑥+log𝑏𝑦,<br/> <img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/72599165912508b07108f2a840898022ed126148><br/> provided that b, x and y are all positive and ''b'' ≠ 1. The [https://en.wikipedia.org/wiki/Slide_rule slide rule], also based on logarithms, allows quick calculations without tables, but at lower precision. The present-day notion of logarithms comes from [https://en.wikipedia.org/wiki/Leonhard_Euler Leonhard Euler], who connected them to the [https://en.wikipedia.org/wiki/Exponential_function exponential function] in the 18th century, and who also introduced the letter e as the base of natural logarithms.<sup id="cite_ref-2">[https://en.wikipedia.org/wiki/Logarithm#cite_note-2 [2]]</sup> | ||
+ | |||
+ | [https://en.wikipedia.org/wiki/Logarithmic_scale Logarithmic scales] reduce wide-ranging quantities to smaller scopes. For example, the [https://en.wikipedia.org/wiki/Decibel decibel] (dB) is a [https://en.wikipedia.org/wiki/Units_of_measurement unit] used to express [https://en.wikipedia.org/wiki/Level_(logarithmic_quantity) ratio as logarithms], mostly for signal power and amplitude (of which [https://en.wikipedia.org/wiki/Sound_pressure sound pressure] is a common example). In chemistry, [https://en.wikipedia.org/wiki/PH pH] is a logarithmic measure for the [https://en.wikipedia.org/wiki/Acid acidity] of an [https://en.wikipedia.org/wiki/Aqueous_solution aqueous solution]. Logarithms are commonplace in scientific [https://en.wikipedia.org/wiki/Formula formulae], and in measurements of the [https://en.wikipedia.org/wiki/Computational_complexity_theory complexity of algorithms] and of geometric objects called [https://en.wikipedia.org/wiki/Fractal fractals]. They help to describe [https://en.wikipedia.org/wiki/Frequency frequency] ratios of [https://en.wikipedia.org/wiki/Interval_(music) musical intervals], appear in formulas counting [https://en.wikipedia.org/wiki/Prime_number prime numbers] or [https://en.wikipedia.org/wiki/Stirling's_approximation approximating] [https://en.wikipedia.org/wiki/Factorial factorials], inform some models in [https://en.wikipedia.org/wiki/Psychophysics psychophysics], and can aid in [https://en.wikipedia.org/wiki/Forensic_accounting forensic accounting]. | ||
+ | |||
+ | The concept of logarithm as the inverse of exponentiation extends to other mathematical structures as well. However, in general settings, the logarithm tends to be a multi-valued function. For example, the [https://en.wikipedia.org/wiki/Complex_logarithm complex logarithm] is the multi-valued [https://en.wikipedia.org/wiki/Inverse_function inverse] of the complex exponential function. Similarly, the [https://en.wikipedia.org/wiki/Discrete_logarithm discrete logarithm] is the multi-valued inverse of the exponential function in finite groups; it has uses in [https://en.wikipedia.org/wiki/Public-key_cryptography public-key cryptography].<br/> <br/> full text link : [https://en.wikipedia.org/wiki/Logarithm https://en.wikipedia.org/wiki/Logarithm]<br/> | ||
+ | |||
+ | |||
+ | === Likelihood === | ||
+ | |||
+ | The '''likelihood function''' (often simply called the '''likelihood''') is the [https://en.wikipedia.org/wiki/Joint_probability_distribution joint] [https://en.wikipedia.org/wiki/Probability_mass_function probability mass] (or [https://en.wikipedia.org/wiki/Probability_density_function probability density]) of [https://en.wikipedia.org/wiki/Sample_(statistics) observed data] viewed as a function of the [https://en.wikipedia.org/wiki/Statistical_parameter parameters] of a [https://en.wikipedia.org/wiki/Statistical_model statistical model].<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/Likelihood_function#cite_note-1 [1]]</sup><sup id="cite_ref-2">[https://en.wikipedia.org/wiki/Likelihood_function#cite_note-2 [2]]</sup><sup id="cite_ref-3">[https://en.wikipedia.org/wiki/Likelihood_function#cite_note-3 [3]]</sup> Intuitively, the likelihood function 𝐿(𝜃∣𝑥)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/24a053912e70a2d35f7037375a39f9f7c3ea72d4> is the probability of observing data 𝑥<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/87f9e315fd7e2ba406057a97300593c4802b53e4> assuming 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af> is the actual parameter. | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Maximum_likelihood_estimation maximum likelihood estimation], the [https://en.wikipedia.org/wiki/Arg_max arg max] (over the parameter 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af>) of the likelihood function serves as a [https://en.wikipedia.org/wiki/Point_estimation point estimate] for 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af>, while the [https://en.wikipedia.org/wiki/Fisher_information Fisher information] (often approximated by the likelihood's [https://en.wikipedia.org/wiki/Hessian_matrix Hessian matrix]) indicates the estimate's [https://en.wikipedia.org/wiki/Precision_(statistics) precision]. | ||
+ | |||
+ | In contrast, in [https://en.wikipedia.org/wiki/Bayesian_statistics Bayesian statistics], parameter estimates are derived from the ''converse'' of the likelihood, the so-called [https://en.wikipedia.org/wiki/Posterior_probability posterior probability], which is calculated via [https://en.wikipedia.org/wiki/Bayes'_theorem Bayes' rule].<sup id="cite_ref-4">[https://en.wikipedia.org/wiki/Likelihood_function#cite_note-4 [4]]</sup><br/> | ||
+ | |||
+ | The likelihood function, parameterized by a (possibly multivariate) parameter 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af>, is usually defined differently for [https://en.wikipedia.org/wiki/Continuous_or_discrete_variable discrete and continuous] [https://en.wikipedia.org/wiki/Probability_distribution probability distributions] (a more general definition is discussed below). Given a probability density or mass function | ||
+ | |||
+ | 𝑥↦𝑓(𝑥∣𝜃),<br/> <img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/442aa0f5b4796ef4a698a7e60aeb5006c8f020f2> | ||
+ | |||
+ | where 𝑥<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/87f9e315fd7e2ba406057a97300593c4802b53e4> is a realization of the random variable 𝑋<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/68baa052181f707c662844a465bfeeb135e82bab>, the likelihood function is 𝜃↦𝑓(𝑥∣𝜃),<br/> <img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/e161b494cb41c7ddbb8d496ece959b776baba128><br/> often written<br/> 𝐿(𝜃∣𝑥).<br/> <img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/487868b15b5aaccd5bf67e86c197d68f37fadc8f> | ||
+ | |||
+ | In other words, when 𝑓(𝑥∣𝜃)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/0f01a2e70b1a8595be545c42562f00820bbff06d> is viewed as a function of 𝑥<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/87f9e315fd7e2ba406057a97300593c4802b53e4> with 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af> fixed, it is a probability density function, and when viewed as a function of 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af> with 𝑥<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/87f9e315fd7e2ba406057a97300593c4802b53e4> fixed, it is a likelihood function. In the [https://en.wikipedia.org/wiki/Frequentist_probability frequentist paradigm], the notation 𝑓(𝑥∣𝜃)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/0f01a2e70b1a8595be545c42562f00820bbff06d> is often avoided and instead 𝑓(𝑥;𝜃)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/79480c3540803bdda2613d69277692e1061ad7d5> or 𝑓(𝑥,𝜃)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/3e3b8aafdf0be69fcd09cdb756b9c5aa2fd8c777> are used to indicate that 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af> is regarded as a fixed unknown quantity rather than as a [https://en.wikipedia.org/wiki/Random_variable random variable] being conditioned on. | ||
+ | |||
+ | The likelihood function does ''not'' specify the probability that 𝜃<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/6e5ab2664b422d53eb0c7df3b87e1360d75ad9af> is the truth, given the observed sample 𝑋=𝑥<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/0661396d873679039ffe8e908a39f02402d4912d>. Such an interpretation is a common error, with potentially disastrous consequences (see [https://en.wikipedia.org/wiki/Prosecutor's_fallacy prosecutor's fallacy]).<br/> <br/> full text link : [https://en.wikipedia.org/wiki/Likelihood_function https://en.wikipedia.org/wiki/Likelihood_function] | ||
+ | |||
+ | | ||
+ | |||
+ | |||
+ | |||
+ | === E-value === | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Statistical_hypothesis_testing statistical hypothesis testing], '''e-values''' quantify the evidence in the data against a [https://en.wikipedia.org/wiki/Null_hypothesis null hypothesis] (e.g., "the coin is fair", or, in a medical context, "this new treatment has no effect"). They serve as a more robust alternative to [https://en.wikipedia.org/wiki/P-value p-values], addressing some shortcomings of the latter. | ||
+ | |||
+ | In contrast to p-values, e-values can deal with optional continuation: e-values of subsequent experiments (e.g. clinical trials concerning the same treatment) may simply be multiplied to provide a new, "product" e-value that represents the evidence in the joint experiment. This works even if, as often happens in practice, the decision to perform later experiments may depend in vague, unknown ways on the data observed in earlier experiments, and it is not known beforehand how many trials will be conducted: the product e-value remains a meaningful quantity, leading to tests with [https://en.wikipedia.org/wiki/Type_I_and_type_II_errors Type-I error control]. For this reason, e-values and their sequential extension, the ''e-process'', are the fundamental building blocks for anytime-valid statistical methods (e.g. confidence sequences). Another advantage over p-values is that any weighted average of e-values remains an e-value, even if the individual e-values are arbitrarily dependent. This is one of the reasons why e-values have also turned out to be useful tools in [https://en.wikipedia.org/wiki/Multiple_comparisons_problem multiple testing].<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/E-values#cite_note-1 [1]]</sup> | ||
+ | |||
+ | E-values can be interpreted in a number of different ways: first, the reciprocal of any e-value is itself a p-value, but a special, conservative one, quite different from p-values used in practice. Second, they are broad generalizations of [https://en.wikipedia.org/wiki/Likelihood_function likelihood ratios] and are also related to, yet distinct from, [https://en.wikipedia.org/wiki/Bayes_factors Bayes factors]. Third, they have an interpretation as bets. Finally, in a sequential context, they can also be interpreted as increments of nonnegative [https://en.wikipedia.org/wiki/Martingale_(probability_theory) supermartingales]. Interest in e-values has exploded since 2019, when the term 'e-value' was coined and a number of breakthrough results were achieved by several research groups. The first overview article appeared in 2023.<sup id="cite_ref-:2_2-0">[https://en.wikipedia.org/wiki/E-values#cite_note-:2-2 [2]]</sup><br/> <br/> | ||
+ | |||
+ | Let the [https://en.wikipedia.org/wiki/Statistical_hypothesis_testing null hypothesis] 𝐻0<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/43910602a221b7a4c373791f94793e3008622070> be given as a set of distributions for data 𝑌<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/961d67d6b454b4df2301ac571808a3538b3a6d3f>. Usually 𝑌=(𝑋1,…,𝑋𝜏)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/232436e83a6551876d9ea98a759d4e9681e80975> with each 𝑋𝑖<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/af4a0955af42beb5f85aa05fb8c07abedc13990d> a single outcome and 𝜏<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/38a7dcde9730ef0853809fefc18d88771f95206c> a fixed sample size or some stopping time. We shall refer to such 𝑌<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/961d67d6b454b4df2301ac571808a3538b3a6d3f>, which represent the full sequence of outcomes of a statistical experiment, as a ''sample'' or ''batch of outcomes.'' But in some cases 𝑌<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/961d67d6b454b4df2301ac571808a3538b3a6d3f> may also be an unordered bag of outcomes or a single outcome. | ||
+ | |||
+ | An '''e-variable''' or '''e-statistic''' is a ''nonnegative'' random variable 𝐸=𝐸(𝑌)<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/2dff3e1e32d9281280efa425fc381f4348711c47> such that under all 𝑃∈𝐻0<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/975a55140fa8854be1a02c263d0dfdb74edff76a>, its expected value is bounded by 1: | ||
+ | |||
+ | 𝐸𝑃[𝐸]≤1<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/1b5decf85768b772fff63e687c162ea50d8c3445>. | ||
+ | |||
+ | The value taken by e-variable 𝐸<img style="null" src=https://wikimedia.org/api/rest_v1/media/math/render/svg/4232c9de2ee3eec0a9c0a19b15ab92daa6223f9b> is called the '''e-value'''''.'' In practice, the term ''e-value'' (a number) is often used when one is really referring to the underlying e-variable (a random variable, that is, a measurable function of the data).<br/> <br/> full text link : [https://en.wikipedia.org/wiki/E-values https://en.wikipedia.org/wiki/E-values]<br/> | ||
+ | |||
+ | |||
+ | == 2024.05.03 == | ||
+ | |||
+ | |||
+ | === Tetrahymena === | ||
+ | |||
+ | As a ciliated [https://en.wikipedia.org/wiki/Protozoan protozoan], '''''Tetrahymena thermophila''''' exhibits [https://en.wikipedia.org/wiki/Nuclear_dimorphism nuclear dimorphism]: two types of cell [https://en.wikipedia.org/wiki/Cell_nucleus nuclei]. They have a bigger, [https://en.wikipedia.org/wiki/Somatic_cell non-germline] [https://en.wikipedia.org/wiki/Macronucleus macronucleus] and a small, [https://en.wikipedia.org/wiki/Germline germline] [https://en.wikipedia.org/wiki/Micronucleus micronucleus] in each cell at the same time and these two carry out different functions with distinct cytological and biological properties. This unique versatility allows scientists to use ''Tetrahymena'' to identify several key factors regarding [https://en.wikipedia.org/wiki/Gene_expression gene expression] and genome integrity. In addition, ''Tetrahymena'' possess hundreds of [https://en.wikipedia.org/wiki/Cilia cilia] and has complicated [https://en.wikipedia.org/wiki/Microtubule microtubule] structures, making it an optimal model to illustrate the diversity and functions of microtubule arrays. | ||
+ | |||
+ | Because ''Tetrahymena'' can be grown in a large quantity in the laboratory with ease, it has been a great source for biochemical analysis for years, specifically for [https://en.wikipedia.org/wiki/Enzyme enzymatic] activities and purification of [https://en.wikipedia.org/wiki/Cell_(biology)#Subcellular_components sub-cellular components]. In addition, with the advancement of genetic techniques it has become an excellent model to study the gene function ''in vivo''. The recent sequencing of the macronucleus genome should ensure that ''Tetrahymena'' will be continuously used as a model system. | ||
+ | |||
+ | ''Tetrahymena thermophila'' exists in 7 different sexes ([https://en.wikipedia.org/wiki/Mating_type mating types]) that can reproduce in 21 different combinations, and a single tetrahymena cannot reproduce sexually with itself. Each organism "decides" which sex it will become during mating, through a [https://en.wikipedia.org/wiki/Stochastic stochastic] process.<sup id="cite_ref-PLOS2013_5-0">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-PLOS2013-5 [5]]</sup><sup id="cite_ref-6">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-6 [6]]</sup> | ||
+ | |||
+ | Studies on ''Tetrahymena'' have contributed to several scientific milestones including: | ||
+ | |||
+ | #First cell which showed synchronized division, which led to the first insights into the existence of mechanisms which control the [https://en.wikipedia.org/wiki/Cell_cycle cell cycle].<sup id="cite_ref-whitepaper_7-0">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup> | ||
+ | #Identification and purification of the first [https://en.wikipedia.org/wiki/Cytoskeleton cytoskeleton] based [https://en.wikipedia.org/wiki/Motor_protein motor protein] such as ''[https://en.wikipedia.org/wiki/Dynein dynein]''.<sup id="cite_ref-whitepaper_7-1">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup> | ||
+ | #Aid in the discovery of ''[https://en.wikipedia.org/wiki/Lysosomes lysosomes]'' and ''[https://en.wikipedia.org/wiki/Peroxisomes peroxisomes]''.<sup id="cite_ref-whitepaper_7-2">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup> | ||
+ | #Early molecular identification of somatic genome rearrangement.<sup id="cite_ref-whitepaper_7-3">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup> | ||
+ | #Discovery of the molecular structure of ''[https://en.wikipedia.org/wiki/Telomeres telomeres]'', ''[https://en.wikipedia.org/wiki/Telomerase telomerase]'' enzyme, the templating role of telomerase RNA and their roles in cellular senescence and chromosome healing (for which a Nobel Prize was won).<sup id="cite_ref-whitepaper_7-4">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup> | ||
+ | #Nobel Prize–winning co-discovery (1989, in Chemistry) of catalytic [https://en.wikipedia.org/wiki/RNA RNA] (''[https://en.wikipedia.org/wiki/Ribozymes ribozyme]'').<sup id="cite_ref-whitepaper_7-5">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup><sup id="cite_ref-8">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-8 [8]]</sup> | ||
+ | #Discovery of the function of [https://en.wikipedia.org/wiki/Histone histone] [https://en.wikipedia.org/wiki/Acetylation acetylation].<sup id="cite_ref-whitepaper_7-6">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-whitepaper-7 [7]]</sup> | ||
+ | #Demonstration of the roles of [https://en.wikipedia.org/wiki/Posttranslational_modification posttranslational modification] such as acetylation and glycylation on [https://en.wikipedia.org/wiki/Tubulins tubulins] and discovery of the enzymes responsible for some of these modifications (glutamylation) | ||
+ | #Crystal structure of 40S ribosome in complex with its initiation factor eIF1 | ||
+ | #First demonstration that two of the "universal" [https://en.wikipedia.org/wiki/Stop_codon stop codons], UAA and UAG, will code for the amino acid [https://en.wikipedia.org/wiki/Glutamine glutamine] in some eukaryotes, leaving UGA as the only termination codon in these organisms.<sup id="cite_ref-9">[https://en.wikipedia.org/wiki/Tetrahymena#cite_note-9 [9]]</sup> | ||
+ | |||
+ | <span style="font-size: 13.3333px;">link : [https://en.wikipedia.org/wiki/Tetrahymena https://en.wikipedia.org/wiki/Tetrahymena]</span><br/> | ||
+ | |||
+ | |||
+ | |||
+ | === telomere === | ||
+ | |||
+ | A '''telomere''' ([https://en.wikipedia.org/wiki/Help:IPA/English /ˈtɛləmɪər, ˈtiːlə-/]; from [https://en.wikipedia.org/wiki/Ancient_Greek_language Ancient Greek] [https://en.wiktionary.org/wiki/τέλος#Ancient_Greek τέλος]'' (''télos'')'' 'end', and [https://en.wiktionary.org/wiki/μέρος#Ancient_Greek μέρος]'' (''méros'')'' 'part') is a region of repetitive [https://en.wikipedia.org/wiki/Nucleotide nucleotide] sequences associated with specialized proteins at the ends of linear [https://en.wikipedia.org/wiki/Chromosome chromosomes] (see [https://en.wikipedia.org/wiki/Telomere#Sequences Sequences]). Telomeres are a widespread genetic feature most commonly found in [https://en.wikipedia.org/wiki/Eukaryote eukaryotes]. In most, if not all species possessing them, they protect the terminal regions of [https://en.wikipedia.org/wiki/DNA chromosomal DNA] from progressive degradation and ensure the integrity of linear chromosomes by preventing [https://en.wikipedia.org/wiki/DNA_repair DNA repair] systems from mistaking the very ends of the DNA strand for a [https://en.wikipedia.org/wiki/Double-strand_break double-strand break].<br/> | ||
+ | |||
+ | The existence of a special structure at the ends of chromosomes was independently proposed in 1938 by [https://en.wikipedia.org/wiki/Hermann_Joseph_Muller Hermann Joseph Muller], studying the fruit fly ''[https://en.wikipedia.org/wiki/Drosophila_melanogaster Drosophila melanogaster]'', and in 1939 by [https://en.wikipedia.org/wiki/Barbara_McClintock Barbara McClintock], working with maize.<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/Telomere#cite_note-1 [1]]</sup> Muller observed that the ends of irradiated fruit fly chromosomes did not present alterations such as deletions or inversions. He hypothesized the presence of a protective cap, which he coined "telomeres", from the Greek ''telos'' (end) and ''meros'' (part).<sup id="cite_ref-2">[https://en.wikipedia.org/wiki/Telomere#cite_note-2 [2]]</sup> | ||
+ | |||
+ | In the early 1970s, Soviet theorist [https://en.wikipedia.org/wiki/Alexei_Olovnikov Alexei Olovnikov] first recognized that chromosomes could not completely replicate their ends; this is known as the "end replication problem". Building on this, and accommodating [https://en.wikipedia.org/wiki/Leonard_Hayflick Leonard Hayflick]'s idea of limited [https://en.wikipedia.org/wiki/Somatic_cell somatic cell] division, Olovnikov suggested that DNA sequences are lost every time a cell replicates until the loss reaches a critical level, at which point cell division ends.<sup id="cite_ref-3">[https://en.wikipedia.org/wiki/Telomere#cite_note-3 [3]]</sup><sup id="cite_ref-4">[https://en.wikipedia.org/wiki/Telomere#cite_note-4 [4]]</sup><sup id="cite_ref-5">[https://en.wikipedia.org/wiki/Telomere#cite_note-5 [5]]</sup> According to his theory of marginotomy DNA sequences at the ends of telomeres are represented by tandem repeats, which create a buffer that determines the number of divisions that a certain cell clone can undergo. Furthermore, it was predicted that a specialized DNA polymerase (originally called a tandem-DNA-polymerase) could extend telomeres in immortal tissues such as germ line, cancer cells and stem cells. It also followed from this hypothesis that organisms with circular genome, such as bacteria, do not have the end replication problem and therefore do not age. | ||
+ | |||
+ | In 1975–1977, [https://en.wikipedia.org/wiki/Elizabeth_Blackburn Elizabeth Blackburn], working as a postdoctoral fellow at [https://en.wikipedia.org/wiki/Yale_University Yale University] with [https://en.wikipedia.org/wiki/Joseph_G._Gall Joseph G. Gall], discovered the unusual nature of telomeres, with their simple repeated DNA sequences composing chromosome ends.<sup id="cite_ref-:0_6-0">[https://en.wikipedia.org/wiki/Telomere#cite_note-:0-6 [6]]</sup> Blackburn, [https://en.wikipedia.org/wiki/Carol_Greider Carol Greider], and [https://en.wikipedia.org/wiki/Jack_Szostak Jack Szostak] were awarded the [https://en.wikipedia.org/wiki/List_of_Nobel_laureates_in_Physiology_or_Medicine#2001–current 2009] [https://en.wikipedia.org/wiki/Nobel_Prize_in_Physiology_or_Medicine Nobel Prize in Physiology or Medicine] for the discovery of how chromosomes are protected by telomeres and the [https://en.wikipedia.org/wiki/Enzyme enzyme] [https://en.wikipedia.org/wiki/Telomerase telomerase].<sup id="cite_ref-7">[https://en.wikipedia.org/wiki/Telomere#cite_note-7 [7]</sup><br/> <br/> | ||
+ | |||
+ | During DNA replication, [https://en.wikipedia.org/wiki/DNA_polymerase DNA polymerase] cannot replicate the sequences present at the [https://en.wikipedia.org/wiki/Directionality_(molecular_biology) 3' ends] of the parent strands. This is a consequence of its unidirectional mode of DNA synthesis: it can only attach new nucleotides to an existing 3'-end (that is, synthesis progresses 5'-3') and thus it requires a [https://en.wikipedia.org/wiki/Primer_(molecular_biology) primer] to initiate replication. On the leading strand (oriented 5'-3' within the replication fork), DNA-polymerase continuously replicates from the point of initiation all the way to the strand's end with the primer (made of [https://en.wikipedia.org/wiki/RNA RNA]) then being excised and substituted by DNA. The lagging strand, however, is oriented 3'-5' with respect to the replication fork so continuous replication by DNA-polymerase is impossible, which necessitates discontinuous replication involving the repeated synthesis of primers further 5' of the site of initiation (see [https://en.wikipedia.org/wiki/Lagging_strand lagging strand replication]). The last primer to be involved in lagging-strand replication sits near the 3'-end of the template (corresponding to the potential 5'-end of the lagging-strand). Originally it was believed that the last primer would sit at the very end of the template, thus, once removed, the DNA-polymerase that substitutes primers with DNA (DNA-Pol δ in eukaryotes)<sup id="cite_ref-note1_8-0">[https://en.wikipedia.org/wiki/Telomere#cite_note-note1-8 [note 1]]</sup> would be unable to synthesize the "replacement DNA" from the 5'-end of the lagging strand so that the template nucleotides previously paired to the last primer would not be replicated.<sup id="cite_ref-9">[https://en.wikipedia.org/wiki/Telomere#cite_note-9 [8]]</sup> It has since been questioned whether the last lagging strand primer is placed exactly at the 3'-end of the template and it was demonstrated that it is rather synthesized at a distance of about 70–100 nucleotides which is consistent with the finding that DNA in cultured human cell is shortened by 50–100 [https://en.wikipedia.org/wiki/Base_pair base pairs] per [https://en.wikipedia.org/wiki/Cell_division cell division].<sup id="cite_ref-10">[https://en.wikipedia.org/wiki/Telomere#cite_note-10 [9]]</sup> | ||
+ | |||
+ | If coding sequences are degraded in this process, potentially vital genetic code would be lost. Telomeres are non-coding, repetitive sequences located at the termini of linear chromosomes to act as buffers for those coding sequences further behind. They "cap" the end-sequences and are progressively degraded in the process of DNA replication. | ||
+ | |||
+ | The "end replication problem" is exclusive to linear chromosomes as circular chromosomes do not have ends lying without reach of DNA-polymerases. Most [https://en.wikipedia.org/wiki/Prokaryote prokaryotes], relying on circular chromosomes, accordingly do not possess telomeres.<sup id="cite_ref-11">[https://en.wikipedia.org/wiki/Telomere#cite_note-11 [10]]</sup> A small fraction of [https://en.wikipedia.org/wiki/Bacteria bacterial] chromosomes (such as those in ''[https://en.wikipedia.org/wiki/Streptomyces Streptomyces]'', ''[https://en.wikipedia.org/wiki/Agrobacterium Agrobacterium]'', and ''[https://en.wikipedia.org/wiki/Borrelia Borrelia]''), however, are linear and possess telomeres, which are very different from those of the eukaryotic chromosomes in structure and function. The known structures of bacterial telomeres take the form of [https://en.wikipedia.org/wiki/Proteins proteins] bound to the ends of linear chromosomes, or hairpin loops of single-stranded DNA at the ends of the linear chromosomes.<sup id="cite_ref-12">[https://en.wikipedia.org/wiki/Telomere#cite_note-12 [11]]</sup><br/> | ||
+ | |||
+ | === Telomerase === | ||
+ | |||
+ | '''Telomerase''', also called '''terminal transferase''',<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/Telomerase#cite_note-1 [1]]</sup> is a [https://en.wikipedia.org/wiki/Ribonucleoprotein ribonucleoprotein] that adds a species-dependent [https://en.wikipedia.org/wiki/Telomere#Sequences telomere repeat sequence] to the [https://en.wikipedia.org/wiki/Directionality_(molecular_biology)#3.27-end 3'] end of [https://en.wikipedia.org/wiki/Telomere telomeres]. A telomere is a region of repetitive [https://en.wikipedia.org/wiki/Sequence_(biology) sequences] at each end of the [https://en.wikipedia.org/wiki/Chromosome chromosomes] of most [https://en.wikipedia.org/wiki/Eukaryote eukaryotes]. Telomeres protect the end of the chromosome from [https://en.wikipedia.org/wiki/DNA_damage_(naturally_occurring) DNA damage] or from fusion with neighbouring chromosomes. The fruit fly ''[https://en.wikipedia.org/wiki/Drosophila_melanogaster Drosophila melanogaster]'' lacks telomerase, but instead uses [https://en.wikipedia.org/wiki/Retrotransposon retrotransposons] to maintain telomeres.<sup id="cite_ref-pmid21821789_2-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-pmid21821789-2 [2]]</sup> | ||
+ | |||
+ | Telomerase is a [https://en.wikipedia.org/wiki/Reverse_transcriptase reverse transcriptase] [https://en.wikipedia.org/wiki/Enzyme enzyme] that carries its own [https://en.wikipedia.org/wiki/Telomerase_RNA_component RNA molecule] (e.g., with the sequence 3′-[https://en.wikipedia.org/wiki/Cytosine C]CC[https://en.wikipedia.org/wiki/Adenine A]A[https://en.wikipedia.org/wiki/Uracil U]CCC-5′ in ''[https://en.wikipedia.org/wiki/Trypanosoma_brucei Trypanosoma brucei]'')<sup id="cite_ref-pmid10097086_3-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-pmid10097086-3 [3]]</sup> which is used as a template when it elongates telomeres. Telomerase is active in [https://en.wikipedia.org/wiki/Gamete gametes] and most [https://en.wikipedia.org/wiki/Cancer cancer] cells, but is normally absent in most [https://en.wikipedia.org/wiki/Somatic_cell somatic cells].<br/> <br/> | ||
+ | |||
+ | The existence of a compensatory mechanism for telomere shortening was first found by Soviet biologist [https://en.wikipedia.org/wiki/Alexey_Olovnikov Alexey Olovnikov] in 1973,<sup id="cite_ref-pmid4754905_4-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-pmid4754905-4 [4]]</sup> who also suggested the telomere hypothesis of [https://en.wikipedia.org/wiki/Aging aging] and the telomere's connections to cancer and perhaps some neurodegenerative diseases.<sup id="cite_ref-pmid16247010_5-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-pmid16247010-5 [5]]</sup> | ||
+ | |||
+ | Telomerase in the ciliate ''[https://en.wikipedia.org/wiki/Tetrahymena Tetrahymena]'' was discovered by [https://en.wikipedia.org/wiki/Carol_W._Greider Carol W. Greider] and [https://en.wikipedia.org/wiki/Elizabeth_Blackburn Elizabeth Blackburn] in 1984.<sup id="cite_ref-pmid3907856_6-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-pmid3907856-6 [6]]</sup> Together with [https://en.wikipedia.org/wiki/Jack_W._Szostak Jack W. Szostak], Greider and Blackburn were awarded the 2009 [https://en.wikipedia.org/wiki/Nobel_Prize Nobel Prize] in [https://en.wikipedia.org/wiki/Nobel_Prize_in_Physiology_or_Medicine Physiology or Medicine] for their discovery.<sup id="cite_ref-Nobel_Prize_2003_7-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-Nobel_Prize_2003-7 [7]]</sup> Later the [https://en.wikipedia.org/wiki/Cryogenic_electron_microscopy cryo-EM] structure of telomerase was first reported in ''T. thermophila'', to be followed a few years later by the cryo-EM structure of telomerase in humans.<sup id="cite_ref-pmid31451513_8-0">[https://en.wikipedia.org/wiki/Telomerase#cite_note-pmid31451513-8 [8]]</sup> | ||
+ | |||
+ | The role of telomeres and telomerase in [https://en.wikipedia.org/wiki/Cellular_aging cell aging] and [https://en.wikipedia.org/wiki/Cancer cancer] was established by scientists at [https://en.wikipedia.org/wiki/Biotechnology biotechnology] company [https://en.wikipedia.org/wiki/Geron_Corporation Geron] with the cloning of the [https://en.wikipedia.org/wiki/RNA RNA] and catalytic components of human telomerase<sup id="cite_ref-9">[https://en.wikipedia.org/wiki/Telomerase#cite_note-9 [9]]</sup> and the development of a [https://en.wikipedia.org/wiki/Polymerase_chain_reaction polymerase chain reaction] (PCR) based assay for telomerase activity called the TRAP assay, which surveys telomerase activity in multiple types of cancer.<sup id="cite_ref-10">[https://en.wikipedia.org/wiki/Telomerase#cite_note-10 [10]]</sup> | ||
+ | |||
+ | The [https://en.wikipedia.org/wiki/Negative_stain negative stain] electron microscopy (EM) structures of human and ''Tetrahymena'' telomerases were characterized in 2013.<sup id="cite_ref-11">[https://en.wikipedia.org/wiki/Telomerase#cite_note-11 [11]]</sup><sup id="cite_ref-12">[https://en.wikipedia.org/wiki/Telomerase#cite_note-12 [12]]</sup> Two years later, the first cryo-electron microscopy ([https://en.wikipedia.org/wiki/Cryo-EM cryo-EM]) structure of telomerase holoenzyme (''Tetrahymena'') was determined.<sup id="cite_ref-13">[https://en.wikipedia.org/wiki/Telomerase#cite_note-13 [13]]</sup> In 2018, the structure of human telomerase was determined through cryo-EM by UC Berkeley scientists.<sup id="cite_ref-14">[https://en.wikipedia.org/wiki/Telomerase#cite_note-14 [14]]</sup><br/> <br/> full text link : [https://en.wikipedia.org/wiki/Telomerase https://en.wikipedia.org/wiki/Telomerase] | ||
+ | |||
+ | |||
+ | === DNA replicate === | ||
+ | |||
+ | In [https://en.wikipedia.org/wiki/Molecular_biology molecular biology],<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-1 [1]]</sup><sup id="cite_ref-2">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-2 [2]]</sup><sup id="cite_ref-3">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-3 [3]]</sup> '''DNA replication''' is the [https://en.wikipedia.org/wiki/Biological_process biological process] of producing two identical replicas of DNA from one original [https://en.wikipedia.org/wiki/DNA DNA] molecule.<sup id="cite_ref-4">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-4 [4]]</sup> DNA replication occurs in all [https://en.wikipedia.org/wiki/Life living organisms] acting as the most essential part of [https://en.wikipedia.org/wiki/Heredity biological inheritance]. This is essential for cell division during growth and repair of damaged tissues, while it also ensures that each of the new cells receives its own copy of the DNA.<sup id="cite_ref-5">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-5 [5]]</sup> The cell possesses the distinctive property of division, which makes replication of DNA essential. | ||
+ | |||
+ | DNA is made up of a [https://en.wikipedia.org/wiki/Nucleic_acid_double_helix double helix] of two [https://en.wikipedia.org/wiki/Complementary_DNA complementary] [https://en.wikipedia.org/wiki/DNA_strand strands]. The double helix describes the appearance of a double-stranded DNA which is thus composed of two linear strands that run opposite to each other and twist together to form.<sup id="cite_ref-6">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-6 [6]]</sup> During replication, these strands are separated. Each strand of the original DNA molecule then serves as a template for the production of its counterpart, a process referred to as [https://en.wikipedia.org/wiki/Semiconservative_replication semiconservative replication]. As a result of semi-conservative replication, the new helix will be composed of an original DNA strand as well as a newly synthesized strand.<sup id="cite_ref-7">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-7 [7]]</sup> Cellular [https://en.wikipedia.org/wiki/Proofreading_(Biology) proofreading] and error-checking mechanisms ensure near perfect [https://en.wikipedia.org/wiki/Fidelity fidelity] for DNA replication.<sup id="cite_ref-Berg_8-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-Berg-8 [8]]</sup><sup id="cite_ref-Alberts_9-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-Alberts-9 [9]]</sup> | ||
+ | |||
+ | In a [https://en.wikipedia.org/wiki/Cell_(biology) cell], DNA replication begins at specific locations, or [https://en.wikipedia.org/wiki/Origin_of_replication origins of replication],<sup id="cite_ref-Hu_352.E2.80.93372_10-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-Hu_352–372-10 [10]]</sup> in the [https://en.wikipedia.org/wiki/Genome genome]<sup id="cite_ref-origins_11-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-origins-11 [11]]</sup> which contains the genetic material of an organism.<sup id="cite_ref-12">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-12 [12]]</sup> Unwinding of DNA at the origin and synthesis of new strands, accommodated by an [https://en.wikipedia.org/wiki/Enzyme enzyme] known as [https://en.wikipedia.org/wiki/Helicase helicase], results in [https://en.wikipedia.org/wiki/Replication_fork replication forks] growing bi-directionally from the origin. A number of [https://en.wikipedia.org/wiki/Protein proteins] are associated with the replication fork to help in the initiation and continuation of [https://en.wikipedia.org/wiki/DNA_synthesis DNA synthesis]. Most prominently, [https://en.wikipedia.org/wiki/DNA_polymerase DNA polymerase] synthesizes the new strands by adding [https://en.wikipedia.org/wiki/Nucleotide nucleotides] that complement each (template) strand. DNA replication occurs during the S-stage of [https://en.wikipedia.org/wiki/Interphase interphase].<sup id="cite_ref-13">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-13 [13]]</sup> | ||
+ | |||
+ | DNA replication (DNA amplification) can also be performed ''[https://en.wikipedia.org/wiki/In_vitro in vitro]'' (artificially, outside a cell).<sup id="cite_ref-Jarillo-2021_14-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-Jarillo-2021-14 [14]]</sup> DNA polymerases isolated from cells and artificial DNA primers can be used to start DNA synthesis at known sequences in a template DNA molecule. [https://en.wikipedia.org/wiki/Polymerase_chain_reaction Polymerase chain reaction] (PCR), [https://en.wikipedia.org/wiki/Ligase_chain_reaction ligase chain reaction] (LCR), and [https://en.wikipedia.org/wiki/Transcription-mediated_amplification transcription-mediated amplification] (TMA) are examples. In March 2021, researchers reported evidence suggesting that a preliminary form of [https://en.wikipedia.org/wiki/Transfer_RNA transfer RNA], a necessary component of [https://en.wikipedia.org/wiki/Translation_(biology) translation], the biological synthesis of new [https://en.wikipedia.org/wiki/Protein proteins] in accordance with the [https://en.wikipedia.org/wiki/Genetic_code genetic code], could have been a replicator molecule itself in the very early development of life, or [https://en.wikipedia.org/wiki/Abiogenesis abiogenesis].<sup id="cite_ref-EL-20210302_15-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-EL-20210302-15 [15]]</sup><sup id="cite_ref-STD-20210403_16-0">[16]</sup><br/> | ||
+ | |||
+ | ==== <sup id="cite_ref-STD-20210403_16-0">DNA Structure</sup> ==== | ||
+ | |||
+ | DNA exists as a double-stranded structure, with both strands coiled together to form the characteristic [https://en.wikipedia.org/wiki/Double_helix double helix]. Each single strand of DNA is a chain of four types of [https://en.wikipedia.org/wiki/Nucleotide nucleotides]. Nucleotides in DNA contain a [https://en.wikipedia.org/wiki/Deoxyribose deoxyribose] sugar, a [https://en.wikipedia.org/wiki/Phosphate phosphate], and a [https://en.wikipedia.org/wiki/Nucleobase nucleobase]. The four types of [https://en.wikipedia.org/wiki/Nucleotide nucleotide] correspond to the four [https://en.wikipedia.org/wiki/Nucleobase nucleobases] [https://en.wikipedia.org/wiki/Adenine adenine], [https://en.wikipedia.org/wiki/Cytosine cytosine], [https://en.wikipedia.org/wiki/Guanine guanine], and [https://en.wikipedia.org/wiki/Thymine thymine], commonly abbreviated as A, C, G, and T. Adenine and guanine are [https://en.wikipedia.org/wiki/Purine pu]<sup id="cite_ref-17">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-17 [17]]</sup>[https://en.wikipedia.org/wiki/Purine rine] bases, while cytosine and thymine are [https://en.wikipedia.org/wiki/Pyrimidine pyrimidines]. These nucleotides form [https://en.wikipedia.org/wiki/Phosphodiester_bonds phosphodiester bonds], creating the phosphate-deoxyribose backbone of the DNA double helix with the nucleobases pointing inward (i.e., toward the opposing strand). Nucleobases are matched between strands through [https://en.wikipedia.org/wiki/Hydrogen_bonding hydrogen bonds] to form [https://en.wikipedia.org/wiki/Base_pair base pairs]. Adenine pairs with thymine (two hydrogen bonds), and guanine pairs with cytosine (three [https://en.wikipedia.org/wiki/Hydrogen_bonds hydrogen bonds]).<sup id="cite_ref-18">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-18 [18]]</sup> | ||
+ | |||
+ | [https://en.wikipedia.org/wiki/Directionality_(molecular_biology) DNA strands have a directionality], and the different ends of a single strand are called the "3′ (three-prime) end" and the "5′ (five-prime) end". By convention, if the base sequence of a single strand of DNA is given, the left end of the sequence is the 5′ end, while the right end of the sequence is the 3′ end. The strands of the double helix are anti-parallel, with one being 5′ to 3′, and the opposite strand 3′ to 5′. These terms refer to the carbon atom in deoxyribose to which the next phosphate in the chain attaches. Directionality has consequences in DNA synthesis, because DNA polymerase can synthesize DNA in only one direction by adding nucleotides to the 3′ end of a DNA strand. | ||
+ | |||
+ | The pairing of complementary bases in DNA (through [https://en.wikipedia.org/wiki/Hydrogen_bonding hydrogen bonding]) means that the information contained within each strand is redundant. Phosphodiester (intra-strand) bonds are stronger than hydrogen (inter-strand) bonds. The actual job of the phosphodiester bonds is where in DNA polymers connect the 5' carbon atom of one nucleotide to the 3' carbon atom of another nucleotide, while the hydrogen bonds stabilize DNA double helices across the helix axis but not in the direction of the axis.<sup id="cite_ref-19">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-19 [19]]</sup> This makes it possible to separate the strands from one another. The nucleotides on a single strand can therefore be used to reconstruct nucleotides on a newly synthesized partner strand.<sup id="cite_ref-20">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-20 [20]]</sup><br/> | ||
+ | |||
+ | |||
+ | ==== <sup id="cite_ref-20">DNA polymerase</sup> ==== | ||
+ | |||
+ | [https://en.wikipedia.org/wiki/DNA_polymerase DNA polymerases] are a family of [https://en.wikipedia.org/wiki/Enzyme enzymes] that carry out all forms of DNA replication.<sup id="cite_ref-22">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-22 [22]]</sup> DNA polymerases in general cannot initiate synthesis of new strands but can only extend an existing DNA or RNA strand paired with a template strand. To begin synthesis, a short fragment of RNA, called a [https://en.wikipedia.org/wiki/Primer_(molecular_biology) primer], must be created and paired with the template DNA strand. | ||
+ | |||
+ | DNA polymerase adds a new strand of DNA by extending the 3′ end of an existing nucleotide chain, adding new [https://en.wikipedia.org/wiki/Nucleotide nucleotides] matched to the template strand, one at a time, via the creation of [https://en.wikipedia.org/wiki/Phosphodiester_bond phosphodiester bonds]. The energy for this process of DNA polymerization comes from hydrolysis of the [https://en.wikipedia.org/wiki/High-energy_phosphate high-energy phosphate] (phosphoanhydride) bonds between the three phosphates attached to each unincorporated [https://en.wikipedia.org/wiki/Nucleotide base]. Free bases with their attached phosphate groups are called [https://en.wikipedia.org/wiki/Nucleotide nucleotides]; in particular, bases with three attached phosphate groups are called [https://en.wikipedia.org/wiki/Nucleoside_triphosphate nucleoside triphosphates]. When a nucleotide is being added to a growing DNA strand, the formation of a phosphodiester bond between the proximal phosphate of the nucleotide to the growing chain is accompanied by hydrolysis of a high-energy phosphate bond with release of the two distal phosphate groups as a [https://en.wikipedia.org/wiki/Pyrophosphate pyrophosphate]. Enzymatic hydrolysis of the resulting [https://en.wikipedia.org/wiki/Pyrophosphate pyrophosphate] into inorganic phosphate consumes a second high-energy phosphate bond and renders the reaction effectively irreversible.<sup id="cite_ref-23">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-23 [Note 1]]</sup> | ||
+ | |||
+ | In general, DNA polymerases are highly accurate, with an intrinsic error rate of less than one mistake for every 10<sup>7</sup> nucleotides added.<sup id="cite_ref-pmid18166979_24-0">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-pmid18166979-24 [23]]</sup> Some DNA polymerases can also delete nucleotides from the end of a developing strand in order to fix mismatched bases. This is known as proofreading. Finally, post-replication mismatch repair mechanisms monitor the DNA for errors, being capable of distinguishing mismatches in the newly synthesized DNA Strand from the original strand sequence. Together, these three discrimination steps enable replication fidelity of less than one mistake for every 10<sup>9</sup> nucleotides added.<sup id="cite_ref-pmid18166979_24-1">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-pmid18166979-24 [23]]</sup> | ||
+ | |||
+ | The rate of DNA replication in a living cell was first measured as the rate of phage T4 DNA elongation in phage-infected ''E. coli''.<sup id="cite_ref-25">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-25 [24]]</sup> During the period of exponential DNA increase at 37 °C, the rate was 749 nucleotides per second. The mutation rate per base pair per replication during phage T4 DNA synthesis is 1.7 per 10<sup>8</sup>.<sup id="cite_ref-26">[https://en.wikipedia.org/wiki/DNA_replication#cite_note-26 [25]]</sup><br/> | ||
+ | |||
+ | |||
+ | === Transposase === | ||
+ | |||
+ | a cut-and-paste mechanism or a replicative mechanism, in a process known as transposition. The word "transposase" was first coined by the individuals who cloned the enzyme required for transposition of the [https://en.wikipedia.org/wiki/Tn3_Transposon Tn3 transposon].<sup id="cite_ref-1">[https://en.wikipedia.org/wiki/Transposase#cite_note-1 [1]]</sup> The existence of transposons was postulated in the late 1940s by [https://en.wikipedia.org/wiki/Barbara_McClintock Barbara McClintock], who was studying the inheritance of [https://en.wikipedia.org/wiki/Maize maize], but the actual molecular basis for transposition was described by later groups. McClintock discovered that some segments of [https://en.wikipedia.org/wiki/Chromosome chromosomes] changed their position, jumping between different loci or from one chromosome to another. The repositioning of these transposons (which coded for color) allowed other genes for pigment to be expressed.<sup id="cite_ref-website1_2-0">[https://en.wikipedia.org/wiki/Transposase#cite_note-website1-2 [2]]</sup> Transposition in maize causes changes in color; however, in other organisms, such as bacteria, it can cause [https://en.wikipedia.org/wiki/Antimicrobial_resistance antibiotic resistance].<sup id="cite_ref-website1_2-1">[https://en.wikipedia.org/wiki/Transposase#cite_note-website1-2 [2]]</sup> Transposition is also important in creating genetic diversity within species and generating adaptability to changing living conditions.<sup id="cite_ref-MM_3-0">[https://en.wikipedia.org/wiki/Transposase#cite_note-MM-3 [3]]</sup> | ||
+ | |||
+ | Transposases are classified under [https://en.wikipedia.org/wiki/Enzyme_Commission_number EC number] EC 2.7.7. Genes encoding transposases are widespread in the genomes of most organisms and are the most abundant genes known.<sup id="cite_ref-4">[https://en.wikipedia.org/wiki/Transposase#cite_note-4 [4]]</sup> During the course of human evolution, as much as 40% of the human genome has moved around via methods such as transposition of transposons.<sup id="cite_ref-website1_2-2">[https://en.wikipedia.org/wiki/Transposase#cite_note-website1-2 [2]]</sup><br/> | ||
+ | |||
+ | ==== Transposase Tn5 ==== | ||
+ | |||
+ | Transposase (Tnp) Tn5 is a member of the [https://en.wikipedia.org/wiki/Ribonuclease RNase] superfamily of proteins which includes retroviral [https://en.wikipedia.org/wiki/Integrase integrases]. Tn5 can be found in ''[https://en.wikipedia.org/wiki/Shewanella Shewanella]'' and ''[https://en.wikipedia.org/wiki/Escherichia Escherichia]'' bacteria.<sup id="cite_ref-Website2_5-0">[https://en.wikipedia.org/wiki/Transposase#cite_note-Website2-5 [5]]</sup> The transposon codes for antibiotic resistance to [https://en.wikipedia.org/wiki/Kanamycin kanamycin] and other aminoglycoside antibiotics.<sup id="cite_ref-MM_3-1">[https://en.wikipedia.org/wiki/Transposase#cite_note-MM-3 [3]]</sup><sup id="cite_ref-N1_6-0">[https://en.wikipedia.org/wiki/Transposase#cite_note-N1-6 [6]]</sup> | ||
+ | |||
+ | Tn5 and other transposases are notably inactive. Because DNA transposition events are inherently mutagenic, the low activity of transposases is necessary to reduce the risk of causing a fatal mutation in the host, and thus eliminating the [https://en.wikipedia.org/wiki/Transposable_element transposable element]. One of the reasons Tn5 is so unreactive is because the N- and C-termini are located in relatively close proximity to one another and tend to inhibit each other. This was elucidated by the characterization of several mutations which resulted in hyperactive forms of transposases. One such mutation, L372P, is a mutation of amino acid 372 in the Tn5 transposase. This amino acid is generally a leucine residue in the middle of an alpha helix. When this leucine is replaced with a proline residue the alpha helix is broken, introducing a conformational change to the C-terminal domain, separating it from the N-terminal domain enough to promote higher activity of the protein.<sup id="cite_ref-MM_3-2">[https://en.wikipedia.org/wiki/Transposase#cite_note-MM-3 [3]]</sup> The transposition of a transposon often needs only three pieces: the transposon, the transposase enzyme, and the target DNA for the insertion of the transposon.<sup id="cite_ref-MM_3-3">[https://en.wikipedia.org/wiki/Transposase#cite_note-MM-3 [3]]</sup> This is the case with Tn5, which uses a cut-and-paste mechanism for moving around transposons.<sup id="cite_ref-MM_3-4">[https://en.wikipedia.org/wiki/Transposase#cite_note-MM-3 [3]]</sup> | ||
+ | |||
+ | Tn5 and most other transposases contain a DDE motif, which is the active site that catalyzes the movement of the transposon. Aspartate-97, aspartate-188, and glutamate-326 make up the active site, which is a triad of acidic residues.<sup id="cite_ref-BC_7-0">[https://en.wikipedia.org/wiki/Transposase#cite_note-BC-7 [7]]</sup> The DDE motif is said to coordinate divalent metal ions, most often magnesium and manganese, which are important in the catalytic reaction.<sup id="cite_ref-BC_7-1">[https://en.wikipedia.org/wiki/Transposase#cite_note-BC-7 [7]]</sup> Because transposase is incredibly inactive, the DDE region is mutated so that the transposase becomes hyperactive and catalyzes the movement of the transposon.<sup id="cite_ref-BC_7-2">[https://en.wikipedia.org/wiki/Transposase#cite_note-BC-7 [7]]</sup> The glutamate is transformed into an aspartate and the two aspartates into glutamates.<sup id="cite_ref-BC_7-3">[https://en.wikipedia.org/wiki/Transposase#cite_note-BC-7 [7]]</sup> Through this mutation, the study of Tn5 becomes possible, but some steps in the catalytic process are lost as a result.<sup id="cite_ref-MM_3-5">[https://en.wikipedia.org/wiki/Transposase#cite_note-MM-3 [3]]</sup> | ||
+ | |||
+ | <br/> <br/> <br/> <br/> [https://biolecture.org/Main_Page Main Page] » [https://biolecture.org/UNIST_Geromics_course UNIST Geromics course] » [https://biolecture.org/Geromics_Course_Students_Folder_2024 Geromics Course Students Folder 2024] » [https://biolecture.org/HyoungJinChoi_2024_Geromics_Course HyoungJinChoi 2024 Geromics Course] » [https://biolecture.org/Summary_class_Geromics_2024_HyoungJinChoi Summary class Geromics 2024 HyoungJinCho] |
Latest revision as of 12:58, 10 May 2024
Main Page » UNIST Geromics course » Geromics Course Students Folder 2024 » HyoungJinChoi 2024 Geromics Course » Summary class Geromics 2024 HyoungJinCho
Contents
2024.03.06
orientation Geromics
2024.03.08
What is theory?
A theory is a rational type of abstract thinking about a phenomenon, or the results of such thinking. The process of contemplative and rational thinking is often associated with such processes as observational study or research. Theories may be scientific, belong to a non-scientific discipline, or no discipline at all. Depending on the context, a theory's assertions might, for example, include generalized explanations of how nature works. The word has its roots in ancient Greek, but in modern use it has taken on several related meanings.
In modern science, the term "theory" refers to scientific theories, a well-confirmed type of explanation of nature, made in a way consistent with the scientific method, and fulfilling the criteria required by modern science. Such theories are described in such a way that scientific tests should be able to provide empirical support for it, or empirical contradiction ("falsify") of it. Scientific theories are the most reliable, rigorous, and comprehensive form of scientific knowledge,[1] in contrast to more common uses of the word "theory" that imply that something is unproven or speculative (which in formal terms is better characterized by the word hypothesis).[2] Scientific theories are distinguished from hypotheses, which are individual empirically testable conjectures, and from scientific laws, which are descriptive accounts of the way nature behaves under certain conditions.
Theories guide the enterprise of finding facts rather than of reaching goals, and are neutral concerning alternatives among values.[3]: 131 A theory can be a body of knowledge, which may or may not be associated with particular explanatory models. To theorize is to develop this body of knowledge.[4]: 46
The word theory or "in theory" is sometimes used outside of science to refer to something which the speaker did not experience or test before.[5] In science, this same concept is referred to as a hypothesis, and the word "hypothetically" is used both inside and outside of science. In its usage outside of science, the word "theory" is very often contrasted to "practice" (from Greek praxis, πρᾶξις) a Greek term for doing, which is opposed to theory.[6] A "classical example" of the distinction between "theoretical" and "practical" uses the discipline of medicine: medical theory involves trying to understand the causes and nature of health and sickness, while the practical side of medicine is trying to make people healthy. These two things are related but can be independent, because it is possible to research health and sickness without curing specific patients, and it is possible to cure a patient without knowing how the cure worked.[a]
full text link : https://en.wikipedia.org/wiki/Theory
2024.03.22
--
Prepare class Before you attend this week's lecture, I would like to encourage you to watch the following YouTube video:
- Title: Mitochondrial Regulation of Stem Cell Aging
- Presenter: Danica Chen, PhD (University of California, Berkeley, USA)
- YouTube Link: https://www.youtube.com/watch?v=FoJWmaT1ptM
In this video, Professor Danica Chen discusses various methods to protect mitochondria and reverse stem cell aging by Sirtuins.
It's an insightful presentation that will undoubtedly enrich our understanding of the topic before our lecture.
--
Mitochondrial Stress is a Driver of Stem Cell Aging
- Mitochondrial stress increases in stem cell during aging
- Mitochondrial dysfunction and aging produces similar defects in stem cells
- Stem cells do not age at the same rate; about one third od chronologically aged HSCs exhibit regeberative function similar to healthy young HSCs, coinciding with the health of mitochondria.
Stem cell
In multicellular organisms, stem cells are undifferentiated or partially differentiated cells that can change into various types of cells and proliferate indefinitely to produce more of the same stem cell. They are the earliest type of cell in a cell lineage.[1] They are found in both embryonic and adult organisms, but they have slightly different properties in each. They are usually distinguished from progenitor cells, which cannot divide indefinitely, and precursor or blast cells, which are usually committed to differentiating into one cell type.
In mammals, roughly 50 to 150 cells make up the inner cell mass during the blastocyst stage of embryonic development, around days 5–14. These have stem-cell capability. In vivo, they eventually differentiate into all of the body's cell types (making them pluripotent). This process starts with the differentiation into the three germ layers – the ectoderm, mesoderm and endoderm – at the gastrulation stage. However, when they are isolated and cultured in vitro, they can be kept in the stem-cell stage and are known as embryonic stem cells (ESCs).
Adult stem cells are found in a few select locations in the body, known as niches, such as those in the bone marrow or gonads. They exist to replenish rapidly lost cell types and are multipotent or unipotent, meaning they only differentiate into a few cell types or one type of cell. In mammals, they include, among others, hematopoietic stem cells, which replenish blood and immune cells, basal cells, which maintain the skin epithelium, and mesenchymal stem cells, which maintain bone, cartilage, muscle and fat cells. Adult stem cells are a small minority of cells; they are vastly outnumbered by the progenitor cells and terminally differentiated cells that they differentiate into.[1]
Research into stem cells grew out of findings by Canadian biologists Ernest McCulloch, James Till and Andrew J. Becker at the University of Toronto and the Ontario Cancer Institute in the 1960s.[2][3] As of 2016, the only established medical therapy using stem cells is hematopoietic stem cell transplantation,[4] first performed in 1958 by French oncologist Georges Mathé. Since 1998 however, it has been possible to culture and differentiate human embryonic stem cells (in stem-cell lines). The process of isolating these cells has been controversial, because it typically results in the destruction of the embryo. Sources for isolating ESCs have been restricted in some European countries and Canada, but others such as the UK and China have promoted the research.[5] Somatic cell nuclear transfer is a cloning method that can be used to create a cloned embryo for the use of its embryonic stem cells in stem cell therapy.[6] In 2006, a Japanese team led by Shinya Yamanaka discovered a method to convert mature body cells back into stem cells. These were termed induced pluripotent stem cells (iPSCs).[7]
full txt link : https://en.wikipedia.org/wiki/Stem_cell
How does the total amount of stem cells in humans change over time?
At what age does it reach its maximum and minimum?
When fertilization occurs, one: start life, two 120 years.
When certain data points are plotted, it seems feasible to converge through statistical methods (considering the number of inflection points).
Are there any papers related to the number of stem cells at various ages in a particular sample?
> No results found in the initial search. (Only use 14 min)
2024.03.29
Occupations with high life expectancy ?
full txt link : https://www.hani.co.kr/arti/society/rights/471412.html
2024.04.05
DNA
Deoxyribonucleic acid (/diːˈɒksɪˌraɪboʊnjuːˌkliːɪk, -ˌkleɪ-/ ⓘ;[1] DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of all known organisms and many viruses. DNA and ribonucleic acid (RNA) are nucleic acids. Alongside proteins, lipids and complex carbohydrates (polysaccharides), nucleic acids are one of the four major types of macromolecules that are essential for all known forms of life.
The two DNA strands are known as polynucleotides as they are composed of simpler monomeric units called nucleotides.[2][3] Each nucleotide is composed of one of four nitrogen-containing nucleobases (cytosine [C], guanine [G], adenine [A] or thymine [T]), a sugar called deoxyribose, and a phosphate group. The nucleotides are joined to one another in a chain by covalent bonds (known as the phosphodiester linkage) between the sugar of one nucleotide and the phosphate of the next, resulting in an alternating sugar-phosphate backbone. The nitrogenous bases of the two separate polynucleotide strands are bound together, according to base pairing rules (A with T and C with G), with hydrogen bonds to make double-stranded DNA. The complementary nitrogenous bases are divided into two groups, the single-ringed pyrimidines and the double-ringed purines. In DNA, the pyrimidines are thymine and cytosine; the purines are adenine and guanine.
Both strands of double-stranded DNA store the same biological information. This information is replicated when the two strands separate. A large part of DNA (more than 98% for humans) is non-coding, meaning that these sections do not serve as patterns for protein sequences. The two strands of DNA run in opposite directions to each other and are thus antiparallel. Attached to each sugar is one of four types of nucleobases (or bases). It is the sequence of these four nucleobases along the backbone that encodes genetic information. RNA strands are created using DNA strands as a template in a process called transcription, where DNA bases are exchanged for their corresponding bases except in the case of thymine (T), for which RNA substitutes uracil (U).[4] Under the genetic code, these RNA strands specify the sequence of amino acids within proteins in a process called translation.
Within eukaryotic cells, DNA is organized into long structures called chromosomes. Before typical cell division, these chromosomes are duplicated in the process of DNA replication, providing a complete set of chromosomes for each daughter cell. Eukaryotic organisms (animals, plants, fungi and protists) store most of their DNA inside the cell nucleus as nuclear DNA, and some in the mitochondria as mitochondrial DNA or in chloroplasts as chloroplast DNA.[5] In contrast, prokaryotes (bacteria and archaea) store their DNA only in the cytoplasm, in circular chromosomes. Within eukaryotic chromosomes, chromatin proteins, such as histones, compact and organize DNA. These compacting structures guide the interactions between DNA and other proteins, helping control which parts of the DNA are transcribed.
full text link : https://en.wikipedia.org/wiki/DNA
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyribonucleic acid (DNA) are nucleic acids. The nucleic acids constitute one of the four major macromolecules essential for all known forms of life. RNA is assembled as a chain of nucleotides. Cellular organisms use messenger RNA (mRNA) to convey genetic information (using the nitrogenous bases of guanine, uracil, adenine, and cytosine, denoted by the letters G, U, A, and C) that directs synthesis of specific proteins. Many viruses encode their genetic information using an RNA genome.
Some RNA molecules play an active role within cells by catalyzing biological reactions, controlling gene expression, or sensing and communicating responses to cellular signals. One of these active processes is protein synthesis, a universal function in which RNA molecules direct the synthesis of proteins on ribosomes. This process uses transfer RNA (tRNA) molecules to deliver amino acids to the ribosome, where ribosomal RNA (rRNA) then links amino acids together to form coded proteins.
It has become widely accepted in science[1] that early in the history of life on Earth, prior to the evolution of DNA and possibly of protein-based enzymes as well, an "RNA world" existed in which RNA served as both living organisms' storage method for genetic information—a role fulfilled today by DNA, except in the case of RNA viruses—and potentially performed catalytic functions in cells—a function performed today by protein enzymes, with the notable and important exception of the ribosome, which is a ribozyme.
Full text link : https://en.wikipedia.org/wiki/RNA
eQTL
Distant and local, trans- and cis-eQTLs, respectively
An expression quantitative trait is an amount of an mRNA transcript or a protein. These are usually the product of a single gene with a specific chromosomal location. This distinguishes expression quantitative traits from most complex traits, which are not the product of the expression of a single gene. Chromosomal loci that explain variance in expression traits are called eQTLs. eQTLs located near the gene-of-origin (gene which produces the transcript or protein) are referred to as local eQTLs or cis-eQTLs. By contrast, those located distant from their gene of origin, often on different chromosomes, are referred to as distant eQTLs or trans-eQTLs.[3] [4] The first genome-wide study of gene expression was carried out in yeast and published in 2002.[5] The initial wave of eQTL studies employed microarrays to measure genome-wide gene expression; more recent studies have employed massively parallel RNA sequencing. Many expression QTL studies were performed in plants and animals, including humans,[6] non-human primates[7][8] and mice.[9]
Some cis eQTLs are detected in many tissue types but the majority of trans eQTLs are tissue-dependent (dynamic).[10] eQTLs may act in cis (locally) or trans (at a distance) to a gene.[11] The abundance of a gene transcript is directly modified by polymorphism in regulatory elements. Consequently, transcript abundance might be considered as a quantitative trait that can be mapped with considerable power. These have been named expression QTLs (eQTLs).[12] The combination of whole-genome genetic association studies and the measurement of global gene expression allows the systematic identification of eQTLs. By assaying gene expression and genetic variation simultaneously on a genome-wide basis in a large number of individuals, statistical genetic methods can be used to map the genetic factors that underpin individual differences in quantitative levels of expression of many thousands of transcripts.[13] Studies have shown that single nucleotide polymorphisms (SNPs) reproducibly associated with complex disorders [14] as well as certain pharmacologic phenotypes [15] are found to be significantly enriched for eQTLs, relative to frequency-matched control SNPs. The integration of eQTLs with GWAS has led to development of the transcriptome-wide association study (TWAS) methodology.[16][17]
Detecting eQTLs
Mapping eQTLs is done using standard QTL mapping methods that test the linkage between variation in expression and genetic polymorphisms. The only considerable difference is that eQTL studies can involve a million or more expression microtraits. Standard gene mapping software packages can be used, although it is often faster to use custom code such as QTL Reaper or the web-based eQTL mapping system GeneNetwork. GeneNetwork hosts many large eQTL mapping data sets and provide access to fast algorithms to map single loci and epistatic interactions. As is true in all QTL mapping studies, the final steps in defining DNA variants that cause variation in traits are usually difficult and require a second round of experimentation. This is especially the case for trans eQTLs that do not benefit from the strong prior probability that relevant variants are in the immediate vicinity of the parent gene. Statistical, graphical, and bioinformatic methods are used to evaluate positional candidate genes and entire systems of interactions.[18][19] The development of single cell technologies, and parallel advances in statistical methods has made it possible to define even subtle changes in eQTLs as cell-states change.[20][21]
Full text link : https://en.wikipedia.org/wiki/Expression_quantitative_trait_loci
2024.04.12
Proteomics
Proteomics is the large-scale study of proteins.[1][2] Proteins are vital parts of living organisms, with many functions such as the formation of structural fibers of muscle tissue, enzymatic digestion of food, or synthesis and replication of DNA. In addition, other kinds of proteins include antibodies that protect an organism from infection, and hormones that send important signals throughout the body.
The proteome is the entire set of proteins produced or modified by an organism or system. Proteomics enables the identification of ever-increasing numbers of proteins. This varies with time and distinct requirements, or stresses, that a cell or organism undergoes.[3]
Proteomics is an interdisciplinary domain that has benefited greatly from the genetic information of various genome projects, including the Human Genome Project.[4] It covers the exploration of proteomes from the overall level of protein composition, structure, and activity, and is an important component of functional genomics.
Proteomics generally denotes the large-scale experimental analysis of proteins and proteomes, but often refers specifically to protein purification and mass spectrometry. Indeed, mass spectrometry is the most powerful method for analysis of proteomes, both in large samples composed of millions of cells[5] and in single cells.[6][7]
Full text link : https://en.wikipedia.org/wiki/Proteomics
Omics
The branches of science known informally as omics are various disciplines in biology whose names end in the suffix -omics, such as genomics, proteomics, metabolomics, metagenomics, phenomics and transcriptomics. Omics aims at the collective characterization and quantification of pools of biological molecules that translate into the structure, function, and dynamics of an organism or organisms.[1]
The related suffix -ome is used to address the objects of study of such fields, such as the genome, proteome or metabolome respectively. The suffix -ome as used in molecular biology refers to a totality of some sort; it is an example of a "neo-suffix" formed by abstraction from various Greek terms in -ωμα, a sequence that does not form an identifiable suffix in Greek.
Functional genomics aims at identifying the functions of as many genes as possible of a given organism. It combines different -omics techniques such as transcriptomics and proteomics with saturated mutant collections.[2]
Full text link : https://en.wikipedia.org/wiki/Omics
-ology
An ology or -logy is a scientific discipline.
Protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, responding to stimuli, providing structure to cells and organisms, and transporting molecules from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and which usually results in protein folding into a specific 3D structure that determines its activity.
A linear chain of amino acid residues is called a polypeptide. A protein contains at least one long polypeptide. Short polypeptides, containing less than 20–30 residues, are rarely considered to be proteins and are commonly called peptides. The individual amino acid residues are bonded together by peptide bonds and adjacent amino acid residues. The sequence of amino acid residues in a protein is defined by the sequence of a gene, which is encoded in the genetic code. In general, the genetic code specifies 20 standard amino acids; but in certain organisms the genetic code can include selenocysteine and—in certain archaea—pyrrolysine. Shortly after or even during synthesis, the residues in a protein are often chemically modified by post-translational modification, which alters the physical and chemical properties, folding, stability, activity, and ultimately, the function of the proteins. Some proteins have non-peptide groups attached, which can be called prosthetic groups or cofactors. Proteins can also work together to achieve a particular function, and they often associate to form stable protein complexes.
Once formed, proteins only exist for a certain period and are then degraded and recycled by the cell's machinery through the process of protein turnover. A protein's lifespan is measured in terms of its half-life and covers a wide range. They can exist for minutes or years with an average lifespan of 1–2 days in mammalian cells. Abnormal or misfolded proteins are degraded more rapidly either due to being targeted for destruction or due to being unstable.
Like other biological macromolecules such as polysaccharides and nucleic acids, proteins are essential parts of organisms and participate in virtually every process within cells. Many proteins are enzymes that catalyse biochemical reactions and are vital to metabolism. Proteins also have structural or mechanical functions, such as actin and myosin in muscle and the proteins in the cytoskeleton, which form a system of scaffolding that maintains cell shape. Other proteins are important in cell signaling, immune responses, cell adhesion, and the cell cycle. In animals, proteins are needed in the diet to provide the essential amino acids that cannot be synthesized. Digestion breaks the proteins down for metabolic use.
Proteins may be purified from other cellular components using a variety of techniques such as ultracentrifugation, precipitation, electrophoresis, and chromatography; the advent of genetic engineering has made possible a number of methods to facilitate purification. Methods commonly used to study protein structure and function include immunohistochemistry, site-directed mutagenesis, X-ray crystallography, nuclear magnetic resonance and mass spectrometry.
PPI (Protein-Protein interaction)
Protein–protein interactions (PPIs) are physical contacts of high specificity established between two or more protein molecules as a result of biochemical events steered by interactions that include electrostatic forces, hydrogen bonding and the hydrophobic effect. Many are physical contacts with molecular associations between chains that occur in a cell or in a living organism in a specific biomolecular context.
Proteins rarely act alone as their functions tend to be regulated. Many molecular processes within a cell are carried out by molecular machines that are built from numerous protein components organized by their PPIs. These physiological interactions make up the so-called interactomics of the organism, while aberrant PPIs are the basis of multiple aggregation-related diseases, such as Creutzfeldt–Jakob and Alzheimer's diseases.
PPIs have been studied with many methods and from different perspectives: biochemistry, quantum chemistry, molecular dynamics, signal transduction, among others.[1][2][3] All this information enables the creation of large protein interaction networks[4] – similar to metabolic or genetic/epigenetic networks – that empower the current knowledge on biochemical cascades and molecular etiology of disease, as well as the discovery of putative protein targets of therapeutic interest.
full text link : https://en.wikipedia.org/wiki/Protein%E2%80%93protein_interaction
String
in computer sciece
In computer programming, a string is traditionally a sequence of characters, either as a literal constant or as some kind of variable. The latter may allow its elements to be mutated and the length changed, or it may be fixed (after creation). A string is generally considered as a data type and is often implemented as an array data structure of bytes (or words) that stores a sequence of elements, typically characters, using some character encoding. String may also denote more general arrays or other sequence (or list) data types and structures.
Depending on the programming language and precise data type used, a variable declared to be a string may either cause storage in memory to be statically allocated for a predetermined maximum length or employ dynamic allocation to allow it to hold a variable number of elements.
When a string appears literally in source code, it is known as a string literal or an anonymous string.[1]
In formal languages, which are used in mathematical logic and theoretical computer science, a string is a finite sequence of symbols that are chosen from a set called an alphabet.
full text link : https://en.wikipedia.org/wiki/String_(computer_science)
in structure
String is a long flexible structure made from fibers twisted together into a single strand, or from multiple such strands which are in turn twisted together. String is used to tie, bind, or hang other objects. It is also used as a material to make things, such as textiles, and in arts and crafts. String is a simple tool, and its use by humans is known to have been developed tens of thousands of years ago.[1] In Mesoamerica, for example, string was invented some 20,000 to 30,000 years ago, and was made by twisting plant fibers together.[1] String may also be a component in other tools, and in devices as diverse as weapons, musical instruments, and toys.
full text link : https://en.wikipedia.org/wiki/String_(structure)
2024.04.19
P-value
In null-hypothesis significance testing, the 𝑝-value[note 1] is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct.[2][3] A very small p-value means that such an extreme observed outcome would be very unlikely under the null hypothesis. Even though reporting p-values of statistical tests is common practice in academic publications of many quantitative fields, misinterpretation and misuse of p-values is widespread and has been a major topic in mathematics and metascience.[4][5] In 2016, the American Statistical Association (ASA) made a formal statement that "p-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone" and that "a p-value, or statistical significance, does not measure the size of an effect or the importance of a result" or "evidence regarding a model or hypothesis".[6] That said, a 2019 task force by ASA has issued a statement on statistical significance and replicability, concluding with: "p-values and significance tests, when properly applied and interpreted, increase the rigor of the conclusions drawn from data".[7]
In statistics, every conjecture concerning the unknown probability distribution of a collection of random variables representing the observed data 𝑋 in some study is called a statistical hypothesis. If we state one hypothesis only and the aim of the statistical test is to see whether this hypothesis is tenable, but not to investigate other specific hypotheses, then such a test is called a null hypothesis test.
As our statistical hypothesis will, by definition, state some property of the distribution, the null hypothesis is the default hypothesis under which that property does not exist. The null hypothesis is typically that some parameter (such as a correlation or a difference between means) in the populations of interest is zero. Our hypothesis might specify the probability distribution of 𝑋 precisely, or it might only specify that it belongs to some class of distributions. Often, we reduce the data to a single numerical statistic, e.g., 𝑇, whose marginal probability distribution is closely connected to a main question of interest in the study.
The p-value is used in the context of null hypothesis testing in order to quantify the statistical significance of a result, the result being the observed value of the chosen statistic 𝑇.[note 2] The lower the p-value is, the lower the probability of getting that result if the null hypothesis were true. A result is said to be statistically significant if it allows us to reject the null hypothesis. All other things being equal, smaller p-values are taken as stronger evidence against the null hypothesis.
Loosely speaking, rejection of the null hypothesis implies that there is sufficient evidence against it.
As a particular example, if a null hypothesis states that a certain summary statistic 𝑇 follows the standard normal distribution 𝑁(0,1), then the rejection of this null hypothesis could mean that (i) the mean of 𝑇 is not 0, or (ii) the variance of 𝑇 is not 1, or (iii) 𝑇 is not normally distributed. Different tests of the same null hypothesis would be more or less sensitive to different alternatives. However, even if we do manage to reject the null hypothesis for all 3 alternatives, and even if we know that the distribution is normal and variance is 1, the null hypothesis test does not tell us which non-zero values of the mean are now most plausible. The more independent observations from the same probability distribution one has, the more accurate the test will be, and the higher the precision with which one will be able to determine the mean value and show that it is not equal to zero; but this will also increase the importance of evaluating the real-world or scientific relevance of this deviation.
full text link : https://en.wikipedia.org/wiki/P-value
Log
In mathematics, the logarithm is the inverse function to exponentiation. That means that the logarithm of a number x to the base b is the exponent to which b must be raised to produce x. For example, since 1000 = 103, the logarithm base 10 of 1000 is 3, or log10 (1000) = 3. The logarithm of x to base b is denoted as logb (x), or without parentheses, logb x. When the base is clear from the context or is irrelevant, such as in big O notation, it is sometimes written log x.
The logarithm base 10 is called the decimal or common logarithm and is commonly used in science and engineering. The natural logarithm has the number e ≈ 2.718 as its base; its use is widespread in mathematics and physics, because of its very simple derivative. The binary logarithm uses base 2 and is frequently used in computer science.
Logarithms were introduced by John Napier in 1614 as a means of simplifying calculations.[1] They were rapidly adopted by navigators, scientists, engineers, surveyors, and others to perform high-accuracy computations more easily. Using logarithm tables, tedious multi-digit multiplication steps can be replaced by table look-ups and simpler addition. This is possible because the logarithm of a product is the sum of the logarithms of the factors: log𝑏(𝑥𝑦)=log𝑏𝑥+log𝑏𝑦,
provided that b, x and y are all positive and b ≠ 1. The slide rule, also based on logarithms, allows quick calculations without tables, but at lower precision. The present-day notion of logarithms comes from Leonhard Euler, who connected them to the exponential function in the 18th century, and who also introduced the letter e as the base of natural logarithms.[2]
Logarithmic scales reduce wide-ranging quantities to smaller scopes. For example, the decibel (dB) is a unit used to express ratio as logarithms, mostly for signal power and amplitude (of which sound pressure is a common example). In chemistry, pH is a logarithmic measure for the acidity of an aqueous solution. Logarithms are commonplace in scientific formulae, and in measurements of the complexity of algorithms and of geometric objects called fractals. They help to describe frequency ratios of musical intervals, appear in formulas counting prime numbers or approximating factorials, inform some models in psychophysics, and can aid in forensic accounting.
The concept of logarithm as the inverse of exponentiation extends to other mathematical structures as well. However, in general settings, the logarithm tends to be a multi-valued function. For example, the complex logarithm is the multi-valued inverse of the complex exponential function. Similarly, the discrete logarithm is the multi-valued inverse of the exponential function in finite groups; it has uses in public-key cryptography.
full text link : https://en.wikipedia.org/wiki/Logarithm
Likelihood
The likelihood function (often simply called the likelihood) is the joint probability mass (or probability density) of observed data viewed as a function of the parameters of a statistical model.[1][2][3] Intuitively, the likelihood function 𝐿(𝜃∣𝑥) is the probability of observing data 𝑥 assuming 𝜃 is the actual parameter.
In maximum likelihood estimation, the arg max (over the parameter 𝜃) of the likelihood function serves as a point estimate for 𝜃, while the Fisher information (often approximated by the likelihood's Hessian matrix) indicates the estimate's precision.
In contrast, in Bayesian statistics, parameter estimates are derived from the converse of the likelihood, the so-called posterior probability, which is calculated via Bayes' rule.[4]
The likelihood function, parameterized by a (possibly multivariate) parameter 𝜃, is usually defined differently for discrete and continuous probability distributions (a more general definition is discussed below). Given a probability density or mass function
𝑥↦𝑓(𝑥∣𝜃),
where 𝑥 is a realization of the random variable 𝑋, the likelihood function is 𝜃↦𝑓(𝑥∣𝜃),
often written
𝐿(𝜃∣𝑥).
In other words, when 𝑓(𝑥∣𝜃) is viewed as a function of 𝑥 with 𝜃 fixed, it is a probability density function, and when viewed as a function of 𝜃 with 𝑥 fixed, it is a likelihood function. In the frequentist paradigm, the notation 𝑓(𝑥∣𝜃) is often avoided and instead 𝑓(𝑥;𝜃) or 𝑓(𝑥,𝜃) are used to indicate that 𝜃 is regarded as a fixed unknown quantity rather than as a random variable being conditioned on.
The likelihood function does not specify the probability that 𝜃 is the truth, given the observed sample 𝑋=𝑥. Such an interpretation is a common error, with potentially disastrous consequences (see prosecutor's fallacy).
full text link : https://en.wikipedia.org/wiki/Likelihood_function
E-value
In statistical hypothesis testing, e-values quantify the evidence in the data against a null hypothesis (e.g., "the coin is fair", or, in a medical context, "this new treatment has no effect"). They serve as a more robust alternative to p-values, addressing some shortcomings of the latter.
In contrast to p-values, e-values can deal with optional continuation: e-values of subsequent experiments (e.g. clinical trials concerning the same treatment) may simply be multiplied to provide a new, "product" e-value that represents the evidence in the joint experiment. This works even if, as often happens in practice, the decision to perform later experiments may depend in vague, unknown ways on the data observed in earlier experiments, and it is not known beforehand how many trials will be conducted: the product e-value remains a meaningful quantity, leading to tests with Type-I error control. For this reason, e-values and their sequential extension, the e-process, are the fundamental building blocks for anytime-valid statistical methods (e.g. confidence sequences). Another advantage over p-values is that any weighted average of e-values remains an e-value, even if the individual e-values are arbitrarily dependent. This is one of the reasons why e-values have also turned out to be useful tools in multiple testing.[1]
E-values can be interpreted in a number of different ways: first, the reciprocal of any e-value is itself a p-value, but a special, conservative one, quite different from p-values used in practice. Second, they are broad generalizations of likelihood ratios and are also related to, yet distinct from, Bayes factors. Third, they have an interpretation as bets. Finally, in a sequential context, they can also be interpreted as increments of nonnegative supermartingales. Interest in e-values has exploded since 2019, when the term 'e-value' was coined and a number of breakthrough results were achieved by several research groups. The first overview article appeared in 2023.[2]
Let the null hypothesis 𝐻0 be given as a set of distributions for data 𝑌. Usually 𝑌=(𝑋1,…,𝑋𝜏) with each 𝑋𝑖 a single outcome and 𝜏 a fixed sample size or some stopping time. We shall refer to such 𝑌, which represent the full sequence of outcomes of a statistical experiment, as a sample or batch of outcomes. But in some cases 𝑌 may also be an unordered bag of outcomes or a single outcome.
An e-variable or e-statistic is a nonnegative random variable 𝐸=𝐸(𝑌) such that under all 𝑃∈𝐻0, its expected value is bounded by 1:
𝐸𝑃[𝐸]≤1.
The value taken by e-variable 𝐸 is called the e-value. In practice, the term e-value (a number) is often used when one is really referring to the underlying e-variable (a random variable, that is, a measurable function of the data).
full text link : https://en.wikipedia.org/wiki/E-values
2024.05.03
Tetrahymena
As a ciliated protozoan, Tetrahymena thermophila exhibits nuclear dimorphism: two types of cell nuclei. They have a bigger, non-germline macronucleus and a small, germline micronucleus in each cell at the same time and these two carry out different functions with distinct cytological and biological properties. This unique versatility allows scientists to use Tetrahymena to identify several key factors regarding gene expression and genome integrity. In addition, Tetrahymena possess hundreds of cilia and has complicated microtubule structures, making it an optimal model to illustrate the diversity and functions of microtubule arrays.
Because Tetrahymena can be grown in a large quantity in the laboratory with ease, it has been a great source for biochemical analysis for years, specifically for enzymatic activities and purification of sub-cellular components. In addition, with the advancement of genetic techniques it has become an excellent model to study the gene function in vivo. The recent sequencing of the macronucleus genome should ensure that Tetrahymena will be continuously used as a model system.
Tetrahymena thermophila exists in 7 different sexes (mating types) that can reproduce in 21 different combinations, and a single tetrahymena cannot reproduce sexually with itself. Each organism "decides" which sex it will become during mating, through a stochastic process.[5][6]
Studies on Tetrahymena have contributed to several scientific milestones including:
- First cell which showed synchronized division, which led to the first insights into the existence of mechanisms which control the cell cycle.[7]
- Identification and purification of the first cytoskeleton based motor protein such as dynein.[7]
- Aid in the discovery of lysosomes and peroxisomes.[7]
- Early molecular identification of somatic genome rearrangement.[7]
- Discovery of the molecular structure of telomeres, telomerase enzyme, the templating role of telomerase RNA and their roles in cellular senescence and chromosome healing (for which a Nobel Prize was won).[7]
- Nobel Prize–winning co-discovery (1989, in Chemistry) of catalytic RNA (ribozyme).[7][8]
- Discovery of the function of histone acetylation.[7]
- Demonstration of the roles of posttranslational modification such as acetylation and glycylation on tubulins and discovery of the enzymes responsible for some of these modifications (glutamylation)
- Crystal structure of 40S ribosome in complex with its initiation factor eIF1
- First demonstration that two of the "universal" stop codons, UAA and UAG, will code for the amino acid glutamine in some eukaryotes, leaving UGA as the only termination codon in these organisms.[9]
link : https://en.wikipedia.org/wiki/Tetrahymena
telomere
A telomere (/ˈtɛləmɪər, ˈtiːlə-/; from Ancient Greek τέλος (télos) 'end', and μέρος (méros) 'part') is a region of repetitive nucleotide sequences associated with specialized proteins at the ends of linear chromosomes (see Sequences). Telomeres are a widespread genetic feature most commonly found in eukaryotes. In most, if not all species possessing them, they protect the terminal regions of chromosomal DNA from progressive degradation and ensure the integrity of linear chromosomes by preventing DNA repair systems from mistaking the very ends of the DNA strand for a double-strand break.
The existence of a special structure at the ends of chromosomes was independently proposed in 1938 by Hermann Joseph Muller, studying the fruit fly Drosophila melanogaster, and in 1939 by Barbara McClintock, working with maize.[1] Muller observed that the ends of irradiated fruit fly chromosomes did not present alterations such as deletions or inversions. He hypothesized the presence of a protective cap, which he coined "telomeres", from the Greek telos (end) and meros (part).[2]
In the early 1970s, Soviet theorist Alexei Olovnikov first recognized that chromosomes could not completely replicate their ends; this is known as the "end replication problem". Building on this, and accommodating Leonard Hayflick's idea of limited somatic cell division, Olovnikov suggested that DNA sequences are lost every time a cell replicates until the loss reaches a critical level, at which point cell division ends.[3][4][5] According to his theory of marginotomy DNA sequences at the ends of telomeres are represented by tandem repeats, which create a buffer that determines the number of divisions that a certain cell clone can undergo. Furthermore, it was predicted that a specialized DNA polymerase (originally called a tandem-DNA-polymerase) could extend telomeres in immortal tissues such as germ line, cancer cells and stem cells. It also followed from this hypothesis that organisms with circular genome, such as bacteria, do not have the end replication problem and therefore do not age.
In 1975–1977, Elizabeth Blackburn, working as a postdoctoral fellow at Yale University with Joseph G. Gall, discovered the unusual nature of telomeres, with their simple repeated DNA sequences composing chromosome ends.[6] Blackburn, Carol Greider, and Jack Szostak were awarded the 2009 Nobel Prize in Physiology or Medicine for the discovery of how chromosomes are protected by telomeres and the enzyme telomerase.[7
During DNA replication, DNA polymerase cannot replicate the sequences present at the 3' ends of the parent strands. This is a consequence of its unidirectional mode of DNA synthesis: it can only attach new nucleotides to an existing 3'-end (that is, synthesis progresses 5'-3') and thus it requires a primer to initiate replication. On the leading strand (oriented 5'-3' within the replication fork), DNA-polymerase continuously replicates from the point of initiation all the way to the strand's end with the primer (made of RNA) then being excised and substituted by DNA. The lagging strand, however, is oriented 3'-5' with respect to the replication fork so continuous replication by DNA-polymerase is impossible, which necessitates discontinuous replication involving the repeated synthesis of primers further 5' of the site of initiation (see lagging strand replication). The last primer to be involved in lagging-strand replication sits near the 3'-end of the template (corresponding to the potential 5'-end of the lagging-strand). Originally it was believed that the last primer would sit at the very end of the template, thus, once removed, the DNA-polymerase that substitutes primers with DNA (DNA-Pol δ in eukaryotes)[note 1] would be unable to synthesize the "replacement DNA" from the 5'-end of the lagging strand so that the template nucleotides previously paired to the last primer would not be replicated.[8] It has since been questioned whether the last lagging strand primer is placed exactly at the 3'-end of the template and it was demonstrated that it is rather synthesized at a distance of about 70–100 nucleotides which is consistent with the finding that DNA in cultured human cell is shortened by 50–100 base pairs per cell division.[9]
If coding sequences are degraded in this process, potentially vital genetic code would be lost. Telomeres are non-coding, repetitive sequences located at the termini of linear chromosomes to act as buffers for those coding sequences further behind. They "cap" the end-sequences and are progressively degraded in the process of DNA replication.
The "end replication problem" is exclusive to linear chromosomes as circular chromosomes do not have ends lying without reach of DNA-polymerases. Most prokaryotes, relying on circular chromosomes, accordingly do not possess telomeres.[10] A small fraction of bacterial chromosomes (such as those in Streptomyces, Agrobacterium, and Borrelia), however, are linear and possess telomeres, which are very different from those of the eukaryotic chromosomes in structure and function. The known structures of bacterial telomeres take the form of proteins bound to the ends of linear chromosomes, or hairpin loops of single-stranded DNA at the ends of the linear chromosomes.[11]
Telomerase
Telomerase, also called terminal transferase,[1] is a ribonucleoprotein that adds a species-dependent telomere repeat sequence to the 3' end of telomeres. A telomere is a region of repetitive sequences at each end of the chromosomes of most eukaryotes. Telomeres protect the end of the chromosome from DNA damage or from fusion with neighbouring chromosomes. The fruit fly Drosophila melanogaster lacks telomerase, but instead uses retrotransposons to maintain telomeres.[2]
Telomerase is a reverse transcriptase enzyme that carries its own RNA molecule (e.g., with the sequence 3′-CCCAAUCCC-5′ in Trypanosoma brucei)[3] which is used as a template when it elongates telomeres. Telomerase is active in gametes and most cancer cells, but is normally absent in most somatic cells.
The existence of a compensatory mechanism for telomere shortening was first found by Soviet biologist Alexey Olovnikov in 1973,[4] who also suggested the telomere hypothesis of aging and the telomere's connections to cancer and perhaps some neurodegenerative diseases.[5]
Telomerase in the ciliate Tetrahymena was discovered by Carol W. Greider and Elizabeth Blackburn in 1984.[6] Together with Jack W. Szostak, Greider and Blackburn were awarded the 2009 Nobel Prize in Physiology or Medicine for their discovery.[7] Later the cryo-EM structure of telomerase was first reported in T. thermophila, to be followed a few years later by the cryo-EM structure of telomerase in humans.[8]
The role of telomeres and telomerase in cell aging and cancer was established by scientists at biotechnology company Geron with the cloning of the RNA and catalytic components of human telomerase[9] and the development of a polymerase chain reaction (PCR) based assay for telomerase activity called the TRAP assay, which surveys telomerase activity in multiple types of cancer.[10]
The negative stain electron microscopy (EM) structures of human and Tetrahymena telomerases were characterized in 2013.[11][12] Two years later, the first cryo-electron microscopy (cryo-EM) structure of telomerase holoenzyme (Tetrahymena) was determined.[13] In 2018, the structure of human telomerase was determined through cryo-EM by UC Berkeley scientists.[14]
full text link : https://en.wikipedia.org/wiki/Telomerase
DNA replicate
In molecular biology,[1][2][3] DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule.[4] DNA replication occurs in all living organisms acting as the most essential part of biological inheritance. This is essential for cell division during growth and repair of damaged tissues, while it also ensures that each of the new cells receives its own copy of the DNA.[5] The cell possesses the distinctive property of division, which makes replication of DNA essential.
DNA is made up of a double helix of two complementary strands. The double helix describes the appearance of a double-stranded DNA which is thus composed of two linear strands that run opposite to each other and twist together to form.[6] During replication, these strands are separated. Each strand of the original DNA molecule then serves as a template for the production of its counterpart, a process referred to as semiconservative replication. As a result of semi-conservative replication, the new helix will be composed of an original DNA strand as well as a newly synthesized strand.[7] Cellular proofreading and error-checking mechanisms ensure near perfect fidelity for DNA replication.[8][9]
In a cell, DNA replication begins at specific locations, or origins of replication,[10] in the genome[11] which contains the genetic material of an organism.[12] Unwinding of DNA at the origin and synthesis of new strands, accommodated by an enzyme known as helicase, results in replication forks growing bi-directionally from the origin. A number of proteins are associated with the replication fork to help in the initiation and continuation of DNA synthesis. Most prominently, DNA polymerase synthesizes the new strands by adding nucleotides that complement each (template) strand. DNA replication occurs during the S-stage of interphase.[13]
DNA replication (DNA amplification) can also be performed in vitro (artificially, outside a cell).[14] DNA polymerases isolated from cells and artificial DNA primers can be used to start DNA synthesis at known sequences in a template DNA molecule. Polymerase chain reaction (PCR), ligase chain reaction (LCR), and transcription-mediated amplification (TMA) are examples. In March 2021, researchers reported evidence suggesting that a preliminary form of transfer RNA, a necessary component of translation, the biological synthesis of new proteins in accordance with the genetic code, could have been a replicator molecule itself in the very early development of life, or abiogenesis.[15][16]
DNA Structure
DNA exists as a double-stranded structure, with both strands coiled together to form the characteristic double helix. Each single strand of DNA is a chain of four types of nucleotides. Nucleotides in DNA contain a deoxyribose sugar, a phosphate, and a nucleobase. The four types of nucleotide correspond to the four nucleobases adenine, cytosine, guanine, and thymine, commonly abbreviated as A, C, G, and T. Adenine and guanine are pu[17]rine bases, while cytosine and thymine are pyrimidines. These nucleotides form phosphodiester bonds, creating the phosphate-deoxyribose backbone of the DNA double helix with the nucleobases pointing inward (i.e., toward the opposing strand). Nucleobases are matched between strands through hydrogen bonds to form base pairs. Adenine pairs with thymine (two hydrogen bonds), and guanine pairs with cytosine (three hydrogen bonds).[18]
DNA strands have a directionality, and the different ends of a single strand are called the "3′ (three-prime) end" and the "5′ (five-prime) end". By convention, if the base sequence of a single strand of DNA is given, the left end of the sequence is the 5′ end, while the right end of the sequence is the 3′ end. The strands of the double helix are anti-parallel, with one being 5′ to 3′, and the opposite strand 3′ to 5′. These terms refer to the carbon atom in deoxyribose to which the next phosphate in the chain attaches. Directionality has consequences in DNA synthesis, because DNA polymerase can synthesize DNA in only one direction by adding nucleotides to the 3′ end of a DNA strand.
The pairing of complementary bases in DNA (through hydrogen bonding) means that the information contained within each strand is redundant. Phosphodiester (intra-strand) bonds are stronger than hydrogen (inter-strand) bonds. The actual job of the phosphodiester bonds is where in DNA polymers connect the 5' carbon atom of one nucleotide to the 3' carbon atom of another nucleotide, while the hydrogen bonds stabilize DNA double helices across the helix axis but not in the direction of the axis.[19] This makes it possible to separate the strands from one another. The nucleotides on a single strand can therefore be used to reconstruct nucleotides on a newly synthesized partner strand.[20]
DNA polymerase
DNA polymerases are a family of enzymes that carry out all forms of DNA replication.[22] DNA polymerases in general cannot initiate synthesis of new strands but can only extend an existing DNA or RNA strand paired with a template strand. To begin synthesis, a short fragment of RNA, called a primer, must be created and paired with the template DNA strand.
DNA polymerase adds a new strand of DNA by extending the 3′ end of an existing nucleotide chain, adding new nucleotides matched to the template strand, one at a time, via the creation of phosphodiester bonds. The energy for this process of DNA polymerization comes from hydrolysis of the high-energy phosphate (phosphoanhydride) bonds between the three phosphates attached to each unincorporated base. Free bases with their attached phosphate groups are called nucleotides; in particular, bases with three attached phosphate groups are called nucleoside triphosphates. When a nucleotide is being added to a growing DNA strand, the formation of a phosphodiester bond between the proximal phosphate of the nucleotide to the growing chain is accompanied by hydrolysis of a high-energy phosphate bond with release of the two distal phosphate groups as a pyrophosphate. Enzymatic hydrolysis of the resulting pyrophosphate into inorganic phosphate consumes a second high-energy phosphate bond and renders the reaction effectively irreversible.[Note 1]
In general, DNA polymerases are highly accurate, with an intrinsic error rate of less than one mistake for every 107 nucleotides added.[23] Some DNA polymerases can also delete nucleotides from the end of a developing strand in order to fix mismatched bases. This is known as proofreading. Finally, post-replication mismatch repair mechanisms monitor the DNA for errors, being capable of distinguishing mismatches in the newly synthesized DNA Strand from the original strand sequence. Together, these three discrimination steps enable replication fidelity of less than one mistake for every 109 nucleotides added.[23]
The rate of DNA replication in a living cell was first measured as the rate of phage T4 DNA elongation in phage-infected E. coli.[24] During the period of exponential DNA increase at 37 °C, the rate was 749 nucleotides per second. The mutation rate per base pair per replication during phage T4 DNA synthesis is 1.7 per 108.[25]
Transposase
a cut-and-paste mechanism or a replicative mechanism, in a process known as transposition. The word "transposase" was first coined by the individuals who cloned the enzyme required for transposition of the Tn3 transposon.[1] The existence of transposons was postulated in the late 1940s by Barbara McClintock, who was studying the inheritance of maize, but the actual molecular basis for transposition was described by later groups. McClintock discovered that some segments of chromosomes changed their position, jumping between different loci or from one chromosome to another. The repositioning of these transposons (which coded for color) allowed other genes for pigment to be expressed.[2] Transposition in maize causes changes in color; however, in other organisms, such as bacteria, it can cause antibiotic resistance.[2] Transposition is also important in creating genetic diversity within species and generating adaptability to changing living conditions.[3]
Transposases are classified under EC number EC 2.7.7. Genes encoding transposases are widespread in the genomes of most organisms and are the most abundant genes known.[4] During the course of human evolution, as much as 40% of the human genome has moved around via methods such as transposition of transposons.[2]
Transposase Tn5
Transposase (Tnp) Tn5 is a member of the RNase superfamily of proteins which includes retroviral integrases. Tn5 can be found in Shewanella and Escherichia bacteria.[5] The transposon codes for antibiotic resistance to kanamycin and other aminoglycoside antibiotics.[3][6]
Tn5 and other transposases are notably inactive. Because DNA transposition events are inherently mutagenic, the low activity of transposases is necessary to reduce the risk of causing a fatal mutation in the host, and thus eliminating the transposable element. One of the reasons Tn5 is so unreactive is because the N- and C-termini are located in relatively close proximity to one another and tend to inhibit each other. This was elucidated by the characterization of several mutations which resulted in hyperactive forms of transposases. One such mutation, L372P, is a mutation of amino acid 372 in the Tn5 transposase. This amino acid is generally a leucine residue in the middle of an alpha helix. When this leucine is replaced with a proline residue the alpha helix is broken, introducing a conformational change to the C-terminal domain, separating it from the N-terminal domain enough to promote higher activity of the protein.[3] The transposition of a transposon often needs only three pieces: the transposon, the transposase enzyme, and the target DNA for the insertion of the transposon.[3] This is the case with Tn5, which uses a cut-and-paste mechanism for moving around transposons.[3]
Tn5 and most other transposases contain a DDE motif, which is the active site that catalyzes the movement of the transposon. Aspartate-97, aspartate-188, and glutamate-326 make up the active site, which is a triad of acidic residues.[7] The DDE motif is said to coordinate divalent metal ions, most often magnesium and manganese, which are important in the catalytic reaction.[7] Because transposase is incredibly inactive, the DDE region is mutated so that the transposase becomes hyperactive and catalyzes the movement of the transposon.[7] The glutamate is transformed into an aspartate and the two aspartates into glutamates.[7] Through this mutation, the study of Tn5 becomes possible, but some steps in the catalytic process are lost as a result.[3]
Main Page » UNIST Geromics course » Geromics Course Students Folder 2024 » HyoungJinChoi 2024 Geromics Course » Summary class Geromics 2024 HyoungJinCho