Professional Documents
Culture Documents
Purine
(1) Adenine – 6 amino purine
(2) Guanine – 2 amino 6-oxypurine
Pyrimidines
(1) Cytosine ( 2oxy 4 amino pyrimidine)
(2) Thymine (2,4-dioxy 5 methyl pyrimidine)
(3) Uracil (2,4 – dioxypyrimidine)
Bases
1. Purine base; adenine and guanine (2 rings, in both DNA and RNA)
2. Pyrimidine base; cytosine, thymine (DNA) and uracil (RNA) – 1 ring
Nucleosides and nucleotides
• Nucleosides = Sugar + base (C1 of sugar by N – glycosidic bond)
• Nucleotides = Sugar + base + phosphate (phosphate group at 5’end of sugar by ester bond)
• Nucleotide - nucleoside mono or di or triphosphate
• Two terminal phosphate bonds are acid anhydride bonds possessing high energy (- 7.3 kcal/mol)
Nucleic acid strand
• Nucleotides are linked together through joining of 3’OH end moiety of 1 st nucleotide and phosphate
group at 5’positionof pentose of 2nd nucleotide via phosphodiester bond.
• Each strand has polarity; 5’and 3’end. At the 5’end, phosphate group and at the 3’end, a hydroxyl
group is often found.
Structure of DNA
DNA is the chemical basis of heredity. The genetic information of all living organisms, except RNA
viruses, is stored in DNA. B-form described by Watson and Crick. It is a double stranded structure in the
form of double helix (right-handed). Two strands are anti-parallel i.e., 5’ to 3’ direction of 2 polynucleotide
strands run in opposite direction and this arrangement produces a stable association between strands.
Each strand is polymer of deoxyribonucleotides (deoxyribose, N-base & phosphate). Nucleotides are
linked by 3' 5' phosphodiester bonds. N-base in DNA are purine (adenine and guanine) and pyrimidine
(thymine and cytosine). Hydrophilic deoxyribose and phosphate forms backbone on the outside while
hydrophobic bases are stacked inside of the molecule. The base sequence carries genetic information.
The width of a double helix is 20A° (2nm). Each turn of the helix is 34A° (3.4nm) with 10 base pairs.
Interwinding of the 2 antiparallel strands produces major groove and minor groove. The number of base pair
and the length of DNA vary from species to species. Base pairing & hydrophobic base stacking interactions
hold the two DNA strands together and maintain the stability of DNA double helices. Base pairing is
complementary; A and T are paired by 2 hydrogen bonds and G and C by 3 hydrogen bonds. Due to specific
base pairing rule (Chargaff’s rule), the total purine equals total pyrimidine. Nucleus contains linear DNA
packed into chromosome and mitochondria contain circular bare DNA.
Alternative forms of DNA; A to E and Z families shown by X-ray crystallography. In high salt
concentration, A form exists and is right-handed helix with 11 bp per turn but it never exists as in vivo. When
the relative humidity of B-form DNA falls to less than 75%, B form undergoes a relative transition into A
form. Z form is a left handed helix, formed by alternating G:C and C:G having 12 bp per turn. It is favored at
high ionic concentration. It has novel conformation like zigzag configuration and its function is still not clear.
Functions of DNA
1. Genetic information in DNA serves as source of information for synthesis of all proteins of cells and
organism. Thus, DNA serves as template for transcription into RNA which is then translated to specific
protein.
2. It provides progeny with genetic information possessed by the parent. Thus both strands of DNA serve as
template for replication into daughter DNA.
Chromosome
Every cell of a multicellular organism contains the same genetic materials. Human genome is made
up of 3.2 x 109 nucleotides distributed over 24 different chromosomes. Each chromosome contains between
48 million and 240 million base pairs. They are divided among different sizes of 23 pairs of distinct
chromosomes (22 autosome and X and Y chromosomes).
Each human somatic cell contains two copies of each chromosome, one inherited from the mother and
one from the father. The maternal and paternal chromosomes of a pair are called homologous chromosomes
(homologs). The only non-homologous chromosome pairs are the sex chromosomes in males, where Y
chromosome is inherited from father and X chromosome from the mother. Somatic cells contain diploid
number; 22 pairs of autosome and one sex chromosome. Germ cells contain haploid numbers; 22 autosome
and either X or Y chromosomes). Large excess DNA that does not carry critical information is called junk
DNA. They act as spacer material and are essential for long term evolution of the species and proper
expression of genes.
Chromosomes are called depending on position of centromere.
1. Metacentric chromosome when centromete is near the middle of chromosome
2. Acrocentric chromosome when near the telomere
3. Submetacentric chromosome when centromere is located between centromere and telomere.
DNA sequences on both sides of centromere are designated as arms of a chromosome; long arm (q)
and short arm (p). The arms are subdivided by regions.
Maps to describe the location of a particular gene on a chromosome
1. Map using cytogenetic location which is based on a distinctive pattern of bands created by
staining of chromosomes with certain chemicals
2. Map using the molecular location which is based on the sequence of DNA base pairs that describe
the precise location of gene on chromosome
5 2022 Molecular Biology
Each chromosome of eukaryotic cells contains single, large duplex DNA molecule. DNA in the
chromosome is tightly associated with histone proteins into nucleosomes. Histones are highly conserved
protein and are positively charged proteins, rich in Arginine and lysine. Thus they can bind the negative
charges of sugar-phosphate backbone of DNA to reduce the electrostatic repulsion and allow tighter packing.
Histones remodeling or modification is important for the regulation of chromatin structure and in
controlling of gene expression. It is done by enzymatic covalent modification such as methylation, acetylation,
ADP-ribosylation, phosphorylation, glycosylation or ubiquitination.
Fiber-like unpacked chromosome is called chromatin. Nucleosome are fundamental units of
chromatin. Nucleosome consists of histone octamer (H2A, H2B, H 3, H4)2 encircled by double stranded
DNA (about 146bp). Outside the nucleosome, H1 prevents unwinding of DNA segment from nucleosome.
Nucleosomes are joined by linker DNA of about 20 to 90 bp.
The nucleosome cores organize into a structure called the 30nm fiber. At the next level, 30nm fiber is
folded into loops. The loops are bound to a protein scaffold consisting of H1 histone and several non-histone
proteins, Sc1 (a topoisomerase II) and Sc2. Fibers supercoiled into chromatin and then forms compact
chromosome. Presence of nucleosome, it can be seen as beads on string appearance under electron
microscope. There are 2 types of chromatin; transcriptionally inactive Heterochromatin and transcriptionally
active Euchromatin. Under electron microscope, heterochromatin is densely packed and euchromatin is seen
as loose strand.
The important proteins in nucleus are histone and nucleoplasmin. Nucleoplasmin is an anionic
pentametric protein. It reversibly binds with histone but does not bind to DNA or chromatin. It is important
for gene expression and replication.
6 2022 Molecular Biology
Cell Cycle
Cell cycle is the orderly sequence of events by which a cell duplicates its chromosomes and other cell
contents followed by division of a cell into two genetically identical daughter cells.
Four sequential phases
1. G1 phase (Gap 1) – period of cell growth and differentiation prior to replication
2. S (synthetic) – DNA synthesis occurs causing duplication of chromosomes.
3. G2 (Gap 2) – period after replication and preparatory phase before cell division
4. M (Mitosis)- Chromosomes segregation and cell division occur. It comprises two major events:
nuclear division or mitosis during which duplicated chromosomes are distributed equally and exactly
to the daughter cells and cytoplasmic division or cytokinesis.
An additional phase is the G0 phase in which the cell is in a quiescent state. A combination of G1, S, G2 are
called interphase and gene expression occurs throughout. The two gap phases allow time for cell growth and
provide check points (G1 and G2 check points).
To re-enter the cell cycle, growth factor is needed. If external conditions are unfavorable, cells delay
progress through G1 and may even enter the G0.
The length of the cell cycle varies among different types of cells. In human body, many cells divide
frequently e.g., hair follicles, skin cells. The precursor of RBC divides a number of times. While fibroblast
and epithelial cells may spend very little or no time in G0 and adult liver cells, brain cells and myocytes spend
most of the time in G0. Early cleavage division in embryonic cells are rapid, in which G1 and G2 are
completely omitted, and the cells cycle rapidly between M and S phase.
Overview of cell proliferation and growth
Cells of a multicellular organism have to receive positive signals in order to grow and divide. Growth
factor binds to their specific cell surface receptors and initiates selective signaling cascade.
The important check points occur at 3 stages: at G1-S transition, during S phase and G2-M boundary.
G1 check point is more complex and is under strict control. Four types of cyclins, 5 types of cyclin-dependent
kinase (CDK) regulate cell cycle transition points.
Mechanism of Cell cycle progress
All eukaryotic cells have gene products (proteins) that govern the transition from one phase of the cell
cycle to another. Growth factor stimulation results in induction of genes producing proteins e.g. cyclins for
cell cycle progression. Cyclins concentration increases and decreases at specific times and are abruptly
destroyed during mitosis.
Cyclin activates specific cyclin dependent protein kinases (CDK) that in turn phosphorylates many
proteins for progression through the cell cycle. When cell is exposed to mitogen in G1 phase, cyclin D level
becomes rise. By activating CDK4 and CDK6, cyclin D induces the synthesis of cyclin E. cyclin E and CDK2
make the cell to pass G1 checkpoint and initiation of DNA synthesis in early S phase. Cyclin A, CDK1 and
CDK2 bring the cell through S phase and remain active through G2 phase. Cyclin B and CDK1 derive the
transition from G2 to M phase.
7 2022 Molecular Biology
Apoptosis
Apoptosis is the process of programmed cell death to limit the growth and proliferation of cells that
occurs in multicellular organisms. It is also a physiologic process involved in various developmental and
physiological processes. Cells unnecessary or threatened to the organism undergo apoptosis.
Apoptosis is mediated by proteolytic enzymes, caspases that trigger cell death by cleaving specific
proteins in cytoplasm and nucleus resulting in
• Halting cell cycle progression
• Disabling homeostatic and repair mechanisms
• Initiating the detachment of the cell from its surrounding tissue structures
• Dismantling structural components such as the cytoskeleton
• Flagging the dying cell for phagocytosis
Dramatic morphological changes in the cells are
• Shrinkage of the cytoplasm
• DNA fragmentation
• Plasma membrane blebbing
• Formation of membrane – enclosed vesicles (apoptotic bodies) and engulfment by phagocytes
• Cell death without lysis or damage to neighboring cells
Apoptosis mediating genes (suicidal genes) (oncopressor genes) are c-fos, p53, Rb. Apoptosis
protecting genes are bcl-2 and other oncogenes.
Stress and other stimuli activate certain cell surface receptors, and a cascade of activation takes place.
Cell death receptors such as tumor necrosis factor (TNF-R) and Fas (also known as CD95 or APO-1) mediate
apoptosis in a number of cell types especially in immune cells. This initiate apoptosis by directly recruiting
procaspases resulting in caspases activation cascade.
Caspase activation cascade
CASPASE (cysteinyl aspartate specific protease) are proteases with cysteine in the active centre. They are
secreted as inactive procaspases and are activated one by one. The caspase 8 is the 1 st one activated (initiator)
and the final one is caspase 3, the executor of death (Yama). Cytochrome c released from mitochondria into
cytosol activates procaspases 9 to caspase 9 causing activation of the whole pathway. This process is inhibited
by B cell lymphoma 2 (bcl 2) by regulating mitochondrial integrity and cytochrome c release. Tumor
suppressor protein p53 activates apoptosis in response to DNA damage.
Biomedical importance
Inappropriate apoptosis machinery can lead to degenerative disorders and subversion and disruption
of the apoptosis machinery can result in cancer or autoimmune disease.
9 2022 Molecular Biology
Replication
Replication is the process by which each strands of parental DNA duplex is copied precisely by base
pairing with complementary deoxyribonucleotides to form daughter DNA molecules.
Common features of replication
➢ It must be complete and carried out with high fidelity to maintain genetic stability within the organism
and species.
➢ General steps are similar in eukaryotes and prokaryotes.
➢ Sequences on DNA templates are copied specifically and accurately by complementary base pairing
rule.
➢ Polymerization of new strand takes from 5’ to 3’ as polymerase joins nucleotides only by 3’ 5’
phosphodiester bond.
➢ It occurs during the S phase of cell cycle. In eukaryotes, all parts of the genome are replicated only once
during each cell cycle.
➢ Both strands serve as template and replicate simultaneously. Replication occurs Bi-directionally from
replication bubble.
➢ Semi-discontinuous process: Although both strands are polymerized 5’ to 3’, one strand is continuously
(leading strand) and other is discontinuously (lagging strand).
➢ Semi-conservative - one strand of parent DNA molecule is conserved in each new double helix, paired
with newly synthesized complementary strand.
10 2022 Molecular Biology
Biomedical importance
1. Inhibitors of DNA gyrase are potent antibiotics.
E.g., Quinolone and fluoroquinolone; Nalidixic acid, Ciprofloxacin, Norfloxacin
2. Etoposide and doxorubicin are used in cancer treatment. They are inhibitors of topoisomerase II.
3. DNA intercalator (insertion of a molecule into DNA) prevent unwinding of DNA also used for cancer.
E.g., doxorubicin
4. Nucleotide analog inhibits deoxyribonucleotide polymerization are generally anti-cancer or anti-viral
agent.
5. In somatic cells, level of telomerase is turned down. Stem cells retain full telomerase activity. Cancer cells
have continued presence of telomerase which is a potential target of new anticancer drugs.
Telomere replication
Telomeres are the repetitive sequences (TTAGGG in human) at the end of the chromosomal DNA.
During replication, DNA synthesis is restricted because there is no place to produce the RNA primer needed
to start the last Okasaki fragment at lagging strand. And new strand is shorter at 5’ end with each round of
replication and gene would be lost.
The enzyme telomerase is reverse transcriptase with internal RNA template. After extension of 3’ end
of the parental DNA strand by telomerase, replication of the lagging strand at the chromosome end can be
completed by the conventional DNA polymerase.
Telomerase level is turned down in somatic cells. Stem cells retain full telomerase activity. Cancer
cells have continued presence of telomerase and chromosome length equilibrium is maintained, leading to
continued cell division. Telomerase is a potential target for newer anticancer drug.
Telomerase length and aging in humans
• In culture, human cells divide only 20-70 cell generations before senescence and death occurs. A
correlation is observed between telomere length and the number of cell divisions and also aging.
• In progerias, inherited diseases characterized by premature aging, their somatic cells have short telomeres
and exhibit decreased proliferative capacity in the culture.
12 2022 Molecular Biology
Reverse transcription
Reverse transcription is RNA directed synthesis of DNA, catalyzed by reverse transcriptase. The genetic
material for some viruses is RNA. The retro virus e.g. HIV contains virally encoded reverse transcriptase. The
enzyme 1st synthesizes double-stranded DNA from its RNA template. In many cases, resulted dsDNA is
integrated into host genome and gene expression of viral RNA genome and mRNAs occurs.
Importance of retrovirus
• Integration of dsDNA copy of retrovirus into the chromosome of the infected cell can transform the cells
into cancerous cells.
• HIV retrovirus causes acquired immunodeficiency disease (AIDS).
• Retrovirus can be used in gene therapy.
Clinical Importance
1. Several important antiviral drugs are nucleotide analogs. They inhibit reverse transcriptase activity.
e.g., Azido 2’, 3’-dideoxythymidine (AZT), 2’, 3’-dideoxycytidine (ddC), dideoxy inosine (ddI) in AIDS
2. Reverse transcriptase can be used to make dsDNA copies from various RNAs in genetic engineering.
Genomic stability
A typical mammalian cell accumulates many thousands of lesions during a 24-hour period. As a result
of DNA repair, fewer than 1 in 1000 becomes a mutation. Genomic stability is important for health of the
individual and for maintenance of the species.
DNA is more stable than RNA but has limited chemical stability. Spontaneous damage of DNA is the
major factor in mutagenesis and ageing.
Cause of DNA damage
1. Spontaneous damage or error during replication is a major factor in mutation and ageing.
e.g. Hydrolysis, oxidation, Non-enzymatic methylation
2. Physical agents such as UV rays and radiation
3. Chemical agents; dyes, drugs, heavy metal, petroleum products
o Aflatoxin by mold in peanuts undergoes epoxidation by Cyt P450 and causes base alteration.
o Benzopyrene (cigarette smoke) causes base pair alteration.
4. Biological agents - Viral infections and fungus
o Base analog and virus infection change in DNA sequence.
Types of damage
1. Single base alterations due to
• Depurination – purine N-glycosidic bonds are especially labile.
• Deamination of cytosine to uracil, adenine to hypoxanthine, guanine to xanthine
• Insertion or deletion of single nucleotide
• Alkylation
• Base analog incorporation
2. Two base alteration due to
• UV light induced thymine-thymine dimer
• Alkylating agent cross linkage
3. Chain breaks which may be caused by
• Single-stranded breaks by Ionizing radiation, radioactive substance, free radicals
• Double-stranded breaks by Ionizing radiation, some chemotherapeutic agent
4. Cross linkage
• between bases in same or opposite strands
• between DNA & protein molecules
14 2022 Molecular Biology
Biomedical importance
Defect in repair system of DNA increases frequency of mutation and can cause cancer.
1. Skin fibroblasts from patients with Xeroderma pigmentosum have defect in excinuclease of NER.
2. Hereditary nonpolyposis colorectal cancer or Lynch syndrome results from mutation of gene of protein
involved in mismatch repair system.
3. Ataxia- telangiectasia is due to defective double strands break repair.
• A gene is defined as a segment of DNA (or in a few cases RNA) which encodes the information that is
required to produce functional biological products; protein or one of several classes of RNA molecules.
• Genes are located on the chromosomes. Site of gene in a chromosome is called locus. The alternative
form of a gene is called allele. Direction of a gene is 5’ to 3’ direction.
• The main function of a gene is to express the characters of genetic information that carries. Gene
expression includes transcription- formation of RNA from DNA and translation – formation of a protein
from RNA. Gene expression is tissue and time specific in nature e.g., insulin protein, embryonic and fetal
hemoglobin.
• According to the gene expression pattern, there are two types of genes. Inducible gene responses to
regulatory signal and a constitutive gene that expresses at a constant rate and does not response to
regulation. Their product proteins are always necessary for cellular metabolism and are called house-
keeping genes.
A functional eukaryotic gene contains two large regions.
1. Structural region
It contains alternating (a) exon and (b) intron (non-coding segments). Exons are coding segments that contains
information for protein synthesis. Introns are important for structure and regulation of gene. They are removed
from precursor RNA before being transported into the cytoplasm.
2. Regulatory region consists of two classes of elements.
(a) Basal expression region
It is essential for gene expression at basal condition. It is usually located at the 5’ end and cis-acting as
they are located near to the gene.
• Proximal component or TATA box located about -25bp to -30 bp upstream from the transcription
start site. It binds and directs RNA polymerase to the starting point of transcription. In mammals, the
exact sequence in TATA box is slightly different and is known as Goldberg-Hogness box.
• Upstream element or CAAT box located about -70bp to -80bp and specifies the frequency of
initiation.
(b) Regulated expression region is located in variety of places. It regulates rate of gene expression.
The regions are responded to various signals such as hormones, metals & chemicals.
• Enhancer element increases rate of gene expression. Silencer element decreases rate of gene
expression.
• other regulatory elements e.g., HRE
• Genes do not function autonomously. The protein-DNA interaction at the regulatory region regulates gene
expression e.g., Transcription factors. The DNA- binding protein has 4 structural motifs namely helix-turn-
helix, helix-loop-helix, Zn finger and Leucine zipper.
17 2022 Molecular Biology
• Cis-acting elements – DNA sequences in the vicinity of the structural portion of a gene that
are required for gene expression.
• Trans-acting factors – factors, usually considered to be proteins, that bind to the cis-acting
sequences to control gene expression.
Gene Expression
Gene expression is a process by which genetic information, carried as DNA sequences on an individual gene
is transformed into individual polypeptides or protein. It can be divided into two major parts; transcription and
translation
Transcription is a process by which the information contained in DNA (a gene) is copied by base pairing, to
form a complementary sequence of ribonucleotides, the RNA chain.
Template strand or anti-sense strand – the strand (3’ to 5’ direction) that is transcribed into an RNA molecule.
Coding strand or sense strand – the other strand (5’ to 3’ direction)
Characteristics of transcription
1. General steps are similar in eukaryotes and prokaryotes except transcription takes place in nucleus in
eukaryotes and in cytoplasm in prokaryotes. Processing of nearly all eukaryotic mRNA precursors occur.
2. It occurs in transcriptionally active euchromatin or chromosome in eukaryotes.
3. RNA is transcribed from template DNA strand (in 3’to 5’direction). Polymerization of RNA strand takes
place from 5' to 3' direction.
4. RNA has the same sequence to coding strand except U for T.
5. The base pairing rule is always maintained.
In Eukaryotes, transcription takes place in nucleus. It requires template DNA, RNA polymerase,
ribonucleoside triphosphate and many transcription factors proteins and co-activator proteins. Splicing occurs
in nearly all RNA precursors.
In eukaryotes, it has 3 types of RNA polymerases. They differ in template specificity, localization and
susceptivity of inhibitors.
RNA polymerase I located in nucleoli transcribes the genes for 18s, 5.8s, and 28s rRNA.
RNA polymerase II located in nucleoplasm synthesizes the mRNA and snRNA.
18 2022 Molecular Biology
In eukaryotes, to locate RNA polymerase at the transcription start site for initiation of transcription,
interaction of transcription factors, trans-activator proteins and co-regulator proteins with the enhancers and
other cis-acting sequence elements.
In initiation of transcription, binding of TATA box binding proteins (TBP) with TBP-associated
factors (TAF) form TFIID. Binding of TFIID complex to TATA box is the 1st step in transcription process.
Other proteins associated with transcription initiation are TFIIA, B, E, F, H and RNA polymerase II.
TATA box located at about -25bp from the transcription start site directs the RNA polymerase to the
transcription start site. Additional element CAAT box specifies the rate of transcription, and some contain a
GC box. Transcription is further stimulated by enhancer elements, located in either upstream or downstream
of the gene.
RNA chains are synthesized from 5’ to 3’ direction. Four different types of nucleoside triphosphate:
ATP, GTP, CTP and UTP are used as substrates. The mechanism of information transfer is according to
complementary base pairing rule. The 1st and 2nd nucleotide attaché to the initiation site and RNA polymerase
catalyzes the formation of 1st phosphodiester bond.
After formation of 1st phosphodiester bond, elongation starts at transcription bubbles – the region
containing RNA polymerase, DNA and nascent RNA that moves along the DNA template. RNA Polymerase
can perform de novo synthesis of RNA chain and primer formation is not required in transcription. It does not
have nuclease activity and cannot do proof reading.
The signal for termination of transcription by eukaryotic RNA polymerase II are poorly understood.
Formation of phosphodiester bond is ceased when termination signal is reached. The RNA-DNA hybrid
dissociates and the melt region of DNA strands rewinds. Then RNA polymerase releases from the template.
RNA processing of primary transcript occurs in the nucleus and forms mature RNA. The process
includes removal of extra nucleotides, base modification, addition of nucleotides and separation of different
RNA sequences by the action of specific nucleases.
or lariat structure that is linked to 3’ splice site. A second cut is made at the junction of intron with 3’ exon
and the lariat structure containing the intron is released and hydrolyzed. The 5’ and 3’ exons are ligated to
form a continuous sequence.
Transcription in prokaryotes
In prokaryotic both transcription and translation takes place in cytoplasm. It has a single form of RNA Pol
although different sigma factors may be involved in initiation of different genes. In E. coli, RNAP is a multi-
subunit enzyme (α2ββ’σ).
Subunit Role
α binds the regulatory sequences
β forms phosphodiester bonds
β’ binds the DNA template
σ recognizes promoter and initiates transcription
Identification of transcription start site is important to obtain desired mRNA. Many prokaryotic
promoters have 2 conserved regions, located about 10 nucleotides and 30 nucleotides upstream (-10 and -
35bp) from the transcription start site. In prokaryotes, transcription factors re not needed and sigma subunits
of RNA polymerase can recognize promoter sites. Then, promoters recruit the RNA polymerase to the
transcription start site.
RNA Polymerase can perform de novo synthesis of RNA chain and primer formation is not required
in transcription. It does not have nuclease activity and cannot do proof reading.
RNA polymerase has an intrinsic unwindase activity that opens the DNA helix. Purine nucleotide is
usually th e1st to be polymerized into RNA molecule. RNA chains are synthesized from 5’ to 3’ direction.
20 2022 Molecular Biology
Four different types of nucleoside triphosphate: ATP, GTP, CTP and UTP are used as substrates. The
mechanism of information transfer is according to complementary base pairing rule.
After formation of 1st phosphodiester bond, elongation starts at transcription bubbles – the region
containing RNA polymerase, DNA and nascent RNA that moves along the DNA template. The superhelical
tension in DNA due to unwinding is controlled by the activity of topoisomerase I and II.
Termination of synthesis of RNA molecule is signaled by a sequence in the template strand of DNA
molecule. Termination occurs by Rho dependent or independent termination.
Rho dependent termination requires rho protein, ATP-dependent helicase. It causes unwinding DNA-
RNA duplex and dissociation of RNA polymerase from the template and stop transcription.
The transcribed region of DNA template contains stop signals. Rho independent termination involves
hair pin loop followed by several U residues leads to termination of transcription. The DNA – RNA hybrid is
unstable because A: U pairs are the most unstable base pair. The nascent RNA dissociates from DNA template
and then from the enzyme.
Human genome
The total DNA content of a cell is the genome. The genetic information is stored in base sequence of
DNA. It is specific for each species and almost the same for all members of the species, but unique for each
individual. Cellular DNA contains genes and intergenic regions, both serve important functions to the cells.
The human genome contains 20,000 to 25,000 different protein coding genes spreading on 23 pairs of
chromosome. Only about 2% of human genome code proteins and functional RNA.
Most of DNA do not carry critical information (junk DNA). Much of mammalian genome are
redundant. But they regulate the expression of genes during development, differentiation and adaptation to the
environment.
They are found in nucleus, cytoplasm and mitochondria. They are important in production of proteins in living
organisms. RNA forms the genetic material in some viruses.
Different types
1. Messenger RNA (mRNA)
2. Transfer RNA (tRNA)
3. Ribosomal RNA (rRNA)
4. small nuclear RNA( snRNA) (in eukaryotes)
5. small cytoplasmic RNA (scRNA) (in eukaryotes)
mRNA (5% of total RNA, 0.5-6+kb)
Structure
It is oriented as 5’to 3’direction. 5' end is capped by 7 methyl guanosine triphosphate to prevent the
attack of 5' exonuclease. 3’end is attached by poly A tail (adenylate residues 20 to 250 nucleotides) to prevent
the attack of 3' exonuclease. At the 5’ and 3’ ends, there are base paired loop known as untranslated regions. It
plays in essential role in regulation of gene expression.
Function
• It serves as a messenger, conveying the genetic information from nucleus to protein synthesizing
machine.
• It also serves as a template for polymerization of amino acids to protein.
Mammalian ribosome contains 40S and 60S subunits. The 60S subunit contains 5S rRNA, 5.8S rRNA and 28
S rRNA. The 40S subunit contains 18S rRNA. In prokaryotes, 70S ribosome contains 30S (16SRNA) and 50S
(23S & 5S rRNA) subunits combined with proteins.
Function
• It serves as site for protein synthesis.
• 28S rRNA of 60S subunit contains peptidyl transferase activity and is a ribozyme.
Small non-coding RNAs such as miRNA and siRNA typically inhibit gene expression by hybridizing with
targeted mRNA.
DNA RNA
Strand Double helix Single (α helix)
Polymer of deoxyribonucleotides linked by 3', Polymer of ribonucleotides linked by
5' phosphodiester bonds 3', 5' phosphodiester bonds
Nucleotide Deoxyribose, purine (adenine, guanine), Ribose, purine (adenine, guanine),
pyrimidine (thymine, cytosine) and phosphate pyrimidine (uracil, cytosine) and
phosphate
Purine & Equal because 2 strands are held together by Not equal but single strand of RNA can
pyrimidine complementary base pairing (G with A and T fold itself like hairpin (G with A and U
content with C) with C)
Hydrolyzed by Can’t hydrolyze to 2’, 3’ cyclic diester of To 2’, 3’ cyclic diester of
alkali mononucleotide due to absence of 2’OH group mononucleotide
Functions Template for replication and transcription mRNA, tRNA and rRNA involve in
protein synthesis
Transcription Translation
DNA RNA Protein
Reverse Transcription
Genetic information in most living organism is stored in base sequence of DNA (in retro virus, it is found in
RNA). DNA is packed into structure called chromosome.
• Gene expression definition - Transcription and Translation
• Transcription definition
• Translation definition
• Replication definition – it occurs before cell division
• Reverse transcription – The genetic information stored in retrovirus is copied to DNA for new viral
generation
25 2022 Molecular Biology
Response element
Is the nucleotide sequence that allows specific stimuli. Response elements are often part of promoters
or enhancers. A single gene may possess a number of different response elements. Multiple genes may
possess same response element and same function.
Transcriptional factors
26 2022 Molecular Biology
They are proteins that recognize promoters, enhancers and response elements. Many transcription
factors act positively and promote transcription, while others act negatively and promote gene silencing
Mitochondria DNA
AUG & AUA = Methionine
UGA = Tryptophan
AGA& AGG = Termination codon
Genetic Code
Nirenberg was awarded the Nobelprize in 1968 for deciphering the genetic code. The letters A, G, T
and C correspond to the nucleotides found in DNA. Within the protein coding genes, these nucleotides are
organized into three-letter code wards called codons. The collection of these codons makes up the genetic
code.
27 2022 Molecular Biology
Combinations of fours nucleotides into three at a time are done, 64 possible codons result. Three
codons, UAA, UAG, UGA, do not specify the amino acid and act as stop codons or non-sense codons. The
codon, AUG, representing for methionine appears as a start codon for protein synthesis.
Biomedical importance
The understanding of genetic code provides the foundation for explanation of protein biosynthesis,
mutation and diagnosis and treatment of genetic diseases.
Translation is a complex process by which the information that has been transcribed from DNA to
mRNA, direct the ordered polymerization of specific amino acids for the synthesis of proteins. It occurs in
cytoplasm. Ribosome serves as sites of protein synthesis.
28 2022 Molecular Biology
mRNA is translated from 5’ end to 3’ end. The produced protein is started from amino terminal and
ends at carboxy terminal. Translation is generally divided into four steps; formation of aminoacyl tRNA,
initiation, elongation and termination.
2. Initiation
Initiation involves several protein-RNA interaction complexes in the ribosome. It involves tRNA, rRNA,
mRNA and at least 10 eukaryotic initiation factors (eIFs). It can be divided into 4 steps.
a. dissociation of the ribosome into 40S and 60 S subunits
b. binding of a ternary complex (the initiator methionyl tRNA, GTP and eIF-2) with 40S ribosome to form
43S pre-initiation complex. In prokaryotes, initiator tRNA carries formyl-methionine.
c. binding of mRNA to 43S pre-initiation complex to form 48S initiation complex
d. then it combines with 60S ribosomal subunit to form 80S initiation complex (80 S/Met-tRNA/mRNA)
At the end of initiation, three sites such as aminoacyl tRNA binding site (A-site), peptidyl tRNA binding site
(P-site) and exit site (E-site). Met-tRNA binds to P-site and A site is free.
2. Elongation
Elongation involves several steps catalyzed by elongation factors (EFs).
a. binding of amino acyl-tRNA to ‘A site’ assist by EF and GTP
b. peptide bond formation of amino acids occupying in A site and P site catalyzed by Peptidyl transferase
(ribozyme)
c. eEF-2 (translocase) helps to move ribosome on the mRNA from 5' to 3' direction by hydrolysis of GTP
d. Thus, tRNA-peptide chain moves to P site. The uncharged (free) tRNA originally in P site moves to the E
site.
e. The whole process recycles for addition of the next amino acid. For each peptide bond formation, 4 high
energy phosphate bonds are used.
3. Termination
After multiple cycles of elongation, polymerization of amino acids to form protein is terminated by
appearance of stop codon at A site. Normally there is no tRNA with anti-codon capable of recognizing
termination codon. But Releasing factors (eRFs) recognize stop codons (UAA, UAG & UGA) on mRNA. RFs
29 2022 Molecular Biology
together with peptidyl transferase cause hydrolysis of the bond between the peptide and tRNA occupying the
P site. The newly synthesized protein, ribosomal subunits, tRNA and mRNA are dissociated from each other.
4. Posttranslational Modification of Protein
Most proteins require post-translational modification to become biologically active form.
• Proteolytic cleavage for conversion of preproprotein or proprotein to active protein (preproinsulin)
• Enzymatic glycosylation – carbohydrate are attached to serine or threonine residues e.g., in hormone
receptor, Ig
• Hydroxylation amino acid residues e.g. lysine to hydroxylysine in collagen
• Gamma carboxylation e.g. glutamic residues in prothrombin with vitamin K as cofactor
• Covalent modification; acetylation, phosphorylation, methylation, ubiquitylation
Diphtheria toxin catalyzes the ADP-ribosylation of eEF-2 and inhibits mammalian protein synthesis.
Polio virus and other picona viruses can synthesize its protein synthesis but inhibits the host protein synthesis
by disrupting the function of eIF4F complex.
[eIF4F is a combination of eIF4E, eIF4G, eIF4A. 4E is responsible for recognition of mRNA cap
structure. Its activity is inhibited by binding of inhibitor protein 4EBP1 preventing the formation of
eIF4F.]
3. Protein synthesis is affected by many factors.
The machinery of protein synthesis can respond to environmental threats. Insulin and many growth
factors stimulate eIF4F-cap mRNA complex formation by phosphorylating 4EBP1 and thus enhance protein
synthesis.
Protein Folding
To become functionally active, newly synthesized protein must be non-covalently folded with the help of
chaperons (a group of specialized protein). It is an ATP dependent mechanism.
Polysome
Many ribosomes can translate the same mRNA molecule simultaneously. Multiple ribosomes on the same
mRNA molecule form a polysome or polyribosome.
Mitochondrial protein synthesis has own protein synthesizing machine.
31 2022 Molecular Biology
Protein Targeting
Signal Hypothesis in synthesis of Export Protein
Proteins destined for cytoplasm or nucleus are translated primarily on free polyribosomes but those
for membrane and for secretion into extracellular space are translated on polyribosomes of rough endoplasmic
reticulum.
In translation of secretory or membrane proteins, shortly after the signal sequence is synthesized, it is
recognized by a signal recognition particle (SRP). SRP-signal peptide-protein complex binds to SRP receptor
on ER membrane. Then SRP is released and the ribosome binds to translocon (protein coding channel) and the
signal peptide inserts into the translocon. The growing polypeptide chain is then fully translocated across the
membrane due to its ongoing protein synthesis. The signal peptide is cleaved by signal peptidase and is
degraded. The peptide is released into the lumen of ER after completion of protein synthesis. Then protein
folding and modification occurs in ER and further modifications occur in golgi apparatus. Then the protein is
distributed to membrane or secreted extracellular.
The genetic content of somatic cells of an organism is the same but not express in all tissues but are
tissue specific in nature (Tissue specific gene expression). Moreover, the organisms can alter gene expression
in response to a variety of changes and stimuli. Not all genes are expressing all the time.
Tissue specific gene expression and gene expression is influenced by genetic developmental
programs, hormones, growth factors, heavy metals, metabolic state and environmental challenges and
diseases. Dysregulation of gene expression can lead to human disease. Thus molecular understanding of these
processes can lead to development of therapeutic agents.
2 Types of genes
1. Constitutive gene or house-keeping gene e.g. enzymes of glycolysis
2. Inducible gene – gene expression is induced or repressed according to the need of the metabolism
33 2022 Molecular Biology
The major locus of controlling gene expression in prokaryotes is at the transcription level. Operon
model is described by Jacob and Monard in 1961. The negative & positive control gene expression system can
be explained with E. coli lac operon. Other explanations on tryptophan and arabinose operon are also present.
In prokaryotes, the genes involved in a metabolic pathway are often present in a collected group
called operon. The operon is composed of structural genes, controlled elements, regulator/inhibitor gene,
operator and promoter area. The cluster of genes under an operon can be regulated by a single promoter or
regulatory region.
The structural region contains 3 structural genes present as a continuous segment. Adjacent to the
promoter gene is the lacI gene which encodes and constitutively produces repressor protein. Protein products
of the structural genes of lac operon are involved in the metabolism of lactose.
✓ Z gene codes for β galactosidase, which acts on lactose to produce glucose and galactose
✓ Y gene for lactose permease, which actively transports lactose and galactose into cell
✓ A gene for transacetylase
When E.coli is grown in a medium containing glucose, lac genes are repressed since utilization of
glucose is preferred. The lac genes are de-repressed only after glucose has been depleted form the medium,
and the bacterium utilizes lactose to supply usable energy glucose.
I. Transcriptional control
Control of gene expression in eukaryotes is primarily at the level of transcription.
1. Histone modification
It regulates the chromatin structure and accessibility of DNA by gene regulatory proteins. Reversible
acetylation at lysine residue of core histones by acetylase weakens the strength of histone-DNA interaction
and relaxation of nucleosome. It facilitates binding of other regulatory proteins and RNA polymerase to
specific elements of DNA and commerce transcription. Conversely the removal of acetyl groups by
deacetylase promotes the condensation of chromosomes and inhibits transcription.
[Methylation of base (cytidine) is generally associated with inactivation of gene expression. Demethylation of
promoter or of a coding sequence of the gene is required for efficient gene expression.]
3. Gene amplification
Under certain conditions, single copy genes are amplified to many folds during development or
response to drugs. Cancer cell resistant to anti-cancer drugs is due to gene amplification. e.g., methotrexate
increases the number of genes for dihydrofoalte reductase.
[This regulation is seen in cell or tissue specific or at certain stages of development or under certain
conditions. For example, the product of same gene is calcitonin in parafollicular C cell and protein involved in
taste sensation in brain cells.]
2. Editing of RNA
It is the alteration of the sequence of nucleotides in the mRNA. RNA editing involves the enzyme
mediated alteration of RNA before translation. The substitution of one nucleotide for another can results in
tissue specific differences in transcript. For example, CAA (glutamine) to UAA (termination codon) in the
apolipoprotein B mRNA produces Apo B48 (2158 amino acids- 48%) in enterocytes instead of Apo B100
(4536 amino acids) in liver.
3. Transport of mRNA
Mature mRNA bound with proteins are transported to cytoplasm through nuclear membrane pore. 3'
UTR (untranslated region) of mRNA is important for mRNA localization in cytoplasm.
3. Translation regulation
eIF 2 and eIF4 are the focus of this regulatory mechanism. Activity of these proteins can be controlled
by phosphorylation. Starvation and hormones control these mechanisms. Some viruses inhibit host protein
synthesis.
[E.g., In reticulocytes, globin chain synthesis is regulated at translation level. eIF2 is inactive when
phosphorylated by kinase. Heme prevents phosphorylation of eIF2 by binding with kinase (inactive). Thus
elevated level of heme favors translation of globin chain.]
Epigenetics
The literal meaning is on top of or in addition to genetics. These regulatory mechanisms do not
change the regulated DNA sequence but change the expression pattern of this DNA.
Mechanisms underlying genetics
Every cell in the organism carries an identical genome, however the terminal phenotype within an
organism is not fixed and deviation is caused by gene expression changes in response to environmental cues.
DNA methylation, histone modification and RNA associated silencing are the major ways of controlling by
epigenetics.
39 2022 Molecular Biology
Genomic instability
Genetic disorder
Gene Mutation
Any permanent change in the nucleotide sequence of a gene is called gene mutation. Single nucleotide
of more than one nucleotide of the gene can change. It may be heritable or non-heritable. It may effect on
pattern of gene expression or on function of proteins.
Promoter mutation - Mutation can occur in promoter or other regulatory site. Structure and function of
proteins are intact but there is change in rate of protein synthesis.]
Stem cell
Stem cells are unspecialized cells found in most multi cellular organism.
Two important characteristics
1. Self- renewal- Can replenish their number for long periods through cell division
2. Potency- After receiving certain chemical signals, can differentiate or transform into specialized cells
• A key ethical concept is the moral status of the embryo. The right of a fetus at any particular stage is
balanced against the potentially large benefits that others may gain from research and ultimately, stem
cell-based treatments.
• Use of pre-14 day embryo, still little more than a ball of cells remains justified.
Control cell growth is achieved by a complex and fine tuning of many proteins that regulate cell proliferation.
They are divided into 4 groups
a. Growth factors
b. Growth factor receptors
c. Intracellular signal transducers including nuclear receptor, Cell cycle control proteins
d. DNA repair proteins
Normal cell growth and differentiation is carried by coordinated mechanism of proliferating genes or
proto- oncogenes and tumor suppressor gene (differentiation and growth inhibition). Proto-oncogenes encode
various proteins that are involved in normal growth and division of cells e.g., growth factor, Ras, MAPK,
cyclins, DNA binding proteins. Tumor suppressor genes encode proteins that normally suppress cell growth
e.g., retinoblastoma protein, p53. Loss of function mutation of tumor suppressor genes could not inhibit
abnormal cell growth. Gain of function mutation of proto-oncogenes enhances cell proliferation.
Oncogenes
Oncogenes are mutated proliferating genes derived from proto-oncogenes. They are 1st recognized in
virus. Their products; oncoproteins cause aberrant gene regulation to cause gain of function or inappropriate
regulation of normal cell growth leading to cellular transformation.
Mechanism of oncogene
Three general mechanisms by which products of oncogenes (oncoproteins) stimulate growth and division of
cells
1. May imitate the action of growth factor
2. May become an occupied receptor for growth factor
45 2022 Molecular Biology
3. May act as key intracellular points involved in growth control e.g., src acting protein, myc acting as DNA
binding proteins
Tumor markers
Many cancers are associated with the abnormal production of molecules: enzymes, hormones or proteins.
They are known as tumor markers and can be measured in plasma or serum. Tumor marker is a biological
substance which can be produced directly by the tumor or non- tumor cells as a response to the presence of
tumor.
Recombinant or chimeric or hybrid DNA is an altered DNA due to the insertion of a sequence of
deoxyribonucleotide, not previously present, into existing molecules of DNA, by enzymatic or chemical
means.
46 2022 Molecular Biology
Genetic recombination
Genetic recombination is the process whereby new linkage relationships are established between genes.
1. General recombination – exchange between homologous chromosome in meiosis
2. Site-specific recombination - Integration of viral into host genomes
Restriction map is the diagrammatic representation of DNA molecule indicating the site of cleavage by
various restriction enzymes.
Gel electrophoresis
It determines the length and purity of DNA molecules as DNA carry negative charge. Polyacrylamide or
agarose gel is used. For visualization of DNA, Radio isotopes 32P and ethidium bromide (dye) are commonly
applied on DNA.
Nucleic acid hybridization & blotting
It is a method using hybridization of oligonucleotide probe (DNA or RNA, 100-300bp long), marked
by radioisotope or chemical to complementary sequence in the sample. This is used to identify presence or
absence of a particular DNA or RNA, amount of RNA transcribed and altered DNA.
47 2022 Molecular Biology
A probe is a single stranded DNA or RNA that is complementary to the target DNA. To be
detectable, the probe must be labeled with either a radioactive isotope or a fluorescent group.
Nucleic acid hybridization
1. Southern blot – Hybridization of a probe to the bound DNA on cellulose membrane
2. Northern blot - Hybridization of a probe to the bound RNA on cellulose membrane
Restriction fragment obtained from genomic DNA are separated by gel electrophoresis. Blots are
created by laying a membrane over one face of the gel and then creating a flow which carries the molecules in
the gel onto the nitrocellulose membrane. The nucleic acid fragments transferred to the membrane are then
hybridized by labeled specific probe. The probe binds to the fragment having complementary bases and the
hybrid area can be visualized by an appropriate method like exposure to X ray film, by UV or densitometer.
It is an in vitro/test tube method of amplifying a target sequence of DNA molecule. The process is
based on the replication but many steps are overcome by different ways.
Millions of DNA copies can be obtained by PCR within a few hours. The sample is from very small
amount of genomic DNA as a drop of blood, hair.
The materials required for PCR are
1) DNA of interest
2) Two complementary oligonucleotide DNA primers (20-25 bp) that flanking the target DNA sequence.
3) heat stable DNA polymerase, Taq polymerase (Thermas aquaticus)
4) mixture of 4dNTPs; dATP, dGTP, dCTP and dTTP
5) reaction buffer containing enhancer and magnesium
6) Thermal cycler
The steps involved in PCR are;
1. Denaturation
The mix is heated to above 94 - 98C for 5 minutes in order to denature the target DNA.
2. Primer annealing
Then cool down to 50 - 65 C to allow the primers anneal to the complementary sequence in test DNA.
3. Primer extension
Then, temperature is raised again to optimal temperature of Taq polymerase (72C) for synthesizing
new DNA strands by using complementary dNTPs.
This set of three steps can be considered as one cycle. The target DNA is replicated in each cycle.
Exponential increased in DNA copies with repeated cycles as much as 30 to 60 cycles. [PCR can be used to
amplify DNA from buccal smears, single hairs, blood spots, body fluid secretion, fetal blood, chorionic villi
and amniotic fluid, paraffin embedded tissue or fossil. ]
Application of PCR
(1) Detection of infected agents (bacterial or viral DNA/RNA) in blood or body fluid especially in latent
period
(2) To detect the allelic polymorphism, detection of mutation and restriction length polymorphism
(3) To make Prenatal diagnosis of genetic diseases
(4) It is useful in recombinant DNA technology and DNA sequencing
(5) To study ancient DNA and evolution, using DNA from archeological samples
(6) To establish Precise tissue types in organ transplantation
(7) In forensic medicine, it is useful to distinguish person to person, from specimen usually found at the scene
of crime by DNA finger printing
(8) For RNA analysis after RNA copying (reverse transcription-PCR), and mRNA quantitation by real time
RT-PCR
49 2022 Molecular Biology
(9) To detect Single nucleotide polymorphism (SNPs) Polymorphism is defined as any DNA sequence variant
for which the population frequency of the less common allele is more than 1%
Advanced PCR systems for determination of quality and quantity of nucleic acids
• Reverse Transcriptase PCR (RT– PCR)
• Real time PCR (quantitative PCR)
DNA cloning
A clone is a large population of identical molecules or cells that arise from a common ancestor.
Molecular cloning is the propagation and multiplication of selected DNAs in microorganisms.
DNA fragment of interest is ligated to vector DNA and chimeric DNA is inserted into host bacteria.
The DNA fragment is cloned as bacteria grows and divides. Most methods use a cloning vector; plasmid,
bacteriophage or artificial chromosomes. The microorganisms containing specific DNA sequence are
identified by hybridization method. DNA cloning is mostly used for overexpression of protein.
Cloning vectors
1. Plasmids
Plasmids are Small, circular, non-chromosomal duplex DNA in bacterial cells. It Functions to confer
antibiotic resistance to the host cell. Its replication does not depend on chromosomal DNA replication. It can
accept 6 – 10 kb long foreign DNA.
2. Phages
Phages are Organism that infect bacteria. It has Linear DNA molecule and can accept 10 -20 kb long
foreign DNA.
3. Cosmids
Cosmids are Circular DNA molecules which Combine best features of plasmids and phages. It contains
plasmid origin of replication (ori) which allows autonomous replication and a antibiotic marker, a cos site. It
can accept 35 – 50 kb long of DNA
4. Bacterial artificial chromosome (BAC) & Yeast artificial chromosome (YAC) Can accept > 50 kb.
==================================================================
RFLP (restriction fragment length polymorphism)
Restriction endonuclease cleaves dsDNA at the specific sequence producing a characteristic set of
smaller DNA fragments. It the DNA deviated from normal, they produce different fragments and are called
RFLP.
RFLP are due to a variety of mutation. It is used to facilitate prenatal detection of a number of
hereditary disorders including sickle cell trait, thalassaemia and also in human identification. It is inherited in
Mandelian fashion.
Variable number of Tandem repeat (VNTR)
50 2022 Molecular Biology
VNTR is a location in a genome where a short nucleotide sequence is organized as a tandem repeat. It
can be analyzed by RFLP. It is Found on many chromosome. It shows variations in length between
individuals. They are inherited alleles and useful for personal or parental identification, in genetic and
biological research, forensic and DNA fingerprinting and CODIS database
It is the study of how an individual’s genetic inheritance affects the body’s response to drugs. It is
Combination of Pharmaceutical science and Biochemistry. It can create personalized or tailor-made drugs
with greater efficacy and safety. A person’s response to drug is influenced by environment, diet, age, lifestyle
and state of health and especially genetic make-up.
Gene Therapy
Gene therapy is a technique for correcting defective genes responsible for disease development.
It is insertion or alteration or removal of genes within an individual’s cell.
Steps of gene therapy
- Isolate the healthy gene along with its control sequence
- Incorporate this gene into a gene vector (retrovirus, adenovirus, adeno-associated virus and Herpes
simplex virus)
- Finally deliver the vector to the target cell.
HGP was started in 1998 and aunched in 1990 and completed in 2003. 18 countries participated in
this project.
The objectives of the International Human Genome Project included;
• Construction of a genetic map
• Identification of all human genes
• Sequencing of the entire genome
• Store the information in detabases
• Improve tools for data analysis – transfer tech to provide sector, raise ethical, legal and social issues arise
from projects
Benefits
Improve in diagnosis of disease
Early detection of genetic predisposition to disease
Development of rational drug design
Gene therapy
Emergence of pharmacogenomics