BIOLOGY in its broadest
sense addresses the nature of various molecules, small and macro, which
make up the cellular constituents, their structure and function to maintain
the integrity, survival, growth, reproduction and differentiation of
a cell. Several disciplines like biochemistry, cell biology, genetics
and microbiology deal with different facets of both plant and animal
cells. Over the last five decades we have seen a phenomenal growth in
our knowledge about various biological phenomena. It is generally believed
that we will experience a much higher level of understanding of the
biological systems in the next decade. The current excitement in biology
is driven substantially by the completion of several genome projects,
including those related to humans.
In the last decade,
a new discipline has emerged in modern biology which is termed as
genomics. Genomics in its simplistic definition deals with the sequence content
and information present within the DNA which in turn influences various
processes of a cell. The emergence of this new discipline can be traced
to the Human Genome Project undertaken by an international collaborative
effort between both public funded and private enterprise. We have just
witnessed the completion of sequencing of the human genome of approximately
3.3 billion base pairs. During the course of this mega project several
technologies have been developed which are spin-offs of this enterprise.
These are now available for biological and medical scientists to address
several fundamental questions pertaining to life processes, development
and differentiation and disease processes.
The mapping of the
human genome was not the only focus of the human genome project: at
its outset the value of sequencing genomes of model organisms was also
recognized. Some of these organisms have been particularly amenable
to genetic analysis. In part, the sequencing of smaller genomes was
considered as a forerunner for large-scale sequencing. At present genomes
of about 150 organisms have already been sequenced, including the genomes
of several microorganisms related to human disease.
The list of prokaryotic
organisms whose genomes have been sequenced reveals different priorities.
In some cases, the driving force was to understand evolutionary relationships
between different organisms, as in the case of archeal genomes and,
in the case of mycoplasma genitialium,
to understand what constitutes a minimal genome. For many organisms,
however, the primary motivation for genome sequencing was their medical
relevance, particularly in the development of more sensitive diagnostic
tools and new targets for therapeutics and vaccines.
Given the importance of the plant kingdom, the genome
of a simple plant, arabidopsis, has now been sequenced and the sequencing
of the rice genome too is almost complete. In several forums questions
have been asked why India has not embarked on a complete sequencing
of any of the genomes, while a developing country like Brazil has sequenced
the genomes of several model organisms. Many members of the biology
research community in India feel that since such an exercise is extremely
expensive and not hypothesis based, it would affect the funding of smaller
intellect based projects. Others, however, argue that for the sake of
national pride, India should carry out the full sequencing of the genome
of any organism of potential application. Further, we missed a great
opportunity to show the world that Indian scientists have the necessary
skills and expertise in biology at the highest level. On a happier note,
India is a partner in the global network in the rice genome sequencing
project. Indian scientists have successfully completed sequencing the
long arm of chromosome 11 of the rice genome.
Once the sequence
of the human and other genomes is known, what difference will it make?
Certainly there will be a great boost to basic research as we grapple
with the fundamental biological questions of how our genome is interpreted
to specify a particularly function of a cell, tissue, organism and the
human individual. The differentiation and establishment of cellular
diversity in both uni- and multicellular organisms is indicated by a
well-orchestrated and coordinated differential expression of several
genes. Differential gene expression in response to specific stimuli
leads to differentiation of pluripotent stem cells into different cellular
lineages that give rise to different tissue types and organs. Often,
an aberration in expression/processing of a gene or a set of genes leads
to abnormal development and disease processes.
Newer methodologies such as gene micro array analysis
involving DNA chips (that contain thousands of sequence verified cDNA
or oligo-nucleotides) offer a tremendous opportunity to undertake global
gene expression profiling. Conceptually, the technique is relatively
simple and depends on the already available methods conveniently used
for nucleic acid hybridization. However, the new technology demands
a very high resolution analysis of several thousand spots on a single
microscopic slide. Fortunately, the development of software for computers
and statistical analysis has helped increase the utility and power of
these new technologies. Several laboratories in India have already started
using this technology to address important questions in biology, including
pathogenesis of some of infectious diseases and cancer. Globally, several
advances are being made towards identifying key set of genes associated
with a biological phenomenon and certain multigenic disorders like cancer.
It is worth noting that scientists have now identified a group of 10
genes that are commonly associated with tumor metastasis.
The completion of
the human genome sequence is more a starting point rather than a finish
line. One immediate question is what this DNA sequence would mean when
the entire science of genetics is predicated on variation. In other
words, given the vast diversity of human genomes on the planet, how
does one begin to use a whole genome sequence to identify important
traits like disease susceptibility?
To address the sequence variation in human population,
the Wellcome Trust of UK has started an SNP Consortium and the goal
of this consortium is to identify 300,000 single base differences between
genomes from different human populations. While most of these differences
may not cause disease, the theory is that a sufficiently dense SNP map
can provide useful markers for mapping candidate disease genes and studying
gene linkages. Such a study entails genotyping a large number of unrelated
people with complex of diseases like asthma, diabetes, hypertension
and comparing the genotypes to those of healthy controls. It is hoped
that ultimately one could identify all of the SNP variations between
disease-prone and disease-resistant genomes, providing a map of likely
disease-causing loci. There is, however, considerable concern whether
large sums of money should be spent on this effort because the utility
of such a database and a strong association between the SNP haplotype
and the disease phenotype is still not apparent. Nevertheless, some
studies have been initiated in India on the SNP mapping of the genes
involved in drug metabolism, DNA repair, asthma and diabetes.
Although the DNA sequence
arrangement in the genome defines the blueprint of a cell, it is the
proteins that are the final actors of the life processes. They function
as enzymes, structural proteins, receptors and so on. It is increasingly
clear that the number of proteins in a human cell may exceed the total
number of genes, some 30,000, by at least two fold because of the phenomenon
of alternate splicing. The diversity of proteins is further magnified
by post-translational modification such as phosphorylation, acetylation,
methylation and so on. Scientists have now begun to address the cellular
functions with respect to total protein profile of a cell and this area
is called as proteomics. Whereas the human genome sequence is fairly unidimensional
and finite, the proteome is multidimensional and constantly changing
in response to various environment factors, giving rise to nearly infinite
numbers of possible entities.
Similar to the human genome project, a human proteome
project has been initiated with particular reference to plasma proteome
and liver proteome. Several academic institutions and industries in
India have already integrated this new proteomics technology in their
research efforts to address issues related to development and differentiation,
plant biology, infectious diseases and cancer. Substantial progress
is being made in these directions. The technology itself has evolved
over the years due to the integration of two-dimensional polyacrylamide
gel electrophoresis and mass spectrometry. One limitation of the proteomics
technology is that only a subset of total proteome complement can be
analyzed in a single experiment due to the limitations of the resolving
power of the two dimensional gel electrophoresis. There is continuous
effort in evolving newer approaches to take care of this problem.
The functional diversity
of proteins is also influenced by its three-dimensional structure. Research
on structure-function relationship of proteins has been, and continues
to be, a major focus of research in India. One school of thought believes
that the next step in modern biological revolution in the post-genomic
era is the combined structural and functional genomic approach that
will greatly contribute to our understanding of gene function. There
is enough institutional expertise and strength in India to solve three-dimensional
structure of proteins. Several scientists have made significant contributions
in the area of structural biology and a group of Indian scientists are
also a part of an international consortium for mycobacterium tuberculosis
structural genomics. The availability of genomic sequence of several
organisms has also enabled several biochemists and cell biologists to
clone, express and study the biochemical function of any given gene.
It is expected that for the next decade biochemists will be busy deciphering
the function of many unknown genes and may discover new biochemical
pathways in many organisms.
Even as the genomic sequence information of several organisms
is being elucidated, some people believe that we may be seeing the end
of the genomics era. Others argue that it is not that ‘genomics’ has
come and gone but that we are at the beginning of a new phase of biology
research. The question now being asked in biosciences research is what
lies beyond genomics, proteomics and metabolomics. For many biologists,
the answer is systems biology, the integrated quantitative study of
life’s cellular processes in time and space. Whereas biological science
was a mix of descriptive studies and hypothesis testing, systems biology
stresses the value of integrating data to suggest models and generate
hypotheses to help unravel biological mechanisms and enhance predictive
skills.
Systems biology approaches are making inroads into a range
of different academic and research programmes. For example, yeast geneticists
are using such strategies to uncover the interactive biochemical pathway
diagrams of cellular responses to food sources. In microbiology, systems
approaches are being employed to uncover mechanisms of antibiotic resistance.
Developmental biologists have started applying systems analyses to understand
how patterns of gene expression programme the body of early embryos
in organisms such as fruit flies and sea urchins through the orderly
production of transcription factors. Several biotechnology companies
are also using a systems biology approach to pinpoint predictive bio-markers
of diseases to identify potential toxicities of promising drug molecules
in their early stages of development.
Systems biology is
also helping to understand complex pathologies like cancer, cardiovascular
disease and autoimmunity. Many researchers around the world have started
addressing the question of how cells interpret extracellular and intracellular
signals in a context dependent manner. A landmark paper was published
in 2001 in Science entitled ‘Integrated genomic and proteomic analyses
of a systematically perturbed metabolic network.’ By integrating data
from mRNA studies, quantitative measures of specific proteins and assessment
of protein-protein, the authors showed that such an integrated approach
gives a much richer picture of an organism than individual approaches.
Personally, I believe that the systems biology approach gives enormous
scope for understanding many biomedical problems. Such studies have
not yet been initiated in India and it is high time that efforts are
made in this direction.
While the effort to
connect genomics and proteomics, and now metabolomics, is clearly on
the rise, some important ‘omics’ remain unexplored. In future, there
should be progress in areas such as protein-sugar, protein-drug, and
protein-hormone interactions. In addition, now that wiring-diagrams
(output of systems biology) of various sorts are being constructed for
different species, scientists are likely to start comparing systems
between species. It is possible that new disciplines to study how the
wiring diagram differs between species or changes over time within a
species may emerge in response to stress or insults or physiological
stimulus. This new area of ‘Network Biology’ is still in its infancy.
The wiring diagram of cellular signalling is analogous
to the inner workings of a supercomputer or complex electronic equipment
like the systems that keep a passenger plane in the air. It is generally
agreed that signalling mechanisms is one such important process that
needs redundant mechanisms. It is like the four engines of a 747-jet;
if one fails, other engines take over. As with the electronic circuitry
of the plane, biology has also evolved positive and negative signals
(feedback systems). All taken together form the wiring diagram. Eliminating
a single component in this circuit can have unpredictable results. Similarly,
biologists are also knocking out one gene at a time to find out a gene
function in the biological context. Several approaches have been employed
for knockout studies, the latest being the small interfering RNA technology.
This approach permits scientists to study gene function in a much shorter
time scale. This emerging technology is already yielding valuable information
and many scientists in India are employing this approach in their research
efforts.
Another area of biological research which has recently
generated tremendous excitement is ‘stem cell research’. Stem cells
have two important characteristics that distinguish them from other
types of cells. First, they are unspecialized cells that renew themselves
for several generations through cell division. Second, they can be induced
to become specialized cells such as heart muscle or pancreas under certain
physiological or experimental conditions. There are two types of stem
cells derived from animals and humans, namely, adult stem cells and
embryonic stem cells. The adult stem cell is an undifferentiated cell
found in a differentiated tissue that can renew itself and differentiate
to yield all the specialized cell types of the tissue from which it
originated. Embryonic stem cells, on the other hand, are primitive undifferentiated
cells from the embryo that have the potential to become a wide variety
of specialized cell types.
Many biomedical scientists
believe that stem cells can become the basis for treating diseases like
Parkinson’s disease, diabetes and heart disease. Considerable effort
and research is being put on hematopoietic stem cells, rebuilding of
the nervous system with stem cells, repair of damaged heart with stem
cells and genetically modified stem cells in experimental gene therapies.
Although it was initially perceived as science fiction, some recent
success stories provide hope that we may soon understand how to manipulate
the stem cells to generate specific cell types making stem-cell based
therapies a reality. Efforts at stem cell research, particularly with
respect to hematopoietic stem cells, stem cells of the eye and muscle
are now underway in India.
The present decade of the new millennium is witnessing
an incredible volume of information generated by genomics and proteomics
research. The problem of handling this volume of information will not
be solved by technology alone. Some visionary people in the biology
community feel that a new generation of researchers, who can incorporate
information from diverse areas of science into their thought and intellectual
process, needs to be trained. In this context, it is worth looking at
the present structure of science education in the country and the world.
Basic science courses, including biology, chemistry and physics, are
often taught without reference to context and acknowledgement that the
students are taking other courses. Modern day biologists know their
own field very well, but often cannot do basic chemistry or statistics.
Consequently, instead of creating a profession of self-sufficient individuals,
the typical course of study creates biologists who are dependent on
others to get their work done.
The future of biology lies in integrating the knowledge
of classical biologists, modern day genomics and bioinformatics with
the language and skills of physical sciences, chemical sciences, engineering
mechanics and mathematics. It would be worthwhile if a group of scientists
from these diverse areas devise a curriculum that can be taught at the
undergraduate level so that we have a new generation of biologists who
can ask very different and challenging questions. This is already happening
at some of the universities in the USA and UK. The Indian educational
policy-makers should seriously think of this new biology so that we
do not fall behind our counterparts in the West.