tailieunhanh - Artificial Intelligence and Molecular Biology

Historically rich in novel, subtle, often controversial ideas, Molecular Bi- ology has lately become heir to a huge legacy of standardized data in the form of polynucleotide and polypeptide sequences. Fred Sanger received two, well deserved Nobel Prizes for his seminal role in developing the basic technology needed for this reduction of core biological information to one linear dimension. With the explosion of recorded information, biochemists for the first time found it necessary to familiarize themselves with databases and the algorithms needed to extract the correlations of records, and in turn have put these to good use in the exploration of phylogenetic relationships, and in the applied tasks of. | Foreward Joshua Lederberg Historically rich in novel subtle often controversial ideas Molecular Biology has lately become heir to a huge legacy of standardized data in the form of polynucleotide and polypeptide sequences. Fred Sanger received two well deserved Nobel Prizes for his seminal role in developing the basic technology needed for this reduction of core biological information to one linear dimension. With the explosion of recorded information biochemists for the first time found it necessary to familiarize themselves with databases and the algorithms needed to extract the correlations of records and in turn have put these to good use in the exploration of phylogenetic relationships and in the applied tasks of hunting genes and their often valuable products. The formalization of this research challenge in the Human Genome Project has generated a new impetus in datasets to be analyzed and the funds to support that research. There are then good reasons why the management of DNA sequence databases has been the main attractive force to computer science relating to molecular biology. Beyond the pragmatic virtues of access to enormous data the sequences present few complications of representation and the knowledge-acquisition task requires hardly more than the enforcement of agreed standards of deposit of sequence information in centralized network-linked archives. The cell s interpretation of sequences is embedded in a far more intricate context than string-matching. It must be conceded that the rules of base-complementarity in the canonical DNA double-helix and the matching of codons x Artificial Intelligence Molecular Biology to the amino acid sequence of the protein are far more digital in their flavor than anyone could have fantasized 50 years ago at the dawn of both molecular biology and modern computer science. There is far more intricate knowledge to be acquired and the representations will be more problematic when we contemplate the pathways by which a .