tailieunhanh - Báo cáo y học: " Assembling genomes using short-read sequencing technology"

Tuyển tập các báo cáo nghiên cứu về y học được đăng trên tạp chí y học Wertheim cung cấp cho các bạn kiến thức về ngành y đề tài: Assembling genomes using short-read sequencing technology. | Jackman and Birol Genome Biology 2010 11 202 http 2010 11 1 202 w Genome Biology MINIREVIEW L__ Assembling genomes using short-read sequencing technology Shaun D Jackman and inang Birol Abstract Gigabase-scale genome assemblies are now feasible using short-read sequencing technology bringing the cost of such projects below the million-dollar mark. Moore s law is often used as a predictor in the informatics field for the growth of processing power based on the increase in the number of transistors in integrated circuits. It states that according to the historical trend this number doubles roughly every 2 years. A similar trend manifests itself in the number of base pairs deposited in the GenBank database which had a mere 680 338 base pairs bp in its December 1982 release. Twentyseven years later that number reached 110 118 557 163 bp in its core repository and 158 317 168 385 bp in the Whole Genome Shotgun sequencing project repository. This increase corresponds to a doubling roughly every 17 months over 3 decades. If this trend is sustained by the mid-21st century we will have enough sequencing data to cover the genomes of the entire projected human population of 9 billion with more than fivefold redundancy and have several exabases 1018 bp remaining to sequence other species. This gap between the rates of growth of informatics and sequencing throughput is exerting a considerable strain on the development of bioinformatics tools to process the sequencing data generated. Hence we need ever faster and more accurate algorithms to keep up with this increasing gap much as media-specific compression algorithms such as those used by MP3 and DVD filled the gap between the digital media revolution and its storage requirements. This article focuses on three large and two smaller de novo sequencing projects all published within the last 6 months with a special emphasis on the recently published giant panda genome 1 which used a so-called .

TỪ KHÓA LIÊN QUAN