{"product_id":"genomics-in-the-cloud","title":"Genomics in the Cloud","description":"\u003cp\u003e\u003c\/p\u003e\u003cblockquote\u003e\n\u003cbr\u003eThe book provides practical guidance for researchers to work with genomics algorithms in the cloud, using open-source tools such as the Genome Analysis Toolkit (GATK), Docker, WDL, and Terra. It covers essential background information, cloud computing operations, GATK usage, automated analysis with scripted workflows, scaling up workflow execution, interactive analysis with Jupyter notebooks, secure collaboration, and computational reproducibility. \u003c\/blockquote\u003e\u003cp\u003e                                                            \u003cstrong\u003eFormat\u003c\/strong\u003e: Paperback \/ softback\u003cbr\u003e                              \u003cstrong\u003eLength\u003c\/strong\u003e: 475 pages\u003cbr\u003e                              \u003cstrong\u003ePublication date\u003c\/strong\u003e: 24 April 2020\u003cbr\u003e                              \u003cstrong\u003ePublisher\u003c\/strong\u003e: O'Reilly Media, Inc, USA\u003cbr\u003e                          \u003c\/p\u003e \u003cp\u003e\u003cbr\u003eThe genomics field is experiencing a remarkable surge in data, with organizations like the National Institutes of Health (NIH) poised to host over 50 million gigabytes of genomic data in the coming years. To meet the demand for efficient access and analysis of this vast amount of data, researchers are turning to cloud infrastructure. In this comprehensive book, experts guide researchers in adapting analysis tools and protocols to work seamlessly in the cloud.\u003cbr\u003e\u003cbr\u003eThe book begins by providing a foundational understanding of genomics and computing technology, covering essential concepts and tools. It then delves into basic cloud computing operations, enabling researchers to set up and manage cloud environments effectively. The authors introduce the Genome Analysis Toolkit (GATK), a widely used open-source tool for genomics analysis, and demonstrate its usage through three major GATK Best Practices pipelines. They also explore the automation of analysis with scripted workflows using Workflow Description Language (WDL) and Cromwell, enabling efficient and reproducible execution of genomics workflows.\u003cbr\u003e\u003cbr\u003eScaling up workflow execution in the cloud is a crucial aspect, and the book covers parallelization and cost optimization techniques to maximize computational resources. It also introduces interactive analysis in the cloud using Jupyter notebooks, providing a seamless platform for data exploration and analysis. Additionally, the book emphasizes secure collaboration and computational reproducibility using Terra, a distributed computing platform designed for genomics data management.\u003cbr\u003e\u003cbr\u003eBy following the step-by-step instructions and real-world examples provided in this book, researchers will gain the skills and knowledge necessary to work with genomics algorithms in the cloud. Whether they are beginners or experienced practitioners, this practical guide will empower them to leverage the power of cloud infrastructure for efficient and impactful genomics research.\u003c\/p\u003e\u003cp\u003e                            \u003cstrong\u003eWeight\u003c\/strong\u003e: 852g                            \u003cbr\u003e\u003cstrong\u003eDimension\u003c\/strong\u003e: 177 x 234 x 29 (mm)                            \u003cbr\u003e\u003cstrong\u003eISBN-13\u003c\/strong\u003e: 9781491975190                                                      \u003c\/p\u003e","brand":"Geraldine Van Der Auwera,Brian D. O'connor","offers":[{"title":"Paperback \/ softback","offer_id":44100309385466,"sku":"9781491975190","price":51.4,"currency_code":"GBP","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0522\/4297\/2845\/products\/4eac80dbb3841df793202dea83ee4905.jpg?v=1624585085","url":"https:\/\/shulphink.com\/products\/genomics-in-the-cloud","provider":"Shulph Ink","version":"1.0","type":"link"}