{"product_id":"delta-lake-up-and-running-modern-data-lakehouse-architectures-with-delta-lake-9781098139728","title":"Delta Lake: Up and Running: Modern Data Lakehouse Architectures with Delta Lake","description":"\u003cp\u003e\u003c\/p\u003e\u003cblockquote\u003e\n\u003cbr\u003eDelta Lake is an open-source lakehouse framework that provides a robust way to manage and analyze big data. This book teaches data engineers, data scientists, and data analysts how to use Delta Lake and its features to build data pipelines and applications, ensuring high-quality data and efficient insights. \u003c\/blockquote\u003e\u003cp\u003e\u003cstrong\u003eFormat\u003c\/strong\u003e: Paperback \/ softback\u003cbr\u003e\u003cstrong\u003eLength\u003c\/strong\u003e: 250 pages\u003cbr\u003e\u003cstrong\u003ePublication date\u003c\/strong\u003e: 31 October 2023\u003cbr\u003e\u003cstrong\u003ePublisher\u003c\/strong\u003e: O'Reilly Media\u003cbr\u003e\u003c\/p\u003e \u003cp\u003e\u003cbr\u003eWith the rise of big data and artificial intelligence (AI), organizations have the opportunity to rapidly generate data products. However, the success of their analytics and machine learning models relies heavily on the quality of the data they rely on. Delta Lake, an open-source format, provides a robust lakehouse framework that surpasses platforms like Amazon S3, Azure Data Lake Storage (ADLS), and Google Cloud Storage (GCS). This practical book aims to guide data engineers, data scientists, and data analysts in getting Delta Lake and its features up and running efficiently.\u003cbr\u003e\u003cbr\u003eThe ultimate objective of building data pipelines and applications is to extract valuable insights from the vast amounts of data available. This book will help you understand how your choice of storage solution impacts the robustness and performance of your data pipeline, from raw data to actionable insights. You will learn how to:\u003cbr\u003e\u003cbr\u003eUtilize modern data management and engineering techniques to optimize data processing and storage.\u003cbr\u003e\u003cbr\u003eGain a deep understanding of how ACID transactions ensure reliability and consistency in data lakes, even at scale.\u003cbr\u003e\u003cbr\u003eExecute streaming and batch jobs concurrently against your data lake, maximizing resource utilization.\u003cbr\u003e\u003cbr\u003ePerform update, delete, and merge operations on your data lake with ease.\u003cbr\u003e\u003cbr\u003eEmploy time travel capabilities to roll back and examine previous data versions, aiding in data analysis and troubleshooting.\u003cbr\u003e\u003cbr\u003eBuild a streaming data quality pipeline that follows the medallion architecture, ensuring data integrity and accuracy throughout the pipeline.\u003cbr\u003e\u003cbr\u003eBy leveraging Delta Lake's open-source format and its powerful features, organizations can unlock the full potential of their data and drive innovation in their industries. Whether you are a data engineer, data scientist, or data analyst, this book will provide you with the knowledge and tools you need to succeed in the era of big data and AI.\u003c\/p\u003e\u003cp\u003e\u003cbr\u003e\u003cstrong\u003eDimension\u003c\/strong\u003e: 233 x 178 (mm)\u003cbr\u003e\u003cstrong\u003eISBN-13\u003c\/strong\u003e: 9781098139728\u003c\/p\u003e","brand":"Bennie Haelen,Dan Davis","offers":[{"title":"Paperback \/ softback","offer_id":44735062769914,"sku":"9781098139728","price":38.78,"currency_code":"GBP","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0522\/4297\/2845\/products\/1699031219917_book.jpg?v=1699193516","url":"https:\/\/shulphink.com\/products\/delta-lake-up-and-running-modern-data-lakehouse-architectures-with-delta-lake-9781098139728","provider":"Shulph Ink","version":"1.0","type":"link"}