{"product_id":"automating-data-quality-monitoring-at-scale-scaling-beyond-rules-with-machine-learning-9781098145934","title":"Automating Data Quality Monitoring at Scale: Scaling Beyond Rules with Machine Learning","description":"\u003cp\u003e\u003c\/p\u003e\u003cblockquote\u003e\n\u003cbr\u003eBusinesses generate 2.5 quintillion bytes of data daily, but much of it is poor quality or useless. This book provides practical advice on using automated data quality monitoring to ensure high-quality records, covering all tables efficiently, proactively alerting on issues, and resolving problems immediately. It also helps understand the limits of automated monitoring and how to overcome them, and how to deploy and manage the solution at scale. \u003c\/blockquote\u003e\u003cp\u003e\u003cstrong\u003eFormat\u003c\/strong\u003e: Paperback \/ softback\u003cbr\u003e\u003cstrong\u003eLength\u003c\/strong\u003e: 170 pages\u003cbr\u003e\u003cstrong\u003ePublication date\u003c\/strong\u003e: 30 January 2024\u003cbr\u003e\u003cstrong\u003ePublisher\u003c\/strong\u003e: O'Reilly Media\u003cbr\u003e\u003c\/p\u003e \u003cp\u003e\u003cbr\u003eThe world's businesses ingest a staggering 2.5 quintillion bytes of data every day, a vast amount of information that is used to build products, power AI systems, and drive business decisions. However, the question arises: how much of this data is of poor quality or simply bad? This practical book aims to address this concern and provide guidance on ensuring that the data your organization relies on contains only high-quality records.\u003cbr\u003e\u003cbr\u003eWhile many data engineers, data analysts, and data scientists genuinely care about data quality, they often lack the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo offer valuable insights on how to leverage automated data quality monitoring to effectively cover all your tables, proactively alert on every category of issue, and resolve problems immediately.\u003cbr\u003e\u003cbr\u003eHere are some key takeaways from the book:\u003cbr\u003e\u003cbr\u003e* Data quality is a business imperative: Recognize that data quality is not just a technical concern but a critical factor in driving business success. Poor-quality data can lead to incorrect decisions, wasted resources, and customer dissatisfaction.\u003cbr\u003e* Understand and assess unsupervised learning models for detecting data issues: Learn about unsupervised learning models, which are used to identify patterns and anomalies in data without human intervention. These models can help detect data quality issues such as missing values, duplicate records, and outliers.\u003cbr\u003e* Implement notifications that reduce alert fatigue and let you triage and resolve issues quickly: Implement notifications that are tailored to the specific issues and categories of concern. This can help you prioritize and address problems efficiently, reducing alert fatigue and allowing you to focus on resolving critical issues.\u003cbr\u003e* Integrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systems: Integrate automated data quality monitoring with your existing data infrastructure to ensure seamless integration and centralized management. This can help you streamline your processes and improve efficiency.\u003cbr\u003e* Understand the limits of automated data quality monitoring and how to overcome them: Understand the limitations of automated data quality monitoring and identify areas where manual intervention may be necessary. This can include identifying complex data relationships, handling sensitive data, and addressing specific business requirements.\u003cbr\u003e* Learn how to deploy and manage your monitoring solution at scale: Deploy your automated data quality monitoring solution in a scalable and efficient manner. This can include optimizing performance, monitoring system health, and ensuring that your monitoring solution can handle the growing volume of data.\u003cbr\u003e* Maintain automated data quality monitoring for the long term: Maintain your automated data quality monitoring solution over time to ensure that it continues to meet your evolving business requirements. This can include regular maintenance, updates, and training to ensure that your team is equipped to handle new challenges and emerging technologies.\u003cbr\u003e\u003cbr\u003eIn conclusion, this book provides valuable insights and practical guidance on ensuring data quality in your organization. By leveraging automated data quality monitoring, you can improve the accuracy, reliability, and usability of your data, leading to better business decisions and increased competitive advantage. Whether you are a data engineer, data analyst, or data scientist, this book will help you build a data quality monitoring solution that succeeds at scale and drives your organization's success.\u003c\/p\u003e\u003cp\u003e\u003cstrong\u003eWeight\u003c\/strong\u003e: 394g\u003cbr\u003e\u003cstrong\u003eDimension\u003c\/strong\u003e: 176 x 234 x 15 (mm)\u003cbr\u003e\u003cstrong\u003eISBN-13\u003c\/strong\u003e: 9781098145934\u003c\/p\u003e","brand":"Jeremy Stanley,Paige Schwartz","offers":[{"title":"Paperback \/ softback","offer_id":45290102751482,"sku":"9781098145934","price":37.83,"currency_code":"GBP","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0522\/4297\/2845\/products\/1706894955417_book.jpg?v=1706943119","url":"https:\/\/shulphink.com\/products\/automating-data-quality-monitoring-at-scale-scaling-beyond-rules-with-machine-learning-9781098145934","provider":"Shulph Ink","version":"1.0","type":"link"}