{"product_id":"handson-entity-resolution-9781098148485","title":"Hands-On Entity Resolution","description":"\u003cp\u003e\u003c\/p\u003e\u003cblockquote\u003e\n\u003cbr\u003eEntity resolution is a crucial analytic technique that identifies multiple data records that refer to the same real-world entity. This book provides practical understanding and techniques to scale up data matching processes and improve the accuracy of reconciliations, using real-world data examples. It covers challenges in deduplicating and joining datasets, extracting, cleansing, and preparing datasets, text matching algorithms, techniques for deduplicating and joining datasets at scale, matching datasets containing persons and organizations, evaluating data matches, optimizing and tuning data matching algorithms, and entity resolution using cloud APIs. \u003c\/blockquote\u003e\u003cp\u003e\u003cstrong\u003eFormat\u003c\/strong\u003e: Paperback \/ softback\u003cbr\u003e\u003cstrong\u003eLength\u003c\/strong\u003e: 200 pages\u003cbr\u003e\u003cstrong\u003ePublication date\u003c\/strong\u003e: 13 February 2024\u003cbr\u003e\u003cstrong\u003ePublisher\u003c\/strong\u003e: O'Reilly Media\u003cbr\u003e\u003c\/p\u003e \u003cp\u003e\u003cbr\u003eEntity resolution is a crucial analytical technique that empowers users to identify multiple data records that correspond to the same real-world entity. This practical guide is designed for product managers, data analysts, and data scientists, offering valuable insights on how to enhance data value through cleansing, analysis, and resolution. Author Michael Shearer guides readers in scaling up their data matching processes and improving the accuracy of reconciliations. By leveraging open-source Python libraries and cloud APIs, individuals will learn how to remove duplicate entries within a single source and join disparate data sources when common keys are unavailable. Through real-world data examples, this book fosters a practical understanding to expedite the delivery of real business value.\u003cbr\u003e\u003cbr\u003eEntity resolution plays a pivotal role in building rich and comprehensive data assets that unveil relationships for marketing and risk management purposes, essential for maximizing the potential of machine learning (ML) and artificial intelligence (AI). This book addresses various challenges associated with deduplicating and joining datasets, including extracting, cleansing, and preparing datasets for matching. It explores text matching algorithms to identify equivalent entities and techniques for deduplicating and joining datasets at scale, encompassing datasets containing persons and organizations. The book also delves into evaluating data matches, optimizing and tuning data matching algorithms, and leveraging cloud APIs for entity resolution. Additionally, it discusses matching using privacy-enhancing technologies, ensuring data security and compliance. By mastering entity resolution, users can unlock the full potential of their data and drive meaningful business outcomes.\u003c\/p\u003e\u003cp\u003e\u003cstrong\u003eWeight\u003c\/strong\u003e: 354g\u003cbr\u003e\u003cstrong\u003eDimension\u003c\/strong\u003e: 175 x 234 x 15 (mm)\u003cbr\u003e\u003cstrong\u003eISBN-13\u003c\/strong\u003e: 9781098148485\u003c\/p\u003e","brand":"Michael Shearer","offers":[{"title":"Paperback \/ softback","offer_id":45282300428538,"sku":"9781098148485","price":39.97,"currency_code":"GBP","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0522\/4297\/2845\/products\/1709317591838_book.jpg?v=1709552280","url":"https:\/\/shulphink.com\/products\/handson-entity-resolution-9781098148485","provider":"Shulph Ink","version":"1.0","type":"link"}