Skip to product information
1 of 1

Akash Tandon,Sandy Ryza,Uri Laserson,Sean Owen,Josh Wills

Advanced Analytics with PySpark: Patterns for Learning from Data at Scale Using Python and Spark

Advanced Analytics with PySpark: Patterns for Learning from Data at Scale Using Python and Spark

Low Stock: Only 3 copies remaining
Regular price £36.56 GBP
Regular price £52.99 GBP Sale price £36.56 GBP
31% OFF Sold out
Tax included. Shipping calculated at checkout.

YOU SAVE £16.43

  • Condition: Brand new
  • UK Delivery times: Usually arrives within 2 - 3 working days
  • UK Shipping: Fee starts at £2.39. Subject to product weight & dimension
Trustpilot 4.5 stars rating  Excellent
We're rated excellent on Trustpilot.
  • More about Advanced Analytics with PySpark: Patterns for Learning from Data at Scale Using Python and Spark


Apache Spark is the de facto tool for analyzing big data, and this updated guide teaches you how to approach analytics problems using PySpark and other best practices. It covers common techniques such as classification, clustering, collaborative filtering, and anomaly detection in fields such as genomics, security, and finance.

Format: Paperback / softback
Length: 275 pages
Publication date: 24 June 2022
Publisher: O'Reilly Media


The sheer volume of data being generated today is truly astounding, and it continues to expand at an unprecedented rate. In this landscape, Apache Spark has emerged as the go-to tool for analyzing large datasets, playing a pivotal role in the field of data science. This comprehensive guide, updated for Spark 3.0, is designed to empower data scientists with the knowledge and skills necessary to tackle complex analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming.

Led by a team of experienced data scientists, including Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills, this guide provides a comprehensive introduction to the Spark ecosystem. It delves into various patterns and techniques that apply to diverse fields such as genomics, security, and finance, utilizing common machine learning and statistical approaches.

Furthermore, this updated edition expands its coverage to include natural language processing (NLP) and image processing, making it an invaluable resource for data scientists working with text and visual data. Whether you have a solid foundation in machine learning and statistics or are new to the field, this book will guide you through the process of conducting large-scale data analysis.

By familiarizing yourself with Spark's programming model and ecosystem, you will gain a deep understanding of how to leverage its powerful capabilities. You will explore general approaches in data science, examine complete implementations that analyze large public datasets, and discover which machine learning tools are most suitable for specific problems. Additionally, you will explore code that can be adapted to a wide range of applications, enabling you to apply your knowledge to real-world scenarios.

In conclusion, if you are looking to unlock the full potential of big data and advance your data science skills, this updated guide to Apache Spark is an essential resource. With its comprehensive coverage, practical examples, and expert guidance, it will empower you to tackle complex analytics problems and make informed decisions based on data. So, whether you are a seasoned data scientist or just starting your journey, this book is your key to success in the world of big data analytics.

Weight: 410g
Dimension: 176 x 234 x 15 (mm)
ISBN-13: 9781098103651

This item can be found in:

UK and International shipping information

UK Delivery and returns information:

  • Delivery within 2 - 3 days when ordering in the UK.
  • Shipping fee for UK customers from £2.39. Fully tracked shipping service available.
  • Returns policy: Return within 30 days of receipt for full refund.

International deliveries:

Shulph Ink now ships to Australia, Belgium, Canada, France, Germany, Ireland, Italy, India, Luxembourg Saudi Arabia, Singapore, Spain, Netherlands, New Zealand, United Arab Emirates, United States of America.

  • Delivery times: within 5 - 10 days for international orders.
  • Shipping fee: charges vary for overseas orders. Only tracked services are available for most international orders. Some countries have untracked shipping options.
  • Customs charges: If ordering to addresses outside the United Kingdom, you may or may not incur additional customs and duties fees during local delivery.
View full details