Skip to product information
1 of 1

Emil Stolarsky

97 Things Every SRE Should Know

97 Things Every SRE Should Know

Low Stock: Only 2 copies remaining
Regular price £27.59 GBP
Regular price £39.99 GBP Sale price £27.59 GBP
31% OFF Sold out
Tax included. Shipping calculated at checkout.

YOU SAVE £12.40

  • Condition: Brand new
  • UK Delivery times: Usually arrives within 2 - 3 working days
  • UK Shipping: Fee starts at £2.39. Subject to product weight & dimension
Trustpilot 4.5 stars rating  Excellent
We're rated excellent on Trustpilot.
  • More about 97 Things Every SRE Should Know

Site reliability engineering (SRE) is a critical skill, and this book provides 97 concise and useful tips from across the industry, including best practices and new approaches to knotty problems.

Format: Paperback / softback
Length: 300 pages
Publication date: 04 December 2020
Publisher: O'Reilly Media, Inc, USA


Site reliability engineering (SRE) is an essential skill in today's digital world, as the reliability of systems has become critical to businesses and organizations. In this practical book, newcomers and experienced professionals alike will explore a wide range of conversations happening in SRE. The editors, Jaime Woo and Emil Stolarsky, co-founders of Incident Labs, have collected 97 concise and useful tips from across the industry, including trusted best practices and innovative approaches to complex problems.

The book is divided into four sections:

Adopting SRE: This section provides actionable advice on how to implement SRE in your organization. It covers topics such as building a team, defining responsibilities, and measuring success.

Why SLOs Matter: SLOs (service-level objectives) are crucial in SRE, as they provide a way to measure the performance of systems and ensure that they meet the needs of users. This section explains how to set and manage SLOs, and how to use them to improve the reliability of your systems.

When to Upgrade Your Incident Response: Incident response is an essential part of SRE, as it helps to respond to and recover from incidents quickly and efficiently. This section provides advice on when to upgrade your incident response processes, and how to do so effectively.

Monitoring and Observability: Monitoring and observability are key components of SRE, as they allow you to track the performance of your systems and identify potential issues before they become problems. This section explains the different types of monitoring and observability tools available, and how to use them effectively.

Each section of the book includes case studies and examples to illustrate the practical applications of the tips and techniques discussed. The book is written in a clear and concise manner, making it easy to understand and apply to your own SRE practices.

Whether you are a new SRE practitioner or an experienced professional looking to improve your skills, this book is a valuable resource. It will help you grow and refine your SRE skills through sound advice and thought-provoking questions that drive the direction of the field.

Some of the 97 things you should know:

Test Your Disaster Plan: Tanya Reilly provides valuable advice on how to test your disaster plan and ensure that it is effective in the event of an outage or incident.

Integrating Empathy into SRE Tools: Daniella Niyonkuru discusses the importance of empathy in SRE and how to integrate it into your tools and processes.

The Best Advice I Can Give to Teams: Nicole Forsgren shares her best advice for teams working in SRE, including how to communicate effectively, collaborate effectively, and manage stress.

Where to SRE: Fatema Boxwala provides insights on where to focus your SRE efforts and how to prioritize your work.

Facing That First Page: Andrew Louis offers advice on how to get started in SRE and how to overcome the initial challenges that may arise.

I Have an Error Budget, Now What?: Alex Hidalgo discusses the importance of error budgets in SRE and how to use them effectively.

Get Your Work Recognized: Write a Brag Document: Julia Evans and Karla Burnett provide tips on how to write a brag document that showcases your SRE work and achievements.

In conclusion, Site reliability engineering (SRE) is an essential skill in today's digital world, and this practical book provides valuable insights and advice on how to implement SRE effectively. Whether you are a newcomer or an experienced professional, this book will help you grow and refine your SRE skills through sound advice and thought-provoking questions that drive the direction of the field.

Weight: 358g
Dimension: 150 x 226 x 17 (mm)
ISBN-13: 9781492081494

UK and International shipping information

UK Delivery and returns information:

  • Delivery within 2 - 3 days when ordering in the UK.
  • Shipping fee for UK customers from £2.39. Fully tracked shipping service available.
  • Returns policy: Return within 30 days of receipt for full refund.

International deliveries:

Shulph Ink now ships to Australia, Belgium, Canada, France, Germany, Ireland, Italy, India, Luxembourg Saudi Arabia, Singapore, Spain, Netherlands, New Zealand, United Arab Emirates, United States of America.

  • Delivery times: within 5 - 10 days for international orders.
  • Shipping fee: charges vary for overseas orders. Only tracked services are available for most international orders. Some countries have untracked shipping options.
  • Customs charges: If ordering to addresses outside the United Kingdom, you may or may not incur additional customs and duties fees during local delivery.
View full details