Site Reliability Engineering

Site Reliability Engineering: How Google Runs Production Systems

Paperback Published on: 31/10/2026
Price: £63.99
Free UK delivery on orders over £25
Coming soon
Published 31/10/2026
Make and edit your lists in your account
No stock available in any shop.
Coming soon
Published 31/10/2026
No stock available in any shop.

Synopsis

Google pioneered the discipline of Site Reliability Engineering, applying reliability to the entire user journey for consumer, enterprise, and infrastructure systems. In the years since, many organizations have followed suit, guided by the tenets laid out in this practical book. This fully revised edition brings Site Reliability Engineering up-to-date with fresh insights on engineering techniques, organizational processes, and case studies that will help you promote and implement greater reliability throughout the engineering lifecycle.

In this collection of essays and articles, key members of Google's Site Reliability Engineering team explore the company's current SRE practices and explain how they've evolved in the decade since the initial publication. New updates cover the value of reliability, cloud reliability, and the impact of AI. You'll learn the principles and practices that enable Google engineers to make some of the world's largest systems scalable, reliable, and efficient—lessons directly applicable to your organization.

  • Train new Site Reliability Engineers based on the latest practices in the field
  • Develop engineering organizations that support reliability as a feature
  • Build online services that incorporate reliability principles
  • Use AI to improve SRE across the organization and optimize critical areas such as automation and incident detection

Publisher information

  • Publisher: O'Reilly Media
  • ISBN: 9798341607682
  • Number of pages: 600
  • Dimensions: 232 x 178 mm
  • Languages: English

Customer Reviews