Site Reliability Engineering

How Google Runs Production Systems



Bookstore > Books > Site Reliability Engineering

Price$11.15 - $62.99
Rating
AuthorsBetsy Beyer, Chris Jones, Jennifer Petoff, Niall Richard Murphy
PublisherO'Reilly Media
Published2016
Pages552
LanguageEnglish
FormatPaper book / ebook (PDF)
ISBN-10149192912X
ISBN-139781491929124
EBook Hardcover Paperback

The overwhelming majority of a software system's lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems?

In this collection of essays and articles, key members of Google's Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You'll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient - lessons directly applicable to your organization.

This book is divided into four sections: Introduction - Learn what site reliability engineering is and why it differs from conventional IT industry practices; Principles - Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE); Practices - Understand the theory and practice of an SRE's day-to-day work: building and operating large distributed computing systems; Management - Explore Google's best practices for training, communication, and meetings that your organization can use.


  1. (3 books)



4 5 1783

Similar Books


Database Reliability Engineering

Database Reliability Engineering

by Laine Campbell, Charity Majors

The infrastructure-as-code revolution in IT is also affecting database administration. With this practical book, developers, system administrators, and junior to mid-level DBAs will learn how the modern practice of site reliability engineering applies to the craft of database architecture and operations. Authors Laine Campbell and Charity...

Price:  $40.71  |  Publisher:  O'Reilly Media  |  Release:  2017

The Site Reliability Workbook

The Site Reliability Workbook

by Niall Murphy, David Rensin, Betsy Beyer, Kent Kawahara, Stephen Thorne

In 2016, Google's Site Reliability Engineering book ignited an industry discussion on what it means to run production services today - and why reliability considerations are fundamental to service design. Now, Google engineers who worked on that bestseller introduce The Site Reliability Workbook, a hands-on companion that uses concrete ex...

Price:  $21.74  |  Publisher:  O'Reilly Media  |  Release:  2018

Seeking SRE

Seeking SRE

by David Blank-Edelman

Organizations - big and small - have started to realize just how crucial system and application reliability is to their business. At the same time, they've also learned just how difficult it is to maintain that reliability while iterating at the speed demanded by the marketplace. Site Reliability Engineering (SRE) is a proven approach to ...

Price:  $29.95  |  Publisher:  O'Reilly Media  |  Release:  2018

Building Secure and Reliable Systems

Building Secure and Reliable Systems

by Betsy Beyer, Piotr Lewandowski, Ana Oprea, Paul Blankinship, Heather Adkins, Adam Stubblefield

Can a system be considered truly reliable if it isn't fundamentally secure? Or can it be considered secure if it's unreliable? Security is crucial to the design and operation of scalable systems in production, as it plays an important part in product quality, performance, and availability. In this book, experts from Google share best prac...

Price:  $49.92  |  Publisher:  O'Reilly Media  |  Release:  2020

Real-World SRE

Real-World SRE

by Nat Welch

Real-World SRE is the go-to survival guide for the software developer in the middle of catastrophic website failure. Site Reliability Engineering (SRE) has emerged on the frontline as businesses strive to maximize uptime. This book is a step-by-step framework to follow when your website is down and the countdown is on to fix it.Nat Welch ...

Price:  $39.99  |  Publisher:  Packt Publishing  |  Release:  2018

Learning Puppet 4

Learning Puppet 4

by Jo Rhett

If you're a system administrator, developer, or site reliability engineer responsible for handling hundreds or even thousands of nodes in your network, the Puppet configuration management tool will make your job a whole lot easier. This practical guide shows you what Puppet does, how it works, and how it can provide significant value to y...

Price:  $36.06  |  Publisher:  O'Reilly Media  |  Release:  2016

Docker: Up & Running

Docker: Up & Running

by Karl Matthias, Sean P. Kane

Docker is quickly changing the way that organizations are deploying software at scale. But understanding how Linux containers fit into your workflow - and getting the integration details right - are not trivial tasks. With this practical guide, you'll learn how to use Docker to package your applications with all of their dependencies, and...

Price:  $24.34  |  Publisher:  O'Reilly Media  |  Release:  2015

Kubernetes Operators

Kubernetes Operators

by Joshua Wood, Jason Dobies

Operators are a way of packaging, deploying, and managing Kubernetes applications. A Kubernetes application doesn't just run on Kubernetes; it's composed and managed in Kubernetes terms. Operators add application-specific operational knowledge to a Kubernetes cluster, making it easier to automate complex, stateful applications and to augm...

Price:  $24.99  |  Publisher:  O'Reilly Media  |  Release:  2020