Finally, use the project management style that suits the project in its current state.
Bobby Marleyhar citeratför 8 år sedan
A matrix of all possible combinations of disasters with plans to address each of these disasters permits you to sleep soundly for at least one night; keeping your recovery plans current and exercised permits you to sleep the other 364 nights of the year.
Bobby Marleyhar citeratför 8 år sedan
The cost of failure is education. Devin Carraway
Bobby Marleyhar citeratför 8 år sedan
Your monitoring system should address two questions: what’s broken, and why?
Bobby Marleyhar citeratför 8 år sedan
In general, an SRE team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their service(s).
Bobby Marleyhar citeratför 8 år sedan
Monitoring should never require a human to interpret any part of the alerting domain.
Darkohar citeratförra månaden
request success rate
Darkohar citeratförra månaden
Whether it is at Google or elsewhere, monitoring is an absolutely essential compo‐
nent of doing the right thing in production. If you can’t monitor a service, you don’t know what’s happening, and if you’re blind to what’s happening, you can’t be reliable.
Darkohar citeratförra månaden
store mapping between BNS paths and IP address:port pairs.
Darkohar citeratförra månaden
Load balancing at the Remote Procedure Call (RPC) level,