作者简介

Casey Rosenthal formalized the practice of Chaos Engineering by co-writing and publishing the definition http://principlesofchaos.org/ with the Chaos Team at Netflix, which he managed for three years. He put together a conference on the topic called Chaos Community Day, the only conference dedicated to Chaos Engineering, which is now entering its fourth year. Casey also manages the Chaos Engineering Community Google Group and co-wrote the Chaos Engineering O’Reilly report. He is currently CTO at Backplane.io, a company that provides reliable and resilient infrastructure.
Nora Jones is a Senior Software Engineer on the Chaos Engineering team at Netflix where she works on ensuring that Netflix remains resilient in the face of uncertain conditions. She is also a student of Human Factors and Systems Safety at Lund University. She is passionate about resilient software, people, and the intersection of those two worlds. She recently keynoted at AWS re:Invent to an audience of ~40,000 people about the benefits and business case behind implementing Chaos Engineering. Prior to Netflix Nora founded and led the Developer Productivity team at Jet.com.
Nathan Aschbacher is currently CEO of Auxon Corporation. He began his career writing programs for CNC machines where overlooked edge cases resulted in mangled heaps of metal, broken tool bits, and lasting impressions of catastrophic failure. Over the many years since, Nathan has designed fault-tolerant, highly-available, and high-assurance systems for distributed data platforms, machine learning, and global payment processing. Nathan first applied Chaos Engineering principles to problems in the FinTech space, transferred the practice to autonomous vehicle development, and now explores the marriage of formal methods and Chaos Engineering for verifying and validating the resiliency of complex, highly-automated safety-critical systems in a number of different domains (e.g., automotive, industrial automation, and aerospace).

内容简介

There’s more to chaos engineering than deliberately breaking stuff in production. With this book, QA engineers as well as program and product managers will examine the theory, history, and implementation of this full-fledged software engineering discipline. Chaos experts Casey Rosenthal, Nora Jones, and Nathan Aschbacher will bring you up to speed on this practice for finding failures within your application, network, and infrastructure.

As the software industry continues to move toward microservices and other complex, distributed systems, fewer people are able to hold a working picture of the entire system in their minds. Complexity can’t be removed from these systems, but new methodologies allow engineers to navigate the complexity while optimizing for business goals such as feature velocity, performance, and fault tolerance. This book guides you through chaos engineering and demonstrates how this methodology can help you optimize for availability.


Casey Rosenthal formalized the practice of Chaos Engineering by co-writing and publishing the definition http://principlesofchaos.org/ with the Chaos Team at Netflix, which he managed for three years. He put together a conference on the topic called Chaos Community Day, the only conference dedicated to Chaos Engineering, which is now entering its fourth year. Casey also manages...

下载地址

豆瓣评论

  • 李Q
    这是一本讲世界观和方法论多于工程实践的书。前几章基于Netflix的chaos principle展开,后面几章则是各大厂实践经验的分享。虽然每个公司展开的角度不一样,但是因为是集锦形式,重复的内容不少。而且最佳实践里关于实验设计,事后复盘闭环,整个chaos engineering文化和流程体系建设的分享更多,关于工具/平台的讨论比较少。比较有趣的是,不同的最佳实践分享里本身就有很多对chaos principle理解很不一样的地方,值得结合不同的产品场景和基建成熟度思考。 总之,如果你的需求是了解chaos engineering更偏test和tooling的部分,这不是你的目标书籍,但是对于开始考虑chaos engineering的团队,这是一本值得先读一次的好书。04-28

猜你喜欢

大家都喜欢