4 minute read

Several posts this week with content from workshops, classes and training sessions. I know just how much effort it takes to put together high-quality learning material like that and I’m super appreciative and happy that folks are increasingly making this content freely available online.


Essential eBook for DevOps pros: The Dev and Ops Guide to Incident Management offers 25+ pages of insight into building teams and improving your response to downtime.


Lessons learned while writing The DevOps Handbook

In a recent webinar, Gene Kim, co-author of The Phoenix Project, shared the top insights he discovered while co-authoring The DevOps Handbook. Read about the DevOps best practices he shared. As a bonus, you’ll also get lessons learned from Dynatrace’s own DevOps transformation – 26 major releases per year and 170 production deployments.


Designing and building secure software is not easy, and few teams adopt a formal approach to solving the problem. The UK National Cyber Security Centre have published a handy set of 8 principles with which to test your development methodology.

The STELLA report from the SNAFU catchers consortium was published this week and makes interesting reading for anyone working on incident response or running large-scale systems. It contains writeups of several failures involving common technologies, delves into the theory and provides useful recommendations.

An excellent take on why monitoring is changing in the face of the adoption of cloud native platforms and microservice architectures. Nicely ties the co-evolution of the spaces together.

A good take on why Kubernetes is interesting. The point about having a single database of the state of all of your applications is super powerful in my view.

An excellent set of resources for anyone wanting to understand distributed systems. A nice, and not overlong, list of important books, papers and blog posts.

Detailed notes for a class on distributed systems. This is perfect if you already know some bits but maybe not the formal language and want to identify areas for more investigation.

A good post on the tension between the desire to fix the problem when an outage occurs and the need to communicate with users and others about the problem.

A good slide deck exploring practical things you and your team can do to improve the operability of your systems.

Microservices typically present some interesting challenges for testing, which can easily result in expensive, slow and brittle tests to quickly dominate. This post has some good tips for unit testing services.

An interesting look at the numbers behind the growth of hyper-scale public cloud provider datacenters, and why the idea that everything will rapidly move to the public cloud might play out more slowly.

Assessing risk, and using that assessment to help design systems or processes, is a useful skill. It’s critical to avoid under (or over) investment in securing systems in particular. This freely available workshop content serves as a great introduction to risk assessment fundamentals.

A nice detailed look at how docker handles failure of containers, looking at signals, exit codes, restart policies and health checks.

The Linux Foundation

Free Your Metadata by joining this webinar w/ODPi, IBM, ING, and Hortonworks October 12, Online

We all use metadata every day. Metadata today has gravity, however, it should have the ability to be moved with the data and be augmented and processed through APIs for permitted usages. Join us and find out how we’re going to accomplish this.

CNCF - Cloud Native Computing Foundation

Free Webinar - Cloud Native Apps from Development to Production October 12, Online

Learn about the advantages and elements of a cloud-native approach, and how GitLab is making it simpler to deploy cloud-native applications to production using Kubernetes.

KubeCon + CloudNativeCon - Join leading Kubernetes, Docker, and Cloud Native technologists, December 6-8, in Austin for a broad range of technical sessions on the cloud native ecosystem. We sold out in Berlin and are excited to see thousands of you from the community join us, this time in Austin!


DevSecCon is back in London on the 19th and 20th of October. An interesting schedule of topics for operators, developers and security folks with talks on securing Apple laptops, making process secure by default, compliance in a continuous delivery world and more.


Kedge aims to provide a high-level data-centric interface for describing Kubernetes applications, with a focus on good defaults and familiarity for folks already using the lower-level APIs.

The Docker Version Manager (dvm) is a cross-platform command-line tool that helps you install and switch between Docker clients. Handy if you’re working with a range of different server versions.

Like DevOps? Register for All Day DevOps - a FREE online conference this October, offering 100 DevOps-focused sessions across six different tracks. Learn more & register: