1 minute read

A short issue this week, with a few posts on decision making when adopting Kubernetes, and a full series of posts on the issue of alert fatigue and what you can do about it.

From our sponsor, Victorops

How do you respond to an outage? Do you have an internal and external communication plan? A good DevOps-minded team will prepare for failure and understand what’s necessary for communicating during an outage. Learn more:
http://try.victorops.com/devopsweekly/crisis-communication-practices

News

A nice case study of evolving CI/CD practices and tooling in an organisation as it scales up and as tools and processes mature. SOme good tips for chat updates for any sort of integration too.
https://betterment.engineering/ci-cd-at-betterment-part-i-shortening-the-feedback-loop-6d207218e24b

Envoy is a useful modern proxy in its own right, and increasingly being embedded in other tooling too. This post is a nice simple introduction to the basic configuration and moving parts like discovery services.
https://jvns.ca/blog/2018/10/27/envoy-basics/

A concrete example of applying swarming techniques to a support service desk.
https://medium.com/@JonHall_/a-nice-example-of-swarming-in-support-of-a-service-desk-7bdbe8e9890c

A 4 part series on the issue of alert fatigue. Why is this a problem, how to reduce the amount of alerts by filtering out noise and consolidating information into actionable alerts and more.
https://blog.sensu.io/alert-fatigue-part-1-avoidance-and-course-correction
https://blog.sensu.io/alert-fatigue-part-2-alert-reduction-with-sensu-filters-token-substitution
https://blog.sensu.io/alert-fatigue-part-3-automating-triage-remediation-with-checks-hooks-handlers
https://blog.sensu.io/alert-fatigue-part-4-alert-consolidation

A write-up of choosing to build your own Kubernetes cluster, centered around building an understanding of a core piece of technology as well as the cost in differences between the main cloud providers.
https://medium.com/bench-engineering/starting-out-with-kubernetes-why-we-rolled-our-own-despite-the-naysayers-5c8c65e65e76

Another experience report from adopting Kubernetes, this time in video form. Lots of practical real world experience on networking, areas where the lack of documentation causes issues and more.
https://www.youtube.com/watch?v=2dsCwp_j0yQ

Tools

Dive is a handy tool for analysing Docker image layers. It provides a very nice CLI interface showing what changes between layers and giving an idea of space you could save with further optimisation.
https://github.com/wagoodman/dive

Kimchi is a nice looking dashboard for managing KVM guests via libvirt. View templates, running guests and start new instances.
https://github.com/kimchi-project/kimchi/

A collection of tools for working with container registries. Copy images from one place to another, check what tags are available in a registry, get the digest of an image and more.
https://github.com/google/go-containerregistry

How do you respond to an outage? Do you have an internal and external communication plan? A good DevOps-minded team will prepare for failure and understand what’s necessary for communicating during an outage. Learn more:
http://try.victorops.com/devopsweekly/crisis-communication-practices

Updated: