DEVOPS WEEKLY ISSUE #300 - 25th September 2016
Several posts this week on automation and engineering efforts with physical hardware, as well as posts on serverless, unikernels and containers. Generalising when it comes to technology in this space is dangerous, which makes it all the more impressive that the culture and organisation aspects of devops appear to be so generally applicable.
Sponsor
[WEBINAR] The Do’s and Dont’s of Post-Incident Analysis. Join VictorOps and Datadog to get an inside look at how to conduct modern post-incident analysis.
Sign up now: http://bit.ly/2d0Ejyk
Sponsored
Top Lessons Learned While Researching and Writing The DevOps Handbook
Join The DevOps Handbook co-author Gene Kim as he shares first-hand insights and lessons learned during the five years required to create this essential resource, including DevOps transformation case studies around continuous integration and delivery. Register today!
http://ow.ly/EXwE304sS1F
News
Windows 10 is an increasingly interesting development environment. One of the things I’ve been waiting for is the ability to run windows hyper-v containers and linux containers side by side. Here’s how to do it.
https://stefanscherer.github.io/run-linux-and-windows-containers-on-windows-10/
A good reminder that adopting devops means acting to improve the culture of your organisation, and some good tips for building bridges between former silos.
https://speakerdeck.com/kdaniels/building-bridges-with-devops
Most talk of unikernels has been about theory, but these notes from a recent talk discuss getting them into production. Lots of talk of scaling, hosting, testing and general observations of an early adopter.
http://icfp2016.mirage.io/CUFP/baby-steps-to-unikernels-in-pr.md
An argument that end-to-end testing is mainly not worth the effort due to long lead times and maintenance costs, and a more balanced continuous testing approach is more useful.
http://www.slideshare.net/agilestevesmith/endtoend-testing-considered-harmful
As seen in the post above, the best way to test web application is still a matter of opinion. This survey is collecting some data surrounding application testing and I’d be interested in seeing the results.
https://www.surveymonkey.com/r/X7MW3JL
Some good practical tips for anyone getting started with writing incident postmortems focused on learning rather than blame.
https://medium.com/production-ready/writing-your-first-postmortem-8053c678b90f#.6v7rac3tk
Physical infrastructure doesn’t have to mean a lack automation. A solid post on mosting to a fully automated provisioning system using CIMC and Foreman.
https://engineering.linkedin.com/blog/2016/09/project-autobuild
I attended the recent Operability conference in London and these two posts nicely summarise the various talks and provide lots of links to the slides.
https://skeltonthatcher.com/blog/operability-io-2016-operations-is-crucial/
https://skeltonthatcher.com/blog/operability-io-2016-operations-still-relevant-day-1/
An interesting post about rebuilding a common part of a web application stack at scale, in this case how GitHub rebuild their load balancer layer. Good explanation of the design goals.
http://githubengineering.com/introducing-glb/
In 1.12 Docker added the HEALTHCHECK instruction to the Dockerfile syntax. This post covers what it does and why, and shows how you can add healthchecks to your images.
https://blog.newrelic.com/2016/08/24/docker-health-check-instruction/
A detailed slide deck looking at building better security monitoring into your applications. Looks at data collection, visualisation and analysis of the collected data.
http://www.slideshare.net/jtmelton/building-selfdefending-applications-with-owasp-appsensor-javaone-2016
Tools
LambCI is a CI and build system designed to run on AWS Lambda. This makes for some interesting properties, including potentially large cost savings depending on your build profile.
https://medium.com/@hichaelmart/lambci-4c3e29d6599b#.fhhti6fbs
https://github.com/lambci/lambci
Static site generators are nothing new, but Trestus is interesting as it’s very domain specific. It aims to build status pages, for communicating with users around outages, with the data coming from Trello. If you’re using Trello internally to manage an outage that could be a nice workflow.
https://canonical-ols.github.io/trestus/
SQLint is a simple SQL syntax checker, ideal for sanity checking SQL files as part of a continuous integration workflow.
https://github.com/purcell/sqlint
[WEBINAR] The Do’s and Dont’s of Post-Incident Analysis. Join VictorOps and Datadog to get an inside look at how to conduct modern post-incident analysis.
Sign up now: http://bit.ly/2d0Ejyk