Rootconf 2016

Rootconf is India's principal conference where systems and operations engineers share real world knowledge about building resilient and scalable systems.

saurabh hirani

saurabh hirani

@saurabh_hirani

The transition: Manual => Automated => Distributed monitoring

Submitted Jan 15, 2016

Everyone talks the benefits of having an automated monitoring system in place - one which can discover
infrastructure components as they are added, monitor them while they are alive and stop monitoring when
they are moved out. But no one has chronicled their journey through the process of automating a manually
maintained monitoring system and showcased their battle scars for others to learn from. That’s what
we are going to talk about.

Outline

We moved our monitoring systems from a manually maintained icinga 1.x (nagios compatible) setup to an automated, distributed icinga2 infrastructure. This talk is going to cover the following topics:

  • The challenges of monitoring a distributed infrastructure
  • A manually maintained nagios setup and its limitations
  • Prepping for moving to an automated monitoring setup
  • Collating data from multiple host sources
  • Progressive cutover from manual to automated monitoring

Requirements

Curiosity and an itch to automate all things manual.

Speaker bio

Currently working as a Senior DevOps engineer at BlueJeans. I enjoy automating infrastructure and contributing to the open source community.

Github - https://github.com/saurabh-hirani/
Linkedin - https://in.linkedin.com/in/shirani
Blog - http://saurabh-hirani.github.io/writing/

Slides

http://saurabh-hirani.github.io/slides/monitoring-transition.html#/

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

We care about site reliability, cloud costs, security and data privacy