Let’s Talk About Time - Data Driven Organic Monitoring @Directi

May 2015

11 Mon

12 Tue

13 Wed

14 Thu

15 Fri 08:15 AM – 05:30 PM IST

16 Sat 08:15 AM – 05:15 PM IST

17 Sun

Make a submission

Accepting submissions till 02 Jul 2019, 02:31 AM

MLR Convention Centre, J P Nagar

In today’s technology world, operational efficiency is pivotal. It is time to focus on measuring, learning and improving different aspects of infrastructure scaling and development cycles. Not to forget the the culture of DevOps and team dynamics.

Rootconf 2015 focuses on four key aspects of DevOps and scaling infrastructure: virtualization, managing infrastructure at scale, new technologies, and team management.

Target audience:

Rootconf is aimed at DevOps, CTOs, and teams and companies.
Organizations developing a Continuous Integration and Continous Deployment strategy. You will get a headstart by attending Rootconf.

Conference and workshop topics:

Virtualization.
Managing infrastructure at scale:
- Scaling processes for enterprises – log monitoring, security, data management.
- Scaling tools for enterprises – such as using Graphite, logstash and other tools at scale.
Upcoming technologies such as Salt.
Team management / DevOps as culture.

Format:

This year’s edition spans two days of hands-on and conference. We are inviting proposals for:

Full-length 40 minute talks.
Crisp 15-minute talks.
Sponsored sessions, 40 minute duration.
Hands-on Workshop sessions, 3 hour duration

Selection Process:

Proposals will be filtered and shortlisted by an Editorial Panel. We urge you to add links to videos / slide decks when submitting proposals. This will help us understand your past speaking experience. Blurbs or blog posts covering the relevance of a particular problem statement and how it is tackled will help the Editorial Panel better judge your proposals.

Proposers must submit presentation drafts as part of the selection process to ensure that the talk is in line with the theme of the conference, and to help the editorial panel build a strong line-up for the event.

We will notify you about the status of your proposal within two weeks of submission.

There is only one speaker per session. Entry is free for selected speakers. As our budget is limited, we will prefer speakers from locations closer home, but will do our best to cover for anyone exceptional. HasGeek will provide a bursary to cover part of your travel and accommodation in Bangalore. Bursaries are limited and made available to speakers delivering full sessions (40 minutes or longer).

Commitment to Open Source:

HasGeek believes in open source as the binding force of our community. If you are describing a codebase for developers to work with, we’d like it to be available under a permissive open source license. If your software is commercially licensed or available under a combination of commercial and restrictive open source licenses (such as the various forms of the GPL), please consider picking up a sponsorship. We recognize that there are valid reasons for commercial licensing, but ask that you support us in return for giving you an audience. Your session will be marked on the schedule as a sponsored session.

Important dates:

Last date for submitting proposals: 15th April

Shortlisting process: 31st March onwards

Pre-conference workshops: 13th and 14th May

Conference dates: 15th and 16th May

##Venue:
Rootconf will be held in the MLR Convention Centre, J P Nagar.

##Contact:
For more information about speaking proposals, tickets and sponsorships, contact info@hasgeek.com or call +91-7676332020.

Hosted by

Rootconf

Rootconf is a community-funded platform for activities and discussions on the following topics: Site Reliability Engineering (SRE). Infrastructure costs, including Cloud Costs - and optimization. Security - including Cloud Security. more

All submissions

Previous Next

This submission has been added to the schedule

Let’s Talk About Time - Data Driven Organic Monitoring @Directi

Submitted Apr 10, 2015

Section: Crisp talk Technical level: Intermediate

Describe a radical rethink of how we perceive alerts and monitoring and the profound implications it has on how we describe and interact with our infrastructure.
Its production implementation and how it enables us to get a fresh new insight and analysis on how we deal with problems.
What benefits, challenges and constructs it brings

Outline

The human perception of dimensions and space, specifically time is drastically different from that of machines. Yet, almost every monitoring system today completely ignores this key distinction and floods us with thousands of mechanical streams of data-points that are increasingly a burden for an operator to interpret and react on. They often do not account for vital data about the operational response which is absolutely key to how infrastructure is run in any enterprise.

The perception of time is central to this concept, clustering diverse data-points we build an abstract construct that closely mimics how humans perceive and react to events and situations. While raw data-streams are modeled as low dimension immutable facts that systems can rapidly and effectively interpret, this abstraction is mutable and modeled as an FSM (finite state machine), enabling it to hold several derived dimensions that are of great value. These attributes/dimensions can be attained through transformation functions triggered by the event stream.

This abstraction has had a profound impact on how we at Directi interact with our issues and infrastructure and enables us to explore possibilities that didn’t exist before . We shall take a sneak peak of the λ-architecture and Materialized Views in Slant (our platform) that abstracts the standard monitoring layer into a mesh of highly composable and flexible constructs.

This has empowered us to ask and answer complex questions such as - How do I define the virality/relative score of an issue? How did the operations team respond? What caused my team to lose sleep and how they reacted & resolved these issues ? What deployment, support & policy changes impacted operations and how ? How do I deduce optimal alerts and escalations ? What issues are critical and what was the root cause ?

Finally we shall then look at real world examples such as the ability to identify hot points,overlay diverse but related data, define auto aggregations in a more natural form, doing away with trigger level redundancy. How this enables conversation and allows us to organically explore issues and get a top level unified real-time and historical view of issues in our infrastructure and visualize them.

Requirements

No specific requirements, familiarity with monitoring and alerting systems will help.

Speaker bio

Dhananjay Sathe is a former BITS Pilani grad currently working as a Sr Operations Engineer on the Platform Team at Directi/Endurance where he architects and builds the central operations platform and toolchain.
In the past he has contributed OpenSource Projects such as Samba through the GSoC, Gnome and been one of the developers behind the RoboEarth Cloud Engine. His prior speaking engagements include multiple highly rated talks at PyCon India, Rootconf 14 and GoogleIOx.
His favourite hobbies include programming, travelling, exploring adventure sports and craft brews (in no particular order of preference).