Cortex: Horizontally Scalable, Distributed Prometheus

Sep 2019

16 Mon

17 Tue

18 Wed

19 Thu

20 Fri

21 Sat 08:55 AM – 06:20 PM IST

22 Sun

Make a submission

Accepting submissions till 21 Aug 2019, 10:30 AM

St. Laurn Hotel, Pune

Tickets

##About Rootconf Pune:

Rootconf Pune is a conference for:

DevOps engineers
Site Reliability Engineers (SRE)
Security and DevSecOps professionals
Software engineers
Network engineers

The Pune edition will cover talks on:

InfoSec and application security for DevOps programmers
DNS and TLS 1.3
SRE and distributed systems
Containers and scaling

Speakers from Flipkart, Hotstar, Red Hat, Trusting Social, Appsecco, InfraCloud Technologies, among others, will share case studies from their experiences of building security, SRE and Devops in their organizations.

##Workshops:

Two workshops will be held before and after Rootconf Pune:

Full-day Prometheus training workshop on 20 September, conducted by Goutham V, contributor to Prometheus and developer at Grafana Labs. Details about the workshop are available here: https://hasgeek.com/rootconf/2019-prometheus-training-pune/
Full-day DNS deep dive workshop on 22 September by Ashwin Murali: https://hasgeek.com/rootconf/2019-dns-deep-dive-workshop-pune/

##Event venue:
Rootconf Pune will be held on 21 September at St. Laurn Hotel, Koregaon Park, Pune-411001.

#Sponsors:

Click here to view the Sponsorship Deck.
Email sales@hasgeek.com for bulk ticket purchases, and sponsoring the above Rootconf Series.

Rootconf Pune 2019 sponsors:

#Platinum Sponsor

#Bronze Sponsors

#Community Partner

##To know more about Rootconf, check out the following resources:

hasgeek.com/rootconf
hasgeek.com/rootconf/2019
https://hasgeek.tv/rootconf/2019

For information about the event, tickets (bulk discounts automatically apply on 5+ and 10+ tickets) and speaking, call Rootconf on 7676332020 or write to info@hasgeek.com

Hosted by

Rootconf

Rootconf is a community-funded platform for activities and discussions on the following topics: Site Reliability Engineering (SRE). Infrastructure costs, including Cloud Costs - and optimization. Security - including Cloud Security. more

All submissions

Previous Next

Cortex: Horizontally Scalable, Distributed Prometheus

Submitted Feb 14, 2019

Technical level: Intermediate

In this talk we’ll present a horizontally scalable, distributed, Prometheus API-compatible monitoring system called Cortex. Cortex was built to offer a different solution to Prometheus HA and virtually infinite retention. We’ll discuss its architecture, tradeoffs and evolution, with special reference given to the distributed systems algorithms use to provide failure tolerance and scalability.

Cortex turns a lot of the Prometheus architectural assumptions on its head, by marrying a scale-out PromQL query engine with a storage layer based on NOSQL databases such as Bigtable, DynamoDB and Cassandra. We have disaggregated the Prometheus binary into a microservices-style architecture, with separate services for query, ingest, alerting and recording rules. By designing all these services as fungible replicas, this solution can be scaled out with ease and failure of any individual replica can be dealt with gracefully.

Cortex is a CNCF Project and has been in production for over two years now, and the talk will cover some of the many things we have learnt along the way.

Outline

This talk will help the audience understand what Cortex is, how it relates to Prometheus and how to get started with it. The lack of horizontal scalability, replication and long term storage have been cited by some as a reason not to choose Prometheus; Cortex aims to provide a version of Prometheus with these features, removing some of the reasons against adoption.

We start with Prometheus, it’s limitations and then dive into the motivations and architecture behind Cortex. We then talk about users and use-cases before we finally talk about the future of the project.

Speaker bio

Goutham is a developer from India who started his journey as an infra intern at large company where he worked on deploying Prometheus. After that initial encounter, he started contributing to Prometheus and interned with CoreOS, working on Prometheus’ new storage engine. He is now a maintainer for TSDB, the engine behind Prometheus 2.0. He now works at Grafana Labs on open-source observability tools. When not hacking away, he is either on his bike, or is binge watching GCN!

All submissions

Previous Next