[Realtime Metrics Ecosystem @ PhonePe - How we handle more than 400 billion metrics a day]

Nov 2024

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri 09:00 AM – 05:10 PM IST

23 Sat

24 Sun

Bangalore International Centre, Bengaluru

Tickets

All submissions

Previous Next

[Realtime Metrics Ecosystem @ PhonePe - How we handle more than 400 billion metrics a day]

Submitted Oct 21, 2024

Submission type: 40 min talk Track in which your submission fits: Systems engineering

Have you ever experienced an abrupt service shutdown in production due to the inability to monitor CPU utilization and memory spikes post-deployment? If so, you understand the critical importance of service metrics monitoring.

At PhonePe, we empower our engineers to continously monitor their systems using Opentsdb. On top of these metrics, we have built in house alerting system Anomaly detection which helps the teams to get real time alert for any anomalies. More than 200 clients push more than 400 billion metrics a day and peak touching 5 millon metrics per sec. We retain these raw metrics for 30 days and rolled up metrics for 365 days. Overall cluster footprint is close 80 Baremetals holding terabytes of data

In this talk, we will talk about -
Systems architecture of our Metrics platform along with Opentsdb.
We will deep dive into system system optimisations we have done over the years to scale our Kafka and HBase which acts as the backbone of our platform.
Production outages and remediations

Key take aways
How we scaled Opentsdb to handle 400 billion metrics a day
Rollup of metrics using Spark
Feedback loop to build intelligence system
Dos/Dont’s for managing larger infrastructure

This session is useful for :
Developers
SRE/Devops
Engineering Managers

All submissions

Previous Next

Comments

Nov 2024

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri 09:00 AM – 05:10 PM IST

23 Sat

24 Sun

Hybrid Access Ticket

Hosted by

Rootconf

We care about site reliability, cloud costs, security and data privacy

Supported by

Platinum Sponsor

Nutanix Technologies India Private Limited

Nutanix is a global leader in cloud software, offering organizations a single platform for running apps and data across clouds.

Platinum Sponsor

PhonePe Private Limited

PhonePe was founded in December 2015 and has emerged as India’s largest payments app, enabling digital inclusion for consumers and merchants alike.

Silver Sponsor

e6data

The next-gen analytics engine for heavy workloads.

Sponsor

Swiggy

Community sponsor

Peak XV Partners

Peak XV Partners (formerly Sequoia Capital India & SEA) is a leading venture capital firm investing across India, Southeast Asia and beyond.

Venue host - Rootconf workshops

Thoughtworks

Thoughtworks is a pioneering global technology consultancy, leading the charge in custom software development and technology innovation.

Community Partner

FOSS United Foundation

FOSS United is a non-profit foundation that aims at promoting and strengthening the Free and Open Source Software (FOSS) ecosystem in India. more

Community Partner

Rust Bangalore

A community of Rust language contributors and end-users from Bangalore. We have presence on the following telegram channels https://t.me/RustIndia https://t.me/fpncr LinkedIn: https://www.linkedin.com/company/rust-india/ X/Twitter: https://x.com/IndiaRust more

Rootconf Mini 2024 (on 22nd & 23rd Nov)

[Realtime Metrics Ecosystem @ PhonePe - How we handle more than 400 billion metrics a day]

Comments