Submissions

Rootconf Mini 2024 (on 22nd & 23rd Nov)

Geeking out on systems and security since 2012

Accepting submissions

Not accepting submissions

General guidelines for submitting talk and demo proposals We appreciate that many participants create submissions out of a genuine desire to share knowledge with our community, and to contribute in a meaningful way. However, many written submissions fail to capture the attention of the community, o… expand

General guidelines for submitting talk and demo proposals

We appreciate that many participants create submissions out of a genuine desire to share knowledge with our community, and to contribute in a meaningful way. However, many written submissions fail to capture the attention of the community, or meet acceptance through Rootconf’s peer review process. More often than not this is because the content of the submission does not explain what they intend with sufficient clarity or detail.

The template (and example) is an attempt to help you write a better submission, one that is noticed and understood by your intended audience and not lost in the crowd of interesting proposals we receive. Please use this template as a guideline, while ensuring that it is in your own unique and authentic voice.

BEFORE you begin writing your submission, please give some thought to the following:

  1. Who is the audience for your session? Think about their interests, work roles, challenges, age or experience as you decide this.
  2. What problem/pain are you trying to solve (for the audience)? This should be something that is communicated clearly so that they have a sense of your session’s importance.
  3. What will be the scope of your session? This will help identify the central topic or theme and should describe broad areas you plan to cover during the session?
  4. How will participants benefit from your session? Think of practical and specific ways in which they will be able to apply the knowledge they gain, and beyond just general awareness.
  5. What is the appropriate format for your session, given the audience and objectives that you have in mind?

The most successful talks and sessions are those where presenters are able to abstract an actionable insight from a common pain area, enlighten the audience about something new, provide a fresh perspective, and/or demonstrate innovation.

Here’s a guide for speakers to draft their presentations.

You can view talks held at previous editions of Rootconf 2024 for reference:

  1. Cloud Costs Optimization 2023 - http://has.gy/xeb4
  2. Rootconf SRE 2023 - http://has.gy/Lnis
  3. Rootconf Hyderabad 2019 - http://has.gy/7VSz

The call for submissions will be close on 30 October 2024. Talks will be selected on a rolling basis as submissions are made.

Topics for submitting talks

A. Systems engineering:

  • Resilience engineering in complex systems
  • Advanced observability practices
  • Serverless architectures: costs, benefits and challenges
  • Multi-cloud and hybrid cloud strategies
  • Kubernetes in production: lessons learned
  • Continuous Integration and Continuous Delivery (CI/CD) pipeline automation

B. Security engineering:

  • Security engineering case studies - systems/processes/practices that organizations have developed.
  • Security tooling experiences - open source and proprietary
  • Cloud security - case studies.
  • Security for AI; AI and security products

Types of submissions

You can submit a session for:

  1. 40 mins talk
  2. 15 mins demos and experience reports with dev tools and security tools
  3. Birds of Feather (BOF) sessions
  4. Hands-on workshops for three hours duration
  5. Pitch sessions - where you demonstrate MVPs you have built.
Vaidyanathan S

Vaidyanathan S

Moving your Databases to Kubernetes - Flipkart's DBaaS journey.

Abstract When most companies talk about Kubernetes adoption, they talk about the stateless aspect. However most of them shy away from Kubernetes when it comes to the stateful part. In this talk we will explore why Flipkart chose to move to stateful K8s for databases, the challenges we faced in this journey and the road ahead. more
  • 10 comments
  • Confirmed & scheduled
  • 24 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Achal Shah

Achal Shah

Rebuilding Tecton's Realtime Compute stack (twice)

Overview: This tech talk proposes to dive into the evolution of Tecton’s real-time compute stack, a journey that started with sidecar processes, moved through serverless architecture, and ultimately matured into a native service deployed on virtual machines (VMs). The session will (hopefully) outline the challenges, lessons learned, and engineering decisions made at each stage. more
  • 2 comments
  • Confirmed & scheduled
  • 05 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Jatin Katyal

Jatin Katyal

Cricket Match from a Devops Lens

Tsunami Traffic, Traffic Avalanche, Hockey Stick Curve are some of the common terms laid out as the benchmark for developing systems at scale. As the first hire for the Jiocinema Devops team @ Viacom18 I got the opportunity to work on breaking these benchmarks while maintaining our Infrastructure on a cluster of Kubernetes clusters ;) more
  • 9 comments
  • Confirmed & scheduled
  • 27 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Swapnil Dubey

Swapnil Dubey

Changing DevOps landscape in FinOps world.

Cloud computing benefits organizations in many ways. The benefits are so numerous that it makes it almost impossible not to consider moving business operations to a cloud-based platform. Easier said than done, multiple organizations get trapped in the pricing model – “Pay as you go”, and this not well understood, has resulted in wastages. As per Finops surveys, the key priorities for 2024 are as … more
  • 5 comments
  • Submitted
  • 26 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Shubham Kumar

Shubham Kumar Author

Lessons from Optimizing Cloud Costs and Improving System Performance to deliver 9M+ Radiology Reports

Introduction 5C Network is a leading AI-powered platform in the healthcare space, specializing in radiology and medical imaging. We manage and process large volumes of medical data, including over 1 billion DICOM objects, providing critical diagnostic services across India. Our focus on leveraging cutting-edge cloud and AI technologies allows us to deliver high-quality, cost-effective healthcare … more
  • 10 comments
  • Submitted
  • 27 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Rammohan Thirupasur

Rammohan Thirupasur

Agentic AI Security - An idea whose time has come !!

I intend to deep-dive the security landscape of AI agents. Not many have ventured in to this space. This talk will be niche, original & cutting-edge. more
  • 10 comments
  • Submitted
  • 22 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Security

Dr Sashank Dara

Reimagining Vulnerability Management with AI: A Complete Lifecycle Approach

Abstract Whether it is network vulnerabilities, application security issues or OS level misconfigurations the sheer volume of findings is simply overwhelming to administrators. Prioritizing and remediating them is daunting task given the short number of security experts out there who can intrepret and mitigate them accurately. more
  • 4 comments
  • Submitted
  • 19 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Security
Arun

Arun

AWS Cloud Cost Optimization at Xflow

Introduction As cloud adoption surges, optimizing costs is vital, especially for startups. This article shares Xflow’s successful strategies that slashed our AWS bill from ~$25,000 to ~$11,500, saving ~$12,500 monthly after adjusting for additional non-AWS costs, resulting in a savings of ~$150,000 annually! more
  • 5 comments
  • Submitted
  • 01 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Abhishek Tripathi

Parser Combinators - Embedding Zig language in Elixir

Elixir is a high level language known for fault tolerance. What if you need to write some parts of your project in a low-level language like Rust or Zig? more
  • 2 comments
  • Submitted
  • 21 Sep 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Chandrapal Badshah

With Infinite Scale Comes Infinite Bill (and Bankruptcy)

What can a bored hacker do with $5? They can do one of the below - buy a coffee, subscribe to some video streaming service or make your company bleed 10s to 100s of dollars in cloud bills. more
  • 2 comments
  • Confirmed & scheduled
  • 03 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Arya ketan

Arya ketan

Millions Saved, Lessons Learned: The Cloud Cost Optimization Blueprint

Optimizing cloud costs isn’t just about deploying tools and applying technical best practices—it’s fundamentally tied to a shift in organizational culture, a strategic and deep understanding of latest technology, and the identification of both quick wins and long-term investment areas. more
  • 2 comments
  • Confirmed & scheduled
  • 05 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
cvam

cvam

[Linux kernel devsprint/ sending patches to mainline kernel]

I’ll describe the following in detail, and this session will provide a hands-on experience for the attendees. more
  • 5 comments
  • Confirmed
  • 07 Oct 2024
Submission type: Hands-on workshop Track in which your submission fits: Systems engineering

Lalit Kumar

[Add a catchy title here]

How security teams can leverage GenAI to help them optimize security operations, we will demonstrate threat mitigation with GenAI, Attendees will walk away with the code to build their own GenAI enabled threat mitigation tool. more
  • 1 comment
  • Submitted
  • 07 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Security

Sai Sandeep Rangisetti

From Open Access to Hardened Security: Flipkart's Path to Secure Production Access

Flipkart, having grown from a startup to India’s largest e-commerce platform, has continually evolved its security posture to meet the demands of a dynamic, large-scale infrastructure. From an initial state of open access to all developers, our cloud environments have steadily advanced, to a state of centrally orchestrated, timebound, audited and restricted production access. more
  • 1 comment
  • Submitted
  • 08 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Ajay Ravichandran

Ajay Ravichandran

Sobering Noisy Background Jobs at Scale

Background jobs, more commonly referred to as asynchronous tasks or jobs, are a technique in software development for managing tasks that can be executed independently of the primary user interaction or request-response cycle. Background jobs are utilized to enhance system responsiveness, manage time-consuming tasks, and offload resource-intensive operations from the main application thread or pr… more
  • 15 comments
  • Submitted
  • 10 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Madhavi Madangopal

Madhavi Madangopal

Cost Optimisation - Big Impact from Small Changes

Abstract The usual approach to cost optimization is to focus on large scale infrastructure changes that lead to savings in the order of millions of dollars. However, there could be seemingly insignificant issues requiring small configuration fixes, which are often overlooked, but can cumulatively cause significant over expenditure. Overall cost optimization can only be achieved by addressing such… more
  • 7 comments
  • Submitted
  • 10 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Abhimanyu Dhamija

Abhimanyu Dhamija

Github Action CI Security with BOLT

Abstract: CI systems are the security orchestration centre of the SDLC but CI itself has become an attack surface as Solarwinds and Codecov attacks have shown. more
  • 2 comments
  • Confirmed & scheduled
  • 11 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Security
Jaideep Khandelwal

Jaideep Khandelwal

ABC of LLMOps - What does it take to run self-hosted LLMs?

LLMs and generative AI have made their way into our day-to-day operations. While the wrappers over GPT are a good starting point, I was intrigued by what it takes for an SRE to understand the domain, identify its operational aspects, and build runbooks around running self-hosted LLM models. more
  • 6 comments
  • Confirmed & scheduled
  • 11 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Kumar Tej Gedala

Kumar Tej Gedala

Navigating the Scale: How Design Patterns Power Our Infrastructure

Abstract Automation and modernization are foundational to support large infrastructure like that of LinkedIn. Managing a fleet of half a million servers(~500K) across our private data centers at LinkedIn is no small feat. This immense scale demands infrastructure solutions that not only expand capacity but also ensure performance, reliability, and efficiency as we grow. Building for scale involve… more
  • 5 comments
  • Submitted
  • 11 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Siddharth Balyan

Siddharth Balyan

Chatting with Logs: An exploratory study on Finetuning LLMs for LogQL

Abstract Monitoring and observability tools are a cornerstone in debugging processing for any large organization. more
  • 7 comments
  • Confirmed & scheduled
  • 12 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Sudheer Srinivasa

Sudheer Srinivasa

Unleashing the Power of Serverless with AWS: A Practical Guide to Building and Scaling Applications

In today’s fast-paced cloud environments, serverless architectures have become a powerful tool for reducing operational complexity while maximizing scalability and efficiency. This session focuses specifically on the Serverless Framework with AWS, providing a hands-on walkthrough of deploying and managing a fully serverless application. more
  • 3 comments
  • Submitted
  • 12 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Preeti

Preeti

Building OpenTelemetry Pipelines

Introduction The observability stack in modern organizations often consists of multiple vendors handling logs, metrics, and traces. This results in inconsistent data formats and conventions, increasing the operational overhead of maintaining these pipelines. more
  • 9 comments
  • Confirmed & scheduled
  • 14 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Sachin

Malicious Hallucinations: Hidden Threats with Indirect Prompt Injection

Large language models (LLMs) are known to generate unintended inaccurate responses, often called hallucinations. Most of these are harmless mistakes, like Google AI Overview suggesting to eat a rock a day. There’s a more concerning possibility: what if an attacker could deliberately cause specific hallucinations? This could allow the stealthy spread of targeted disinformation. more
  • 1 comment
  • Confirmed & scheduled
  • 15 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Abhisek Datta

Abhisek Datta

Paving the Path for Secure Software Engineering for Startups

Startups must move fast. Does this mean compromising on security? Everyone will choose security but no startup will have the resources to establish a matured security program from inception. How do you move fast while staying secure even when you have code contributions from interns, software engineers of different experience levels and multi tasking founders? more
  • 3 comments
  • Confirmed & scheduled
  • 15 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security

Srujan A

Security illusions and events mayhem

Description: In this 40 minutes, we will start off with a brief introduction of DevSecOps and spend almost 30 minutes on the critical role of Security Information and Event Management (SIEM) components within the DevSecOps framework. more
  • 3 comments
  • Submitted
  • 15 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Arpit Bhayani

Arpit Bhayani

How we made DiceDB a truly real-time reactive database

The talk will deep dive into how we built DiceDB and made it a truly real-time reactive database by eliminating polling inefficiencies. The talk will touch on the internal arch of Redis, persistent connections, and leveraging Pub/Sub patterns to enable instantaneous data flow at low latency and high throughput. more
  • 3 comments
  • Confirmed & scheduled
  • 15 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Sathvik Kallepalli

From Zero to Hero: Building Cloud Security Maturity in Fast-Growing Startups

From Zero to Hero: Building Cloud Security Maturity in Fast-Growing Startups 🚀🔒 more
  • 1 comment
  • Submitted
  • 15 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Mallikarjun

Mallikarjun

Building complex systems with k8s operator pattern using kubebuilder

Kubernetes operators are software extensions to Kubernetes that make use of custom resources to manage applications and their components. The operator pattern aims to capture the key aim of a human operator who is managing a service or set of services. Human operators who look after specific applications and services have deep knowledge of how the system ought to behave, how to deploy it, and how… more
  • 2 comments
  • Confirmed
  • 16 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Srinivas Devaki

Srinivas Devaki

Art of Caching: Ways, Wins, Woes, Weird, Wisdom

TLDR; An advanced exploration of war stories from building caching systems at a decacorn. more
  • 4 comments
  • Confirmed & scheduled
  • 17 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

shantanu joshi

Break Knowledge Silos with AI

Effectively Learning from past incidents is crucial to improving MTTR. Despite implementing blameless postmortems, runbooks, collaborative incident responses, and on-call handoff meetings, organizations struggle to effectively share and leverage collective knowledge. more
  • 0 comments
  • Submitted
  • 17 Oct 2024
Submission type: Product pitch - for MVPs Track in which your submission fits: MVP pitches

Sanjaykumar S

Mitigating Emerging Threats in LLM Security

Abstract Large Language Models (LLMs) are reshaping industries by powering advancements in customer interactions, content generation, and critical business operations. However, these advancements come with significant security challenges, such as data leakage, prompt injection, model inversion, data poisoning, and ethical concerns related to accountability and transparency. These vulnerabilities … more
  • 1 comment
  • Submitted
  • 17 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Rohit kumar

Rohit kumar

Building a Scalable PII and Secrets Detection Framework Across Modern Infrastructure

Introduction: This security tech talk proposes to dive into the evolution of detecting and securing Personally Identifiable Information (PII) and secrets across complex infrastructures. Sensitive data, such as PII and secrets, can be found anywhere—from logging services like Grafana, SaaS apps like Slack and Microsoft Teams, cloud buckets, or even employee desktops and shared folders. As security… more
  • 2 comments
  • Submitted
  • 17 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Mallikarjun

Mallikarjun

Case Study: Handling Multi DC constructs with Apache HBase

Apache HBase is an open-source non-relational distributed database modeled after Google’s Bigtable and written in Java. It is developed as part of Apache Software Foundation’s Apache Hadoop project and runs on top of HDFS, providing Bigtable-like capabilities for Hadoop. more
  • 5 comments
  • Submitted
  • 18 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Nitish Goyal

Nitish Goyal

[Realtime Metrics Ecosystem @ PhonePe - How we handle more than 400 billion metrics a day]

Have you ever experienced an abrupt service shutdown in production due to the inability to monitor CPU utilization and memory spikes post-deployment? If so, you understand the critical importance of service metrics monitoring. more
  • 6 comments
  • Submitted
  • 21 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Priya Ananthasankar

Priya Ananthasankar

Migrating Distributed Systems Infrastructure: Methodology and Insights

Any long running infrastructure in production, reaches its constraints over time. It requires a timely migration to avoid getting into the vicious cycle of tech debt, where upgrading a system evokes fear and reinforces the belief that change is futile. more
  • 4 comments
  • Submitted
  • 22 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Abhishek Gaddhyan

[Deploy Multicluster Kafka on Kubernetes using Strimzi Operator]

Strimzi is a tool with which a full fledged Apache Kafka-cluster including Apache ZooKeeper can be set up on Kubernetes or OpenShift. Strimzi is an open-source project,CNCF Sandbox Project. more
  • 1 comment
  • Submitted
  • 22 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Systems engineering

Rohan Birtia

Securing Kubernetes Posture Without Burning Your Budget Using Open-Source Solutions for Maximum Impact

Describe your talk/session in 2-3 paragraphs In this talk, we’ll explore how can manage our Kubernetes security posture without breaking the bank. An enterprise-grade Kubernetes security posture management(KSPM) tool often comes with a hefty price tag, sometimes nearing a million dollars. For even well-funded startups, dedicating such a budget solely to security can be a challenge. However, there… more
  • 2 comments
  • Submitted
  • 23 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Jothir Ganesan

Jothir Ganesan

SRE - A Pioneering Innovation

SRE being a pioneering innovation Reiteration of the golden principles of SRE more
  • 1 comment
  • Submitted
  • 23 Oct 2024
Submission type: Sponsored talk (for sponsors only) Track in which your submission fits: Systems engineering
Vishnu Naini

Vishnu Naini

Santanu Sinha

Santanu Sinha

Drove: A Simple, Performant, and Operations-Friendly Container Orchestrator

We shall discuss Drove, a simple container orchestrator developed at PhonePe that focuses on efficient resource utilization, container performance, straightforward compliance and security models, and ease of management. At PhonePe, containers running on Drove clusters, deployed across our multiple Data Centers and cloud, handle millions of requests per second and power all services and apps acros… more
  • 9 comments
  • Submitted
  • 23 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Srikanth Venugopalan Author

Object storage for new use cases through Indexes on lakehouses

ABSTRACT: Object storage has been around for a long time. While it is a cheap and scalable storage option, it has been traditionally limited to use cases such as storing unstructured data, or as a blob storage for binary data. With data footprints growing at an exponential rate, object storage is being used for a class of use cases that were previously thought to be impossible. While the most wel… more
  • 1 comment
  • Submitted
  • 24 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Lakshminarasimhan Sudarshan Author

Using Java in low-latency applications

Java is not what we think of immediately when it comes to low-latency applications - this is typically the realm of C/C++/Rust, etc. In E6data, we use Java in many parts of the engine and have successfully used it in cases where we need high performance and low latency. more
  • 1 comment
  • Confirmed & scheduled
  • 24 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Ishika Mittal

From Logs to Insights: Building a Scalable Monitoring System with Loki and Mimir

Monitoring and collecting metrics and logs is essential for product and service improvement, particularly at scale due to the vast data size and diverse stakeholders. At e6data, we began our journey with our Gen3 Lake House native compute engine in a “Cloudprem” model. Metrics and logs are crucial for understanding customer interactions, such as query success rates and the reasons behind query fa… more
  • 3 comments
  • Submitted
  • 24 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Kanika Khetawat

Santanu Sinha

Santanu Sinha

Spyglass: Graph based automated RCA tool

Abstract In a microservices architecture, detecting issues quickly becomes a challenge with high scale. At PhonePe we handle about a million requests per second on the edge. This translates to tens and hundreds of millions of service calls across thousands of service containers across the system. Traditional detection mechanisms like distributed tracing typically generate too much data for easy m… more
  • 3 comments
  • Submitted
  • 25 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Kaushik Thirthappa

Kaushik Thirthappa

Panic vs. Precision: Diving Deep into Alerts

Overview Incident management is a critical aspect of operational success, yet many organizations find themselves grappling with repeated incidents and alert fatigue. In our experience, over a period of time, panic with each incident reduce; since many teams often encounter the same issues multiple times. This paradox leads to alerts being perceived more as background noise than urgent calls to ac… more
  • 1 comment
  • Submitted
  • 25 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Jothir Ganesan

Jothir Ganesan

HeatWave Service - Effective Incident Management with TOS:IM

Abstract: The responsibility of SRE more than providing to resolution to the incident. This talk is to explain about the process of Effective Incident Management by having a structured incident management framework that we have in HeatWave Service (aka Oracle Cloud Infrastructure MySQL Service) with various techniques of both internal and external evaluation. more
  • 0 comments
  • Submitted
  • 26 Oct 2024
Submission type: Sponsored talk (for sponsors only) Track in which your submission fits: Systems engineering
Navin Govindarasu

Navin Govindarasu

Getting Started with Azure’s Cloud-Native Security - The Thoughtworks Way

I’ll be drawing from my experience as a Senior Consultant at ThoughtWorks to guide you through Azure’s most powerful security tools and services, transforming how you approach cloud security. Whether you’re securing a small project or protecting large-scale infrastructure, Azure’s cloud-native security offerings provide everything you need to stay one step ahead of potential threats. more
  • 0 comments
  • Submitted
  • 27 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Navin Govindarasu

Navin Govindarasu

Securing the SDLC with a Shift-Left Security Approach - The Thoughtworks Way

Abstract In today’s fast-paced digital world, security must be a priority, not an afterthought. Adopting a “Shift-left” approach means integrating security early in the software development lifecycle (SDLC). This talk will discuss the importance of early security integration, the challenges organizations face, and how to implement security tools throughout the development process to improve appli… more
  • 0 comments
  • Submitted
  • 27 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Security
Robin Tak

Robin Tak

Rollup: Managing 300 Billion Daily Metrics at PhonePe

Overview The Metrics Platform enables our engineers at Phonepe to monitor their services around the clock. This platform stores and serves the data that powers Grafana dashboards and the anomaly detection alert infrastructure. All metrics are stored in time series database - OpenTSDB, a well-established project in the open-source domain. more
  • 4 comments
  • Submitted
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Piyush Singh

Streamline Multicloud Infrastructure Management with zop.dev

Managing infrastructure across multiple cloud providers can be complex, with each platform having its own tools, configurations, and networking challenges. In this talk, we will introduce zop.dev, a comprehensive platform designed to simplify multicloud infrastructure provisioning, management, and observability. Whether you’re deploying applications on AWS, Google Cloud, or Azure, zop.dev offers … more
  • 1 comment
  • Submitted
  • 28 Oct 2024
Submission type: Product pitch - for MVPs Track in which your submission fits: Systems engineering
Chakravati Singh

Chakravati Singh

Bhavin Modi

Empowering Mobile UI Automation at Scale: Dynamic Emulator Creation with PhonePe's Drove Infrastructure

Abstract This tech talk dives into PhonePe’s journey on scaling its UI Automation capabilities by building a comprehensive platform to support its fleet of applications and use cases. Investment in automation was made primarily for two reasons: Improving Product Quality and Improving Org Efficiency. more
  • 5 comments
  • Confirmed & scheduled
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Snehasish Roy

Clockwork: The Backbone of PhonePe’s 2 Billion Daily Jobs

Overview Have you ever had an alarm fail to wake you up, causing a ripple effect of chaos in your morning? At PhonePe, we understand the criticality of such ‘alarms’ in our digital ecosystem. more
  • 7 comments
  • Submitted
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Akash Sethiya

[Add a catchy title here]

Abstract Observability is a key component of resilient and dependable systems. Working with clients like HDFC Bank, Amazon Pay, Zomato - We can’t imagine running blind. more
  • 1 comment
  • Submitted
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Snehasish Roy

Zero Downtime, Zero Compromise: How PhonePe's DocStore Handles Billions of Documents

Overview Ever wondered what happens when millions of PhonePe users share documents, buy insurance, or upload KYC information? Enter DocStore - the powerhouse behind PhonePe’s massive document operations. This home-grown object storage platform seamlessly handles thousands of critical transactions, from instant chat attachments to vital insurance documents, powering both customer experiences and d… more
  • 7 comments
  • Submitted
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Pallavi Pareek

Creating Safe Workspaces for Global Teams

It is important for organisations, technical and non technical to give space and dignity to the employees. Ungender/GetConduct is suite of applications that help organisations to educate, train on acceptable behaviour and report unacceptable advances or harassment, as per the law across countries. While ours is a technology company, our mission is to ensure safe, conducive environments for all ge… more
  • 0 comments
  • Submitted
  • 28 Oct 2024
Submission type: Product pitch - for MVPs Track in which your submission fits: MVP pitches
Sandesh Kumar Gupta

Sandesh Kumar Gupta

Building Intelligence and resilience for highly available managed DbaaS platforms

Objective At Flipkart, we have seen the huge adoption of the home grown managed platforms running as multi-cloud setup by all the engineering teams working at massive scale, and DbaaS platforms are protagonists of this story. It becomes paramount that these platforms can maintain high resilience, high availability to deliver sustained performance and continuous optimisations to handle adoption at… more
  • 2 comments
  • Submitted
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Abhinav Sarkar

Abhinav Sarkar

Managing Personal Servers with Nix

We software developers work on big scalable complex distributed systems in our day job. But some of us also like to run small personal servers to run some software for personal use, and we’d like this setup to be simple, yet reliable. Enter Nix, which lets us do this declaratively. more
  • 2 comments
  • Submitted
  • 28 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Systems engineering

Agastya Dev Addepally

Terraform Custom Module Management: A simple CLI tool solving a tech debt landmine ready to happen

Terraform custom modules are the cornerstone of most IaC implementations. In places where they are extensively used, it often leads to a state where you’re not able to track the custom module versions upstream leading to issues such as: more
  • 6 comments
  • Submitted
  • 28 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Systems engineering
Umed

Umed Speaker

Krishna Prasanth Co-Author

PPEC Agent: Streamlined VM Management from Creation to Optimization

Overview: (2 mins) This tech talk explores the design and evolution of the PPEC Agent and PPEC Proxy, which form the backbone of a virtualization stack leveraging libvirt, KVM, and QEMU to manage virtual machine (VM) creation, disk attachment, and performance tuning of VMs on bare-metal systems. The session will outline key engineering decisions, the challenges faced in optimizing resource manage… more
  • 8 comments
  • Submitted
  • 28 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Rolland

Navigating the observability maze

Most orgs have monitoring & observability systems in place … and they are expensive or complex or both. Creating a long-term, cost-effective strategy for observability is not easy: more
  • 3 comments
  • Submitted
  • 29 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Amod Malviya

Amod Malviya

Revisiting Abstractions for Fun & Profit

Abstractions are great! They help us think without being overwhelmed by details. But sometimes they can come at the cost of understanding, where they become a wall that we don’t wish to climb. That limits us, because when they change, a lot of us can’t keep up. more
  • 1 comment
  • Confirmed & scheduled
  • 29 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Sasank Chilamkurthy

Sasank Chilamkurthy

Rootless Linux Operating System

Abstract We have been working on shipping an AI server called JOHNAIC for serving cloud like workloads from this edge server. We are developing our operating system with a specific focus on security and usability for developers. With this OS, we are able to deploy SaaS apps, indistinguisable from cloud and expose them over internet. In this session, I will speak about this operating system in det… more
  • 4 comments
  • Submitted
  • 29 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Security

Mourjo Sen

Congestion control in web services

TCP achieves reliable communication over an unreliable network (IP). It does so by abstracting the underlying network and observing only the sender and the receiver. This is a cornerstone for the internet’s success and is called the “end-to-end principle”. more
  • 2 comments
  • Submitted
  • 29 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Rohit Raveendran

Rohit Raveendran

Beyond CI/CD: Building Platforms for complex engineering setups

Abstract Remember when every team had their own CI/CD pipeline. Every CI/CD pipeline becomes a distributed monolith - a tangle of Jenkins jobs, GitHub Actions, and custom scripts that only the original team understands. more
  • 1 comment
  • Submitted
  • 29 Oct 2024
Submission type: 15 mins demo or experience report Track in which your submission fits: Systems engineering

Anirudh Singh

Powering Real-Time Gameplay at Scale: Managing Cassandra in Production at Quizizz

Abstract Operating Cassandra at scale for mission-critical workloads presents unique challenges. This talk explores the strategies we use at Quizizz to maintain a resilient Cassandra cluster for real-time gameplay, ensuring high availability and low latency. We discuss architectural considerations for building a scalable, fault-tolerant infrastructure, with a focus on data modeling, performance m… more
  • 1 comment
  • Submitted
  • 29 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Raj Suvariya

Maintenance operator for TiDB running on Kubernetes

Description: TiDB is distributed SQL and horizontally scalable datastore developed by PingCAP. TiDB makes it easy to deploy and run database clusters on kubernetes by providing a official TiDB operator. However, this operator generally assumes that the database is running on network attached disk, which makes it easier for operator to not worry about compute failures - as compute can be reschedul… more
  • 0 comments
  • Submitted
  • 30 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
nitin kumar

nitin kumar

Finding Needles in a Million RPS Haystack : Solving Performance Problems with eBPF

Overview At PhonePe, our in-house API gateway handles over a million requests every second. When you operate at this scale, you encounter performance challenges that are impossible to spot during testing. Using eBPF as our debugging tool, we not only solved these issues but also saved millions in yearly infrastructure costs. more
  • 4 comments
  • Confirmed
  • 30 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Nidhi Agarwal

Nidhi Agarwal

Enhancing resiliency through CI/CD at Zomato: Advanced Automation and Real-Time Safeguards

Abstract Building a CI/CD pipeline capable of supporting 700+ engineers, and managing 600+ deployments across 300+ services daily is essential at Zomato’s scale. Efficient CI/CD pipelines are critical for streamlining the development process and ensuring secure deployments. more
  • 4 comments
  • Submitted
  • 30 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering
Nabarun Pal

Nabarun Pal

Building universe scale control planes the Kubernetes way

Kubernetes has solidified its core technology status in the field of infrastructure software. As per CNCF Annual Surveys, 66% of potential/actual consumers were using Kubernetes in production and an additional 18% were evaluating it. End users of Kubernetes are moving towards hybrid cloud architectures for flexibility, security, cost optimizations, scalability and performance. A staggering 43% of… more
  • 0 comments
  • Submitted
  • 30 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Akshay Sethi

Practical tips for building AI applications using LLMs - Best practices and trade-offs

Overview At KushoAI, we’ve built an AI agent that can autonomously perform API testing for you. While building this, we came across a lot of problems specific to AI applications built on top of LLMs that you don’t see anywhere else. Since this is a fairly new area of development, we had to spend a lot of time figuring out solutions for them on our own. more
  • 0 comments
  • Submitted
  • 30 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Tarushi Bhandari

Skynet for Incidents: Intelligent Incident Management using Ansible Playbooks and RAG based LLM

Idea Deep observability is crucial for any software system. This is to ensure that in cases of failure, we have visibility allowing us to solve issues quickly. To help simplify the handling of these issues, we can create automated runbooks—predefined solutions to common problems which can significantly reduce the time it takes to resolve incidents. more
  • 4 comments
  • Submitted
  • 30 Oct 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Bharadwaj Embar Speaker

Building a seamless hybrid cloud with Kubernetes and Clutch

This talk outlines how we use Clutch as a unified entry point for developers to deploy and monitor workloads across Kubernetes clusters in hybrid cloud environments. By integrating Clutch with ArgoCD, organizations can streamline multi-cloud operations, automate migrations, and proactively manage workload health. more
  • 2 comments
  • Confirmed & scheduled
  • 08 Nov 2024
Submission type: 40 min talk Track in which your submission fits: Systems engineering

Hosted by

We care about site reliability, cloud costs, security and data privacy

Supported by

Platinum Sponsor

Nutanix is a global leader in cloud software, offering organizations a single platform for running apps and data across clouds.

Platinum Sponsor

PhonePe was founded in December 2015 and has emerged as India’s largest payments app, enabling digital inclusion for consumers and merchants alike.

Silver Sponsor

The next-gen analytics engine for heavy workloads.

Sponsor

Community sponsor

Peak XV Partners (formerly Sequoia Capital India & SEA) is a leading venture capital firm investing across India, Southeast Asia and beyond.

Venue host - Rootconf workshops

Thoughtworks is a pioneering global technology consultancy, leading the charge in custom software development and technology innovation.

Community Partner

FOSS United is a non-profit foundation that aims at promoting and strengthening the Free and Open Source Software (FOSS) ecosystem in India. more

Community Partner

A community of Rust language contributors and end-users from Bangalore. We have presence on the following telegram channels https://t.me/RustIndia https://t.me/fpncr LinkedIn: https://www.linkedin.com/company/rust-india/ Twitter (not updated frequently): https://twitter.com/rustlangin more