Rootconf is India's principal conference where systems and operations engineers share real world knowledge about building resilient and scalable systems.
At the end of this course, participants will be familiar with the Domain Name System and Security Extensions to the Domain Name System (DNSSEC). The course is taught “hands-on” in a virtualized FreeBSD environment using the BIND, NSD and Unbound name server implementations. Participants will configure authoritative and recursive domain name servers and will learn to analyse and debug common misco… more
This session will provide some insights into how to stress test your cloud infra and/or application and plan on autoscaling which helps in dealing with high traffic. This session is most helpful for e-commerce sites, and other providers who have to deal with high traffic. more
This talk will primarily focus on something really simple: how do you setup a Docker-based environment for scaling web applications. The current articles, etc. out there give a rather vague description of the process, leaving a tonne for the reader; if the fate is, and he doesn’t know DevOps, it might be a problem for him. In this presentation, I will give a rock-solid guide as to how you’d want … more
While we wish systems never go down,its prudent we build robust systems for all possible eventuality. in 40 mins, the objective isn’t to tell-you-all, rather just focus on 4 most important aspects to plan Disaster Recovery scenarios. more
Build and Run Fault Tolerant and elastic distributed system, using Apache Mesos. Learn how you can deploy and work with your web apps over the cloud on a cluster of machines but treat the cluster as rather a single compute unit, and keeping the cluster in cloud as a fault-tolerant unit. more
Not much is spoken about how we as humans tend to deal with failure especially us Ops folks . Given the ever distracting nature of our jobs which pushes and pulls us in multiple directions each day mistakes/failures are bound to happen. I’d like to share my experience of one such incident and the ways/methods I used to deal/cope with failure and hopefully inspire others to adopt similar methods a… more
To discuss the experience of openstack setup and testing using CI/CD based on puppet, what all we did and when and where does it fail. At the end of the session, the audience will know about what all works and helps to setup an openstack cluster with no manual intervention. And what all can go wrong in the same openstack production cluster. more
Everyone talks the benefits of having an automated monitoring system in place - one which can discover infrastructure components as they are added, monitor them while they are alive and stop monitoring when they are moved out. But no one has chronicled their journey through the process of automating a manually maintained monitoring system and showcased their battle scars for others to learn from.… more
Feature Toggle is one of the key practices for Continuous Delivery, but not enough has spoken about the same. This session is to give an intro about Feature Toggle and explain the advantages it has over Feature branching and share my experience while using it for the last few years. more
The sudorank framework defines a simple method of evaluating a prospective candidate’s knowlegde and hands on experience. This is a large improvement on the current ad-hoc methodology, which is a mix of Q&A and pairing. more
This talk will focus on the various innovations in some of the monitoring solutions of today and how monitoring systems have evolved tremendously in the past few years. Also given the changes in application landscape today, we will talk about what are really the important things to monitor and why more
To discuss resiliency testing challenges in large scale cloud deployments and how to automate them (think Chaos Monkey, but with a few key differences). more
We all are consumers of services in some form or other, Consider cloud computing, it is easy to use; less hassle; zero maintenance. But what happening on the other side, how are they managing to keep the lights on. How are the cogs arranged and who controls its movements !! more
Those attending will get the basic idea of the concept of failure induction, An architectural overview of Simoorg the failure induction framework developed at LinkedIn and the features provided by Simoorg. more
Learn how Rundeck can be levaraged to do one-click Deploys/integrated into CI pipeline. If you are using cloud environments, you will be dealing with large number of servers, Rundeck provides UI/API interface to simplify deployment on cloud/on-prem. We will also looking into writing rundeck jobs to self-service other teams. more
This workshop contains how we can make ELK stack to replace the traditional SOC. Apart from ELK participants will create their own custom filters and parsers. We also integrate with alerting system for visibility and better understanding of attacks & statistics of infrastructure resources and attacks by kibana. more
To discuss the internal architecture of Hadoop(hdfs),Hbase and Hive. I will also discuss, how we Designed our Data in Hive and Hbase based on our need, what problems we faced in production cluster and how we made it scalable and highly available. more
To explain the state of the HTTP protocol right now. Going through a bit of history, the evolution and the current HTTP/2 implementation. A small demo of performance improvements at the end. more
Coming from a traditional LAMP stack, this talk focuses on how & why we decided to re-write our core platform in Golang. I’ll also stress upon the learnings and heartbreaks through this journey. more
Systems do fail. There are multitude of components that could fail any time. Therefore, one could think of introducing factors that might lead to failure and thus eliminating one angle of a possible future failure. This talk aims to provide some such ideas. more
Understanding life cycle of Kafka-request Understanding how a trivial (metrics addition) change caused a Kafka cluster to crumble under high load causing frontend user impact. (KAFKA-2664) more
The talk is intended to help understand how to scale operations and developers with the help of cultural and process changes that allow you to innovate faster with higher accountability and reliability. more
Disussing the problem that I faced in making webservices go live in a hackathon and the workflows I adopted to push apps to the server really fast. I mean really really really fast !!!!!! more
While developing systems, we make assumptions about the environment that the system would run in. These are stories about how the real world broke our assumptions, the road(s) to debugging and how my server side devOps experience fit right into fixing low power wireless IPv6 mesh networks so that we could eventually- more
In this talk I will provide a survey of these more non traditional services, I’ll give my honest opinion on which ones I think are ready for production, and I’ll provide a live demo of using one or two of them. more
Sharing all the bits and pieces of learnings gained while I interned at Cisco in the summer of 2015 where I worked on building an open-source install architecture for distributed bare-metal hardware that would be generic enough to be used by other big players. more
To get an idead on how FreeBSD helps in building a resilient container infrastructure for web applications. Skipping CARP(Cisco VRRP) which is required for HA firewall, as this is going to be a crisp talk. more
The objective of this talk is to tell people about our experiences with implementing a large puppet deployment in a very traditional enterprise, and to give them tips and pointers on how to do this succesfully. more
The objective of this workshop is to show attendees how to run a puppet setup in a production environment. The workshop will include setting up a puppet code respository from scratch, checking it into a git repository and making sure it can be deployed properly. more
This talk will go through the issues we faced in the Fedora Cloud SIG while testing the cloud imgaes, and how we solved the issues in record time with help from a programming language called ‘Python’. I hope this talk will encourage people to use more of Python as a tool to solve problems. more
The primary objective of this workshop is to teach fundamentals of Go programming language. The workshop will include setting up environment for Go development, explanation about syntax & concepts and solving problems. The participants will be able to write idiomatic and effective Go code after the session. All participants will get chance to work on a small project. more
teach the benefits of designing software in a crash only fashion, with examples from large scale deployments at DigitalOcean. how to design for graceful handling of failures. more
The talk introduces Chaos Engineering to the audience, and talks about how complex distributed systems fail in large scale internet services. The talk also goes into discussing design patterns for making higly resilient distributed systems which can heal from transient failures. more
Microservices are a talk of the town and the newer tools, frameworks like Kafka, Consul, grpc convinces us to be armchair architects. But let’s take a step back to understand the common design principles of a services architecture and the commonality between Unix and microservices. more
Understand how to achieve maximum efficiency from your infrastructure using pre-emption friendly docker applications and load-aware container scheduling. more
The MySQL ecosystem is 21 years old which means there are plenty of solutions for you to use, or stuff that you may have used. Learn from best practices over the years, and use tools that are current for your next deployment. more
Running MySQL? Replication is easy to setup. But how do you handle failover? How do you keep running smoothly over time. Learn best practices from the failures of others. more
The objective of this talk is to help other industries benefit from the knowledge of the real-time work flow in a TV newsroom. While industries are distinct in their content and area of specialization, I believe that work-flow knowledge systems cut across these barriers. Such knowledge systems when shared can enable industries to learn new skills, trigger thought processes on existing practices a… more
To build and run a multihost Docker cluster and to make the containers talk to each other over the network without using any external Software defined network (SDN). more
The objective of this workshop is to highlight the challenges of generic JSON parsing in a strongly typed language such as #golang. At the end of this talk, participants will be able to design config driven applications, with strong validation and friendly error logs. more
Attendees will learn the history and current status of DTrace, as well as a technical overview of how DTrace works and a selection of use cases where DTrace provides valuable debugging information. A number of examples and sample commands will be presented. more
The tutorial’s objective is to provide an overview of DTrace and show the student how to become proficient with using DTrace to formulate queries about system behaviour and get the desired answers. more