Production is Priority - Self Fix / Heal Techniques

May 2014

12 Mon

13 Tue

14 Wed 10:00 AM – 06:30 PM IST

15 Thu 10:00 AM – 06:30 PM IST

16 Fri 09:30 AM – 10:30 PM IST

17 Sat 09:30 AM – 06:15 PM IST

18 Sun

Make a submission

The Energy & Resources Institute, Bangalore

As Developers / Managers we almost everyday think and talk about faster / shorter Software Development cycles to increase our market presence/reach. Is there a way to measure how fast we are ?

Speaking of cycle: In Cycling a term “Cadence” is used, which simply means the speed at which you pedal. Cyclists measure this in revolutions per minute, or rpm. Similar to Cadence in Cycling, the cadence of a software team is measured by how fast and how frequent you can take your software live. Can you do this on every day, every week ? Do you have the tools for the same to Scale UP ?

While we try to improve the cadence of the team we have many challenges around Infrastructure Scaling, Test Integration, Configuration Management, Monitoring for uptime, Log Management, Security of Servers, Dev-Test-Prod setups, Maintaining single source of truth for your assets, etc… And how does these changes impact team dynamics ? If you have adopted some strategies have you noticed that your team has improved? do you need more QAs or do you need more sysadmins ? do you really need those many routers, servers or backups?

Rootconf is a conference which tries to address some of the challenges we face when we fine tune our infrastructure to be able to appropriately respond to a business need, while we Scale UP our Cloud or Web Infrastructure.

Developing a good Continuous Integration/Deployment/Testing/Delivery strategy is critical to improve the cadence of your team. Infrastructure and DevOps is an upfront investment human, time & money. The challenge always is whether you’re willing to make that investment right away, or in the future at a much higher cost and effort.

Rootconf is a conference which will help you to plan and develop a strategy map for infrastructure and devops. It will show you the building blocks for reaching a strategy for Infrastructure Scaling, Continuous Integration, Deployment and Delivery.

Target audience

Rootconf is targeted at individuals, teams and companies that are seeking to scale the effectiveness of their developer teams and performance of their web stacks, thereby increase the Cadence of their software delivery.

Organizations which need a CI and CD strategy to achieve the above will find a substantial headstart in doing so, by attending Rootconf.

Venue

Workshops

14th and 15th May 2014
The Energy and Research Institute,
4th Main Rd, Domlur II Stage,
Domlur, Bangalore

Conference

16th and 17th May 2014
MLR Convention Centre,
J P Nagar 7th Phase,
Brigade Millenium campus,
Bangalore

Tickets

http://rootconf.doattend.com

Online Presence

Website | Facebook | twitter

For questions about submissions or the conference, write to support@hasgeek.com

Theme

For Rootconf 2014, we are accepting proposals for Full Talks, Crisp Talks & Flash Talks for the Conference, and proposals for hands-on 3 hour workshops on the below topics. For more information on the types of talks, please checkout the Format tab.

Infrastructure Scaling & Automation
- Treating your infrastructure as code.
- How did you do scaling and what were your automation strategies while you were gunning for scaling.
Continuous Integration
- Tell us how you have done it for your organization ?
- Any use case around how it impacted your development team / process.
- Reference Tools – Jenkins, Travis CI, CruiseControl, TeamCity.
Deployment
- Tell us how you have done it for your organization ?
- Any use case on how you reduced your deployment time ? Did you reduce your time to market your product by Adopting CD ?
- Reference Tools – Chef, Puppet, Ansible, Salt
Automating Testing
- How much manual can be automated ?
- How did you automate ? What tools di you use ?
- What framework(s) did you use ?
- Did you use heavy weight Selenium or Watir or Sahi?
- Tools that work across heterogeneous languages (PHP, Java, C, Mobile)
Security
- Code Security
  - Trust no one - including the developer.
  - How are you testing your code ?
  - Do you run vulnerability testing part of the CI ?
  - Best Practices for secure coding
- Server side security
- Data at motion
  - Is internet really safe, how do you protect your data. Is HTTPS alone sufficient ?
- Data at rest
- Do you need to implement standards?
Log monitoring and server monitoring
- The heartbeat / lifeline of your business: tell us more about how you monitor.
- Do you use any of these tools? Graphite, Sentry, CopperEgg, Loggly, Papertrail, Splunk, Nagios, Monit, etc..
Cloud databases:
- NoSQL Databases (DynamoDB, MongoDB, Couch)
- The good and bad of NoSQL
- Automation challenges of NoSQL
Self-healing
- Automatic remediation of services and servers.
- Process Protection using Service Protector, Monit
- Auto Scaling Groups
New tools
- Do you have more tools that makes you a better DevOps Engineer ?

Talks can submitted for the following OSes:

Windows
Linux
Cross-platform

Hosted by

Rootconf

Rootconf is a community-funded platform for activities and discussions on the following topics: Site Reliability Engineering (SRE). Infrastructure costs, including Cloud Costs - and optimization. Security - including Cloud Security. more

All submissions

Previous Next

This submission has been added to the schedule

Production is Priority - Self Fix / Heal Techniques

Submitted Jan 29, 2014

Section: Full talk Technical level: Intermediate

To understand how to

Monitor systems
a. Nagios
b. Ganglia
Analyse Root cause
Automate the fix
Log / Record Incidents

Outline

Production systems are always P1 and keeping them up & scaling them is what keeps everyone on their toes

How ever we have cracked some important automation that could drastically make a devOps engineers life easier.

We let our 1000+ servers across 4 regions heal by themselves, and let the Operations team focus on bigger tasks that could add more impact to the organisation.

This ensures that we are not doing the same task over and over again, increases productivity and scalability across the application stacks.

A simple example would be something like log rotate, which ensures that we don’t keep cleaning logs every day but it does that task over and over again on your behalf to ensure logs get purged everyday

Question : I have a use-case that does not have a solution in the open source community..

Answer : Customise it.. you would be able to plugging scripts and hooks to fix the problem.

Will discuss on how its done by us!