Strategies to achieve high uptime at scale. The points this talk is going to cover are:
- A real-life case study
- Cloud Architecture
- Immutable infrastructure
- Infrastructure as code
- Secrets Management
- Service Discovery
- Container management and scheduling
- Blue Green Deployment
- Observability
- About Me, My Company and My Situation
- Set context for the rest of the headlines
- Touch up on the legacy setup and infrastructure so that people can put the upcoming points in contrast with it
- Causes of our downtime
- Architecture
- Poor Provisioning Practices
- Lack of Monitoring
- Missing Backups, DR and BC
- Poor Technical Choices
- Storing Data on single node
- Scaling storage with LVM
- Node local cache for distributed apps
- Cyclic API calls
- Security
- Checked in secrets
- Publicly accessible resources
- Outdated and vulnerable versions of tools
- Lack of Documentation and Testing
- Takeaway - Typical problems faced in a poorly architected infrastructure
- Architecture
- What’s wrong with it
- Designing immutable infrastructure
- Poor Provisioning Practices
- What’s wrong with it
- Provisioning immutable resources with Terraform
- Deploying and Configuring services in immutable fashion
- Monitoring
- What’s wrong with it
- Implementing Observability
- Backups, DR and BC
- What’s wrong with it
- Automated backups with redundant copies
- Poor Technical Choices
- What’s wrong with it
- Fixing the mistakes made so far
- Lack of Documentation and Testing
- Summary
https://github.com/Gufran
https://twitter.com/freatbeater
https://www.linkedin.com/in/mgufran/
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}