A Beast to Process Kafka Events

Jun 2019

17 Mon

18 Tue

19 Wed

20 Thu

21 Fri 08:45 AM – 05:40 PM IST

22 Sat 09:00 AM – 05:30 PM IST

23 Sun

NIMHANS Convention Centre, Bangalore

Tickets

All submissions

Previous Next

A Beast to Process Kafka Events

Submitted Mar 21, 2019

Building an event processing library comes with own baggage.
No Data loss takes highest priority, then comes Performance and Scalability. Scaling the tool for millions of messages/minute with architecture than with language. Also ensuring it is generic so we can deploy for any schema/table by config change.

Will walk you through the journey of building this library and learnings through the process.

Outline

We’ve built our own event processing library to consume events from kafka, and pushes to bigquery. All of our micro services are event sourced. We’ve high load of 21K messages/second for few topics, and hundreds of topics.

In this talk, will cover the learnings,

why we built our custom event processing tool Beast
- customising code for each input/output combination and old way of deployment
- limitations with existing systems for our usecase
Ensuring no data loss
- How could we test the application for data loss
- How could we monitor data loss in bigquery?
How we achieved performance which handles high throughput with acceptable latency.
- Architecture (processing with Queues), (why we didn’t pick redis?)
- why we couldn’t use go language
How we achieved scalability using kubernetes.
- load testing
- chaos testing
Enhancements (ease of deployment)
- parser to generate config from proto
- auto update the table schema for new fields in proto
Demo

The learnings are generic irrespective of the language.

Requirements

Basic Understanding of Kafka or Pub/Sub tools
Basic usecase for Biquery
Basics of building applications in java

These will make the session more effective

Speaker bio

Dinesh Kumar is a software developer, passionate about building products for impact. He works at Gojek, handling backend services which serves millions of users. Go enthusiast, active volunteer and co-organiser in go community. Artist at times.

Comments

AS

Anwesha Sarkar

@anweshaalt
Thank you for submitting the proposal. Submit your slides and preview video by 20th April (latest) it helps us to close the review process.

Posted 5 years ago
Share
Copy link
Email
Twitter
Facebook
Linkedin
- DK
  
  Dinesh Kumar
  
  @devdinu Submitter
  Hi Anwesha Sarkar, thanks for letting me know. was out of town and i’m traveling today, would it be fine if i submit by Today EOD ?
  
  Posted 5 years ago (edited 5 years ago)
  
  Share
  Copy link
  Email
  Twitter
  Facebook
  Linkedin
- DK
  
  Dinesh Kumar
  
  @devdinu Submitter
  https://youtu.be/8m0YOZONeIA Mostly my talk is live, or use Vim. if the slides are mandate let know, i could create one with architecture and few important points.
  
  Posted 5 years ago (edited 5 years ago)
  
  Share
  Copy link
  Email
  Twitter
  Facebook
  Linkedin
  - Zainab Bawa
    
    @zainabbawa Editor & Promoter
    
    We will consider this proposal for the distributed systems track at Rootconf. In order to finalize this, we need to see detailed slides by 27 May. You will also have to outline:
    
    The tradeoffs of this architecture/approach. You will also have to include diagrams of the architecture to anchor participants around this.
    
    Show metrics at each stage, for scaling, enhancements, ease of deployment, etc
    
    Participants who attend Rootconf know about Kafka. You have to give enough context that is necessary for this case study.
    
    You have explained what other choices you considered in building this system. Show how you compared these options, and what criteria were used for evaluation.
    
    Posted 5 years ago
    
    Share
    Copy link
    Email
    Twitter
    Facebook
    Linkedin

Jun 2019

17 Mon

18 Tue

19 Wed

20 Thu

21 Fri 08:45 AM – 05:40 PM IST

22 Sat 09:00 AM – 05:30 PM IST

23 Sun

Hybrid access (members only)

Hosted by

Rootconf

We care about site reliability, cloud costs, security and data privacy

Rootconf 2019

A Beast to Process Kafka Events

Outline

Requirements

Speaker bio

Links

Comments

Anwesha Sarkar

@anweshaalt

Dinesh Kumar

@devdinu Submitter

Dinesh Kumar

@devdinu Submitter

Zainab Bawa

@zainabbawa Editor & Promoter