Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
##About the conference and topics for submitting talks:
The Fifth Elephant is rated as India’s best data conference. It is a conference for practitioners, by practitioners. In 2018, The Fifth Elephant will complete its seventh edition.
The Fifth Elephant is an evolving community of stakeholders invested in data in India. Our goal is to strengthen and grow this community by presenting talks, panels and Off The Record (OTR) sessions that present real insights about:
**
##Target audience:
You should attend and speak at The Fifth Elephant if your work involves:
##Perks for submitting proposals:
Submitting a proposal, especially with our process, is hard work. We appreciate your effort.
We offer one conference ticket at discounted price to each proposer, and a t-shirt.
We only accept one speaker per talk. This is non-negotiable. Workshops may have more than one instructor.
In case of proposals where more than one person has been mentioned as collaborator, we offer the discounted ticket and t-shirt only to the person with who the editorial team corresponded directly during the evaluation process.
##Format:
The Fifth Elephant is a two-day conference with two tracks on each day. Track details will be announced with a draft schedule in February 2018.
We are accepting sessions with the following formats:
##Selection criteria:
The first filter for a proposal is whether the technology or solution you are referring to is open source or not. The following criteria apply for closed source talks:
The criteria for selecting proposals, in the order of importance, are:
No one submits the perfect proposal in the first instance. We therefore encourage you to:
Our editorial team helps potential speakers in honing their speaking skills, fine tuning and rehearsing content at least twice - before the main conference - and sharpening the focus of talks.
##How to submit a proposal (and increase your chances of getting selected):
The following guidelines will help you in submitting a proposal:
To summarize, we do not accept talks that gloss over details or try to deliver high-level knowledge without covering depth. Talks have to be backed with real insights and experiences for the content to be useful to participants.
##Passes and honorarium for speakers:
We pay an honorarium of Rs. 3,000 to each speaker and workshop instructor at the end of their talk/workshop. Confirmed speakers and instructors also get a pass to the conference and networking dinner. We do not provide free passes for speakers’ colleagues and spouses.
##Travel grants for outstation speakers:
Travel grants are available for international and domestic speakers. We evaluate each case on its merits, giving preference to women, people of non-binary gender, and Africans. If you require a grant, request it when you submit your proposal in the field where you add your location. The Fifth Elephant is funded through ticket purchases and sponsorships; travel grant budgets vary.
##Last date for submitting proposals is: 31 March 2018.
You must submit the following details along with your proposal, or within 10 days of submission:
##Contact details:
For more information about the conference, sponsorships, or any other information contact support@hasgeek.com or call 7676332020.
Hosted by
ADDEPALLI GIRIDHAR
@connect2ppl
Submitted Mar 31, 2018
At Walmart TB’s of data gets generated per day via interactions, transactions by our users on walmart.com and other properties(in-store, jet.com etc). As part of our Customer data strategy we strive to increase Reach, Depth, Freshness to know about more customers, more about customers, and in as real-time as possible. Towards this goal, we need to ingest data as when it is generated and process it to gain insights about our customers.
Our current streaming platform is built using Kafka, Storm & Couchbase. As we plan to ingest more data we observed that lookups in Couchbase from our Storm processes over network is a bottleneck. We have evaluated few technologies like Samza, Flink and Kafka Streams which persist state on the machine that is processing the messages, so that network calls are not necessary.
We have chosen Kakfa Streams as a technology over Samza & Flink. Kafka Streams has borrowed few good ideas from Samza.
And has added few important features like Standby Replica (increases availability) & Interactive Queries (makes state queryable).
We have implemented few useful features on top of Kafka Streams like ::
(1) Storage Policies (Archival / TTL / Compaction) which prevents not-so-recent data occupying disk space.
(2) Ability to query state even when task is RESTORATION state (during rebalancing). This increases availability.
(3) Currently, in Kafka Streams two-hops are required for asnwering interactive queries, we have developed mechanism so that only one hop is enough.
(4) Currently, in Kafka Streams Changelog Kafka Topics have infinite retention time to support restoration of state in case of failures. But if state is huge it is not feasible to have infinite retention time. We have developed mechanism so that it is possible to build state even with finite retention time setting for Changelog Kafka Topics.
(5) Added rack-awareness feature that ensures active and standby replica tasks are not scheduled to run on the same rack.
Problem Statement - Build platform that helps in developing stateful applications that ingest events in real-time.
Discuss previous version of Streaming Platform that used Kafka, Storm, Couchbase and discuss its shortcomings.
Discuss few alternatives like Samza, Flink and reasons for our choice of Kafka Streams
Discuss few features we have built on top of Kafka Streams which improves efficiency, availability.
Discuss how we productionalized our streaming platform.
I am Giridhar Addepalli with over 9 years of experience as Software Developer. Currently working as Staff Engineer at WalmartLabs.
https://www.linkedin.com/in/giridhar-addepalli-8828367/
https://www.slideshare.net/giridhar510/kafka-streams-fifth-elephant-2018
Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Hosted by
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}