Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
##About the conference and topics for submitting talks:
The Fifth Elephant is rated as India’s best data conference. It is a conference for practitioners, by practitioners. In 2018, The Fifth Elephant will complete its seventh edition.
The Fifth Elephant is an evolving community of stakeholders invested in data in India. Our goal is to strengthen and grow this community by presenting talks, panels and Off The Record (OTR) sessions that present real insights about:
**
##Target audience:
You should attend and speak at The Fifth Elephant if your work involves:
##Perks for submitting proposals:
Submitting a proposal, especially with our process, is hard work. We appreciate your effort.
We offer one conference ticket at discounted price to each proposer, and a t-shirt.
We only accept one speaker per talk. This is non-negotiable. Workshops may have more than one instructor.
In case of proposals where more than one person has been mentioned as collaborator, we offer the discounted ticket and t-shirt only to the person with who the editorial team corresponded directly during the evaluation process.
##Format:
The Fifth Elephant is a two-day conference with two tracks on each day. Track details will be announced with a draft schedule in February 2018.
We are accepting sessions with the following formats:
##Selection criteria:
The first filter for a proposal is whether the technology or solution you are referring to is open source or not. The following criteria apply for closed source talks:
The criteria for selecting proposals, in the order of importance, are:
No one submits the perfect proposal in the first instance. We therefore encourage you to:
Our editorial team helps potential speakers in honing their speaking skills, fine tuning and rehearsing content at least twice - before the main conference - and sharpening the focus of talks.
##How to submit a proposal (and increase your chances of getting selected):
The following guidelines will help you in submitting a proposal:
To summarize, we do not accept talks that gloss over details or try to deliver high-level knowledge without covering depth. Talks have to be backed with real insights and experiences for the content to be useful to participants.
##Passes and honorarium for speakers:
We pay an honorarium of Rs. 3,000 to each speaker and workshop instructor at the end of their talk/workshop. Confirmed speakers and instructors also get a pass to the conference and networking dinner. We do not provide free passes for speakers’ colleagues and spouses.
##Travel grants for outstation speakers:
Travel grants are available for international and domestic speakers. We evaluate each case on its merits, giving preference to women, people of non-binary gender, and Africans. If you require a grant, request it when you submit your proposal in the field where you add your location. The Fifth Elephant is funded through ticket purchases and sponsorships; travel grant budgets vary.
##Last date for submitting proposals is: 31 March 2018.
You must submit the following details along with your proposal, or within 10 days of submission:
##Contact details:
For more information about the conference, sponsorships, or any other information contact support@hasgeek.com or call 7676332020.
Hosted by
Puneet
@puneetkrojha
Submitted Mar 27, 2018
XStream is a Unified Self-Service Analytics ETL & ML Platform Built On Top Of Apache Spark, which allows you to create scalable and fault tolerant pipelines.You can express your Big Data Spark computation logic in a much simpler and intuitive fashion and get your complex pipelines ready in minutes.
XStream is also capable of running Big Data batch jobs as streaming computation on a static data.It allows to switch from batch processing jobs to stream processing jobs and viceversa. XStream provides you with ready to use I/O connectors,interface to use static dataset for joins and lookups and connectors to perform realtime fast lookup on Redis,HBase and BigTable.
Complex and important part of handling job failures gracefully,bad data handling,getting realtime descriptiive and prescriptive metrics dashboard for your running jobs,defining and scheduling workflows on your batch and streaming jobs, are another very important aspect it gracefully handles out of the box for all your pipelines.
XStream also focuses on defining all important data featurization operators needed to create Machine Learning models.It allows to embed online Machine Learning models into XStream pieplines and also create Machine Learning models using it Drag and Drop constructs.
An existing Fortune 500 Online Retailor had their batch Market Propensity models which took around 24 hours to generate updated models to be used in their Machine Learning Pipelines.Due to huge infrastructure cost they created their Models on sample data. Business usecases needed upgrade in existing model to be updated in realtime.They had issues in maintaining realtime customer segment profiles and customer product profiles.
XStream helped not only change the existing Market Propensity pipeline from Batch to Realtime but its effective feature generation operators helped reduce the time and infrastructure cost. Complete input data was used to generate Market Propensity models , Realtime Customer Segment Profiles and Customer Product Profiles.
The customer could use the same pipeline for batch or streaming inputs,on a click of a button, thereby avoiding the re-engineering required to developed two workflows.
We will explain the existing model logic , how it was mapped in XStream by a ETL Developer who could never imagine creating similar workflows like skilled Big Data Developers and run it without much hassle.One doesn’t need to focus on tuning the jobs as the important aspects of connection tuning, getting metrics on input rate,memory usage,shuffle and alert on ill configured job parameters,bypassing and storing bad data records in separate sink are handled by XStream.
Introducing XStream
Features of the Product
Machine Learning Usecase(Realtime Market Propesity Modeling) using XStream
Puneet Kumar Ojha
VP Data Engineering and Analytics
https://www.linkedin.com/in/puneetkumarojha/
Proven Experience in building scalable Big Data and Machine Learning,Data Quality and Analytics Products.He has delivered solutions for Online Retail,AdTech,HeathCare Domains.Experience in architecting solutions scaling to petaByte scale data for low lantecy and high velocity.
Experienced Data Modeler for relational and NoSQL databases.Solved Usecases on Data Convergence-Customer360, Market Propensity,Enterprise Platform Migration - DataCenter to Google Cloud & AWS, Customer Segmentation,Conversational BOT Platform and Realtime Decision Platform for Retail Industry and Connected Devices.
https://www.slideshare.net/PuneetKumar30/market-propensity-modeling-using-xstreams
Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Hosted by
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}