The Fifth Elephant round the year submissions for 2019
Submit a talk on data, data science, analytics, business intelligence, data engineering and ML engineering
Accepting submissions till 31 Dec 2020, 11:59 PM
Not accepting submissions
If you missed the deadline for submitting your talk for The Fifth Elephant 2019 -- to be held in Bangalore on 25 and 26 July -- you can propose a talk here.
We are accepting talks on:
Accepting submissions till 31 Dec 2020, 11:59 PM
Building a large-scale Data as a Service (DaaS) platform to consistently deliver high-quality datasetsAs a provider of Competitive Intelligence as a Service to eCommerce businesses and consumer brands, DataWeave aggregates and analyses product catalog data from eCommerce websites each day at massive scale. Once aggregated, this data is fed into a complex process of extraction, transformation, machine learning, and analyses. These operations are performed on a consistent basis to provide our custo… more
Session type: Short talk of 20 mins
|
Finding needles in high dimensional haystacks: Product Matching in RetailMatching the same and similar products is a problem fundamental to the online retail industry with multiple applications spanning across price optimization, recommending similar or substitute products to customers, understanding gaps in product assortments, and counterfeit product detection. Given that that there are no standard product identifiers, catalog data is often noisy, incomplete and non… more
Session type: Short talk of 20 mins
|
Websites to DatasetsAs a provider of Competitive Intelligence as a Service to eCommerce businesses and consumer brands, DataWeave aggregates and analyses product catalog data from eCommerce websites each day at massive scale. Once aggregated, this data is fed into a complex process of extraction, transformation, machine learning, and analyses. These operations are performed on a consistent basis to provide our custo… more
Session type: Short talk of 20 mins
|
A Journey of Building Dream11's Data PlatformDream11 is India’s biggest fantasy sports platform that allows users to play fantasy cricket, hockey, football, kabaddi and basketball. Our total user base is over 70 million and expected to cross 100 million by end of 2019. more
|
Deep Learning powered Genomic ResearchThe event disease happens when there is a slip in the finely orchestrated dance between physiology, environment and genes. Treatment with chemicals (natural, synthetic or combination) solved some diseases but others persisted and got propagated along the generations. Molecular basis of disease became prime center of studies to understand and to analyze root cause. Cancer also showed a way that or… more
Session type: Workshop
|
Panel Discussion around Healthcare AnalyticsPanel Discussion around Healthcare Analytics Outline more
Session type: Birds of a Feather session of 1 hour
|
Interpretable NLP ModelsDeep learning models are always known to be a black box and lacks interpretability compared to traditional machine learning models. So,There is alway a hesitation in adopting deep learning models in user facing applications (especially medical applications). Recent progress in NLP with the advent of Attention based models , LIME and other techniques have helped to solve this. I would like to walk… more
Session type: Tutorial
|
Real-Time DataQuality on FlinkMy use case is to provide monitoring, and improving the overall search data quality, also to find the unusual patterns of user’s search behavior, and notifying the intent on-site back to the respective business stakeholders. To achieve the same, I explored various big data processing engines, which can process the huge data with complex business logic in real time. Eventually, I used Flink Stream… more
Session type: Full talk of 40 mins
|
Building a Location Intelligence Platform for audience segmentationThe ROI of OOH (Out of Home Advertisement) depends on precise and intelligent targeting of advertisements. The media buyers therefore require detailed understanding and visibility of the audiences across various attributes so that they can then plan their OOH media buy to specifically target a selected set of audiences. Location information of the user, device level audience data, enriched with r… more
Session type: Short talk of 20 mins
|
How to make a kickass data platform with spark and S3In this talk, we will explore the advantages and challenges faced while running an in-house data platform using spark and S3. We will also discuss how to add some essential features to your platform like autoscaling and access control. The latter part of the talk will also address some ways to organise data in S3, storage formats for big data and indexing to improve read performance for big-data … more
Session type: Full talk of 40 mins
|
Anomaly Detection at Scale: Architectural Choices for Data Pipelines for 7B events per dayCloud-native applications. Multiple Cloud providers. Hybrid Cloud. 1000s of VMs and containers. Complex network policies. Millions of connections and requests in any given time window. This is the typical situation faced by a Security Operations Control (SOC) Analyst every single day. In this talk, the speaker talks about the high-availability and highly scalable data pipelines that he built for … more
Session type: Full talk of 40 mins
|
Deploying Deep Learning models on the Edge (Android, IOS, ...)The ability to train the task specific deep learning models is very easy these days, with the wide range of available libraries and documentation around it. But, the difficulty lies in bringing it to production ready mode. Especially, if the application concentrates on Mobile platform. Though there are existing wrappers of certain libraries to make them work, but, as of now, they are slow and use… more
Session type: Full talk of 40 mins
|
Machine Learning Model Management with MLflowBackground Data is the new oil and its size is growing exponentially day by day. Most of the companies are leveraging data science capabilities extensively to affect business decisions, perform audits on ML patterns, decode faults in business logic, and more. They run large number of machine learning model to produce results. more
Session type: Tutorial
|
Building a data pipeline inside and outside a vehicleAther 450 is a smart electric vehicle with data intensive features on the vehicle as well as on the cloud/mobile app. On the vehicle, the on-board software uses the vehicle data to make decisions regarding the vehicle behaviour and safety, while giving some user delight features like auto-indicator. Via the cloud, user has a mobile app using which the vehicle can be monitored and their ride stati… more
Session type: Short talk of 20 mins
|
Data Science for the discretionary managers: Lessons from a 60 trillion$ traditional industry resistant to change and facing the quant threatInvestment management is a 60 Trillion$ industry, and despite the recent advancements in data science and machine learning, still remains fairly discretionary. Untill recently, less 20% of the funds called themselves quantitative. However, there is an absolutely massive transformation taking place right now within the discretionary investment management industry. Quantitative and systematic strat… more
Session type: Short talk of 20 mins
|
Case study: Outbound logistics optimization for multi depot problem with time windowCase study: Outbound logistics optimization for multi depot problem with time window more
Session type: Short talk of 20 mins
|