Nischal HP, Vice President of Data Engineering and Data Science at Scoutbee. Nischal curated the MLOps conference which was held online between 23 and 27 July 2021.
AI and Research - covers research, findings, and solutions for challenges on building models in various areas such as fraud detection, forecasting, and analytics. This track delves into the latest methodologies for handling challenges such as large-scale data processing, distributed computing, and optimizing model performance.
Industrial applications of ML - covers implementation of AI in the industry, with more focus on the AI models, the issues in training, gathering data so, and so forth. ML is being used at scale in industries such as automotive, mechanical, manufacturing, agriculture, and such domains. This track focuses on the challenges in this space, as we see innovation coming out of these industries in the pursuit of using ML on a second-to-second basis.
AI and Product - covers strategies for building AI products to scale and mitigating challenges. This track provides insights on incorporating AI tools and forecasting techniques to improve model training, developing a working model architecture, and using data in the business context.
The Fifth Elephant 2023 Monsoon edition will be held in-person. Attendance is open to The Fifth Elephant members only. Purchase a membership to attend the conference in-person. If you have questions about participation, post a comment here.
Data/MLOps engineers who want to learn about state-of-the-art tools and techniques, especially from domains such as automobile, agri-tech and mechanical industries.
Data scientists who want a deeper understanding of model deployment/governance.
Architects who are building ML workflows that scale.
Tech founders who are building products that require AI or ML.
Product managers, who want to learn about the process of building AI/ML products.
Directors, VPs and senior tech leadership who are building AI/ML teams.
Infrastructure (GPU, CPU and cloud providers) and developer productivity tool makers who want to evangelise their offering to developers and decision-makers.
Companies seeking tech branding among AI and ML developers.
Venture Capital (VC) firms and investors who want to scan the landscape of innovations and innovators in AI and who want to source leads for investment in the AI and ML space.
The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more
Submission guidelines and selection criteria If you are interested in speaking at The Fifth Elephant, note the following guidelines: Submit a description of your talk, explaining the problem that your talk covers, and one concrete takeaway for audience. Talks have to give at least one practical ins… expand
If you are interested in speaking at The Fifth Elephant, note the following guidelines:
Submit a description of your talk, explaining the problem that your talk covers, and one concrete takeaway for audience. Talks have to give at least one practical insight to the audience.
Preference will be given to talk experiential talks such as case studies, journeyman stories and implementation stories.
Editors will comment on the description. Make sure to watch out for comments from Nischal HP and Sumod Mohan on your submission. Editors’ decision on which talks are selected for the conference will be final. Talks which are not selected will also receive feedback, so that speakers can present their talks at other opportunities under The Fifth Elephant umbrella.
If you submission is included in the shortlist, you will be required to prepare an outline of your talk and go through a rehearsal.
The call for submissions will be closed on 30 June. In the meanwhile, talks will be selected on a rolling basis, as individuals make submissions.
The Fifth Elephant membership funds pay for honorarium for speaker. The honorarium amount will be declared with the funds available in the membership corpus on 31 July 2023.
You can submit an experiential talk on one of the following topics. You are welcome to submit talks on other topics. Focus on the relevance of the talk, who are the stakeholders who will be impacted by your talk, and what is the takeaway for them.
AI and research.
Decisions Intelligence Systems.
Responsible AI.
Privacy.
Industrial applications of ML - implementation of AI, with more focus on the AI models, the issues in training, gathering data so, and so forth in industries such as automotive, mechanical, manufacturing, agriculture, and such. Explain the challenge you are working on, and the innovation coming out of these industries as they pursue ML on a second-to-second basis.
Machine Learning (ML) application lifecycle at an organization and how it is helping the organization scale.
Cloud or tools associated with ML application lifecycle.
Data labeling and classification.
Data security, data privacy and data governance.
“Build versus buy” experiential case studies.
The conference is also accepting:
Lightning talks on ideas and innovations that individuals are testing/implementing using AI.
Showcase for open source projects built for MLOps.
If you have questions about submitting a talk or speaking at the conference, post a comment here.
Background In 2015, I sold the intellectual property (IP) of my Silicon Valley company, Perceptive Code LLC, to Mercedes Benz. Subsequently, I was tasked with meeting certain milestones as part of the handover process. I chose to complete this task in India, where I aways wanted to be. I successfully downsized our research model, initially consuming 6GB of GPU memory, to a mere 300KB of weights. … more
Samsung Ads is an intuitive audience platform that delivers meaningful experiences reaching the right audience across screens, formats and devices. With more than 900M Mobiles and 150M Smart TVs, and the largest first party data set powered by ACR, we help marketers reach targets and enhance experiences that span digital landscapes. The business has grown 10x since 2015. Our foundation is based o… more
Abstract: Echocardiogram(Echo) is one of the common modality that captures state of the heart in the form images and videos. Using Ultrasound technique, an echo study captures multiple cross sections of the heart, termed as Views. Cardiologist utilizes measurements on the basis of these Views to analyse heart functions. In order to automate this process to measure how heart is functioning, an ini… more
Link to presentation: https://docs.google.com/presentation/d/18IgJG7hvPgcQZOVoE9cQpYlp0B6fLrwd/edit?usp=sharing&ouid=116463250234490514040&rtpof=true&sd=true more
Problem Statement In our journey at Squadstack, we encountered a significant challenge in maintaining exceptional customer interactions during telemarketing calls. We realized the need to evaluate callers based on various parameters to identify and flag undesirable interactions, this would become a robust solution and will help streamline the evaluation process and provide targeted training inter… more
Abstract: Companies are now building products using real-time computer vision and machine learning of video from various systems and processes. This requires a full stack system consisting of video ingestion and storage, live inferencing, post-processing of data generated, and use of the data in the business context or customer’s domain language. We demonstrate a preferred architecture and stack … more
I’ve built production LLM systems and made multiple mistakes. The talk will focus on key areas to focus on while deploying LLM and Generative AI systems in production more
A/B testing is a widely-used paradigm within marketing optimization because it promises identification of causal effects, because it is implemented out of the box in most messaging delivery software platforms, but mostly because it is held up as a “gold standard” for evaluating options. This talk will explain why A/B tests are not a particularly sound method, why businesses rarely choose better (… more
Problem statement: Instamart, the quick commerce grocery delivery service of Swiggy gives unparalleled convenience of being able to order, from a huge assortment, across fresh fruits & vegetables/ dairy/FMCG products and accessories for household requirements, parties or festivities, pretty much at any time of the day and also through late night (from 6am to 3pm) and get the delivery in ~10–15 mi… more
Problem : At the TnS(Trust and Safety) team at Swiggy, building powerful fraud detection models that operate at high precision while still capturing maximum fraud has been the uber goal. Our system currently operates at a high level of complexity through various interventions, modelling techniques, and semi-supervised training methods while maintaining robustness. For the final downstream model, … more
Introduction The gap in access to Justice in India is a huge opportunity for AI. Initial field trials from Jugalbandi with farmers, domestic workers and waste pickers presents several lessons in AI product design for all Indians. In this talk we will gain a deeper understanding of the opportunities in addressing access to justice, the design choices in building Jugalbandi and what kind of technic… more
About the Industry Telesales services involving sales of products or services over a phone call is a vast market with a global value of approximately US$ 27 billion in 2022 and is expected to grow to US$ 55 Bn by 2029. Call centre solutions originating from India are a widespread phenomenon and most of these services are for the vast consumer market in India with calls happening in Indic language… more
Problem Paying for deliveries using cash after the delivery is made is a popular mode of payment employed by customers transacting online for the first time or those that prefer to have more control, especially in emerging economies like India. While the cash (or pay)-on-delivery (COD or POD) option helps e-commerce platforms, for example in our food delivery platform, tap into new customers, it … more
Agenda: 1, A deep understanding of Multimodality as a phenomenon 2. The gist of the Axiomatic treatment of multimodality - covering challenges pertaining to Representation, Alignment, Reasoning, Generation, Transference and Quantification 3. The heart of Graph Neural Network Modeling frameworks including GCNs, Graph Attention and their capacity to model Heterogeneity and spatio-temporality 4. How… more
By using advanced analytics and predictive modeling, growth systems have real-time access to customer journey insights. With this data and real time segmentation and prediction models, each stage of the marketing funnel can be optimized to more effectively nurture leads to convert. Which inturn helps drive up ROI. more
Abstract Artificial Intelligence, particularly generative models, are driving unprecedented transformations in products and services across industries. However, integrating generative AI is not just a matter of “plugging in” the technology. more
The integration of computer vision technologies in the automotive industry has revolutionized various aspects of vehicle safety, navigation, and driver assistance systems. However, developing robust and accurate computer vision models for real-world scenarios necessitates large-scale, diverse, and accurately labeled datasets, which can be challenging to obtain. Consider a scenario where the durat… more
Abstract/Proposal Server-driven recommendation systems (RecSys) face significant challenges when it comes to scaling to handle large data volumes and providing real-time recommendations. At Glance, we serve personalized recommendations to over millions of users, prioritizing response time and data privacy. To tackle these challenges, we have turned to edge machine learning (ML). By deploying ML m… more
Abstract With the advent of Large Language Models (LLMs) and Generative AI (GAI), there is growing interest in building LLM-driven applications.. Within the industry, a growing number of companies are exploring whether they can leverage state-of-the-art, open source-driven LLM Models within their own existing k8s clusters and building applications that leverage these bespoke, in-cluster LLMs for … more
ABSTRACT Submitted by Shrinath Bhat Mechanical Engineering Dept., IIT Madras 2020 graduate Senior Data Scientist, BEES Algo Selling team, AB InBev more
Abstract Glance inspires consumers to make the most of every moment by surfacing relevant experiences for them with its ‘smart lock screen’ innovation. More than 225 million consumers enjoy Glance on their Android smartphones across markets. Glance harnesses the power of Machine Learning (ML) to provide consumers with a highly personalized and engaging user experience featuring top content from b… more
Abstract In recent times, Live Streaming platforms are gaining popularity where live content is being shown to users. Typically, the videos created by the creators range from 15 minutes to an hour. After intensive research, it was found that a sizable chunk of users drops within first 30 seconds of the video. Another piece of research shows that, on average, a user only has an attention span of 3… more
Abstract: Nowadays in this digital world efficient extraction of Entities from various Government records like Pan card, Adhar card, Driving License and etc. has become a priority for various use cases like Authentication, KYC Compliance, Partner/Customer Onboarding, Age Validation etc. in a wide number of sectors. Solving such an essential problem also comes with a variety of challenges like var… more
Problem: Myntra is one of the leading fashion e-commerce companies in India. Myntra is focused on delivering best-in-class customer experience for all the fashion lovers from browsing to purchase and post purchase experience. Myntra provides curated, customized shopping experience to every user by matching deep understanding of the user with deep expertise on fashion and trends. Myntra is leverag… more
Problem Myntra is one of the leading fashion e-commerce companies in India. Myntra delivers best-in-class shopping experience by leveraging many advanced machine learning models, deployed for online or real-time inference. The online inference requires streams of data to be processed, machine learning features to be computed, stored and served in (near) real-time, at Myntra scale. more
Problem: The Consumer Packaged Goods (CPG) industry faces unique credit risk challenges such as fluctuating consumer demands, the risk of bad debt, optimal working capital management, and market volatility. These challenges necessitate a robust and dynamic credit risk model to accurately assess and manage credit risks. While similar problems have been addressed in the banking sector, the CPG indu… more
Problem Statement In the retail industry, Category Managers and Buyers often rely on their experience and instincts when planning the allocation of space in a store. However, these traditional approaches may not be accurate or adaptable to changing market dynamics. Furthermore, numerous factors influencing space planning decisions may go unnoticed by decision makers. more
Abstract At LinkedIn, we serve 100000s of inferences per second across 100s of ML models concurrently in our online systems. ML models have different system performance characteristics - ranging from lightweight XGBoosts to memory intensive recommendation models, to the newer Generative AI models, which are both compute and memory intensive. We run these models across different hardware profiles … more
Abstract Auquan is an AI startup that serves institutional investors and investment managers with curated news and documents to help them make better investment decisions. more
In the ever-changing landscape of digital products, algorithms play a vital role in providing personalized experiences for users. This talk will delve into the intersection of product and algorithmic design, addressing the need to create products that optimize algorithmic capabilities. For example, at Glance, a lock screen experience, we have 200m+ daily active users and need to design our user e… more
Abstract Auquan is an AI startup that serves institutional investors and investment managers with curated news and documents to help them make better investment decisions. more
Brief : Beyond the analyst,scientist and ML engineer roles, what are the possible evolutions of roles surrounding data science and how we are seeing it develop across different projects in our agritech startup . more
At DoorDash, machine learning is a key component, used to enhance the experience of merchants, dashers, and customers. As our machine learning use cases keep growing, our forecasting and training pipelines are faced with several challenges like scalability, growing costs, reduced user development velocity and lack of proper debugging/observability. more
Problem: The modern purchasing algorithm at Target is an automated algorithm that places orders to vendors with the details of the Target warehouses (distribution centres – DCs) where these items need to be shipped. Since the DCs serve multiple stores and vendors ship multiple items across categories, a purchase order can be so big that it doesn’t fit in a single truck or so small that it doesn’t… more
My Background I am Kritika Saraswat, a passionate data scientist currently employed at AB InBev, the world’s largest brewing company headquartered in Leuven, Belgium. They own more than 500+ beer brands across the globe. With a strong background in data science and machine learning, I have been actively involved in driving impactful solutions within the domain. Prior to my current role, I gained … more
Reinforcement Learning (RL) is a subfield of machine learning that involves interacting with the environment to improve performance. Scientists have been using various games as a way to test and compare different learning and planning methods. Back in 1992, Gerry Tessauro used Reinforcement Learning to train a neural network to play Backgammon. Since then, similar techniques have been used to cre… more
Abstract: LlamaIndex is a powerful framework that acts as a central interface between Language Model Libraries (LLMs) and external data. In this talk, we will dive into the reasons behind the development of LlamaIndex and explore its various basic building blocks/ components. We will discuss the different abstractions within llamaindex - indexes, retrievers, response synthesis, query engines. We … more
Abstract The success of large-scale recommender systems hinges upon their ability to deliver accurate and timely recommendations to a diverse user base. At Glance, we offer snackable personalized content to the lock screens of 200M smartphones. In this context, continuous monitoring is paramount as it safeguards data integrity, detects drifts, addresses evolving user preferences, optimizes system… more
About me I am working in ABInBev as Senior Manager – Data Science with experience in Statistical/Machine Learning and Predictive Modelling and analytics consulting. With a passion for machine learning and data-driven solutions, I have been actively involved in the development and implementation of advanced analytics frameworks. Currently working as Product Owner and DS lead for AlgoVault. I am ma… more
Abstract LLMs have been demonstrated to perform quite well in question answering tasks and have been shown to generate good answers based on the context provided. In many scenarios, training of LLM becomes challenging due to time and resource constraints. When it comes to adoption of LLM in large organizations, major problem arises due to the confidentially of data and scattered relevant informat… more
The last 12 months have seen a relentless pace of innovation in the field of Artificial Intelligence (AI). Each week hundreds of new models and code repositories are released. It’s clearly a wonderful time to be working in the field but the volume and pace do present their own unique challenges. How to develop products that support rapid prototyping whilst frequently changing the models powering … more
My name is Rahul, and I am the founder and CEO of Akaike.ai, an AI multi-modal company specialising in advanced ML/AI solutions. I am writing to you today to express my interest in speaking at The Fifth Elephant 2023 Conference on the theme of “Industrial Applications of ML.” more
About me I am working in ABInBev as Senior Manager – Data Science with experience in Statistical/Machine Learning and Predictive Modelling and analytics consulting. With a passion for machine learning and data-driven solutions, I have been actively involved in the development and implementation of advanced analytics frameworks. I am master’s in computer science and have 12+ years of experience in… more
Graph databases and social graph Graph databases are the most scalable, high-performance way to query and store highly interconnected data. They help improve intelligence, predictive analytics, social network analysis, and decision and process management – which all involve highly connected data with lots of relationships. A relevant use case for graph databases is the social graph. more
DNN (Deep Neural Network) models are nonlinear and have a high variance, which can be frustrating when preparing a final model for making predictions. In order to get good results with any model, there are certain criteria (data, hyperparameters) that need to be fulfilled. But in the real-world scenario, you might either end up with bad training data or might have a hard time figuring out appropr… more
At Dream 11, we have built a Customer lifetime value(CLTV) model to predict each user’s future lifetime value. There are two broad areas where having a future-looking estimate of customer value can help. more
The talk will be an implementation story which falls under #5 in the Topic for Submission - [https://hasgeek.com/fifthelephant/2023/sub#h:topics-for-submission] section. It will be mainly focused on a problem statement in the Search domain to rank the search results retrieved from the Search engine. It explains in detail about the advanced/SOTA solutions using Deep Learning, Reinforcement Learnin… more
Background Large Language Models have demonstrated amazing capability for solving complex problems. But they can’t answer what they haven’t seen, and to take advantage of these amazing models, we need to expose our data to the model. Fine-tuning is not an option, at least not a cheap one. Prompt engineering is a helpful technique to provide context to LLMs, which helps the model restrict its answ… more
In this talk , I will talk about privacy risks with Machine Learning and explain in detail about Privacy Preserving Machine Learning techqniues. Introudce variious frameworks which can be used to implement to protect ML Models, Training Data, Inference Results from privacy threats. Talk about privacy threats in Large Lanauge Models ( LLM’s) and varous benchmarks in ML with resepct to privacy pres… more
About me I am currently a Data Science Manager in Revenue Growth Management Analytics at AB InBev, the world’s largest beer company. In this role, I harness the power of machine learning (ML) to drive business success. With my expertise in pricing and mix management, combined with a master’s degree in economics specializing in applied quantitative finance, I bring a unique blend of knowledge and … more
Its a no brainer that huge amounts of data in high TBs/PBs are going to be processed for any foundational models or even training of LLMs. In this talk, i propose to discuss the pain points of handling data at this scale more
The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more