Speak at The Fifth Elephant 2026 Annual Conference
Share you work with the community
Jul 2026
13 Mon
14 Tue
15 Wed
16 Thu
17 Fri 09:00 AM – 06:00 PM IST
18 Sat 09:00 AM – 06:00 PM IST
19 Sun
Submitted Jun 21, 2026
Everyone wants AI. Everyone talks about models. But in many organizations, the real battle is neither model selection nor prompt engineering—it is data quality. At Vahan, a large blue-collar recruitment marketplace, we process million of data points daily from our vendors & external partners.
We discovered that 10–40% of incoming data were routinely corrupted before it even reached our systems. The culprit was not faulty databases or broken pipelines, but seemingly harmless operational processes involving spreadsheets, CSV exports, Google Sheets, locale settings, and manual copy-paste workflows. These issues silently introduced date corruption, data fluctuations, retractions, pullbacks, duplication, and inconsistencies that impacted analytics, forecasting, incentive calculations, and downstream AI systems.
This talk is a production engineering war story on how we built a DataOps platform to detect, prevent, correct, and monitor data quality issues at scale. I will share the surprising failure modes we uncovered, the architecture we built around validation engines, business-critical rules, automated correction workflows, and human-in-the-loop operations, and the metrics we used to continuously improve quality.
Attendees will learn how we transformed data quality from roughly 70% to 99%+, creating a trusted foundation for analytics, forecasting, incentives, and AI systems. More importantly, they will leave with a practical, battle-tested DataOps playbook that can be applied to any organization consuming large volumes of operational or third-party data.
Anuj Gupta helps Organizations become AI native in the capacity of Head of AI.
{Add the link to 2-min elevator pitch video}
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}