The Fifth Elephant 2017

On data engineering and application of ML in diverse domains

Gaurav Godhwani


Transforming India's Budgets into Open Linked Data

Submitted Apr 30, 2017

Indian Budget documents across various tiers of government, consist of detailed information on allocations made and resources raised in a financial year. Unfortunately these documents are published in unstrtuctured PDFs which makes it difficult for researchers, economists and general public to analyse and use this crucial data. This session will delve into our journey of developing OpenBudgetsIndia - a collective initiative to make India’s budgets open, usable and easy to comprehend.

What’s in for you?

Building data pipelines for civic-engagement is still in its embryonic stage in India, this talk will give an opportunity to data enthusiasts to learn, produce and contribute to open data in their geographies. People will also explore how we can employ simple scripts and open-source tools to deal with complex multifarious data formats.


The session will be organized as:

  • Setting the scene
  • Major issues with Indian Budget Documents
  • Role of Open Source + Communities
  • Key components of our Open Data Pipeline: Scrape, Parse, Transform, Publish, Analyse
  • Open Linked Data: Benefits and Usage
  • Various Analysis Tools
  • How you can contribute
  • Future
  • Questions

Speaker bio

Building | Doing data-for-good at DataKind Bangalore



{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

All about data science and machine learning