The Fifth Elephant 2017

On data engineering and application of ML in diverse domains

Up next

Transforming India's Budgets into Open Linked Data

GG

Gaurav Godhwani

@gggodhwani

Indian Budget documents across various tiers of government, consist of detailed information on allocations made and resources raised in a financial year. Unfortunately these documents are published in unstrtuctured PDFs which makes it difficult for researchers, economists and general public to analyse and use this crucial data. This session will delve into our journey of developing OpenBudgetsIndia - a collective initiative to make India’s budgets open, usable and easy to comprehend.

What’s in for you?

Building data pipelines for civic-engagement is still in its embryonic stage in India, this talk will give an opportunity to data enthusiasts to learn, produce and contribute to open data in their geographies. People will also explore how we can employ simple scripts and open-source tools to deal with complex multifarious data formats.

Outline

The session will be organized as:

  • Setting the scene
  • Major issues with Indian Budget Documents
  • Role of Open Source + Communities
  • Key components of our Open Data Pipeline: Scrape, Parse, Transform, Publish, Analyse
  • Open Linked Data: Benefits and Usage
  • Various Analysis Tools
  • How you can contribute
  • Future
  • Questions

Speaker bio

Building OpenBudgetsIndia.org | Doing data-for-good at DataKind Bangalore

Links

Slides

https://docs.google.com/presentation/d/1ZC9YuyILxmcpY1-op1KliDmIZQ8XeLJEFyYAkwORfrs/edit?usp=sharing

Comments