Jun 2025
23 Mon
24 Tue
25 Wed
26 Thu
27 Fri 08:00 AM – 05:00 PM IST
28 Sat
29 Sun
Anmol Pahwa
@anmpahwa
Submitted Apr 14, 2025
Often publicly available datasets from Indian government agencies/bodies are available in the form of PDF tables (in fact, sometimes scanned PDF tables). One such dataset includes the daily air quality data that is published by the Central Pollution Control Board at 4:00PM everyday. I created an automated Python script to scrape through these tables every day at 4:30PM. Subsequently I automated a Julia script to visualize daily air quality data (index, level, and major pollutant). And finally, using Git I publish these plots daily at 5:30PM on my blog. This is an effort to make government sourced data more accessible and available.
GitHub: INDAQ
Blog: Monitoring Daily Air Quality Data
This session could be of interest to students and researchers across domains, especially those who work on publicly available/open-sourced datasets.
I am Anmol Pahwa, Assistant Professor in the Civil Engineering Department at IIT Madras, with research interests pertaining to sustainable and resilient freight transportation.
Website: LogNitiLab
Hosted by
Supported by
Platinum sponsor
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}