Open Source Tools and Archive for Tackling Misinformation on ChatApps in India

Nov 2019

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri

23 Sat 08:30 AM – 05:30 PM IST

24 Sun

Taj M G Road, Bangalore, Bangalore

Tickets

All submissions

Previous Next

This submission has been added to the schedule

Open Source Tools and Archive for Tackling Misinformation on ChatApps in India

Submitted Nov 7, 2019

Session type: Short talk of 20 mins

Tattle is a civic tech project in India that is creating an archive of content circulated on WhatsApp and other chat apps, and building open source tools to navigate this archive. Such an archive is useful for research on information networks as well as for increasing the efficiency and reach of fact checking efforts. One of Tattle’s goals is opening the archive, even if in a limited scope, to the general public.
We will describe some of the challenges in data collection on encrypted platforms; and our approach for different kinds of search operations (duplicate, approximate, semantic) on multi-lingual and multi-media content. We will conclude with some of the ethical considerations in doing this work.

Outline

Motivation and Goals of the Project
- How does it aim to affect the misinformation challenge in India
Data Collection
- Ways of collecting media from Chat Apps
- Collecting media from allied sources (fact checking websites)
Data Processing (Tools to navigate the archive)
- Duplicate Detection
- Approximate Search
- Semantic Search
- Use of embeddings over hashing
Ethical Considerations in this work
- Consent frameworks for data collection
- Managing access and use
- Managing violent and pornographic content

Requirements

An interest in misinformation!

Speaker bio

Keshav Joshi is a data scientist @Tattle working to bring together an archive of misinformation and keep developing the data science stack. Keshav has several years of experience as a data scientist/researcher/lecturer, with two Masters in Physics & CS from Georgia Tech.

Slides

https://docs.google.com/presentation/d/1YfKV8MSYy40k36OzDoDmnfdRil-W8LW9RGfObExL8EY/edit#slide=id.g6adfa597e3_0_195

All submissions

Previous Next

Comments

Nov 2019

18 Mon

19 Tue

20 Wed

21 Thu

22 Fri

23 Sat 08:30 AM – 05:30 PM IST

24 Sun

Hybrid access (members only)

Hosted by

Anthill Inside

Anthill Inside is a forum for conversations about risk mitigation and governance in Artificial Intelligence and Deep Learning. AI developers, researchers, startup founders, ethicists, and AI enthusiasts are encouraged to: more

Anthill Inside 2019

Open Source Tools and Archive for Tackling Misinformation on ChatApps in India

Outline

Requirements

Speaker bio

Links

Slides

Comments