Sourcing, processing, and indexing location data for search
Submitted by oligoglot (@oligoglot) on Friday, 31 August 2012
Section: Technology Technical level: Intermediate Session type: Workshop
One can learn about the following by attending the lecture: 1. Open sources of geodata 2. Tools to process such data 3. Indexing them to build a simple location-enabled search engine
- Open sources for location information associated with structured and semi-structured data Wikipedia, DBPedia, Freebase, Geonames, US Military database etc.
- Tools and methods to process the identified feeds: Google Refine, RDF libraries, custom feed processors
- Building a simple search engine using the data processed above to support the following: Point, range, and neighbourhood searches
Draws from my experience extracting information from various online sources for building a mobile search engine (zook.in) as well as for building custom search systems at Ideophone.
A basic idea of the following: 1. geodata 2. webservices
Sundar has built/worked-on a number of information retrieval systems for Y! Travel, Shopping, Hotjobs, zook.in, and Ideophone. His primary work has been in crawling the web, extracting structured data from semi-structured information and processing that for indexing.