Jul 2012
23 Mon
24 Tue
25 Wed
26 Thu
27 Fri 09:30 AM – 05:30 PM IST
28 Sat 09:30 AM – 05:00 PM IST
29 Sun
Jul 2012
23 Mon
24 Tue
25 Wed
26 Thu
27 Fri 09:30 AM – 05:30 PM IST
28 Sat 09:30 AM – 05:00 PM IST
29 Sun
A overview of the Hadoop ecosystem and how the different parts of the ecosystem interact and fit together.
Hadoop has matured to point where it is not longer just one project but a bunch of projects ranging from getting data onto the cluster to processing and analyzing data to managing the cluster itself. I will be talking from my personal experiences from setting up a hadoop cluster at Inmobi that processes 10TB+ of Data per day (and growing). The several Hadoop clusters in Inmobi are spread over multiple datacenters across continents.
Projects that will be covered in some detail include Hadoop (HDFS and Mapreduce), Hive, HBase, Pig, Mahout, Scribe , Zookeeper and Oozie/Azkhaban.
Should have basic familiarity with Hadoop.
Vinayak Hegde is Head of Engineering (Marketplace Management) at Inmobi. He has been active in opensource software community for more than a decade. He has been writing code in mainstream as well as esoteric programming languages on a variety of operating systems. He is a computer networking and data geek.
Jul 2012
23 Mon
24 Tue
25 Wed
26 Thu
27 Fri 09:30 AM – 05:30 PM IST
28 Sat 09:30 AM – 05:00 PM IST
29 Sun
Hosted by
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}