What are your users doing on your website or in your store? How do you turn the piles of data your organization generates into actionable information? Where do you get complementary data to make yours more comprehensive? What tech, and what techniques?
The Fifth Elephant is a two day conference on big data.
Early Geek tickets are available from fifthelephant.doattend.com.
The proposal funnel below will enable you to submit a session and vote on proposed sessions. It is a good practice introduce yourself and share details about your work as well as the subject of your talk while proposing a session.
Each community member can vote for or against a talk. A vote from each member of the Editorial Panel is equivalent to two community votes. Both types of votes will be considered for final speaker selection.
It’s useful to keep a few guidelines in mind while submitting proposals:
Describe how to use something that is available under a liberal open source license. Participants can use this without having to pay you anything.
Tell a story of how you did something. If it involves commercial tools, please explain why they made sense.
Buy a slot to pitch whatever commercial tool you are backing.
Speakers will get a free ticket to both days of the event. Proposers whose talks are not on the final schedule will be able to purchase tickets at the Early Geek price of Rs. 1800.
GlusterFS "Big Data" Interface
Infrastructure for Big-Data processing (drop-in replacement for Hadoop Distributed File System - HDFS)
GlusterFS is an open source, distributed file system capable of scaling to several petabytes and handling thousands of clients. GlusterFS clusters together storage building blocks over Infiniband RDMA or TCP/IP interconnect.
GlusterFS can also be used as a replacement for HDFS and to run Map/Reduce jobs on data residing on it. GlusterFS Hadoop plugin allows exisitng Map/Reduce jobs to seamlessly work without any changes. This is done by using Hadoop’s FileSystem interface and communicating to GlusterFS via it’s native protocol (using FUSE).
Basic know-how of GlusterFS
Distributed File System
Working knowledge of Hadoop
Venky Shankar works on GlusterFS at Red Hat. He is a Team lead for the Replication team and is also responsible for designing and implementing the Hadoop compatibility plugin in GlusterFS. He has about six years of experience in the industry. His interests include System Programming, Distributed Systems, Big Data.