GlusterFS "Big Data" Interface

Jul 2012

23 Mon

24 Tue

25 Wed

26 Thu

27 Fri 09:30 AM – 05:30 PM IST

28 Sat 09:30 AM – 05:00 PM IST

29 Sun

Nimhans Convention Centre, Bangalore

All submissions

Previous Next

This submission has been added to the schedule

GlusterFS "Big Data" Interface

Submitted Jul 17, 2012

Section: Big Data Infrastructure & Processing Technical level: Intermediate Session type: Demo

Infrastructure for Big-Data processing (drop-in replacement for Hadoop Distributed File System - HDFS)

Outline

GlusterFS is an open source, distributed file system capable of scaling to several petabytes and handling thousands of clients. GlusterFS clusters together storage building blocks over Infiniband RDMA or TCP/IP interconnect.

GlusterFS can also be used as a replacement for HDFS and to run Map/Reduce jobs on data residing on it. GlusterFS Hadoop plugin allows exisitng Map/Reduce jobs to seamlessly work without any changes. This is done by using Hadoop’s FileSystem interface and communicating to GlusterFS via it’s native protocol (using FUSE).

Requirements

Basic know-how of GlusterFS
Distributed File System
Working knowledge of Hadoop
UNIX

Speaker bio

Venky Shankar works on GlusterFS at Red Hat. He is a Team lead for the Replication team and is also responsible for designing and implementing the Hadoop compatibility plugin in GlusterFS. He has about six years of experience in the industry. His interests include System Programming, Distributed Systems, Big Data.

Comments

Jul 2012

23 Mon

24 Tue

25 Wed

26 Thu

27 Fri 09:30 AM – 05:30 PM IST

28 Sat 09:30 AM – 05:00 PM IST

29 Sun

Hosted by

The Fifth Elephant

Jumpstart better data engineering and AI futures

The Fifth Elephant 2012

GlusterFS "Big Data" Interface

Outline

Requirements

Speaker bio

Links

Comments