Architecture of Apache HBase Balancer
Submitted Apr 4, 2025
Topic of your submission:
Distributed data systems
Type of submission:
30 mins talk
I am submitting for:
Rootconf Annual Conference 2025
Description
Apache HBase is an open-source non-relational distributed database modeled after Google’s Bigtable and written in Java. It is developed as part of Apache Software Foundation’s Apache Hadoop project and runs on top of HDFS, providing Bigtable-like capabilities for Hadoop
A Load Balancer in a distributed database context is a piece of software that ensures the load is distributed across the nodes to ensure resource utilisation as per the configuration. Depending on the architecture of the database, functionality of load balancer can be different, for example it could just be a stateless layer sitting in front of the data routing the traffic to server nodes functioning as a load balancer, or each database server process itself can act as a navigator and in turn function as a load balancer.
This talk will talk about
- What are balancers in databases, what is the need for a balancer and how are they used
- Architecture of Apache HBase Load Balancer
- Performance aspects of HBase Load Balancer and tunable configurations with real world performance examples
Takeaways
- Internals of how Apache HBase load balancer works and tunable aspects of the balancer to achieve optimal performance
- General knowledge on how to design distributed systems and tunable configurations catering to different kinds of use cases.
Audience
- Engineers who are interested in understanding the internals of databases like Apache HBase
- Engineers interested in learning how to design distributed systems and tunable configurations catering to different kinds of use cases.
Bio
I work at Flipkart Database Team, with primary responsibility of leading technical side of Apache HBase
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}