This course introduces HBase – a NoSQL store on top of Hadoop. The course is intended for developers who will be using HBase to develop applications, and administrators who will manage HBase clusters.
Overview
This course introduces HBase – a NoSQL store on top of Hadoop. The course is intended for developers who will be using HBase to develop applications, and administrators who will manage HBase clusters.
We will walk a developer through HBase architecture and data modelling and application development on HBase. It will also discuss using MapReduce with HBase, and some administration topics, related to performance optimization. The course is very hands-on with lots of lab exercises.
Course Outline
Section 1: Introduction to Big Data & NoSQL
Big Data ecosystem
NoSQL overview
CAP theorem
When is NoSQL appropriate
Columnar storage
HBase and NoSQL
Section 2 : HBase Intro
Concepts and Design
Architecture (HMaster and Region Server)
Data integrity
HBase ecosystem
Lab : Exploring HBase
Section 3 : HBase Data model
Namespaces, Tables and Regions
Rows, columns, column families, versions
HBase Shell and Admin commands
Lab : HBase Shell
Section 3 : Accessing HBase using Java API
Introduction to Java API
Read / Write path
Time Series data
Scans
Map Reduce
Filters
Counters
Co-processors
Labs (multiple) : Using HBase Java API to implement time series , Map Reduce, Filters and counters.
Section 4 : HBase schema Design : Group session
students are presented with real world use cases
students work in groups to come up with design solutions
discuss / critique and learn from multiple designs
Labs : implement a scenario in HBase
Section 5 : HBase Internals
Understanding HBase under the hood
Memfile / HFile / WAL
HDFS storage
Compactions
Splits
Bloom Filters
Caches
Diagnostics
Section 6 : HBase installation and configuration
hardware selection
install methods
common configurations
Lab : installing HBase
Section 7 : HBase eco-system
developing applications using HBase
interacting with other Hadoop stack (MapReduce, Pig, Hive)
frameworks around HBase
advanced concepts (co-processors)
Labs : writing HBase applications
Section 8 : Monitoring And Best Practices
monitoring tools and practices
optimizing HBase
HBase in the cloud
real world use cases of HBase
Labs : checking HBase vitals
Requirements
comfortable with Java programming language
comfortable in Java programming language (navigate Linux command line, edit files with vi / nano)
A Java IDE like Eclipse or IntelliJ
Lab environment
A working HBase cluster will be provided for students. Students would need an SSH client and a browser to access the cluster.
Zero Install : There is no need to install HBase software on students’ machines!
NobleProg is an international training and consultancy group, delivering high quality courses to every sector, covering: Cyber Security, Artificial Intelligence, IT, Management, Applied Statistics.
Over the last 17 years, we have trained more than 50,000 people from over 6000 companies and organisations.
Our courses include classroom (both public and closed) and instructor-led online giving you choice and flexibility to suit your time, budget and level of expertise.
We practice what we preach – we use a great deal of the technologies and methods that we teach, and continuously upgrade and improve our courses, keeping up to date with all the latest developments.
Our trainers are hand picked and have been through rigorous checks and interviews, and all courses are evaluated by delegates ensuring continuous feedback and improvement.
This course offers end-users a sound background on the concepts of a Relational Database Management System. It is primarily aimed at end-users who need to understand SQL in order to effectively and efficiently query their corporate database
This 2-day Oracle PL/SQL Programming Advanced training course will help the PL/SQL student take advantage of language features, advanced techniques, and packages and facilities provided by Oracle to develop and tune efficient and effective PL/SQL subprograms
Enroll for 2-day Querying Data with Microsoft Transact-SQL - DP-080T00 training course from Koenig Solutions accredited by Microsoft. In this course you will learn about the basics of Microsoft's dialect of the standard SQL language
Professional SQL Server High Availability and Disaster Recovery explains the high availability and disaster recovery technologies available in SQL Server: Replication, AlwaysOn, and Log Shipping.
Introduction to SQL Databases course is offered by Wardy IT Solutions. As a Microsoft Partner we’ve been awarded with the Silver Learning competency. Every training session we run is taught by a Microsoft Certified Trainer.
© 2024 coursetakers.com All Rights Reserved. Terms and Conditions of use | Privacy Policy