Introduction to Apache HBase

This engaging hands-on course introduces HBase, a NoSQL store on top of Hadoop. The course is intended for developers who will be using HBase to develop applications as well as administrators who will manage HBase clusters.
The instructor will walk participants through HBase architecture, data modeling and application development on HBase. Other subjects covered include using MapReduce with HBase and administration topics related to performance optimization. A significant portion of class time will be dedicated to interactive labs.

Apache Kafka for Software Developers

Apache Kafka is a publish-subscribe messaging rethought as a distributed commit log, capable of handling hundreds of megabytes of reads and writes per second from thousands of clients on each broker. A Kafka cluster can serve as the central data backbone for a large organization.

Big Data Overview

Participants of this course will learn the theory and practice of Big Data. Upon completion they will be able to create data-driven applications that make modern enterprise tick. This course contains an ideal mix of theory and practice, with the theory portion illustrated by hands-on examples and accomplished with industry-leading tools.

Machine Learning with Apache Spark

This course teaches machine learning at scale with the popular Apache Spark framework.
No machine learning knowledge is assumed. For each machine learning concept, we first discuss the foundations, its applicability and limitations. Then we explain the implementation and use, covering specific use cases. This is achieved through a combination of 50 percent lecture, 50 percent lab work.
Please note that this course does not include in-depth coverage of the maths/stats behind machine learning.
This course is taught using Spark and Python.

Introduction to Apache Spark

This course will familiarize participants with Apache Spark, explaining how Spark fits into the big data ecosystem and how to use Spark for data analysis. Topics covered will include Spark shell for interactive data analysis, Spark internals, Spark APIs, Spark SQL, Spark streaming, machine learning and GraphX.