Autoplay
Autocomplete
Previous Lecture
Complete and Continue
Big Data Crash Course
Overview
About the Instructor (2:34)
Course Structure and Approach (3:14)
Course Pre-requisites (2:48)
Course Outcomes (2:08)
Environment Setup
Google Cloud Account Setup (1:39)
Creating a Dataproc Cluster (17:42)
GCP Account Best Practices (2:55)
Twitter Developer Account Setup (3:31)
Data Files
Getting Started with Big Data Journey
Definition of Big Data (10:20)
Data Lake Overview (6:03)
Key Roles in a Big Data Science Project (16:56)
Big Data Logical Architecture (20:00)
Typical Big Data Pipeline (6:00)
Hadoop Overview (9:20)
Bonus for non-Java participants: Demystifying JVM vs JDK vs JRE (6:06)
Hadoop Filesystem
HDFS Overview (6:16)
Small FS vs HDFS (3:12)
HDFS Architecture (11:23)
Hands-on: HDFS (11:32)
Distributed Processing using MapReduce and Beyond
Introduction to MapReduce (5:07)
Logical & Physical Architecture of MR (15:51)
YARN (Distributed OS) (7:08)
YARN Architecture (14:55)
Hands-on: Spark on YARN (12:10)
Data Persistence in Big Database
Why NoSQL?: RDBMS USPs & its Limitations (6:58)
Polyglot Persistence (6:36)
Why HBase and Limitations (10:50)
HBase Terms (9:42)
HBase Physical Storage (4:38)
HBase Architecture (16:58)
Hands-on: Installation HBase on DataProc cluster (5:06)
Hands-on: Installing Confluent Kafka (9:09)
Hands-on: KSQLDB Troubleshoot (2:46)
Hands-on: HBase (8:12)
Data Ingestion using Sqoop
Sqoop Overview (7:06)
Sqoop Architecture (8:51)
Sqoop Installation (11:49)
Hands-on: Sqoop (5:12)
Data Analysis using Hive & Impala
Hive Introduction (8:11)
Hive Architecture (6:54)
Impala Overview (6:52)
Impala Architecture (12:27)
Text vs Binary Data Formats (3:29)
Avro Format (5:16)
Hands-on: Hive (16:19)
Hands-on: Sqoop + Hive Integration (9:27)
Hands-on: Schema Evolution (18:53)
Data Processing using Spark
Spark Overview (6:55)
Spark Logical Architecture (7:17)
Spark Physical Architecture (14:12)
Spark Core Vs Spark SQL (4:21)
Spark Execution Modes (8:51)
Hands-on: Spark on Jupyter (13:08)
Streaming Events through Kafka
Introduction to Apache Kafka (9:24)
Evolution of Kafka (4:33)
Why Kafka? (12:06)
Apache Kafka Vs Confluent Kafka (7:09)
Kafka Architecture (20:41)
Hands-on: Kafka Console Producer Consumer (6:38)
Building Dataflows using Nifi
NiFi Overview (10:56)
NiFi UseCases (3:17)
NiFi Limitations (3:18)
NiFi Components and its Architecture (9:12)
Hands-on: NiFi Installation on GCP (7:11)
Hands-on: Twitter Data Ingestion Using Nifi Part 1 (15:09)
Hands-on: Twitter Data Ingestion Using Nifi Part 2 (14:01)
Hands-on: Twitter Data Ingestion Using Nifi Part 3 (4:40)
Epilogue
Conclusion (0:39)
HBase Architecture
Lecture content locked
If you're already enrolled,
you'll need to login
.
Enroll in Course to Unlock