Session No. | Topic |
1 | Hadoop Foundation |
Linux: File Handling, | |
Text Processing, System Administration, | |
Process Management, | |
File Systems, | |
Advanced Commands. | |
2 | Java: Introduction |
Oops concept | |
Object-Class-Inheritance-Polymorphism-Abstraction-Encapsulation | |
3 | String and String Methods |
Exception Handling | |
4 | Collections:(List Map Set) |
Iterator interface, | |
List | |
Map | |
Set | |
5 | Serialization , Deserialization. |
6 | Hadoop Cluster Administration |
Introduction to Big Data, Hadoop Architecture, MapReduce Framework, | |
A typical Hadoop Cluster, Data Loading into HDFS, Hadoop Cluster Administrator, | |
Roles and Responsibilities | |
7 | Hadoop Architecture and Cluster setup |
Hadoop server roles and their usage, Rack Awareness, Anatomy of Write and | |
Read, Replication Pipeline, Data Processing, Hadoop Installation and Initial | |
Configuration, Deploying Hadoop in pseudo-distributed mode, deploying | |
a multinode Hadoop cluster, Installing Hadoop Clients | |
8 | Hadoop Cluster: Planning and Managing |
Planning the Hadoop Cluster, Cluster Size, Hardware and Software | |
considerations, Managing and Scheduling Jobs, types of schedulers in Hadoop, | |
Configuring the schedulers and run MapReduce jobs, Cluster Monitoring and | |
Troubleshooting. | |
9 | Backup, Recovery and Maintenance |
Configure Rack awareness, Setting up Hadoop Backup | |
data nodes in a cluster, setup quota's, upgrade Hadoop cluster, copy data across | |
clusters , Commissioning and Decommisisioning Data Nodes | |
10 | Advanced Topics: High Availability and Federation |
Configuring Secondary NameNode, Hadoop 1.0, YARN framework, MR, | |
Cluster setup, Deploying Hadoop 1.0 in pseudo-distributed mode, | |
Deploying a multi-node Hadoop 2.0 cluster. | |
11 | Hadoop Implementation |
Installation in Hadoop in Distributed Mode | |
Implementation Modes and running in Distributed mode | |
12 | Hadoop MapReduce Framework |
MapReduce Use Cases, Traditional way Vs MapReduce way, Why | |
MapReduce, Hadoop 1.x MapReduce Architecture, Hadoop 1.x MapReduce | |
Components, YARN MR Application Execution Flow, YARN Workflow, Anatomy of | |
MapReduce Program. | |
13 | Input Splits, Relation between Input Splits and HDFS Blocks, MapReduce Job |
Submission Flow, MapReduce: Combiner & Partitioner | |
Counters, Distributed Cache, MRunit, Reduce Join, Custom Input Format, | |
Sequence Input Format. | |
14 | Pig |
About Pig, MapReduce Vs Pig, Pig Use Cases, Programming Structure in Pig, | |
Pig Running Modes, Pig components, Pig Execution, Pig Latin Program, Data | |
Models in Pig, Pig Data Types-primitive and Scalar. | |
Pig Latin : Arithmetic Operators ,Boolean Operators, Cast Operators, Comparison | |
Operators , Type Construction Operators ,Dereference Operators ,Disambiguate | |
Operator Flatten Operator , | |
15 | File and Storage Functions, Group Operator, cogroup |
Operator, Joins and COGROUP, Union | |
Advance PigLatin: Processing Unstructured data in Pig - Word Count Example, | |
16 | Hive |
Hive Background, Hive Use Case, About Hive, Hive Vs Pig, Hive Architecture | |
and Components, Metastore in Hive, Limitations of Hive, Comparison with | |
Traditional Database, Hive Data Types and Data Models | |
17 | Hive QL: DML:Hive Tables, Types of Tables(Managed Tables and External Tables), |
Importing Data, Loading Data in to Hive, Insert Data in from Hive Queries, | |
Export Data, Querying Data, Managing Outputs, Hive Scripts. | |
18 | Advanced Hive QL: DynamicPartitioning, Adding Partition on External Table, |
Bucketing, Views, Indexing, File Formats in Hive | |
Sampling, | |
19 | BlockSampling, Joins In Hive, Join Optimization, Bucket Optimization, |
Custom User Defined Functions. | |
20 | Introduction to No SQL Databases |
Introduction to NoSQL Databases and HBase, HBase v/s RDBMS, HBase Components, | |
HBase Architecture, HBase Cluster Deployment. | |
21 | HBase Data Model, HBase Shell, HBase Client |
API, Data Loading Techniques. | |
22 | Overview on Hadoop Eco System Tools |
Sqoop Components workflow | |
23 | Flume Components workflow |
.
Read More Read Less