Hadoop
Course highlights :
Indepth personalized training
Extensive curriculum
World class lab ( Development on standalone system and
distributed clusters)
Real time scenarios
Indepth Documentation with real time references for
reference.
Hadoop course content:
• Introduction to Distributed systems
High Availability
Scaling
Advantages
• Introduction to Big Data
Big Data opportunities
Big Data Challenges
• Introduction to Hadoop
Hadoop Distributed File System
Hadoop Architecture
Map Reduce & HDFS
• Hadoop Eco Systems
Introduction to Pig
Introduction to Hive
Introduction to HBase
Other eco system Map
• Hadoop Administration
Hadoop Installation & Configuration
Setting up Standalone system
Setting up pseudo distributed cluster
Setting up distributed cluster
• The Hadoop Distributed File System (HDFS)
HDFS Design & Concepts
Blocks, Name nodes and Data nodes
Hadoop DFS The Command-Line Interface
Basic File System Operations
Reading Data from a Hadoop URL
Reading Data Using the File System API
• Map Reduce
Map and Reduce Basics.
How Map Reduce Works
Anatomy of a Map Reduce Job Run
Job Submission, Job Initialization, Task Assignment, Task Execution
Progress and Status Updates
Job Completion, Failures
Shuffling and Sorting.
Combiner
Hadoop Streaming
• Map/Reduce Programming - Java
Hands on "Word Count" in Map/Reduce in Eclipse
Sorting files using Hadoop Configuration API discussion
Emulating "grep" for searching inside a file in Hadoop
Chain Mapping API discussion
Job Dependency API discussion and Hands on
Input Format API discussion and hands on
Input Split API discussion and hands on
Custom Data type creation in Hadoop
Discussion on some business use cases
Ecosystem includes the below :
Hive
Pig
HBase
Sqoop
Flume
Oozie
Call Us : 0801-987-1850
Mail Us :infoprofessionalonlinetraining@gmail.com
No comments:
Post a Comment