Close

HADOOP

Hadoop is mainly for handling big data. Big data is unstructured data like log files, audios, videos, images, emails, books, documents and so on. In the real world there are lot of sources which are generating big data everyday. Most of the projects from esteemed clients require big data handling.

Length : 60 Hours

Introduction to HADOOP

What is Big Data?

What is Hadoop?

Challenges With Big Data

Comparison With Other Technologies

Components of Hadoop Echo System

HDFS

Significance of HDFS in Hadoop

Features of HDFS

Storage aspects of HDFS

HDFS Architecture

Accessing HDFS

MapReduce

Why MapReduce is essential in Hadoop

Processing Daemons of Hadoop

Input Split

Life Cycle

Data Types

Driver Code

Mapper Code

Reducer Code

Input Format in MapReduce

Output Format in MapReduce

MapReduce API

Combiner

Partitioner

Oozie

YARN

Hive

Pig

Hbase

Zookeeper

MySql

Sqoop

MongoDB

Scala

Spark

Hue