Apache Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.
The following are the things covered under Hadoop.
Hadoop Distributed File System is the core component or you can say, the backbone of Hadoop Ecosystem.
YARN as the brain of your Hadoop Ecosystem. It performs all your processing activities by allocating resources and scheduling tasks.
MapReduce is a software framework which helps in writing applications that processes large data sets using distributed and parallel algorithms inside Hadoop environment.
PIG has two parts: Pig Latin, the language and the pig runtime, for the execution environment. You can better understand it as Java and JVM.
Facebook created HIVE for people who are fluent with SQL. Thus, HIVE makes them feel at home while working in a Hadoop Ecosystem.
HBase is an open source, non-relational distributed database. In other words, it is a NoSQL database.
The following are the course contents offered for Big Data / Apache Hadoop
Download Apache Hadoop course plan