Some of the Hadoop projects we will talk about are:
HDFS : A distributed filesystem that runs on large clusters of commodity machines.
Map Reduce: A distributed data processing model and execution environment that runs on large clusters of commodity machines.
Pig: A data flow language and execution environment for exploring very large datasets. Pig runs on HDFS and MapReduce clusters.
HBase: A distributed, column-oriented database. HBase uses HDFS for its underlying storage, and supports both batch-style computations using MapReduce and point queries (random reads).
ZooKeeper: A distributed, highly available coordination service. ZooKeeper provides primitives such as distributed locks that can be used for building distributed applications.
Oozie: Oozie is a workflow scheduler system to manage Apache Hadoop jobs.