Home > Apache Hadoop > Apache Hadoop – A powerful folk for big data processing

Apache Hadoop – A powerful folk for big data processing


What Is Apache Hadoop?

The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.

Apache Hadoop Homepage: http://hadoop.apache.org/

Hadoop Wiki: http://wiki.apache.org/hadoop/

Apache Hadoop Wikipedia: http://en.wikipedia.org/wiki/Apache_Hadoop

Yahoo! Hadoop Tutorial: http://developer.yahoo.com/hadoop/tutorial/

Cloudera: http://www.cloudera.com/what-is-hadoop/

 

Guide for installation:

Running Hadoop On Ubuntu Linux (Single-Node Cluster)

Running Hadoop On Ubuntu Linux (Multi-Node Cluster)

 

Cheers,

Pete Houston

Advertisements
Categories: Apache Hadoop Tags: , ,
  1. No comments yet.
  1. No trackbacks yet.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: