Tuesday, September 24, 2019

what is hadoop



Hadoop is one of the first popular open source big data technologies. It is a scalable
fault-tolerant system for processing large datasets across a cluster of commodity hardware.

Internal components:
HDFS & YARN with Mapreduce

No comments: