New posts in hadoop

Avro vs. Parquet

Datanode process not running in Hadoop

DIY Hadoop Cluster - Heat & Dust issues?

Can apache spark run without hadoop?

zfs for Hadoop cloud instead of ext4 [closed]

The way to check a HDFS directory's size?

Hive : How to flatten an array?

What is Memory reserved on Yarn

Remove log4j v1 dependency on the existing services in hadoop

connect to host localhost port 22: Connection refused

How does the MapReduce sort algorithm work?

Why are Hadoop and Spark not in the official Ubuntu repositories?

what's the difference between "hadoop fs" shell commands and "hdfs dfs" shell commands?

How to open/stream .zip files through Spark?

How does Hadoop Namenode failover process works?

How to transpose/pivot data in hive?

Moving the SecondaryName Node in a Cloudera HBase Cluster

Calling a mapreduce job from a simple java program

How does Hadoop process records split across block boundaries?

Difference between Hive internal tables and external tables?