New posts in hadoop

Avro vs. Parquet

hadoop avro parquet

Datanode process not running in Hadoop

hadoop configuration process

DIY Hadoop Cluster - Heat & Dust issues?

hardware cluster hadoop physical-environment

Can apache spark run without hadoop?

hadoop amazon-s3 apache-spark mapreduce mesos

zfs for Hadoop cloud instead of ext4 [closed]

zfs ext4 hadoop

The way to check a HDFS directory's size?

hadoop command-line directory hdfs

Hive : How to flatten an array?

java sql hadoop hive hiveql

What is Memory reserved on Yarn

hadoop apache-spark hadoop-yarn hadoop2

Remove log4j v1 dependency on the existing services in hadoop

connect to host localhost port 22: Connection refused

linux hadoop ssh

How does the MapReduce sort algorithm work?

algorithm sorting parallel-processing hadoop mapreduce

Why are Hadoop and Spark not in the official Ubuntu repositories?

package-management cloud hadoop official-repositories

what's the difference between "hadoop fs" shell commands and "hdfs dfs" shell commands?

How to open/stream .zip files through Spark?

hadoop apache-spark

How does Hadoop Namenode failover process works?

hadoop hdfs hadoop2 failover namenode

How to transpose/pivot data in hive?

Moving the SecondaryName Node in a Cloudera HBase Cluster

hadoop hbase cloudera

Calling a mapreduce job from a simple java program

java hadoop mapreduce

How does Hadoop process records split across block boundaries?

hadoop split mapreduce block hdfs

Difference between Hive internal tables and external tables?

hadoop hive hiveql