New posts in mapreduce

Hadoop speculative task execution

hadoop mapreduce

How to get the input file name in the mapper in a Hadoop program?

hadoop mapreduce

MultipleOutputFormat in hadoop

java hadoop mapreduce

Is gzip format supported in Spark?

java scala mapreduce gzip apache-spark

Find all duplicate documents in a MongoDB collection by a key field

mongodb mapreduce duplicates aggregation-framework

data block size in HDFS, why 64MB?

database hadoop mapreduce block hdfs

Integration testing Hive jobs

java testing hadoop mapreduce hive

Reading file as single record in hadoop

java hadoop mapreduce

Why is the final reduce step extremely slow in this MapReduce? (HiveQL, HDFS MapReduce)

performance hive mapreduce hdfs reduce

Kotlin - How to convert a list of objects into a single one after map operation?

kotlin functional-programming mapreduce reduce

Hadoop Word count: receive the total number of words that start with the letter "c"

java hadoop mapreduce

Hadoop FileAlreadyExistsException: Output directory hdfs://<namenode public dns>:9000/input already exists

ubuntu hadoop mapreduce

Remove Duplicates from MongoDB

mongodb mapreduce mongodb-query aggregation-framework

What is Hive: Return Code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask

hadoop mapreduce hive

Hadoop DistributedCache is deprecated - what is the preferred API?

java hadoop mapreduce

Count lines in large files

linux mapreduce

MongoDB Stored Procedure Equivalent

stored-procedures mongodb geolocation mapreduce

Oozie: Launch Map-Reduce from Oozie <java> action?

java hadoop mapreduce oozie avro

Hadoop truncated/inconsistent counter name

java hadoop mapreduce hadoop-yarn

MongoDB aggregation comparison: group(), $group and MapReduce

mongodb mapreduce mongodb-query aggregation-framework