Most common action type you will find in oozie workflow is <map-reduce> action type. In this blog we will see how to define a map-reduce action type. the
Your reading schema doesn’t has to be same as that of the writing schema. You can add new fields or remove the existing fields(projection). If a new field
Avro Data Files are portable across platforms. You can read the Data Files written by java program from a python program. Data Files carry the schema with them.In
Avro DataFiles are binary files that carry the schema with them. They are splittable and allows seeking to a random position. You can sync with record boundary . You need
We have already seen how to use Junit to write unit tests for your java classes. There is specialized test suite for testing mapreduce jobs, known as MRUnit.