Skip to end of metadata
Go to start of metadata

This job will use a Merge transform to ensure it is not pushed down to the Hadoop cluster in case you haven't setup Pig yet.

  1. Create a new HDFS file format pointing to a file in your HDFS.
    • Change the Type if the file to be read isn’t delimited.
    • Specify a Name.
    • Specify the File name(s) to where your file resides in your HDFS using a full path.
  2. Connect your HDFS file format as a source to a Merge transform.
  3. Create a new HDFS file format pointing to a new file in your HDFS.
    • Change the Type if the file to be written isn’t delimited.
    • Specify a Name.
    • Specify the File name(s) for the new file where it should reside in your HDFS using a full path.
  4. Connect your HDFS file format as a target for the Merge transform.
  5. Run the job.
  6. View the new file in your HDFS using the Hadoop CLI to verify it contains the contents of your input file.
  • No labels