Skip to end of metadata
Go to start of metadata
  1. Create a new HDFS Delimited file format pointing to a file in your HDFS.
    • Specify a Name.
    • Specify the File name(s) to where your file resides in your HDFS.
  2. Connect your HDFS file format as a source to a Query transform.
  3. Configure the Query transform to substring Field1 to 20 characters. If you changed the input file format schema, use your own varchar field.
  4. Create a new HDFS file format pointing to a new file in your HDFS
    • Change the Type if the file to be written isn’t delimited.
    • Specify a Name.
    • Specify the Root directory to where your new file should reside in the HDFS.
    • Specify the File name(s) for the new file.
  5. Connect your HDFS file format as a target for the Query transform.
  6. Run the job.
  7. View the new file in your HDFS using the Hadoop CLI to verify it contains the manipulated contents of your input file.
  • No labels