DataStage: Prepare the data source files for Cloud Storage connection

Prepare the data source files and store them in your cloud-based storage system. The files must be in one of the following:

  • An AWS S3 bucket.
  • An Azure Data Lake Storage container.
  • A Google Cloud Storage bucket.
  1. Export the DataStage project files (DSX) for which you want to create a technical lineage. Exclude executables when you export the files.

    Tip You can either export a DataStage project manually or automatically via command line.
  2. Store the DataStage files in your cloud-based storage system.
  3. Optionally, if your DataStage project uses environment variables, manually export the environment files (ENV).

  4. Give the environment files the same name as the DataStage project files. For example, if your project file is named datastage-project-1.dmx, name your environment file datastage-project-1.env.

  5. Store the environment files in the same bucket or container in your cloud-based storage system.

    Important  
    • CollibraData Lineage only supports DSX and ENV files.
    • You can have one DSX file per DataStage project.
    • You can have more than one DSX file in the bucket or container in your cloud-based storage system.
    • You can have one or none ENV file per DSX file.
    • The name of the DSX file and the ENV file has to be the same.

What's next

Add the Technical Lineage for DataStage (Cloud) capability for Cloud Storage connections.