0% found this document useful (0 votes)

298 views13 pages

Importing Sequential Files in IBM Server

This document discusses importing sequential files into IBM Information Server for data lineage analysis. It describes how to import sequential files from the DataStage and QualityStage designer, which creates a table definition representing the file structure. It recommends publishing the table definition as a shared table so the metadata workbench can analyze dependencies on the file. The metadata workbench can then link files between jobs based on file name and location.

Uploaded by

Karthic Vijay D

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

298 views13 pages

Importing Sequential Files in IBM Server

Uploaded by

Karthic Vijay D

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

IMPORTING SEQUENTIAL FILES

INFORMATION SERVER V8.7

Prepared by: March Haber, march@il.ibm.com

Last Updated: January, 2012
IBM MetaData Workbench Enablement Series

Table of Contents:

Table of Contents:................................................................................................................... 2
Introduction............................................................................................................................... 3
Objective ................................................................................................................................... 3
Working with Sequential Files.................................................................................................. 4
Data Files .................................................................................................................................. 5
Metadata Workbench Data Lineage Analysis........................................................................... 5
Importing Sequential Files From the DataStage and QualityStage Designer.................... 6
Publish the Sequential File Definition From the DataStage and QualityStage Designer .. 8
Modify the Data File From the DataStage and QualityStage Designer........................... 10
Deleting Data Files - From the Metadata Asset Manager....................................................... 11
Synchronizing Published Sequential Files.............................................................................. 12
Summary ................................................................................................................................. 13

IBM Confidential Information Page 2 of 13

IBM MetaData Workbench Enablement Series

Introduction

Data is at the core of our business and therefore proper management and understanding of such is essential.
Furthermore, data remains at the hub of the IBM Information Server. Whether it is the ETL processes of
DataStage and QualityStage, profiling by Information Analyzer or definition and understanding by Business
Glossary and MetaData Workbench, all require and reference a data source.

For purposes of identification and re-use, it becomes imperative that care be taken in how data structures are
imported into the IBM Information Server. Several mechanisms for import exist, however in all cases the
imported structure can be accessed and utilized by any of the IBM Information Server applications.

Objective
Learn how to import sequential files or complex flat file structures, displaying and re-using such structures
within the IBM Information Server applications.

For users of IBM DataStage and QualityStage:

ETL Developers will often as parts of their development cycle create Table Definitions. These definitions are
viewed as templates or user-defined structures, often identical to an actual physical source. They shortcut the
development process, but due to their flexible nature and disassociation from the physical source, they are not
considered for Lineage within the IBM MetaData Workbench.

IBM Confidential Information Page 3 of 13

IBM MetaData Workbench Enablement Series

Working with Sequential Files

Sequential Files are often the source of a DataStage and QualityStage Job or provide Lookup Data for such
Jobs. Such files may be imported into the IBM Information Server for purposes of analyzing and
understanding their dependencies and usages.

Sequential Files are imported from the DataStage and QualityStage Designer, by invoking the Sequential File
Definitions import or the ODBC Connector. When complete, this import process creates a Table Definition
representing the structure of the Sequential File, including Column Definitions and their datatypes.

It is recommended to publish the created Table Definition as a Shared Table. This will allow the IBM
MetaData Workbench analysis services to report on the data dependencies of the Sequential File, including
searching on the File and viewing the DataStage Jobs which read or write from the File.

It is not required to import Sequential Files to facilitate Data Lineage analysis within the Metadata
Workbench. The Metadata Workbench will link DataStage File Stages, from different Jobs, together, when
one Stage is reading and the other is writing to an identical File. An identical file is determined by the
defined file name and location of the DataStage Stage.

File Name: #FileDir#/DataFile.txt

IBM Confidential Information Page 4 of 13

IBM MetaData Workbench Enablement Series

Data Files

Data Files further may be assigned a Business Term, Business Label or a Data Steward via InfoSphere
Business Glossary or InfoSphere MetaData Workbench, in addition to allowing the authoring of its
Description or Business Name.

Data Files and Relational Databases are collectively referred as Implemented Data Resources, within the
Information Server.

When published, the following components and relationships are captured and defined:

Data File A user-defined structure defined during import

Data File Structure The component of the File, defined during publication

Data File Field The fields of the File, defined during import

Metadata Workbench Data Lineage Analysis

Data Lineage Analysis reports include the display of Data Files, as the Source or Target of a DataStage Job.

The Data File created by the publication method, must reflect the fully qualified file name and location
defined within the DataStage File Stages.

When the file name or location includes Job Parameters or Environment Variables, those are replaced with
their Default Values when evaluating Design Metadata, and with their Runtime Values when evaluating
Operational Metadata.

Note the example below the defined file is evaluated as such:

InputData (EWS_ProductionSourceStaging) connected to Datafile:

C:/EWS/Prod/AmericaProd.txt,(found physical table)

For more information please refer to the Metadata Workbench Administration Guide.

IBM Confidential Information Page 5 of 13

IBM MetaData Workbench Enablement Series

Importing Sequential Files From the DataStage and QualityStage Designer

Sequential File Definition import wizard:

Launch the DataStage and QualityStage client.

Select the menu item Import | Table Definitions | Sequential File Definitions. The Import dialog
displays.

o Select the Directory containing the Sequential File or Complex Flat File to import.
o Select the File, from the list of displayed files, to be imported.
o Set the DataStage Project Folder to contain the Table Definition to be created by the import
process.
o Click Import. The Define Sequential MetaData dialog appears.

o Choose the appropriate delimiters and options for the File.

o Click Preview. Ensure the data preview correctly displays the columns of the File to be
imported.

IBM Confidential Information Page 6 of 13

IBM MetaData Workbench Enablement Series

o Click the Define Tab. Set the Column Names, SQL Type, Length, Description and other
properties as appropriate.

o Click OK to complete the import process. The import process creates a Table Definition
within the current DataStage Project. Table Definitions are specific to the DataStage Project
and are not included in the display of Data Files from the Metadata Workbench or Business
Glossary.
Select another File for import, or click Close to close the Import MetaData dialog.

A Table Definition
The Table Definition which has been created is identified as a Sequential File. It may be necessary to
view or edit the Table Definition properties to ensure the Locator Table Type indicated Sequential.

IBM Confidential Information Page 7 of 13

IBM MetaData Workbench Enablement Series

Publish the Sequential File Definition From the DataStage and QualityStage Designer
Shared Table Creation Wizard to publish a Table Definition as a Data File:

Launch the DataStage and QualityStage client.

From the DataStage Repository viewer, browse and select newly created Table Definition
representing the imported Sequential File.
Optional: Double-click the Table Definition to view the Table Definition details. Ensure the locator
defining the type of Table Definition is set to Sequential.

From the DataStage Repository viewer, select and Right-Click the Table Definition. Select Shared
Table Creation Wizard from the menu. The Shared Table Creation Wizard dialog appears.

IBM Confidential Information Page 8 of 13

IBM MetaData Workbench Enablement Series

o Select the Table Definition, click Next.

o Define the identity parameters of the Table Definition to be published. Identity parameters
include the Host System and Path of the Data File.
Select Create New from the list of Association types.
From the Create New Table dialog, select an existing Host System or type the name
of a new Host System to be created. The host system must reflect the server, on
which the Sequential File exists.
Enter the complete Directory Path where the Sequential File is located. The
Directory Path is used to uniquely identify the Sequential File and should not
include a final slash (/ or \).
Click OK to confirm the Identity details of the Sequential File.

o Click Next to proceed.

o Click Create to complete the process. A Data File, Data File Structure and Data File Field
Assets are created within the Information Server Metadata Repository. The Table Definition
displayed within the DataStage Project is updated, and bound to the published Data File.

IBM Confidential Information Page 9 of 13

IBM MetaData Workbench Enablement Series

Modify the Data File From the DataStage and QualityStage Designer
Metadata Management

1. Launch the DataStage and QualityStage client.

2. Select the menu item Repository | Metadata Sharing | Management file menu. The Metadata
Sharing dialog opens.
3. Browse and select the newly created Data File
4. Click Repository | Edit to edit the Data File
The name must reflect the fully qualified name as defined within the DataStage Stage.
The path must reflect the fully qualified file location as defined within the DataStage Stage.
Optional: Enter a Short or Long Description to describe and annotate the Data File asset.
5. Click Close to save the changes.

6. Optional: Click Repository | Delete to remove the selected Data File.

7. Optional: Select the Columns tab, to view the list and structure of the contained Data File Fields

IBM Confidential Information Page 10 of 13

IBM MetaData Workbench Enablement Series

Deleting Data Files - From the Metadata Asset Manager

Host Systems and Data Files may be removed from the IBM InfoSphere Metadata Asset Manager application.

Browse to the Metadata Asset Manager: http://ServerName:9080/ibm/imam/console, and logon to the

application with the appropriate credentials, which must include Common Metadata Administrator.
Select the Repository Management Tab.
Expand Browse Assets from the left navigation pane. Select Implemented Data Resources. A list of
Host Systems will display.
Select and expand a specific Host System to view its contained Data Files.
Select a Data File to view the Asset details.
Optional: Expand the Usage section of the Asset details, to view the dependency upon the Data File by
other components. Click Retrieve Usage to update the list of dependencies.

Select Delete from the toolbar menu item to remove the selected Asset. Click Yes to confirm the
removal of the selected Asset. Deletion of a Data File will additionally remove the contained Structure
and Fields.
Optional: Select More Actions from the toolbar menu to view the Asset within the IBM InfoSphere
Metadata Workbench.

IBM Confidential Information Page 11 of 13

IBM MetaData Workbench Enablement Series

Synchronizing Published Sequential Files

Introduction

As development and changes are made to Databases or Files and their structures, their will come a time where
those changes will need to be synchronized with existing Physical Data Sources previously imported into the
IBM Information Server. This synchronization should be seamless, by identifying current Information
Assets and any changed content.

Synchronization

Synchronization requires the re-import of the Physical Data Sources. Data that has changed, will be deleted
and imported, this will cause any alterations of the data, such as Definitions or Classification, to be lost. Data
that has remained the same will not be affected.

For example, changing a Field name will cause only the corresponding Field to be imported anew.

Upon re-importing a Sequential File from within DataStage, please keep the following in mind:

When re-importing the File, a user will be prompted that the Shared Data File, which has been
previously been published, will be disconnected.
After re-importing the File, the changed Table Definition must be re-published as a Shared Data
File.
When re-publishing the File, the identical Host and Data File previously associated with the File
should be selected.

IBM Confidential Information Page 12 of 13

IBM MetaData Workbench Enablement Series

Summary

It is good practice to import the data structures of all sources into the IBM Information Server. This allows
for a single point of reference for governance, development, definition and reporting. ETL Developers can
reference the same Data Source which has been classified within Business Glossary; enriching their
understanding, analyzed within Information Analyzer or depicted within a Data Lineage report from the
Metadata Workbench.

IBM Confidential Information Page 13 of 13

C Optimize Ds Job For Lineage
No ratings yet
C Optimize Ds Job For Lineage
7 pages
Data Stage
100% (2)
Data Stage
299 pages
IBM Infosphere Metadata Workbench v8 7 Tutorial
No ratings yet
IBM Infosphere Metadata Workbench v8 7 Tutorial
44 pages
DataStage Manager: Metadata Management Guide
No ratings yet
DataStage Manager: Metadata Management Guide
15 pages
Administrator's Guide Datastage
No ratings yet
Administrator's Guide Datastage
177 pages
Luncheon Webinar Series June 3Rd, 2010: Deep Dive - Metadata Workbench
No ratings yet
Luncheon Webinar Series June 3Rd, 2010: Deep Dive - Metadata Workbench
46 pages
DataStage Metadata Management
No ratings yet
DataStage Metadata Management
23 pages
Intersecting With Other Information Server Tools: Datastage Essentials V8.5
No ratings yet
Intersecting With Other Information Server Tools: Datastage Essentials V8.5
27 pages
DataStage vs Informatica: ETL Comparison
No ratings yet
DataStage vs Informatica: ETL Comparison
9 pages
Course
No ratings yet
Course
663 pages
DataStage Manager: Metadata Management Guide
No ratings yet
DataStage Manager: Metadata Management Guide
13 pages
DataStage ETL Training Course Overview
100% (2)
DataStage ETL Training Course Overview
133 pages
Introduction To Datastage: Ibm Infosphere Datastage V11.5
No ratings yet
Introduction To Datastage: Ibm Infosphere Datastage V11.5
23 pages
IBM InfoSphere DataStage Overview
No ratings yet
IBM InfoSphere DataStage Overview
68 pages
DataStage Basics for Data Warehousing
50% (2)
DataStage Basics for Data Warehousing
90 pages
Data Profiling with IBM Quality Stage
No ratings yet
Data Profiling with IBM Quality Stage
2 pages
Web Services Transformer
No ratings yet
Web Services Transformer
20 pages
Streams - Datastage Integration
No ratings yet
Streams - Datastage Integration
19 pages
DataMasking Using DataStage
No ratings yet
DataMasking Using DataStage
60 pages
DataStage PPT
No ratings yet
DataStage PPT
94 pages
QS Essentials
No ratings yet
QS Essentials
327 pages
InfoSphereDataStageEssentials PDF
No ratings yet
InfoSphereDataStageEssentials PDF
110 pages
Datastage Interview
100% (1)
Datastage Interview
161 pages
Chapter 02 - Application Server Files
No ratings yet
Chapter 02 - Application Server Files
19 pages
Informatica Data Integration Overview
No ratings yet
Informatica Data Integration Overview
16 pages
DataStage Job Performance Optimization Guide
No ratings yet
DataStage Job Performance Optimization Guide
74 pages
Datastage Best Practices
No ratings yet
Datastage Best Practices
29 pages
Quick Start Guide: IBM Information Server
No ratings yet
Quick Start Guide: IBM Information Server
4 pages
ABAP File Processing on Application Server
No ratings yet
ABAP File Processing on Application Server
19 pages
IBM InfoSphere Information Server Overview
No ratings yet
IBM InfoSphere Information Server Overview
72 pages
Importing Text File
No ratings yet
Importing Text File
19 pages
DataStage Tricks & Tips
No ratings yet
DataStage Tricks & Tips
41 pages
Introduction To ETL and DataStage
No ratings yet
Introduction To ETL and DataStage
48 pages
Data Stage Designer 8.5
100% (1)
Data Stage Designer 8.5
269 pages
Mastering Data Integration With Ibm Datastage
No ratings yet
Mastering Data Integration With Ibm Datastage
286 pages
DataStage 8.7
No ratings yet
DataStage 8.7
28 pages
Free DataStage Tutorials and Guides
0% (4)
Free DataStage Tutorials and Guides
3 pages
What'S New in Ibm Infosphere Information Server 8.7
No ratings yet
What'S New in Ibm Infosphere Information Server 8.7
28 pages
IBM DataStage Performance Tuning Guide
No ratings yet
IBM DataStage Performance Tuning Guide
9 pages
DataStage EE Overview and Management Guide
No ratings yet
DataStage EE Overview and Management Guide
88 pages
Data Quality with IBM Quality Stage
No ratings yet
Data Quality with IBM Quality Stage
9 pages
Datastage Developer Guide
No ratings yet
Datastage Developer Guide
362 pages
DataStage Course Overview and Objectives
No ratings yet
DataStage Course Overview and Objectives
3 pages
Datastage Enterprise Edition
No ratings yet
Datastage Enterprise Edition
374 pages
Ibm Datastage - Training Day1
No ratings yet
Ibm Datastage - Training Day1
77 pages
A-Introduction To ETL and DataStage
No ratings yet
A-Introduction To ETL and DataStage
48 pages
DataStage Administrator Guide and Management
No ratings yet
DataStage Administrator Guide and Management
20 pages
Understanding ETL with Informatica
No ratings yet
Understanding ETL with Informatica
5 pages
IBM Infosphere Information Analyzer v8 7 User Guide PDF
No ratings yet
IBM Infosphere Information Analyzer v8 7 User Guide PDF
483 pages
Purpose of Database System
No ratings yet
Purpose of Database System
13 pages
Database Administrator File MCA Semester 3
100% (1)
Database Administrator File MCA Semester 3
70 pages
NPTEL Cloud Computing Week 3 Answers
No ratings yet
NPTEL Cloud Computing Week 3 Answers
4 pages
FADA Academy - Excellence in Excel Ver2
No ratings yet
FADA Academy - Excellence in Excel Ver2
8 pages
Transact-SQL Querying Basics Lab
No ratings yet
Transact-SQL Querying Basics Lab
12 pages
Chapter - 3 TRANSACTION PROCESSING
No ratings yet
Chapter - 3 TRANSACTION PROCESSING
51 pages
Basic Application Software
No ratings yet
Basic Application Software
3 pages
Database Management Key Concepts Explained
No ratings yet
Database Management Key Concepts Explained
4 pages
EXAM TEST 1z0-150-22
No ratings yet
EXAM TEST 1z0-150-22
3 pages
Database System Abstraction Levels Explained
No ratings yet
Database System Abstraction Levels Explained
9 pages
Sas Base & Advance
No ratings yet
Sas Base & Advance
4 pages
DA NayanHore
No ratings yet
DA NayanHore
1 page
DOS Commands Exercise for IT Students
No ratings yet
DOS Commands Exercise for IT Students
10 pages
Database Management System & SQL 2 Mark Questions
No ratings yet
Database Management System & SQL 2 Mark Questions
28 pages
? - Databricks Data Engineer Associate Exam - Reference
No ratings yet
? - Databricks Data Engineer Associate Exam - Reference
33 pages
Sohail Resume-1
No ratings yet
Sohail Resume-1
1 page
SQL Basics for Beginners
No ratings yet
SQL Basics for Beginners
23 pages
OLTP vs OLAP: Key Differences Explained
No ratings yet
OLTP vs OLAP: Key Differences Explained
33 pages
ODS for Real-Time Reporting
No ratings yet
ODS for Real-Time Reporting
3 pages
JDBC Interview Questions and Answers
No ratings yet
JDBC Interview Questions and Answers
63 pages
Batch Apex in Salesforce
No ratings yet
Batch Apex in Salesforce
3 pages
Data Structures & Algorithms for IR
No ratings yet
Data Structures & Algorithms for IR
34 pages
Map-Reduce Algorithms for Data Analysis
0% (1)
Map-Reduce Algorithms for Data Analysis
2 pages
Advanced Database Management System-Mcq
No ratings yet
Advanced Database Management System-Mcq
8 pages
DBMS Module 1
No ratings yet
DBMS Module 1
40 pages
OWL Ontology Management Platform
No ratings yet
OWL Ontology Management Platform
1 page
Romi Gupta Data Analyst
No ratings yet
Romi Gupta Data Analyst
1 page
Importance of Database Systems
No ratings yet
Importance of Database Systems
26 pages
Sem 7 - COMP - BDA
No ratings yet
Sem 7 - COMP - BDA
16 pages
Effective Customer Query Resolution Techniques
No ratings yet
Effective Customer Query Resolution Techniques
63 pages

Importing Sequential Files in IBM Server

Uploaded by

Importing Sequential Files in IBM Server

Uploaded by

IMPORTING SEQUENTIAL FILES

INFORMATION SERVER V8.7

Prepared by: March Haber, march@il.ibm.com

IBM Confidential Information Page 2 of 13

For users of IBM DataStage and QualityStage:

IBM Confidential Information Page 3 of 13

Working with Sequential Files

File Name: #FileDir#/DataFile.txt

IBM Confidential Information Page 4 of 13

Data File A user-defined structure defined during import

Metadata Workbench Data Lineage Analysis

Note the example below the defined file is evaluated as such:

InputData (EWS_ProductionSourceStaging) connected to Datafile:

IBM Confidential Information Page 5 of 13

Importing Sequential Files From the DataStage and QualityStage Designer

Launch the DataStage and QualityStage client.

o Choose the appropriate delimiters and options for the File.

IBM Confidential Information Page 6 of 13

IBM Confidential Information Page 7 of 13

Launch the DataStage and QualityStage client.

IBM Confidential Information Page 8 of 13

o Select the Table Definition, click Next.

o Click Next to proceed.

IBM Confidential Information Page 9 of 13

1. Launch the DataStage and QualityStage client.

6. Optional: Click Repository | Delete to remove the selected Data File.

IBM Confidential Information Page 10 of 13

Deleting Data Files - From the Metadata Asset Manager

Browse to the Metadata Asset Manager: http://ServerName:9080/ibm/imam/console, and logon to the

IBM Confidential Information Page 11 of 13

Synchronizing Published Sequential Files

IBM Confidential Information Page 12 of 13

IBM Confidential Information Page 13 of 13

You might also like