Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Look up keyword
Like this
14Activity
0 of .
Results for:
No results containing your search query
P. 1
Unstructured Data Transformation Overview

Unstructured Data Transformation Overview

Ratings: (0)|Views: 2,809|Likes:
Published by ypraju

More info:

Published by: ypraju on Dec 21, 2009
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as DOC, PDF, TXT or read online from Scribd
See more
See less

03/28/2013

pdf

text

original

 
Unstructured Data Transformation Overview 
By PenchalaRaju.Yanamala
Transformation type:Active/PassiveConnectedThe Unstructured Data transformation is a transformation that processesunstructured and semi-structured file formats, such as messaging formats, HTMLpages and PDF documents. It
 
also transforms structured formats such asACORD, HIPAA, HL7, EDI-X12, EDIFACT, AFP, and SWIFT.The Unstructured Data transformation calls a Data Transformation service from aPowerCenter session. Data Transformation is the application that transforms theunstructured and semi-structured file formats. You can pass data from theUnstructured Data transformation to a Data Transformation service, transformthe data, and return the transformed data to the pipeline.Data Transformation has the following components:
Data Transformation Studio
. A visual editor to design and configuretransformation projects.
Data Transformation Service
. A Data Transformation project that is deployedto the Data Transformation Repository and is ready to run.
Data Transformation repository.
A directory that stores executable servicesthat you create in Data Transformation Studio. You can deploy projects todifferent repositories, such as repositories for test and production services.
Data Transformation Engine.
A processor that runs the services that youdeploy to the repository.When Data Transformation Engine runs a service, it writes the output data, or itreturns output data to the Integration Service. When Data Transformation Enginereturns output to the Integration Service, it returns XML data. You can configurethe Unstructured Data transformation to return the XML in an output port, or youcan configure output groups to return row data.
 
Configuring the Unstructured Data Option
The Unstructured Data transformation is installed with PowerCenter. DataTransformation has a separate installer. Install the Data Transformation Server and Client components after you install PowerCenter.To install the Unstructured Data option, complete the following steps:1. Install PowerCenter.2.Install Data Transformation. For information about installing DataTransformation, see the Data Transformation Administrator Guide.3. Configure the Data Transformation repository folder.
Configuring the Data Transformation Repository Directory
The Data Transformation repository contains executable Data Transformationservices. When you install Data Transformation, the installation creates thefollowing folder:<Data_Transformation_install_dir>\ServiceDBTo configure a different repository folder location, open Data TransformationConfiguration from the Windows Start menu. The repository location is in thefollowing path in the Data Transformation Configuration:CM Configuration > CM Repository > File System > Base PathIf Data Transformation Studio can access the remote file system, you canchange the Data Transformation repository to a remote location and deploy
 
services directly from Data Transformation Studio to the system that runs theIntegration Service. For more information about deploying services to remotemachines, see the Data Transformation Studio User Guide.Copy custom files from the Data Transformation autoInclude\user or theexternLibs\user directory to the autoInclude\user or externLibs\user directory onthe machine that runs the Integration Service. For more information about thesedirectories, see the Data Transformation Engine Developer Guide.
Data Transformation Service Types
When you create a project in Data Transformation Studio, you choose a DataTransformation service type to define the project. Data Transformation has thefollowing types of services that transform data:
Parser 
. Converts source documents to XML. The output of a parser is alwaysXML. The input can have any format, such as text, HTML, Word, PDF, or HL7.
Serializer 
. Converts an XML file to an output document of any format. Theoutput of a serializer can be any format, such as a text document, an HTMLdocument, or a PDF.
Mapper 
. Converts an XML source document to another XML structure or schema. A mapper processes the XML input similarly to a serializer. Itgenerates XML output similarly to a parser. The input and the output are fullystructured XML.
Transformer 
. Modifies the data in any format. Adds, removes, converts, or changes text. Use transformers with a parser, mapper, or serializer. You canalso run a transformer as stand-alone component.
Streamer.
Splits large input documents, such as multi-gigabyte data streams,into segments. The streamer processes documents that have multiplemessages or records in them, such as HIPAA or EDI files.For more information about creating projects with Data Transformation, seeGetting Started with Data Transformation.
Unstructured Data Transformation Components
The Unstructured Data transformation contains the following tabs:
Transformation.
Enter the name and description of the transformation. Thenaming convention for an Unstructured Data transformation isUD_TransformationName. You can also make the Unstructured Datatransformation reusable.
Properties
. Configure the Unstructured Data transformation general propertiessuch as IsPartitionable and Output is Repeatable.
UDT Settings
. Modify Unstructured Data transformation settings such as inputtype, output type, and service name.
UDT Ports
. Configure Unstructured Data transformation ports and attributes.
Relational Hierarchy.
Define a hierarchy of output groups and ports to enablethe Unstructured Data transformation to write rows to relational targets.
Properties Tab

Activity (14)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
Evan Cutler liked this
Nagaraj Kulkarni liked this
amitbbs liked this
veerendrakbongu liked this
mandamurthy liked this
nirmalrajj liked this
natrajdreams liked this
sruti_2003 liked this

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->