You are on page 1of 2

Pentaho Data Integration Fundamentals

DI1000

CO URS E D ES C R I P T I O N

Course Description
Delivery Type
●● Instructor-led Training (ILT) This instructor-led course* introduces the Pentaho Data Integration (PDI) platform. It
covers the basic functions, explains the capabilities of PDI, and describes the best
Virtual Instructor-led (vILT)
practices to use it successfully. Course demonstrations, combined with practice, prepare
●●

you to use PDI for real world cases. Additional benefits are attained because you
Duration
practice concepts learned in a PDI development environment during the course.
●● 3 days
Pentaho Data Integration prepares and blends data to create a complete picture of
Course Availability your business that drives actionable insights. The complete data integration platform
●● Employees delivers accurate, “analytics ready” data to end users from any source. With visual tools
●● Customers to eliminate coding and complexity, Pentaho puts big data and all data sources at the
fingertips of business and IT users alike.
●● Partners
*Note: The interactive nature of an instructor-led course has great advantage but also know that
Target Audience there is a self-paced online version, DI1000W, that is available either by itself or as part of a Self-
Paced Library.
●● ETL Developers
●● Data Analyst
Course Objectives
Required Knowledge and Skills
When you complete this course, you should be able to:
●● Experience in ETL concepts is
preferred
●● Describe the Pentaho Data Integration (PDI) Platform and its components and their
common uses
Prerequisites ●● List the parts of transformations and describe how they execute
●● None ●● Create, preview, run, and troubleshoot a transformation using best practices and
modular design principles
Supplemental Courses ●● Read and write data to and from various file formats
●● None ●● Perform calculations, merges, and lookups
●● Use the PDI enterprise repository, scheduling, and monitoring capabilities
●● Log execution metrics to database tables

Course Outline
Content Modules
●● Introduction to Pentaho Data Integration
To register or for more information,
• Objectives and Class Logistics go to:
• Pentaho Platform and Architecture Hitachi Vantara Learning Center
●● Transformations (customers/partners)

• Transformation Concepts Hitachi University (employees)

(See next page)


(Continued from prior page)
Join the Conversation
• Learning the PDI User Interface ●● Calculations Ask questions and connect with
other Hitachi Vantara customers,
• Creating and Running Transformations • Grouping
partners and employees within
• Introduction to Repositories • Calculation and Scripting Steps the Hitachi Vantara Community.
●● Reading and Writing Files ●● Jobs Orchestration community.HitachiVantara.com
• Input and Output Steps • Introduction to Jobs
• PDI’s Home Directory • Explore Common Job Entries
• Parameterization ●● Exploring the Pentaho Repository
●● Working with Databases • The Pentaho Repository
• Connecting to and Exploring a ●● Scheduling and Monitoring
Database
• Setting up the Scheduler
• Table Input and Output Steps
• Monitoring Scheduled Tasks
• Insert / Update and Delete Steps
●● Logging
• Filtering and Sorting Data
• Introduction to Logging
• Variables and Unnamed Parameters
in SQL • File-based logging
●● Data Flow and Lookups • Logging Execution Metrics to
Databases
• Data Movement and Step Copies
• Lookups and Merge

All modules listed above contain guided demonstrations and exercises where students get
the opportunity to practice the concepts, techniques, and features covered in their HALO
environment (see below).
Note: The Hitachi Automated Labs Online (HALO) is a 24/7, self-service portal that provides unlimited
access to Hitachi Vantara software technologies, and some 3rd party solutions, through a virtual,
hands-on lab environment.

Hitachi Vantara
Corporate Headquarters Contact Information
2535 Augustine Drive USA: 1-800-446-0744
Santa Clara, CA 95054 USA Global: 1-858-547-4526
hitachivantara.com | community.hitachivantara.com hitachivantara.com/contact

HITACHI is a registered trademark of Hitachi, Ltd. Hitachi Content Platform Anywhere, Live Insight, VSP, ShadowImage, TrueCopy and Hi-Track are trademarks or registered trademarks of
Hitachi Vantara Corporation. IBM and FlashCopy are trademarks or registered trademarks of International Business Machines Corporation. Microsoft, Azure and Windows are trademarks
or registered trademarks of Microsoft Corporation. All other trademarks, service marks and company names are properties of their respective owners.
DS-DI1000 ANT January 2020

You might also like