Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Buy Now $50.00
Standard view
Full view
of .
Look up keyword or section
Like this
20Activity

Table Of Contents

Getting Started with Pentaho
BI Developer Examples: Regional Sales - HTML
Directory Navigation
Command History
Using Symbolic Links
Creating Symbolic Links in Ubuntu
Creating Symlinks in Windows Vista
Java Installation and Configuration
Installing Java on Ubuntu Linux
Installing Java on Windows
MySQL Installation
Installing MySQL Server and Client on Ubuntu
Installing MySQL Server and Client on Windows
MySQL GUI Tools
Ubuntu Install
Windows Install
Database Tools
Power*Architect and Other Design Tools
Squirrel SQL Client
SQLeonardo
Server Configuration
Installation
Installation Directory
User Account
Configuring Tomcat
Automatic Startup
Managing Database Drivers
Driver Location for the Server
Driver Location for the Administration Console
System Databases
Setting Up the MySQL Schemas
Configuring Quartz and Hibernate
Configuring JDBC Security
Sample Data
Modify the Pentaho Startup Scripts
E-mail
Basic SMTP Configuration
Secure SMTP Configuration
Testing E-mail Configuration
Publisher Password
Administrative Tasks
The Pentaho Administration Console
Basic PAC Configuration
Starting and Stopping PAC
The PAC Front End
Configuring PAC Security and Credentials
User Management
Data Sources
Other Administrative Tasks
Pentaho BI Stack Perspectives
Functionality
Server, Web Client, and Desktop Programs
Front-Ends and Back-Ends
Underlying Technology
The Pentaho Business Intelligence Server
The platform
The Platform
The Solution Repository and the Solution Engine
Database Connection Pool Management
User Authentication and Authorization
Task Scheduling
E-mail Services
BI Components
Ad hoc Reporting Service
The ETL Engine
Reporting Engines
The OLAP Engine
The Data Mining Engine
The Presentation Layer
Underlying Java Servlet Technology
Desktop Programs
Pentaho Enterprise Edition and Community Edition
Pentaho Design Studio (Eclipse) Primer
The Action Sequence Editor
Anatomy of an Action Sequence
Inputs
Outputs
Actions
World Class Movies: The Basics
The WCM Data
Obtaining and Generating Data
WCM Database: The Big Picture
DVD Catalog
Customers
Employees
Purchase Orders
Customer Orders and Promotions
Inventory Management
Why Do You Need a Data Warehouse?
The Big Debate: Inmon Versus Kimball
Data Warehouse Architecture
The Staging Area
The Central Data Warehouse
Data Marts
OLAP Cubes
Storage Formats and MDX
Data Warehouse Challenges
Data Quality
Data Vault and Data Quality
Using Reference and Master Data
Data Volume and Performance
Changed Data Capture
Source Data-Based CDC
Trigger-Based CDC
Snapshot-Based CDC
Log-Based CDC
Which CDC Alternative Should You Choose?
Changing User Requirements
Data Warehouse Trends
Virtual Data Warehousing
Real-Time Data Warehousing
Analytical Databases
Data Warehouse Appliances
On Demand Data Warehousing
What Is a Star Schema?
Dimension Tables and Fact Tables
Fact Table Types
Querying Star Schemas
Join Types
Applying Restrictions in a Query
Combining Multiple Restrictions
Restricting Aggregate Results
Ordering Data
The Bus Architecture
Design Principles
Using Surrogate Keys
Naming and Type Conventions
Granularity and Aggregation
Audit Columns
Modeling Date and Time
Time Dimension Granularity
Local Versus UTC Time
Smart Date Keys
Handling Relative Time
Monster Dimensions
Role-Playing Dimensions
Multi-Valued Dimensions and Bridge Tables
Building Hierarchies
Snowflakes and Clustering Dimensions
Outriggers
Consolidating Multi-Grain Tables
Working with Regular Expressions
Profiling and Exploring Results
Validating and Comparing Data
Developing the Model
Data Modeling with Power*Architect
Building the WCM Data Marts
Generating the Database
Generating Static Dimensions
Special Date Fields and Calculations
Source to Target Mapping
Data Integration Overview
Data Integration Activities
Extraction
Change Data Capture
Data Staging
Data Validation
Data Cleansing
Decoding and Renaming
Key Management
Aggregation
Dimension and Bridge Table Maintenance
Loading Fact Tables
Tools and Utilities
The Data Integration Engine
Repository
Jobs and Transformations
Plug-in Architecture
Getting Started with Spoon
Launching the Spoon Application
A Simple ‘‘Hello, World!’’ Example
Building the Transformation
Running the Transformation
The Execution Results Pane
The Output
Checking Consistency and Dependencies
Logical Consistency
Resource Dependencies
Verifying the Transformation
Working with Database Connections
JDBC and ODBC Connectivity
Creating a Database Connection
Testing Database Connections
How Database Connections Are Used
A Database-Enabled ‘‘Hello, World!’’ Example
Generic Database Connections
Generating Dimension Table Data
Using Stored Procedures
Loading a Simple Date Dimension
Days Sequence: The Add Sequence Step
Calculate and Format Dates: The Calculator Step
The Value Mapper Step
Load dim_date: The Table Output Step
More Advanced Date Dimension Features
ISO Week and Year
Current and Last Year Indicators
Internationalization and Locale Support
Loading a Simple Time Dimension
Calculate Time: Again, the Calculator Step
Loading the Demography Dimension
Generating Age and Income Groups
Multiple Incoming and Outgoing Streams
Loading Data from Source Systems
Staging Lookup Values
The stage_lookup_data Job
The START Job Entry
Transformation Job Entries
Mail Success and Mail Failure
The stage_lookup_data Transformation
The Filter rows Step
Create Staging Table: Executing Dynamic SQL
The Dummy Step
The Stream Lookup Step
Sort on Lookup Type: The Sort Rows Step
The Promotion Dimension
Promotion Mappings
Promotion Data Changes
Synchronization Frequency
The load_dim_promotion Job
The extract_promotion Transformation
Determining Promotion Data Changes
Saving the Extract and Passing on the File Name
Picking Up the File and Loading the Extract
Configuration Management
Using Variables
Variables in Configuration Properties
User-Defined Variables
Built-in Variables
More About the Set Variables Step
Set Variables Step Gotchas
Using JNDI Connections
What Is JNDI?
Creating a JNDI Connection
JNDI Connections and Deployment
Working with the PDI Repository
Creating a PDI Repository
Connecting to the Repository
The Repository Explorer
How PDI Keeps Track of Repositories
Upgrading an Existing Repository
Running in the Deployment Environment
Running from the Command Line
Command-Line Parameters
Running Jobs with Kitchen
Running Transformations with Pan
Using Custom Command-line Parameters
Using Obfuscated Database Passwords
Running Inside the Pentaho BI Server
Transformations in Action Sequences
Jobs in Action Sequences
The Pentaho BI Server and the PDI Repository
Remote Execution with Carte
Why Remote Execution?
Running Carte
Creating Slave Servers
Remotely Executing a Transformation or Job
Clustering
Metadata Overview
What Is Metadata?
The Advantages of the Metadata Layer
Adding Flexibility and Schema Independence
Refining Access Privileges
Handling Localization
Enforcing Consistent Formatting and Behavior
Scope and Usage of the Metadata Layer
The Logical Layer
The Delivery Layer
Deploying and Using Metadata
Exporting and Importing XMI files
Publishing the Metadata to the Server
Refreshing the Metadata
Reporting Architecture
Web-Based Reporting
Practical Uses of WAQR
Pentaho Report Designer
The PRD Screen
Report Structure
Report Elements
Creating Data Sets
Creating SQL Queries Using JDBC
Creating Metadata Queries
Example Data Set
Adding and Using Parameters
Layout and Formatting
Alternate Row Colors: Row Banding
Grouping and Summarizing Data
Adding and Modifying Groups
Using Functions
Using Formulas
Adding Charts and Graphs
Adding a Bar Chart
Pie Charts
Working with Images
Working with Subreports
Passing Parameter Values to Subreports
Publishing and Exporting Reports
Exporting Reports
Scheduling
Scheduler Concepts
Public and Private Schedules
Content Repository
Creating a New Schedule
Running Schedules
Suspending and Resuming Schedules
Deleting Schedules
Add Job
Suspend Job, Resume Job, and Delete Job
Other Scheduler Process Actions
Scheduler Alternatives
UNIX-Based Systems: Cron
Windows: The at Utility and the Task Scheduler
Background Execution and Subscription
How Background Execution Works
How Subscription Works
Allowing Users to Subscribe
Granting Execute and Schedule Privileges
The Actual Subscription
The User’s Workspace
Viewing the Contents of the Workspace
The Public Schedules Pane
The Server Administrator’s Workspace
Cleaning Out the Workspace
Bursting
Implementation of Bursting in Pentaho
Bursting Example: Rental Reminder E-mails
Step 2: Looping Through the Customers
Step 4: Running the Reminder Report
Step 5: Sending the Report via E-mail
Other Bursting Implementations
Overview of Pentaho Analysis Services
Architecture
Schema
Schema Design Tools
Aggregate Tables
MDX Primer
Cubes, Dimensions, and Measures
The Cube Concept
Star Schema Analogy
Cube Visualization
Hierarchies, Levels, and Members
Hierarchies
Levels and Members
The All Level, All Member, and Default Member
Member Sets
Multiple Hierarchies
Cube Family Relationships
Relative Time Relationships
MDX Query Syntax
Basic MDX Query
Axes: ON ROWS and ON COLUMNS
Looking at a Part of the Data
Dimension on Only One Axis
More MDX Examples: a Simple Cube
The FILTER Function
The ORDER Function
Using TOPCOUNT and BOTTOMCOUNT
Using NON EMPTY
Working with Sets and the WITH Clause
Using Calculated Members
Creating Mondrian Schemas
Getting Started with Pentaho Schema Workbench
Downloading Mondrian
Installing Pentaho Schema Workbench
Starting Pentaho Schema Workbench
Establishing a Connection
JDBC Explorer
Using the Schema Editor
Creating a New Schema
Saving the Schema on Disk
Editing Object Attributes
Changing Edit Mode
Creating and Editing a Basic Schema
Basic Schema Editing Tasks
Creating cubes
Creating a Cube
Choosing a Fact Table
Adding Measures
Adding Dimensions
Adding Hierarchy Levels
Associating Cubes with Shared Dimensions
Adding the DVD and Customer Dimensions
XML Listing
Testing and Deployment
Using the MDX Query Tool
Publishing the Cube
Schema Design Topics We Didn’t Cover
Visualizing Mondrian Cubes with JPivot
Getting Started with the Analysis View
Using the JPivot Toolbar
Drilling
Drilling Flavors
Drill Member and Drill Position
Drill Replace
Drill Through
The OLAP Navigator
Controlling Placement of Dimensions on Axes
Slicing with the OLAP Navigator
Displaying Multiple Measures
Miscellaneous Features
MDX Query Pane
PDF and Excel Export
Chart
Aggregation Benefits
Extending Mondrian with Aggregate Tables
Pentaho Aggregate Designer
Alternative Solutions
Data Mining Primer
Data Mining Process
Data Mining Toolset
Classification
Association
Numeric Prediction (Regression)
Data Mining Algorithms
Training and Testing
Stratified Cross-Validation
The Weka Workbench
P. 1
Pentaho Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL

Pentaho Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL

Ratings: (0)|Views: 4,221|Likes:
Published by Wiley
Your all-in-one resource for using Pentaho with MySQL forBusiness Intelligence and Data Warehousing

Open-source Pentaho provides business intelligence (BI) and datawarehousing solutions at a fraction of the cost of proprietarysolutions. Now you can take advantage of Pentaho for your businessneeds with this practical guide written by two major participantsin the Pentaho community.

The book covers all components of the Pentaho BI Suite. You'lllearn to install, use, and maintain Pentaho-and find plenty ofbackground discussion that will bring you thoroughly up to speed onBI and Pentaho concepts.

Of all available open source BI products, Pentaho offers themost comprehensive toolset and is the fastest growing open sourceproduct suite Explains how to build and load a data warehouse with PentahoKettle for data integration/ETL, manually create JFree (pentahoreporting services) reports using direct SQL queries, and createMondrian (Pentaho analysis services) cubes and attach them to aJPivot cube browser Review deploying reports, cubes and metadata to the Pentahoplatform in order to distribute BI solutions to end-users Shows how to set up scheduling, subscription and automaticdistribution

The companion Web site provides complete source code examples,sample data, and links to related resources.

Your all-in-one resource for using Pentaho with MySQL forBusiness Intelligence and Data Warehousing

Open-source Pentaho provides business intelligence (BI) and datawarehousing solutions at a fraction of the cost of proprietarysolutions. Now you can take advantage of Pentaho for your businessneeds with this practical guide written by two major participantsin the Pentaho community.

The book covers all components of the Pentaho BI Suite. You'lllearn to install, use, and maintain Pentaho-and find plenty ofbackground discussion that will bring you thoroughly up to speed onBI and Pentaho concepts.

Of all available open source BI products, Pentaho offers themost comprehensive toolset and is the fastest growing open sourceproduct suite Explains how to build and load a data warehouse with PentahoKettle for data integration/ETL, manually create JFree (pentahoreporting services) reports using direct SQL queries, and createMondrian (Pentaho analysis services) cubes and attach them to aJPivot cube browser Review deploying reports, cubes and metadata to the Pentahoplatform in order to distribute BI solutions to end-users Shows how to set up scheduling, subscription and automaticdistribution

The companion Web site provides complete source code examples,sample data, and links to related resources.

More info:

Publish date: Sep 23, 2010
Added to Scribd: Feb 05, 2010
Copyright:Traditional Copyright: All rights reservedISBN:9780470572726
List Price: $50.00 Buy Now

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
This book can be read on up to 6 mobile devices.
See more
See less

02/17/2015

651

9780470572726

$50.00

USD

You're Reading a Free Preview
Pages 14 to 57 are not shown in this preview.
You're Reading a Free Preview
Pages 71 to 213 are not shown in this preview.
You're Reading a Free Preview
Pages 227 to 236 are not shown in this preview.
You're Reading a Free Preview
Pages 250 to 397 are not shown in this preview.
You're Reading a Free Preview
Pages 411 to 651 are not shown in this preview.

Activity (20)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
jaos liked this
benben08 liked this
maureliomsilva liked this
Notre1 liked this
mds_88 liked this
taxel liked this
vishalharle8919 liked this

You're Reading a Free Preview

Download
scribd