Professional Documents
Culture Documents
Introduction To The Penn State Data Warehouse
Introduction To The Penn State Data Warehouse
Transactions processed on the mainframe -ISIS, IBIS, ADIS Many years of historical data, millions of mainframe records
Accessing mainframe data requires knowledge and skills in Natural programming language
Complete form. Specify purpose of request, what data is required. Signed by Access and Security Representative (ASR). Must be approved by all data stewards (may be several).
forms to get access to files Write a Natural program Write the JCL
Use Roscoe system to write Job Control Language.
forms to get access to files Write a Natural program Write the JCL Submit the job
Goes into a queue with other similar jobs: first in, first out.
forms to get access to files Write a Natural program Write the JCL Submit the job Wait at least overnight
Jobs in queue run only at night. Queues can be quite long. Some jobs will wait several days in the queue until they reach the top.
forms to get access to files Write a Natural program Write the JCL Submit the job Wait at least overnight Job failed? Repeat steps
Check the status. If anything failed, fix problems and repeat the steps.
Security forms to get access to files Write a Natural program Write the JCL Submit the job Wait at least overnight Job failed? Repeat steps Days or weeks to complete
Experienced Natural programmer writing a simple job will take several days of coding, testing, running. Anything complicated can take longer.
ADVANTAGES: No programming required Easy to use Available to everyone, even from terminals Eliminated a lot of ad-hoc programming
The
consolidation of data from mainframe legacy systems into subjectoriented tables that are accessible through desktop tools
the data from the mainframe to a server where it can be accessed from Data extracted periodically. the users PC Changes on the mainframe
may not be reflected on the warehouse for a week or more.
Snapshot
Transcripts are on the warehouse, but official transcripts are only available through ISIS.
Use
EIS
summary data
Enterprise Information System extracts data from the warehouse and summarizes it Data Warehouse extracts detail data from the mainframe on a periodic schedule
Data Warehouse
detail data extracted periodically
Data Transformation
Data goes through a series of steps as it is moved to the warehouse: Extract programs
Write Natural programs to extract data from the mainframe data base
Data Transformation
Data goes through a series of steps as it is moved to the warehouse: Extract programs Verify data
Verify accuracy and consistency of data -- ensure data legibility
Data Transformation
Data goes through a series of steps as it is moved to the warehouse: Extract programs Verify data Create tables
Create normalized tables on the warehouse -eliminate data redundancy (i.e. address appears in one place only)
Data Transformation
Data goes through a series of steps as it is moved to the warehouse: Extract programs Verify data Create tables Load tables
Load warehouse tables with extracted data
Data Transformation
Data goes through a series of steps as it is moved to the warehouse: Extract programs Verify data Create tables Load tables Refresh data
Establish a schedule to refresh the data. Frequency depends on volatility of the data. Some refreshed weekly, some once per semester
End Users
Data Warehouse
Query criteria can be easily and quickly adjusted to modify results or obtain additional data
Data can be exported to other desktop software for formatting and analysis
Amount of data retrieved limited only by query tool and size of PC hard drive
Microsoft
If you have Microsoft Office, you already have three query tools on your desktop that will work with Penn States data warehouse.
Microsoft
This understanding is critical and not simple to acquire. If youve used AIDA or the mainframe systems, you already have some of this knowledge. Takes time and training.
Two
Understand
Backbone with TCP/IP communications software SQL Client License or MS BackOffice license (Purchased from the Microcomputer Order Center)
For Windows, SQL Client Tools (provided by the MOC when the SQL Client License is purchased) must be installed on the PC. Not required for Macintosh.
Open Data Base Connectivity (ODBC) compliant query tool
Request access for each database separately. Requests submitted via e-mail. Instructions on the web site.
authorized as a warehouse user Set up your workstation Select a query tool Get training
Training offered through HRDC and TLT: hands-on and web-based
authorized as a warehouse user Set up your workstation Select a query tool Get training Learn the data
Work with it; documentation on the web site; help available from steward offices.
authorized as a warehouse user Set up your workstation Select a query tool Get training Learn the data Get help
User groups, listservs, web site
Mainframe can be used for transaction processing. Programmers can use skills to enhance the mainframe systems instead of writing ad-hoc reports.
Data can be retrieved with limitless combinations of criteria and can be exported into other desktop tools for analysis and manipulation.
Mainframe