0% found this document useful (0 votes)

173 views24 pages

Data Pump

The document discusses Oracle Database 10g's Data Pump utilities for exporting and importing data and metadata. Data Pump provides high-performance loading and unloading through parallelism, direct path loading, checkpoint/restart capabilities, and fine-grained object selection. Performance tests showed Data Pump Export taking half the time and Data Pump Import taking one-tenth to one-fortieth the time of the original Export and Import utilities.

Uploaded by

Baba Ib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

173 views24 pages

Data Pump

Uploaded by

Baba Ib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Session id: 40043

Data Pump in Oracle Database 10g : Foundation for Ultra-High Speed Data Movement Utilities George H. Claborn
Data Pump Technical Project Leader

Oracle Corporation New England Development Center

Data Pump: Overview

What is it? Main Features Architecture Performance Things to keep in mind Some thoughts on original exp / imp

Data Pump: What is it?

Server-based facility for high performance loading and unloading of data and metadata Callable: DBMS_DATAPUMP. Internally uses DBMS_METADATA Data written in Direct Path stream format. Metadata written as XML New clients expdp and impdp: Supersets of original exp / imp. Foundation for Streams, Logical Standby, Grid, Transportable Tablespaces and Data Mining initial instantiation.

Features: Performance!!
Automatic, two-level parallelism

Direct Path for inter-partition parallelism External Tables for intra-partition parallelism Simple: parallel=<number of active threads> Dynamic: Workers can be added and removed from a running job in Enterprise Edition Index builds automatically parallelized up to degree of job

Simultaneous data and metadata unload Single thread of data unload: 1.5-2X exp Single thread of data load: 15X-40X imp With index builds: 4-10X imp

Features: Checkpoint / Restart

Job progress recorded in a Master Table May be explicitly stopped and restarted later:

Stop after current item finishes or stop immediate

Abnormally terminated job is also restartable Current objects can be skipped on restart if problematic

Features: Network Mode

Network import: Load one database directly from another Network export: Unload a remote database to a local dumpfile set

Allows export of read-only databases

Data Pump runs locally, Metadata API runs remotely. Uses DB links / listener service names, not pipes. Data is moved as insert into <local table> select from <remote table>@service_name Direct path engine is used on both ends Its easy to swamp network bandwidth: Be careful!

Features: Fine-Grained Object Selection

All object types are supported for both operations: export and import Exclude: Specified object types are excluded from the operation Include: Only the specified object types are included. E.g, just retrieve packages, functions and procedures More than one of each can be specified, but use of both is prohibited by new clients Both take an optional name filter for even finer granularity:

INCLUDE PACKAGE: LIKE PAYROLL% EXCLUDE TABLE: IN (FOO,BAR, )

Features: Monitoring
Flexible GET_STATUS call Per-worker status showing current object and percent done Initial job space estimate and overall percent done Job state and description Work-in-progress and errors

Features: Dump File Set Management

Directory based: E.g, DMPDIR:export01.dmp where DMPDIR created as:
SQL> create directory dmpdir as /data/dumps

Multiple, wildcarded file specifications supported:

dumpfile=dmp1dir:full1%u.dmp, dmp2dir:full2%u.dmp

Files are created as needed on a round-robin basis from available file specifications

File size can be limited for manageability Dump file set coherency automatically maintained

New Clients expdp / impdp

Similar (but not identical) look and feel to exp / imp All modes supported: full, schema, table, tablespace, transportable. Superset of exp / imp Flashback is supported Query supported by both expdp and impdp and on a per-table basis! Detach from and attach to running jobs Multiple clients per job allowed; but a single client can attach to only one job at a time If privileged, attach to and control other users jobs

New Clients expdp / impdp

Interactive mode entered via Ctl-C:

ADD_FILE: Add dump files and wildcard specs. to job PARALLEL: Dynamically add or remove workers STATUS: Get detailed per-worker status and change reporting interval STOP_JOB{=IMMEDIATE}: Stop job, leaving it restartable. Immediate doesnt wait for workers to finish current work items theyll be re-done at restart START_JOB: Restart a previously stopped job KILL_JOB: Stop job and delete all its resources (master table, dump files) leaving it unrestartable CONTINUE: Leave interactive mode, continue logging EXIT: Exit client, leave job running

Features: Other Cool Stuff

DDL transformations are easy with XML:

REMAP_SCHEMA REMAP_TABLESPACE REMAP_DATAFILE Segment and storage attributes can be suppressed

Can extract and load just data, just metadata or both SQLFILE operation generates executable DDL script If a table pre-exists at load time, you can: skip it (default), replace it, truncate then load or append to it. Space estimates based on allocated blocks (default) or statistics if available Enterprise Manager interface integrates 9i and 10g Callable!

Architecture: Block Diagram

expdp impdp
Enterprise Manager
Other Clients: Data Mining, etc

Data Pump
DBMS_DATAPUMP
Data/Metadata movement engine

Oracle_ Loader

Oracle_ DataPump

Direct Path API

Metadata API:
DBMS_METADATA

External Table API

Architecture: Flow Diagram

User A: expdp
Status queue: Work-in-progress and errors Master Control Process User As shadow process

Dump File Set:

Data, metadata & master table

Master table
Worker A: metadata

Worker B: Direct path Parallel Proc. 01

Log File

User B: OEM

User Bs shadow process

Worker C: Ext. Table

Dynamic commands Command and (stop, start, parallel etc) control queue

Parallel Proc. 02

No Clients Required !

Dump File Set:

Data, metadata & master table

Master table
Worker A: metadata

Master Control Process

Worker B: Direct path Parallel Proc. 01

Log File

Worker C: Ext. Table

Parallel Proc. 02

Data Pump: Performance Tuning

Default initialization parameters are fine!

Make sure disk_asynch_io remains TRUE

Spread the I/O! Parallel= no more than 2X number of CPUs: Do not exceed disk spindle capacity.

Corollary:

SPREAD THE I/O !!!

Sufficient SGA for AQ messaging and metadata API queries Sufficient rollback for long running queries

Thats it!

Large Internet Company

2 Fact Tables: 16.2M rows, 2 Gb
Program
exp out of the box: direct=y exp tuned: direct=y buffer=2M recordlength=64K expdp out of the box: Parallel=1

Elapsed
0 hr 10 min 40 sec 0 hr 04 min 08 sec 0 hr 03 min 12 sec

imp out of the box

imp tuned: buffer=2M recordlength=64K impdp out of the box: Parallel=1

2 hr 26 min 10 sec
2 hr 18 min 37 sec 0 hr 03 min 05 sec

With one index per table

imp tuned: buffer=2M recordlength=64K 2 hr 40 min 17 sec

impdp: Parallel=1

0 hr 25 min 10 sec

Oracle Applications Seed Database:

Metadata intensive: 392K objects, 200 schemas, 10K tables, 2.1 Gb of data total Original exp / imp total: 32 hrs 50 min

exp:

2 hr 13 min

imp:

30 hrs 37 min.

Data Pump expdp / impdp total:

15 hrs 40 min

expdp: 1 hr 55 min impdp: 13 hrs 45 min. Parallel=2 for both expdp and impdp

Keep in Mind:
Designed for *big* jobs with lots of data.

Metadata performance is about the same More complex infrastructure, longer startup

XML is bigger than DDL, but much more flexible Data format in dump files is ~15% more compact than exp Import subsetting is accomplished by pruning the Master Table

Original exp and imp

Original imp will be supported forever to allow loading of V5 V9i dump files Original exp will ship at least in 10g, but may not support all new functionality. 9i exp may be used for downgrades from 10g Original and Data Pump dump file formats are not compatible

10g Beta Feedback

British Telecom:
Ian Crocker, Performance & Storage Consultant
We have tested Oracle Data Pump, the new Oracle10g Export and Import Utilities. Data Pump Export performed twice as fast as Original Export, and Data Pump Import performed ten times faster than Original Import. The new manageability features should give us much greater flexibility in monitoring job status.

Airbus Deutschland:
Werner Kawollek, Operation Application Management
We have tested the Oracle Data Pump Export and Import utilities and are impressed by their rich functionality. First tests have shown tremendous performance gains in comparison to the original export and import utilities"

Please visit our Demo!

Oracle Database 10g Data Pump: Faster and Better Export / Import

Try out Data Pumps Tutorial in the Oracle By Example (OBE) area: Unloading and Loading Data Base Contents

Q U E S T I O N S A N S W E R S

Datapump in Oracle 10G
No ratings yet
Datapump in Oracle 10G
23 pages
Oracle 10g Data Pump Overview and Guide
No ratings yet
Oracle 10g Data Pump Overview and Guide
47 pages
OracleData Pump
No ratings yet
OracleData Pump
19 pages
Oracle Data Pump: Features & Benefits
No ratings yet
Oracle Data Pump: Features & Benefits
8 pages
Oracle Data Pump in Oracle Database 10g: Getting Started
No ratings yet
Oracle Data Pump in Oracle Database 10g: Getting Started
10 pages
Overview of Oracle Data Pump
No ratings yet
Overview of Oracle Data Pump
34 pages
Oracle Data Pump Export and Import
No ratings yet
Oracle Data Pump Export and Import
4 pages
Oracle DataPump Guide for DBAs
No ratings yet
Oracle DataPump Guide for DBAs
41 pages
DataPump PDF
No ratings yet
DataPump PDF
15 pages
Oracle Data Pump Overview and Features
No ratings yet
Oracle Data Pump Overview and Features
3 pages
Oracle Data Pump Overview & Interview Q&A - Brijesh Mehra
No ratings yet
Oracle Data Pump Overview & Interview Q&A - Brijesh Mehra
42 pages
Oracle 11g Data Pump Overview
No ratings yet
Oracle 11g Data Pump Overview
22 pages
Datapump Scenarios
No ratings yet
Datapump Scenarios
10 pages
Oracle 11G Datapump Overview-Part I
No ratings yet
Oracle 11G Datapump Overview-Part I
36 pages
DOAG2021 DataPumpDeepDive
No ratings yet
DOAG2021 DataPumpDeepDive
61 pages
Oracle Data Pump for DBAs
No ratings yet
Oracle Data Pump for DBAs
10 pages
8 Moving Data
No ratings yet
8 Moving Data
36 pages
Oracle
No ratings yet
Oracle
13 pages
Oracle 10g Data Pump Overview
No ratings yet
Oracle 10g Data Pump Overview
12 pages
Q04. How To Check The Datapump Import Jobs Are Running or Not ?
No ratings yet
Q04. How To Check The Datapump Import Jobs Are Running or Not ?
6 pages
Oracle Data Pump FAQ Guide
100% (1)
Oracle Data Pump FAQ Guide
5 pages
ORACLE-BASE - Oracle Data Pump (Expdp and Impdp) in Oracle Database 10g
100% (1)
ORACLE-BASE - Oracle Data Pump (Expdp and Impdp) in Oracle Database 10g
18 pages
Ora 10 GUtilities
No ratings yet
Ora 10 GUtilities
62 pages
07 Data Pump
No ratings yet
07 Data Pump
10 pages
Oracle Data Pump Utility Guide
No ratings yet
Oracle Data Pump Utility Guide
3 pages
LRN2959 Data Pump 23ai
No ratings yet
LRN2959 Data Pump 23ai
97 pages
Oracle Export/Import Options Explained
No ratings yet
Oracle Export/Import Options Explained
3 pages
Original Export and Import Versus Data Pump Export and Import
No ratings yet
Original Export and Import Versus Data Pump Export and Import
3 pages
Oracle Database 10g - DBA
100% (1)
Oracle Database 10g - DBA
98 pages
Oracle Exadata: Extreme Performance Insights
No ratings yet
Oracle Exadata: Extreme Performance Insights
45 pages
Less16 MovingData
No ratings yet
Less16 MovingData
25 pages
Oracle Data Pump
No ratings yet
Oracle Data Pump
14 pages
Oracle Export/Import & Data Pump FAQs
No ratings yet
Oracle Export/Import & Data Pump FAQs
3 pages
Oracle Data Pump Export/Import Guide
No ratings yet
Oracle Data Pump Export/Import Guide
17 pages
Oracle 10g Data Pump Overview
No ratings yet
Oracle 10g Data Pump Overview
50 pages
Exadata X8M
100% (1)
Exadata X8M
29 pages
DWH Tutorial
No ratings yet
DWH Tutorial
108 pages
8 Algorithms For Rubik's Cube
100% (3)
8 Algorithms For Rubik's Cube
7 pages
Rubikscubeinstructions Full
No ratings yet
Rubikscubeinstructions Full
6 pages
DWH Tutorial
No ratings yet
DWH Tutorial
108 pages
Lecture2 Oracle
No ratings yet
Lecture2 Oracle
20 pages