Oracle Tips and Techs: Datapump

Monday, May 21, 2012

Datapump - some tips

Data Pump is a utility for unloading/loading data and metadata into a set of operating system files called a dump file set. The dump file set can be imported only by the Data Pump Import utility. The dump file set can be imported on the same system or it can be moved to another system and loaded there.

In this post, let us see some tips and tricks that can done with Datapump.

Tip #1 : Using PARALLEL parameter

PARALLEL parameter is used to improve the speed of the export. But this will be more effective when you split the dumpfiles with DUMPFILE parameter across the filesystem.

Create 2 or 3 directories in different filesystems and use the commands effectively.

expdp / dumpfile=dir1:test_1.dmp, dir1:test_2.dmp, dir2:test_3.dmp, dir3:test_4.dmp logfile=dir1:test.log full=y parallel=4

where dir1, dir2 and dir3 are directory names created in the database.

Tip #2 : Using FILESIZE parameter

FILESIZE parameter is used to limit the dumpfile size. For eg., if you want to limit your dumpfiles to 5gb, you can issue command as below

expdp / directory=dir1 dumpfile=test1.dmp,test2.dmp,test3.dmp logfile=test.log filesize=5120m

expdp / directory=dir1 dumpfile=test_%U.dmp logfile=test.log filesize=5120m full=y

where %U will assign numbers automatically from 1 to 99.

Note: If you use %U, dumpfile number 100 can't be created and export fails with "dumpfile exhausted" error.

Update (23 July 2013): If you want to create more than 99 files, you can use this work around.

expdp / directory=dir1 dumpfile=test_%U.dmp dumpfile=test_1%U.dmp logfile=test.log filesize=5120m full=y

This will create files in a round robin method like below.

test_01.dmp

test_101.dmp

test_02.dmp

test_102.dmp

Tip #3 : Usage of VERSION parameter

VERSION parameter is used while taking export if you want to create a dumpfile which should be imported into a DB which is lower than the source DB.

For eg., if your source DB is 11g and target DB is 10g, you can't use the dumpfile taken from 11g expdp utility to import into 10g DB.

This throws the below error.

ORA-39142: incompatible version number 3.1 in dump file "/u02/dpump/test.dmp"

To overcome this we can use the VERSION parameter.

VERSION={COMPATIBLE | LATEST | version_string}
For example

expdp / directory=dir1 dumpfile=test_1.dmp logfile=test.log VERSION=10.2.0

Tip #4 : PARALLEL with single DUMPFILE

When you use PARALLEL parameter and use only one dumpfile to unload datas from the DB, you may get the below error.

expdp / directory=dir1 dumpfile=test_1.dmp logfile=test.log parallel=4

ORA-39095: Dump file space has been exhausted: Unable to allocate 8192 bytes 
Job "USER"."TABLE_UNLOAD" stopped due to fatal error at 00:37:29

Now a simple work around is to remove the PARALLEL parameter or add dumpfiles. This will over come the error.

expdp / directory=dir1 dumpfile=test_1.dmp logfile=test.log

expdp / directory=dir1 dumpfile=test_1.dmp,test_2.dmp,test_3.dmp, test_4.dmp logfile=test.log parallel=4

expdp / directory=dir1 dumpfile=test_%U.dmp logfile=test.log parallel=4

Tip #5 : Drop dba_datapump_job rows

Sometimes before the export completes or when the export encounters a resumable wait or you would have stopped the export job in between. Now you start the DataPump job that stopped. Then the dump file has been removed from the directory location. You are not able to attach to the job.

You will get an error like this.

ORA-39000: bad dump file specification
ORA-31640: unable to open dump file "/oracle/product/10.2.0/db_2/rdbms/log/test.dmp" for read
ORA-27037: unable to obtain file status
Linux Error: 2: No such file or directory

But you will see the row updated in view dba_datapump_jobs

SQL> select * from dba_datapump_jobs;
OWNER JOB_NAME                       OPERATI JOB_M STATE                    DEGREE ATTACHED_SESSIONS DATAPUMP_SESSIONS
----- ------------------------------ ------- ----- -------------------- ---------- ----------------- -----------------
SYS   SYS_EXPORT_FULL_01             EXPORT  FULL  NOT RUNNING                   0        ##########                 0

You are not able to remove the row from dba_datapump_jobs as you are not able to attach to the export job with expdp client to kill the job.

In this case you can remove the row by dropping the master table created by the datapump export.

SQL> drop table SYS_EXPORT_FULL_01 purge;

Table dropped.
SQL> select * from dba_datapump_jobs;
no rows selected

Now you can see the row is deleted from the dba_datapump_jobs view.

Tip #6 : FLASHBACK_SCN and FLASHBACK_TIME

Do not use FLASHBACK_SCN and FLASHBACK_TIME as these parameters slow down the performace of export.

Tip #7 : Effective EXCLUDE

Import of full database should be split as tables first and indexes next. Use the parameter exclude effectively to improve the speed of import.

EXCLUDE = INDEX,STATISTICS

This will not import the indexes and statistics which in turn only import the tables, hence improving the performance.

Tip #8 : INDEXFILE=<filename> usage

After the import of tables has been completed, you can create the indexes and collect statistics of the tables. To get the indexes creation ddl, you can use the INDEXFILE = <filename> parameter to get all the indexes creation statements which were involved in the import operation.

Example of effective import

impdp / directory=dir1,dir2,dir3 dumpfile=test_%U.dmp logfile=test.log EXCLUDE=STATISTICS Full=Y INDEXFILE=index_ddl.sql

The above will turn on the legacy mode import of datapump as the parameter indexfile is present instead of SQLFILE parameter.

Indexfile parameter is available in imp and sqlfile parameter with impdp. However you can use indexfile parameter in impdp which will turn on legacy mode import which is as below.

;;; Legacy Mode Active due to the following parameters:
;;; Legacy Mode Parameter: "indexfile=testindex.sql" Location: Command Line, Replaced with: "sqlfile=index_ddl.sql include=index"

Hence to extract only the indexes the statement should be as below.

impdp / directory=dir1,dir2,dir3 dumpfile=test_%U.dmp logfile=test.log EXCLUDE=STATISTICS Full=Y SQLFILE=index_ddl.sql INCLUDE=INDEX

Note: Tip #8 edited as per comment from Eric below.

Tip #9 : Contents of Dump file

If you are not sure about the schemas that were present in the dumpfile or tablespaces present inside the dumpfile, etc., you can easily check the dumpfile for those information using the below command

grep -a "CREATE USER" test_1.dmp
grep -a "CREATE TABLESPACE" test_1.dmp

-a is not a recognised flag in some OS and hence command works without the flag. Mind, the dumpfile created is a binary file.

The above command gives all the CREATE USER statements and CREATE TABLESPACE statements which will be useful in many cases. You can also get the INDEXES and TABLES creation ddl from the dumpfile as well.

Tip #10 : init.ora parameter cursor_sharing

Always set init.ora parameter cursor_sharing to exact which has a good effect on import's performance.

Tip #11 : STATUS parameter usage

You can check the on going datapump export/import operation with the use of STATUS parameter and track the progress by yourself. You can attach to a export/import session and check the status.

For example:

[oracle@ini8115l3aa2ba-136018207027 ~]$ expdp attach=SYS_EXPORT_FULL_01
Export: Release 11.2.0.1.0 - Production on Mon May 21 10:56:28 2012
Copyright (c) 1982, 2009, Oracle and/or its affiliates.  All rights reserved.
Username: sys as sysdba
Password:
Connected to: Oracle Database 11g Enterprise Edition Release 11.2.0.1.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options
Job: SYS_EXPORT_FULL_01
  Owner: SYS
  Operation: EXPORT
  Creator Privs: TRUE
  GUID: C08622D4FB5571E4E04012881BCF4C92
  Start Time: Monday, 21 May, 2012 10:55:55
  Mode: FULL
  Instance: newdb
  Max Parallelism: 1
  EXPORT Job Parameters:
  Parameter Name      Parameter Value:
     CLIENT_COMMAND        sys/******** AS SYSDBA directory=dmpdir full=y
  State: EXECUTING
  Bytes Processed: 0
  Current Parallelism: 1
  Job Error Count: 0
  Dump File: /u02/dpump/expdat.dmp
    bytes written: 4,096
Worker 1 Status:
  Process Name: DW00
  State: EXECUTING
  Object Type: DATABASE_EXPORT/SCHEMA/TABLE/COMMENT
  Completed Objects: 400
  Worker Parallelism: 1
Export> status
Job: SYS_EXPORT_FULL_01
  Operation: EXPORT
  Mode: FULL
  State: COMPLETING
  Bytes Processed: 37,121
  Percent Done: 100
  Current Parallelism: 1
  Job Error Count: 0
  Dump File: /u02/dpump/expdat.dmp
    bytes written: 561,152
Worker 1 Status:
  Process Name: DW00
  State: WORK WAITING

Here you can see the bytes written which will be progressing and you can track the export/import job easily.

Note: The parameter ATTACH when used, it cannot be combined with any other parameter other than the USERID parameter.

$ expdp ATTACH= JOB_NAME

I’ll be updating the post whenever I come across things that can help improving the performance of datapump. J

Selvakumar Nagulan

A person with hard working ability, will power and passion to finish what he starts. An Oracle database administrator trying to explore every nook and corner possible in Oracle technology. A blogger, a story and poem writer at leisure. A guy who maintain at most patience even when the whole world rejects him.

22 comments:

nikolayivankinMay 21, 2012 at 11:33 AM
Nice article, but I would say, that dump file can be imported/exported not only by DataPump utility, but also with the help of DBMS_DATAPUMP package.
ReplyDelete
Replies
AnonymousMay 26, 2012 at 9:41 AM
Thanks Mamukutti for sharing these tips ... for tip#8 should that be INCLUDE instead of EXCLUDE=.

--Vinay
ReplyDelete
Replies
Mamu kuttiMay 28, 2012 at 8:06 PM
@Nikolayivankin Thank you for the comment. :)
ReplyDelete
Replies
Mamu kuttiMay 28, 2012 at 8:12 PM
@Vinay It should be EXCLUDE. If you want to speed up the import, you should not import indexes and statistics as they take significant amount of time. So you have to exclude from the dump while loading the datas.
ReplyDelete
Replies
DON!VDecember 17, 2012 at 1:56 AM
Really very good points. I really appreciate your post. Thanks
ReplyDelete
Replies
NandhuJanuary 11, 2013 at 1:51 PM
Thank u mamu kutty, its really helpfull... Expecting some more articles from u...
ReplyDelete
Replies
shamsJune 7, 2013 at 2:36 AM
A very good place you have given for Datapump users. Nice article
ReplyDelete
Replies
Palpandi AyyaswamyJuly 5, 2013 at 9:25 AM
hi
very nice tips mamu kutti.it is very very useful for dba.good helpful for dba presentation
ReplyDelete
Replies
Shanmugam Vellore ThyagarajanAugust 29, 2013 at 5:46 PM
Thanks MK for sharing tip#11 to monitor the percentage of datapump jobs, is there any query to monitor the percentage from db side?
ReplyDelete
Replies
Mamu kuttiOctober 19, 2013 at 1:30 AM
Hi Shanmugam,

Apologies for delayed reply.
You can check with the below query.

alter session set nls_date_format='dd/mm/yy hh24:mi:ss'
select sid, USERNAME, OPNAME, start_time, totalwork, sofar, (sofar/totalwork) * 100 pct_done from v$session_longops where totalwork > sofar and opname like '%EXPORT%';

Thank you!!
ReplyDelete
Replies
Eric ThomsonJune 23, 2014 at 7:26 AM
Hi ,
Just clarify about the indexfile in impdp - Is it available imp or impdp
What is the meaning of SQLFILE in impdp?
Your example seems to be confusing and misleading

Example of effective import
impdp / directory=dir1,dir2,dir3 dumpfile=test_%U.dmp logfile=test.log EXCLUDE=INDEX,STATISTICS Full=Y INDEXFILE=index_ddl.sql

Again why you need to use session long_ops instead of attach= and query the status through impdp utility itself

Manoharan
manoharansk@yahoo.com
ReplyDelete
Replies
Karthik SupremeJune 26, 2014 at 7:02 AM
Hello sir pls clear me this doubts
why we use directory and where the dumpfile will be stored ?
where the log file will be stored?
suppose if we export a table from one database to another database where the dumpfile will be stored?
should we name the dumpfile to export a table?
and also pls tell me whats the use of remap?
pls sir iam going to attend an interview surely they will ask this export import pls help out
thanks in advance.
ReplyDelete
Replies
AnonymousFebruary 27, 2017 at 10:50 PM
How can we easily split the sqlfile into multiple files so we can run them in parallel?
ReplyDelete
Replies
AnonymousNovember 25, 2022 at 10:17 AM
ORA-39097: Data Pump job encountered unexpected error -4031
ORA-39065: unexpected master process exception in MAIN
ORA-04031: unable to allocate 1160 bytes of shared memory ("shared pool","unknown object","sga heap(2,1)","KQR X PO")

Job "SYS"."SYS_EXPORT_FULL_08" stopped due to fatal error at 00:37:35
ReplyDelete
Replies

Add comment

Looking for something? Try here..

Monday, May 21, 2012

Datapump - some tips

22 comments:

Certified Specialist

Microsoft Certified: Azure Fundamentals

Certified Associate