Data Load in a Data Warehouse
The ultimate goal and the responsibility of the load manager is to get data loaded into the data warehouse database. There are a number of different methods for getting the data from the source into the data warehouse. Some of the more commonly used approaches are: loading from flat files, 3GL programs for extracting and loading data, gateway products for connecting different RDBMSs, and copy management tools.
Some of these tools, such as the gateway products and some of the third-party tools, are designed to select data directly from the source system itself. Others rely on the data being transferred from the source system as a file. If large amounts of data need to be loaded, then it is likely that you will need to use the RDBMS’s own loader software. These loader programs will sometimes have direct path load mechanisms, which bypass the SQL layer of the RDBMS and build whole database blocks directly. This will greatly improve the performance of the load.
Generally, the load performance can be further improved if there is.a no-logging option. This option switches off journaling for the load operation. Note, however, that if this option is used it needs to be taken into consideration in your backup recovery strategy. If no-logging is to be used you also need to design the load jobs to be re-startable.
iEntry 10th Anniversary
News
Articles