IBM InfoSphere DataStage and QualityStage

IBM® InfoSphere® DataStage® and QualityStage® provides a graphical framework that you use to design and run the jobs that transform and cleanse your data.

Depending on which products you have licensed, you can develop parallel jobs to transform and cleanse data and server jobs to transform data. Parallel and server jobs are run on the IBM InfoSphere Information Server engine. Mainframe jobs produce COBOL code which runs on a mainframe computer.

Note: Mainframe jobs are not supported in this version of IBM InfoSphere Information Server.

You design jobs in the IBM InfoSphere DataStage and QualityStage Designer client and run them in the IBM InfoSphere DataStage and QualityStage Director client. Jobs are organized into projects, and you can administer these projects by using the IBM InfoSphere DataStage and QualityStage Administrator client. You can deploy your job designs and job design collateral by using the InfoSphere Information Server Manager.

Information roadmap for InfoSphere DataStage
This document provides links to the information resources that are available for IBM InfoSphere DataStage.

Information roadmap for InfoSphere QualityStage
This document provides links to the information resources that are available for IBM InfoSphere QualityStage.

Alphabetical list of stages
This document lists the stages that are available in IBM InfoSphere Information Server, as included with the base installation or with add-on installations.

Overview of InfoSphere DataStage
IBM InfoSphere DataStage is a data integration tool for designing, developing, and running jobs that move and transform data.

Getting started
Use these tutorials to learn the basic skills that you need to develop parallel jobs that transform data and parallel jobs that cleanse data.

Designing jobs
You design IBM InfoSphere DataStage and QualityStage jobs by using the Designer client. The Designer client is like a workbench or a blank canvas that has a palette that contains the tools that form the basic building blocks of a job: stages, links, and annotations.

Developing parallel jobs
You design parallel jobs to transform and to cleanse data. Parallel jobs consist of individual stages. Each stage describes a particular process, this might be accessing a database or transforming data in some way. Parallel jobs brings the power of parallel processing to your data extraction and transformation applications.

Developing server jobs
Server jobs are compiled and run on the server engine. Such jobs connect to a data source, extract and transform data, and write data to a target database or file, such as a data warehouse.

Cleansing data with InfoSphere QualityStage jobs
The cleansing process can include, but is not limited to, eliminating redundant, obsolete, or inaccurate data. Clean data is a critical component for accurate information, reports, and analyses. Throughout your organization, people make business decisions based on data that is provided to them. By cleansing data, you provide high-quality data.

Deploying jobs and accessing version control
Use the InfoSphere Information Server Manager to move IBM InfoSphere DataStage and QualityStage objects between projects on the same engine or on different engines. You can also use the InfoSphere Information Server Manager to move objects from one domain to another.

Running jobs
You run your IBM InfoSphere DataStage and QualityStage jobs from the Director client.

Administering workload management
You can use the workload management queues to control the starting of parallel and server jobs.

Monitoring jobs
You can use the Operations Database and the Operations Console to better monitor the job runs, services, and system resources on several InfoSphere DataStage engines.

Administering projects
IBM InfoSphere DataStage and QualityStage jobs are organized in projects, along with associated design items. Different users can be granted access to different projects.

Reference
The reference topics provide more in-depth information about IBM InfoSphere DataStage and QualityStage. You can use these topics to help you fine tune your jobs and to produce custom components to use in your jobs.

InfoSphere Information Server suite-wide glossary
This glossary contains terms and definitions for InfoSphere Information Server.