IBM Data Refinery on Cloud Pak for Data
This comprehensive course empowers you to master data integration and transformation pipelines. You’ll efficiently manage DataStage Flows, handle file and data processing stages, and seamlessly connect to cloud and database connectors. Learn to automate pipelines with Watson Pipelines, ensuring optimized performance and streamlined data workflows. By the end of this course, you’ll be equipped with the skills to handle parallel frameworks, multi-instance jobs, and advanced data transformations.
Audience:
This course is designed for Business Users, Business Analysts, Data Stewards, Governance Users, Data Quality Analysts, Data Scientists, Catalog Administrators, Metadata Administrators, Data Engineers, and Developers looking to enhance their data refinery, data governance and management skills with IBM Cloud Pak for Data.
Objectives: This is a 1-day instructor-led course, conducted over 5 hours per day.
Overview of Cloud Pak for Data and IBM Knowledge Catalog
- IBM Cloud Pak for Data
- IBM Knowledge Catalog
Workspaces in Cloud Pak for Data
- Category
- Catalog
- Project
- Collaborators for Workspaces
Data Refinery
- Adding Data into Project
- Profiling Data
- Refining Data
- Data Refinery Flow
- Working with Jobs