Get in Touch

Course Outline

Advanced Transformation Building Blocks

  • Handling complex data types.
  • Managing fields, metadata, and dynamic structures.
  • Implementing reusable transformation patterns.

Parameters, Variables, and Job-Oriented Design

  • Understanding runtime variables and scoping rules.
  • Parameterizing transformations for flexibility.
  • Structuring parent-child job hierarchies.

Database Integration and Lookup Strategies

  • Utilizing advanced lookup steps.
  • Employing effective caching strategies.
  • Designing efficient join operations.

Working with Files, APIs, and External Systems

  • Processing JSON and XML formats.
  • Invoking REST and SOAP services.
  • Managing streaming and batch data loads.

Error Handling and Data Quality Techniques

  • Capturing and routing errors effectively.
  • Applying data validation patterns.
  • Implementing auditing and logging mechanisms.

Performance Tuning Essentials

  • Optimizing step design for efficiency.
  • Considering memory usage and threading configurations.
  • Identifying and resolving bottlenecks.

Introduction to Repository-Based Development

  • Leveraging the Pentaho repository.
  • Managing versions of transformations and jobs.
  • Adopting team collaboration best practices.

Deployment and Migration Practices

  • Promoting jobs across different environments.
  • Managing configurations effectively.
  • Following operational best practices.

Summary and Next Steps

Requirements

  • A foundational understanding of ETL concepts.
  • Prior experience working with Pentaho Data Integration.
  • Basic knowledge of data warehousing principles.

Target Audience

  • ETL developers.
  • Data engineers.
  • Technical professionals seeking to expand their PDI expertise.
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories