Pentaho Data Integration Beginnerвђ™s Guide [2K]
PDI utilizes a suite of tools, collectively often referred to by their original names (the "Kettle" project components):
: A command-line tool specifically for executing transformations. Kitchen : A command-line tool used to execute jobs. Pentaho Data Integration Beginner’s Guide
For beginners, understanding the distinction between these two building blocks is critical: PDI utilizes a suite of tools, collectively often
: A lightweight web server that allows for remote execution and monitoring of transformations and jobs. Key Concepts: Transformations vs. Jobs Key Concepts: Transformations vs
: Features include advanced data cleansing, filtering "junk" data, and handling slowly changing dimensions for data warehousing.
: A common first step involves creating a simple transformation to read a file, apply a basic change (like splitting a name field), and output it to a new format.
: Spoon allows for real-time previewing of data at any step in the transformation to verify logic before execution. A Beginners Guide to Pentaho DI - GoLogica technologies