Spark
Spark step module
koheesio.spark.SparkStep #
Base class for a Spark step
Extends the Step class with SparkSession support. The following:
- Spark steps are expected to return a Spark DataFrame as output.
- spark property is available to access the active SparkSession instance.
- The SparkSession instance can be provided as an argument to the constructor through the spark
parameter.