Teradata
Teradata reader.
koheesio.spark.readers.teradata.TeradataReader #
Wrapper around JdbcReader for Teradata.
Notes
- Consider using synthetic partitioning column when using partitioned read:
MOD(HASHBUCKET(HASHROW(<TABLE>.<COLUMN>)), <NUM_PARTITIONS>)
- Relevant jars should be added to the Spark session manually. This class does not take care of that.
See Also
- Refer to JdbcReader for the list of all available parameters.
- Refer to Teradata docs for the list of all available connection string parameters: https://teradata-docs.s3.amazonaws.com/doc/connectivity/jdbc/reference/current/jdbcug_chapter_2.html#BABJIHBJ
Example
This example depends on the Teradata terajdbc4
JAR. e.g. terajdbc4-17.20.00.15. Keep in mind that older versions
of terajdbc4
drivers also require tdgssconfig
JAR.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
url |
str
|
JDBC connection string. Refer to Teradata docs for the list of all available connection string parameters.
Example: |
required |
user |
str
|
Username |
required |
password |
SecretStr
|
Password |
required |
dbtable |
str
|
Database table name, also include schema name |
required |
options |
Optional[Dict[str, Any]]
|
Extra options to pass to the Teradata JDBC driver. Refer to Teradata docs for the list of all available connection string parameters. |
{"fetchsize": 2000, "numPartitions": 10}
|
query |
Optional[str]
|
Query |
None
|
format |
str
|
The type of format to load. Defaults to 'jdbc'. Should not be changed. |
required |
driver |
str
|
Driver name. Be aware that the driver jar needs to be passed to the task. Should not be changed. |
required |