Manage your Greenplum data in Databricks with Talend

Greenplum uses massively parallel processing (MPP) architecture and a database based on PostgreSQL to deliver analytics on large datasets. Greenplum uses MPP to distribute the big data workload to data warehouses and maximize a system’s resources in parallel. The Greenplum Database software can be deployed on on-premises hardware or cloud servers, and parent company Dell EMC also sells it as part of a hardware bundle. Manage Greenplum data in Databricks with Talend's suite of data integration tools.

Connecting to Greenplum

To connect to a Greenplum database, use the tGreenplumConnection component. Choose a property type (built-in or repository) and additional details such as the host, port, database, schema, and username and password.

Learn more about connecting to Greenplum

More about integrating Greenplum data

Talend has detailed documentation on how to ETL your Greenplum data for a better view of the business.

Connecting to Databricks

Databricks is the leader in unified analytics and founded by the original creators of Apache Spark™.

The Talend tDBFSConnection module connects to DBFS (the Databricks Filesystem) system. DBFS components are designed for quick and straightforward data transferring with Databricks. For more sophisticated scenarios you can also use Spark Jobs with Databricks.

Get more from your Greenplum data

Deliver data your organization can trust... Get started today.

Explore Talend's full suite of apps