Manage your Apache Impala data in Databricks with Talend

Apache Impala is an open source massively parallel processing SQL query engine for data stored in Apache Hadoop. It allows data analysts to run low latency and high concurrency queries on Hadoop for business intelligence. Manage Apache Impala data in Databricks with Talend's suite of data integration tools.

Connecting to Apache Impala

To connect to Impala, use the tImpalaConnection component. Choose a property type (built-in or repository) and additional details such as the host, port, database, and username.

Learn more about connecting to Impala

More about integrating Apache Impala data

Talend has detailed documentation on how to ETL your Apache Impala data for a better view of the business.

Connecting to Databricks

Databricks is the leader in unified analytics and founded by the original creators of Apache Spark™.

The Talend tDBFSConnection module connects to DBFS (the Databricks Filesystem) system. DBFS components are designed for quick and straightforward data transferring with Databricks. For more sophisticated scenarios you can also use Spark Jobs with Databricks.

Get more from your Apache Impala data

Deliver data your organization can trust... Get started today.

Explore Talend's full suite of apps