Manage your JDBC data in HDFS with Talend

JDBC (Java Database Connectivity) is a widely used Java API for connecting to a database, issuing queries and commands, and handling the datasets returned. Virtually all relational databases provide a way for other programs to connect with them using JDBC. Manage JDBC data in HDFS with Talend's suite of data integration tools.

Connecting to JDBC

To use JDBC to connect to a database, use the tELTInput component and enter the default table and schema names.

Learn more about connecting with JDBC

More about integrating JDBC data

Talend has detailed documentation on how to ETL your JDBC data for a better view of the business.

Connecting to HDFS

Apache HDFS (Hadoop Distributed File Systems) provides a software framework for distributed storage and processing of big data. In combination with tools such as MapReduce, Yarn, and other core modules, HDFS lets organizations build Apache Hadoop clusters of hundreds or thousands of nodes that can handle datasets of terabyte size. A robust ecosystem of other tools can take advantage of data stored in HDFS.

To connect to HDFS, use the Component tab of the tHDFSExist component. Enter the Hadoop distribution and version, the HDFS directory, and name of the file you want to use.

Learn more about connecting to HDFS

Work with your JDBC data

Aggregate table columns and filter Use JDBC to gather data from several input DB table schemas and implement a clause to filter the output using an SQL statement See how >

Get more from your JDBC data

Deliver data your organization can trust... Get started today.

Explore Talend's full suite of apps