Manage your PostgreSQL data in HDFS with Talend

PostgreSQL (a.k.a. Postgres) is a popular open source object-relational database system that runs on all major operating systems. It emphasizes ANSI SQL conformance, and provides the ability to define custom data types and add custom functions. It's known for its stability and its ability to handle high volumes of transactions. Manage PostgreSQL data in HDFS with Talend's suite of data integration tools.

Connecting to PostgreSQL

To get started, first decide what data in Postgres you want to work with, then log in to Talend Data Fabric and choose a Talend app from the drop-down list. Use the tPostgreSQLConnection component. Enter your username, password, and a JDBC URL that identifies your Postgres database.

Learn more about connecting to PostgeSQL

More info on JDBC URLs

More about integrating PostgreSQL data

Talend has detailed documentation on how to ETL your PostgreSQL data for a better view of the business.

Connecting to HDFS

Apache HDFS (Hadoop Distributed File Systems) provides a software framework for distributed storage and processing of big data. In combination with tools such as MapReduce, Yarn, and other core modules, HDFS lets organizations build Apache Hadoop clusters of hundreds or thousands of nodes that can handle datasets of terabyte size. A robust ecosystem of other tools can take advantage of data stored in HDFS.

To connect to HDFS, use the Component tab of the tHDFSExist component. Enter the Hadoop distribution and version, the HDFS directory, and name of the file you want to use.

Learn more about connecting to HDFS

Get more from your PostgreSQL data

Deliver data your organization can trust... Get started today.

Explore Talend's full suite of apps