Last month Stitch became part of Talend. Talend is a global open source big data and cloud integration software company whose mission is "to make your data better, more trustworthy, and more available to drive business value." That maps naturally to Stitch's mission "to inspire and empower data-driven people."
Talend offers a wide range of products to complement Stitch's frictionless SaaS ETL platform.
Free and open source
Talend's open source product line is founded on Talend Open Studio for Data Integration, which the company first released in 2006. It's a free, Eclipse-based graphical tool and Java code generator for creating ETL and ELT jobs using drag-and-drop building blocks. You can use Open Studio to map, aggregate, sort, enrich, and merge data from multiple sources to multiple destinations. It comes with connectors for databases, SaaS platforms, and data center applications. The code it generates can run locally, or you can deploy it to the Talend Administration Center to manage the job. Open Studio runs locally on either Microsoft Windows or macOS desktops.
There are several flavors of Open Studio:
- Open Studio for Big Data – offers features that support the Hadoop ecosystem. It can generate Spark or MapReduce code, and includes machine learning components to use in the pipeline.
- Open Studio for ESB – offers tools to help build service-oriented architectures (SOA) with orchestration.
- Open Studio for Data Quality – includes advanced data profiling features.
- Open Studio for MDM – provides model-driven user interface for profiling data.
- Open Studio for Data Preparation – offers tools to import, cleanse, enrich, combine, and export data.
Talend has a couple more open source applications in addition to Open Studio:
Data Preparation Free Desktop Edition is a free data prep tool to clean up data in minutes.
Data Streams Free Edition is an embedded code editor that lets data scientists, analysts, and engineers integrate streaming data and build pipelines powered by Apache Beam that run on AWS.
Talend Data Fabric
Talend's commercial offerings are aimed at enterprises with complex data integration and transformation needs. They fall under the umbrella of Talend Data Fabric, an end-to-end data integration and management platform across cloud and on-premises environments. It comprises products that support big data, machine learning, data quality, governance, and API development lifecycle support.
Talend Cloud is an integration platform as a service. It comes in three editions — Cloud Data Integration, Cloud Data Management Platform, and Cloud Real-Time Big Data Platform — and new users can get a 30-day free trial.
Talend Cloud lets customers build cloud data lakes and data warehouses for fast analytics. Talend Cloud offers:
- The ability to connect to on-premises databases, SaaS apps, AWS, Microsoft Azure, Google Cloud Platform, Snowflake, and more
- Native Spark and Hadoop support
- Built-in data quality tools
- Governance and self-service capabilities such as data catalog, data preparation, and data stewardship
- Software development lifecycle (SDLC) and multicloud support
Big Data Platform simplifies and automates big data integration with graphical tools and wizards that generate native code that works with Apache Hadoop, Apache Spark, Spark Streaming, and NoSQL databases. The company offers a free trial big data and machine learning sandbox.
Companies without big data can use Talend's Data Integration Platform.
Cloud API Services provides a unified environment for the full API development lifecycle, including design, test, documentation, implementation, and deployment.
Data Catalog is a tool for creating a central, governed, shareable catalog of data. Data Catalog automatically crawls, profiles, organizes, links, and enriches all of an organization's metadata.
Data Quality profiles, cleanses, and masks data, while monitoring data quality over time. It integrates with other Talend products and handles data deduplication, validation, and standardization, and enriches data with external sources for things like postal validation, business identification, and credit score information.
Data Preparation provides self-service tools for discovering, cleansing, and sharing data. In addition to the aforementioned Free Desktop Edition, there's the commercial Talend Data Preparation and Talend Cloud Data Preparation.
Master Data Management lets you create a "single version of the truth" for cloud, big data, and mobile applications.
So there you have it — Talend in a nutshell. But don't imagine that Stitch is going away. Talend teamed up with Stitch because they believe our zero-configuration ETL pipeline offers value that complements that of its other products. They plan to invest more in Stitch and grow our team. Sign up for Stitch and see what Talend liked so much they bought the company.