Q: Where's the T in your ETL tool?

tl;dr We believe that transformation is best done in your cloud data warehouse, so we focus on E and L.

A few years ago, the only data warehouses available were expensive, on-premises appliances, and it took weeks or months for organizations to add additional capacity. In that world, it made sense to do extract, transform, and load in that order. ETL tools that were built 10 or more years ago were set up to do as much prep work as possible, including transformation, prior to loading data into data warehouses. Today, however, cloud data warehouses like Amazon Redshift, Google BigQuery, and Snowflake can elastically scale up and down in seconds or minutes, so you can skip the preload transformations and dump all of your raw data into your data warehouse. You can then define transformations in SQL and run them in the data warehouse at query time.

You may also be interested in a blog post our CTO wrote on this subject.

Q: How does Stitch compare to [another ETL platform]?

Some of the most common reasons our customers choose Stitch are:

Q: Do you charge per integration, by volume of rows/events, or some other way?

Our self-serve plans are tiered by data volume. We have a Free plan for organizations that move fewer than 5 million rows or events per month. In all of our plans, you can use as many of our integrations as you like at no extra cost, although our premium integrations are available only to customers on one of our paid plans. If you're already using Stitch for one data source, we encourage you to add more.

Our Enterprise plans are custom-built based on the needs of your organization. If you’re interested in an enterprise-grade ETL platform for your mission-critical data, please contact our Sales team for more details.

Q: How quickly will data be available in my data warehouse?

Within a small number of minutes. If you need a data latency SLA, please contact our Sales team for more details.

Q: How does Stitch determine when to replicate my data?

You can specify the Replication Frequency on an integration-by-integration basis, which determines how often Stitch will attempt to extract data from an each data source.

Q: How secure is Stitch?

A: We take security very seriously – see our security page and, for more details, our security FAQ.

Q: I see you support a lot of integrations – what about [integration we don’t support yet]?

There are three paths for adding new integrations. If you need an integration for a new data source immediately, you can build integrations using the open source Singer framework, and they'll run in Stitch; check out the Singer Getting Started guide, and bring any questions you have to the Singer Slack group. You can also work with one of our implementation partners, which are experienced in building custom integrations for use with Stitch. Finally, we can include custom integration development and commercial support for community-developed integrations for Enterprise customers.

Q: What is Singer?

Singer is an open source platform that lets anyone write and collaborate on scripts that move data between databases, web APIs, file queues, and just about anything else you can think of. You can submit Singer integrations to our Product team for inclusion in Stitch; once accepted, you can use Stitch to run any integration written in the Singer format. By running a Singer integration on Stitch's platform you get auto-scaling, a secure infrastructure, credential management, monitoring, and alerting. Singer integrations can also be run on hardware that you manage.

Singer is made up of three parts:

All taps and targets can be mixed and matched, so changing the destination you're loading data into is easy. Since it's all open source, community members can leverage each other's improvements.

Check out the Singer Getting Started guide, and bring join the Singer Slack group to get help from the community and see what other people are working on.

Q: What happens to data in the pipeline if the data warehouse gets disconnected? Could I lose data, or wind up with duplicate data when the pipeline is reconnected?

Stitch is architected to prevent data loss or duplication. We buffer data once it's in the pipeline, so if a data warehouse gets disconnected, nothing will be lost as long as it's reconnected before the buffer expires. Most customers have a two-week buffer; Enterprise customers can define custom data retention policies and expiration intervals.

Q: What's the difference between your Certified and Community integrations?

Both Certified and Community integrations offer a number of benefits:

The key difference is that Stitch provides commercial support for Certified integrations but not for Community integrations – though support for Community integrations can be included in Enterprise contracts. Commercial support is a guarantee that the Stitch team will fix bugs and adapt to new versions of third-party APIs. Maintenance of Community integrations is handled by members of the Singer open source community.

Q: What do I need to do to get started using Stitch?

Sign up, add a data source, a destination, and you’re ready to go. We offer an unlimited 14-day free trial, so feel free to connect all of your systems to Stitch. Our Getting Started guide walks you through the process, and our Support team can help out with any bumps you hit along the way.