Why and how to replicate MySQL to your data warehouse

Do you use MySQL? To get the most value out of your data, you can use Stitch to replicate it to a data warehouse, where you can analyze it in conjunction with data from other sources using business intelligence (BI) software.

A data warehouse stores data in a columnar database. Each column of data is stored in one or more contiguous blocks, which makes the kinds of information you might want to analyze – pricing information or ad usage, for instance – much faster to retrieve.

MySQL is a traditional row-oriented database. Each row in each table is stored together within one or more contiguous storage blocks, which makes it an excellent platform for storing data that's characterized by frequent writes. But for analytical queries, you don't care about row-by-row values, but rather the information in certain columns for all rows. For instance, a retailer might want see how price affects sales, or to zero in on the referrers that send it the most traffic. Queries run against data in a data warehouse will complete much faster than analytics using a row-oriented database.

Want to get started replicating MySQL to your data warehouse so you can use it with your BI platform? Let's walk through the process. If you're not already using Stitch, sign up to replicate MySQL and start moving data to Amazon Redshift, Google BigQuery, Snowflake, or another data warehouse in less than 15 minutes.

Add MySQL as a Stitch data source

To set up a MySQL integration, follow the steps in our MySQL setup guide, then sign in to Stitch and connect to MySQL. Specify your credentials and connection details, and choose a replication schedule, following the advice in the documentation.

Once you're connected, choose the database you want to replicate. Stitch will display a list of tables available for replication. For each table, Stitch will ask you the replication method you want to use. It will also give you the chance to choose which columns in the table to replicate..

Once you finalize your selection the status may show as pending while Stitch schedules the initial replication job.

MySQL-ss-4a

We have troubleshooting guides that can help you if you run into an error message during the extraction phase or encounter destination issues. We also have detailed MySQL documentation to get you started. If you need additional help, click on the pink Support icon in the lower right of the app.

Hosted and compatible MySQL platforms

The procedure outlined here is the same for Amazon RDS for MySQL, Amazon Aurora (MySQL-compatible edition), and Google Cloud SQL for MySQL, and because MariaDB is a binary drop-in replacement for MySQL, the same process should work with MariaDB too.

Let the data flow

Stitch makes it easy to replicate MySQL data to any destination – and Stitch gives you free row replication for the first seven days of a new integration, so go and replicate your MySQL data, or any of the more than 80 data sources we support.

MySQL-ss-5