Google Analytics integration summary

Stitch’s Google Analytics integration replicates data using the Google Analytics Reporting API v4. Refer to the Schema section for details about the data Stitch will replicate.

Note: A single Google Analytics integration is limited to 10 Metrics and 7 Dimensions. This is due to limits enforced by Google. Refer to Google’s documentation for more info.

Google Analytics feature snapshot

A high-level look at Stitch's Google Analytics (v14-09-2016) integration, including release status, useful links, and the features supported in Stitch.

STITCH
Release Status Released Supported By

Stitch

Stitch Plan

Free

DATA SELECTION
Table Selection

Unsupported

Column Selection

Unsupported

REPLICATION SETTINGS
Anchor Scheduling

Unsupported

Advanced Scheduling

Unsupported

Table-level Reset

Unsupported

Configurable Replication Methods

Unsupported

TRANSPARENCY
Extraction Logs

Unsupported

Loading Reports

Supported

Connecting Google Analytics

Google Analytics setup requirements

To set up Google Analytics in Stitch, you need:

  • At least Read & Analyze permissions in the account you want to connect to Stitch. See Google’s documentation for more info.

  • To have recent data in the account you want to connect to Stitch. Verify that there is data from the past 30 days in the account before continuing.

  • To pause all ad-blocking software. Because Google authentication uses pop ups, you may encounter issues if ad blockers aren’t disabled during the setup.

Step 1: Add Google Analytics as a Stitch data source

  1. Sign into your Stitch account.
  2. On the Stitch Dashboard page, click the Add Integration button.

  3. Click the Google Analytics icon.

  4. Enter a name for the integration. This is the name that will display on the Stitch Dashboard for the integration; it’ll also be used to create the schema in your destination.

    For example, the name “Stitch Google Analytics” would create a schema called stitch_google_analytics in the destination. Note: Schema names cannot be changed after you save the integration.

Step 2: Define the historical sync

The Sync Historical Data setting will define the starting date for your Google Analytics integration. This means that data equal to or newer than this date will be replicated to your data warehouse.

Change this setting if you want to replicate data beyond Google Analytics’s default setting of 30 days. For a detailed look at historical replication jobs, check out the Syncing Historical SaaS Data guide.

Step 3: Create a replication schedule

In the Replication Frequency section, you’ll create the integration’s replication schedule. An integration’s replication schedule determines how often Stitch runs a replication job, and the time that job begins.

Google Analytics integrations support the following replication scheduling methods:

To keep your row usage low, consider setting the integration to replicate less frequently. See the Understanding and Reducing Your Row Usage guide for tips on reducing your usage.

Step 4: Authorize Stitch and select a Google Analytics profile

  1. Next, you’ll be prompted to log into your Google account and to approve Stitch’s access to your Google Analytics data. Note that we will only ever read your data.
  2. Click Allow to continue.
  3. After your credentials are validated, you’ll be prompted to select the Google Analytics profile you want to connect to Stitch:

    Selecting a Google Analytics profile.

    Remember: Profiles need to have Read & Analyze permissions to be detected by Stitch. If you don’t see the profile you want in this list, we recommend that you double-check the permission settings.

  4. When finished, click Continue.

Step 5: Select metrics and dimensions

  1. In the Metrics and Dimensions fields, you can search or use the drop-down to explore your options:

    Selecting Metrics & Dimensions.

  2. Click the Metric or Dimension in the menu to add it to the configuration.
  3. To add a Custom Metric or Dimension, type out the name exactly in its entirety. If you try to search for it and add a Metric/Dimension that looks like this - ga:customMetricXX - you’ll run into issues.

    For example: let’s say you want to add custom metric 10 to the configuration. To add it, you would type ga:metric10 in the Choose Metrics field like this:

    Adding a custom metric.

    1. As you add Metrics and Dimensions, Stitch will check for compatibility. If there are any conflicts, you’ll need to resolve them before you can save the integration. Use Google’s Dimensions & Metrics Explorer as a guide when selecting Metrics and Dimensions.

    2. Review your selections. Remember: once saved, Metrics and Dimensions can’t be added or removed.

    3. When you’re finished, click Save Integration.

Initial and historical replication jobs

After you finish setting up Google Analytics, its Sync Status may show as Pending on either the Stitch Dashboard or in the Integration Details page.

For a new integration, a Pending status indicates that Stitch is in the process of scheduling the initial replication job for the integration. This may take some time to complete.

Free historical data loads

The first seven days of replication, beginning when data is first replicated, are free. Rows replicated from the new integration during this time won’t count towards your quota. Stitch offers this as a way of testing new integrations, measuring usage, and ensuring historical data volumes don’t quickly consume your quota.


Google Analytics replication

Stitch replicates data in this way to account for updates made to existing records within the default attribution window of 15 days, thus ensuring you won’t make decisions based on stale (or false) data. As a result, you may see a higher number of replicated rows than what’s being generated in Google Analytics.

Setting the Replication Frequency to a higher frequency - like 30 minutes - can result in re-replicating recent data and contribute to greater row usage. Selecting a lower frequency can help keep your row count low.


Google Analytics table schemas

report

Replication Method :

Key-based Incremental

Replication Key :

start-date

Primary Key :

start_date : end_date : [DIMENSIONS_YOU_SELECT]

The report table will contain the Metrics and Dimensions selected during integration setup.

Data pagination

Google Analytics data is paginated on a daily basis. This means that a single row in this table pertains to a specific day. Use the start_date and end_date columns to identify what day the row is for.

Primary keys

The Primary Key for this table is a composite key made up of all Dimension columns and the start_date and end_date columns.

For example: If referralPath and country were Dimension columns in the table, both columns, along with start_date and end_date would make up the Primary Key for the table.

start_date
DATE-TIME

The start date for the record.

end_date
DATE-TIME

The end date for the record.

[DIMENSIONS_YOU_SELECT]
VARIES

A column will be created for every Dimension selected during setup. Every Dimension column comprises the composite Primary Key for the table.

[METRICS_YOU_SELECT]
VARIES

A column will be created for every Metric selected during setup.



Questions? Feedback?

Did this article help? If you have questions or feedback, feel free to submit a pull request with your suggestions, open an issue on GitHub, or reach out to us.