This integration is certified by Stitch. For support, contact Stitch support.
Google Analytics snapshot
A high-level look at Stitch's Google Analytics integration, including release status, useful links, and the features supported in Stitch.
|Configurable Replication Methods||
Connecting Google Analytics
Connecting your Google Analytics data to Stitch is a five-step process:
- Add Google Analytics as a Stitch data source
- Define the Historical Sync
- Define the Replication Frequency
- Authorize Stitch & select a Google Analytics profile to track
- Select metrics and dimensions to sync
Before you get started, you should verify that:
The user creating the integration has at least Read & Analyze permissions and that there’s recent data in the account. If the profile you use to connect doesn’t have these permissions (or there’s no data in the account), you’ll receive an error message like this:
“Something went wrong. None of the Google Analytics profiles associated with the credentials you’ve supplied contain data that Stitch can access. Please make sure that the credentials you’ve supplied have appropriate access.”
All ad-blocking software you are currently using is paused. Because Google authentication uses pop ups, you may encounter issues if ad blockers aren’t disabled during the setup.
Add Google Analytics as a Stitch data source
- Sign into your Stitch account.
On the Stitch Dashboard page, click the Add Integration button.
Click the Google Analytics icon.
Enter a name for the integration. This is the name that will display on the Stitch Dashboard for the integration; it’ll also be used to create the schema in your destination.
For example, the name “Stitch Google Analytics” would create a schema called
stitch_google_analyticsin the destination. Note: Schema names cannot be changed after you save the integration.
Define the Historical Sync
The Sync Historical Data setting will define the starting date for your Google Analytics integration. This means that data equal to or newer than this date will be replicated to your data warehouse.
Change this setting if you want to replicate data beyond Google Analytics’s default setting of 30 days. For a detailed look at historical replication jobs, check out the Syncing Historical SaaS Data guide.
Create a replication schedule
In the Replication Frequency section, you’ll create the integration’s replication schedule. An integration’s replication schedule determines how often Stitch runs a replication job, and the time that job begins.
Stitch offers two methods of creating a replication schedule:
- Replication Frequency: This method requires selecting the interval you want replication to run for the integration. Start times of replication jobs are based on the start time and duration of the previous job. Refer to the Replication Frequency documentation for more information and examples.
Anchor scheduling: Based on the Replication Frequency, or interval, you select, this method “anchors” the start times of this integration’s replication jobs to a time you select to create a predictable schedule. Anchor scheduling is a combination of the Anchor Time and Replication Frequency settings, which must both be defined to use this method. Additionally, note that:
- A Replication Frequency of at least one hour is required to use anchor scheduling.
An initial replication job may not begin immediately after saving the integration, depending on the selected Replication Frequency and Anchor Time. Refer to the Anchor Scheduling documentation for more information.
- You’ll need to contact support to request using an Anchor Time with this integration.
To keep your row usage low, consider setting the integration to replicate less frequently. See the Understanding and Reducing Your Row Usage guide for tips on reducing your usage.
Authorize Stitch & Select a Google Analytics Profile
- Next, you’ll be prompted to log into your Google account and to approve Stitch’s access to your Google Analytics data. Note that we will only ever read your data.
- Click Allow to continue.
After your credentials are validated, you’ll be prompted to select the Google Analytics profile you want to connect to Stitch:
Remember: profiles need to have Read & Analyze permissions to be detected by Stitch. If you don’t see the profile you want in this list, we recommend that you double-check the permission settings.
- When finished, click Continue.
Select Metrics & Dimensions to Sync
After you grant Stitch access to your Google Analytics profile, you can select the specific Metrics and Dimensions you want to replicate to your destination.
Before you get started, note the following:
Metric/Dimension combos that adhere to Google’s compatibility rules can be saved. Stitch will display a notification if a conflict is found while adding Metrics and Dimensions. Integrations with incompatible Metric/Dimension combos can’t be saved.
We recommend creating additional Google Analytics integrations for different reports if you run into compatibility issues.
Metric/Dimension combos can’t be changed after the integration is saved. The Primary Key Stitch creates for Google Analytics integration tables is a composite key composed of the Dimensions selected during setup. Adding or removing Dimensions will change the Primary Key, potentially leading to issues with identifying new data for replication or de-duping data.
Google limits the number of Metrics and Dimensions you can select. You can select up to 10 Metrics and seven Dimensions per integration. Refer to Google’s documentation for more info on these limits.
Segments and Filters aren’t currently supported. If you’re interested in us adding these features, please get in touch with us.
Select Metrics & Dimensions
In the Metrics and Dimensions fields, you can search or use the drop-down to explore your options:
- Click the Metric or Dimension in the menu to add it to the configuration.
To add a Custom Metric or Dimension, type out the name exactly in its entirety. If you try to search for it and add a Metric/Dimension that looks like this -
ga:customMetricXX- you’ll run into issues.
For example: let’s say you want to add
custom metric 10to the configuration. To add it, you would type
ga:metric10in the Choose Metrics field like this:Tip: Not sure what the name of a custom Metric or Dimension is? Sign into your Google Analytics Dashboard and open the Custom Metrics or Dimensions page (Admin > Property Column > Custom Definitions > Custom Metric/Dimension).
In the row for the Metric or Dimension, take the number in the Index column and append it to either
ga:dimension. For example: if the index for a custom Dimension is
1, the name would be
- As you add Metrics and Dimensions, Stitch will check for compatibility. If there are any conflicts, you’ll need to resolve them before you can save the integration. Use Google’s Dimensions & Metrics Explorer as a guide when selecting Metrics and Dimensions.
- Review your selections. Remember: once saved, Metrics and Dimensions can’t be added or removed.
- When you’re finished, click Save Integration.
Initial and historical replication jobs
After you finish setting up Google Analytics, its Sync Status may show as Pending on either the Stitch Dashboard or in the Integration Details page.
For a new integration, a Pending status indicates that Stitch is in the process of scheduling the initial replication job for the integration. This may take some time to complete.
Initial replication jobs with Anchor Scheduling
If using Anchor Scheduling, an initial replication job may not kick off immediately. This depends on the selected Replication Frequency and Anchor Time. Refer to the Anchor Scheduling documentation for more information.
Free historical data loads
The first seven days of replication, beginning when data is first replicated, are free. Rows replicated from the new integration during this time won’t count towards your quota. Stitch offers this as a way of testing new integrations, measuring usage, and ensuring historical data volumes don’t quickly consume your quota.
Replicating Google Analytics Data
Every time Stitch runs a replication job for Google Analytics, the last 15 days’ worth of data will be replicated.
This is applicable to all tables in the integration.
Stitch replicates data in this way to account for updates made to existing records within the default attribution window of 15 days, thus ensuring you won’t make decisions based on stale (or false) data. As a result, you may see a higher number of replicated rows than what’s being generated in Google Analytics.
Setting the Replication Frequency to a higher frequency - like 30 minutes - can result in re-replicating recent data and contribute to greater row usage. Selecting a lower frequency can help keep your row count low.
Google Analytics Schema
A single table - called
report - will be created in your data warehouse for each Google Analytics integration you create.
The schema of this table will be composed of the Metrics and Dimensions you selected during the setup process and two other columns:
If, for example, you selected the following Metrics and Dimensions during setup:
The table would look like the
The Primary Key for a Google Analytics table is a composite key made up of the Dimension columns and the
For example: the Primary Key for the
SAMPLE_TABLE would be:
Table Rows & Data Pagination
Google Analytics data is paginated on a daily basis. This means that a single row in the
report table pertains to a specific day. Use the
end_date columns to identify what day the row is for.
Replication Method: Key-based Incremental
Contains Nested Structures?: No
This is a sample table. The section below contains the real info for the
report table that will be created in your data warehouse.
ga:country Metrics & Dimensions were selected during setup, the schema of the table would include these attributes:
Replication Method: Key-based Incremental
Contains Nested Structures?: No
This is the table that will be created in your data warehouse. The columns in this table will be the Metrics and Dimensions you selected during setup.
Replication & Attribution Windows
Every time a replication job runs for Google Analytics, the past 15 days' worth of data will be replicated for this table. As a result, you may see a higher number of replicated rows than what's being generated in Google Analytics.
Stitch replicates data in this way to account for updates made to existing records within Google Analytics's default attribution window, thus ensuring you won't make decisions based on stale (or false) data.
For more info on the Metrics and Dimensions in Google Analytics, check out their documentation.
Metrics selected by you during setup
Dimensions selected by you during setup
Google Analytics & (not set) Values
According to Google’s documentation:
(not set)is a placeholder name that Analytics uses when it hasn’t received any information for a dimension.
In Google Analytics, the reasons for a
(not set) value can vary depending on the report you’re looking at. More info for the various report types can be found here in Google’s documentation.