Refresh Your GA4: Clear Out Data Clutter
I hosted this webinar a few months ago and am bringing it back again because you should be checking your GA4 every few months for new data clutter!
Prefer video? Here you go:
Why your GA4 setup needs regular maintenance
Just as your home needs the mattress flipped and windows washed, your GA4 also needs regular check ups to prevent these common issues:
Whether your account has been accidentally neglected or changed hands multiple times, this webinar will help you find what’s broken and fix it up.
Configuration issues: the foundation of clean data
The most common configuration issues that I encounter are leftovers from when GA4 was originally set up, likely in a rush, and then forgotten about.
Universal Analytics connection check
✅ What to look for
Check if GA4 is still receiving data via the outdated Universal Analytics connection. This method could impact your data quality, especially since UA has sunset. You’ll want to ensure you’re using proper GA4 data collection instead.
🛠 Steps to take
💡 What I recommend
Time zone and currency settings
✅ What to look for
Make sure that your time zone is correct for your business location and reporting needs, and your currency is set correctly to match your transaction values.
🛠 Steps to take
💡 What I recommend
Data retention settings
✅ What to look for
Check if your data retention period is set to 14 months (instead of the default 2 months). This setting only impacts Explorations and some advanced reports—not standard GA4 reports.
🛠 Steps to take
💡 What I recommend
Tracking issues: are you collecting the right data?
Tracking issues are when your GA4 setup says it’s tracking something important—like form submissions or button clicks—but it isn’t working as expected.
“It drives me bananas when people track a click on the submit button and call it a form submission. That’s not a submission!”
Multiple web data streams
✅ What to look for
Check if your property has more than one web data stream. Users are not deduplicated between different web streams, which will result in fragmentation of user data and elevated user and session counts.
🛠 Steps to take
💡 What I recommend
“Users aren’t combined between streams—so attribution breaks and reporting becomes a mess.”
Enhanced measurement events
✅ What to look for
Enhanced measurement events don’t always work correctly for all websites. Plus they can be disabled, potentially limiting your ability to track important engagement events.
🛠 Steps to take
💡 What I recommend
Google Signals
✅ What to look for
Check whether Google Signals is enabled, as it has significant privacy implications, especially for businesses operating in regions with strict data protection laws.
“Don’t assume the right person made the decision. Just because it’s on doesn’t mean it should be.”
🛠 Steps to take
💡 What I recommend
“I’ve seen this turned on way too many times by PPC teams who were told to do it by a Google Ads help rep. Don’t let that be your only reason.”
Traffic and channel issues: where are your conversions coming from?
When your traffic isn’t properly sorted into channels in GA4, it can lead to attribution issues and missed opportunities.
Unassigned traffic
✅ What to look for
Review how much traffic is falling into the “Unassigned” channel, which I call “the junk drawer of GA4” where valuable traffic can get lost.
🛠 Steps to take
Look at your GA4 traffic breakdown to see how much is falling under “Unassigned.” Use tools like my GA4 Audit Dashboard to help identify where this traffic is coming from and why.
💡 What I recommend
Mixed case sources, mediums, and URLs
✅ What to look for
All data In GA4 is case-sensitive. This causes particular issues with URL and attribution data. For example, a source of Facebook and another source of facebook are treated as two entirely separate sources—splitting your data and making reporting messy or misleading. Similarly, page views of /about/ and /About/ will be split up.
🛠 Steps to take
Review your traffic reports (or use the GA4 Audit Dashboard) to identify inconsistencies in capitalization or spacing across your source/medium values.
💡 What I recommend
Event and conversion tracking: are you measuring what matters?
Event and conversion tracking in GA4 is where things can go wrong either by tracking too much, too little, or the wrong things altogether.
Event quantity and quality
✅ What to look for
Review your event list for cluttered, redundant, or poorly named events that make reporting difficult.
🛠 Steps to take
Audit your full list of events and ask: Are these useful? Are there duplicates? Are naming conventions consistent?
💡 What I recommend
Form submit tracking
✅ What to look for
Form submit tracking is often inaccurate—many properties track clicks on a submit button rather than actual successful submissions. This inflates conversion numbers and leads to misleading reporting.
🛠 Steps to take
💡 What I recommend
Custom dimensions and metrics
✅ What to look for
Many GA4 properties either don’t use custom dimensions at all or configure them unnecessarily. Poor use of custom dimensions, such as registering something GA4 already tracks (e.g. page_location) can lead to cardinality issues. This can cause sampling, misreporting, or entire rows of data grouped under “(other),” making it harder to get clean, actionable insights.
🛠 Steps to take
💡 What I recommend
“Custom dimensions are your best friend when it comes to debugging.”
Governance: who's minding your analytics?
Even the best GA4 setup can be undermined by poor governance. Too many admin users, product links set up by long-gone employees, and a lack of annotations can cause issues down the road.
Product link credentials
✅ What to look for
Connections between GA4 and other products—like Google Ads or BigQuery—should be set up using credentials your organization controls, not those belonging to an agency or individual. Otherwise, you risk broken links and data loss if someone leaves or access is revoked.
🛠 Steps to take
💡 What I recommend
Access management
✅ What to look for
Too many users—or the wrong users—may have access to your GA4 property, often with more permissions than necessary. It’s common to find former employees, agencies, or random Gmail addresses with admin-level access, which can put your data at risk.
🛠 Steps to take
💡 What I recommend
Annotations
✅ What to look for
GA4 now supports annotations, but many properties haven’t started using them—or are using them inconsistently. Missing or incomplete annotations make it harder to connect traffic changes with real-world events.
🛠 Steps to take
💡 What I recommend
Key takeaways from the Q&A
Has GA4 fixed the issue where it was firing a form submit for the Meta pixel?
Yes, this has been fixed. Previously, the Meta (formerly Facebook) pixel would trigger a form submit event, but Meta has fixed this issue. If you’re still seeing it, make sure you’re using the most recent version of the Meta pixel code.
My GA4 is showing traffic, but under data streams I get this warning that data collection is not yet active for my website. Why?
This warning typically appears for the first 72 hours up to a week after setup. If you’re seeing traffic data, it’s working correctly despite the warning. If it persists beyond a week, check that all pages have the GA4 tag.
Regarding AI traffic that shows up under Unassigned, should we just wait for Google to catch up and assign it to an appropriate channel?
Don’t wait. Create a custom channel for AI traffic in GA4. This issue is partly due to how tools like ChatGPT set up their source/medium parameters incorrectly and we can’t wait for them to fix up their tracking.
Do you know the best way to track indirect conversions in Looker Studio on an asset-based basis? For example, conversions from users who view a piece of content, went to another page, and then converted.
Use audiences in GA4. Create segments in explorations for users who viewed specific content and then converted. You can then create audiences from these segments and use them in Looker Studio reports. (We cover this technique in our Practical GA4 course!)
I’m seeing a lot of "not set" in reports. What can we do to fix this and get more helpful data?
The “not set” value would appear for data in the last 24-48 hours but that was changed to “data not available” on March 10, 2025. Any more recent “not set” data is likely due to either:
Consider reporting on the 3rd-5th of the month rather than the 1st to allow GA4 time to process data from the end of the previous month.
For audiences, do actions have to happen in the same session or do they cross over?
When creating audiences, you can choose a condition for the scope to be for a user/device across all sessions, within the same session, or within the same event. Remember that without user ID tracking, “user” just means device.
What other GA4 questions do you have? Drop them in the comments!