There’s a new type of Google Analytics spam in town and its eroding the accuracy of your GA data. It’s being referred to as Language Spam.
Take a look at your Language Report in Google Analytics > Geo >Language. It should look like this:
You can see that it’s nice and logical, with the different languages broken out and easy to analyse.
Now there’s a new type of spam / fake traffic which ruins our language analysis. It looks like this:
It’s yet another threat to the accuracy of your GA data.
What you might also be interested to know is that the example above is taken from a site that hasn’t installed Google Analytics. This means that data is being pushed into Google Analytics without the code being present on any pages.
Why is this happening?
Spammers wants to acquire traffic. The tactics used in this particular campaign are an interesting blend of old school marketing psychology.
The tactics look like this:
- Leave an entry in GA reports to a sub-domain that appears to be owned by Google.
- Make people curious: Secret
- Imply scarecity: Enter only with this ticket URL
- Add a hot topic: Vote for Trump!
Their goals is then to bake the ingredients into a data payload and automate delivery to as many website owners’ Google Analytics Accounts as possible; Get people to visit the faked Google domain, obtain PII, deliver viruses, generate ad revenue etc.
The Language Spam Solution
Thankfully this threat can be taken down swiftly and without too much hassle. Make sure you’re logged into Google Analytics and we’ll walk you through the next steps.
Step 1 – Create a Custom Segment
Create a Custom Segment to test your filter pattern and preview your historic data. Best practice is to always configure a Custom Segment in your Master View where you can safely test the filter logic and see how different filter settings change your data.
- Click +Add Segment
2. Set Up the Custom Segment Pattern
- Use clear descriptive labelling
- Set ‘Language’ to ‘does not contain’ and the pattern as a full stop: ‘.’ Language codes never contain a full stop and this setting excludes any entry which include one.
- Click Preview to see the effect
- Use the custom segment for any historic reporting to view your data without Language Spam.
Here’s the data with the Custom Segment applied, as you’ll see, we’ve removed 25.64% of Language Spam using this segment:
If the custom segment works as expected save it. I know what you’re thinking, what about the (not set) bucket? We’ll look into (not set) in another blog post.
Step 2 – Add the Filter to your Test View
It’s best practice to run the filter on your Test View for a period before applying it to the master view at a later stage.
- Select your Test View
- Click Admin > Filters
3. Select + Add Filter
4. Give the filter a descriptive name, e.g. ‘Exclude Language Spam Referrers’
5. Select ‘Custom’ in the Filter Type
6. Click the ‘Exclude’ radio button
7. Add ‘\.’ to the Filter Field (that’s an escaped full stop)
8. Click ‘Save’
- Leave the filter in place for a few days to allow sufficient traffic to collect
- Compare the Test View Language report entries with your Master View report entries. You should be able to see differences in data between your reports.
Step 3 – Apply the Language Spam Filter to your Master View
Once you’ve collected enough data in your Test View and are satisfied that there’s no longer any Language Spam collected, apply the Filter to your Master View. Filters are not retrospective, so if you need to see historic data without any Language Spam, you’ll need to apply the Custom Segment you created in Steps 1 & 2.
You’re not done. There’s other fake traffic creeping into your reports like Ghost Spam, Spam Referrers, Event Spam, Campaign Spam… Don’t worry you’ll be clearing out more fake traffic in a follow up to this post when we take a look at Ghost Spam – wooooOOOOooo.
If you liked this post, you might find our Guide to Setting Up Google Analytics Like a Pro useful. You can download that for free by filling out the form below:
Download Setting Up Google Analytics Guide:
There’s a lot of resources talking about GA spam, but Mike Sullivan’s guide provides solid trusted advice and is highly recommended. Get coffee and chocolate sorted before you click this link!