Collect Google Analytics events in your own cheap AWS warehouse with Snowplow

Few weeks ago we wrote about how to set-up your own web tracking infrastructure and data warehouse with Snowplow and AWS in very cost efficient way, close to if not exactly $1/month. This post is about how to collect Google Analytic events to your own data warehouse using similar approach.

Review of Snowplow/S3/Athena costs

It’s September now, so good time to check what was the real cost to run our Snowplow/AWS website visitors tracking set-up. When we set-up our AWS environment, we created a service tag for billing to track all costs related with our solution: S3, Cloudfront, Lambda, Athena. Here is what we got for this tag for …

Connecting R to Athena to analyse Snowplow events

We previously wrote about how to collect website data with Snowplow Analytics, save it to AWS S3, process and enrich with Lambda function and retrieve with AWS Athena. In this post I’ll write about how to access this data through one of the most popular languages used in data science R. AWS Athena query result …