Ad blocking is a big deal. It’s had it’s time in European Court and has been ruled legal but what is the impact it’s making on our online society? This paper asks what if 40% of diners at a restaurant just walked out and didn’t pay in the same way 40 ish % of people don’t consume advertisements? But first a word from our sponsors …
Shiller, Benjamin, Joel Waldfogel, and Johnny Ryan. “The effect of ad blocking on website traffic and quality.” The RAND Journal of Economics 49, no. 1 (2018): 43-63.
Continue reading “The effect of ad blocking on website traffic and quality – Shiller Wadfogel & Ryan”
So I’m sure you’re totally interested in what I do at work BUT this is a cute, pretty and clever… I think!
So the notebook is here on my github but here are some of the highlights that I’m really happy with.
Continue reading “Clustering Search Volumes With KMeans”
If you want to learn how to do a technique then it might be an idea to check the source of the technique in the first place. Whilst Rayson and Garside didn’t invent the technique, they perfected it! In the last post I explained how I implemented their work, this post is all about the ins and outs of their paper that has been cited a huge 492 times!
Rayson, P., & Garside, R. (2000, October). Comparing corpora using frequency profiling. In Proceedings of the workshop on Comparing Corpora(pp. 1-6). Association for Computational Linguistics.
Continue reading “Comparing Corpora using Frequency Profiling – Rayson & Garside”
Yep, the stats are back this week and they are even better! I took my lunch break to implement the log likelihood ratio that is described in this fascinating paper. It took me less time to code than the blasted chi squared and runs at least 10 times as fast. Here’s how I did it!
Continue reading “How to implement a log likelihood test on corpora using Python”