If you want to learn how to do a technique then it might be an idea to check the source of the technique in the first place. Whilst Rayson and Garside didn’t invent the technique, they perfected it! In the last post I explained how I implemented their work, this post is all about the ins and outs of their paper that has been cited a huge 492 times!
Rayson, P., & Garside, R. (2000, October). Comparing corpora using frequency profiling. In Proceedings of the workshop on Comparing Corpora(pp. 1-6). Association for Computational Linguistics.
Continue reading “Comparing Corpora using Frequency Profiling – Rayson & Garside”
You know me, I’m fascinated by masculinities online and when I came across this citation I just couldn’t resist! I’m usually a stickler for methodology in gender research but this paper really got me thinking. I’ll admit it’s not my perfect cup of tea…
But it’s pretty close!
Schmitz, R. M., & Kazyak, E. (2016). Masculinities in Cyberspace: An Analysis of Portrayals of Manhood in Men’s Rights Activist Websites. Social Sciences, 5(2), 18.
Continue reading “Masculinities in Cyberspace – Schmitz & Kazyak”
Yep, the stats are back this week and they are even better! I took my lunch break to implement the log likelihood ratio that is described in this fascinating paper. It took me less time to code than the blasted chi squared and runs at least 10 times as fast. Here’s how I did it!
Continue reading “How to implement a log likelihood test on corpora using Python”
Why and how is black masculinity replicated by white boys in a mixed race US high school? The answer may only be regarding one boy, in one racially charged situation but it does offer some pointers on how masculinity is represented in linguistics.
Bucholtz, M. (1999). You da man: Narrating the racial other in the production of white masculinity. Journal of sociolinguistics, 3(4), 443-460.
Continue reading “You Da Man – Mucholtz”
On the back of the corpus chapter that I read through here, I thought that I would pick up an old project that I might explain in another post. Long story short, I wanted to try to build a system that will take input text and return innuendo. I chose innuendo as a form of humour because of seeming ease that anything can be twisted meaning training material for the system would be fruitful.
Continue reading “Corpus Anotation”
I went into this chapter (24 in the Oxford Handbook of Computational Linguistics) to answer a question that motivated me to get the book in the first place: “How should I extract a quantitive proof from a corpus?”. Unfortunately, it didn’t answer this question but it did provide a great jumping off point for further research.
Mitkov, R. (2005). The Oxford handbook of computational linguistics. Oxford University Press.
Continue reading “Corpus Linguistics – Tony McEnery”