On the Research Gap Regarding the Study of the Content, Dissemination, and Life Cycle of Memes
There are numerous avenues for additional research on the topic of how the content of memes affects the how they spread, live, and die. In my lit review, I discuss one experiment by Barnes et al. (2021) in which the researchers use 129,000 memes scraped from reddit and fed into machine learning models for training so they could use it to predict what factors in the text and images of memes make them most likely to go viral. Yet, their entire sample was taken just on the first week of the COVID-19 pandemic in the US (mid-March 2020). The research team acknowledged that the timing, extraordinariness of the moment, and short span of data collection were all limitations that potentially affect the generalizability of their results.
Another study using machine learning by Ling et al. (2021) used a much larger sample of 160 million meme images collected over a period of about 10 years, but the place they gathered their sample from deletes posts periodically, making replicability impossible.
I would want to propose an experiment similar to Barnes et al (2021). but with a sample gathered similarly to Ling et al, (2021) except with a sample location that isn't ephemeral. Doing so would overcome the issues of both samples and provide a much bigger sample overall to train the machine learning models to use for predicting what meme content factors impact viral spread.
There is just one major problem. I'm not qualified/don't know anywhere near enough about:
- how to scrape and cluster big data samples
- how to use or train machine learning programs
- how to do advanced analysis on results that would be necessary.
Let me know what you think about which direction I should go.
Hi Mike,
ReplyDeleteI really like your idea on qualitative study here that kind of mirrors Jones et al. (i.e., "On the qualitative side..." paragraph). If it's me, I'd replicate Jones et al.'s qualitative approach but on a diff. set of data (i.e., another context/set of memes/etc.) to add to the literature that is specific to the field of TWDR. You can also get mileage out of this topic and become a known scholar on memes with more studies for you to conduct after your first findings, this time with quantitative analysis in collaboration with a statistician (I wish Utah Tech has an operating Stat Center), and so on. Of course, if you'd like to design a method involving both quant. and qual. approaches, that would be good too. Let me know if you want to Zoom -- am here. :)