|COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring.|
Language and Demographics on Twitter: Inferring Latent User Attributes from Streaming Communications
If you have a question about this talk, please contact Ekaterina Kochmar.
Content shared locally within a user’s social network can reveal latent attributes of a user. However, not all attributes are pronounced equally given similar amounts of content (some attributes are harder to predict). We explore various network structures on Twitter for the prediction of attributes of varying levels of difficulty (gender, age, and political beliefs), examining the impact of graph-type and amount of available content. We show that even when limited or no self-authored data is available, language from neighbor communications provide sufficient evidence for prediction. We find that a friend graph leads to highest accuracy for gender, while a follower-graph is preferred for age, and a retweet-graph is best for political belief classification.
However, the above models for social media personal analytics assume access to thousands of messages per user, even though most users author content only sporadically over time. Given this sparsity, we: (i) leverage content from the local neighborhood of a user and (ii) estimate the amount of time and tweets required for a dynamic model to predict user preferences. When updating our dynamic models over time, we find that political beliefs can be often predicted using roughly 100 tweets, depending on the context of user selection, where this could mean hours, or weeks, based on the author’s tweeting frequency.
This talk is part of the NLIP Seminar Series series.
This talk is included in these lists:
Note that ex-directory lists are not shown.
Other listsCambridge University European Society Global Sustainability Institute Seminars & Events Innovation Forum
Other talksMigration in Science Wild Immunology Was the Cat in the Hat Black?: The Hidden Racism of Children’s Literature, and the Need for Diverse Books Art speak Personal data: freedom and rights tba