COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > NLIP Seminar Series > Texts Come from People - How Demographic Factors Influence NLP Models
Texts Come from People - How Demographic Factors Influence NLP ModelsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Kris Cao. The way we express ourselves is heavily influenced by our demographic background. I.e., we don’t expect teenagers to talk the same way as retirees. Natural Language Processing (NLP) models, however, are based on a small demographic sample and approach all language as uniform. As a result, NLP models perform worse on language from demographic groups that differ from the training data, i.e., they encode a demographic bias. This bias harms performance and can disadvantage entire user groups. Sociolinguistics has long investigated the interplay of demographic factors and language use, and it seems likely that the same factors are also present in the data we use to train NLP systems. In this talk, I will show how we can combine statistical NLP methods and sociolinguistic theories to the benefit of both fields. I present ongoing research into large-scale statistical analysis of demographic language variation to detect factors that influence the performance (and fairness) of NLP systems, and how we can incorporate demographic information into statistical models to address both problems. This talk is part of the NLIP Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCambridge University Eco Racing Enterprise Tuesday 2016-2017 Cambridge Global Food Security CMIH short course: Image Reconstruction in Biomedical Imaging Cyber Security Society talks UK-Japan network for high-speed microscopy in cellsOther talksEffective Conference Presentations and Networking Bioengineering conference: Innovation through convergence Athena SWAN Network Event: Changing Culture “It’s like they’re speaking a different language!” Investigating an accidental resistance to school mathematics reform Bank credit rating changes, capital structure adjustments and lending CANCELLED: How and why the growth and biomass varies across the tropics |