COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Language Technology Lab Seminars > Teaching Artificial Agents to Understand Language by Modelling Reward
Teaching Artificial Agents to Understand Language by Modelling RewardAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Edoardo Maria Ponti. Abstract: Recent progress in Deep Reinforcement Learning has shown that agents can be taught complex behaviour and solve difficult tasks, such as playing video games from pixel observations, or mastering the game of Go without observing human games, with relatively little prior information. Building on these successes, researchers such as Hermann and colleagues have sought to apply these methods to teach–in simulation–agents to complete a variety of tasks specified by combinatorially rich instruction languages. In this talk, we discuss some of these highlights and some of the limitations which inhibit scalability of such approaches to more complex instruction languages (including natural language). Following this, we introduce a new approach, inspired by recent work in adversarial reward modelling, which constitutes a first step towards scaling instruction-conditional agent training to “real world” language. Bio: Edward Grefenstette is a Research Scientist at Facebook AI Research, and Honorary Associate Professor at UCL . Prior to this, he was a Staff Research Scientist at DeepMind. He completed his DPhil (PhD) at the University of Oxford in 2013 under the supervision of Profs Coecke and Pulman, and Dr Sadrzadeh, working on applying category-theoretic tools–initially developed to model quantum information flow–to model compositionality of distributed representations in natural language semantics. His recent research has covered topics at the intersection of deep learning and machine reasoning, addressing questions such as how neural networks can model or understand logic and mathematics, infer implicit or human-readable programs, or learn to understand instructions from simulation. This talk is part of the Language Technology Lab Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsAutism TQS Journal Clubs ji247's listOther talksStudents, tourists and farmers: the publics of botanic gardens in the 18th century Emergence and Control in UK Energy Democratisation Local and global synaptic credit assignment HuR vs. TTP: Who is really the Master Regulator of CD4+ T cell Differentiation? Keeping authorities honest with verifiable append-only logs, and making backdoored software updates detectable Measuring g(r) by test-particle insertion |