Value Reasoning and Test-Time Verification for Trustworthy LLMs
- 👤 Speaker: Prof. Lu Wang (University of Michigan)
- 📅 Date & Time: Thursday 19 June 2025, 14:00 - 15:00
- 📍 Venue: https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
Abstract
Abstract: Despite their impressive capabilities, large language models (LLMs) continue to face significant limitations in complex real-world settings, particularly when navigating high-stakes moral reasoning or when efficient and trustworthy test-time behavior is required. This talk explores two complementary directions that address these challenges: evaluating limitations in value reasoning and scaling verification through efficient process supervision.
First, I introduce CLASH , a new benchmark that examines how well LLMs reason about dilemmas involving conflicting values. CLASH enables a structured analysis of decision ambivalence, psychological discomfort, and value shifts over time. The benchmark reveals the difficulty LLMs have in representing nuanced human value reasoning, especially in ambiguous or temporally dynamic contexts.
Second, I present ThinkPRM, a generative process reward model that enables step-by-step verification using long chain-of-thought reasoning. Unlike traditional discriminative PRMs that require extensive labeled supervision, ThinkPRM is trained on only a fraction of the process data by leveraging LLMs’ inherent reasoning abilities to generate and verify each step in a solution. This approach supports more scalable and efficient test-time oversight, outperforming strong baselines in various domains.
Bio: Lu Wang is an Associate Professor in Computer Science and Engineering at University of Michigan, Ann Arbor. Previously, she was an Assistant Professor in Khoury College of Computer Sciences at Northeastern University. She received her Ph.D. in Computer Science from Cornell University. Her research focuses on building trustworthy large language models that produce factual, accurate, and safe content. She has been working on problems of summarization, reasoning, evaluation, as well as applications in AI for education and computational social science. Lu has received paper awards at ACL , CHI, and SIGDIAL . She won the NSF CAREER award in 2021.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Prof. Lu Wang (University of Michigan)
Thursday 19 June 2025, 14:00-15:00