Extending a Surface Realizer to Generate Coherent Discourse
- đ¤ Speaker: Eva Banik, Open University
- đ Date & Time: Friday 05 June 2009, 12:00 - 13:00
- đ Venue: SW01, Computer Laboratory
Abstract
The ultimate aim of research on natural language generation is to develop large-scale, domain independent NLG systems, which are able to generate high quality, fluent and well-formatted texts. In order to produce high quality, coherent text, generators need to be able to model referential coherence and pronominalization, insert appropriate discourse connectives using appropriate constructions (e.g. preposed, postposed or interposed subordinate clauses) and provide a way for the user to specify which bits of information should be emphasized in the text.
Many NLG systems use a pipeline architecture where linguistic information is distributed across several system modules. These systems typically introduce additional modules (e.g. an aggregation or revision module) in order to model the above phenomena, resulting in more complex systems with limited flexibility. Using this approach, the research challenges in NLG become system engineering tasks, limited to questions such as: what modules should a system have, how should these modules be ordered, and how should the interactions between modules be handled.
In this talk I would like to present a slightly different perspective, where some of the research challenges in NLG are reformulated as grammar engineering tasks. I will argue that when linguistic resources in an NLG system are centralized we can model constraints on discourse coherence by simply incorporating more linguistic information into the grammar of a surface realizer. This approach improves the flexibility of the system (i.e. produces more paraphrases for the same input) and makes it possible to generate coherent text without additional modules.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SW01, Computer Laboratory
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Eva Banik, Open University
Friday 05 June 2009, 12:00-13:00