Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

NLIP Reading Group: Unsupervised Decomposition of a Document into Authorial Components

Add to your list(s) Download to your calendar using vCal

Thomas Lippincott (University of Cambridge)
Thursday 13 October 2011, 12:00-13:00
GS15, Computer Laboratory.

If you have a question about this talk, please contact Jimme Jardine.

Tom will be presenting

@inproceedings{koppel2011unsupervised , author = {Koppel, M. and Akiva, N. and Dershowitz, I. and Dershowitz, N.} , title = {Unsupervised decomposition of a document into authorial components} , booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1} , year = {2011} , pages = {1356—1364} , organization = {Association for Computational Linguistics} }

Abstract We propose a novel unsupervised method for separating out distinct authorial components of a document. In particular, we show that, given a book artificially “munged” from two thematically similar biblical books, we can separate out the two constituent books almost perfectly. This allows us to automatically recapitulate many conclusions reached by Bible scholars over centuries of research. One of the key elements of our method is exploitation of differences in synonym choice by different authors.

This talk is part of the Natural Language Processing Reading Group series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

NLIP Reading Group: Unsupervised Decomposition of a Document into Authorial Components

This talk is included in these lists:

Other lists

Other talks