University of Cambridge > Talks.cam > Natural Language Processing Reading Group > NLIP Reading Group: Unsupervised Decomposition of a Document into Authorial Components

NLIP Reading Group: Unsupervised Decomposition of a Document into Authorial Components

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Jimme Jardine.

Tom will be presenting

@inproceedings{koppel2011unsupervised , author = {Koppel, M. and Akiva, N. and Dershowitz, I. and Dershowitz, N.} , title = {Unsupervised decomposition of a document into authorial components} , booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1} , year = {2011} , pages = {1356—1364} , organization = {Association for Computational Linguistics} }

Abstract We propose a novel unsupervised method for separating out distinct authorial components of a document. In particular, we show that, given a book artificially “munged” from two thematically similar biblical books, we can separate out the two constituent books almost perfectly. This allows us to automatically recapitulate many conclusions reached by Bible scholars over centuries of research. One of the key elements of our method is exploitation of differences in synonym choice by different authors.

This talk is part of the Natural Language Processing Reading Group series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity