Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Discovering interesting usage patterns in text collections

Add to your list(s) Download to your calendar using vCal

Marek Rei and Helen Yannakoudakis
Monday 11 January 2010, 12:30-13:30
GS15, Computer Laboratory.

If you have a question about this talk, please contact Diarmuid Ó Séaghdha.

At this session of the NLIP Reading Group we’ll be discussing the following paper:

Anthony Don, Elena Zheleva, Machon Gregory, Sureyya Tarkan, Loretta Auvil, Tanya Clement, Ben Shneiderman and Catherine Plaisant. 2007. Discovering interesting usage patterns in text collections: integrating text mining with visualization. In Proceedings of CIKM -07.

Abstract: This paper addresses the problem of making text mining results more comprehensible to humanities scholars, journalists, intelligence analysts, and other researchers, in order to support the analysis of text collections. Our system, FeatureLens1, visualizes a text collection at several levels of granularity and enables users to explore interesting text patterns. The current implementation focuses on frequent itemsets of n-grams, as they capture the repetition of exact or similar expressions in the collection. Users can find meaningful co-occurrences of text patterns by visualizing them within and across documents in the collection. This also permits users to identify the temporal evolution of usage such as increasing, decreasing or sudden appearance of text patterns. The interface could be used to explore other text features as well. Initial studies suggest that FeatureLens helped a literary scholar and 8 users generate new hypotheses and interesting insights using 2 text collections.

This talk is part of the Natural Language Processing Reading Group series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Discovering interesting usage patterns in text collections

This talk is included in these lists:

Other lists

Other talks