Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Text Data Mining using Topic Modeling

Add to your list(s) Download to your calendar using vCal

Ioana Bica, Churchill College
Wednesday 02 November 2016, 19:00-19:30
Club Room, Churchill College.

If you have a question about this talk, please contact Matthew Ireland.

Room changed: club room

As we gather more and more data, it is becoming increasingly difficult to find the information that we need. However, text data mining tools can provide us with ways of organizing all this information in a useful and accessible way. In particular, discovering the patterns in a document using topic modeling can help us annotate and search through documents based on their themes. My talk will present how the Latent Dirichlet Allocation performs the task of extracting a certain number of topics from a document by utilising a probabilistic model which assumes that each document is arising from a generative process. Furthermore, we shall also investigate how a Bayesian nonparametric model, namely the Chinese Restaurant Process, can be employed when the number of topics in a document is not known in advance. Finally, we shall see how topic hierarchies can be built by exploiting the Nested Chinese Restaurant Process.

This talk is part of the Churchill CompSci Talks series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Text Data Mining using Topic Modeling

This talk is included in these lists:

Other lists

Other talks