University of Cambridge > Talks.cam > NLIP Seminar Series > Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

Download to your calendar using vCal

John Carroll - Department of Informatics, University of Sussex
Friday 15 June 2007, 15:00-16:00
SW01 Computer Laboratory.

If you have a question about this talk, please contact NLIP Seminars .

We compare the accuracy of a statistical parse ranking model trained from a fully-annotated portion of the Susanne treebank with one trained from unlabeled partially-bracketed sentences derived from this treebank and from the Penn Treebank. We demonstrate that confidence-based semi-supervised techniques similar to self-training outperform expectation maximization when both are constrained by partial bracketing. Both methods based on partially-bracketed training data outperform the fully supervised technique, and both can, in principle, be applied to any statistical parser whose output is consistent with such partial-bracketing. We also explore tuning the model to a different domain and the effect of in-domain data in the semi-supervised training processes.

(This is joint work with Rebecca Watson and Ted Briscoe.)

This talk is part of the NLIP Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

📅 Download to calendar (vCal)

👤 Speaker: John Carroll - Department of Informatics, University of Sussex
📅 Date & Time: Friday 15 June 2007, 15:00 - 16:00
📍 Venue: SW01 Computer Laboratory

Questions? Contact NLIP Seminars

Abstract

(This is joint work with Rebecca Watson and Ted Briscoe.)

Series This talk is part of the NLIP Seminar Series series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

This talk is included in these lists:

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

This talk is included in these lists:

Other lists

Other talks

Semi-supervised Training of a Statistical Parser from Unlabeled Partially-bracketed Data

Abstract

Included in Lists