University of Cambridge > Talks.cam > NLIP Seminar Series > Portuguese Text Simplification for Digital Inclusion and Accessibility

Portuguese Text Simplification for Digital Inclusion and Accessibility

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Johanna Geiss.

I will present PorSimples, a project for developing text simplification technology for the Portuguese language. We focus on syntactic simplification, which consists of breaking complex syntactic constructs in order to make sentences easier to read by people with poor reading skills. Our text simplification system has two modules: a machine learning-based module that decides when a sentence needs to be simplified, and a rule-based module that simplifies the sentences. The machine-learning module collects features of the sentences from a corpus of manually simplified texts and decides when a simplification operation is required, so that the output text is “natural” and not over simplified. The rule-based module executes simplification operations for the syntactic phenomena that are considered complex. I will detail both modules and present our experimental results so far.

This talk is part of the NLIP Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2020 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity