Home page
Home page

Projects

SMSCollector

DESCRIPTION

SMSCollector was a job order from a telephone multinational company. The aims of this project were the collection and the annotation of a SMS corpus both in textual and speech form.

CELCT’S ROLE:

In the first part of the project, CELCT worked on recruitment of 1,700 subjects that inserted through a web interface 120.000 SMS, for a total of 2 million tokens.
Then the collected SMS have been manually annotated with named entities and time expressions.
Finally, the Center worked on the registration of 100 speakers reading a part of this corpus and on the manual annotation of the speech files produced.

© 2009 - Celct