Computer Scientist working in the field of corpora and annotations:
formalization/constitution of corpora,
automatic annotation (mainly at the phonetic level, also at the discourse level),
multimodality (annotation, exploration, extraction of annotated data),
multilinguality (methods and algorithms).
Author and developer of SPPAS - Automatic Annotation of Speech
Tutorial scopes
This tutorial will report on methodology for the manual and/or automatic annotation and analysis of a recorded speech corpus.
We illustrate the steps to take in the perspective of:
obtaining rich and broad-coverage speech annotation
and initial analysis of such a corpus
both with a specific focus on SPPAS software.
Corpus annotation "can be defined as the practice of adding interpretative, linguistic information to an electronic corpus of spoken and/or written language data. 'Annotation' can also refer to the end-product of this process"(Leech, 1997).