The automatic annotation and analysis of speech

Linguistic resources of the supported languages

SPPAS can automatically annotate speech — if the necessary linguistic resources are available. But if they’re missing, annotation won’t be possible. So why do some languages have resources while others don’t? Simply because people have helped the SPPAS author create them, or they shared their own resources online under a license that allows redistribution. If the language you’re looking for isn’t on the list, why not contribute by creating and sharing the resources yourself?

This page describes the available linguistic resources for use in SPPAS. They enable five automatic annotations: normalization, phonetization, alignment, syllabification and cued speech — allowing among others to get speech segments of phones and words.

Table of content

  1. Introduction
  2. French
  3. English
  4. Mandarin
  5. Italian
  6. Spanish
  7. Catalan
  8. Polish
  9. Deutsch
  10. Portuguese
  11. Southern Min (or Min Nan)
  12. Cantonese
  13. Japanese
  14. Korean
  15. Nigerian Pidgin
  16. Bengali
  17. Persian
  18. Dutch
  19. Version history
  20. Appendix