The automatic annotation and analysis of speech


SPPAS: Scientific research software

SPPAS offers open source cross-platform, customizable automatic annotation and analysis solutions for audio and video media.


SPPAS was awarded by the French Ministry of Higher Education, Research and Innovation at the 2022 Open Source Research Software Competition.


Brigitte Bigi is the author of SPPAS: she's a computer scientist, researcher at Laboratoire Parole et Langage, Aix-en-Provence, France.


SPPAS is registered with the program protection agency under the reference IDDN.FR.001.500008.000.S.C.2024.000.31235

French Open Science Award
Honourable Mention to the Special Jury Prize

Annotate

SPPAS produces automatically annotations from a recorded audio and its orthographic transcription and/or from a video.

Analyze

SPPAS helps for the analysis of annotated files: statistics, requests, view and edit files to annotate manually.

Convert

SPPAS converts annotated files from/to a wide range of formats: xra, TextGrid, eaf, trs...

SPPAS enables easy and efficient access to a wide scope of customizable features:
annotate, analyze and manage files becomes available to everyone.

Hot topics

2023: Annotate videos in the "Edit" page

The editor page of the Graphical User Interface allows annotating manually. This interface is already offering an easy and powerful solution to modify labels of annotations. In 2023 release 4.14, it is possible to adjust boundaries of video annotations with a very high precision, in a convenient window displaying a sequence of three video frames.

SPPAS Edit Screenshot

SPPAS 4.2: The Edit page with an audio file, a video file, the manual orthographic transcription and 2 automatic annotations.

2023-2026: Cued Speech Automatic Generation




Cued speech keys generator was introduced the first time in version 3.9, August 2021. Then, a Proof of Concept (PoC) of an augmented reality system was firstly proposed in version 4.2.

The PoC was turned into a stable version in 2023. In future works, some models based on the analyses of CLeLfPC will be implemented (statistical distributions analyses, machine learning, ...).

This is part of a project funded by FIRAH.

Orthographic transcription of the video Cette vidéo est une démonstration de la génération automatique des clés LPC par le logiciel SPPAS.
Audio of the video
Result of the proof-of-concept of "Cued Speech automatic annotation" (SPPAS 4.3)

Why should you trust SPPAS?

Because...

The SPPAS software tool is reliable: the application performs the features that the documentation described. It can tolerate the user making mistakes or using the software in unexpected ways: in these situations, an error identifier with an error message is displayed. Its performances are good enough for the required uses cases, under the expected load and data volume.

SPPAS is installed on your computer: your corpus won't be transferred on the web. No statistics, no personal data are collected.

Its ongoing maintenance by the author: fixing bugs, keeping its systems operational, investigating failures, checking it on three platforms, modifying it for new use cases, adding new features and last but not least adding new language resources, updating the documentation and the website.

SPPAS is an open source package. You can edit the source code of the software tool, you can modify it, you can re-distribute it, etc.

Key Facts and Figures

  • Documentations:
    • The SPPAS documentation: 160 pages
    • The resources documentation: 60 pages
    • The XRA file format: 15 pages
    • The transcription convention: 5 pages
  • References: 30
    • 4 about the software tool itself
    • 19 about the annotations
    • 3 about the linguistic resources construction
    • 2 about the analyses
    • 2 about the data representation

How to cite?

Supports

Current ones

Past ones

Previously, SPPAS was partly supported by the following projects:

Logo ORTOLANG

ORTOLANG - Investissement d'Avenir

2012-2019

ORTOLANG receives state aid under the « Investissements d’avenir » program (ANR–11–EQPX–0032)

Logo CoFee Project

CoFee - Conversational Feedback

2013-2015

Multidimensional analyses and modeling (ANR-12-JCJC-JSH2-006-01)

Logo Variamu Project

Variamu - Variations in Action

2014-2015

a Multilingual approach (Campus France - Procore PHC)

Logo Polytechnic University of Hong-Kong Project

Adding Cantonese into SPPAS

2015-2016

In collaboration with PolyU (Campus France - Procore PHC)

Logo NaijaSynCor Project

NaijaSynCor - Common Nigerian Pidgin

2017-2021

a corpus-based study of the nature and functions of Naija in Nigeria (ANR-16-CE27-0007)

Logo VAPVISIO Project

VAPVISIO

2018-2022

the training of language trainers in online environments using videoconferencing (ANR-18-CE28-0011)