Debian Accessibility Project
Summary
Speech Synthesis
Debian Accessibility Speech Synthesis

This metapackage will install packages which are useful for Speech Synthesis and related APIs or applications.

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Accessibility to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Accessibility mailing list

Links to other tasks

Debian Accessibility Speech Synthesis packages

Official Debian packages with high relevance

daisy-player
player for DAISY Digital Talking Books
Versions of package daisy-player
ReleaseVersionArchitectures
bullseye12.1-1amd64,arm64,armhf,i386
sid13.0-5amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky13.0-5amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie13.0-4amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm13.0-4amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package daisy-player:
interfacetext-mode
roleprogram
scopeutility
soundplayer
uitoolkitncurses
uselearning, playing
works-withaudio
works-with-formatmp3
Popcon: 5 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

Daisy-player is a command-line player for talking books based on the Digital Accessible Information System protocol. It is comparable in functionality, features, and ease of use with commercial players, and has a simple user interface appropriate for Braille terminals.

Screenshots of package daisy-player
eflite
Festival-Lite based emacspeak speech server
Versions of package eflite
ReleaseVersionArchitectures
bullseye0.4.1-12amd64,arm64,armhf,i386
bookworm0.4.1-13amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.4.1-13amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
forky0.4.1-14amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
sid0.4.1-14amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package eflite:
accessibilityspeech
roleplugin
suiteemacs
works-withaudio
Popcon: 4 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

EFlite is a speech server for Emacspeak and other screen readers that allows them to interface with Festival Lite, a free text-to-speech engine developed at the CMU Speech Center as an off-shoot of Festival.

Due to limitations inherited from its backend, EFlite does only provide support for the English language at the moment.

espeak
Multi-lingual software speech synthesizer
Versions of package espeak
ReleaseVersionArchitectures
bullseye1.48.15+dfsg-2amd64,arm64,armhf,i386
sid1.48.15+dfsg-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky1.48.15+dfsg-3amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie1.48.15+dfsg-3amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm1.48.15+dfsg-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package espeak:
interfacecommandline
roleprogram
soundspeech
works-withaudio
Popcon: 241 users (175 upd.)*
Versions and Archs
License: DFSG free
Git

eSpeak is a software speech synthesizer for English, and some other languages.

eSpeak produces good quality English speech. It uses a different synthesis method from other open source text to speech (TTS) engines, and sounds quite different. It's perhaps not as natural or "smooth", but some find the articulation clearer and easier to listen to for long periods.

It can run as a command line program to speak text from a file or from stdin.

  • Includes different Voices, whose characteristics can be altered.
  • Can produce speech output as a WAV file.
  • Can translate text to phoneme codes, so it could be adapted as a front end for another speech synthesis engine.
  • Potential for other languages. More than 40 languages are included.
  • Compact size. The program and its data total about 350 kbytes.
  • Written in C++.
flite
Small run-time speech synthesis engine
Versions of package flite
ReleaseVersionArchitectures
bullseye2.2-2amd64,arm64,armhf,i386
sid2.2-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky2.2-7amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie2.2-7amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm2.2-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package flite:
accessibilityspeech
interfacecommandline
roleprogram
scopeutility
works-withaudio
Popcon: 79 users (723 upd.)*
Versions and Archs
License: DFSG free
Git

Flite is a small fast run-time speech synthesis engine. It is the latest addition to the suite of free software synthesis tools including University of Edinburgh's Festival Speech Synthesis System and Carnegie Mellon University's FestVox project, tools, scripts and documentation for building synthetic voices. However, flite itself does not require either of these systems to run.

It currently only supports the English and Indic languages.

This package contains the executables and documentation.

speech-dispatcher
Common interface to speech synthesizers
Versions of package speech-dispatcher
ReleaseVersionArchitectures
bullseye0.10.2-2+deb11u2amd64,arm64,armhf,i386
trixie-backports0.12.1-2~bpo13+1amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm0.11.4-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.12.0-5amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm-backports0.12.0-4~bpo12+1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid0.12.1-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky0.12.1-2amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
Debtags of package speech-dispatcher:
accessibilityspeech
interfacedaemon
networkserver
roleprogram
works-withaudio
Popcon: 95640 users (16166 upd.)*
Versions and Archs
License: DFSG free
Git

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications.

Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel.

This package contains Speech Dispatcher itself.

The package is enhanced by the following packages: sd-multispeech
speech-tools
Edinburgh Speech Tools - user binaries
Versions of package speech-tools
ReleaseVersionArchitectures
forky2.5.0-14amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie2.5.0-13amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
sid2.5.0-14amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye2.5.0-11amd64,arm64,armhf,i386
bookworm2.5.0-13amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package speech-tools:
accessibilityspeech
fieldlinguistics
interfacecommandline, text-mode
roleprogram
scopeutility
uitoolkitncurses
useplaying
Popcon: 6 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

This package contains the various highly useful utility programs that use and accompany the Edinburgh Speech Tools Library. Audio software and some basic signal processing software is included in this package.

The following programs are available: na_play: generic playback program for use with net_audio and CSTR ao. ch_wave: Waveform file conversion program. ch_lab: label file conversion program. ch_track: Track file conversion program. wagon: a CART tree build and test program

See /usr/share/doc/speech-tools/README for detail list of programs available.

Official Debian packages with lower relevance

festvox-ru
Russian male speaker for Festival
Versions of package festvox-ru
ReleaseVersionArchitectures
bullseye0.5+dfsg-5all
bookworm0.5+dfsg-6all
trixie0.5+dfsg-6all
forky0.5+dfsg-6all
sid0.5+dfsg-6all
Debtags of package festvox-ru:
accessibilityspeech
culturerussian
roleapp-data
soundspeech
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This package provides Russian support to Festival speech synthesis system.

freetts
speech synthesis system
Maintainer: Bdale Garbee
Versions of package freetts
ReleaseVersionArchitectures
bookworm1.2.2-7all
sid1.2.2-8all
forky1.2.2-8all
trixie1.2.2-8all
bullseye1.2.2-7all
Debtags of package freetts:
accessibilityspeech
roleprogram
Popcon: 6 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

FreeTTS is a speech synthesis system written entirely in the Java(TM) programming language. It is based upon Flite, a small run-time speech synthesis engine developed at Carnegie Mellon University. Flite in turn is derived from the Festival Speech Synthesis System from the University of Edinburgh and the FestVox project from Carnegie Mellon University.

saytime
dice la hora actual por la tarjeta de sonido
Maintainer: Holger Levsen
Versions of package saytime
ReleaseVersionArchitectures
bullseye1.0-34amd64,arm64,armhf,i386
sid1.0-37amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky1.0-37amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie1.0-36amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm1.0-35amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package saytime:
accessibilityspeech
interfacecommandline
roleprogram
scopeutility
soundplayer
usetimekeeping
works-withaudio
Popcon: 11 users (8 upd.)*
Versions and Archs
License: DFSG free
Git

Dice la hora actual por la tarjeta de sonido. Necesita que tenga un dispositivo de salida de sonido disponible.

Screenshots of package saytime
sonic
utilidad sencilla para acelerar o decelerar locuciones
Versions of package sonic
ReleaseVersionArchitectures
sid0.2.0-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky0.2.0-13amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie0.2.0-13amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm0.2.0-12amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.2.0-10amd64,arm64,armhf,i386
Debtags of package sonic:
roleprogram
scopeutility
useediting
works-withaudio
Popcon: 3 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

Sonic es una utilidad muy sencilla que lee y escribe archivos wav, y los acelera o decelera, con poca distorsión. La característica clave nueva de Sonic respecto a otras bibliotecas es la alta calidad a factores de velocidad superiores a 2X.

Screenshots of package sonic
speech-dispatcher-festival
Festival support for Speech Dispatcher
Versions of package speech-dispatcher-festival
ReleaseVersionArchitectures
bookworm-backports0.12.0-4~bpo12+1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.11.4-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.10.2-2+deb11u2amd64,arm64,armhf,i386
trixie0.12.0-5amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
trixie-backports0.12.1-2~bpo13+1amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
forky0.12.1-2amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
sid0.12.1-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package speech-dispatcher-festival:
accessibilityspeech
rolemetapackage
works-withaudio
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications.

Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel.

This package contains dependencies on packages necessary for running Speech Dispatcher with Festival.

Debian packages in contrib or non-free

libttspico-utils
Small Footprint TTS (binaries)
Versions of package libttspico-utils
ReleaseVersionArchitectures
bullseye1.0+git20130326-11 (non-free)amd64,arm64,armhf,i386
bookworm1.0+git20130326-13 (non-free)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.0+git20130326-14.1 (non-free)amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
forky1.0+git20130326-14.1 (non-free)amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
sid1.0+git20130326-14.1 (non-free)amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package libttspico-utils:
roleprogram
Popcon: 36 users (17 upd.)*
Versions and Archs
License: non-free
Git

The SVOX Pico engine is a software speech synthesizer for German, English (GB and US), Spanish, French and Italian.

SVOX produces a clear and distinct speech output made possible by the use of Hidden Markov Model (HMM) algorithms.

This package contains binary files including pico2wave.

mbrola
Multilingual software speech synthesizer
Maintainer: Samuel Thibault
Versions of package mbrola
ReleaseVersionArchitectures
forky3.3+dfsg-9 (contrib)amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie3.3+dfsg-9 (contrib)amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm3.3+dfsg-9 (contrib)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye3.3+dfsg-4+deb11u1 (contrib)amd64,arm64,armhf,i386
sid3.3+dfsg-9 (contrib)amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package mbrola:
roleprogram
soundspeech
Popcon: 194 users (19 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Git

Mbrola is Thierry Dutoit's phonemizer for multilingual speech synthesis. The various diphone databases are distributed on separate packages, but they must be used with and only with Mbrola because of license matters. Read the copyright for details.

Mbrola itself doesn't provide full TTS. It is a speech synthesizer based on the concatenation of diphones. It takes a list of phonemes as input, together with prosodic information (duration of phonemes and a piecewise linear description of pitch), and produces speech samples on 16 bits (linear), at the sampling frequency of the diphone database.

Use Mbrola along with Freephone, cicero or espeak to have a complete text-to-speech in English.

Packaging has started and developers might try the packaging code in VCS

cicero
French and English Text-To-Speech for MBROLA
Versions of package cicero
ReleaseVersionArchitectures
VCS0.7.2-7all
Versions and Archs
License: free
Debian package not available
Git
Version: 0.7.2-7

This Text-To-Speech (TTS) engine speaks French; a preliminary English support is also offered. The engine uses context-sensitive rules to produce phonemes from the text. It relies on MBROLA to generate actual audio output from the phonemes. The TTS engine is implemented using the Python programming language.

The upstream authors have come up with this TTS to try and meet their own needs as blind users. It's designed to be plugged as output to some screen-review software, firstly with BRLTTY. They favor speed and intelligibility over perfect pronunciation. Cicero is aimed to have a quick response time, the ability to quickly shut-up and skip to another utterance, intelligibility where it counts (not perfect pronunciation), the ability to track speech progression, relative simplicity (hackability) and relative small code size.

gespeaker
GTK+ front-end for eSpeak and mbrola
Responsible: Fabio Castelli (Sandro Tosi)
Versions of package gespeaker
ReleaseVersionArchitectures
VCS0.8.6-2all
Versions and Archs
License: GPL-2+
Debian package not available
Git
Version: 0.8.6-2

Gespeaker is a GTK+ frontend for eSpeak and mbrola. It allows one to play a text in many languages with settings for voice, pitch, volume, speed and word gap.

Since version 0.6 it can use mbrola package and voices to obtain a more realistic text reading experience.

*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 269205