Summary
Speech Synthesis
Debian Accessibility Speech Synthesis
This metapackage will install packages which are useful for
Speech Synthesis and related APIs or applications.
Description
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Accessibility
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Accessibility mailing list
Links to other tasks
|
Debian Accessibility Speech Synthesis packages
Official Debian packages with high relevance
daisy-player
player for DAISY Digital Talking Books
|
Versions of package daisy-player |
Release | Version | Architectures |
buster | 11.6.2.1-2 | amd64,arm64,armhf,i386 |
stretch | 10.3-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 12.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 9.0.0-1 | amd64,armel,armhf,i386 |
Debtags of package daisy-player: |
interface | text-mode |
role | program |
scope | utility |
sound | player |
uitoolkit | ncurses |
use | learning, playing |
works-with | audio |
works-with-format | mp3 |
|
License: DFSG free
|
Daisy-player is a command-line player for talking books based on the
Digital Accessible Information System protocol. It is comparable in
functionality, features, and ease of use with commercial players, and
has a simple user interface appropriate for Braille terminals.
|
|
eflite
Festival-Lite based emacspeak speech server
|
Versions of package eflite |
Release | Version | Architectures |
bullseye | 0.4.1-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 0.4.1-6 | amd64,armel,armhf,i386 |
stretch | 0.4.1-8 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.4.1-9 | amd64,arm64,armhf,i386 |
Debtags of package eflite: |
accessibility | speech |
role | plugin |
suite | emacs |
works-with | audio |
|
License: DFSG free
|
EFlite is a speech server for Emacspeak and other screen readers that
allows them to interface with Festival Lite, a free text-to-speech
engine developed at the CMU Speech Center as an off-shoot of Festival.
Due to limitations inherited from its backend, EFlite does only provide
support for the English language at the moment.
|
|
espeak
Multi-lingual software speech synthesizer
|
Versions of package espeak |
Release | Version | Architectures |
buster | 1.48.04+dfsg-7+deb10u1 | amd64,arm64,armhf,i386 |
bullseye | 1.48.15+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.48.04+dfsg-1 | amd64,armel,armhf,i386 |
stretch | 1.48.04+dfsg-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package espeak: |
interface | commandline |
role | program |
sound | speech |
works-with | audio |
|
License: DFSG free
|
eSpeak is a software speech synthesizer for English, and some other
languages.
eSpeak produces good quality English speech. It uses a different synthesis
method from other open source text to speech (TTS) engines, and sounds quite
different. It's perhaps not as natural or "smooth", but some find the
articulation clearer and easier to listen to for long periods.
It can run as a command line program to speak text from a file or from stdin.
- Includes different Voices, whose characteristics can be altered.
- Can produce speech output as a WAV file.
- Can translate text to phoneme codes, so it could be adapted as a front end
for another speech synthesis engine.
- Potential for other languages. More than 40 languages are included.
- Compact size. The program and its data total about 350 kbytes.
- Written in C++.
|
|
flite
Small run-time speech synthesis engine
|
Versions of package flite |
Release | Version | Architectures |
bullseye | 2.2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.4-release-12 | amd64,armel,armhf,i386 |
sid | 2.2-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.0.0-release-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 2.2-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.2-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.1-release-3 | amd64,arm64,armhf,i386 |
Debtags of package flite: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
works-with | audio |
|
License: DFSG free
|
Flite is a small fast run-time speech synthesis engine. It is the
latest addition to the suite of free software synthesis tools
including University of Edinburgh's Festival Speech Synthesis System
and Carnegie Mellon University's FestVox project, tools, scripts and
documentation for building synthetic voices. However, flite itself
does not require either of these systems to run.
It currently only supports the English and Indic languages.
This package contains the executables and documentation.
|
|
speech-dispatcher
Common interface to speech synthesizers
|
Versions of package speech-dispatcher |
Release | Version | Architectures |
bookworm | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.0-5+deb10u1 | amd64,arm64,armhf,i386 |
bookworm-backports | 0.11.5-4~bpo12+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 0.8.6-4+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.8-7 | amd64,armel,armhf,i386 |
bullseye-backports | 0.11.4-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
experimental | 0.12.0~rc4-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.10.2-2+deb11u2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package speech-dispatcher: |
accessibility | speech |
interface | daemon |
network | server |
role | program |
works-with | audio |
|
License: DFSG free
|
Speech Dispatcher provides a device independent layer for speech synthesis.
It supports various software and hardware speech synthesizers as
backends and provides a generic layer for synthesizing speech and
playing back PCM data via those different backends to applications.
Various high level concepts like enqueueing vs. interrupting speech and
application specific user configurations are implemented in a device
independent way, therefore freeing the application programmer from
having to yet again reinvent the wheel.
This package contains Speech Dispatcher itself.
|
|
speech-tools
Edinburgh Speech Tools - user binaries
|
Versions of package speech-tools |
Release | Version | Architectures |
sid | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.5.0-11 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.5.0-5 | amd64,arm64,armhf,i386 |
stretch | 2.4~release-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 2.1~release-8 | amd64,armel,armhf,i386 |
Debtags of package speech-tools: |
accessibility | speech |
field | linguistics |
interface | commandline, text-mode |
role | program |
scope | utility |
uitoolkit | ncurses |
use | playing |
|
License: DFSG free
|
This package contains the various highly useful utility programs that use and
accompany the Edinburgh Speech Tools Library. Audio software and some basic
signal processing software is included in this package.
The following programs are available:
na_play: generic playback program for use with net_audio and CSTR ao.
ch_wave: Waveform file conversion program.
ch_lab: label file conversion program.
ch_track: Track file conversion program.
wagon: a CART tree build and test program
See /usr/share/doc/speech-tools/README for detail list of programs available.
|
|
Official Debian packages with lower relevance
festvox-ru
Russian male speaker for Festival
|
Versions of package festvox-ru |
Release | Version | Architectures |
bookworm | 0.5+dfsg-6 | all |
trixie | 0.5+dfsg-6 | all |
sid | 0.5+dfsg-6 | all |
jessie | 0.5+dfsg-3 | all |
stretch | 0.5+dfsg-3 | all |
buster | 0.5+dfsg-4 | all |
bullseye | 0.5+dfsg-5 | all |
Debtags of package festvox-ru: |
accessibility | speech |
culture | russian |
role | app-data |
sound | speech |
|
License: DFSG free
|
This package provides Russian support to Festival speech
synthesis system.
|
|
freetts
|
Versions of package freetts |
Release | Version | Architectures |
stretch | 1.2.2-3 | all |
sid | 1.2.2-7 | all |
trixie | 1.2.2-7 | all |
bookworm | 1.2.2-7 | all |
bullseye | 1.2.2-7 | all |
buster | 1.2.2-6 | all |
jessie | 1.2.2-3 | all |
Debtags of package freetts: |
accessibility | speech |
role | program |
|
License: DFSG free
|
FreeTTS is a speech synthesis system written entirely in the Java(TM)
programming language. It is based upon Flite, a small run-time speech
synthesis engine developed at Carnegie Mellon University. Flite in turn
is derived from the Festival Speech Synthesis System from the University
of Edinburgh and the FestVox project from Carnegie Mellon University.
|
|
gespeaker
GTK+ front-end for eSpeak and mbrola
|
Versions of package gespeaker |
Release | Version | Architectures |
stretch | 0.8.6-1 | all |
buster | 0.8.6-1 | all |
jessie | 0.8.5-1 | all |
Debtags of package gespeaker: |
accessibility | speech |
interface | x11 |
role | program |
scope | application |
sound | speech |
uitoolkit | gtk |
use | entertaining |
works-with | audio, text |
x11 | application |
|
License: DFSG free
|
Gespeaker is a GTK+ frontend for eSpeak and mbrola.
It allows one to play a text in many languages with settings
for voice, pitch, volume, speed and word gap.
Since version 0.6 it can use mbrola package and voices to
obtain a more realistic text reading experience.
|
|
recite
??? missing short description for package recite :-(
|
Versions of package recite |
Release | Version | Architectures |
jessie | 1.0-8.2 | amd64,armel,armhf,i386 |
Debtags of package recite: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
works-with | text |
|
License: DFSG free
|
|
|
saytime
|
Versions of package saytime |
Release | Version | Architectures |
sid | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.0-26 | amd64,armel,armhf,i386 |
stretch | 1.0-27 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.0-30 | amd64,arm64,armhf,i386 |
bullseye | 1.0-34 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package saytime: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
sound | player |
use | timekeeping |
works-with | audio |
|
License: DFSG free
|
使用你的声卡说出当前时间。需要系统有可用的声音输出设备。
|
|
sonic
Simple utility to speed up or slow down speech
|
Versions of package sonic |
Release | Version | Architectures |
buster | 0.2.0-7 | amd64,arm64,armhf,i386 |
stretch | 0.2.0-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.1.17-1.1 | amd64,armel,armhf,i386 |
sid | 0.2.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.2.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.2.0-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.2.0-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package sonic: |
role | program |
scope | utility |
use | editing |
works-with | audio |
|
License: DFSG free
|
Sonic is a very simple utility that reads and writes wav files,
and speeds them up or slows them down, with low distortion.
The key new feature in Sonic versus other libraries is very
high quality at speed up factors well over 2X.
|
|
speech-dispatcher-festival
Festival support for Speech Dispatcher
|
Versions of package speech-dispatcher-festival |
Release | Version | Architectures |
bookworm | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
experimental | 0.12.0~rc4-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm-backports | 0.11.5-4~bpo12+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye-backports | 0.11.4-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.10.2-2+deb11u2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.0-5+deb10u1 | amd64,arm64,armhf,i386 |
stretch | 0.8.6-4+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.8-7 | amd64,armel,armhf,i386 |
Debtags of package speech-dispatcher-festival: |
accessibility | speech |
role | metapackage |
works-with | audio |
|
License: DFSG free
|
Speech Dispatcher provides a device independent layer for speech synthesis.
It supports various software and hardware speech synthesizers as
backends and provides a generic layer for synthesizing speech and
playing back PCM data via those different backends to applications.
Various high level concepts like enqueueing vs. interrupting speech and
application specific user configurations are implemented in a device
independent way, therefore freeing the application programmer from
having to yet again reinvent the wheel.
This package contains dependencies on packages necessary for running Speech
Dispatcher with Festival.
|
|
Debian packages in contrib or non-free
cicero
French and English Text-To-Speech for MBROLA
|
Versions of package cicero |
Release | Version | Architectures |
buster | 0.7.2-4 (contrib) | all |
stretch | 0.7.2-3 (contrib) | all |
jessie | 0.7.2-3 (contrib) | all |
Debtags of package cicero: |
accessibility | speech |
culture | british, french |
role | program |
|
License: DFSG free, but needs non-free components
|
This Text-To-Speech (TTS) engine speaks French; a preliminary English support
is also offered.
The engine uses context-sensitive rules to produce phonemes from the text. It
relies on MBROLA to generate actual audio output from the phonemes. The TTS
engine is implemented using the Python programming language.
The upstream authors have come up with this TTS to try and meet their own needs
as blind users.
It's designed to be plugged as output to some screen-review software, firstly
with BRLTTY.
They favor speed and intelligibility over perfect pronunciation.
Cicero is aimed to have a quick response time, the ability to quickly shut-up
and skip to another utterance, intelligibility where it counts (not perfect
pronunciation), the ability to track speech progression, relative simplicity
(hackability) and relative small code size.
|
gnome-speech-dectalk
??? missing short description for package gnome-speech-dectalk :-(
|
Versions of package gnome-speech-dectalk |
Release | Version | Architectures |
jessie | 0.4.25-5 (contrib) | i386 |
stretch | 0.4.25-6 (contrib) | i386 |
Debtags of package gnome-speech-dectalk: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
|
gnome-speech-ibmtts
??? missing short description for package gnome-speech-ibmtts :-(
|
Versions of package gnome-speech-ibmtts |
Release | Version | Architectures |
jessie | 0.4.25-5 (contrib) | i386 |
stretch | 0.4.25-6 (contrib) | i386 |
Debtags of package gnome-speech-ibmtts: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
|
gnome-speech-swift
??? missing short description for package gnome-speech-swift :-(
|
Versions of package gnome-speech-swift |
Release | Version | Architectures |
stretch | 0.4.25-6 (contrib) | amd64,i386 |
jessie | 0.4.25-5 (contrib) | amd64,i386 |
Debtags of package gnome-speech-swift: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
|
libttspico-utils
Small Footprint TTS (binaries)
|
Versions of package libttspico-utils |
Release | Version | Architectures |
stretch | 1.0+git20130326-5 (non-free) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0+git20130326-3 (non-free) | amd64,armel,armhf,i386 |
buster | 1.0+git20130326-9 (non-free) | amd64,arm64,armhf,i386 |
bullseye | 1.0+git20130326-11 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0+git20130326-13 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0+git20130326-14.1 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0+git20130326-14.1 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package libttspico-utils: |
role | program |
|
License: non-free
|
The SVOX Pico engine is a software speech synthesizer for German, English (GB
and US), Spanish, French and Italian.
SVOX produces a clear and distinct speech output made possible by the use of
Hidden Markov Model (HMM) algorithms.
This package contains binary files including pico2wave.
|
mbrola
Multilingual software speech synthesizer
|
Versions of package mbrola |
Release | Version | Architectures |
stretch | 3.01h+2-3 (non-free) | amd64,armel,armhf,i386 |
jessie | 3.01h+1-2 (non-free) | amd64,armel,i386 |
buster | 3.02b+dfsg-4 (contrib) | amd64,arm64,armhf,i386 |
buster-backports | 3.3+dfsg-4+deb11u1~bpo10+1 (contrib) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.3+dfsg-4+deb11u1 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package mbrola: |
role | program |
sound | speech |
|
License: DFSG free, but needs non-free components
|
Mbrola is Thierry Dutoit's phonemizer for multilingual speech synthesis. The
various diphone databases are distributed on separate packages, but they
must be used with and only with Mbrola because of license matters. Read the
copyright for details.
Mbrola itself doesn't provide full TTS. It is a speech synthesizer based on
the concatenation of diphones. It takes a list of phonemes as input,
together with prosodic information (duration of phonemes and a piecewise linear
description of pitch), and produces speech samples on 16 bits (linear),
at the sampling frequency of the diphone database.
Use Mbrola along with Freephone, cicero or espeak to have a complete
text-to-speech in English.
|
|