Debian Accessibility Project
Summary
Speech Synthesis
Debian Accessibility Speech Synthesis

This metapackage will install packages which are useful for Speech Synthesis and related APIs or applications.

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Accessibility to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Accessibility mailing list

Links to other tasks

Debian Accessibility Speech Synthesis packages

Official Debian packages with high relevance

daisy-player
player for DAISY Digital Talking Books
Versions of package daisy-player
ReleaseVersionArchitectures
buster11.6.2.1-2amd64,arm64,armhf,i386
stretch10.3-3amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
trixie13.0-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
bookworm13.0-4amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye12.1-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid13.0-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie9.0.0-1amd64,armel,armhf,i386
Debtags of package daisy-player:
interfacetext-mode
roleprogram
scopeutility
soundplayer
uitoolkitncurses
uselearning, playing
works-withaudio
works-with-formatmp3
Popcon: 4 users (12 upd.)*
Versions and Archs
License: DFSG free
Git

Daisy-player is a command-line player for talking books based on the Digital Accessible Information System protocol. It is comparable in functionality, features, and ease of use with commercial players, and has a simple user interface appropriate for Braille terminals.

Screenshots of package daisy-player
eflite
Festival-Lite based emacspeak speech server
Versions of package eflite
ReleaseVersionArchitectures
bullseye0.4.1-12amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.4.1-13amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.4.1-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
sid0.4.1-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie0.4.1-6amd64,armel,armhf,i386
stretch0.4.1-8amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
buster0.4.1-9amd64,arm64,armhf,i386
Debtags of package eflite:
accessibilityspeech
roleplugin
suiteemacs
works-withaudio
Popcon: 4 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

EFlite is a speech server for Emacspeak and other screen readers that allows them to interface with Festival Lite, a free text-to-speech engine developed at the CMU Speech Center as an off-shoot of Festival.

Due to limitations inherited from its backend, EFlite does only provide support for the English language at the moment.

espeak
다중 언어 음성 합성 소프트웨어
Versions of package espeak
ReleaseVersionArchitectures
buster1.48.04+dfsg-7+deb10u1amd64,arm64,armhf,i386
bullseye1.48.15+dfsg-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm1.48.15+dfsg-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid1.48.15+dfsg-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie1.48.15+dfsg-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
jessie1.48.04+dfsg-1amd64,armel,armhf,i386
stretch1.48.04+dfsg-5amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
Debtags of package espeak:
interfacecommandline
roleprogram
soundspeech
works-withaudio
Popcon: 242 users (105 upd.)*
Versions and Archs
License: DFSG free
Git

eSpeak은 영어 및 기타 다른 언어를 위한 음성 합성 소프트웨어 입니다.

eSpeak는 좋은 품질에 영어 음성을 만듭니다. 이 프로그램은 서로 다른 오픈소스 text to speech (TTS) 엔진에서 다른 합성 방법을 사용하며, 그리고 전혀 다른 소리가 납니다. 아마도 자연스럽거나 부드럽지는 않겠지만, 그러나 일부는 오랬 동안 듣고 있으면 발음이 명료하고 쉽게 들리는 것을 찾을 수 있습니다.

eSpeak은 파일 또는 표준 입력으로 text를 말하기위해 명령행 프로그램으로 실행할 수도 있습니다.

  • 특성을 변경할 수 있도록, 다른 음색을 포함합니다.
  • WAV 파일로 음성 출력을 만들 수 있습니다.
  • 텍스트를 음소 코드로 번역할 수 있으며, 그래서 다른 음성 합성 엔진에 대한 프론트앤드로 구성될 수도 있습니다.
  • 다른 언어에 대한 잠재성. 40 이상의 언어를 포함합니다.
  • 작은 사이즈. 프로그램과 데이타 총 350kbytes 정도입니다.
  • C++로 개발.
flite
Small run-time speech synthesis engine
Versions of package flite
ReleaseVersionArchitectures
bullseye2.2-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
jessie1.4-release-12amd64,armel,armhf,i386
sid2.2-6amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
stretch2.0.0-release-3amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
trixie2.2-6amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
bookworm2.2-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster2.1-release-3amd64,arm64,armhf,i386
Debtags of package flite:
accessibilityspeech
interfacecommandline
roleprogram
scopeutility
works-withaudio
Popcon: 27 users (73 upd.)*
Versions and Archs
License: DFSG free
Git

Flite is a small fast run-time speech synthesis engine. It is the latest addition to the suite of free software synthesis tools including University of Edinburgh's Festival Speech Synthesis System and Carnegie Mellon University's FestVox project, tools, scripts and documentation for building synthetic voices. However, flite itself does not require either of these systems to run.

It currently only supports the English and Indic languages.

This package contains the executables and documentation.

speech-dispatcher
Common interface to speech synthesizers
Versions of package speech-dispatcher
ReleaseVersionArchitectures
trixie0.11.5-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
experimental0.12.0~rc2-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid0.11.5-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm-backports0.11.5-1~bpo12+1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.11.4-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye-backports0.11.4-2~bpo11+1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye0.10.2-2+deb11u2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster0.9.0-5+deb10u1amd64,arm64,armhf,i386
stretch0.8.6-4+deb9u1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie0.8-7amd64,armel,armhf,i386
Debtags of package speech-dispatcher:
accessibilityspeech
interfacedaemon
networkserver
roleprogram
works-withaudio
Popcon: 78693 users (12441 upd.)*
Versions and Archs
License: DFSG free
Git

Speech Dispatcher provides a device independent layer for speech synthesis. It supports various software and hardware speech synthesizers as backends and provides a generic layer for synthesizing speech and playing back PCM data via those different backends to applications.

Various high level concepts like enqueueing vs. interrupting speech and application specific user configurations are implemented in a device independent way, therefore freeing the application programmer from having to yet again reinvent the wheel.

This package contains Speech Dispatcher itself.

speech-tools
Edinburgh Speech Tools - user binaries
Versions of package speech-tools
ReleaseVersionArchitectures
sid2.5.0-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie2.5.0-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
bookworm2.5.0-13amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye2.5.0-11amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster2.5.0-5amd64,arm64,armhf,i386
stretch2.4~release-5amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
jessie2.1~release-8amd64,armel,armhf,i386
Debtags of package speech-tools:
accessibilityspeech
fieldlinguistics
interfacecommandline, text-mode
roleprogram
scopeutility
uitoolkitncurses
useplaying
Popcon: 9 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

This package contains the various highly useful utility programs that use and accompany the Edinburgh Speech Tools Library. Audio software and some basic signal processing software is included in this package.

The following programs are available: na_play: generic playback program for use with net_audio and CSTR ao. ch_wave: Waveform file conversion program. ch_lab: label file conversion program. ch_track: Track file conversion program. wagon: a CART tree build and test program

See /usr/share/doc/speech-tools/README for detail list of programs available.

Official Debian packages with lower relevance

festvox-ru
Russian male speaker for Festival
Versions of package festvox-ru
ReleaseVersionArchitectures
bookworm0.5+dfsg-6all
trixie0.5+dfsg-6all
sid0.5+dfsg-6all
jessie0.5+dfsg-3all
stretch0.5+dfsg-3all
buster0.5+dfsg-4all
bullseye0.5+dfsg-5all
Debtags of package festvox-ru:
accessibilityspeech
culturerussian
roleapp-data
soundspeech
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

This package provides Russian support to Festival speech synthesis system.

freetts
음성 합성 시스템
Maintainer: Bdale Garbee
Versions of package freetts
ReleaseVersionArchitectures
stretch1.2.2-3all
sid1.2.2-7all
trixie1.2.2-7all
bookworm1.2.2-7all
bullseye1.2.2-7all
buster1.2.2-6all
jessie1.2.2-3all
Debtags of package freetts:
accessibilityspeech
roleprogram
Popcon: 7 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

FreeTTS는 전체를 Java(TM) 프로그래밍 언어로 개발한 음성 합성 시스템입니다. 이는 Carnegie Mellon 대학에서 개발한 런타임 음성 합성 엔진인 Flite를 기반으로 합니다. Flite는 Edinburgh 대학의 Festival Speech Synthesis System과 Carnegie Mellon 대학의 FestVox 프로젝트에서 파생되었습니다.

gespeaker
GTK+ front-end for eSpeak and mbrola
Maintainer: Fabio Castelli
Versions of package gespeaker
ReleaseVersionArchitectures
jessie0.8.5-1all
stretch0.8.6-1all
buster0.8.6-1all
Debtags of package gespeaker:
accessibilityspeech
interfacex11
roleprogram
scopeapplication
soundspeech
uitoolkitgtk
useentertaining
works-withaudio, text
x11application
Popcon: 3 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

Gespeaker is a GTK+ frontend for eSpeak and mbrola. It allows one to play a text in many languages with settings for voice, pitch, volume, speed and word gap.

Since version 0.6 it can use mbrola package and voices to obtain a more realistic text reading experience.

recite
English text speech synthesizer
Versions of package recite
ReleaseVersionArchitectures
jessie1.0-8.2amd64,armel,armhf,i386
Debtags of package recite:
accessibilityspeech
interfacecommandline
roleprogram
scopeutility
works-withtext
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free

Recite is a program to do speech synthesis. The quality of sound produced is not terribly good, but it should be adequate for reporting the occasional error message verbally.

Given some English text, recite will convert it to a series of phonemes, then convert the phonemes to a sequence of vocal tract parameters, and then synthesise the sound a vocal tract would make to say the sentence. Recite can perform a subset of these operations, so it can be used to convert text into phonemes, or to produce an utterance based on vocal tract parameters computed by another program.

saytime
사운드 카드를 통해서 현재 시간을 알려줌
Maintainer: Holger Levsen
Versions of package saytime
ReleaseVersionArchitectures
jessie1.0-26amd64,armel,armhf,i386
bookworm1.0-35amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.0-35amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
sid1.0-35amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster1.0-30amd64,arm64,armhf,i386
stretch1.0-27amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bullseye1.0-34amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package saytime:
accessibilityspeech
interfacecommandline
roleprogram
scopeutility
soundplayer
usetimekeeping
works-withaudio
Popcon: 7 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

사용자의 사운드 카드를 통해서 현재 시간을 알려줍니다. 사운드 출력 장치가 있 어야 합니다.

Screenshots of package saytime
sonic
말하기 속도를 높이거나 낮추는 간단한 유틸리티
Versions of package sonic
ReleaseVersionArchitectures
trixie0.2.0-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
buster0.2.0-7amd64,arm64,armhf,i386
bullseye0.2.0-10amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.2.0-12amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
stretch0.2.0-4amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
sid0.2.0-13amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie0.1.17-1.1amd64,armel,armhf,i386
Debtags of package sonic:
roleprogram
scopeutility
useediting
works-withaudio
Popcon: 6 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

Sonic은 wav 파일을 읽고 쓰는 매우 간단한 유틸리티로 적은 왜곡으로 파일 속도를 높이거나 낮춥니다. 다른 라이브러리와 비교해서 Sonic에 새로운 주요 기능으로 재생 속도를 2배 이상으로 높여도 퀄리티가 매우 높다는 것 입니다.

Screenshots of package sonic
speech-dispatcher-festival
스피치 디스패처용 페스티벌 지원
Versions of package speech-dispatcher-festival
ReleaseVersionArchitectures
experimental0.12.0~rc2-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
buster0.9.0-5+deb10u1amd64,arm64,armhf,i386
bullseye0.10.2-2+deb11u2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye-backports0.11.4-2~bpo11+1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.11.4-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm-backports0.11.5-1~bpo12+1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.11.5-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
sid0.11.5-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie0.8-7amd64,armel,armhf,i386
stretch0.8.6-4+deb9u1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
Debtags of package speech-dispatcher-festival:
accessibilityspeech
rolemetapackage
works-withaudio
Popcon: 3 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

스피치 디스패처는 음성 합성을 위한 장치 독립 계층을 제공합니다. 다양한 소프트웨어 및 하드웨어 음성 신디사이저를 백엔드로 지원하며 어플리케이션에 대한 다양한 백엔드를 통해 음성을 합성하고 PCM 데이터를 재생하기 위한 일반 계층을 지원합니다.

큐잉 대 음성 인터럽트, 그리고 어플리케이션별 사용자 구성과 같은 다양한 고급 개념이 장치 독립적인 방식으로 구현되므로 어플리케이션 프로그래머가 바퀴를 두 번 만드는 일은 필요치 않습니다.

이 패키지는 페스트벌과 함께 음성 디스패처를 실행하는데 필요한 패키지에 대한 의존성을 포함합니다.

Debian packages in contrib or non-free

cicero
French and English Text-To-Speech for MBROLA
Versions of package cicero
ReleaseVersionArchitectures
buster0.7.2-4 (contrib)all
stretch0.7.2-3 (contrib)all
jessie0.7.2-3 (contrib)all
Debtags of package cicero:
accessibilityspeech
culturebritish, french
roleprogram
Popcon: 3 users (0 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Git

This Text-To-Speech (TTS) engine speaks French; a preliminary English support is also offered. The engine uses context-sensitive rules to produce phonemes from the text. It relies on MBROLA to generate actual audio output from the phonemes. The TTS engine is implemented using the Python programming language.

The upstream authors have come up with this TTS to try and meet their own needs as blind users. It's designed to be plugged as output to some screen-review software, firstly with BRLTTY. They favor speed and intelligibility over perfect pronunciation. Cicero is aimed to have a quick response time, the ability to quickly shut-up and skip to another utterance, intelligibility where it counts (not perfect pronunciation), the ability to track speech progression, relative simplicity (hackability) and relative small code size.

gnome-speech-dectalk
GNOME text-to-speech library (Fonix DECtalk engine support)
Versions of package gnome-speech-dectalk
ReleaseVersionArchitectures
jessie0.4.25-5 (contrib)i386
stretch0.4.25-6 (contrib)i386
Debtags of package gnome-speech-dectalk:
accessibilityspeech
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Svn

The GNOME Speech library gives a simple yet general API for programs to convert text into speech, as well as speech input.

This package provides the source code required to compile a driver for the commercial DECtalk software speech synthesis engine and voices from Fonix (http://www.fonix.com/). Upon installation, it will automatically attempt to compile and install the dectalk-synthesis-driver binary required to use GNOME Speech with dectalk.

This package is only useful if the dectalk engine is already installed on the system.

gnome-speech-ibmtts
GNOME text-to-speech library (IBMTTS engine support)
Versions of package gnome-speech-ibmtts
ReleaseVersionArchitectures
stretch0.4.25-6 (contrib)i386
jessie0.4.25-5 (contrib)i386
Debtags of package gnome-speech-ibmtts:
accessibilityspeech
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Svn

The GNOME Speech library gives a simple yet general API for programs to convert text into speech, as well as speech input.

This package provides the source code required to compile a driver for the commercial IBMTTS speech synthesis engine available from http://ttsynth.com/. Upon installation, it will automatically attempt to compile and install the voiavoice-synthesis-driver binary required to use GNOME Speech with IBMTTS.

This package is only useful if the IBMTTS (TTSynth) engine is already installed on the system.

gnome-speech-swift
GNOME text-to-speech library (Cepstral swift engine support)
Versions of package gnome-speech-swift
ReleaseVersionArchitectures
jessie0.4.25-5 (contrib)amd64,i386
stretch0.4.25-6 (contrib)amd64,i386
Debtags of package gnome-speech-swift:
accessibilityspeech
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Svn

The GNOME Speech library gives a simple yet general API for programs to convert text into speech, as well as speech input.

This package provides the source code required to compile a driver for the commercial swift speech synthesis engine and voices from Cepstral (http://www.cepstral.com/). Upon installation, it will automatically attempt to compile and install the swift-synthesis-driver binary required to use GNOME Speech with swift.

This package is only useful if the swift engine is already installed on the system.

libttspico-utils
Small Footprint TTS (binaries)
Versions of package libttspico-utils
ReleaseVersionArchitectures
stretch1.0+git20130326-5 (non-free)amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
buster1.0+git20130326-9 (non-free)amd64,arm64,armhf,i386
bullseye1.0+git20130326-11 (non-free)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm1.0+git20130326-13 (non-free)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie1.0+git20130326-14.1 (non-free)amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
sid1.0+git20130326-14.1 (non-free)amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie1.0+git20130326-3 (non-free)amd64,armel,armhf,i386
Debtags of package libttspico-utils:
roleprogram
Popcon: 34 users (19 upd.)*
Versions and Archs
License: non-free
Git

The SVOX Pico engine is a software speech synthesizer for German, English (GB and US), Spanish, French and Italian.

SVOX produces a clear and distinct speech output made possible by the use of Hidden Markov Model (HMM) algorithms.

This package contains binary files including pico2wave.

mbrola
Multilingual software speech synthesizer
Maintainer: Samuel Thibault
Versions of package mbrola
ReleaseVersionArchitectures
stretch3.01h+2-3 (non-free)amd64,armel,armhf,i386
jessie3.01h+1-2 (non-free)amd64,armel,i386
buster3.02b+dfsg-4 (contrib)amd64,arm64,armhf,i386
buster-backports3.3+dfsg-4+deb11u1~bpo10+1 (contrib)amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bullseye3.3+dfsg-4+deb11u1 (contrib)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm3.3+dfsg-9 (contrib)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie3.3+dfsg-9 (contrib)amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
sid3.3+dfsg-9 (contrib)amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package mbrola:
roleprogram
soundspeech
Popcon: 185 users (7 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Git

Mbrola is Thierry Dutoit's phonemizer for multilingual speech synthesis. The various diphone databases are distributed on separate packages, but they must be used with and only with Mbrola because of license matters. Read the copyright for details.

Mbrola itself doesn't provide full TTS. It is a speech synthesizer based on the concatenation of diphones. It takes a list of phonemes as input, together with prosodic information (duration of phonemes and a piecewise linear description of pitch), and produces speech samples on 16 bits (linear), at the sampling frequency of the diphone database.

Use Mbrola along with Freephone, cicero or espeak to have a complete text-to-speech in English.

*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 236226