Summary
Speech synthesis
Debian Accessibility Speech Synthesis
This metapackage will install packages which are useful for
Speech Synthesis and related APIs or applications.
Description
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Accessibility
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Accessibility mailing list
Links to other tasks
|
Debian Accessibility Speech synthesis packages
Official Debian packages with high relevance
Daisy-player
player for DAISY Digital Talking Books
|
Versions of package daisy-player |
Release | Version | Architectures |
buster | 11.6.2.1-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch | 10.3-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 9.0.0-1 | amd64,armel,armhf,i386 |
bookworm | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 12.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package daisy-player: |
interface | text-mode |
role | program |
scope | utility |
sound | player |
uitoolkit | ncurses |
use | learning, playing |
works-with | audio |
works-with-format | mp3 |
|
License: DFSG free
|
Daisy-player is a command-line player for talking books based on the
Digital Accessible Information System protocol. It is comparable in
functionality, features, and ease of use with commercial players, and
has a simple user interface appropriate for Braille terminals.
|
|
Eflite
Festival-Lite based emacspeak speech server
|
Versions of package eflite |
Release | Version | Architectures |
bullseye | 0.4.1-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.4.1-8 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.4.1-6 | amd64,armel,armhf,i386 |
buster | 0.4.1-9 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package eflite: |
accessibility | speech |
role | plugin |
suite | emacs |
works-with | audio |
|
License: DFSG free
|
EFlite is a speech server for Emacspeak and other screen readers that
allows them to interface with Festival Lite, a free text-to-speech
engine developed at the CMU Speech Center as an off-shoot of Festival.
Due to limitations inherited from its backend, EFlite does only provide
support for the English language at the moment.
|
|
Espeak
|
Versions of package espeak |
Release | Version | Architectures |
buster | 1.48.04+dfsg-7+deb10u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.48.15+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.48.04+dfsg-1 | amd64,armel,armhf,i386 |
stretch | 1.48.04+dfsg-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package espeak: |
interface | commandline |
role | program |
sound | speech |
works-with | audio |
|
License: DFSG free
|
eSpeak은 영어 및 기타 다른 언어를 위한 음성 합성 소프트웨어 입니다.
eSpeak는 좋은 품질에 영어 음성을 만듭니다. 이 프로그램은 서로 다른 오픈소스
text to speech (TTS) 엔진에서 다른 합성 방법을 사용하며, 그리고 전혀 다른
소리가 납니다. 아마도 자연스럽거나 부드럽지는 않겠지만, 그러나 일부는 오랬
동안 듣고 있으면 발음이 명료하고 쉽게 들리는 것을 찾을 수 있습니다.
eSpeak은 파일 또는 표준 입력으로 text를 말하기위해 명령행 프로그램으로 실행할
수도 있습니다.
- 특성을 변경할 수 있도록, 다른 음색을 포함합니다.
- WAV 파일로 음성 출력을 만들 수 있습니다.
- 텍스트를 음소 코드로 번역할 수 있으며, 그래서 다른 음성 합성 엔진에 대한
프론트앤드로 구성될 수도 있습니다.
- 다른 언어에 대한 잠재성. 40 이상의 언어를 포함합니다.
- 작은 사이즈. 프로그램과 데이타 총 350kbytes 정도입니다.
- C++로 개발.
|
|
Flite
Small run-time speech synthesis engine
|
Versions of package flite |
Release | Version | Architectures |
bookworm | 2.2-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.4-release-12 | amd64,armel,armhf,i386 |
sid | 2.2-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 2.0.0-release-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 2.1-release-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package flite: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
works-with | audio |
|
License: DFSG free
|
Flite is a small fast run-time speech synthesis engine. It is the
latest addition to the suite of free software synthesis tools
including University of Edinburgh's Festival Speech Synthesis System
and Carnegie Mellon University's FestVox project, tools, scripts and
documentation for building synthetic voices. However, flite itself
does not require either of these systems to run.
It currently only supports the English and Indic languages.
This package contains the executables and documentation.
|
|
Speech-dispatcher
Common interface to speech synthesizers
|
Versions of package speech-dispatcher |
Release | Version | Architectures |
bullseye | 0.10.2-2+deb11u2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.0-5+deb10u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch | 0.8.6-4+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.8-7 | amd64,armel,armhf,i386 |
sid | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye-backports | 0.11.4-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package speech-dispatcher: |
accessibility | speech |
interface | daemon |
network | server |
role | program |
works-with | audio |
|
License: DFSG free
|
Speech Dispatcher provides a device independent layer for speech synthesis.
It supports various software and hardware speech synthesizers as
backends and provides a generic layer for synthesizing speech and
playing back PCM data via those different backends to applications.
Various high level concepts like enqueueing vs. interrupting speech and
application specific user configurations are implemented in a device
independent way, therefore freeing the application programmer from
having to yet again reinvent the wheel.
This package contains Speech Dispatcher itself.
|
|
Speech-tools
Edinburgh Speech Tools - user binaries
|
Versions of package speech-tools |
Release | Version | Architectures |
buster | 2.5.0-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch | 2.4~release-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.5.0-11 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 2.1~release-8 | amd64,armel,armhf,i386 |
sid | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package speech-tools: |
accessibility | speech |
field | linguistics |
interface | commandline, text-mode |
role | program |
scope | utility |
uitoolkit | ncurses |
use | playing |
|
License: DFSG free
|
This package contains the various highly useful utility programs that use and
accompany the Edinburgh Speech Tools Library. Audio software and some basic
signal processing software is included in this package.
The following programs are available:
na_play: generic playback program for use with net_audio and CSTR ao.
ch_wave: Waveform file conversion program.
ch_lab: label file conversion program.
ch_track: Track file conversion program.
wagon: a CART tree build and test program
See /usr/share/doc/speech-tools/README for detail list of programs available.
|
|
Official Debian packages with lower relevance
Festvox-ru
Russian male speaker for Festival
|
Versions of package festvox-ru |
Release | Version | Architectures |
bullseye | 0.5+dfsg-5 | all |
bookworm | 0.5+dfsg-6 | all |
sid | 0.5+dfsg-6 | all |
jessie | 0.5+dfsg-3 | all |
stretch | 0.5+dfsg-3 | all |
buster | 0.5+dfsg-4 | all |
Debtags of package festvox-ru: |
accessibility | speech |
culture | russian |
role | app-data |
sound | speech |
|
License: DFSG free
|
This package provides Russian support to Festival speech
synthesis system.
|
|
Freetts
|
Versions of package freetts |
Release | Version | Architectures |
stretch | 1.2.2-3 | all |
sid | 1.2.2-7 | all |
bookworm | 1.2.2-7 | all |
bullseye | 1.2.2-7 | all |
buster | 1.2.2-6 | all |
jessie | 1.2.2-3 | all |
Debtags of package freetts: |
accessibility | speech |
role | program |
|
License: DFSG free
|
FreeTTS는 전체를 Java(TM) 프로그래밍 언어로 개발한 음성 합성 시스템입니다. 이는 Carnegie Mellon 대학에서 개발한 런타임 음성 합성 엔진인 Flite를 기반으로 합니다. Flite는 Edinburgh 대학의 Festival Speech Synthesis System과 Carnegie Mellon 대학의 FestVox 프로젝트에서 파생되었습니다.
|
|
Gespeaker
GTK+ front-end for eSpeak and mbrola
|
Versions of package gespeaker |
Release | Version | Architectures |
stretch | 0.8.6-1 | all |
buster | 0.8.6-1 | all |
jessie | 0.8.5-1 | all |
Debtags of package gespeaker: |
accessibility | speech |
interface | x11 |
role | program |
scope | application |
sound | speech |
uitoolkit | gtk |
use | entertaining |
works-with | audio, text |
x11 | application |
|
License: DFSG free
|
Gespeaker is a GTK+ frontend for eSpeak and mbrola.
It allows one to play a text in many languages with settings
for voice, pitch, volume, speed and word gap.
Since version 0.6 it can use mbrola package and voices to
obtain a more realistic text reading experience.
|
|
Recite
English text speech synthesizer
|
Versions of package recite |
Release | Version | Architectures |
jessie | 1.0-8.2 | amd64,armel,armhf,i386 |
Debtags of package recite: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
works-with | text |
|
License: DFSG free
|
Recite is a program to do speech synthesis. The quality of sound
produced is not terribly good, but it should be adequate for reporting
the occasional error message verbally.
Given some English text, recite will convert it to a series of phonemes,
then convert the phonemes to a sequence of vocal tract parameters, and
then synthesise the sound a vocal tract would make to say the sentence.
Recite can perform a subset of these operations, so it can be used to
convert text into phonemes, or to produce an utterance based on vocal
tract parameters computed by another program.
|
|
Saytime
|
Versions of package saytime |
Release | Version | Architectures |
jessie | 1.0-26 | amd64,armel,armhf,i386 |
sid | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0-34 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.0-30 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch | 1.0-27 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package saytime: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
sound | player |
use | timekeeping |
works-with | audio |
|
License: DFSG free
|
사용자의 사운드 카드를 통해서 현재 시간을 알려줍니다. 사운드 출력 장치가 있
어야 합니다.
|
|
Sonic
말하기 속도를 높이거나 낮추는 간단한 유틸리티
|
Versions of package sonic |
Release | Version | Architectures |
stretch | 0.2.0-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.2.0-7 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 0.2.0-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 0.1.17-1.1 | amd64,armel,armhf,i386 |
bullseye | 0.2.0-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.2.0-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package sonic: |
role | program |
scope | utility |
use | editing |
works-with | audio |
|
License: DFSG free
|
Sonic은 wav 파일을 읽고 쓰는 매우 간단한 유틸리티로 적은 왜곡으로 파일 속도를 높이거나 낮춥니다. 다른 라이브러리와 비교해서 Sonic에 새로운 주요 기능으로 재생 속도를 2배 이상으로 높여도 퀄리티가 매우 높다는 것 입니다.
|
|
Speech-dispatcher-festival
Festival support for Speech Dispatcher
|
Versions of package speech-dispatcher-festival |
Release | Version | Architectures |
jessie | 0.8-7 | amd64,armel,armhf,i386 |
stretch | 0.8.6-4+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.0-5+deb10u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.10.2-2+deb11u2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye-backports | 0.11.4-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package speech-dispatcher-festival: |
accessibility | speech |
role | metapackage |
works-with | audio |
|
License: DFSG free
|
Speech Dispatcher provides a device independent layer for speech synthesis.
It supports various software and hardware speech synthesizers as
backends and provides a generic layer for synthesizing speech and
playing back PCM data via those different backends to applications.
Various high level concepts like enqueueing vs. interrupting speech and
application specific user configurations are implemented in a device
independent way, therefore freeing the application programmer from
having to yet again reinvent the wheel.
This package contains dependencies on packages necessary for running Speech
Dispatcher with Festival.
|
|
Debian packages in contrib or non-free
Cicero
French and English Text-To-Speech for MBROLA
|
Versions of package cicero |
Release | Version | Architectures |
buster | 0.7.2-4 (contrib) | all |
stretch | 0.7.2-3 (contrib) | all |
jessie | 0.7.2-3 (contrib) | all |
Debtags of package cicero: |
accessibility | speech |
culture | british, french |
role | program |
|
License: DFSG free, but needs non-free components
|
This Text-To-Speech (TTS) engine speaks French; a preliminary English support
is also offered.
The engine uses context-sensitive rules to produce phonemes from the text. It
relies on MBROLA to generate actual audio output from the phonemes. The TTS
engine is implemented using the Python programming language.
The upstream authors have come up with this TTS to try and meet their own needs
as blind users.
It's designed to be plugged as output to some screen-review software, firstly
with BRLTTY.
They favor speed and intelligibility over perfect pronunciation.
Cicero is aimed to have a quick response time, the ability to quickly shut-up
and skip to another utterance, intelligibility where it counts (not perfect
pronunciation), the ability to track speech progression, relative simplicity
(hackability) and relative small code size.
|
Gnome-speech-dectalk
GNOME text-to-speech library (Fonix DECtalk engine support)
|
Versions of package gnome-speech-dectalk |
Release | Version | Architectures |
stretch | 0.4.25-6 (contrib) | i386 |
jessie | 0.4.25-5 (contrib) | i386 |
Debtags of package gnome-speech-dectalk: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
The GNOME Speech library gives a simple yet general API for programs
to convert text into speech, as well as speech input.
This package provides the source code required to compile a driver
for the commercial DECtalk software speech synthesis engine and voices from
Fonix (http://www.fonix.com/).
Upon installation, it will automatically attempt to compile and install
the dectalk-synthesis-driver binary required to use GNOME Speech with
dectalk.
This package is only useful if the dectalk engine is already installed on
the system.
|
Gnome-speech-ibmtts
GNOME text-to-speech library (IBMTTS engine support)
|
Versions of package gnome-speech-ibmtts |
Release | Version | Architectures |
stretch | 0.4.25-6 (contrib) | i386 |
jessie | 0.4.25-5 (contrib) | i386 |
Debtags of package gnome-speech-ibmtts: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
The GNOME Speech library gives a simple yet general API for programs
to convert text into speech, as well as speech input.
This package provides the source code required to compile a driver
for the commercial IBMTTS speech synthesis engine available
from http://ttsynth.com/.
Upon installation, it will automatically attempt to compile and install
the voiavoice-synthesis-driver binary required to use GNOME Speech with
IBMTTS.
This package is only useful if the IBMTTS (TTSynth) engine is already
installed on the system.
|
Gnome-speech-swift
GNOME text-to-speech library (Cepstral swift engine support)
|
Versions of package gnome-speech-swift |
Release | Version | Architectures |
stretch | 0.4.25-6 (contrib) | amd64,i386 |
jessie | 0.4.25-5 (contrib) | amd64,i386 |
Debtags of package gnome-speech-swift: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
The GNOME Speech library gives a simple yet general API for programs
to convert text into speech, as well as speech input.
This package provides the source code required to compile a driver
for the commercial swift speech synthesis engine and voices from
Cepstral (http://www.cepstral.com/).
Upon installation, it will automatically attempt to compile and install
the swift-synthesis-driver binary required to use GNOME Speech with swift.
This package is only useful if the swift engine is already installed on the
system.
|
Libttspico-utils
Small Footprint TTS (binaries)
|
Versions of package libttspico-utils |
Release | Version | Architectures |
buster | 1.0+git20130326-9 (non-free) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0+git20130326-3 (non-free) | amd64,armel,armhf,i386 |
stretch | 1.0+git20130326-5 (non-free) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 1.0+git20130326-13 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0+git20130326-13 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0+git20130326-11 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package libttspico-utils: |
role | program |
|
License: non-free
|
The SVOX Pico engine is a software speech synthesizer for German, English (GB
and US), Spanish, French and Italian.
SVOX produces a clear and distinct speech output made possible by the use of
Hidden Markov Model (HMM) algorithms.
This package contains binary files including pico2wave.
|
Mbrola
Multilingual software speech synthesizer
|
Versions of package mbrola |
Release | Version | Architectures |
stretch | 3.01h+2-3 (non-free) | amd64,armel,armhf,i386 |
jessie | 3.01h+1-2 (non-free) | amd64,armel,i386 |
buster | 3.02b+dfsg-4 (contrib) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster-backports | 3.3+dfsg-4+deb11u1~bpo10+1 (contrib) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.3+dfsg-4+deb11u1 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package mbrola: |
role | program |
sound | speech |
|
License: DFSG free, but needs non-free components
|
Mbrola is Thierry Dutoit's phonemizer for multilingual speech synthesis. The
various diphone databases are distributed on separate packages, but they
must be used with and only with Mbrola because of license matters. Read the
copyright for details.
Mbrola itself doesn't provide full TTS. It is a speech synthesizer based on
the concatenation of diphones. It takes a list of phonemes as input,
together with prosodic information (duration of phonemes and a piecewise linear
description of pitch), and produces speech samples on 16 bits (linear),
at the sampling frequency of the diphone database.
Use Mbrola along with Freephone, cicero or espeak to have a complete
text-to-speech in English.
|
|