Summary
Speech Synthesis
Debian Accessibility Speech Synthesis
This metapackage will install packages which are useful for
Speech Synthesis and related APIs or applications.
Description
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Accessibility
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Accessibility mailing list
Links to other tasks
|
Debian Accessibility Speech Synthesis packages
Official Debian packages with high relevance
daisy-player
player for DAISY Digital Talking Books
|
Versions of package daisy-player |
Release | Version | Architectures |
buster | 11.6.2.1-2 | amd64,arm64,armhf,i386 |
stretch | 10.3-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 12.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 13.0-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 9.0.0-1 | amd64,armel,armhf,i386 |
Debtags of package daisy-player: |
interface | text-mode |
role | program |
scope | utility |
sound | player |
uitoolkit | ncurses |
use | learning, playing |
works-with | audio |
works-with-format | mp3 |
|
License: DFSG free
|
Daisy-player is a command-line player for talking books based on the
Digital Accessible Information System protocol. It is comparable in
functionality, features, and ease of use with commercial players, and
has a simple user interface appropriate for Braille terminals.
|
|
eflite
Festival-Lite based emacspeak speech server
|
Versions of package eflite |
Release | Version | Architectures |
bullseye | 0.4.1-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.4.1-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 0.4.1-6 | amd64,armel,armhf,i386 |
stretch | 0.4.1-8 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.4.1-9 | amd64,arm64,armhf,i386 |
Debtags of package eflite: |
accessibility | speech |
role | plugin |
suite | emacs |
works-with | audio |
|
License: DFSG free
|
EFlite is a speech server for Emacspeak and other screen readers that
allows them to interface with Festival Lite, a free text-to-speech
engine developed at the CMU Speech Center as an off-shoot of Festival.
Due to limitations inherited from its backend, EFlite does only provide
support for the English language at the moment.
|
|
espeak
|
Versions of package espeak |
Release | Version | Architectures |
buster | 1.48.04+dfsg-7+deb10u1 | amd64,arm64,armhf,i386 |
bullseye | 1.48.15+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.48.15+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.48.04+dfsg-1 | amd64,armel,armhf,i386 |
stretch | 1.48.04+dfsg-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package espeak: |
interface | commandline |
role | program |
sound | speech |
works-with | audio |
|
License: DFSG free
|
eSpeak은 영어 및 기타 다른 언어를 위한 음성 합성 소프트웨어 입니다.
eSpeak는 좋은 품질에 영어 음성을 만듭니다. 이 프로그램은 서로 다른 오픈소스
text to speech (TTS) 엔진에서 다른 합성 방법을 사용하며, 그리고 전혀 다른
소리가 납니다. 아마도 자연스럽거나 부드럽지는 않겠지만, 그러나 일부는 오랬
동안 듣고 있으면 발음이 명료하고 쉽게 들리는 것을 찾을 수 있습니다.
eSpeak은 파일 또는 표준 입력으로 text를 말하기위해 명령행 프로그램으로 실행할
수도 있습니다.
- 특성을 변경할 수 있도록, 다른 음색을 포함합니다.
- WAV 파일로 음성 출력을 만들 수 있습니다.
- 텍스트를 음소 코드로 번역할 수 있으며, 그래서 다른 음성 합성 엔진에 대한
프론트앤드로 구성될 수도 있습니다.
- 다른 언어에 대한 잠재성. 40 이상의 언어를 포함합니다.
- 작은 사이즈. 프로그램과 데이타 총 350kbytes 정도입니다.
- C++로 개발.
|
|
flite
Small run-time speech synthesis engine
|
Versions of package flite |
Release | Version | Architectures |
bullseye | 2.2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.4-release-12 | amd64,armel,armhf,i386 |
sid | 2.2-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.0.0-release-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 2.2-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.2-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.1-release-3 | amd64,arm64,armhf,i386 |
Debtags of package flite: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
works-with | audio |
|
License: DFSG free
|
Flite is a small fast run-time speech synthesis engine. It is the
latest addition to the suite of free software synthesis tools
including University of Edinburgh's Festival Speech Synthesis System
and Carnegie Mellon University's FestVox project, tools, scripts and
documentation for building synthetic voices. However, flite itself
does not require either of these systems to run.
It currently only supports the English and Indic languages.
This package contains the executables and documentation.
|
|
speech-dispatcher
Common interface to speech synthesizers
|
Versions of package speech-dispatcher |
Release | Version | Architectures |
bookworm | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.0-5+deb10u1 | amd64,arm64,armhf,i386 |
bookworm-backports | 0.11.5-4~bpo12+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 0.8.6-4+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.8-7 | amd64,armel,armhf,i386 |
bullseye-backports | 0.11.4-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
experimental | 0.12.0~rc4-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.10.2-2+deb11u2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package speech-dispatcher: |
accessibility | speech |
interface | daemon |
network | server |
role | program |
works-with | audio |
|
License: DFSG free
|
Speech Dispatcher provides a device independent layer for speech synthesis.
It supports various software and hardware speech synthesizers as
backends and provides a generic layer for synthesizing speech and
playing back PCM data via those different backends to applications.
Various high level concepts like enqueueing vs. interrupting speech and
application specific user configurations are implemented in a device
independent way, therefore freeing the application programmer from
having to yet again reinvent the wheel.
This package contains Speech Dispatcher itself.
|
|
speech-tools
Edinburgh Speech Tools - user binaries
|
Versions of package speech-tools |
Release | Version | Architectures |
sid | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.5.0-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.5.0-11 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.5.0-5 | amd64,arm64,armhf,i386 |
stretch | 2.4~release-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 2.1~release-8 | amd64,armel,armhf,i386 |
Debtags of package speech-tools: |
accessibility | speech |
field | linguistics |
interface | commandline, text-mode |
role | program |
scope | utility |
uitoolkit | ncurses |
use | playing |
|
License: DFSG free
|
This package contains the various highly useful utility programs that use and
accompany the Edinburgh Speech Tools Library. Audio software and some basic
signal processing software is included in this package.
The following programs are available:
na_play: generic playback program for use with net_audio and CSTR ao.
ch_wave: Waveform file conversion program.
ch_lab: label file conversion program.
ch_track: Track file conversion program.
wagon: a CART tree build and test program
See /usr/share/doc/speech-tools/README for detail list of programs available.
|
|
Official Debian packages with lower relevance
festvox-ru
Russian male speaker for Festival
|
Versions of package festvox-ru |
Release | Version | Architectures |
bookworm | 0.5+dfsg-6 | all |
trixie | 0.5+dfsg-6 | all |
sid | 0.5+dfsg-6 | all |
jessie | 0.5+dfsg-3 | all |
stretch | 0.5+dfsg-3 | all |
buster | 0.5+dfsg-4 | all |
bullseye | 0.5+dfsg-5 | all |
Debtags of package festvox-ru: |
accessibility | speech |
culture | russian |
role | app-data |
sound | speech |
|
License: DFSG free
|
This package provides Russian support to Festival speech
synthesis system.
|
|
freetts
|
Versions of package freetts |
Release | Version | Architectures |
stretch | 1.2.2-3 | all |
sid | 1.2.2-7 | all |
trixie | 1.2.2-7 | all |
bookworm | 1.2.2-7 | all |
bullseye | 1.2.2-7 | all |
buster | 1.2.2-6 | all |
jessie | 1.2.2-3 | all |
Debtags of package freetts: |
accessibility | speech |
role | program |
|
License: DFSG free
|
FreeTTS는 전체를 Java(TM) 프로그래밍 언어로 개발한 음성 합성 시스템입니다. 이는 Carnegie Mellon 대학에서 개발한 런타임 음성 합성 엔진인 Flite를 기반으로 합니다. Flite는 Edinburgh 대학의 Festival Speech Synthesis System과 Carnegie Mellon 대학의 FestVox 프로젝트에서 파생되었습니다.
|
|
gespeaker
GTK+ front-end for eSpeak and mbrola
|
Versions of package gespeaker |
Release | Version | Architectures |
stretch | 0.8.6-1 | all |
buster | 0.8.6-1 | all |
jessie | 0.8.5-1 | all |
Debtags of package gespeaker: |
accessibility | speech |
interface | x11 |
role | program |
scope | application |
sound | speech |
uitoolkit | gtk |
use | entertaining |
works-with | audio, text |
x11 | application |
|
License: DFSG free
|
Gespeaker is a GTK+ frontend for eSpeak and mbrola.
It allows one to play a text in many languages with settings
for voice, pitch, volume, speed and word gap.
Since version 0.6 it can use mbrola package and voices to
obtain a more realistic text reading experience.
|
|
recite
??? missing short description for package recite :-(
|
Versions of package recite |
Release | Version | Architectures |
jessie | 1.0-8.2 | amd64,armel,armhf,i386 |
Debtags of package recite: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
works-with | text |
|
License: DFSG free
|
|
|
saytime
|
Versions of package saytime |
Release | Version | Architectures |
sid | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.0-26 | amd64,armel,armhf,i386 |
stretch | 1.0-27 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.0-30 | amd64,arm64,armhf,i386 |
bullseye | 1.0-34 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0-35 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package saytime: |
accessibility | speech |
interface | commandline |
role | program |
scope | utility |
sound | player |
use | timekeeping |
works-with | audio |
|
License: DFSG free
|
사용자의 사운드 카드를 통해서 현재 시간을 알려줍니다. 사운드 출력 장치가 있
어야 합니다.
|
|
sonic
말하기 속도를 높이거나 낮추는 간단한 유틸리티
|
Versions of package sonic |
Release | Version | Architectures |
buster | 0.2.0-7 | amd64,arm64,armhf,i386 |
stretch | 0.2.0-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.1.17-1.1 | amd64,armel,armhf,i386 |
sid | 0.2.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.2.0-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.2.0-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.2.0-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package sonic: |
role | program |
scope | utility |
use | editing |
works-with | audio |
|
License: DFSG free
|
Sonic은 wav 파일을 읽고 쓰는 매우 간단한 유틸리티로 적은 왜곡으로 파일 속도를 높이거나 낮춥니다. 다른 라이브러리와 비교해서 Sonic에 새로운 주요 기능으로 재생 속도를 2배 이상으로 높여도 퀄리티가 매우 높다는 것 입니다.
|
|
speech-dispatcher-festival
|
Versions of package speech-dispatcher-festival |
Release | Version | Architectures |
bookworm | 0.11.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
experimental | 0.12.0~rc4-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.11.5-5.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm-backports | 0.11.5-4~bpo12+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye-backports | 0.11.4-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.10.2-2+deb11u2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.0-5+deb10u1 | amd64,arm64,armhf,i386 |
stretch | 0.8.6-4+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.8-7 | amd64,armel,armhf,i386 |
Debtags of package speech-dispatcher-festival: |
accessibility | speech |
role | metapackage |
works-with | audio |
|
License: DFSG free
|
스피치 디스패처는 음성 합성을 위한 장치 독립 계층을 제공합니다. 다양한 소프트웨어 및 하드웨어 음성 신디사이저를 백엔드로 지원하며 어플리케이션에 대한 다양한 백엔드를 통해 음성을 합성하고 PCM 데이터를 재생하기 위한 일반 계층을 지원합니다.
큐잉 대 음성 인터럽트, 그리고 어플리케이션별 사용자 구성과 같은 다양한 고급 개념이 장치 독립적인 방식으로 구현되므로 어플리케이션 프로그래머가 바퀴를 두 번 만드는 일은 필요치 않습니다.
이 패키지는 페스트벌과 함께 음성 디스패처를 실행하는데 필요한 패키지에 대한 의존성을 포함합니다.
|
|
Debian packages in contrib or non-free
cicero
French and English Text-To-Speech for MBROLA
|
Versions of package cicero |
Release | Version | Architectures |
buster | 0.7.2-4 (contrib) | all |
stretch | 0.7.2-3 (contrib) | all |
jessie | 0.7.2-3 (contrib) | all |
Debtags of package cicero: |
accessibility | speech |
culture | british, french |
role | program |
|
License: DFSG free, but needs non-free components
|
This Text-To-Speech (TTS) engine speaks French; a preliminary English support
is also offered.
The engine uses context-sensitive rules to produce phonemes from the text. It
relies on MBROLA to generate actual audio output from the phonemes. The TTS
engine is implemented using the Python programming language.
The upstream authors have come up with this TTS to try and meet their own needs
as blind users.
It's designed to be plugged as output to some screen-review software, firstly
with BRLTTY.
They favor speed and intelligibility over perfect pronunciation.
Cicero is aimed to have a quick response time, the ability to quickly shut-up
and skip to another utterance, intelligibility where it counts (not perfect
pronunciation), the ability to track speech progression, relative simplicity
(hackability) and relative small code size.
|
gnome-speech-dectalk
??? missing short description for package gnome-speech-dectalk :-(
|
Versions of package gnome-speech-dectalk |
Release | Version | Architectures |
jessie | 0.4.25-5 (contrib) | i386 |
stretch | 0.4.25-6 (contrib) | i386 |
Debtags of package gnome-speech-dectalk: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
|
gnome-speech-ibmtts
??? missing short description for package gnome-speech-ibmtts :-(
|
Versions of package gnome-speech-ibmtts |
Release | Version | Architectures |
jessie | 0.4.25-5 (contrib) | i386 |
stretch | 0.4.25-6 (contrib) | i386 |
Debtags of package gnome-speech-ibmtts: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
|
gnome-speech-swift
??? missing short description for package gnome-speech-swift :-(
|
Versions of package gnome-speech-swift |
Release | Version | Architectures |
stretch | 0.4.25-6 (contrib) | amd64,i386 |
jessie | 0.4.25-5 (contrib) | amd64,i386 |
Debtags of package gnome-speech-swift: |
accessibility | speech |
|
License: DFSG free, but needs non-free components
|
|
libttspico-utils
Small Footprint TTS (binaries)
|
Versions of package libttspico-utils |
Release | Version | Architectures |
stretch | 1.0+git20130326-5 (non-free) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0+git20130326-3 (non-free) | amd64,armel,armhf,i386 |
buster | 1.0+git20130326-9 (non-free) | amd64,arm64,armhf,i386 |
bullseye | 1.0+git20130326-11 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0+git20130326-13 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0+git20130326-14.1 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0+git20130326-14.1 (non-free) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package libttspico-utils: |
role | program |
|
License: non-free
|
The SVOX Pico engine is a software speech synthesizer for German, English (GB
and US), Spanish, French and Italian.
SVOX produces a clear and distinct speech output made possible by the use of
Hidden Markov Model (HMM) algorithms.
This package contains binary files including pico2wave.
|
mbrola
Multilingual software speech synthesizer
|
Versions of package mbrola |
Release | Version | Architectures |
stretch | 3.01h+2-3 (non-free) | amd64,armel,armhf,i386 |
jessie | 3.01h+1-2 (non-free) | amd64,armel,i386 |
buster | 3.02b+dfsg-4 (contrib) | amd64,arm64,armhf,i386 |
buster-backports | 3.3+dfsg-4+deb11u1~bpo10+1 (contrib) | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.3+dfsg-4+deb11u1 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.3+dfsg-9 (contrib) | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package mbrola: |
role | program |
sound | speech |
|
License: DFSG free, but needs non-free components
|
Mbrola is Thierry Dutoit's phonemizer for multilingual speech synthesis. The
various diphone databases are distributed on separate packages, but they
must be used with and only with Mbrola because of license matters. Read the
copyright for details.
Mbrola itself doesn't provide full TTS. It is a speech synthesizer based on
the concatenation of diphones. It takes a list of phonemes as input,
together with prosodic information (duration of phonemes and a piecewise linear
description of pitch), and produces speech samples on 16 bits (linear),
at the sampling frequency of the diphone database.
Use Mbrola along with Freephone, cicero or espeak to have a complete
text-to-speech in English.
|
|