Summary
Linguistics
Debian Science Linguistics packages
This metapackage is part of the Debian Pure Blend "Debian Science"
and installs packages related to Linguistics.
Description
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Science
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Science mailing list
Links to other tasks
|
Debian Science Linguistics packages
Official Debian packages with high relevance
apertium
jadro na strojový preklad pomocou plytkého prekladu
|
Versions of package apertium |
Release | Version | Architectures |
jessie | 3.1.0-2 | amd64,armel,armhf,i386 |
sid | 3.9.4-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 3.9.4-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 3.5.2-1 | amd64,arm64,armhf,i386 |
bullseye | 3.7.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 3.8.3-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 3.4.0~r61013-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package apertium: |
field | linguistics |
role | program |
|
License: DFSG free
|
Apertium, open source jadro na strojový preklad pomocou plytkého prekladu,
je v prvom rade zamerané na súvisiace dvojice jazykov.
Na lexikálne spracovanie používa konečné prevodníky, skryté Markovove
modely na označovanie slovných druhov a konečné rozdeľovanie na prenos
štruktúry.
Systém je vo veľkej miere založený na systémoch, ktoré už vyvinula skupina
Transducens na Universitat d'Alacant, ako interNOSTRUM (Spanish-Catalan,
http://www.internostrum.com/welcome.php) a Traductor Universia (Spanish-
Portuguese, http://traductor.universia.net).
Jednoduchým poskytnutím lingvistických dát v správnom formáte bude možné
použiť Apertium na zostavenie systémov na strojový preklad pre rozličné
dvojice súvisiacich jazykov.
|
|
apertium-eval-translator
Evaluate machine translation output against reference
|
Versions of package apertium-eval-translator |
Release | Version | Architectures |
sid | 1.2.1-3 | all |
bullseye | 1.2.1-2 | all |
bookworm | 1.2.1-3 | all |
trixie | 1.2.1-3 | all |
|
License: DFSG free
|
This package contails Perl scripts to evaluate Apertium-based machine
translation output against reference: WER, PER, TER, BLEU.
|
|
apertium-lex-tools
Constraint-based lexical selection module
|
Versions of package apertium-lex-tools |
Release | Version | Architectures |
buster | 0.2.1-1 | amd64,arm64,armhf,i386 |
stretch | 0.1.1~r66150-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.2.7-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.4.2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.4.2-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.4.2-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Module for compiling lexical selection rules and processing
them in the pipeline.
|
|
artha
užitočný offline synonymický slovník založený na WordNet
|
Versions of package artha |
Release | Version | Architectures |
sid | 1.0.5-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0.5-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.0.3-1 | amd64,armel,armhf,i386 |
bookworm | 1.0.5-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0.5-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.0.3-3 | amd64,arm64,armhf,i386 |
stretch | 1.0.3-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package artha: |
field | linguistics |
interface | x11 |
role | program |
uitoolkit | gtk |
use | learning |
x11 | application |
|
License: DFSG free
|
Artha je offline anglický synonymický slovník s výraznými vlastnosťami:
- vyhľadanie slova stlačením klávesovej skratky (vyberte text
v ľubovoľnom okne, stlačte kláves a vyhľadáte slovo)
- hľadanie podľa regulárnych výrazov (rozšírenie vyhľadávania pomocou
zástupných znakov ako *, ? atď.)
- pasívne oznámenia prac. prostredia (definícií slov na nerušenú prácu)
- pravopisné návrhy (keď nie je presný pravopis známy alebo je nejasný)
Po spustení monitoruje nastavenú klávesovú skratku. Ak je v niektorom okne
vybraný text a stlačená klávesová skratka, zobrazí sa okno s vyhľadaným
slovom. Ak používateľ uprednostňuje pasívne oznámenia, môže si ich zapnúť.
Ak sa hľadaný termín nenájde alebo je nejasný, je možné buď vo
vyhľadávacom reťazci rozšíriť vyhľadávanie pomocou regulárnych výrazov
(*, ? atď.) alebo použiť návrhy, ak je termín nesprávny.
Aby regulárne výrazy fungovali, musíte mať nainštalovaný balík
wordnet-sense-index.
|
|
cg3
Tools for using the 3rd edition of Constraint Grammar (CG-3)
|
Versions of package cg3 |
Release | Version | Architectures |
buster | 1.1.7-1 | amd64,arm64,armhf,i386 |
sid | 1.4.6-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.3.2-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.3.9-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.9.9~r11624-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 1.4.6-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Constraint Grammar compiler and applicator for the 3rd edition of CG
that is developed and maintained by VISL SDU and GrammarSoft ApS.
CG-3 can be used for disambiguation of morphology, syntax, semantics, etc;
dependency markup, target language lemma choice for MT, QA systems, and
much more. The core idea is that you choose what to do based on the whole
available context, as opposed to n-grams.
See https://visl.sdu.dk/cg3.html for more documentation
|
|
collatinus
lemmatisation of latin text
|
Versions of package collatinus |
Release | Version | Architectures |
trixie | 12.2-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 12.2-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch-backports | 11-1~bpo9+1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch | 10.2-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 12.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 11-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 10.2-2 | amd64,armel,armhf,i386 |
buster | 11-1 | amd64,arm64,armhf,i386 |
Debtags of package collatinus: |
field | linguistics |
interface | x11 |
role | dummy, program |
scope | application |
uitoolkit | gtk |
use | learning |
x11 | application |
|
License: DFSG free
|
Collatinus can be used to lemmatise latin texts, i.e. extract words and
make a lexicon which indicates for each word its canonic form, and how
the form actually found in the text was derived from it, for instance by
declining it. Example : rosam gives : rosa-rosae -- acc. sing.
Collatinus provides a nice graphic front-end to each operation.
Collatinus-nouus (stands for Collatinus, new generation) replaces every
previous version of Collatinus.
This package provides a documentation in HTML format.
|
|
dimbl
Distributed Memory Based Learner
|
Versions of package dimbl |
Release | Version | Architectures |
bullseye | 0.15-2.1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.15-2.1 | amd64,arm64,armhf,i386 |
jessie | 0.12-2 | amd64,armel,armhf,i386 |
stretch | 0.15-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 0.17-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.17-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.15-2.1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 0.18 |
Debtags of package dimbl: |
role | program |
|
License: DFSG free
|
Dimbl obaľuje klasifikátor k najbližších susedov v TiMBL. Ponúka paralelnú
klasifikáciu na počítačoch s viacerými CPU. Dimbl rozdelí pôvodnú
tréningovú množinu, zostaví samostatné klasifikátory TiMBL pre každú
podmnožinu a zlúči ich sady najbližších susedov podľa klasifikovanej
inštancie.
Vlastnosti Dimbl:
- je pekným obalom TiMBL a zachováva voľby príkazového riadka
- vie pracovať s viacerými jadrami
- využíva špecifikáciu OpenMP na paralelné programovanie
- dosahuje superlineárne zlepšenie rýchlosti v porovnaní so štandardným
TiMBL
Dimbl je produktom výskumnej skupiny ILK (Tilburg University, Holandsko).
Ak sa venujete výskumu spracovania prirodzených jazykov pomocou metódy
pamäťovej výuky, Dimbl sa vám pravdepodobne hodí.
|
|
fasttext
Efficient learning of word representations and sentence classification library
|
Versions of package fasttext |
Release | Version | Architectures |
sid | 0.9.2+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.9.2+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.9.2-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.9.2+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
fastText is a library for efficient learning of word representations
and sentence classification, which refers subword information to
enrich word vectors.
|
|
frog
tagger and parser for natural languages (runtime)
|
Versions of package frog |
Release | Version | Architectures |
jessie | 0.12.17-7.1 | amd64,armel,armhf,i386 |
stretch | 0.13.7-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.15-1 | amd64,arm64,armhf,i386 |
bullseye | 0.20-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.20-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.32-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.32-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
upstream | 0.34 |
|
License: DFSG free
|
Memory-Based Learning (MBL) is a machine-learning method applicable to a wide
range of tasks in Natural Language Processing (NLP).
Frog is a modular system integrating a morphosyntactic tagger, lemmatizer,
morphological analyzer, and dependency parser for natural languages. It is
based upon it's predecessor TADPOLE (TAgger, Dependency Parser, and
mOrphoLogical analyzEr). Using Memory-Based Learning techniques, frog
tokenizes, tags, lemmatizes, and morphologically segments word tokens in
incoming UTF-8 text files, and assigns a dependency graph to each sentence.
Frog is particularly targeted at the increasing need for fast, automatic NLP
systems applicable to very large (multi-million to billion word) document
collections that are becoming available due to the progressive digitization of
both new and old textual data. Up to now, frog has only been tested and used
using corpora of Dutch natural language (see the frogdata package for samples).
Frog is a product of the Centre of Language and Speech Technology at
Radboud University Nijmegen, it subsumes previous work by the
ILK Research Group (Tilburg University, The Netherlands) and
the CLiPS Research Centre (University of Antwerp, Belgium). It is
currently maintained at the KNAW Humanities Cluster.
If you do scientific research in NLP, Frog will likely be of use to you.
|
|
giella-sme
Giellatekno single language data for North Saami
|
Versions of package giella-sme |
Release | Version | Architectures |
stretch | 0.0.20150917~r121176-1 | all |
buster | 0.0.20150917~r121176-3 | all |
|
License: DFSG free
|
Data package providing Giellatekno language resources for North Saami.
|
|
hfst
Helsinki Finite-State Transducer Technology
|
Versions of package hfst |
Release | Version | Architectures |
bookworm | 3.16.0-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.15.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.15.0-1.1~deb10u1 | amd64,arm64,armhf,i386 |
stretch | 3.10.0~r2798-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 3.16.1-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 3.16.1-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
The Helsinki Finite-State Transducer software is intended for the
implementation of morphological analysers and other tools which are
based on weighted and unweighted finite-state transducer technology.
|
|
hfst-ospell
Spell checker library and tool based on HFST
|
Versions of package hfst-ospell |
Release | Version | Architectures |
trixie | 0.5.4-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.5.2-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.4.0~r4643-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.5.0-2 | amd64,arm64,armhf,i386 |
bookworm | 0.5.3-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.5.4-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Minimal HFST optimized lookup format based spell checker library and
a demonstrational implementation of command line based spell checker.
|
|
irstlm
IRST Language Modeling Toolkit
|
Versions of package irstlm |
Release | Version | Architectures |
trixie | 6.00.05-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 6.00.05-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 6.00.05-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 6.00.05-2 | amd64,arm64,armhf,i386 |
stretch | 6.00.05-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 6.00.05-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
The IRST Language Modeling Toolkit can be used to learn a language model
from data. The generated n-gram models should be usable on any system
supporting ARPA language model format.
This package provides the command line tools.
|
|
libcld2-dev
Compact Language Detector 2, development package
|
Versions of package libcld2-dev |
Release | Version | Architectures |
sid | 0.0.0-git20150806-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
bullseye | 0.0.0-git20150806-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
bookworm | 0.0.0-git20150806-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
trixie | 0.0.0-git20150806-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
stretch | 0.0.0-git20150806-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
buster | 0.0.0-git20150806-6 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
Detects over 80 languages in UTF-8 text, based largely on groups
of four letters.
Also tables for 160+ language version.
This is the development package.
|
|
link-grammar
syntaktický analyzátor spojových gramatík Carnegie Mellon University
|
Versions of package link-grammar |
Release | Version | Architectures |
jessie | 4.7.4-2 | amd64,armel,armhf,i386 |
sid | 5.12.5~dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 5.12.5~dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 5.12.0~dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 5.8.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 5.5.1-6 | amd64,arm64,armhf,i386 |
stretch | 5.3.14-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package link-grammar: |
field | linguistics |
interface | commandline |
role | program |
use | checking |
works-with | dictionary |
|
License: DFSG free
|
Sleator, D. a Temperley, D. v „Parsing English with a Link Grammar“ (1991)
definujú nový systém formálnych gramatík zvaný „spojová gramatika“.
Postupnosť slov je v jazyku spojovej gramatiky ak existuje možnosť ako
nakresliť „spojenia“ medzi slovami tak, aby boli uspokojené lokálne
požiadavky každého slova, aby sa spojenia nepretínali a slová tvorili
súvislý graf. Autori do takéhoto systému zakódovali gramatiku angličtiny a
napísali tento program na syntaktickú analýzu angličtiny pomocou tejto
gramatiky.
link-grammar je možné použiť na lingvistickú syntaktickú analýzu za účelom
získavania informácií alebo extrakcie z dokumentov v prirodzenom jazyku.
Tiež je možné použiť ho na kontrolu gramatiky.
Tento balík obsahuje používateľský spustiteľný súbor.
|
|
lttoolbox
Apertium lexical processing modules and tools
|
Versions of package lttoolbox |
Release | Version | Architectures |
trixie | 3.7.6-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 3.3.3~r68466-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 3.1.0-1.2 | amd64,armel,armhf,i386 |
bookworm | 3.7.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 3.7.6-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 3.5.0-3 | amd64,arm64,armhf,i386 |
bullseye | 3.5.3-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package lttoolbox: |
field | linguistics |
role | program |
|
License: DFSG free
|
The lttoolbox contains the augmented letter transducer tools for natural
language processing used by Apertium, a platform for building rule-based
and hybrid machine translation systems. The software is also useful
for making morphological analysers and generators for natural language
processing applications.
|
|
mbt
memory-based tagger-generator and tagger
|
Versions of package mbt |
Release | Version | Architectures |
buster | 3.4-1 | amd64,arm64,armhf,i386 |
trixie | 3.10-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.10-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 3.6-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 3.6-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 3.2.10-4 | amd64,armel,armhf,i386 |
stretch | 3.2.16-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
upstream | 3.11 |
Debtags of package mbt: |
field | linguistics |
role | program |
|
License: DFSG free
|
MBT is a memory-based tagger-generator and tagger in one. The tagger-generator
part can generate a sequence tagger on the basis of a training set of tagged
sequences; the tagger part can tag new sequences. MBT can, for instance, be
used to generate part-of-speech taggers or chunkers for natural language
processing. Features:
- Tagger generation: tagged text in, tagger out,
- Optional feedback loop: feed previous tag decision back to input of next
decision,
- Easily customizable feature representation; can incorporate user-provided
features,
- Automatic generation of separate sub-taggers for known words and unknown
words,
- Can make use of full algorithmic parameters of TiMBL.
MBT is a product of the Centre of Language and Speech Technology (Radboud
University Nijmegen, The Netherlands), the ILK Research Group (Tilburg
University, The Netherlands) and the CLiPS Research Centre (University
of Antwerp, Belgium).
If you do scientific research in natural language processing, MBT will
likely be of use to you.
|
|
mbtserver
Server extensions for the MBT tagger
|
Versions of package mbtserver |
Release | Version | Architectures |
stretch | 0.11-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.14-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.16-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.16-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 0.7-3 | amd64,armel,armhf,i386 |
buster | 0.12-1 | amd64,arm64,armhf,i386 |
bullseye | 0.14-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 0.17 |
|
License: DFSG free
|
MbtServer extends Mbt with a server layer, running as a TCP server. Mbt is a
memory-based tagger-generator and tagger for natural language processing.
MbtServer provides the possibility to access a trained tagger from multiple
sessions. It also allows one to run and access different taggers in parallel.
MbtServer is a product of the Centre for Language and Speech Technology
(Radboud University, Nijmegen, The Netherlands), the ILK Research Group
(Tilburg University, The Netherlands) and the CLiPS Research Centre
(University of Antwerp, Belgium).
If you do scientific research in natural language processing, MbtServer will
likely be of use to you.
|
|
opennlp
wrapper for Apache OpenNLP natural language text processing toolkit
|
Versions of package opennlp |
Release | Version | Architectures |
trixie | 2.5.1-1 | all |
bookworm | 2.1.0-1 | all |
sid | 2.5.1-1 | all |
bullseye | 1.9.3-1 | all |
|
License: DFSG free
|
The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging, named
entity extraction, chunking, parsing, and coreference resolution. These tasks
are usually required to build more advanced text processing services. OpenNLP
also included maximum entropy and perceptron based machine learning.
This package contains the command line wrapper.
|
|
python3-pynlpl
PyNLPl is a library for Natural Language Processing (Python 3 version)
|
Versions of package python3-pynlpl |
Release | Version | Architectures |
buster | 1.1.2-1 | all |
bullseye | 1.2.9-1 | all |
trixie | 1.2.9-1 | all |
sid | 1.2.9-1 | all |
stretch | 1.1.2-1 | all |
bookworm | 1.2.9-1 | all |
|
License: DFSG free
|
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language
Processing. It contains various modules useful for common, and less common,
NLP tasks. PyNLPl can be used for basic tasks such as the extraction of
n-grams and frequency lists, and to build simple language models. It also
contains complex data types and algorithms. Moreover, it includes parsers for
file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL) and clients
to interface with various NLP specific servers. PyNLPl most notably features a
very extensive library for working with FoLiA XML (Format for Linguistic
Annotation).
This is the Python 3 version.
|
|
python3-thinc
Practical Machine Learning for NLP in Python
|
Versions of package python3-thinc |
Release | Version | Architectures |
bookworm | 8.1.7-1 | amd64,arm64,armhf,i386,mips64el,s390x |
sid | 9.0.0-2 | amd64,arm64,armhf,i386,mips64el,riscv64,s390x |
buster | 6.12.1-1 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
Thinc is the machine learning library powering spaCy https://spacy.io.
It features a battle-tested linear model designed for large sparse
learning problems, and a flexible neural network model under development
for spaCy v2.0 https://spacy.io/usage/v2.
Thinc is a practical toolkit for implementing models that follow the
"Embed, encode, attend, predict" architecture. It's designed to be easy
to install, efficient for CPU usage and optimised for NLP and deep
learning with text – in particular, hierarchically structured input
and variable-length sequences.
|
|
r-cran-lexrankr
extractive summarization of text with the LexRank algorithm
|
Versions of package r-cran-lexrankr |
Release | Version | Architectures |
sid | 0.5.2-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.5.2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.5.2-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.5.2-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 0.5.0-2 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
An R implementation of the LexRank algorithm implementing stochastic
graph-based method for computing relative importance of textual units
for Natural Language Processing. The technique on the problem
of Text Summarization (TS) is tested. Extractive TS relies on the concept of
sentence salience to identify the most important sentences in a
document or set of documents. Salience is typically defined in terms of
the presence of particular important words or in terms of similarity to
a centroid pseudo-sentence.
|
|
r-cran-snowballc
Snowball stemmers based on the C libstemmer UTF-8 library
|
Versions of package r-cran-snowballc |
Release | Version | Architectures |
buster | 0.6.0-1 | amd64,arm64,armhf,i386 |
sid | 0.7.1-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.7.1-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.7.0-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.7.0-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
An R interface to the C libstemmer library that implements Porter's word
stemming algorithm for collapsing words to a common root to aid
comparison of vocabulary. Currently supported languages are Danish,
Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian,
Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.
|
|
sentencepiece
Unsupervised text tokenizer and detokenizer
|
Versions of package sentencepiece |
Release | Version | Architectures |
sid | 0.2.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.2.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.1.95-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.1.97-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
SentencePiece is an unsupervised text tokenizer/detokenizer mainly
designed for Neural Network-based text generation systems where the
vocabulary size is predetermined prior to the neural model training.
|
|
timbl
Tilburg Memory Based Learner
|
Versions of package timbl |
Release | Version | Architectures |
bullseye | 6.5-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 6.4.13-1 | amd64,arm64,armhf,i386 |
stretch | 6.4.8-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 6.4.4-4 | amd64,armel,armhf,i386 |
trixie | 6.9-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 6.9-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 6.5-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 6.10 |
Debtags of package timbl: |
role | program |
|
License: DFSG free
|
Memory-Based Learning (MBL) is a machine-learning method applicable to a wide
range of tasks in Natural Language Processing (NLP).
The Tilburg Memory Based Learner, TiMBL, is a tool for NLP research, and for
many other domains where classification tasks are learned from examples. It
is an efficient implementation of k-nearest neighbor classifier.
TiMBL's features are:
-
Fast, decision-tree-based implementation of k-nearest neighbor
classification;
-
Implementations of IB1 and IB2, IGTree, TRIBL, and TRIBL2 algorithms;
- Similarity metrics: Overlap, MVDM, Jeffrey Divergence, Dot product, Cosine;
-
Feature weighting metrics: information gain, gain ratio, chi squared,
shared variance;
-
Distance weighting metrics: inverse, inverse linear, exponential decay;
- Extensive verbosity options to inspect nearest neighbor sets;
- Server functionality and extensive API;
- Fast leave-one-out testing and internal cross-validation;
- and Handles user-defined example weighting.
TiMBL is a product of the Centre of Language and Speech Technology
(Radboud University, Nijmegen, The Netherlands), the ILK Research Group
(Tilburg University, The Netherlands) and the CLiPS Research Centre
(University of Antwerp, Belgium).
If you do scientific research in NLP, timbl will likely be of use to you.
|
|
timblserver
Server extensions for Timbl
|
Versions of package timblserver |
Release | Version | Architectures |
bullseye | 1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.12-1 | amd64,arm64,armhf,i386 |
stretch | 1.11-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.7-4 | amd64,armel,armhf,i386 |
sid | 1.18-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.18-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 1.19 |
Debtags of package timblserver: |
role | program |
|
License: DFSG free
|
timblserver is a TiMBL wrapper; it adds server functionality to TiMBL. It
allows TiMBL to run multiple experiments as a TCP server, optionally via HTTP.
The Tilburg Memory Based Learner, TiMBL, is a tool for Natural Language
Processing research, and for many other domains where classification tasks are
learned from examples.
TimblServer is a product of the ILK Research Group (Tilburg University, The
Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).
If you do scientific research in NLP, TimblServer will likely be of use to you.
|
|
ucto
|
Versions of package ucto |
Release | Version | Architectures |
sid | 0.30-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.30-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.21.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.9.6-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.14-2 | amd64,arm64,armhf,i386 |
bullseye | 0.21.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 0.5.3-3.1 | amd64,armel,armhf,i386 |
upstream | 0.35 |
Debtags of package ucto: |
role | program |
|
License: DFSG free
|
Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
punctuation, split sentences, generate n-grams), and offers several other
basic preprocessing steps that make your text suited for further processing
such as indexing, part-of-speech tagging, or machine translation.
This package provides the command-line tool itself.
Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto
was funded by NWO, the Netherlands Organisation for Scientific Research,
under the Implicit Linguistics project, the CLARIN-NL program, and the
CLARIAH project.
Ucto is a product of the Centre of Language and Speech Technology (Radboud
University Nijmegen), and previously the ILK Research Group
(Tilburg University, The Netherlands).
If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
do scientific research in natural language processing, ucto will likely be of
use to you.
|
|
uctodata
|
Versions of package uctodata |
Release | Version | Architectures |
bookworm | 0.8-2 | all |
bullseye | 0.8-2 | all |
trixie | 0.9.1-1 | all |
sid | 0.9.1-1 | all |
buster | 0.8-2 | all |
stretch | 0.4-1 | all |
upstream | 0.11 |
|
License: DFSG free
|
Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
punctuation, split sentences, generate n-grams), and offers several other
basic preprocessing steps that make your text suited for further processing
such as indexing, part-of-speech tagging, or machine translation.
This package provides necessary language-specific datafiles for running Ucto.
Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto
was funded by NWO, the Netherlands Organisation for Scientific Research,
under the Implicit Linguistics project, the CLARIN-NL program, and the
CLARIAH project.
Ucto is a product of the Centre of Language and Speech Technology (Radboud
University Nijmegen), and previously the ILK Research Group (Tilburg
University, The Netherlands).
|
|
wordnet
elektronická lexikálna databáza anglického jazyka
|
Versions of package wordnet |
Release | Version | Architectures |
bookworm | 3.0-37 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.0-35 | amd64,arm64,armhf,i386 |
trixie | 3.0-38 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.0-38 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 3.0-33 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 3.0-33 | amd64,armel,armhf,i386 |
bullseye | 3.0-36 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package wordnet: |
field | linguistics |
interface | x11 |
role | program |
scope | application |
uitoolkit | tk |
use | checking |
works-with | dictionary |
x11 | application |
|
License: DFSG free
|
WordNet(C) je online systém lexikálnych odkazov, ktorého dizajn sa
inšpiruje aktuálnymi psycholinguistickými teóriami ľudskej lexikálnej
pamäti. Anglické podstatné mená, slovesá, prídavné mená a príslovky sú
organizované do množín synoným, z ktorých každá reprezentuje ich lexikálny
koncept. Množiny synoným sú prepojené pomocou rôznych vzťahov.
WordNet vyvinuli v Cognitive Science Laboratory na Princeton University
pod vedením hlavného riešiteľa, ktorým bol profesor George A. Miller.
WordNet sa považuje za najdôležitejší dostupný zdroj pre v oblasti
výskumníkov výpočtovej lingvistiky, analýzy diskurzu a mnohých súvisiacich
oblastiach.
Spustiteľný súbor, manuálové stránky programu WordNet a tiež všeobecné
manuálové stránky.
|
|
Official Debian packages with lower relevance
apertium-af-nl
Transitional dummy package for apertium-afr-nld
|
Versions of package apertium-af-nl |
Release | Version | Architectures |
buster | 0.2.0~r58256-2 | all |
stretch | 0.2.0~r58256-1 | all |
bullseye | 0.3.0-2 | all |
bookworm | 0.3.0-3 | all |
trixie | 0.3.0-3 | all |
sid | 0.3.0-3 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-apy
|
Versions of package apertium-apy |
Release | Version | Architectures |
stretch | 0.9.1~r343-2 | all |
buster | 0.11.4-2 | all |
bullseye | 0.11.7-2 | all |
bookworm | 0.11.7-2.1 | all |
trixie | 0.11.7-2.2 | all |
sid | 0.11.7-2.2 | all |
upstream | 0.12.1 |
|
License: DFSG free
|
Tento balík obsahuje Apertium APY, čo je jednoduché API Apertium napísané v
jazyku Python 3 určené ako okamžitá náhrada za ScaleMT.
|
|
apertium-arg
Apertium - dáta jednotlivého jazyka - aragónčina
|
Versions of package apertium-arg |
Release | Version | Architectures |
buster | 0.1.2~r65494-2 | all |
stretch | 0.1.2~r65494-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium v jazyku aragónčina.
|
|
apertium-arg-cat
Apertium - prekladové dáta dvojice aragónčina-katalánčina
|
Versions of package apertium-arg-cat |
Release | Version | Architectures |
sid | 0.3.0-2 | all |
trixie | 0.3.0-2 | all |
bullseye | 0.2.0-2 | all |
bookworm | 0.2.0-3 | all |
buster | 0.1.0~r64925-2 | all |
stretch | 0.1.0~r64925-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
aragónčina a katalánčina.
|
|
apertium-bel
Apertium single language data for Belarusian
|
Versions of package apertium-bel |
Release | Version | Architectures |
buster | 0.1.0~r81357-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Belarusian.
|
|
apertium-bel-rus
Apertium translation data for the Belarusian-Russian pair
|
Versions of package apertium-bel-rus |
Release | Version | Architectures |
bullseye | 0.2.1-1 | all |
sid | 0.2.1-2 | all |
trixie | 0.2.1-2 | all |
bookworm | 0.2.1-2 | all |
buster | 0.2.0~r81186-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for translating
between the Belarusian and Russian languages.
|
|
apertium-br-fr
Apertium - prekladové dáta dvojice bretónčina-francúzština
|
Versions of package apertium-br-fr |
Release | Version | Architectures |
sid | 0.5.1-1 | all |
stretch | 0.5.0~r61325-2 | all |
buster | 0.5.0~r61325-3 | all |
bullseye | 0.5.1-1 | all |
bookworm | 0.5.1-1 | all |
trixie | 0.5.1-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
bretónčina a francúzština.
|
|
apertium-ca-it
Transitional dummy package for apertium-cat-ita
|
Versions of package apertium-ca-it |
Release | Version | Architectures |
bookworm | 0.2.2-1 | all |
trixie | 1.1.0-1 | all |
sid | 1.1.0-1 | all |
buster | 0.1.1~r57554-2 | all |
stretch | 0.1.1~r57554-1 | all |
bullseye | 0.2.1-3 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-cat
Apertium single language data for Catalan
|
Versions of package apertium-cat |
Release | Version | Architectures |
buster | 2.6.0-1 | all |
stretch | 1.0.0~r65787-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Catalan.
|
|
apertium-cat-srd
Apertium translation data for the Catalan-Sardinian pair
|
Versions of package apertium-cat-srd |
Release | Version | Architectures |
buster | 1.0.0~r82995-2 | all |
sid | 1.2.0-1 | all |
trixie | 1.2.0-1 | all |
bookworm | 1.1.0-2 | all |
bullseye | 1.1.0-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for translating
between the Catalan and Sardinian languages.
|
|
apertium-crh
Apertium single language data for Crimean Tatar
|
Versions of package apertium-crh |
Release | Version | Architectures |
buster | 0.2.0~r83161-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Crimean Tatar
|
|
apertium-crh-tur
Apertium translation data for the Crimean Tatar-Turkish pair
|
Versions of package apertium-crh-tur |
Release | Version | Architectures |
buster | 0.3.0~r83159-2 | all |
bullseye | 0.3.0-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for translating
between the Crimean Tatar and Turkish languages.
|
|
apertium-cy-en
Apertium - prekladové dáta dvojice waleština-angličtina
|
Versions of package apertium-cy-en |
Release | Version | Architectures |
buster | 0.1.1~r57554-4 | all |
bullseye | 0.1.1~r57554-7 | all |
stretch | 0.1.1~r57554-3 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
waleština a angličtina.
|
|
apertium-dan
Apertium single language data for Danish
|
Versions of package apertium-dan |
Release | Version | Architectures |
buster | 0.5.0~r67099-2 | all |
stretch | 0.5.0~r67099-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Danish.
|
|
apertium-dan-nor
Apertium translation data for the Danish-Norwegian pair
|
Versions of package apertium-dan-nor |
Release | Version | Architectures |
bullseye | 1.4.1-2 | all |
stretch | 1.3.0~r67099-1 | all |
trixie | 1.5.0-2 | all |
bookworm | 1.5.0-2 | all |
sid | 1.5.0-2 | all |
buster | 1.3.0~r67099-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for translating
from the Danish to the Norwegian Nynorsk/Norwegian Bokmål variants
and from Danish to Norwegian Nynorsk.
|
|
apertium-en-ca
Transitional dummy package for apertium-eng-cat
|
Versions of package apertium-en-ca |
Release | Version | Architectures |
stretch | 0.9.3~r61328-1 | all |
jessie | 0.8.9-1 | amd64,armel,armhf,i386 |
bullseye | 1.0.1-4 | all |
bookworm | 1.0.1-5 | all |
trixie | 1.0.1-5 | all |
sid | 1.0.1-5 | all |
buster | 0.9.3~r61328-2 | all |
Debtags of package apertium-en-ca: |
culture | catalan |
field | linguistics |
role | app-data |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-en-es
Transitional dummy package for apertium-eng-spa
|
Versions of package apertium-en-es |
Release | Version | Architectures |
jessie | 0.6.0-1.1 | amd64,armel,armhf,i386 |
buster | 0.8.0~r57502-4 | all |
stretch | 0.8.0~r57502-2 | all |
bullseye | 0.8.0~r57502-5 | all |
sid | 0.8.1-2 | all |
trixie | 0.8.1-2 | all |
bookworm | 0.8.1-2 | all |
Debtags of package apertium-en-es: |
culture | spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-en-gl
Apertium - prekladové dáta dvojice angličtina-galícijčina
|
Versions of package apertium-en-gl |
Release | Version | Architectures |
buster | 0.5.2~r57551-2 | all |
bullseye | 0.5.2~r57551-3 | all |
sid | 0.5.4-2 | all |
bookworm | 0.5.4-1 | all |
stretch | 0.5.2~r57551-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
angličtina a galícijčina.
|
|
apertium-eo-ca
Apertium - prekladové dáta dvojice esperanto-katalánčina
|
Versions of package apertium-eo-ca |
Release | Version | Architectures |
stretch | 0.9.1~r60655-1 | all |
jessie | 0.9.0-1.1 | amd64,armel,armhf,i386 |
bookworm | 0.9.2-1 | all |
buster | 0.9.1~r60655-3 | all |
trixie | 0.9.2-1 | all |
sid | 0.9.2-1 | all |
bullseye | 0.9.2-1 | all |
Debtags of package apertium-eo-ca: |
culture | catalan, esperanto |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
esperanto a katalánčina.
|
|
apertium-eo-en
Apertium linguistic data to translate between Esperanto and English
|
Versions of package apertium-eo-en |
Release | Version | Architectures |
bookworm | 1.0.2-1 | all |
bullseye | 1.0.0~r63833-3 | all |
stretch | 1.0.0~r63833-1 | all |
buster | 1.0.0~r63833-2 | all |
sid | 1.0.2-1 | all |
|
License: DFSG free
|
This is a linguistic package for the Apertium shallow-transfer
machine translation system. The package can be used to translate
between Esperanto and English.
|
|
apertium-eo-es
Apertium - prekladové dáta dvojice esperanto-španielčina
|
Versions of package apertium-eo-es |
Release | Version | Architectures |
buster | 0.9.1~r60655-3 | all |
stretch | 0.9.1~r60655-1 | all |
sid | 0.9.2-1 | all |
trixie | 0.9.2-1 | all |
jessie | 0.9.0-1.1 | amd64,armel,armhf,i386 |
bullseye | 0.9.1~r60655-4 | all |
bookworm | 0.9.2-1 | all |
Debtags of package apertium-eo-es: |
culture | esperanto, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
esperanto a španielčina.
|
|
apertium-eo-fr
Apertium - prekladové dáta dvojice esperanto-francúzština
|
Versions of package apertium-eo-fr |
Release | Version | Architectures |
sid | 0.9.1-1 | all |
bullseye | 0.9.1-1 | all |
bookworm | 0.9.1-1 | all |
trixie | 0.9.1-1 | all |
buster | 0.9.0~r57551-2 | all |
stretch | 0.9.0~r57551-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
esperanto a francúzština.
|
|
apertium-es-ast
Transitional dummy package for apertium-spa-ast
|
Versions of package apertium-es-ast |
Release | Version | Architectures |
bookworm | 1.1.1-2 | all |
bullseye | 1.1.0~r51165-3 | all |
buster | 1.1.0~r51165-2 | all |
stretch | 1.1.0~r51165-1 | all |
sid | 1.1.1-2 | all |
trixie | 1.1.1-2 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-es-ca
prechodový fiktívny balík na apertium-spa-cat
|
Versions of package apertium-es-ca |
Release | Version | Architectures |
sid | 2.2.0-3 | all |
bullseye | 2.2.0-2 | all |
bookworm | 2.2.0-3 | all |
trixie | 2.2.0-3 | all |
buster | 2.1.0~r79717-2 | all |
stretch | 1.2.1+svn~57448-4 | all |
jessie | 1.1.0-1.1 | amd64,armel,armhf,i386 |
Debtags of package apertium-es-ca: |
culture | catalan, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Toto je prechodový fiktívny balík. Je možné ho bezpečne odstrániť.
|
|
apertium-es-gl
Apertium - prekladové dáta dvojice španielčina-galícijčina
|
Versions of package apertium-es-gl |
Release | Version | Architectures |
jessie | 1.0.7-1 | amd64,armel,armhf,i386 |
bullseye | 1.0.8~r57542-4 | all |
buster | 1.0.8~r57542-3 | all |
stretch | 1.0.8~r57542-2 | all |
bookworm | 1.0.9-3 | all |
trixie | 1.0.9-3 | all |
sid | 1.0.9-3 | all |
Debtags of package apertium-es-gl: |
culture | galician, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
španielčina a galícijčina.
|
|
apertium-es-it
prechodový fiktívny balík na apertium-spa-ita
|
Versions of package apertium-es-it |
Release | Version | Architectures |
bullseye | 0.2.0~r78826-2.1 | all |
stretch | 0.1.0~r51165-1 | all |
sid | 0.2.1-3 | all |
bookworm | 0.2.1-3 | all |
trixie | 0.2.1-3 | all |
buster | 0.2.0~r78826-2 | all |
|
License: DFSG free
|
Toto je prechodový fiktívny balík. Je možné ho bezpečne odstrániť.
|
|
apertium-es-pt
Apertium - prekladové dáta dvojice španielčina-portugalčina
|
Versions of package apertium-es-pt |
Release | Version | Architectures |
bookworm | 1.1.6-1 | all |
sid | 1.1.6-1 | all |
trixie | 1.1.6-1 | all |
bullseye | 1.1.5+svn~57507-5 | all |
buster | 1.1.5+svn~57507-4 | all |
stretch | 1.1.5+svn~57507-3 | all |
jessie | 1.0.3-2.1 | amd64,armel,armhf,i386 |
Debtags of package apertium-es-pt: |
culture | esperanto, portuguese, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
španielčina a portugalčina.
|
|
apertium-es-ro
Apertium - prekladové dáta dvojice španielčina-rumunčina
|
Versions of package apertium-es-ro |
Release | Version | Architectures |
stretch | 0.7.3~r57551-2 | all |
buster | 0.7.3~r57551-3 | all |
sid | 0.7.5-1 | all |
jessie | 0.7.1-2.1 | amd64,armel,armhf,i386 |
bookworm | 0.7.5-1 | all |
bullseye | 0.7.3~r57551-4 | all |
Debtags of package apertium-es-ro: |
culture | romanian, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
španielčina a rumunčina.
|
|
apertium-eu-en
Apertium - prekladové dáta dvojice baskičtina-angličtina
|
Versions of package apertium-eu-en |
Release | Version | Architectures |
buster | 0.3.1~r56205-2 | all |
bullseye | 0.3.1~r56205-3 | all |
bookworm | 0.3.3-1 | all |
sid | 0.3.3-1 | all |
stretch | 0.3.1~r56205-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
baskičtina a angličtina.
|
|
apertium-eu-es
Apertium - prekladové dáta dvojice baskičtina-španielčina
|
Versions of package apertium-eu-es |
Release | Version | Architectures |
trixie | 0.3.4-1 | all |
jessie | 0.3.1-1 | amd64,armel,armhf,i386 |
stretch | 0.3.3~r56159-2 | all |
buster | 0.3.3~r56159-3 | all |
bullseye | 0.3.3~r56159-4 | all |
bookworm | 0.3.4-1 | all |
sid | 0.3.4-1 | all |
Debtags of package apertium-eu-es: |
culture | basque, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
baskičtina a španielčina.
|
|
apertium-fr-ca
??? missing short description for package apertium-fr-ca :-(
|
Versions of package apertium-fr-ca |
Release | Version | Architectures |
stretch | 1.1.0~r64309-1 | all |
jessie | 1.0.2-1 | amd64,armel,armhf,i386 |
Debtags of package apertium-fr-ca: |
culture | catalan, french |
field | linguistics |
role | app-data |
|
License: DFSG free
|
|
|
apertium-fr-es
Apertium - prekladové dáta dvojice francúzština-španielčina
|
Versions of package apertium-fr-es |
Release | Version | Architectures |
sid | 0.9.4-1 | all |
trixie | 0.9.4-1 | all |
jessie | 0.9.0-1 | amd64,armel,armhf,i386 |
buster | 0.9.2~r61322-3 | all |
bookworm | 0.9.4-1 | all |
bullseye | 0.9.2~r61322-4 | all |
stretch | 0.9.2~r61322-2 | all |
Debtags of package apertium-fr-es: |
culture | french, spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
francúzština a španielčina.
|
|
apertium-fra
Apertium single language data for French
|
Versions of package apertium-fra |
Release | Version | Architectures |
buster | 1.5.0-1 | all |
stretch | 1.0.0~r65786-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for French.
|
|
apertium-fra-cat
Apertium - prekladové dáta dvojice francúzština-katalánčina
|
Versions of package apertium-fra-cat |
Release | Version | Architectures |
bullseye | 1.9.0-1 | all |
bookworm | 1.10.0-1 | all |
stretch | 1.1.0~r64309-1 | all |
buster | 1.5.0-1 | all |
sid | 1.10.0-1 | all |
trixie | 1.10.0-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
francúzština a katalánčina.
|
|
apertium-hbs
Apertium single language data for Serbo-Croatian
|
Versions of package apertium-hbs |
Release | Version | Architectures |
buster | 0.5.0~r68212-3 | all |
stretch | 0.5.0~r68212-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Serbo-Croatian.
|
|
apertium-hbs-eng
Apertium - prekladové dáta dvojice srbochorvátčina-angličtina
|
Versions of package apertium-hbs-eng |
Release | Version | Architectures |
buster | 0.1.0~r57598-2 | all |
bullseye | 0.5.1-1 | all |
bookworm | 0.5.1-2 | all |
trixie | 0.5.1-2 | all |
sid | 0.5.1-2 | all |
stretch | 0.1.0~r57598-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
srbochorvátčina a angličtina.
|
|
apertium-hbs-mkd
Apertium - prekladové dáta dvojice srbochorvátčina-macedónčina
|
Versions of package apertium-hbs-mkd |
Release | Version | Architectures |
buster | 0.1.0~r76450-2.1 | all |
bookworm | 0.1.1-1 | all |
bullseye | 0.1.0~r76450-4 | all |
trixie | 0.1.1-1 | all |
sid | 0.1.1-1 | all |
stretch | 0.1.0~r57554-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
srbochorvátčina a macedónčina.
|
|
apertium-hbs-slv
Apertium - prekladové dáta dvojice srbochorvátčina-slovinčina
|
Versions of package apertium-hbs-slv |
Release | Version | Architectures |
sid | 0.5.1-2 | all |
bullseye | 0.5.1-1 | all |
trixie | 0.5.1-2 | all |
buster | 0.1.0~r59294-2 | all |
bookworm | 0.5.1-2 | all |
stretch | 0.1.0~r59294-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
srbochorvátčina a slovinčina.
|
|
apertium-hin
Apertium single language data for Hindi
|
Versions of package apertium-hin |
Release | Version | Architectures |
bookworm | 0.1.0~r59158-4 | all |
buster | 0.1.0~r59158-2 | all |
stretch | 0.1.0~r59158-1 | all |
sid | 0.1.0~r59158-4 | all |
trixie | 0.1.0~r59158-4 | all |
bullseye | 0.1.0~r59158-2.1 | all |
upstream | 0.1.0 |
|
License: DFSG free
|
Data package providing Apertium language resources for Hindi.
|
|
apertium-id-ms
Transitional dummy package for apertium-ind-zlm
|
Versions of package apertium-id-ms |
Release | Version | Architectures |
bookworm | 0.1.2-3 | all |
stretch | 0.1.1~r57551-1 | all |
buster | 0.1.1~r57551-2 | all |
bullseye | 0.1.2-3 | all |
trixie | 0.1.2-3 | all |
sid | 0.1.2-3 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-is-sv
Transitional dummy package for apertium-isl-swe
|
Versions of package apertium-is-sv |
Release | Version | Architectures |
bookworm | 0.1.1-2 | all |
stretch | 0.1.0~r56030-1 | all |
buster | 0.1.0~r76450-2 | all |
bullseye | 0.1.0~r76450-3 | all |
sid | 0.1.1-2 | all |
trixie | 0.1.1-2 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-isl
Apertium single language data for Icelandic
|
Versions of package apertium-isl |
Release | Version | Architectures |
stretch | 0.1.0~r65494-1 | all |
bullseye | 0.1.0~r65494-2.1 | all |
buster | 0.1.0~r65494-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Icelandic.
|
|
apertium-isl-eng
Apertium - prekladové dáta dvojice islandčina-angličtina
|
Versions of package apertium-isl-eng |
Release | Version | Architectures |
sid | 0.1.2-1 | all |
bullseye | 0.1.0~r66083-3 | all |
bookworm | 0.1.2-1 | all |
stretch | 0.1.0~r66083-1 | all |
buster | 0.1.0~r66083-2 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
islandčina a angličtina.
|
|
apertium-ita
Apertium single language data for Italian
|
Versions of package apertium-ita |
Release | Version | Architectures |
bullseye | 0.10.0~r82237-2.1 | all |
stretch | 0.9.0~r72553-1 | all |
buster | 0.10.0~r82237-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Italian.
|
|
apertium-kaz
Apertium - dáta jednotlivého jazyka - kazaština
|
Versions of package apertium-kaz |
Release | Version | Architectures |
stretch | 0.1.0~r61338-1 | all |
buster | 0.1.0~r61338-2 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium v jazyku kazaština.
|
|
apertium-kaz-tat
Apertium - prekladové dáta dvojice kazaština-tatárčina
|
Versions of package apertium-kaz-tat |
Release | Version | Architectures |
buster | 0.2.1~r57554-2 | all |
bullseye | 0.2.1-1 | all |
stretch | 0.2.1~r57554-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
kazaština a tatárčina.
|
|
apertium-mk-bg
Transitional dummy package for apertium-mkd-bul
|
Versions of package apertium-mk-bg |
Release | Version | Architectures |
trixie | 0.2.1-2 | all |
bookworm | 0.2.1-2 | all |
buster | 0.2.0~r49489-2 | all |
stretch | 0.2.0~r49489-1 | all |
bullseye | 0.2.0~r49489-3 | all |
sid | 0.2.1-2 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-mk-en
Transitional dummy package for apertium-mkd-eng
|
Versions of package apertium-mk-en |
Release | Version | Architectures |
stretch | 0.1.1~r57554-1 | all |
sid | 0.1.3-2 | all |
bookworm | 0.1.3-2 | all |
trixie | 0.1.3-2 | all |
bullseye | 0.1.1~r57554-3 | all |
buster | 0.1.1~r57554-2 | all |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-mlt-ara
Apertium - prekladové dáta dvojice maltčina-arabčina
|
Versions of package apertium-mlt-ara |
Release | Version | Architectures |
buster | 0.2.0~r62623-2 | all |
bullseye | 0.2.0~r62623-2.1 | all |
stretch | 0.2.0~r62623-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
maltčina a arabčina.
|
|
apertium-nno
Apertium single language data for Norwegian Nynorsk
|
Versions of package apertium-nno |
Release | Version | Architectures |
stretch | 0.9.0~r69513-1 | all |
buster | 0.9.0~r69513-3 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Norwegian Nynorsk.
|
|
apertium-nno-nob
Apertium - prekladové dáta dvojice nórsky nynorsk-nórsky bokmål
|
Versions of package apertium-nno-nob |
Release | Version | Architectures |
sid | 1.5.0-1 | all |
buster | 1.1.0~r66076-2 | all |
stretch | 1.1.0~r66076-1 | all |
bullseye | 1.3.0-1 | all |
bookworm | 1.5.0-1 | all |
trixie | 1.5.0-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
nórsky nynorsk a nórsky bokmål.
|
|
apertium-nob
Apertium single language data for Norwegian Bokmål
|
Versions of package apertium-nob |
Release | Version | Architectures |
buster | 0.9.0~r69513-2 | all |
stretch | 0.9.0~r69513-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Norwegian Bokmål.
|
|
apertium-oc-ca
Apertium - prekladové dáta dvojice okcitánčina-katalánčina
|
Versions of package apertium-oc-ca |
Release | Version | Architectures |
jessie | 1.0.5-1.1 | amd64,armel,armhf,i386 |
stretch | 1.0.6~r57551-2 | all |
buster | 1.0.6~r57551-3 | all |
bullseye | 1.0.6~r57551-4 | all |
bookworm | 1.0.7-1 | all |
trixie | 1.0.7-1 | all |
sid | 1.0.7-1 | all |
Debtags of package apertium-oc-ca: |
culture | catalan |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
okcitánčina a katalánčina.
|
|
apertium-oc-es
Apertium - prekladové dáta dvojice okcitánčina-španielčina
|
Versions of package apertium-oc-es |
Release | Version | Architectures |
bookworm | 1.0.8-1 | all |
sid | 1.0.8-1 | all |
bullseye | 1.0.6~r57551-4 | all |
buster | 1.0.6~r57551-3 | all |
stretch | 1.0.6~r57551-2 | all |
jessie | 1.0.5-1.1 | amd64,armel,armhf,i386 |
Debtags of package apertium-oc-es: |
culture | spanish |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
okcitánčina a španielčina.
|
|
apertium-oci
Apertium single language data for Occitan
|
Versions of package apertium-oci |
Release | Version | Architectures |
buster | 0.1.0-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Occitan.
|
|
apertium-pol
Apertium single language data for Polish
|
Versions of package apertium-pol |
Release | Version | Architectures |
buster | 0.1.1-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Polish.
|
|
apertium-pol-szl
Apertium translation data for the Polish-Silesian pair
|
Versions of package apertium-pol-szl |
Release | Version | Architectures |
bullseye | 0.2.1-2 | all |
trixie | 0.2.1-3 | all |
bookworm | 0.2.1-3 | all |
sid | 0.2.1-3 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for translating
between the Polish and Silesian languages.
|
|
apertium-pt-ca
Transitional dummy package for apertium-por-cat
|
Versions of package apertium-pt-ca |
Release | Version | Architectures |
trixie | 0.10.1-2 | all |
jessie | 0.8.1-1 | amd64,armel,armhf,i386 |
bookworm | 0.10.1-2 | all |
stretch | 0.8.2+svn~57507-3 | all |
buster | 0.8.2+svn~57507-4 | all |
sid | 0.10.1-2 | all |
bullseye | 0.10.0-1 | all |
Debtags of package apertium-pt-ca: |
culture | catalan, portuguese |
field | linguistics |
role | app-data |
|
License: DFSG free
|
This is a transitional dummy package. It can safely be removed.
|
|
apertium-pt-gl
Apertium - prekladové dáta dvojice portugalčina-galícijčina
|
Versions of package apertium-pt-gl |
Release | Version | Architectures |
jessie | 0.9.1-1 | amd64,armel,armhf,i386 |
stretch | 0.9.2~r57551-2 | all |
buster | 0.9.2~r57551-3 | all |
bullseye | 0.9.2~r57551-4 | all |
bookworm | 0.9.3-1 | all |
trixie | 0.9.3-1 | all |
sid | 0.9.3-1 | all |
Debtags of package apertium-pt-gl: |
culture | galician, portuguese |
field | linguistics |
role | app-data |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
portugalčina a galícijčina.
|
|
apertium-rus
Apertium single language data for Russian
|
Versions of package apertium-rus |
Release | Version | Architectures |
buster | 0.2.0~r82706-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Russian
|
|
apertium-separable
Reordering separable/discontiguous multiwords
|
Versions of package apertium-separable |
Release | Version | Architectures |
buster | 0.3.2-1 | amd64,arm64,armhf,i386 |
sid | 0.6.1-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.3.6-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.6.1-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.6.1-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Apertium module for reordering separable/discontiguous multiwords.
|
|
apertium-sme-nob
Apertium - prekladové dáta dvojice severná saamčina-nórsky bokmål
|
Versions of package apertium-sme-nob |
Release | Version | Architectures |
bullseye | 0.6.1+ds.1-2 | all |
buster | 0.6.0~r61921-2 | all |
stretch | 0.6.0~r61921-1 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
severná saamčina a nórsky bokmål.
|
|
apertium-spa
Apertium single language data for Spanish
|
Versions of package apertium-spa |
Release | Version | Architectures |
buster | 1.1.0~r79716-2 | all |
stretch | 0.1.0~r65494-1 | all |
bullseye | 1.1.0~r79716-2.1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Spanish.
|
|
apertium-spa-arg
Apertium - prekladové dáta dvojice španielčina-aragónčina
|
Versions of package apertium-spa-arg |
Release | Version | Architectures |
trixie | 0.6.0-2 | all |
buster | 0.4.0~r64399-2 | all |
bookworm | 0.5.0-2 | all |
bullseye | 0.5.0-1 | all |
stretch | 0.4.0~r64399-1 | all |
sid | 0.6.0-2 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
španielčina a aragónčina.
|
|
apertium-srd
Apertium single language data for Sardinian
|
Versions of package apertium-srd |
Release | Version | Architectures |
stretch | 0.9.0~r72792-1 | all |
buster | 1.2.0~r82994-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Sardinian.
|
|
apertium-srd-ita
Apertium - prekladové dáta dvojice sardínčina-taliančina
|
Versions of package apertium-srd-ita |
Release | Version | Architectures |
stretch | 0.9.0~r72554-1 | all |
bullseye | 1.1.0-1 | all |
buster | 0.9.5~r82237-2 | all |
trixie | 1.3.0-1 | all |
sid | 1.3.0-1 | all |
bookworm | 1.1.0-2 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
sardínčina a taliančina.
|
|
apertium-swe
Apertium single language data for Swedish
|
Versions of package apertium-swe |
Release | Version | Architectures |
stretch | 0.7.0~r69513-1 | all |
buster | 0.7.0~r69513-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Swedish.
|
|
apertium-swe-dan
Apertium - prekladové dáta dvojice švédčina-dánčina
|
Versions of package apertium-swe-dan |
Release | Version | Architectures |
stretch | 0.7.0~r66063-1 | all |
trixie | 0.8.1-3 | all |
sid | 0.8.1-3 | all |
bullseye | 0.8.1-2 | all |
buster | 0.7.0~r66063-2 | all |
bookworm | 0.8.1-3 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
švédčina a dánčina.
|
|
apertium-swe-nor
Apertium - prekladové dáta dvojice švédčina-nórčina
|
Versions of package apertium-swe-nor |
Release | Version | Architectures |
trixie | 0.4.0-1 | all |
sid | 0.4.0-1 | all |
bookworm | 0.4.0-1 | all |
bullseye | 0.3.1-1 | all |
buster | 0.2.0~r69544-2 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
švédčina a nórčina.
|
|
apertium-szl
Apertium single language data for Silesian
|
Versions of package apertium-szl |
Release | Version | Architectures |
buster | 0.1.0-1 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Silesian.
|
|
apertium-tat
Apertium - dáta jednotlivého jazyka - tatárčina
|
Versions of package apertium-tat |
Release | Version | Architectures |
stretch | 0.1.0~r60887-1 | all |
buster | 0.1.0~r60887-2 | all |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium v jazyku tatárčina.
|
|
apertium-tur
Apertium single language data for Turkish
|
Versions of package apertium-tur |
Release | Version | Architectures |
buster | 0.2.0~r83161-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Turkish.
|
|
apertium-ukr
Apertium single language data for Ukrainian
|
Versions of package apertium-ukr |
Release | Version | Architectures |
buster | 0.1.0~r82563-2 | all |
|
License: DFSG free
|
Data package providing Apertium language resources for Ukrainian.
|
|
apertium-urd
Apertium single language data for Urdu
|
Versions of package apertium-urd |
Release | Version | Architectures |
trixie | 0.1.0~r61311-3 | all |
sid | 0.1.0~r61311-3 | all |
stretch | 0.1.0~r61311-1 | all |
buster | 0.1.0~r61311-2 | all |
bullseye | 0.1.0~r61311-2.1 | all |
bookworm | 0.1.0~r61311-3 | all |
upstream | 0.1.0 |
|
License: DFSG free
|
Data package providing Apertium language resources for Urdu.
|
|
apertium-urd-hin
Apertium - prekladové dáta dvojice urdčina-hindčina
|
Versions of package apertium-urd-hin |
Release | Version | Architectures |
sid | 0.1.0~r64379-4 | all |
trixie | 0.1.0~r64379-4 | all |
stretch | 0.1.0~r64379-1 | all |
bullseye | 0.1.0~r64379-2.1 | all |
buster | 0.1.0~r64379-2 | all |
bookworm | 0.1.0~r64379-4 | all |
upstream | 0.1.0 |
|
License: DFSG free
|
Balík dát, ktorý poskytuje zdroje Apertium na preklad medzi jazykmi
urdčina a hindčina.
|
|
frogdata
|
Versions of package frogdata |
Release | Version | Architectures |
buster | 0.16-1 | all |
bookworm | 0.18-2 | all |
trixie | 0.22-1 | all |
jessie | 0.4-1 | all |
bullseye | 0.18-1 | all |
sid | 0.22-1 | all |
stretch | 0.13-1 | all |
|
License: DFSG free
|
Frog is a modular system integrating a morphosyntactic tagger, lemmatizer,
morphological analyzer, and dependency parser for the Dutch language.
This package provided necessary datafiles for running Frog.
Frog is a product of the Centre for Language and Speech Technology
(Radboud University, Nijmegen) and prior to that of ILK Research Group
(Tilburg University, The Netherlands) and the CLiPS Research Centre
(University of Antwerp, Belgium). It is currently maintained at the
KNAW Humanities Cluster.
|
|
libapache-opennlp-java
machine learning based toolkit for the processing of natural language text
|
Versions of package libapache-opennlp-java |
Release | Version | Architectures |
bullseye | 1.9.3-1 | all |
trixie | 2.5.1-1 | all |
bookworm | 2.1.0-1 | all |
sid | 2.5.1-1 | all |
|
License: DFSG free
|
The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging, named
entity extraction, chunking, parsing, and coreference resolution. These tasks
are usually required to build more advanced text processing services. OpenNLP
also included maximum entropy and perceptron based machine learning.
|
|
libcg3-dev
Headers and shared files to develop using the CG-3 library
|
Versions of package libcg3-dev |
Release | Version | Architectures |
trixie | 1.4.6-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.3.9-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.9.9~r11624-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 1.4.6-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.3.2-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.1.7-1 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
Development files to use the CG-3 API.
It is recommended to instrument the CLI tools instead of using this API.
See https://visl.sdu.dk/cg3.html for more documentation
|
|
libfasttext-dev
|
Versions of package libfasttext-dev |
Release | Version | Architectures |
bookworm | 0.9.2+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.9.2+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.9.2+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.9.2-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
fastText is a library for efficient learning of word representations
and sentence classification, which refers subword information to
enrich word vectors.
This package contains header files for development.
|
|
libfolia-dev
Implementation of the FoLiA document format (C++ headers)
|
Versions of package libfolia-dev |
Release | Version | Architectures |
trixie | 2.17-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.6-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 2.17-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.15-1 | amd64,arm64,armhf,i386 |
jessie | 0.10-4.2 | amd64,armel,armhf,i386 |
bullseye | 2.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.4-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 2.21 |
Debtags of package libfolia-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
FoLiA is an XML-based format for Linguistic Annotation suitable for
representing written language resources such as corpora.
Its goal is to unify a variety of linguistic annotations in one single rich
format, without committing to any particular standard annotation set.
Instead, it seeks to accommodate any desired system or tagset, and so offer
maximum flexibility. This makes FoLiA language independent.
see https://proycon.github.io/folia for more information.
libfolia is a product of the Centre of Language and Speech Technology, Radboud
University Nijmegen (The Netherlands), it was previously developed at the ILK
Research Group, Tilburg University. Work on libfolia is funded by NWO, the
Netherlands Organisation for Scientific Research, in the scope of projects
like CLARIN-NL and CLARIAH.
This package provides the FoLiA header files required to compile C++ programs
that use libfolia and implements FoLiA v2.5.1.
|
|
libmbt-dev
memory-based tagger-generator and tagger - development
|
Versions of package libmbt-dev |
Release | Version | Architectures |
bookworm | 3.6-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.4-1 | amd64,arm64,armhf,i386 |
trixie | 3.10-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.10-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 3.6-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 3.11 |
Debtags of package libmbt-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
MBT is a memory-based tagger-generator and tagger in one. The tagger-generator
part can generate a sequence tagger on the basis of a training set of tagged
sequences; the tagger part can tag new sequences. MBT can, for instance, be
used to generate part-of-speech taggers or chunkers for natural language
processing.
MBT is a product of the Centre of Language and Speech Technology (Radboud
University Nijmegen, The Netherlands), the ILK Research Group (Tilburg
University, The Netherlands) and the CLiPS Research Centre (University
of Antwerp, Belgium).
If you do scientific research in natural language processing, MBT will
likely be of use to you.
This package provides the header files required to compile C++ programs that
use libmbt.
|
|
libopennlp-maxent-java
OpenNLP Maximum Entropy Package
|
Versions of package libopennlp-maxent-java |
Release | Version | Architectures |
bullseye | 3.0.0+ds-2 | all |
trixie | 3.0.0+ds-2 | all |
sid | 3.0.0+ds-2 | all |
bookworm | 3.0.0+ds-2 | all |
|
License: DFSG free
|
Maximum entropy is a powerful method for constructing statistical models of
classification tasks, such as part of speech tagging in Natural Language
Processing. Several example applications using maxent can be found in the
OpenNLP Tools Library.
|
|
libsentencepiece-dev
Header files of SentencePiece
|
Versions of package libsentencepiece-dev |
Release | Version | Architectures |
bookworm | 0.1.97-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.2.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.2.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.1.95-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
SentencePiece is an unsupervised text tokenizer/detokenizer mainly
designed for Neural Network-based text generation systems where the
vocabulary size is predetermined prior to the neural model training.
|
|
libticcutils-dev
utility functions used in the context of Natural Language Processing (headers)
|
Versions of package libticcutils-dev |
Release | Version | Architectures |
buster | 0.20-1 | amd64,arm64,armhf,i386 |
sid | 0.34-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.34-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.24-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.24-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 0.36 |
Debtags of package libticcutils-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
The TiCC utils C++ library contains useful functions and other goodies for
general use in TiMBL and other parts of the TiCC software stack and beyond.
TiCC utils is a product of the Tilburg centre for Cognition and Communication
(Tilburg University, The Netherlands). If you do scientific research in
Natural Language Processing, TiCC software will likely be of use to you.
This package provides the header files required to compile C++ programs
that use libticcutils.
|
|
libticcutils2-dev
??? missing short description for package libticcutils2-dev :-(
|
Versions of package libticcutils2-dev |
Release | Version | Architectures |
stretch | 0.14-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.4-5.1 | amd64,armel,armhf,i386 |
upstream | 0.36 |
Debtags of package libticcutils2-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
|
|
libtimbl-dev
Tilburg Memory Based Learner - development
|
Versions of package libtimbl-dev |
Release | Version | Architectures |
bookworm | 6.5-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 6.4.13-1 | amd64,arm64,armhf,i386 |
sid | 6.9-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 6.9-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 6.5-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 6.10 |
Debtags of package libtimbl-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
The Tilburg Memory Based Learner, TiMBL, is a tool for Natural Language
Processing research, and for many other domains where classification tasks are
learned from examples. It is an efficient implementation of k-nearest neighbor
classifier.
TiMBL is a product of the Centre of Language and Speech Technology
(Radboud University, Nijmegen, The Netherlands), the ILK Research Group
(Tilburg University, The Netherlands) and the CLiPS Research Centre
(University of Antwerp, Belgium).
This package provides the TiMBL header files required to compile C++ programs
that use TiMBL.
|
|
libtimblserver-dev
Server extensions for Timbl - development
|
Versions of package libtimblserver-dev |
Release | Version | Architectures |
sid | 1.18-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.12-1 | amd64,arm64,armhf,i386 |
trixie | 1.18-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
upstream | 1.19 |
Debtags of package libtimblserver-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
timblserver is a TiMBL wrapper; it adds server functionality to TiMBL. It
allows TiMBL to run multiple experiments as a TCP server, optionally via HTTP.
The Tilburg Memory Based Learner, TiMBL, is a tool for Natural Language
Processing research, and for many other domains where classification tasks are
learned from examples.
TimblServer is a product of the ILK Research Group (Tilburg University, The
Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium).
This package provides the header files required to compile C++ programs that
use timblserver.
|
|
libucto-dev
Unicode Tokenizer - development
|
Versions of package libucto-dev |
Release | Version | Architectures |
bullseye | 0.21.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.9.6-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.21.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 0.5.3-3.1 | amd64,armel,armhf,i386 |
trixie | 0.30-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.30-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 0.14-2 | amd64,arm64,armhf,i386 |
upstream | 0.35 |
Debtags of package libucto-dev: |
devel | library |
role | devel-lib |
|
License: DFSG free
|
Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
punctuation, split sentences, generate n-grams), and offers several other
basic preprocessing steps that make your text suited for further processing
such as indexing, part-of-speech tagging, or machine translation.
This package provides C++ headers for the programming library.
Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto
was funded by NWO, the Netherlands Organisation for Scientific Research,
under the Implicit Linguistics project, the CLARIN-NL program, and the
CLARIAH project.
Ucto is a product of the Centre of Language and Speech Technology (Radboud
University Nijmegen), the KNAW Humanities Cluster, and previously the ILK
Research Group (Tilburg University, The Netherlands).
If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
do scientific research in natural language processing, ucto will likely be of
use to you.
|
|
python3-fasttext
fastText binding for Python3
|
Versions of package python3-fasttext |
Release | Version | Architectures |
trixie | 0.9.2+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.9.2+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.9.2+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.9.2-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
fastText is a library for efficient learning of word representations
and sentence classification, which refers subword information to
enrich word vectors.
python3-fasttext is its binding for Python3.
|
|
python3-gensim
Python framework for fast Vector Space Modelling
|
Versions of package python3-gensim |
Release | Version | Architectures |
sid | 4.3.3+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 4.2.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Gensim is a Python library for topic modelling, document indexing
and similarity retrieval with large corpora. The target audience
is the natural language processing (NLP) and information retrieval
(IR) community.
|
|
python3-nltk
Python3 libraries for natural language processing
|
Versions of package python3-nltk |
Release | Version | Architectures |
sid | 3.9.1-2 | all |
jessie | 3.0.0-1 | all |
stretch | 3.2.1-2 | all |
buster | 3.4-1 | all |
bullseye | 3.5-1 | all |
bookworm | 3.8-1 | all |
trixie | 3.9.1-2 | all |
|
License: DFSG free
|
The Natural Language Toolkit (NLTK) is a leading platform for building
Python programs to work with human language data. It provides easy-to-use
interfaces to over 50 corpora and lexical resources such as WordNet,
along with a suite of text processing libraries for classification,
tokenization, stemming, tagging, parsing, and semantic reasoning.
This package contains the modules for Python3.
Please cite:
Steven Bird, Ewan Klein and Edward Loper:
(2009)
|
|
python3-sentencepiece
SentencePiece binding for Python3
|
Versions of package python3-sentencepiece |
Release | Version | Architectures |
trixie | 0.2.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.1.95-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.2.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.1.97-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
SentencePiece is an unsupervised text tokenizer/detokenizer mainly
designed for Neural Network-based text generation systems where the
vocabulary size is predetermined prior to the neural model training.
python3-sentencepiece is its binding for Python3.
|
|
python3-snowballstemmer
Pure Python Snowball stemming library
|
Versions of package python3-snowballstemmer |
Release | Version | Architectures |
bookworm | 2.2.0-2 | all |
bullseye | 2.1.0-1 | all |
stretch | 1.2.1-1 | all |
buster | 1.2.1-1 | all |
sid | 2.2.0-4 | all |
trixie | 2.2.0-4 | all |
|
License: DFSG free
|
Snowball provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search engines
and information retrieval software; for example, a search with stemming
enabled should be able to find a document containing "cycling" given the
query "cycles".
Snowball provides algorithms for several (mainly European) languages.
It also provides access to the classic Porter stemming algorithm for
English: although this has been superseded by an improved algorithm, the
original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.
This package contains the pure Python module that implements Snowball
algorithms. When python3-stemmer package (which contains the C extension)
is installed, it uses that extension instead of the pure Python code.
|
|
python3-streamparser
Python library to parse Apertium stream format
|
Versions of package python3-streamparser |
Release | Version | Architectures |
sid | 5.0.2-2 | all |
trixie | 5.0.2-2 | all |
buster | 5.0.2-1 | all |
bullseye | 5.0.2-2 | all |
bookworm | 5.0.2-2 | all |
|
License: DFSG free
|
This package provides Python 3 library, streamparser, to parse
Apertium stream format.
|
|
r-cran-nlp
Natural Language Processing Infrastructure for R
|
Versions of package r-cran-nlp |
Release | Version | Architectures |
sid | 0.2-1-1 | all |
bookworm | 0.2-1-1 | all |
bullseye | 0.2-1-1 | all |
buster | 0.2-0-1 | all |
stretch-backports | 0.2-0-1~bpo9+1 | all |
stretch | 0.1-9-1 | all |
trixie | 0.2-1-1 | all |
upstream | 0.3-2 |
|
License: DFSG free
|
Basic classes and methods for Natural Language Processing in R.
|
|
r-cran-tm
Text Mining functionality for R
|
Versions of package r-cran-tm |
Release | Version | Architectures |
trixie | 0.7-14-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch-backports | 0.7-6-1~bpo9+1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.7-6-1 | amd64,arm64,armhf,i386 |
stretch | 0.6-2-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 0.7-14-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.7-8-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.7-11-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 0.7-15 |
|
License: DFSG free
|
A framework for text mining applications within R.
|
|
tfdocgen
TiLP framework documentation generator
|
Versions of package tfdocgen |
Release | Version | Architectures |
bullseye | 1.0-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0-1 | amd64,armel,armhf,i386 |
stretch | 1.0-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.0-2 | amd64,arm64,armhf,i386 |
bookworm | 1.0-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package tfdocgen: |
devel | docsystem |
role | program |
|
License: DFSG free
|
The tfdocgen program is a program used by the libti2 libraries to generate
their HTML documentation from sources and misc files. You don't need this
package unless you want to develop on the libti2 libraries.
|
|
Packaging has started and developers might try the packaging code in VCS
spacy
Industrial-strength Natural Language Processing (NLP)
|
Versions of package spacy |
Release | Version | Architectures |
VCS | 2.2.3-1 | all |
|
License: MIT
Debian package not available
Version: 2.2.3-1
|
spaCy is a library for advanced Natural Language Processing in Python
and Cython. It’s built on the very latest research, and was designed
from day one to be used in real products. spaCy comes with pre-trained
statistical models and word vectors, and currently supports tokenization
for 30+ languages. It features the fastest syntactic parser in the
world, convolutional neural network models for tagging, parsing and
named entity recognition and easy deep learning integration.
|
travatar
tree based machine translation toolkit
|
Versions of package travatar |
Release | Version | Architectures |
VCS | 0.1.0+git20131221-1 | all |
|
License: LGPL-3.0+
Debian package not available
Version: 0.1.0+git20131221-1
|
Travatar is tree based statistical machine translation system containing
Tree-to-String (T2S) and Forest-to-String (F2S).
Tree based translation uses syntax trees of natural language and it's
particularly effective for language pairs that require a large amount of
reordering, such as English-Japanese translation.
|
No known packages available but some record of interest (WNPP bug)
Python bindings for the Tilburg Memory Based Learner (Timbl)
|
|
License: unknown
Debian package not available
|
python-timbl is a Python extension module wrapping the full TiMBL C++
programming interface. With this module, all functionality exposed
through the C++ interface is also available to Python scripts. Being
able to access the API from Python greatly facilitates prototyping
TiMBL-based applications.
TiMBL is an open source software package implementing several
memory-based learning algorithms, among which IB1-IG, an
implementation of k-nearest neighbor classification with feature
weighting suitable for symbolic feature spaces, and IGTree, a
decision-tree approximation of IB1-IG. All implemented algorithms have
in common that they store some representation of the training set
explicitly in memory. During testing, new cases are classified by
extrapolation from the most similar stored cases.
The Python module offers both a high-level as well as a low-level
interface, the former is very Pythonic and easy to use while the
latter offers the full API.
|
No known packages available
wnsqlbuilder
SQL version of WordNet 3.0
|
|
License: GPL
Debian package not available
|
WordNet SQL Builder is a Java utility to generate SQL database from
WordNet standard database as released by the WordNet Project (Princeton
University)
Features
- Support for MySql and PostGreSQL.
- Complete port (however, orphaned morphological forms are dropped, and
so are VerbNet/XWordNet data that cannot be linked to WordNet entries).
- Incremental build support.
- Retains synset index as primary key allowing easy reference to wordnet
original database
- Includes support for WordNet 3.0
- Includes support for WordNet 2.0 to 2.1, 2.1 to 3.0, 2.0 to 3.0 sense maps
- Includes support for VerbNet 2.3
- Includes support for XWordNet 2.0-1.1
- Ready-to-use database (see wnsqldatabase package in download section) including
- WordNet 3.0
- WordNet 2.0 to 2.1, 2.1 to 3.0, 2.0 to 3.0 sense maps
- VerbNet 2.3
- XWordNet 2.0-1.1
- British National Corpus statistical data (for commonly used-words)
|
|