Debian Science Project
Summary
Data Management
Debian Science Data Management-pakker

Denne metapakke vil installere pakker til at assistere med datahåndteringsopgaver, såsom indhentelse af data fra eksterne ressourcer, holde data under versionskontrol etc.

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Science to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Science mailing list

Links to other tasks

Debian Science Data Management packages

Official Debian packages with high relevance

datalad
Håndtering af og distributionsplatform for datafiler
Versions of package datalad
ReleaseVersionArchitectures
sid0.19.6-2all
stretch0.4.1-1all
trixie0.19.6-2all
bookworm0.18.1-2all
bullseye0.14.0-1all
buster0.11.2-2all
upstream1.0.2
Popcon: 42 users (5 upd.)*
Newer upstream!
License: DFSG free
Git

DataLad er en datahåndterings- og distributionsplatform, som tilbyder adgang til en bred vifte af dataressourcer allerede tilgængelige på nettet. Bruger git-annex som motor for datalogistik og tilbyder de følgende indbyggede eller tilgængelige funktioner via udvidelser:

  • grænseflader for kommandolinjen og Python til at manipulere samlinger af datasæt (installer, fjern, opdater, udgiv, gem etc.)og separate filer/mapper (add, get)
  • udtræk, saml og søg igennem diverse metadatakilder (xmp, EXIF, etc; installer datalad-neuroimaging for DICOM-, BIDS-, NIfTI-understøttelse)
  • gennemløb internetsider for automatisk at forberede og opdatere git-annex-arkiver med indhold fra internetsider, S3 etc. (installer datalad-crawler)
datalad-container
DataLad-udvidelse til arbejdet med containermiljøer
Maintainer: Yaroslav Halchenko
Versions of package datalad-container
ReleaseVersionArchitectures
sid1.2.5-1all
buster0.2.2-2all
bullseye1.1.2-1all
bookworm1.1.9-1all
trixie1.2.5-1all
Popcon: 4 users (3 upd.)*
Versions and Archs
License: DFSG free

Denne udvidelse forbedrer DataLad (http://datalad.org) til arbejdet med beregningscontainere.

git-annex
Håndter filer med git, uden at tjekke deres indhold ind i git
Versions of package git-annex
ReleaseVersionArchitectures
sid10.20240129-1amd64,arm64,i386,mips64el,ppc64el,riscv64,s390x
bullseye8.20210223-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
buster-backports8.20200330-1~bpo10+1amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
buster7.20190129-3amd64,arm64,armhf,i386
stretch-backports7.20190129-2~bpo9+1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
stretch-backports7.20181211-2~bpo9+1mips
stretch-backports6.20180913-1~bpo9+1mipsel
stretch6.20170101-1+deb9u2amd64,arm64,i386,mips,mips64el,mipsel,ppc64el,s390x
stretch-security6.20170101-1+deb9u1amd64,i386
jessie-security5.20141125+oops-1+deb8u2amd64,armel,armhf,i386
jessie5.20141125+deb8u1amd64,armel,armhf,i386
trixie10.20240129-1amd64,arm64,i386,mips64el,ppc64el,s390x
bookworm-backports10.20240129-1~bpo12+1amd64,arm64,armel,i386,mips64el,mipsel,ppc64el,s390x
bookworm10.20230126-3amd64,arm64,i386,mips64el,mipsel,ppc64el,s390x
Debtags of package git-annex:
develrcs
roleprogram
works-withfile
Popcon: 426 users (33 upd.)*
Versions and Archs
License: DFSG free
Git

Git-annex tillader håndtering af filer med git, uden at lagre filindholdet i git. Programmet kan synkronisere, lave sikkerhedskopi og arkivere dine data, lokalt eller på nettet. Kontrolsummer og kryptering holder dine data sikre. Hent kraften og den distribuerede natur i git til dine store filer med git-annex.

Programmet kan lagre store filer på mange steder, fra lokale harddiske til et stort antal skytjenester, inklusive S3, WebDAV og rsync, med et stort antal skyleverandører via udvidelsesmoduler. Filer kan lagres krypteret med gpg, så at skyleverandøren ikke kan se dine data. Git-annex holder styr på hvor hver fil er lagret, så den ved hvor mange kopier, der er tilgængelige, og har mange faciliteter til at sikre at dine data bevares.

Git-annex kan også bruges til at holde en mappe i synkronisering mellem computere, ved at holde øje med hvornår filer ændres og automatisk sende dem til git og overføre dem til andre computere. Netprogrammet for git-annex gør det nemt at opsætte og bruge git-annex på denne måde.

The package is enhanced by the following packages: elpa-git-annex elpa-magit-annex keysafe
Screenshots of package git-annex
hdf5-filter-plugin
Eksterne filtre for HDF5 - LZ4, BZip2, Bitshuffle
Versions of package hdf5-filter-plugin
ReleaseVersionArchitectures
bookworm0.0~git20221111.49e3b65-4amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.0~git20221111.49e3b65-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
sid0.0~git20221111.49e3b65-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Den eksterne filtermekanisme introduceret med HDF5 1.8.12 tillader at programmer kan udnytte tilpassede filtre ikke indeholdt af HDF5-grundbiblioteket uden at kompilere dit program igen. Denne pakke tilbyder eksterne filtre for HDF5 for

  • Lz4-kompressionalgoritmen
  • BZip2-kompression
hdf5-filter-plugin-blosc-serial
Blokering, blanding og kompressionsbibliotek uden kvalitetstab
Versions of package hdf5-filter-plugin-blosc-serial
ReleaseVersionArchitectures
sid0.0~git20220616.9683f7d-5amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie0.0~git20220616.9683f7d-5amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
bookworm0.0~git20220616.9683f7d-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 0 users (14 upd.)*
Versions and Archs
License: DFSG free
Git

Denne pakke indeholder et filter for HDF5, der bruger Blosc-kompressoren. Ved at installere dette filter, så kan du læse og skrive HDF5-filer med Blosc-komprimerede datasæt.

hdf5-filter-plugin-zfp-serial
Kompressionsudvidelsesmodul for HDF5-biblioteket via ZFP-kompression
Versions of package hdf5-filter-plugin-zfp-serial
ReleaseVersionArchitectures
sid1.1.1-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64
bookworm1.1.0+git20221021-4amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
experimental1.1.0+git20230428-0+exp2amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie1.1.1-2amd64,arm64,armel,armhf,i386,mips64el,ppc64el
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

H5Z-ZFP er et kompressionsfilter for HDF5 via ZFP-kompressionsbiblioteket, der understøtter kompression med og uden kvalitetstab for kommatal og heltal for at møde bithastighed, nøjagtighed og/eller præcisionsmål.

nexus-tools
NeXus scientific data file format - applications
Versions of package nexus-tools
ReleaseVersionArchitectures
trixie4.4.3-6amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
bookworm4.4.3-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
jessie4.3.2-svn1921-2amd64,armel,armhf,i386
bullseye4.4.3-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid4.4.3-6amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 4 users (6 upd.)*
Versions and Archs
License: DFSG free
Git

NeXus is a common data format for neutron, X-ray, and muon science. It is being developed as an international standard by scientists and programmers representing major scientific facilities in Europe, Asia, Australia, and North America in order to facilitate greater cooperation in the analysis and visualization of neutron, X-ray, and muon data.

This is the package containing some applications for reading and writing NeXus files.

plfit
Tilpasning af power-law-distributioner til empiriske data - grænseflader
Versions of package plfit
ReleaseVersionArchitectures
sid0.9.6+ds-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.9.4+ds-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
trixie0.9.4+ds-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el
Popcon: 2 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

Programmet plfit tilpasser power-law-distributioner til empiriske (diskrete eller sammenhængende) data, jævnfør metoden fra Clauset, Shalizi og Newman [SIAM Review 51, 661-703 (2009)].

Denne pakke tilbyder to kommandolinjeredskaber, plfit og plgen.

The package is enhanced by the following packages: plfit-doc
python3-jdata
JData-koder/afkoder for Python 3
Versions of package python3-jdata
ReleaseVersionArchitectures
sid0.3.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bullseye0.3.6-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm0.3.6-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.3.6-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
Popcon: 1 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

JData-specifikationen (https://github.com/fangq/jdata/) definerer en simpel sproguafhængig annotationsgrænseflade for data målrettet lagring og deling af komplekse datastrukturer på tværs af forskellige programmeringssprog såsom MATLAB, JavaScript, Python etc. Ved at bruge JDATA-formater kan komplekse Pythonstrukturer kodes som et »dict«-objekt, der nemt kan serialiseres som JSON/binær JSON-fil og dele sådanne data mellem programmer i forskellige sprog.

python3-mdp
Modulært værktøjssæt for databehandling
Versions of package python3-mdp
ReleaseVersionArchitectures
jessie3.3-2all
stretch3.5-1all
bullseye3.6-1.1all
bookworm3.6-2amd64,arm64,mips64el,ppc64el
trixie3.6-7all
sid3.6-7all
Popcon: 10 users (5 upd.)*
Versions and Archs
License: DFSG free
Git

Databehandlingsramme til Python for bygning af komplekse databehandlingsprogrammer ved at kombinere udbredte algoritmer for maskinlæring til datakanaler og netværk. Implementerede algoritmer inkluderer: Principal Component Analysis (PCA), Independent Component Analysis (ICA), Slow Feature Analysis (SFA), Independent Slow Feature Analysis (ISFA), Growing Neural Gas (GNG), Factor Analysis, Fisher Discriminant Analysis (FDA) og gaussiske klassifikationer.

The package is enhanced by the following packages: python3-sklearn
python3-nxs
NeXus scientific data file format - Python 3 binding
Versions of package python3-nxs
ReleaseVersionArchitectures
sid4.4.1-4all
trixie4.4.1-4all
bookworm4.4.1-4all
bullseye4.4.1-3all
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

NeXus is a common data format for neutron, X-ray, and muon science. It is being developed as an international standard by scientists and programmers representing major scientific facilities in Europe, Asia, Australia, and North America in order to facilitate greater cooperation in the analysis and visualization of neutron, X-ray, and muon data.

This is the package containing the Python 3 bindings.

python3-pyzoltan
Wrapper for the Zoltan data management library
Versions of package python3-pyzoltan
ReleaseVersionArchitectures
bullseye1.0.1-2+deb11u1amd64,arm64,ppc64el,s390x
bookworm1.0.1-5+deb12u1amd64,arm64,ppc64el,s390x
trixie1.0.1-9amd64,arm64,ppc64el,s390x
sid1.0.1-9amd64,arm64,ppc64el,riscv64,s390x
Popcon: 4 users (9 upd.)*
Versions and Archs
License: DFSG free
Git

PyZoltan is as the name suggests, is a Python wrapper for the Zoltan data management library.

In PyZoltan, only specific routines and objects are wrapped. The following features of Zoltan are currently supported:

  • Dynamic load balancing using geometric algorithms
  • Unstructured point-to-point communication
  • Distributed data directories
virtuoso-opensource
database med høj ydelse
Versions of package virtuoso-opensource
ReleaseVersionArchitectures
bookworm7.2.5.1+dfsg1-0.3all
sid7.2.5.1+dfsg1-0.8all
experimental7.2.12+dfsg-0.1all
jessie6.1.6+dfsg2-2all
buster6.1.6+dfsg2-4all
stretch6.1.6+dfsg2-4all
bullseye7.2.5.1+dfsg1-0.1all
upstream7.2.12
Debtags of package virtuoso-opensource:
rolemetapackage, program
works-withdb
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

OpenLink Virtuoso er en objekt-relation SQL-database med høj ydelse. Den tilbyder transaktioner, en smart SQL-compiler, hot backup, SQL:1999-understøttelse, et kraffuldt stored-procedure sprog der understøtter serverside Java eller .NET, og mere. Den understøtter alle væsentlige grænseflader for datatilgang, inklusiv ODBC, JDBC, ADO.NET og OLE/DB.

Virtuoso understøtter SPARQL indlejret i SQL for forespørgsler til RDF-data gemt i sin database. SPARQL har fordel af understøttelse i motoren på lavt niveau, såsom SPARQL-opmærksomme type-casting regler og en dedikeret IRI-datatype.

Installer denne metapakke for den fulde programpakke som udgør Virtuoso OSE (»Open-Source Edition«)

visidata
Hurtigt undersøge kolonnedata i terminalen
Versions of package visidata
ReleaseVersionArchitectures
bullseye2.2.1-1all
bookworm2.11-1all
sid3.0.2-1all
buster1.5.2-1all
trixie3.0.2-1all
Popcon: 33 users (14 upd.)*
Versions and Archs
License: DFSG free
Git

VisiData er et terminalredskab for flere formål til at undersøge, rense, restrukturere og analysere tabeldata. Kilder understøttet i øjeblikket er TSV, CSV, tekst med fast bredde, JSON, SQLite, HTTP, HTML, .xls og .xlsx (Microsoft Excel).

Official Debian packages with lower relevance

libnexus-dev
NeXus scientific data file format - development libraries
Versions of package libnexus-dev
ReleaseVersionArchitectures
sid4.4.3-6amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm4.4.3-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie4.4.3-6amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x
bullseye4.4.3-5amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

NeXus is a common data format for neutron, X-ray, and muon science. It is being developed as an international standard by scientists and programmers representing major scientific facilities in Europe, Asia, Australia, and North America in order to facilitate greater cooperation in the analysis and visualization of neutron, X-ray, and muon data.

This is the package containing the development libraries.

libnexus-java
NeXus scientific data file format - java libraries
Versions of package libnexus-java
ReleaseVersionArchitectures
sid4.4.3-6all
bullseye4.4.3-5all
bookworm4.4.3-5all
trixie4.4.3-6all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

NeXus is a common data format for neutron, X-ray, and muon science. It is being developed as an international standard by scientists and programmers representing major scientific facilities in Europe, Asia, Australia, and North America in order to facilitate greater cooperation in the analysis and visualization of neutron, X-ray, and muon data.

This is the package containing the java libraries.

libplfit-dev
Tilpasning af power-law-distributioner til empiriske data - udvikling
Versions of package libplfit-dev
ReleaseVersionArchitectures
sid0.9.6+ds-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm0.9.4+ds-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
trixie0.9.4+ds-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

Programmet plfit tilpasser power-law-distributioner til empiriske (diskrete eller sammenhængende) data, jævnfør metoden fra Clauset, Shalizi og Newman [SIAM Review 51, 661-703 (2009)].

Denne pakke indeholder teksthovedfiler, statiske biblioteker og symbolske henvisninger som udviklere, der bruger biblioteket plfit, skal bruge.

The package is enhanced by the following packages: plfit-doc
python3-openpyxl
Python 3-modul til at læse/skrive OpenXML xlsx/xlsm-filer
Versions of package python3-openpyxl
ReleaseVersionArchitectures
trixie3.1.2+dfsg-6all
bullseye3.0.3-1all
bookworm3.0.9-1all
buster2.4.9-1all
sid3.1.2+dfsg-6all
stretch2.3.0-3all
Popcon: 246 users (291 upd.)*
Versions and Archs
License: DFSG free
Git

Openpyxl er et rent Python 3-modul til at læse/skrive Excel 2007 (OpenXML) xlsx/xlsm-filer.

Denne pakke indeholder selve modulet.

python3-opentsne
t-Distributed Stochastic Neighbor Embedding algorithm
Versions of package python3-opentsne
ReleaseVersionArchitectures
sid1.0.0-1amd64,arm64,armel,armhf,mips64el,ppc64el,riscv64,s390x
sid0.5.0-2i386
upstream1.0.1
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

Modular Python implementation of t-Distributed Stochasitc Neighbor Embedding (t-SNE), a popular dimensionality-reduction algorithm for visualizing high-dimensional data sets. openTSNE incorporates the latest improvements to the t-SNE algorithm, including the ability to add new data points to existing embeddings, massive speed improvements, enabling t-SNE to scale to millions of data points and various tricks to improve global alignment of the resulting visualizations.

python3-plfit
Tilpasning af power-law-distributioner til empiriske data - Python
Versions of package python3-plfit
ReleaseVersionArchitectures
sid0.9.6+ds-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie0.9.4+ds-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el
bookworm0.9.4+ds-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Programmet plfit tilpasser power-law-distributioner til empiriske (diskrete eller sammenhængende) data, jævnfør metoden fra Clauset, Shalizi og Newman [SIAM Review 51, 661-703 (2009)].

Denne pakke indeholder et Pythonmodul.

The package is enhanced by the following packages: plfit-doc
*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 236283