Debian Science Project
Summary
Data management
Debian Science Data Management packages

This metapackage will install packages to assist with data management tasks, such as obtaining data from remote resources, keeping data under version control, etc.

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Science to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Science mailing list

Links to other tasks

Debian Science Data management packages

Official Debian packages with high relevance

Datalad
data files management and distribution platform
Versions of package datalad
ReleaseVersionArchitectures
buster0.11.2-2all
sid0.13.5-1all
stretch0.4.1-1all
Popcon: 28 users (17 upd.)*
Versions and Archs
License: DFSG free
Git

DataLad is a data management and distribution platform providing access to a wide range of data resources already available online. Using git-annex as its backend for data logistics it provides following facilities built-in or available through additional extensions

  • command line and Python interfaces for manipulation of collections of datasets (install, uninstall, update, publish, save, etc.) and separate files/directories (add, get)

  • extract, aggregate, and search through various sources of metadata (xmp, EXIF, etc; install datalad-neuroimaging for DICOM, BIDS, NIfTI support)

  • crawl web sites to automatically prepare and update git-annex repositories with content from online websites, S3, etc (install datalad-crawler)
Datalad-container
DataLad-udvidelse til arbejdet med containermiljøer
Maintainer: Yaroslav Halchenko
Versions of package datalad-container
ReleaseVersionArchitectures
sid1.0.1-1all
buster0.2.2-2all
Popcon: 3 users (2 upd.)*
Versions and Archs
License: DFSG free

Denne udvidelse forbedrer DataLad (http://datalad.org) til arbejdet med beregningscontainere.

Git-annex
Håndter filer med git, uden at tjekke deres indhold ind i git
Versions of package git-annex
ReleaseVersionArchitectures
stretch-security6.20170101-1+deb9u1amd64,i386
stretch6.20170101-1+deb9u2amd64,arm64,i386,mips,mips64el,mipsel,ppc64el,s390x
buster7.20190129-3amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
bullseye8.20200908-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid8.20200908-1ppc64el
sid8.20201103-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,s390x
wheezy3.20120629amd64,armel,armhf,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-security3.20120629+deb7u1amd64,armel,armhf,i386
jessie5.20141125+deb8u1amd64,armel,armhf,i386
jessie-security5.20141125+oops-1+deb8u2amd64,armel,armhf,i386
Debtags of package git-annex:
develrcs
roleprogram
works-withfile
Popcon: 493 users (93 upd.)*
Versions and Archs
License: DFSG free
Git

git-annex tillader håndtering af filer med git, uden at tjekke filindholdet ind i git. Selv om dette kan lyde paradoksalt, så er det nyttigt ved håndtering af filer, der er større end hvad git med lethed kan klare, hvad enten det skyldes begrænsninger i hukommelse, tid eller diskplads.

Programmet kan lagre store filer på mange steder, fra lokale harddiske til et stort antal skytjenester, inklusive S3, WebDAV og rsync, med et dusion skyleverandører via udvidelsesmoduler. Filer kan lagres krypteret med gpg, så at skyleverandøren ikke kan se dine data. Git-annex holder styr på hvor hver fil er lagret, så den ved hvor mange kopier, der er tilgængelige, og har mange faciliteter til at sikre at dine data bevares.

Git-annex kan også bruges til at holde en mappe i synkronisering mellem computere, ved at holde øje med hvornår filer ændres og automatisk sende dem til git og overføre dem til andre computere. Netprogrammet for git-annex gør det nemt at opsætte og bruge git-annex på denne måde.

The package is enhanced by the following packages: elpa-git-annex elpa-magit-annex keysafe
*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 200793