DebiChem Project
Summary
Cheminformatics
DebiChem Cheminformatics

This metapackage will install cheminformatics packages useful for chemists.

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for DebiChem to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the DebiChem mailing list

Links to other tasks

DebiChem Cheminformatics packages

Official Debian packages with high relevance

libcdk-java
Chemistry Development Kit (CDK) Java libraries
Versions of package libcdk-java
ReleaseVersionArchitectures
trixie2.9-1all
bookworm2.8-2all
bullseye2.3.134.g1bb9a64587-2all
stretch1.2.10-6all
jessie1.2.10-6all
sid2.9-1all
buster1.2.10-7all
Debtags of package libcdk-java:
devellang:java, library
fieldchemistry
roledevel-lib
Popcon: 4 users (5 upd.)*
Versions and Archs
License: DFSG free
Git

The CDK is a library of Java classes used in computational and information chemistry and in bioinformatics. It includes renderers, file IO, SMILES generation/parsing, maximal common substructure algorithms, fingerprinting and much, much more.

Please cite: C. Steinbeck, Y. Han, S. Kuhn, O. Horlacher, E. Luttmann and E. L. Willighagen: The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo- and Bioinformatics. (2003)
libchemicaltagger-java
tool for semantic text-mining in chemistry
Versions of package libchemicaltagger-java
ReleaseVersionArchitectures
sid1.6.2-4all
trixie1.6.2-4all
bookworm1.6.2-2all
Popcon: users ( upd.)*
Versions and Archs
License: DFSG free
Git

ChemicalTagger parser is a medium-depth, phrase-based semantic NLP tool for the language of chemical experiments. Tagging is based on a modular architecture and uses a combination of OSCAR, domain-specific regex and English taggers to identify parts-of-speech. The ANTLR grammar is used to structure this into tree-based phrases.

Please cite: Lezan Hawizy, David M Jessop, Nico Adams and Peter Murray-Rust: ChemicalTagger: A tool for semantic text-mining in chemistry. Journal of Cheminformatics 3(1) (2011)
libinchi-bin
International Chemical Identifier (InChI) algorithm (executable)
Versions of package libinchi-bin
ReleaseVersionArchitectures
trixie1.07.1+dfsg-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
bookworm1.03+dfsg-4amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid1.07.1+dfsg-4amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

The International Chemical Identifier (InChI) is an Open Source chemical structure representation algorithm.

InChIs are text strings comprising different layers and sublayers of information separated by slashes (/). Each InChI strings starts with the InChI version number followed by the main layer. This main layer contains sublayers for chemical formula, atom connections and hydrogen atoms. Depending on the structure of the molecule the main layer may be followed by additional layers e. g. for charge, stereochemical and/or isotop information.

This package contains the executable, inchi_main.

libjni-inchi-java
Java Native Interface wrapper for InChI
Versions of package libjni-inchi-java
ReleaseVersionArchitectures
sid0.8+dfsg-8all
bullseye0.8+dfsg-6all
trixie0.8+dfsg-8all
bookworm0.8+dfsg-7all
Popcon: 56 users (68 upd.)*
Versions and Archs
License: DFSG free
Git

JNI-InChI provides JNI (Java Native Interface) wrappers for the InChI (International Chemical Identifier) C library distributed by IUPAC. All of the features from the InChI libarary are supported:

  • Standard and Non-Standard InChI generation from structures with 3D, 2D, or no coordinates
  • Structure generation (without coordinates) from InChI
  • InChIKey generation
  • Check InChI / InChIKey
  • InChI-to-InChI conversion
  • AuxInfo to InChI input
  • Access to the full range of options supported by InChI
  • Full support for InChI's handling of stereochemistry
libopenchemlib-java
framework providing cheminformatics core functionality
Versions of package libopenchemlib-java
ReleaseVersionArchitectures
bullseye2021.2.0+dfsg-1all
bookworm2023.1.1+dfsg-1all
sid2023.1.1+dfsg-1all
trixie2023.1.1+dfsg-1all
upstream2024.11.1
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

OpenChemLib is Java based framework providing cheminformatics core functionality and user interface components. Its main focus is on organics chemistry and small molecules. It is built around a StereoMolecule class, which represents a molecule using atom and bond tables, provides atom neighbours, ring and aromaticity information, and supports MDL's concept of enhanced stereo representation. Additional classes provide, 2D-depiction, descriptor calculation, molecular similarity and substructure search, reaction search, property prediction, conformer generation, support for molfile and SMILES formats, energy minimization, ligand-protein interactions, and more. OpenChemLib's idcode represents molecules, fragments or reactions as canonical, very compact string that includes stereo and query features.

Different to other cheminformatics frameworks, OpenChemLib also provides user interface components that allow one to easily embed chemical functionality into Java applications, e.g. to display or edit chemical structures or reactions.

libopsin-java
Chemical name to structure converter
Versions of package libopsin-java
ReleaseVersionArchitectures
trixie2.8.0-1all
jessie1.5.0-1all
stretch1.5.0-1all
buster2.3.1-1all
bullseye2.5.0-1all
bookworm2.7.0-2all
sid2.8.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

OPSIN (Open Parser for Systematic IUPAC Nomenclature) converts IUPAC compound names to semantic chemical information and outputs a structure either as CML (Chemical Markup Language), SMILES, or InChI.

This package contains the Java library.

liboscar4-java
automated annotation of chemistry in scientific articles
Versions of package liboscar4-java
ReleaseVersionArchitectures
trixie5.2.0+dfsg-2all
bookworm5.2.0+dfsg-2all
sid5.2.0+dfsg-2all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

OSCAR (Open Source Chemistry Analysis Routines) is an open source extensible system for the automated annotation of chemistry in scientific articles. It can be used to identify chemical names, reaction names, ontology terms, enzymes and chemical prefixes and adjectives, and chemical data such as state, yield, IR, NMR and mass spectra and elemental analyses. In addition, where possible, any chemical names detected will be annotated with structures derived either by lookup, or name-to-structure parsing using OPSIN or with identifiers from the ChEBI (Chemical Entities of Biological Interest) ontology.

Please cite: David M Jessop, Sam E Adams, Egon L Willighagen, Lezan Hawizy and Peter Murray-Rust: OSCAR4: a flexible architecture for chemical text-mining. Journal of Cheminformatics 3(1) (2011)
openbabel
Chemical toolbox utilities (cli)
Versions of package openbabel
ReleaseVersionArchitectures
stretch2.3.2+dfsg-3amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x
buster2.4.1+dfsg-3amd64,arm64,armhf,i386
bullseye3.1.1+dfsg-6amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm3.1.1+dfsg-9amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie3.1.1+dfsg-11amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid3.1.1+dfsg-11amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
jessie2.3.2+dfsg-2amd64,armel,armhf,i386
Debtags of package openbabel:
fieldchemistry
interfacecommandline
roleprogram
scopeutility
useconverting
Popcon: 73 users (59 upd.)*
Versions and Archs
License: DFSG free
Git

Open Babel is a chemical toolbox designed to speak the many languages of chemical data. It allows one to search, convert, analyze, or store data from molecular modeling, chemistry, solid-state materials, biochemistry, or related areas. Features include:

  • Hydrogen addition and deletion
  • Support for Molecular Mechanics
  • Support for SMARTS molecular matching syntax
  • Automatic feature perception (rings, bonds, hybridization, aromaticity)
  • Flexible atom typer and perception of multiple bonds from atomic coordinates
  • Gasteiger-Marsili partial charge calculation

File formats Open Babel supports include PDB, XYZ, CIF, CML, SMILES, MDL Molfile, ChemDraw, Gaussian, GAMESS, MOPAC and MPQC.

This package includes the following utilities:

  • obabel: Convert between various chemical file formats
  • obenergy: Calculate the energy for a molecule
  • obminimize: Optimize the geometry, minimize the energy for a molecule
  • obgrep: Molecular search program using SMARTS pattern
  • obgen: Generate 3D coordinates for a molecule
  • obprop: Print standard molecular properties
  • obfit: Superimpose two molecules based on a pattern
  • obrotamer: Generate conformer/rotamer coordinates
  • obconformer: Generate low-energy conformers
  • obchiral: Print molecular chirality information
  • obrotate: Rotate dihedral angle of molecules in batch mode
  • obprobe: Create electrostatic probe grid
Screenshots of package openbabel
osra
optical structure recognition
Versions of package osra
ReleaseVersionArchitectures
bookworm2.1.3-3amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie2.1.3-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
sid2.1.3-3amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

OSRA is a utility designed to convert graphical representations of chemical structures, as they appear in journal articles, patent documents, textbooks, trade magazines etc., into SMILES (Simplified Molecular Input Line Entry Specification) - a computer recognizable molecular structure format. OSRA can read a document in any of the over 90 graphical formats parseable by ImageMagick - including GIF, JPEG, PNG, TIFF, PDF, PS etc., and generate the SMILES representation of the molecular structure images encountered within that document.

python3-indigo
Organic Chemistry Toolkit (Python module)
Versions of package python3-indigo
ReleaseVersionArchitectures
bullseye1.2.3-3.1all
sid1.2.3-3.1all
trixie1.2.3-3.1all
bookworm1.2.3-3.1all
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Git

Indigo is a C++ based organic chemistry and cheminformatics software environment. Features Include:

  • Molecule and reaction rendering including SVG support
  • Automatic layout for SMILES-represented molecules and reactions
  • Canonical (isomeric) SMILES computation
  • Exact matching, substructure matching, SMARTS matching
  • Matching of tautomers and resonance structures
  • Molecule fingerprinting, molecule similarity computation
  • Fast enumeration of SSSR rings, subtrees, and edge sugraphs
  • Molecular weight, molecular formula computation
  • R-Group deconvolution and scaffold detection
  • Computation of the exact maximum common substructure for an arbitrary amount of input structures
  • Combinatorial chemistry
  • Plugin support in the API

File formats Indigo support include MDL Mol, SDF, RDF, CML, SMILES and SMARTS.

This package contains the Python modules.

python3-openbabel
Chemical toolbox library (Python bindings)
Versions of package python3-openbabel
ReleaseVersionArchitectures
bullseye3.1.1+dfsg-6amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bookworm3.1.1+dfsg-9amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid3.1.1+dfsg-11amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
trixie3.1.1+dfsg-11amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Popcon: 16 users (11 upd.)*
Versions and Archs
License: DFSG free
Git

Open Babel is a chemical toolbox designed to speak the many languages of chemical data. It allows one to search, convert, analyze, or store data from molecular modeling, chemistry, solid-state materials, biochemistry, or related areas. Features include:

  • Hydrogen addition and deletion
  • Support for Molecular Mechanics
  • Support for SMARTS molecular matching syntax
  • Automatic feature perception (rings, bonds, hybridization, aromaticity)
  • Flexible atom typer and perception of multiple bonds from atomic coordinates
  • Gasteiger-Marsili partial charge calculation

File formats Open Babel supports include PDB, XYZ, CIF, CML, SMILES, MDL Molfile, ChemDraw, Gaussian, GAMESS, MOPAC and MPQC.

This package contains the Python binding.

python3-rdkit
Collection of cheminformatics and machine-learning software
Versions of package python3-rdkit
ReleaseVersionArchitectures
bookworm202209.3-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye202009.4-1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
sid202309.3-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
upstream202409.2
Popcon: 35 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

RDKit is a Python/C++ based cheminformatics and machine-learning software environment. Features Include:

  • Chemical reaction handling and transforms
  • Substructure searching with SMARTS
  • Canonical SMILES
  • Molecule-molecule alignment
  • Large number of molecular descriptors, including topological, compositional, EState, SlogP/SMR, VSA and Feature-map vectors
  • Fragmentation using RECAP rules
  • 2D coordinate generation and depiction, including constrained depiction
  • 3D coordinate generation using geometry embedding
  • UFF and MMFF94 forcefields
  • Chirality support, including calculation of (R/S) stereochemistry codes
  • 2D pharmacophore searching
  • Fingerprinting, including Daylight-like, atom pairs, topological torsions, Morgan algorithm and MACCS keys
  • Calculation of shape similarity
  • Multi-molecule maximum common substructure
  • Machine-learning via clustering and information theory algorithms
  • Gasteiger-Marsili partial charge calculation

File formats RDKit supports include MDL Mol, PDB, SDF, TDT, SMILES and RDKit binary format.

Registry entries: SciCrunch 
smiles-scripts
command line tools to handle SMILES descriptors
Versions of package smiles-scripts
ReleaseVersionArchitectures
bookworm0.2.0+dfsg1-4all
sid0.2.0+dfsg1-6all
trixie0.2.0+dfsg1-6all
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Git

Package provides command line tools to work with Simplified Molecular-Input Line-Entry System (SMILES) descriptors. Package contains command line interface scripts for Chemistry Development Kit (CDK), as well as a syntax checker for SMILES.

Please cite: Miguel Quirós, Saulius Gražulis, Saulė Girdzijauskaitė, Andrius Merkys and Antanas Vaitkus: Using SMILES strings for the description of chemical connectivity in the Crystallography Open Database. Journal of Cheminformatics 10(1) (2018)
*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 245926