adabmDCA 2.0 - Direct Coupling Analysis in C/C++

Authors:

Lorenzo Rosset (Ecole Normale Supérieure ENS, Sorbonne Université)
Roberto Netti (Sorbonne Université)
Anna Paola Muntoni (Politecnico di Torino)
Martin Weigt (Sorbonne Université)
Francesco Zamponi (Sapienza Università di Roma)

Maintainer: Anna Paola Muntoni

Overview

adabmDCA 2.0 is a flexible yet easy-to-use implementation of Direct Coupling Analysis (DCA) based on Boltzmann machine learning. This package provides tools for analyzing residue-residue contacts, predicting mutational effects, scoring sequence libraries, and generating artificial sequences, applicable to both protein and RNA families. The package is designed for flexibility and performance, supporting multiple programming languages (C++, Julia, Python) and architectures (single-core/multi-core CPUs and GPUs).
This repository contains the C/C++ version of adabmDCA, maintained by Anna Paola Muntoni.

The project's main repository can be found at adabmDCA 2.0.

Features

Direct Coupling Analysis (DCA) based on Boltzmann machine learning.
Support for dense and sparse generative DCA models.
Available on multiple architectures: single-core and multi-core CPUs, GPUs.
Ready-to-use for residue-residue contact prediction, mutational-effect prediction, and sequence design.
Compatible with protein and RNA family analysis.

Installation

In the src folder run

make

It will generate the executable file adabmDCA. In the main folder run also chmod +x adabmDCA.sh to use the main script file. See

./adabmDCA.sh [ train | sample | energies | DMS | contacts ] --help

for the basic runs or look at the main page.

Alternative usage

Learning a Potts model from a MSA

./adabmDCA -f <MSA file> -a <output folder> -k <label> -m <nsave> -L

Output files will be saved every nsave iterations specified in the -m flag;
The output folder is named after the output folder specified in the -a flag. Files will be labeled according to the argument of the flag -k.
Use -w <file name> for ad-hoc weights file (optional).
For RNA, set the flag -b n; for ad hoc alphabet set -b <alphabet> where alphabet is a string

Learning a sparse Potts model from a MSA

Decimation from converged run

./adabmDCA -f <MSA file> -k <label> -a <output folder> -p <params> -c <convergence threshold> -x <required sparsity> -L

-A flag removes gauge invariance at the beginning of the training;
Additional options -V <drate>

Activation from profile model

./adabmDCA -f <MSA file> -k <label> -a <output folder> -I 0. -Z -c <convergence threshold> -X <gsteps> -L -e <nsweep>

-Z flag inactivates all couplings at the beginning of the training;
-I 0. allows one to start from a profile model;
Additional options -U <factivate>;
Convergence at target Pearson whatever the density.

Restore training

Add the flag --restore to restart the training from the checkpoint saved in the output folder.

Sampling

Use

./adabmDCA -p <params> -f <MSA file> -i 0 -S -L -s <nconfig>

-W nmix (optional)

Computing energies

./adabmDCA --energies -p <params> -f <MSA file>

DMS scores

./adabmDCA --dms -p <params> -f <wild type>

Other features

See

./adabmDCA -h

for a complete list.

License

This package is open-sourced under the MIT License.

Citation

If you use (even partially) this code, please cite:

Rosset, L., Netti, R., Muntoni, A.P., Weigt, M., & Zamponi, F. (2024). adabmDCA 2.0: A flexible but easy-to-use package for Direct Coupling Analysis.

Acknowledgments

This work was developed in collaboration with Sorbonne Université, Sapienza Università di Roma, and Politecnico di Torino.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
src		src
LICENSE		LICENSE
README.md		README.md
adabmDCA.sh		adabmDCA.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

adabmDCA 2.0 - Direct Coupling Analysis in C/C++

Overview

Features

Installation

Alternative usage

Learning a Potts model from a MSA

Learning a sparse Potts model from a MSA

Decimation from converged run

Activation from profile model

Restore training

Sampling

Computing energies

DMS scores

Other features

License

Citation

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

spqb/adabmDCAc

Folders and files

Latest commit

History

Repository files navigation

adabmDCA 2.0 - Direct Coupling Analysis in C/C++

Overview

Features

Installation

Alternative usage

Learning a Potts model from a MSA

Learning a sparse Potts model from a MSA

Decimation from converged run

Activation from profile model

Restore training

Sampling

Computing energies

DMS scores

Other features

License

Citation

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages