Aarav H. Dave1
This repository implements MSAffect, a computational pipeline for generating, perturbing, and analyzing multiple-sequence alignments (MSAs) in the context of protein structure prediction with AlphaFold2. It automates:
- Baseline MSA generation via ColabFold/MMseqs2
- Three adversarial perturbations (deletion, residue-level mutation, row-and-column shuffle)
- AlphaFold2 runs on both unperturbed and perturbed MSAs
- Extraction of confidence scores (pLDDT) and structural deviation metrics (RMSD)
- Generation of summary tables, bar plots, scatterplots, and 3D visualizations
This repository hosts project files (2025.1α) for public usage dictated by its license.
Coming Soon
Coming Soon
- Open
MSAffect.ipynb
in a Google Colaboratory environment. - Install dependencies by running “Initiate MSAffect Environment” cell, which will install ColabFold, Biopython, NumPy, Matplotlib, and py3Dmol.
- Prepare your FASTA inputs by placing one or more
.fasta
files in themsa/
directory.ubiquitin.fasta
,bpti.fasta
, andcalmodulin.fasta
are included as examples. - Execute the pipeline by running the “Run MSAffect” cell. The script will process every FASTA in
msa/
, create subfolders underresults/
, and produce summary tables and figures. You can visualize your results in 3D by running the "3D Visualize Results" cell. - Contribute edits by adding your GitHub token and running the "Push to MSAffect GitHub" cell.
Coming Soon
This software utilizes:
- Google Colaboratory for interactive execution and free GPU access
- ColabFold (AlphaFold2) for accelerated MSA generation and structure prediction
- Python 3.11 as the execution environment
- Biopython for MSA parsing and PDB handling
- NumPy for numerical operations
- Matplotlib (and optional Seaborn) for plotting
- py3Dmol for in-notebook 3D molecular visualization
This software, as with all subsequent versions of the software, is protected by the CC-BY-NC-ND license. In summary, this does not allow commercial usage, distribution, or distribution of modifications of the software. In additon, you are required to credit authorship and state any changes you may have made.
For more information, please refer to the
LICENSE
file.
For questions concerning the contents of this repository, please contact contact [at] aaravdave [dot] org.
Footnotes
-
Lowndes High School, Valdosta, GA ↩