Installation

Quick Installation

The simplest way to get started with TRIFID is to install it via pip directly from GitHub:

pip install git+https://github.com/fpozoc/trifid.git

This method is ideal if you want to:

Use TRIFID’s preprocessing modules (QSplice, Pfam effects)
Load and analyze pre-computed predictions
Integrate TRIFID into your analysis pipeline

Development Installation

For development work or to reproduce the complete TRIFID methodology from scratch, follow these steps:

Install Conda/Mamba

First, ensure you have Miniconda or Anaconda with mamba installed. If not, install Miniconda:

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh -b -p $HOME/miniconda3

Install mamba for faster dependency resolution:

conda install -c conda-forge mamba

Clone the Repository

Clone the TRIFID repository and navigate to the directory:

git clone [email protected]:fpozoc/trifid.git
cd trifid

Create Environment

Create the conda environment from the provided configuration:

mamba env create -f environment.yml
conda activate trifid

Install Pre-commit Hooks

Set up development tools and pre-commit hooks:

pre-commit install

Verify Installation

Test that everything is working correctly:

# Run pre-commit checks
pre-commit run --all-files

# Run tests
pytest -v

System Requirements

Minimum Requirements:

Python 3.7+
8GB RAM (16GB recommended for genome-wide analysis)
Linux or macOS (Windows via WSL)

Core Dependencies

TRIFID requires the following Python packages:

biopython
numpy
pandas>=1.0
scikit-learn
joblib
cython
pyyaml
gtfparse
loguru
muscle

Install Optional Dependencies

Depending on your use case, you may want additional features:

Visualization
Interactive Analysis
All Features

For generating plots and interpreting model predictions:

pip install -e .[extra]

This includes: matplotlib, altair, shap, eli5, mlxtend, rfpimp

For Jupyter notebook support:

pip install -e .[interactive]

This includes: jupyterlab, watermark

Install everything:

pip install -e .[dev,extra,interactive]

Pre-computed Predictions

Instead of running TRIFID from scratch, you can download pre-computed predictions for multiple genome assemblies. This is the recommended approach for most users.

Available Genomes

Human (Homo sapiens)

Assembly	Database	Version	Release Date	Download
GRCh38	GENCODE	v27	Aug 2017	Link
GRCh38	GENCODE	v37	Feb 2021	Link
GRCh38	GENCODE	v42	Apr 2022	Link
GRCh38	RefSeq	110	Feb 2020	Link
GRCh37	GENCODE	v19	Dec 2013	Link
GRCh37	RefSeq	105	Feb 2020	Link

Model Organisms

Species	Assembly	Database	Version	Download
Mouse	GRCm39	GENCODE	M31	Link
Mouse	GRCm38	GENCODE	M25	Link
Rat	mRatBN7.2	Ensembl	105	Link
Zebrafish	GRCz11	Ensembl	104	Link
Fruitfly	BDGP6	Ensembl	107	Link
Worm	WBcel235	Ensembl	108	Link

Other Vertebrates

Species	Assembly	Database	Version	Download
Chicken	GRCg7b	Ensembl	108	Link
Chimpanzee	Pan_tro_3.0	Ensembl	104	Link
Pig	Sscrofa11.1	Ensembl	108	Link
Cow	ARS-UCD1.2	Ensembl	104	Link
Macaque	Mmul_10	Ensembl	105	Link

Download and Extract

Each prediction package contains:

trifid_predictions.tsv.gz - TRIFID scores for all isoforms
trifid_db.tsv.gz - Complete feature matrix
Feature description files

# Example: Download human GRCh38 GENCODE 27 predictions
wget https://drive.google.com/... -O trifid_predictions.tsv.gz

# Extract and view
gunzip trifid_predictions.tsv.gz
head trifid_predictions.tsv

Prediction files can be large (>1GB compressed). Ensure you have sufficient disk space.

Additional Resources

For advanced usage and model training, you may need additional data files:

Training Data

TRIFID training set (GENCODE 27)

Pre-trained Model

TRIFID model v1.0.4 (pickle format)

Tutorial Notebook

Complete tutorial with examples

Figures Notebook

Reproduce paper figures

Need Help?

If you encounter issues during installation:

Check the GitHub Issues for known problems
Ensure all system requirements are met
Try creating a fresh conda environment
Open a new issue with your error log

For species or genome versions not listed, you can request custom predictions by opening an issue on GitHub.

Next Steps

Quick Start Guide

Learn how to load predictions and analyze your first gene

Get Started

Core Concepts

User Guides

TRIFID Modules

Data & Models

Quick Installation

Development Installation

Verify Installation

System Requirements

Core Dependencies

Install Optional Dependencies

Pre-computed Predictions

Available Genomes

Download and Extract

Additional Resources

Training Data

Pre-trained Model

Tutorial Notebook

Figures Notebook

Need Help?

Next Steps

Quick Start Guide

Build docs developers (and LLMs) love

Get Started

Core Concepts

User Guides

TRIFID Modules

Data & Models

​Quick Installation

​Development Installation

​Verify Installation

​System Requirements

​Core Dependencies

​Install Optional Dependencies

​Pre-computed Predictions

​Available Genomes

​Download and Extract

​Additional Resources

Training Data

Pre-trained Model

Tutorial Notebook

Figures Notebook

​Need Help?

​Next Steps

Quick Start Guide

Build docs developers (and LLMs) love

Quick Installation

Development Installation

Verify Installation

System Requirements

Core Dependencies

Install Optional Dependencies

Pre-computed Predictions

Available Genomes

Download and Extract

Additional Resources

Need Help?

Next Steps