#
IntroductionWe’re proud to introduce scvi‑tools v1.3, encompassing major advances in modeling, data loading, computational scalability, metric integration, and interpretability in single-cell analytics.
Featuring nine new or enhanced models—optimized for spatial, cytometry, methylation, perturbation, and multi‑omic data—it also introduces streaming data loaders for large-scale datasets, multi‑GPU model training, on-the-fly metric tuning, and integrated model interpretability.
This article delves into each enhancement with depth, including detailed insights, illustrative figures, and manuscript references.
#
1. 🔬 New Models#
ResolVIResolVI1 is a spatial transcriptomics denoising model that reallocates mis-assigned gene counts among true cells, neighborhood leakage, and background. It employs a Gaussian-mixture latent prior to learn corrected counts and interpretable embeddings. This approach is highly scalable (handling >1 million spots) and offers downstream capabilities like differential expression and transfer learning on corrected data
In tutorial, it has been shown to markedly enhance spatial expression accuracy in noisy segmentation settings, enabling reliable differential expression—especially in high-throughput ST datasets.

Figure 1: ResolVI cell type annotations based on noisy cellular segmentation Xenium data of a mouse brain. The left hemisphere for model training and the right hemisphere for transfer mapping. Original labeles were based on Leiden clustering of the ProSeg algorithm
#
scVIVAscVIVA2 augments spatial transcriptomics analysis by jointly modeling each cell’s own expression and its micro-environmental context (neighborhood composition and gene counts). This niche-aware VAE embeds both cellular identity and environmental features, revealing tissue-specific patterns and environment-driven variation. Its latent embeddings delineate tissue-specific structures—ideal for spatial differential abundance or niche-focused clustering studies.
Dedicated tutorial showcase how scVIVA enables niche-focused clustering and differential abundance analyses

Figure 2: scVIVA results: median Log-Fold Change (LFC) of upregulated genes in vs displayed on the x-axis, while we compare differential expression computed between and on the y-axis. Genes are colored by their marker label (yellow=significantly upregulated in vs , green otherwise). We also display the classifier decision boundary (the predicted probability of being in the yellow class).
#
CytoVICytoVI3 brings totalVI-inspired modeling to cytometry and mass cytometry data. It models protein-marker distributions, corrects for dropouts and technical batch variation, and generates embeddings for downstream clustering and abundance inference.
Early tutorial already demonstrate clear delineation of immune subpopulations across batch-affected datasets.
#
VIVSVariational Inference for Variable Selection (VIVS4) identifies associations across modalities—such as gene–protein couplings—while rigorously controlling false discovery rates using conditional randomization. VIVS achieves interpretable and scalable feature selection, enabling discovery of biologically meaningful links in paired datasets
Tutorial in scvi-tools soon to be updated.
#
SysVISysVI5 tackles major batch effects—such as those arising from cross-species or organoid-versus-tissue studies—using latent cycle-consistency and VampPrior regularization. Compared to Harmony or regular scVI models, SysVI excels at aligning technical systems while preserving true biological variance, producing embedding spaces where analogous cell types across batches cluster coherently.
in the tutorial, we show the power of sysVI with data integration between human and mouse immune cells.

Figure 3: Example results of integration between human and mouse immune cells
#
DecipherDecipher6 Designed to dissect perturbation effects (e.g., disease versus control), Decipher disentangles shared and condition-specific variation within a VAE framework. Demonstrated on AML (acute myeloid leukemia) data, it uncovers latent axes aligning with known disease signatures and identifies corresponding differentially expressed markers - bridging latent space analysis and functional biology.
#
MethylVIMethylVI7 is a VAE tailored for single-cell bisulfite sequencing (scBS‑seq). By modeling methylation probabilities at genomic regions, it captures epigenetic heterogeneity and learns latent spaces that integrate multiple batches. Tutorial show it outperforms linear methods like PCA in retaining biologically meaningful methylation structures.
#
MethylANVIMethylANVI extends the MethylVI framework with annotation-aware modeling: it jointly integrates methylation profiles with metadata-driven cell-type labels. This supervised model supports both clustering and label transfer, all while capturing latent biological variation across methylome profiles—ideal for epigenetic atlas-building.
#
totalANVItotalANVI brings supervised annotation to CITE‑seq-style multi-omic integration. Leveraging a VAE for RNA + protein, it jointly learns latent embeddings and cell-type classifiers. The model simultaneously performs dimensionality reduction, denoising, differential expression, and accurate cell-type annotation in one cohesive model.
Tutorial in scvi-tools soon to be updated.
#
2. 🧩 Custom Dataloadersscvi‑tools v1.3 introduces three scalable streaming dataloaders—LaminDB, Census, and AnnCollection—enabling out-of-core and federated training without memory overload.
#
LaminDBstreams data directly from Lamindb collections, letting users train SCVI-like models on terabyte-scale datasets stored on disk, with no changes to the training pipeline or performance drop-offs.
#
Censusemploys TileDB for atlas-scale tensor-backed data, offering similar streaming capabilities but enhanced support for multi-dimensional genomic inputs and federated study designs.
#
AnnCollectionallows seamless union of multiple AnnData sources—perfect for interfacing independent experiments—without manual concatenation or feature alignment. The system handles missing modalities and automatically harmonizes variables across datasets.
These backends support full compatibility with scvi‑tools' data registration and training workflows, offering both scale and convenience to large projects.
#
3. ⚙️ Core Enhancements#
Multi‑GPU TrainingBuilt on PyTorch Lightning, v1.3 empowers all major models (SCVI, totalVI, SysVI, etc.) to run across multiple GPUs with a single API flag. Training benchmarks on million-cell datasets show training times reduced by 2–3×, with full gradient synchronization and no code modifications needed.
#
scIB‑Metrics Optimizationscvi‑tools now integrates evaluation of scIB clustering metrics (like NMI and batch ARI) during training via ScibCallback
, and supports hyperparameter tuning via AutotuneExperiment
. This enables scientists to automatically optimize latent dimension size, learning rate, or architecture based on validation performance—making the model training process more principled and outcome-driven.
#
Explainability & Interpretabilityv1.3 brings native support for Integrated Gradients (IG) across generative models like CondSCVI, peakVI, and totalVI. Users can compute gene-level attribution scores tied to latent dimensions or differential axes. This complements get_normalized_expression
, delivering a full pipeline that links model representations back to biologically interpretable molecular mechanisms.
#
Summaryscvi‑tools v1.3 is a landmark release that advances the field across three foundational pillars:
Innovative modeling across nine tailored VAEs for spatial, protein, methylation, perturbation, and multi‑omic data.
Scalable data processing with three new streaming backends, enabling efficient handling of federated, atlas-scale datasets.
Infrastructure and transparency with multi-GPU training, metric-aware tuning, and demonstrable model interpretability.
Together, these developments empower researchers to build, train, and interpret probabilistic models at scale—in a reproducible, transparent, and biologically meaningful way.
#
References- ResolVI: addressing noise and bias in spatial transcriptomics / Ergen et al.↩
- scVIVA: a probabilistic framework for representation of cells and their environments in spatial transcriptomics / Levy et al.↩
- CytoVI: Deep generative modeling of antibody-based single cell technologies / Ingelfinger et al.↩
- VI-VS: calibrated identification of feature dependencies in single-cell multiomics / Boyeau et al.↩
- sysVI: Integrating single-cell RNA-seq datasets with substantial batch effects / Hrovatin et al.↩
- Decipher: Joint representation and visualization of derailed cell states with Decipher / Nazaret et al.↩
- MethylVI: A deep generative model of single-cell methylomic data / Weinberger et al.↩