BioNeuralNet: Graph Neural Networks for Multi-Omics Network Analysis

https://img.shields.io/badge/License-CC%20BY--NC--ND%204.0-lightgrey.svg https://img.shields.io/pypi/v/bioneuralnet https://static.pepy.tech/badge/bioneuralnet https://img.shields.io/badge/GitHub-View%20Code-blue https://zenodo.org/badge/DOI/10.5281/zenodo.17503083.svg
BioNeuralNet Logo

Installation

BioNeuralNet is available as a Python package on PyPI:

pip install bioneuralnet

For additional installation details and troubleshooting, see Installation.

Quick Start Examples

Get started quickly with these end-to-end examples demonstrating the BioNeuralNet workflow:

BioNeuralNet Workflow Overview

BioNeuralNet Workflow Overview

View BioNeuralNet Workflow.

What is BioNeuralNet?

BioNeuralNet is a flexible and modular Python framework tailored for end-to-end network-based multi-omics data analysis. It leverages Graph Neural Networks (GNNs) to learn biologically meaningful low-dimensional representations from multi-omics networks, converting complex molecular interactions into versatile embeddings.

Core Analytical Modules:

  • Network Construction: Build informative networks from raw tabular data using strategies like Similarity, Correlation, Neighborhood-based, or Phenotype-driven (e.g., SmCCNet) approaches. [1]

  • Biomarker Discovery: Identify biological modules and key molecular interactions that drive disease phenotypes.

  • Disease Prediction: Implement end-to-end supervised disease classification using the DPMON (Disease Prediction using Multi-Omics Networks) module. [2]

  • Subject Representation: Generate enhanced subject-level embeddings for stratification and clustering.

Visualizing Multi-Omics Networks

BioNeuralNet allows you to inspect the topology of your constructed networks. The visualization below, from our TCGA Lower Grade Glioma (LGG) analysis, highlights a survival-associated module of highly correlated omics features identified by HybridLouvain.

Multi-Omics Network Visualization

Network visualization of a highly connected gene module identified in the TCGA-LGG dataset. See Network Full Size

Top Identified Biomarkers (Hub Omics)

The table below lists the top hub features identified in the network above, ranked by their degree centrality.

Omics with high degree

Feature Name (Omic)

Index

Degree

Source

HIVEP3

20

7

RNA

DBH

19

7

RNA

ERMP1

8

7

RNA

LFNG

12

6

RNA

MIR23A

21

6

miRNA

THADA

4

6

RNA

Why Graph Neural Networks for Multi-Omics?

Traditional statistical methods typically represent multi-omics data as high-dimensional tabular matrices, often overlooking the intricate relationships and interactions between biomolecular entities. BioNeuralNet overcomes these limitations by using Graph Neural Networks (GNNs) to explicitly model multi-omics data as biological networks.

BioNeuralNet supports several GNN architectures suited to different biological contexts:

  • GCN: Effective for uniformly connected graphs.

  • GAT: Uses attention mechanisms to highlight key biological relationships.

  • GraphSAGE: Designed for large or dynamic datasets.

  • GIN: Sensitive to subtle feature variations in molecular structures.

Network Embeddings

By projecting high-dimensional omics networks into latent spaces, BioNeuralNet distills complex, nonlinear molecular relationships into compact vectorized representations. The t-SNE projection below reveals distinct clusters corresponding to different omics modalities (e.g., DNA Methylation, RNA, miRNA).

t-SNE visualization of Network Embeddings

2D projection of Network Embeddings showing distinct separation between omics modalities. See Embeddings Full Size

For detailed explanations of BioNeuralNet’s supported GNN architectures, see GNN Embeddings.

Key Considerations for Robust Analysis

Multi-omics pipelines involve sequential decisions across data alignment, feature selection, network construction, and downstream modeling. Each stage shapes the one that follows. BioNeuralNet provides a structured Data Decision Framework to guide these choices with concrete parameter recommendations grounded in empirical results from TCGA and COPD workflows.

BioNeuralNet Data-Driven Decision Flowchart

Data-driven decision flowchart for navigating the BioNeuralNet pipeline. See Flow Chart Full Size

For the full stage-by-stage parameter reference, see Data Decision Framework. For preprocessing utilities, see Preprocessing Utilities. Per-cohort feature implementation details are available in the Notebooks.

Explore BioNeuralNet’s Documentation

For detailed examples and tutorials, visit:

  • Quick Start: A series of walkthroughs demonstrating the BioNeuralNet workflow from start to finish.

  • Notebooks: A collection of demonstration notebooks showcasing end-to-end analyses on TCGA datasets.

Documentation Sections:

  • Quick Start: End-to-end notebook walkthrough using a synthetic demo dataset, covering network construction, subgraph detection, and disease prediction.

  • Data Decision Framework: Comprehensive stage-by-stage parameter reference and decision guide grounded in empirical results from TCGA and COPD workflows.

  • Notebooks: Demonstration notebooks showcasing end-to-end analyses on TCGA-BRCA, TCGA-LGG, TCGA-KIPAN, and ROSMAP datasets.

  • GNN Embeddings: Overview of supported GNN architectures (GCN, GAT, GraphSAGE, GIN) and embedding generation.

  • Subgraph Detection & Biomarkers: Phenotype-aware subgraph detection using CorrelatedLouvain, CorrelatedPageRank, and HybridLouvain, with TCGA-LGG and ROSMAP case studies.

  • Network Construction & Analysis: Tools for network construction, topology analysis, and automated network search.

  • Downstream Tasks: Downstream analysis pipelines including DPMON for phenotype prediction and SubjectRepresentation for patient-level profiling.

  • Metrics and Visualization: Visualization, quality evaluation, and performance benchmarking utilities.

  • Utils: Data preprocessing, feature selection, imputation, normalization, and network pruning.

  • Datasets Guide: Built-in multi-omics benchmark datasets (BRCA, LGG, KIPAN, ROSMAP) with cohort summaries and feature selection details.

  • External Tools: Utility functions for interoperability with R-based tools including SmCCNet cross-validation fold export.

  • User API: Full API reference for developers and advanced users.

Contributing to BioNeuralNet

We welcome contributions to BioNeuralNet! If you have ideas for new features, improvements, or bug fixes, please follow these steps:

  • Ways to contribute:

    • Report issues or bugs on our GitHub Issues page.

    • Suggest new features or improvements.

    • Share your experiences or use cases with the community.

  • Implementing new features:

    • Fork the repo and create a feature branch UCD-BDLab/BioNeuralNet.

    • Add tests and documentation for new features.

    • Run the test suite and and pre-commit hooks before opening a Pull Request(PR).

    • A new PR should pass all tests and adhere to the project’s coding standards.

    git clone https://github.com/UCD-BDLab/BioNeuralNet.git
    cd BioNeuralNet
    pip install -r requirements-dev.txt
    pre-commit install
    pytest --cov=bioneuralnet
    

Citation

If you use BioNeuralNet in your research, we kindly ask that you cite our paper:

Ramos, V., Hussein, S., et al. (2025). BioNeuralNet: A Graph Neural Network based Multi-Omics Network Data Analysis Tool. arXiv preprint arXiv:2507.20440 | DOI: 10.48550/arXiv.2507.20440.

Contents:

Indices and References