ComptoxAI logo

ComptoxAI

by EPAepa.gov
VISIT OFFICIAL WEBSITE →

OVERVIEW

A free, public, and open-source AI toolkit and graph-structured knowledge base for computational and predictive toxicology research.

ComptoxAI is a new data infrastructure and toolkit designed to enable computational and artificial intelligence (AI) research in predictive toxicology. It is a free, public, and open-source resource developed by the Romano Lab (University of Pennsylvania) with contributions from the US EPA, and is supported by NIH grant funding.

At its core, ComptoxAI features a large, graph-formatted knowledge base, initially implemented in Neo4j, that rigorously aggregates and describes entities and relationships relevant to computational toxicology. This multimodal graph-formatted knowledge base integrates data from a diverse array of public third-party databases, including AOP-DB, DSSTox, Drugbank, Hetionet, PubChem, Reactome, and Tox21.

The platform provides diverse classes of users, including biomedical researchers, public health and regulatory officials, and the general public, with multiple interfaces for access and analysis. Users can access the knowledge base via:

  • A Python package for programmatic access and data analysis.
  • A REST web API for integration into other applications.
  • A browser-based graphical interface for simplified data query and visualization.
  • Direct access via the Cypher query language to a public copy of the Neo4j database.

Key capabilities demonstrated by ComptoxAI include the use of a “shortest path” module to identify mechanistic links between chemical exposure and disease, an “expand network” module to identify communities linked to toxicity, and a quantitative structure–activity relationship (QSAR) dataset generator. The goal is to rapidly answer complex questions about toxicology that are infeasible using previous technologies and data resources.

RATING & STATS

Founded
2021

KEY FEATURES

  • Graph-formatted Knowledge Base (Neo4j)
  • REST Web API for data access
  • Python API for programmatic access
  • Graph Machine Learning Toolkit
  • Shortest Path Query Module
  • Network Expansion Module
  • QSAR Dataset Generator
  • OWL Ontology for data structure

PRICING

Model: free
ComptoxAI is a free, public, and open-source toolkit, offered under a dual-license model including the GNU GPLv3. It is a research resource supported by US National Institutes of Health (NIH) grant funding.
FREE TIER

TECHNICAL DETAILS

Deployment: cloud, on_premise
Platforms: web, windows, mac, linux
🔌 API Available⚡ Open Source

USE CASES

Predictive Toxicology ResearchIdentifying Mechanistic Links between exposure and diseaseQuantitative Structure-Activity Relationship (QSAR) modelingChemical Toxicity Network AnalysisInformation Retrieval from Toxicology Knowledge Base

INTEGRATIONS

AOP-DBAOP-WikiDrugbankDSSToxHetionetNCBI GeneNCBI OMIMPubChemReactomeTox21

SUPPORT & IMPLEMENTATION

Support: email
Implementation Time: < 1 week
Target Company Size: small, medium, enterprise
TRAINING AVAILABLE

PROS & CONS

✓ Pros:
  • +Free, public, and open-source under GNU GPLv3
  • +Utilizes a graph-formatted knowledge base (Neo4j) for complex, relational queries
  • +Aggregates and integrates data from numerous high-value public toxicology databases
  • +Provides multiple access methods: Python API, REST API, and a browser-based interface
  • +Specifically designed for AI/Machine Learning research in computational toxicology
✗ Cons:
  • -Primarily a research tool, not a commercial-grade application
  • -Requires technical knowledge (Python/Cypher) for full functionality
  • -Under active development (e.g., transition from Neo4j to Memgraph)
  • -No dedicated commercial support or compliance certifications (HIPAA/SOC2)

ABOUT EPA