`rel-trial` Clinical trial database

Database Description: The clinical trial database is curated from AACT initiative, which consolidates all protocol and results data from studies registered on ClinicalTrials.gov. It offers extensive information about clinical trials, including study designs, participant demographics, intervention details, and outcomes. It is an important resource for health research, policy making, and therapeutic development.

Database Statistics:

Domain	Medical
Num of Tables	15
Num of Rows	5,852,157
Num of Columns	140
Starting Time	2000-01-01
Validation timestamp	2020-01-01
Testing timestamp	2021-01-01
Time window	1 year

Database schema:

To load this relational database in RelBench, do:

from relbench.datasets import get_dataset
dataset = get_dataset("rel-trial")

References:

[1] Clinical Trials Transformation Initiative.

Dataset License: Not specified.

Node Classification Tasks

`study-outcome`

Task Description: Predict if the trials will achieve its primary outcome (defined as p-value < 0.05).

Evaluation metric: AUROC

Node Regression Tasks

`study-adverse`

Task Description: Predict the number of affected patients with severe advsere events/death for the trial.

Evaluation metric: MAE

`site-success`

Task Description: Predict the success rate of a trial site in the next 1 year.

Evaluation metric: MAE

Link Prediction Tasks

`condition-sponsor-run`

Task Description: Predict whether the sponsor (pharma/hospital) will run clinical trials for the condition (disease)in next year

Evaluation metric: MAP

`site-sponsor-run`

Task Description: Predict whether this sponsor (pharma/hospital) will have a trial in the facility in next year

Evaluation metric: MAP

rel-trial Clinical trial database

Node Classification Tasks

study-outcome

Node Regression Tasks

study-adverse

site-success