StableMedBench

Dataset Access

The datasets used in this benchmark are protected, hence, need to be downloaded from respective sources.

EHRShot
MC-MED
MIMIC: MIMIC-IV, MIMIC-IV-Note, and MIMIC-IV-ED

Benchmark

For each TASK, run relevant files under process_data and benchmarks, the dataset would be created under data/{TASK}. Set DATA_DIR to data/{TASK} for each task.

Models

For classical models XGBoost and Random Forest, run classical/trainer --task {TASK}.

Additionally for stability, run classical/stability --task {TASK}
For transformers GPT2, GPT2-AR and Mamba, run python trainer_binary.py --task {TASK}.

The steps for reproducing the tokenizer is under tokenizer.

Optionally, to pre-train the model, use python pretrain/trainer.py, and modify the loader in pretrain/trainer.py to load the dataset you want to pre-train on.
For LLMs, refer to the README.md in the llm directory. Note that we ran experiments on an Nvidia A100 80GB GPU, and the code is not optimized for other GPUs. Physionet policies for MIMIC dataset prevent using API providers such as OpenAI or Claude naively, refer here for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StableMedBench

Dataset Access

Benchmark

Models

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
classic		classic
data		data
evaluation		evaluation
llm		llm
pretrain		pretrain
process_data		process_data
stability_results		stability_results
transformers		transformers
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

SeewonChoi/CAREBench

Folders and files

Latest commit

History

Repository files navigation

StableMedBench

Dataset Access

Benchmark

Models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages