SchizoNegSympAI

This repository contains the official implementation of the research paper: Analyzing Generative AI and Machine Learning in Auto-Assessing Schizophrenia's Negative Symptoms.

The study introduces an automated system for the assessment of negative symptoms in schizophrenia based on the Clinical Assessment Interview for Negative Symptoms (CAINS) scale. It evaluates two primary domains:

EXP Domain (Expression): Extracts and analyzes features from video and audio data using ensemble learning that combines ordinal and nominal classifiers.
MAP Domain (Motivation and Pleasure): Leverages large language models with zero-shot prompting techniques to analyze clinical interview transcripts.

The results show strong alignment with expert ratings:

EXP Assessment (ML-based): ICC_3,1 = 0.65 and weighted κ = 0.62 (moderate to substantial reliability).
MAP Assessment (GenAI-based): ICC_3,1 = 0.82 and weighted κ = 0.77 (good reliability).

Repository Structure

/
├── assets/                    # Resource files
└── src/  
    ├── core/                  # Shared utilities and helper functions
    ├── ensemble/              # Ensemble learning components
    │   ├── feature_extractor/ # Multimodal feature extraction
    │   │   ├── spkr_diarization.py    # Speaker diarization and separation
    │   │   ├── holistic_tracking.py   # Facial and pose analysis
    │   │   └── encoder.py             # Feature encoding
    │   └── estimator/         # Classification models
    │       ├── train.py               # Training pipeline for ensemble models
    │       └── inference.py           # Assessment using trained models
    ├── llm/                   # LLM-based MAP assessment
    │   ├── transcriber.py     # Audio-to-text transcription
    │   └── estimator.py       # LLM-based assessment logic
    ├── config.yaml            # Configuration file
    └── .env.example           # Environment variable template

Environment Setup

Build Docker image:

docker build --build-arg UID=$(id -u) --build-arg USERNAME=$(whoami) -t schizo-neg-symp-ai:1.0 .

Run Docker container:

docker run --rm -it -d -v $(pwd):/home/$(whoami)/SchizoNegSympAI --gpus all --name schizo-neg-symp-ai schizo-neg-symp-ai:1.0

docker exec -it schizo-neg-symp-ai /bin/bash

Copy the environment template:
```
cp src/.env.example src/.env
```
Add API keys in src/.env.
Configure model parameters in src/config.yaml.

EXP Assessment Pipeline

Evaluates expression-related symptoms (facial expression, vocal expression, expressive gestures, and quantity of speech).

Feature Extraction

Place your audio and video files in ./data/audio and ./data/video, respectively. Ensure that the audio and video files for each subject share the same filename.

Run speaker diarization:

python src/ensemble/feature_extractor/spkr_diarization.py ./data/audio

Run pose landmark detection:

python src/ensemble/feature_extractor/holistic_tracking.py ./data/video

Generate features:
```
python src/ensemble/feature_extractor/encoder.py
```
Feature files for each subject will be saved in the directory specified by ensemble.path.embed_dir in src/config.yaml.

Training

Prepare a ground truth file in the format of assets/labels.csv, where "C1" represents the first assessment item in CAINS. Place this file at the path specified by ensemble.path.label_file in src/config.yaml.
Train the ensemble classifiers:
```
python src/ensemble/estimator/train.py
```
The trained models will be saved in the folder specified by ensemble.path.result_dir in src/config.yaml. Each run creates a subfolder named train_<exp_name>_<timestamp>, where exp_name is set by ensemble.training.experiment_name in the config file.

Inference

Run assessment on new data:

python src/ensemble/estimator/inference.py --exp-dir </path/to/training_experiment>

Results will be saved in the folder specified by ensemble.path.result_dir in src/config.yaml. Each run creates a subfolder named inference_<exp_name>_<timestamp>, where exp_name is set by ensemble.training.experiment_name in the config file.

MAP Assessment Pipeline

Assesses motivation-related symptoms (social relationships, work/school activities, and recreational activities) from clinical interview transcripts.

Place your audio files in ./data/audio
Transcribe audio to text:
```
python src/llm/transcriber.py ./data/audio
```
Transcripts for each subject will be saved in the directory specified by llm.path.transcript_dir in src/config.yaml.
Perform MAP scoring with an LLM:
```
python src/llm/estimator.py --model <model_id> [--output-dir </path/to/results>]
```
If --output-dir is not specified, results will be saved in the directory specified by llm.path.result_dir in src/config.yaml. Each run creates a subfolder named <exp_name>_<timestamp>, where exp_name is set by llm.estimator.experiment_name in the config file.

Acknowledgements

This work builds upon several open-source tools and APIs. We thank the teams behind the following projects for their contributions:

Citation

@article{10.1093/schbul/sbaf102,
    author = {Liu, Chih-Min and Chan, Yi-Hsuan and Ho, Ming-Yang and Liu, Chen-Chung and Lu, Ming-Hsuan and Liao, Yi-An and Hsieh, Ming-Hsien and Tseng, Yufeng Jane},
    title = {Analyzing Generative AI and Machine Learning in Auto-Assessing Schizophrenia’s Negative Symptoms},
    journal = {Schizophrenia Bulletin},
    pages = {sbaf102},
    year = {2025},
    month = {08},
    issn = {1745-1701},
    doi = {10.1093/schbul/sbaf102}
}

License

Licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SchizoNegSympAI

Repository Structure

Environment Setup

EXP Assessment Pipeline

Feature Extraction

Training

Inference

MAP Assessment Pipeline

Acknowledgements

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

MartyJan/SchizoNegSympAI

Folders and files

Latest commit

History

Repository files navigation

SchizoNegSympAI

Repository Structure

Environment Setup

EXP Assessment Pipeline

Feature Extraction

Training

Inference

MAP Assessment Pipeline

Acknowledgements

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages