Skip to content
View yzhao062's full-sized avatar
💜
Very Busy Since Joined USC
💜
Very Busy Since Joined USC

Organizations

@pygod-team @Open-Source-ML @USC-FORTIS

Block or report yzhao062

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yzhao062/README.md

Note on External Advisory/Consultancy:
Dr. Zhao occasionally provides technical advice to selected projects on topics such as privacy-preserving AI and secure machine learning systems.
These collaborations are strictly technical in nature, with no involvement in financial operations, external fundraising, or investment-related activities.


😄 I am an Assistant Professor at USC Computer Science; see the latest information at my homepage.

🌱 Research Interests

My research centers on building reliable, safe, and scalable AI systems, with a focus on understanding and mitigating failure modes in modern foundation models and agentic systems.
I organize my work into two tightly connected tiers:

  • Tier 1: advancing the scientific foundations of reliability and safety in modern AI systems
  • Tier 2: translating these foundations into system-level evaluation frameworks and high-impact scientific and societal applications

Tier 1: Foundations of Reliable & Safe AI

I study why and how modern AI systems fail under distribution shift, uncertainty, and strategic pressure, and develop methods to make their behavior more predictable and reliable.
This tier comprises two complementary directions:

  • LLM & Agent Safety
    Understanding and mitigating failure modes in large language models and agentic systems, including hallucinations, jailbreaks, privacy leakage, model extraction, and multi-agent instability.

  • Robustness & Failure Detection
    Developing algorithms and benchmarks to identify abnormal or unreliable behavior, grounded in robustness, out-of-distribution generalization, and anomaly detection.

Keywords:
LLM Safety, Robustness, Agents, Hallucination Mitigation, Jailbreak Detection, OOD Generalization, Failure Analysis


Tier 2: System-Level Evaluation & Scientific/Societal Impact

I adopt a system-oriented perspective to evaluate, stress-test, and deploy reliable AI in realistic settings, and apply these methods to domains where failures carry high cost.
This tier focuses on two areas that operationalize foundational advances:

  • Evaluation & Benchmarking
    Designing scalable evaluation frameworks, benchmarks, and workflows that probe model and agent behavior under realistic and adversarial conditions.

  • AI for Science & Society
    Applying reliable foundation models to high-impact domains, including climate and weather forecasting, healthcare and biomedicine, and political or social decision-making.

Keywords:
Evaluation, Benchmarking, System-Level Analysis, AI for Science, Scientific Foundation Models, Climate & Weather Modeling, AI for Healthcare


📫 Contact me by:


Pinned Loading

  1. pyod pyod Public

    A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

    Python 9.7k 1.5k

  2. USC-FORTIS/AD-AGENT USC-FORTIS/AD-AGENT Public

    A multi-agent framework to fully automate anomaly detection in different modalities, tabular, graph, time series, and more (work in progress)!

    Python 84 29

  3. anomaly-detection-resources anomaly-detection-resources Public

    Anomaly detection related books, papers, videos, and toolboxes. Last update late 2025 for LLM and VLM works!

    Python 9.1k 1.8k

  4. USC-FORTIS/NLP-ADBench USC-FORTIS/NLP-ADBench Public

    [EMNLP Findings 2025]. NLP-ADBench is a comprehensive benchmarking tool designed for Anomaly Detection in Natural Language Processing (NLP).

    Python 20

  5. Minqi824/ADBench Minqi824/ADBench Public

    Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.

    Python 1k 149

  6. USC-FORTIS/AD-LLM USC-FORTIS/AD-LLM Public

    [ACL Findings 2025] A benchmark for anomaly detection using large language models. It supports zero-shot detection, data augmentation, and model selection, with scripts and data for GPT-4 and Llama…

    Python 37 8