BillChan226

Follow

🐝

learning

Zhaorun Chen BillChan226

🐝

learning

Follow

PhD student at Secure Learning Lab at the CS department of the University of Chicago

38 followers · 15 following

Achievements

Achievements

Highlights

Pro

Organizations

BillChan226/README.md

Hi there, I'm Zhaorun Personal Website 👋

Connect with me:

🏖️ My Research Interests

Trustworthy deployment and safe interactions with large foundation models and agents from both a theoretical and empirical perspective.
enhancing LLM's trustworthiness via retrieval-augmented generation (RAG) and robustness certificates for hallucination, alignment, jailbreaks and privacy.

Pinned Loading

HALC HALC Public

[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"

Python 74 1
AI-secure/AgentPoison AI-secure/AgentPoison Public

[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

Python 80 8
MJ-Bench/MJ-Bench MJ-Bench/MJ-Bench Public

Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

Jupyter Notebook 40 5
SafeWatch SafeWatch Public

Official implementation for "SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations"

4