Lipeng (Tony) He's picture

2 6 25

Lipeng (Tony) He

ttttonyhe

·

https://lipeng.ac

ttttonyhe

AI & ML interests

Trustworthy Machine Learning

Recent Activity

updated a collection 17 days ago

updated a collection 22 days ago

updated a collection 22 days ago

Red-Teaming Models & Datasets

View all activity

Organizations

commented a paper 3 months ago

Locket: Robust Feature-Locking Technique for Language Models

Paper • 2510.12117 • Published Oct 14, 2025 • 1 •

commented a paper 11 months ago

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

Paper • 2502.00840 • Published Feb 2, 2025 •