About

I am on the job market this year (2025-2026)!

I am a final year Ph.D. student in Computer Science at Purdue University. I am fortunate to work with Professor Xiangyu Zhang.

My research focuses on building secure and trustworthy foundations for modern generative and agentic AI systems (LLMs/VLMs/DMs/Agents) by mitigating adversarial threats such as backdoors, jailbreaks, and AIGC-based deception. I aim to endow these models with human-level risk awareness through a unified, scalable framework that integrates threat characterization, red-teaming, and awareness-oriented alignment training.

I serve as the team leader and core member of the Perspecta-PurdueUMass team, which competes in the TrojAI Program, an AI backdoor detection competition held by IARPA. Over the past four years, our team has achieved top-tier performance, securing leading positions in 14 out of 20 rounds. In the course of this competition, I have developed and refined a suite of scanning methodologies for detecting backdoors across a variety of machine learning models, including object detection systems, malware detectors, and large language models. I also participate in the Amazon Nova AI Challenge as the co-lead of the PurCL team, where we develope advanced red-teaming tools to help build trustworthy AI coding agents. Our team was awarded first place as the winning attacker team ($250,000) in the finals.

📢: I am always open to discussions and collaborations. If you are interested in exploring ideas related to AI safety and security, please feel free to contact me via email.

News

Dec. 2025: Two papers on VLM-based deepfake detection and backdoor mitigation got accepted to WACV 2026.

Nov. 2025: Our paper on AI-generated text origin detection got accepted to EMNLP 2025.

Nov. 2025: Three papers got accepted to ResponsibleFM Workshop@NeurIPS 2025.

Oct. 2025: Our paper on benchmarking diffusion models safety got accepted to ICCV 2025.

Jul. 2025: Our team PurCL won the first place in the Amazon Nova AI Challenge. Shout out to the team co-lead Xiangzhe and all teammates/advisors! BOILER UP!

Selected Publications [Full List] (* equal contribution)

AuthGuard: Generalizable Deepfake Detection via Language Guidance
Guangyu Shen, Zhihua Li, Xiang Xu, Tianchen Zhao, Zheng Zhang, Dongsheng An, Zhuowen Tu, Yifan Xing, Qin Zhang
Winter Conference on Applications of Computer Vision 2026 (WACV 2026)
From Poisoned to Aware: Fostering Backdoor Self-Awareness in LLMs
Guangyu Shen, Siyuan Cheng, Xiangzhe Xu, Yuan Zhou, Hanxi Guo, Zhuo Zhang, Xiangyu Zhang
NeurIPS 2025 Workshop on Socially Responsible and Trustworthy Foundation Models (ResponsibleFM)
ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants
Xiangzhe Xu^*, Guangyu Shen^*, Zian Su, Siyuan Cheng, Hanxi Guo, Lu Yan, Xuan Chen, Jiasheng Jiang, Xiaolong Jin, Chengpeng Wang, Zhuo Zhang, Xiangyu Zhang
NeurIPS 2025 Workshop on Socially Responsible and Trustworthy Foundation Models (ResponsibleFM)
🏆 Winning red-teaming solution in Amazon Nova AI Challenge
Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia
Guangyu Shen^*, Siyuan Cheng^* Kaiyuan Zhang, Lu Yan, Shengwei An, Zhuo Zhang, Guanhong Tao, Shiqing Ma, Xiangyu Zhang
NeurIPS 2025 Workshop on Socially Responsible and Trustworthy Foundation Models (ResponsibleFM)
BAIT: Large Language Model Backdoor Scanning by Inverting Attack Target
Guangyu Shen^*, Siyuan Cheng^*, Zhuo Zhang, Guanhong Tao, Kaiyuan Zhang, Hanxi Guo, Lu Yan, Xiaolong Jin, Shengwei An, Shiqing Ma, Xiangyu Zhang
Proceedings of the 46th IEEE Symposium on Security and Privacy (S&P 2025)
ODSCAN: Backdoor Scanning for Object Detection Models
Siyuan Cheng^*, Guangyu Shen^*, Guanhong Tao, Kaiyuan Zhang, Zhuo Zhang, Shengwei An, Xiangzhe Xu, Yingqi Liu, Shiqing Ma, Xiangyu Zhang
Proceedings of the 45th IEEE Symposiums on Security and Privacy (S&P 2024)
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Siyuan Cheng^*, Guangyu Shen^*, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo, Shiqing Ma, Xiangyu Zhang
The 18th European Conference on Computer Vision (ECCV 2024)
Django: Detecting Trojans in Object Detection Models via Gaussian Focus Calibration
Guangyu Shen^*, Siyuan Cheng^*, Guanhong Tao, Kaiyuan Zhang, Yingqi Liu, Shengwei An, Shiqing Ma, Xiangyu Zhang
Proceedings of 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
PICCOLO: Exposing Complex Backdoors in NLP Transformer Models
Yingqi Liu^*, Guangyu Shen^*, Guanhong Tao, Shengwei An, Shiqing Ma, Xiangyu Zhang Proceedings of the 43rd IEEE Symposiums on Security and Privacy (S&P 2022)
Constrained Optimization with Dynamic Bound-scaling for Effective NLP Backdoor Defense
Guangyu Shen^*, Yingqi Liu^*, Guanhong Tao, Qiuling Xu, Zhuo Zhang, Shengwei An, Shiqing Ma, Xiangyu Zhang
Proceedings of the 39th International Conference on Machine Learning (ICML 2022)
Complex Backdoor Detection by Symmetric Feature Differencing
Yingqi Liu^*, Guangyu Shen^*, Guanhong Tao, Zhenting Wang, Shiqing Ma, Xiangyu Zhang
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022 (CVPR 2022)
Backdoor Scanning for Deep Neural Networks through K-Arm Optimization
Guangyu Shen^*, Yingqi Liu^*, Guanhong Tao, Shengwei An, Qiuling Xu, Siyuan Cheng, Shiqing Ma, Xiangyu Zhang
Proceedings of Thirty-eighth International Conference on Machine Learning (ICML 2021)

Awards & Honors

Fellowship

Bilsland Dissertation Fellowship, Purdue, 2025

Competition Record

1st place for Amazon Nova AI Challenge ($250,000)
1st place for TrojAI 14 out of 20 rounds
2nd place for 2 tracks in Trojan Detection Competition (TDC2022)
- Target Label Prediction
- Trigger Synthesis
3nd place and most efficient method award for Track II: Backdoor Trigger Recovery for Models in The Competition for LLM and Agent Safety (CLAS2024)

Teaching

Guest lecture at Virginia Tech, CS 6804: AI Security and Privacy, invited by Prof. Shengwei An, Nov. 2025
Guest lecture at Rice University, COMP 620: Machine Learning System Seminar, invited by Prof. Yuke Wang, Nov. 2025
Guest lecture at University of Uath, CS 6958: Machine Learning Security, invited by Prof. Guanhong Tao, Oct. 2025
Guest lecture at University of Georgia, CSCI 8000: Red-Teaming for LLM Agents, invited by Prof. Zhen Xiang, Oct. 2025
Guest lecture at University of Uath, CS 6958: Machine Learning Security, invited by Prof. Guanhong Tao, Oct. 2024
Teaching Assistance at Purdue University, CS 59200 - AI and Security, Aug. 2024
Guest lecture at University of Massachusetts, Amherst, COMPSCI 360: Introduction to Computer and Network Security, invited by Prof. Shiqing Ma, Feb. 2024

Services

Competition Co-chair

IEEE Trojan Removal Competition, 2022

Program Committee

ACM Conference on Computer and Communications Security (CCS): 2025, 2026
8th Deep Learning Security and Privacy Workshop(DLSP): 2025
Workshop on Backdoors in Deep Learning: The Good, the Bad, and the Ugly(BUGS), NeurIPS 2023
Workshop on Secure and Trustworthy Deep Learning Systems (SecTL), AsiaCCS 2023

Reviewer

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR): 2022,2023
International Conference on Machine Learning (ICML): 2022,2023,2024,2025
European Conference on Computer Vision (ECCV): 2022
International Conference on Computer Vision (ICCV): 2023
Conference on Neural Information Processing Systems (NeurIPS): 2022,2023
International Conference on Learning Representations (ICLR): 2025

Experiences

Applied Scientist Intern, Amazon AWS AI Lab, May.2024-Aug.2024, May.2023-Aug.2023
Research Assistant, working with Prof.Baijian Yang, Purdue University, Aug.2019-Jan.2020
Summer Research Intern, working with Prof.Junfeng Yang and Prof.Baishakhi Ray, Columbia University, May.2019-Aug.2019

Personal

I love movies and music. 🏂 is my new favorite sport. :p

Guangyu (Noah) Shen