publications
2024
- PreprintUnveiling Concept Attribution in Diffusion Models2024
- Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor AttacksIn International Conference on Learning Representations, 2024
- PreprintMetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs2024
- Fooling the Textual Fooler via Randomizing Latent RepresentationsIn Findings of the Association for Computational Linguistics: ACL 2024, 2024
- Understanding the Robustness of Randomized Feature Defense Against Query-Based Adversarial AttacksIn The Twelfth International Conference on Learning Representations, 2024
2023
- PreprintSynthesizing Physical Backdoor Datasets: An Automated Framework Leveraging Deep Generative Models2023
- Clean-label Backdoor Attacks by Selectively Poisoning with Limited Information from Target ClassIn NeurIPS 2023 Workshop on Backdoors in Deep Learning-The Good, the Bad, and the Ugly, 2023
- PreprintEveryone Can Attack: Repurpose Lossy Compression as a Natural Backdoor Attack2023
- A Cosine Similarity-based Method for Out-of-Distribution DetectionIn ICML 2023: The Second Workshop on Spurious Correlations, Invariance and Stability , 2023