Prompt Corruption
ICLR
Expand ICLR
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Improving Complex Reasoning with Dynamic Prompt Corruption: A Soft Prompt Optimization Approach. | ICLR | 2025 | Link |
arXiv
Expand arXiv
2024
| Title | Venue | Year | Link |
|---|---|---|---|
| Brain Surgery: Ensuring GDPR Compliance in Large Language Models via Concept Erasure | arXiv | 2024 | Link |
| Mechanistic interpretability of large language models with applications to the financial services industry | arXiv | 2024 | Link |
| Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning | arXiv | 2024 | Link |