L L M Unlearning
Table of Contents
NeurIPS
ICLR
KDD
AAAI
USENIX Security Symposium
COLING
CIKM
Expert Syst. Appl.
Neural Networks
IEEE Trans. Knowl. Data Eng.
Nat. Mac. Intell.
NeurIPS
Expand NeurIPS
2024
| Title | Venue | Year | Link |
|---|---|---|---|
| Large Language Model Unlearning via Embedding-Corrupted Prompts. | NeurIPS | 2024 | Link |
| Large Language Model Unlearning. | NeurIPS | 2024 | Link |
| RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. | NeurIPS | 2024 | Link |
| Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference. | NeurIPS | 2024 | Link |
| Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models. | NeurIPS | 2024 | Link |
| Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space. | NeurIPS | 2024 | Link |
| WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models. | NeurIPS | 2024 | Link |
ICML
Expand ICML
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Adaptive Localization of Knowledge Negation for Continual LLM Unlearning. | ICML | 2025 | Link |
| Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning. | ICML | 2025 | Link |
| Fast Exact Unlearning for In-Context Learning Data for LLMs. | ICML | 2025 | Link |
| GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs. | ICML | 2025 | Link |
| Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning. | ICML | 2025 | Link |
| Tool Unlearning for Tool-Augmented LLMs. | ICML | 2025 | Link |
| Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond. | ICML | 2025 | Link |
| Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning. | ICML | 2025 | Link |
2024
| Title | Venue | Year | Link |
|---|---|---|---|
| In-Context Unlearning: Language Models as Few-Shot Unlearners. | ICML | 2024 | Link |
| To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models. | ICML | 2024 | Link |
ICLR
Expand ICLR
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| A Closer Look at Machine Unlearning for Large Language Models. | ICLR | 2025 | Link |
| A Probabilistic Perspective on Unlearning and Alignment for Large Language Models. | ICLR | 2025 | Link |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset. | ICLR | 2025 | Link |
| Catastrophic Failure of LLM Unlearning via Quantization. | ICLR | 2025 | Link |
| LLM Unlearning via Loss Adjustment with Only Forget Data. | ICLR | 2025 | Link |
| MUSE: Machine Unlearning Six-Way Evaluation for Language Models. | ICLR | 2025 | Link |
| On Large Language Model Continual Unlearning. | ICLR | 2025 | Link |
| Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond. | ICLR | 2025 | Link |
| Towards Effective Evaluations and Comparisons for LLM Unlearning Methods. | ICLR | 2025 | Link |
| Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs. | ICLR | 2025 | Link |
| Unified Parameter-Efficient Unlearning for LLMs. | ICLR | 2025 | Link |
| Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning. | ICLR | 2025 | Link |
KDD
Expand KDD
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| LLM-Eraser: Optimizing Large Language Model Unlearning through Selective Pruning. | KDD | 2025 | Link |
ACL
Expand ACL
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| A General Framework to Enhance Fine-tuning-based LLM Unlearning. | ACL | 2025 | Link |
| Answer When Needed, Forget When Not: Language Models Pretend to Forget via In-Context Knowledge Unlearning. | ACL | 2025 | Link |
| Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis. | ACL | 2025 | Link |
| Decoupling Memories, Muting Neurons: Towards Practical Machine Unlearning for Large Language Models. | ACL | 2025 | Link |
| Disentangling Biased Knowledge from Reasoning in Large Language Models via Machine Unlearning. | ACL | 2025 | Link |
| From Evasion to Concealment: Stealthy Knowledge Unlearning for LLMs. | ACL | 2025 | Link |
| MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models. | ACL | 2025 | Link |
| Modality-Aware Neuron Pruning for Unlearning in Multimodal Large Language Models. | ACL | 2025 | Link |
| Opt-Out: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport. | ACL | 2025 | Link |
| REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space. | ACL | 2025 | Link |
| ReLearn: Unlearning via Learning for Large Language Models. | ACL | 2025 | Link |
| Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning. | ACL | 2025 | Link |
| SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs? | ACL | 2025 | Link |
| SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning. | ACL | 2025 | Link |
| Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training. | ACL | 2025 | Link |
| Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation. | ACL | 2025 | Link |
| Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation. | ACL | 2025 | Link |
| Which Retain Set Matters for LLM Unlearning? A Case Study on Entity Unlearning. | ACL | 2025 | Link |
2024
| Title | Venue | Year | Link |
|---|---|---|---|
| Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning. | ACL | 2024 | Link |
| Machine Unlearning of Pre-trained Large Language Models. | ACL | 2024 | Link |
| Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models. | ACL | 2024 | Link |
| Towards Safer Large Language Models through Machine Unlearning. | ACL | 2024 | Link |
| Unlearning Traces the Influential Training Data of Language Models. | ACL | 2024 | Link |
2023
| Title | Venue | Year | Link |
|---|---|---|---|
| Knowledge Unlearning for Mitigating Privacy Risks in Language Models. | ACL | 2023 | Link |
| Unlearning Bias in Language Models by Partitioning Gradients. | ACL | 2023 | Link |
EMNLP
Expand EMNLP
2024
| Title | Venue | Year | Link |
|---|---|---|---|
| Can Machine Unlearning Reduce Social Bias in Language Models? | EMNLP | 2024 | Link |
| Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models. | EMNLP | 2024 | Link |
| Dissecting Fine-Tuning Unlearning in Large Language Models. | EMNLP | 2024 | Link |
| EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models. | EMNLP | 2024 | Link |
| Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models. | EMNLP | 2024 | Link |
| SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning. | EMNLP | 2024 | Link |
| To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models. | EMNLP | 2024 | Link |
| Towards Robust Evaluation of Unlearning in LLMs via Data Transformations. | EMNLP | 2024 | Link |
| ULMR: Unlearning Large Language Models via Negative Response and Model Parameter Average. | EMNLP | 2024 | Link |
2023
| Title | Venue | Year | Link |
|---|---|---|---|
| Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models. | EMNLP | 2023 | Link |
| Unlearn What You Want to Forget: Efficient Unlearning for LLMs. | EMNLP | 2023 | Link |
AAAI
Expand AAAI
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models. | AAAI | 2025 | Link |
| Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage. | AAAI | 2025 | Link |
| On Effects of Steering Latent Representation for Large Language Model Unlearning. | AAAI | 2025 | Link |
| Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models. | AAAI | 2025 | Link |
| Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models. | AAAI | 2025 | Link |
USENIX Security Symposium
Expand USENIX Security Symposium
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Refusal Is Not an Option: Unlearning Safety Alignment of Large Language Models. | USENIX Security Symposium | 2025 | Link |
COLING
Expand COLING
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models. | COLING | 2025 | Link |
| Unveiling Entity-Level Unlearning for Large Language Models: A Comprehensive Analysis. | COLING | 2025 | Link |
CIKM
Expand CIKM
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Pseudo-Inverse Prefix Tuning for Effective Unlearning in LLMs. | CIKM | 2025 | Link |
Expert Syst. Appl.
Expand Expert Syst. Appl.
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Law LLM unlearning via interfere prompt, review output and update parameter: new challenges, method and baseline. | Expert Syst. Appl. | 2025 | Link |
Neural Networks
Expand Neural Networks
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| DP2Unlearning: An efficient and guaranteed unlearning framework for LLMs. | Neural Networks | 2025 | Link |
IEEE Trans. Knowl. Data Eng.
Expand IEEE Trans. Knowl. Data Eng.
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Exact and Efficient Unlearning for Large Language Model-Based Recommendation. | IEEE Trans. Knowl. Data Eng. | 2025 | Link |
Nat. Mac. Intell.
Expand Nat. Mac. Intell.
2025
| Title | Venue | Year | Link |
|---|---|---|---|
| Rethinking machine unlearning for large language models. | Nat. Mac. Intell. | 2025 | Link |