L L M Unlearning

Table of Contents
NeurIPS
ICML
ICLR
KDD
ACL
EMNLP
AAAI
USENIX Security Symposium
COLING
CIKM
Expert Syst. Appl.
Neural Networks
IEEE Trans. Knowl. Data Eng.
Nat. Mac. Intell.

NeurIPS

Expand NeurIPS

2024

Title Venue Year Link
Large Language Model Unlearning via Embedding-Corrupted Prompts. NeurIPS 2024 Link
Large Language Model Unlearning. NeurIPS 2024 Link
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024 Link
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference. NeurIPS 2024 Link
Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models. NeurIPS 2024 Link
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space. NeurIPS 2024 Link
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models. NeurIPS 2024 Link

ICML

Expand ICML

2025

Title Venue Year Link
Adaptive Localization of Knowledge Negation for Continual LLM Unlearning. ICML 2025 Link
Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning. ICML 2025 Link
Fast Exact Unlearning for In-Context Learning Data for LLMs. ICML 2025 Link
GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs. ICML 2025 Link
Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning. ICML 2025 Link
Tool Unlearning for Tool-Augmented LLMs. ICML 2025 Link
Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond. ICML 2025 Link
Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning. ICML 2025 Link

2024

Title Venue Year Link
In-Context Unlearning: Language Models as Few-Shot Unlearners. ICML 2024 Link
To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models. ICML 2024 Link

ICLR

Expand ICLR

2025

Title Venue Year Link
A Closer Look at Machine Unlearning for Large Language Models. ICLR 2025 Link
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models. ICLR 2025 Link
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset. ICLR 2025 Link
Catastrophic Failure of LLM Unlearning via Quantization. ICLR 2025 Link
LLM Unlearning via Loss Adjustment with Only Forget Data. ICLR 2025 Link
MUSE: Machine Unlearning Six-Way Evaluation for Language Models. ICLR 2025 Link
On Large Language Model Continual Unlearning. ICLR 2025 Link
Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond. ICLR 2025 Link
Towards Effective Evaluations and Comparisons for LLM Unlearning Methods. ICLR 2025 Link
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs. ICLR 2025 Link
Unified Parameter-Efficient Unlearning for LLMs. ICLR 2025 Link
Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning. ICLR 2025 Link

KDD

Expand KDD

2025

Title Venue Year Link
LLM-Eraser: Optimizing Large Language Model Unlearning through Selective Pruning. KDD 2025 Link

ACL

Expand ACL

2025

Title Venue Year Link
A General Framework to Enhance Fine-tuning-based LLM Unlearning. ACL 2025 Link
Answer When Needed, Forget When Not: Language Models Pretend to Forget via In-Context Knowledge Unlearning. ACL 2025 Link
Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis. ACL 2025 Link
Decoupling Memories, Muting Neurons: Towards Practical Machine Unlearning for Large Language Models. ACL 2025 Link
Disentangling Biased Knowledge from Reasoning in Large Language Models via Machine Unlearning. ACL 2025 Link
From Evasion to Concealment: Stealthy Knowledge Unlearning for LLMs. ACL 2025 Link
MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models. ACL 2025 Link
Modality-Aware Neuron Pruning for Unlearning in Multimodal Large Language Models. ACL 2025 Link
Opt-Out: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport. ACL 2025 Link
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space. ACL 2025 Link
ReLearn: Unlearning via Learning for Large Language Models. ACL 2025 Link
Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning. ACL 2025 Link
SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs? ACL 2025 Link
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning. ACL 2025 Link
Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training. ACL 2025 Link
Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation. ACL 2025 Link
Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation. ACL 2025 Link
Which Retain Set Matters for LLM Unlearning? A Case Study on Entity Unlearning. ACL 2025 Link

2024

Title Venue Year Link
Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning. ACL 2024 Link
Machine Unlearning of Pre-trained Large Language Models. ACL 2024 Link
Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models. ACL 2024 Link
Towards Safer Large Language Models through Machine Unlearning. ACL 2024 Link
Unlearning Traces the Influential Training Data of Language Models. ACL 2024 Link

2023

Title Venue Year Link
Knowledge Unlearning for Mitigating Privacy Risks in Language Models. ACL 2023 Link
Unlearning Bias in Language Models by Partitioning Gradients. ACL 2023 Link

EMNLP

Expand EMNLP

2024

Title Venue Year Link
Can Machine Unlearning Reduce Social Bias in Language Models? EMNLP 2024 Link
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models. EMNLP 2024 Link
Dissecting Fine-Tuning Unlearning in Large Language Models. EMNLP 2024 Link
EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models. EMNLP 2024 Link
Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models. EMNLP 2024 Link
SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning. EMNLP 2024 Link
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models. EMNLP 2024 Link
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations. EMNLP 2024 Link
ULMR: Unlearning Large Language Models via Negative Response and Model Parameter Average. EMNLP 2024 Link

2023

Title Venue Year Link
Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models. EMNLP 2023 Link
Unlearn What You Want to Forget: Efficient Unlearning for LLMs. EMNLP 2023 Link

AAAI

Expand AAAI

2025

Title Venue Year Link
Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models. AAAI 2025 Link
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage. AAAI 2025 Link
On Effects of Steering Latent Representation for Large Language Model Unlearning. AAAI 2025 Link
Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models. AAAI 2025 Link
Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models. AAAI 2025 Link

USENIX Security Symposium

Expand USENIX Security Symposium

2025

Title Venue Year Link
Refusal Is Not an Option: Unlearning Safety Alignment of Large Language Models. USENIX Security Symposium 2025 Link

COLING

Expand COLING

2025

Title Venue Year Link
Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models. COLING 2025 Link
Unveiling Entity-Level Unlearning for Large Language Models: A Comprehensive Analysis. COLING 2025 Link

CIKM

Expand CIKM

2025

Title Venue Year Link
Pseudo-Inverse Prefix Tuning for Effective Unlearning in LLMs. CIKM 2025 Link

Expert Syst. Appl.

Expand Expert Syst. Appl.

2025

Title Venue Year Link
Law LLM unlearning via interfere prompt, review output and update parameter: new challenges, method and baseline. Expert Syst. Appl. 2025 Link

Neural Networks

Expand Neural Networks

2025

Title Venue Year Link
DP2Unlearning: An efficient and guaranteed unlearning framework for LLMs. Neural Networks 2025 Link

IEEE Trans. Knowl. Data Eng.

Expand IEEE Trans. Knowl. Data Eng.

2025

Title Venue Year Link
Exact and Efficient Unlearning for Large Language Model-Based Recommendation. IEEE Trans. Knowl. Data Eng. 2025 Link

Nat. Mac. Intell.

Expand Nat. Mac. Intell.

2025

Title Venue Year Link
Rethinking machine unlearning for large language models. Nat. Mac. Intell. 2025 Link