Token Reduction in ViTs

18 notes in this category

Token Reduction in ViTs — Overview

ViT token efficiency — Pruning · Merging · Pooling · Hybrid, with 17 key papers at a glance.

Survey

2026 · survey
[Token Cropr] Token Cropr: Faster ViTs for Quite a Few Tasks

Prunes tokens by task relevance using auxiliary cross-attention heads that are thrown away after training, plus Last Layer Fusion to revive pruned tokens for dense tasks.

CVPR 2025

2023 · pruning
[Frequency-Aware TR] Frequency-Aware Token Reduction for Efficient Vision Transformer

Reads token reduction through a frequency lens: keeps high-frequency tokens (which fight rank collapse) and squeezes the low-frequency rest into a compact DC token.

NeurIPS 2025

2023 · pruning
[MCTF] Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers

Fuses tokens by a product of three criteria — similarity, informativeness, size — with one-step-ahead attention and bidirectional bipartite matching, beating the base model while cutting FLOPs.

CVPR 2024

2023 · merging
[STAR] Synergistic Patch Pruning for Vision Transformer: Unifying Intra- & Inter-Layer Patch Importance

Fuses online intra-layer [CLS] attention with offline inter-layer LRP importance, and auto-tunes per-layer retention rates from patch similarity.

ICLR 2024

2023 · pruning
[Token Fusion / ToFu] Bridging the Gap between Token Pruning and Token Merging

Switches between pruning (early layers) and merging (later layers) by each layer's functional linearity, with a norm-preserving MLERP merge.

WACV 2024

2023 · hybrid
[DTEM] Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers

Learns a lightweight embedding dedicated to merging — decoupled from the ViT forward pass — via a continuously relaxed (differentiable) token merging.

NeurIPS 2024

2023 · merging
[Zero-TPrune] Zero-Shot Token Pruning through Leveraging of the Attention Graph in Pre-Trained Transformers

Treats the attention matrix as a directed graph and ranks tokens with a Weighted PageRank — pruning without any fine-tuning.

CVPR 2024

2023 · pruning
[Token Pooling] Token Pooling in Vision Transformers

Reframes token downsampling as minimizing reconstruction error, and solves it with simple, parameter-free clustering (K-Means / K-Medoids).

WACV 2023

2023 · pooling
[TPS] Joint Token Pruning & Squeezing Towards More Aggressive Compression of Vision Transformers

Instead of throwing pruned tokens away, squeezes their information into the surviving 'host' tokens — parameter-free matching + similarity-based fusing.

CVPR 2023

2023 · pruning
[DiffRate] Differentiable Compression Rate for Efficient Vision Transformers

Makes the per-layer compression rate differentiable, and prunes + merges in one unified framework.

ICCV 2023

2023 · hybrid
[ToMe] Token Merging: Your ViT But Faster

Combine similar tokens (not prune) via bipartite soft matching, fast as pruning, works even without training.

ICLR 2023

2023 · merging
[AS-ViT] Adaptive Sparse ViT: Learnable Adaptive Token Pruning by Fully Exploiting Self-Attention

Learnable thresholds replace fixed keep-ratios, scoring tokens for free from MHSA's own intermediate results.

IJCAI 2023

2023 · pruning
[ATS] Adaptive Token Sampling for Efficient Vision Transformers

Parameter-free, picks a variable number of tokens per image by sampling the attention CDF.

ECCV 2022

2022 · pruning
[Evo-ViT] Slow-Fast Token Evolution for Dynamic Vision Transformer

Keep all tokens but update informative vs placeholder tokens on different paths.

AAAI 2022

2022 · pruning
[EViT] Not All Patches Are What You Need: Expediting ViTs via Token Reorganizations

Keep top-k attentive tokens by CLS attention, fuse the rest into one.

ICLR 2022

2022 · pruning
[TokenLearner] What Can 8 Learned Tokens Do for Images and Videos?

Learns a handful of adaptive tokens instead of a dense uniform grid.

NeurIPS 2021

2021 · pooling
[DynamicViT] Efficient Vision Transformers with Dynamic Token Sparsification

Dynamically drops redundant tokens per input to speed up ViTs.

NeurIPS 2021

2021 · pruning