Wenfang Sun

Updates

News

Jul 2026Joined ICVSS 2026 in Sicily, Italy 🇮🇹.
Apr 2026Selected as a finalist for the 2026 Qualcomm Innovation Fellowship for Europe.
Jan 2026One paper was accepted by ICLR 2026.
Nov 2025One paper was accepted by WACV 2026.
Sep 2025One paper was accepted by NeurIPS 2025.
Sep 2024One paper was accepted by NeurIPS 2024.
Apr 2024One paper was accepted by CVPR 2024 Workshop.
Apr 2023One paper was accepted by ICML 2023.

Selected Publications

Research

ICLR 2026

RegionReasoner: Region-Grounded Multi-Round Visual Reasoning

Wenfang Sun*, Hao Chen*, Yingjun Du, Yefeng Zheng, Cees G. M. Snoek

RegionReasoner improves multi-round visual reasoning by grounding each reasoning trace in explicit regions and aligning local visual details with global scene semantics.

Paper Code

WACV 2026

QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain

Wenfang Sun, Yingjun Du, Gaowen Liu, Cees G. M. Snoek

QUOTA uses a domain-agnostic optimization framework to improve object-count control for text-to-image models across unseen domains without retraining.

Paper Code

NeurIPS 2025

The Curse of Depth in Large Language Models

Wenfang Sun*, Xinyuan Song*, Pengxiang Li*, Lu Yin, Yefeng Zheng, Shiwei Liu

LayerNorm Scaling mitigates variance explosion in deep Transformer layers, helping large language models benefit more consistently from depth.

Paper Code

NeurIPS 2024

IPO: Interpretable Prompt Optimization for Vision-Language Models

Yingjun Du*, Wenfang Sun*, Cees G. M. Snoek

IPO uses language and multimodal models to generate dataset-specific, human-readable prompts that improve generalization for vision-language models.

Paper Code

Training-free semantic segmentation overview

CVPR Workshop 2024

Training-Free Semantic Segmentation via LLM-Supervision

Wenfang Sun*, Yingjun Du*, Gaowen Liu, Ramana Rao Kompella, Cees G. M. Snoek

A training-free semantic segmentation framework that uses large language models to build richer class descriptors and ensemble subclass-level predictions.

Paper Code

ICML 2023

MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks

Wenfang Sun*, Yingjun Du*, Xiantong Zhen, Fan Wang, Ling Wang, Cees G. M. Snoek

MetaModulation increases task diversity in few-shot learning by modulating feature hierarchies, including variational variants for task uncertainty.

Paper Code