Archival Track
VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
Authors: Juhwan Choi (AITRICS); Junehyoung Kwon (Chung-Ang University); JungMin Yun (Chung-Ang University); Seunguk Yu (Chung-Ang University); YoungBin Kim (Chung-Ang University)
arXiv
Objaverse++: Curated 3D Object Dataset with Quality Annotations
Authors: Chendi Lin (CMU); Heshan Liu (CMU); Qunshu Lin (Zhejiang University); Zachary Bright (Exascale Labs); Shitao Tang (Simon Fraser University); Yihui He (CMU); Minghao Liu (2077AI); Ling Zhu (Exascale Labs); Cindy Le (Google)
arXiv | Project Page
Few-Shot Vision-Language Reasoning for Satellite Imagery via Verifiable Rewards
Authors: Aybora Köksal (METU); A. Aydın Alatan (METU)
arXiv
LG-Traj: LLM Guided Pedestrian Trajectory Prediction
Authors: Pranav singh chib (IIT Roorkee); Pravendra Singh (IIT Roorkee)
PDF
Federated Active Learning for Target Domain Generalisation
Authors: Razvan Caramalau (UCL); Binod Bhattarai (Aberdeen); Danail Stoyanov (UCL)
arXiv
DISC-GAN: Disentangling Style and Content for Cluster-Specific Synthetic Underwater Image Generation
Authors: Sneha Varur (KLE Technological University); Anirudh R Hanchinamani (KLE Technological University); Tarun S Bagewadi (KLE Technological University); Uma Mudenagudi (KLE Technological University), Chaitra D Desai (KLE Technological University), Sujata C (KLE Technological University); Padmashree Desai (KLE Technological University); Sumit Meharwade (KLE Technological University);
PDF
OmViD: Omni-supervised active learning for video action detection
Authors: Aayush Rana (Qualcomm), Akash Kumar (UCF), Vibhav Vineet (Microsoft), Yogesh S Rawat (UCF)
arXiv | Project Page
Class-Proportional Coreset Selection for Difficulty-Separable Data
Authors: Elisa Tsai (Michigan), Haizhong Zheng (CMU), Atul Prakash (Michigan)
arXiv
Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling
Authors: Mingzhuo Li (Hokkaido University), Guang Li(Hokkaido University), Jiafeng Mao (The University of Tokyo), Linfeng Ye (University of Toronto), Takahiro Ogawa (Hokkaido University), Miki Haseyama (Hokkaido University)
arXiv | Project Page
Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation
Authors: Tim Lebailly (Meta, KU Leuven); Vijay Veerabadran (Meta); Satwik Kottur (Meta); Karl Ridgeway (Meta); Michael Louis Iuzzolino (Meta)
arXiv
Efficient Learning for Product Attributes with Compact Multimodal Models
Authors: Mandar Kulkarni (Flipkart)
arXiv
SYM3D: Canonicalizing Triplanes via Symmetry for Single-View 3D Learning
Authors: Jing Yang, Kyle Fogarty, Fangcheng Zhong, Cengiz Oztireli (Cambridge)
PDF
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
Authors: Manuel Brack, Sudeep Katakol, Felix Friedrich, Patrick Schramowski, Hareesh Ravi, Kristian Kersting, Ajinkya Kale
arXiv
Non-Archival Track
Data-Efficient Learning with Sparse Adversarial Coresets
Authors: Manasa Madabhushi; Tushar Shinde
Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision
Authors: Xiao Fang (CMU); Minhyek Jeon (CMU); Zheyang Qin (CMU); Stanislav Panev (CMU); Celso de Melo (DEVCOM); Shuowen Hu (DEVCOM); Shayok Chakraborty (CMU, FSU); Fernando De la Torre (CMU)
arXiv | Project Page
A CLIP-Powered Framework for Robust and Generalizable Data Selection;Dynamic Soft Data Pruning: Adaptive Selection and Scheduling for Data-Efficient Learning
Authors: Suorong Yang (NJU); Junda Yu (FDU); Jucheng Hu (UCL);
arXiv
Making Something from (Almost) Nothing: Extreme Low-Resource Visual Learning with Diffusion Synthesis and Self-Supervised Distillation
Authors: Xuying Li (Hydrox AI)
PDF
ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image
Authors: Dongyu Luo(UMD, HKU), Kelin Yu(UMD), Amir Hossein Shahidzadeh(UMD), Cornelia Fermuller(UMD), Yiannis Aloimonos(UMD), Ruohan Gao(UMD)
PDF | Project Page
RF PRIOR: Preserving Global-Context Priors for Efficient Instance Segmentation Transfer
Authors: Jason K. Nam (Hongik University, ETRI)
VILA$^2$: Towards VLM Augmentation via Self-Improvement
Authors: Yunhao Fang (Nvidia), Ligeng Zhu (Nvidia), Yao Lu (Nvidia), Yan Wang (Nvidia), Pavlo Molchanov (Nvidia), Jan Kautz (Nvidia), Jang Hyun Cho (UT Austin), Marco Pavone (Nvidia), Song Han (Nvidia, MIT), Hongxu Yin (Nvidia)
arXiv
SIDA: Synthetic Image Driven Zero-shot Domain Adaptation
Authors: Ye-Chan Kim (Hanyang University); SeungJu Cha (Hanyang University); Si-Woo Kim (Hanyang University); Taewhan Kim (Hanyang University); Dong-Jin Kim (Hanyang University)
arXiv
arXivBench: A Curated Benchmark for Evaluating LLM Accuracy in Assisting Academic Writing
Authors: Ning Li (UCLA); Jingran Zhang (UCLA), Justin Cui (UCLA)
arXiv | Project Page
Data-Efficient Ensemble Weather Forecasting with Diffusion Models
Authors: Kevin Valencia (UCLA), Ziyang Liu (UCLA), Justin Cui (UCLA)
arXiv | PDF
Generating Fine Details of Entity Interactions
Authors: Xinyi Gu (MIT); Jiayuan Mao (MIT)
arXiv | Project Page
Dynamic Soft Data Pruning: Adaptive Selection and Scheduling for Data-Efficient Learning
Authors: Junda Yu (FDU); Suorong Yang (NJU); Jucheng Hu (UCL); Dongzhan Zhou (SHLab); Peng Ye (FDU); Tao Chen (FDU)
SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
Authors: Si-Woo Kim (Hanyang University); MinJu Jeon (Hanyang University); Ye-Chan Kim (Hanyang University); Soeun Lee(AI R&D Division, CJ Group); Taewhan Kim (Hanyang University); Dong-Jin Kim (Hanyang University)
arXiv
Role Bias in Text-to-Image Diffusion Models: Diagnosing and Mitigating Compositional Failures through Intermediate Decomposition
Authors: Sina Malakouti (University of Pittsburgh); Adriana Kovashka (University of Pittsburgh)
arXiv
AnyBald: Toward Realistic Diffusion-Based Hair Removal In-The-Wild
Authors: Yongjun Choi(UNIST), Seungoh Han(UNIST), Soomin Kim(Ewha Woman’s University), Sumin Son(Ewha Woman’s University), Mohsen Rohani(L’Oréal), Edgar Maucourant(L’Oréal), Dongbo Min(Ewha Woman’s University), Kyungdon Joo(UNIST)
PDF
Efficient Long-Tail Learning in Latent Space by sampling Synthetic Data
Authors: Nakul Sharma (Independent Researcher)
PDF
Autoguided Online Data Curation for Diffusion Model Training
Authors: Valeria Pais (University of Glasgow); Luis Oala (Dotphoton); Daniele Faccio (University of Glasgow); Marco Aversa (Dotphoton)
PDF
Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment
Authors: Abhiroop Chatterjee (Jadavpur University); Susmita Ghosh (Jadavpur University)
PDF
ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation
Authors: Jovana Kondic (MIT), Pengyuan Li (IBM Research), Dhiraj Joshi (IBM Research), Zexue He (MIT-IBM Watson AI Labs), Shafiq Abedin (IBM Research), Jennifer Sun (MIT), Ben Wiesel (IBM Research), Eli Schwartz (IBM Research), Ahmed Nassar (IBM Research), Bo Wu (MIT-IBM Watson AI Labs, IBM Research), Assaf Arbelle (IBM Research), Aude Oliva (MIT, MIT-IBM Watson AI Labs), Dan Gutfreund (MIT-IBM Watson AI Labs, IBM Research), Leonid Karlinsky (MIT-IBM Watson AI Labs, IBM Research), Rogerio Feris (MIT-IBM Watson AI Labs, IBM Research)
arXiv
VILA^2: Towards VLM Augmentation via Self-Improvement
Authors: Yunhao Fang (NVIDIA), Ligeng Zhu (NVIDIA), Yao Lu (NVIDIA), Yan Wang (NVIDIA), Pavlo Molchanov (NVIDIA), Jang Hyun Cho (NVIDIA), Marco Pavone (NVIDIA), Song Han (NVIDIA), Hongxu Yin (NVIDIA)
arXiv
Revisiting Semi-Supervised Learning in the Era of Foundation Models
Authors: Ping Zhang (OSU), Zheda Mai(OSU), Quang-Huy Nguyen(OSU), Wei-Lun Chao(OSU)
arXiv