Schedule
Welcome
Invited Speaker: Zhuang Liu
Oral Session 1
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
Manuel Brack, Sudeep Katakol, Felix Friedrich, Patrick Schramowski, Hareesh Ravi, Kristian Kersting, Ajinkya Kale
Generating Fine Details of Entity Interactions
Xinyi Gu, Jiayuan Mao
Role Bias in Text-to-Image Diffusion Models: Diagnosing and Mitigating Compositional Failures through Intermediate Decomposition
Sina Malakouti, Adriana Kovashka
Invited Speaker: Sara Beery
Poster Session
Exhibit Hall II, Boards 231-264
Lunch Break
Oral Session 2
Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation
Tim Lebailly, Vijay Veerabadran, Satwik Kottur, Karl Ridgeway, Michael Louis Iuzzolino
A CLIP-Powered Framework for Robust and Generalizable Data Selection
Suorong Yang, Peng Ye, Wanli Ouyang, Dongzhan Zhou, Furao Shen
VILA²: Towards VLM Augmentation via Self-Improvement
Yunhao Fang, Ligeng Zhu, Yao Lu, Yan Wang, Pavlo Molchanov, Jan Kautz, Jang Hyun Cho, Marco Pavone, Song Han, Hongxu Yin
Invited Speaker: Andrew Owens
Oral Session 3
SYM3D: Canonicalizing Triplanes via Symmetry for Single-View 3D Learning
Jing Yang, Kyle Fogarty, Fangcheng Zhong, Cengiz Oztireli
Objaverse++: Curated 3D Object Dataset with Quality Annotations
Chendi Lin, Heshan Liu, Qunshu Lin, Zachary Bright, Shitao Tang, Yihui He, Minghao Liu, Ling Zhu, Cindy Le
ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image
Dongyu Luo, Kelin Yu, Amir-Hossein Shahidzadeh, Cornelia Fermüller, Yiannis Aloimonos, Ruohan Gao