Talk by Liang Zheng »The many meanings with image pairs »

Date : 27/10/2024

Lieu : CentraleSupélec Bât. Eiffel

The seminar will take place at CentraleSupelec on Friday, September 27, at 10:00 AM in Amphi VI, located in the Eiffel building of CentraleSupelec.

Title: The many meanings with image pairs

Abstract: Training AI models with image pairs has been studied for a long time and proven very useful. In this talk, I will first revisit popular practices of using data pairs in various computer vision tasks: from face recognition, person re-identification, to contrastive learning in foundation models. I will then discuss human preference data: between a pair of images, people may generally prefer one over the other. This type of data pair can be used to align diffusion models with human preference, so that diffusion models are more likely to generate images that people like. I will describe how we address this problem by aligning human preference at different denoising steps. This method effectively improves stable diffusion (SD) and SDXL models while accelerating the fine-tuning process by 10 times compared with existing methods.

Bio: Dr Liang Zheng is an Associate Professor (tenured) and ARC Future Fellow at the Australian National University. He obtained his Bachelor (2010) and PhD (2015) degrees from Tsinghua University. He is best known for his contributions to the field of object re-identification through useful datasets and algorithms, including Market-1501 (ICCV 2015) and part-based convolutional baseline (ECCV 2018). He also developed a few widely used methods in image classification and multi-object tracking such as random erasing (AAAI 2020) and joint detection and embedding (ECCV 2020). He regularly serves as an area chair for conferences like CVPR and NeurIPS and co-organises the AI CITY Challenges and Vision Datasets Understanding workshops. He is a program chair for ACM Multimedia 2024.