16:00-16:10
Workshop Opening
Keynote & Invited Talks
16:10-16:50
Bridging Isolated Islands in Human Activity Understanding
Cewu Lu, Shanghai Jiao Tong University, China
16:50-17:30
Learning to Anticipate Human Actions from Videos
Basura Fernando, Institute of High Performance Computing (IHPC) and Centre for Frontier AI Research (CFAR) of Agency for Science, Technology and Research (A*STAR), Singapore
17:30-18:10
Neural Digital Human Avatar: Motion Capture, Reconstruction and Rendering
Yebin Liu, Tsinghua University, China
18:10-18:20
Tea Break
Session 1: Oral Presentation
18:20-18:40
Anonym-Recognizer: Relationship-preserving Face Anonymization and Recognition
Chunlei Peng, Shuang Wan, Zimin Miao, Decheng Liu, Yu Zheng, Nannan Wang
18:40-19:00
Cycle-Consistent Learning for Weakly Supervised Semantic Segmentation
Bin Wang, Yu Qiao, Dahua Lin, Stephen D.H Yang, Weijia Li
19:00-19:20
Dual Domain-Adversarial Learning for Audio-Visual Saliency Prediction
Yingzi Fan, Longfei Han, Yue Zhang, Lechao Cheng, Chen Xia, Di Hu
19:20-19:40
Multi-level Multi-modal Feature Fusion for Action Recognition in Videos
Xinghang Hu, Yanli Ji, Gedamu Alemu Kumie
19:40-20:00
Two-branch Objectness-centric Open World Detection
Yan Wu, Xiaowei Zhao, Yuqing Ma, Duorui Wang, Xianglong Liu
18:20-20:00
Session 2: Poster
Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
Shuning Chang, Pichao Wang, Fan Wang, Hao Li, Mike Zheng Shou
Domain Camera Adaptation and Collaborative Multiple Feature Clustering for Unsupervised Person Re-ID
Yuanpeng Tu
PSINet: Progressive Saliency Iteration Network for RGB-D Salient Object Detection
Songsong Duan, Chengxing Xia, Xianjin Fang, Bin Ge, Xiuju Gao, Kuan-Ching Li
Multimodal Network with Cross-Modal Attention for Audio-Visual Event Localization
Qianchao Tan
Real-time Embedded Demo System for Fall Detection under 15W Power
Junyi Lu, Wenxiang Jiang, Yang Xiao, Tingbing Yan, Zhiguo Cao, Zhiwen Fang, Joey Tianyi Zhou
Face Clustering via Adaptive Aggregation of Clean Neighbors
Shiyong Hong, Yaobin Zhang, Xu Ling, Weihong Deng, Yin Yunfeng, Zhang Yingjie, Hongzhi Shi, Dongchao Wen
Cross-modal Token Selection for Video Understanding
Liyong Pan, Zechao Li, Rui Yan, Henghao Zhao