Publications

Full Publication List refer to Google scholar or ORCID

Selected Conference Papers

[*: As Corresponding Author, †: My Mentored Student]

M. Chen†, J. Chen, Z. Fan, Y. Lee, Z. Dang†, L. Wang, Y. Cui, L.-P. Chau, Y. Wang*, “HVG-3D: Bridging Real and Simulation Domains for 3D-Conditional Hand-Object Interaction Video Synthesis,” In CVPR, 2026.
L. Yao†, Y. Chen, Y. Su†, Y. Wang*, M. Liu, L.-P. Chau, “HAMMER: Harnessing MLLMs via Cross-Modal Integration for Intention-Driven 3D Affordance Grounding,” In CVPR, 2026.
T. Liu†, Y. Lu, L. Zhang, C. Cai, J. Gao, Y. Wang, K.-H. Yap, L.-P. Chau, “Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep,” In CVPR, 2026.
Y. Su†, Y. Wang*, L. Yao†, C. Cui, L.-P. Chau, “Interaction-aware Representation Modeling With Co-Occurrence Consistency for Egocentric Hand-Object Parsing,” In ICLR, 2026.
J. Lian†, J. Pan†, L. Wang, Y. Wang*, S. Mei, and L.-P. Chau, “Semantic Representation Attack against Aligned Large Language Models,” In NeurIPS, 2025.
L. Yao†, Y. Wang*, Y. Zhang†, M. Liu, and L.-P. Chau, “GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting,” In ACM Multimedia, 2025.
C. Cai†, T. Liu†, J. Gao†, W. Liu†, K. Wu, R. Wang, Y. Wang*, and S. C. Liew, “From Semantics, Scene to Instance-awareness: Distilling Foundation Model for Open-vocabulary Situation Recognition,” In ACM Multimedia, 2025.
T. Liu†, K. Wu, C. Cai†, Y. Wang, K.-H. Yap*, and L.-P. Chau, “Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework,” In ACM Multimedia, 2025.
Y. Su†, Y. Wang*, Q. Hu†, C. Yang, L.-P. Chau*, “ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction,” In CVPR, 2025.
J. Chen†, H. Xu, Y. Wang*, and L.-P. Chau*, “OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework.” In ICLR, 2025.
J. Lian†, J. Pan†, L. Wang, Y. Wang*, S. Mei*, and L.-P. Chau*, “Semantic Representation Attack against Aligned Large Language Models,” In NeurIPS, 2025.
L. Yao†, Y. Wang*, Y. Zhang†, M. Liu, and L.-P. Chau*. “GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting.” In ACM Multimedia, 2025.
C. Cai, T. Liu, J. Gao, W. Liu, K. Wu*, R. Wang, Y. Wang*, and S. C. Liew, “From Semantics, Scene to Instance-awareness: Distilling Foundation Model for Open-vocabulary Situation Recognition,” In ACM Multimedia, 2025.
T. Liu, K.Wu, C. Cai†, Y. Wang, K.-H. Yap, and L.-P. Chau. “Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework.” In ACM Multimedia, 2025.
W. Liu†, Y. Wang*, K.-H. Yap*, and L.-P. Chau, “Bitstream-Corrupted JPEG Images are Restorable: Two-stage Compensation and Alignment Framework for Image Restoration,” in CVPR, pp. 9979-9988, 2023.
T. Liu†, K. Wu, Y. Wang*, W. Liu, K.-H. Yap*, and L.-P. Chau, “Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method,” NeurIPS, vol. 36, 2023.

Selected Journal Papers

J. Lian†, J. Pan†, L. Wang, Y. Wang*, X. Wang, Y. Lu, S. Mei, L.-P. Chau, “Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models,” Nature Communications, 2026. Nature
H. Zhu†, Y. Zhang†, L. Yao†, L.-P. Chau, and Y. Wang*, “MASS: Mesh-inellipse Aligned Deformable Surfel Splatting for Hand Reconstruction and Rendering from Egocentric Monocular Video,” In CVM, 2026. Recommended to IEEE Transactions on Visualization and Computer Graphics, 2026. (Top 1.4%, Best Paper Candidate)
Y. Su†, Y. Wang, and L.-P. Chau, “CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation,” Expert Systems with Applications, vol. 296, pp. 129148, Jul. 2025.
Y. Zhang†, Y. Wang, Y. Cui, and L.-P. Chau, “3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection,” IEEE Transactions on Multimedia, vol. 27, pp. 6235-6247, Jun. 2025.
S. Meng†, Y. Wang, H. Xu, L.-P. Chau, “Contrastive learning-based place descriptor representation for cross-modality place recognition,” Information Fusion, vol. 124, pp. 103351, Jun. 2025.
J. Gao†, Y. Wang, K.-H. Yap, K. Garg, B. S. Han, “Occlutrack: Rethinking awareness of occlusion for enhancing multiple pedestrian tracking,” IEEE Transactions on Intelligent Transportation Systems, vol. 26, no. 7, pp. 9852-9866, May 2025.
L. Yao†, Y. Wang, M. Liu, and L.-P. Chau, “SGIFormer: Semantic-guided and geometric-enhanced interleaving transformer for 3D instance segmentation,” IEEE Transactions on Circuits and Systems for Video Technology, Nov. 2024.
L. Tang†, Y. Wang*, and L.-P. Chau, “Weakly-supervised part-attention and mentored networks for vehicle re-identification,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 12, pp. 8887-8898, 2022.
Y. Zhou†, Y. Wang, and L.-P. Chau*, “Moving towards centers: Re-ranking with attention and memory for re-identification,” IEEE Transactions on Multimedia, 2022.
Y. Wang, J. Hou, X. Hou, and L.-P. Chau*, “A self-training approach for point-supervised object detection and counting in crowds,” IEEE Transactions on Image Processing, vol. 30, pp. 2876-2887, 2021.
Y. Wang, Z.-P. Bian, Y. Zhou, and L.-P. Chau*, “Rethinking and designing a high-performing automatic license plate recognition approach,” IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 7, pp. 8868-8880, 2021.
Y. Wang, Z.-P. Bian, J. Hou, and L.-P. Chau*, “Convolutional neural networks with dynamic regularization,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 5, pp. 2299-2304, 2021.
H. Zhuang, Y. Wang, Q. Liu, and Z. Lin*, “Fully decoupled neural network learning using delayed gradients,” IEEE Transactions on neural networks and learning systems, vol. 33, no. 10, pp. 6013-6020, 2021.
Y. Wang, H. Liu, and L.-P. Chau*, “Single underwater image restoration using adaptive attenuation-curve prior,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 65, no. 3, pp. 992-1002, 2018.

PhD Thesis

Y. Wang, “Dense prediction and deep learning in complex visual scenes,” in Nanyang Technological University, 2021.

^* Corresponding authorship

Dr. WANG Yi

Publications

Full Publication List refer to Google scholar or ORCID

Selected Conference Papers

Selected Journal Papers

PhD Thesis