About me

Welcome! I am currently a Research Assistant Professor in the Department of Electrical and Electronic Engineering (EEE), The Hong Kong Polytechnic University (香港理工大学, PolyU). I am also working in the JC STEM Lab of Machine Learning and Computer Vision with Prof. CHAU Lap Pui. My research interests include Visual Information Processing, Egocentric Vision, Embodied Cognition, and Multimedia Forensics.

I received BEng in Electronic and Information Engineering and MEng in Signal and Information Processing from Northwestern Polytechnical University (西北工业大学, NPU), China, and earned PhD in the School of Electrical and Electronic Engineering (EEE) from Nanyang Technological University (南洋理工大学,NTU), Singapore, in 2021, supervised by Dr. CHAU Lap Pui, who worked as an A/P at NTU (till 2022) and is currently a Prof. at PolyU.

Prior to joining PolyU, I worked as a Research Fellow in EEE, NTU, until March 2023 with PI A/P YAP Kim Hui. Prior to going to Singapore, I was involved in national projects and a robotics center at NPU, China, supervised by Prof. WAN Shuai and Prof. MEI Shaohui.

News

  • [2025.03] [CVPR 2025] Egocentric Vision: ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction, with code to appear.
  • [2025.02] [ICLR 2025] Autonomous Driving Perception: [OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework], with code.
  • [2025.01] [IEEE JSTARS] (JCR Q1) We organized a special issue, Scene Analysis and Understanding in the Intelligent Transportation for Urban Area. Deadline: Jul 31, 2025.
  • [2024.12] [IEEE TMM] (IF: 8.4) Two papers were accepted: WHANet: Wavelet-based Hybrid Asymmetric Network for Spectral Super-Resolution from RGB Inputs.
  • [2024.12] [IEEE TMM] ByteNet: Rethinking multimedia file fragment classification through visual perspectives.
  • [2024.11] [Remote Sensing] (JCR Q1) We organized a special issue, Recent Advances in Multimodal Hyperspectral Remote Sensing. Deadline: May 28, 2025.
  • [2024.11] [IEEE TCSVT] (IF: 8.3) 3D Point Cloud Segmentation: SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation, with code.
  • [2024.10] [IEEE TCSVT] (IF: 8.3) OWOD Survey: Open World Object Detection: A Survey, with code.
  • [2024.10] Outstanding Reviewer in ACM Multimedia 2024.
  • [2024.09] [Information Fusion] (IF: 14.7) Autonomous Driving Survey: [A survey on occupancy perception for autonomous driving: The information fusion perspective], with github link.
  • [2024.06] We got the champion of EPIC-Kitchens Challenges Multi-Instance Retrieval Track in CVPR 2024 Champion solution and paper link
  • [2024.05] We chaired a special session in ISCAS 2024.
  • [2024.04] We chaired a special session in ICASSP 2024.
  • [2024.02] [Knowledge-based System] (IF: 7.2) Intra- and Inter-sector Contextual Information Fusion with Joint Self-Attention for File Fragment Classification, with code.
  • [2024.01] [Knowledge-based System] (IF: 7.2) ([Weakly-Supervised Grounded Image Captioning].
  • [2024.01] [Remote Sensing] Remote Sensing Image Change Captioning.
  • [2023.10] I am one of Special Session Chair and Review Committee Members of the ISCAS 2024.
  • [2023.09] We organized a special issue, AI-powered multimedia computing, in Multimedia Tools and Applications Journal, as Guest Editor.
  • [2023.09] [NeurIPS 2023] [Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method], with dataset.
  • [2023.09] I have been teaching the subject EIE546 Video Technology in PolyU.
  • [2023.08] Our group has recruited 8 PhD students, 1 research fellow, and 1 MPhil student, and I am co-supervisor and supervisor. The lab’s website will be constructed.
  • [2023.06] I have been working with Prof. CHAU Lap Pui in PolyU and constructing the JC STEM Lab of Machine Learning and Computer Vision.
  • [2023.05] I was in the role of Associate Editor of The Visual Computer Journal.
  • [2023.04] I joined the Hong Kong Polytechnic University (PolyU) as Research Assistant Professor.
  • [2023.03] [CVPR 2023] [Bitstream-corrupted image restoration], with code.

Recruitment/招生

We are actively recruiting image/video processing, computer vision, 3D vision, and self-motivated PhD (including self-finance students) to join our research group at Hong Kong Polytechnic University. Strong candidates can contact me to be nominated for PolyU Presidential PhD Fellowship, Joint PhD Supervision Leading to PolyU degree, and Joint PhD Supervision in specific universities/institutes.