Discipline direction

Current Location: Home > Discipline Construction > Discipline direction > Content

Computer Vision

Time:2022-12-19    Click:
  • Responsible Person:Hongbin Zha
  • Member:Chao Xu, Jufu Feng, Xianghua Ying, Gang Zeng, Yuru Pei, Jinshi Cui
负责人 Hongbin Zha 成员 Chao Xu, Jufu Feng, Xianghua Ying, Gang Zeng, Yuru Pei, Jinshi Cui


Visual perception is an important sensory channel for humans to understand the structure and changes in the external environment. As one of the core areas of artificial intelligence, computer vision aims to utilize principles from cognitive science and computer technology to replicate human visual capabilities. The goal is to enable intelligent machines to autonomously understand and interact efficiently with the environment.

The main research topics in computer vision include image recognition and detection, semantic segmentation of images and videos, biometric recognition, 3D scene reconstruction, dynamic and active vision, and human behavior analysis and intention understanding. The primary objective is to equip intelligent perception systems with reliable environmental analysis and understanding capabilities, enabling them to adapt to complex real-world scenarios.

Our research team has undertaken dozens of national, provincial, and industry-funded projects, including the National Natural Science Foundation, the 863 Program, the 973 Program, the Ministry of Education's Innovation Team, and the National Key Research and Development Program. Our research outcomes have been published in prestigious conferences and journals in the field of computer vision and machine learning, such as TPAMI, IJCV, TVCG, PR, CVPR, ICCV, ECCV, NeurIPS, ICML, AAAI, and ICRA. We have received awards and recognition for our research achievements, including the National Technology Invention Second Prize, the Ministry of Education Technology Invention First Prize, the Ministry of Public Security Scientific and Technological Progress Second Prize, the Guangzhou Scientific and Technological Progress Third Prize, and the Chinese Academy of Sciences Scientific and Technological Progress Second Prize. Our applications have been implemented in areas such as criminal investigation, healthcare, and cultural heritage preservation. The Digital Longmen Grottoes project has been featured in mainstream media including CCTV. Our wavelet-based image compression system has been applied in high-speed imaging systems in aerospace and digital movie playback systems in cinemas. Our achievements in image understanding, such as multi-view and multi-label learning, have been recognized as key results of the National 973 Program. Furthermore, our team members have received honors and awards at various academic conferences both domestically and internationally, including the Best Paper/Poster Awards at Euromed 2010, ACPR 2017, CCCV 2017, ICPR 2018, and FG 2020, as well as securing the first place in the V-SLAM competition at ISMAR 2019.



Close

Address: No. 5, Yiheyuan Road, Haidian District, Beijing Feedback: its@pku.edu.cn

Copyright © All Rights Reserved.