黄庆明-中国科学院大学-UCAS

基本信息

黄庆明讲席教授博导计算机科学与技术学院
国家杰出青年科学基金获得者，百千万人才工程国家级人选，IEEE Fellow

电子邮件： qmhuang@ucas.ac.cn
通信地址：海淀区石景山区玉泉路19号甲
邮政编码： 100190

研究领域

多媒体计算，图像处理，模式识别，机器学习，计算机视觉

教育背景

1988-09--1994-08 哈尔滨工业大学博士生
1984-09--1988-07 哈尔滨工业大学本科生

学历

1984-1988年哈尔滨工业大学获计算机系计算机软件专业本科

1988-1991年哈尔滨工业大学获计算机系计算机应用专业硕士（直接攻博）

1991-1994年哈尔滨工业大学获计算机系计算机应用专业博士

学位

1988年7月哈尔滨工业大学获计算机系计算机软件专业工学学士

1994年12月哈尔滨工业大学获计算机系计算机应用专业工学博士

出国学习工作

1995-1996年新加坡国立大学工学院博士后

工作经历

2003-05--今中国科学院大学讲席教授、博士生导师，国家杰出青年科学基金获得者，百千万人才工程国家级人选， IEEE Fellow

2002-06--2003-05 新加坡威实录科技公司首席工程师

2000-04--2002-05 新加坡资讯与通信研究院研究员

1996-11--2000-03 新加坡资讯与通信研究院副研究员

1995-05--1996-11 新加坡国立大学博士后

1994-08--1995-05 哈尔滨工业大学讲师

工作简历

2003-05~现在, 中国科学院大学, 讲席教授、博士生导师，国家杰出青年科学基金获得者，百千万人才工程国家级人选， IEEE Fellow
2002-06~2003-05,新加坡威实录科技公司, 首席工程师
2000-04~2002-05,新加坡资讯与通信研究院, 研究员
1996-11~2000-03,新加坡资讯与通信研究院, 副研究员
1995-05~1996-11,新加坡国立大学, 博士后
1994-08~1995-05,哈尔滨工业大学, 讲师

社会兼职

2020-01-11-今,中国计算机学会, 理事
2018-01-01-今,IEEE, 会士（Fellow）
2018-01-01-今,中国计算机学会, 会士
2017-01-01-今,IEEE CASS Beijing Chapter, Chair
2016-09-15-今,北京市图像图形学会, 副理事长
2016-01-31-今,中国计算机学会多媒体技术专业委员会, 副主任

教授课程

视觉信息学习与分析
模式识别与机器学习
模式识别在图像与视频分析中的应用
模式识别
多媒体数字视频压缩原理技术与标准

专利与奖励

奖励信息

（1）视觉媒体的局部关联分析与表示, 二等奖, 其他, 2020
（2）图像视频的多尺度表征与语义映射, 一等奖, 其他, 2020
（3）视觉底层特性与高层语义对应性研究, 二等奖, 省级, 2015
（4）大规模网络视频内容分析关键技术及应用, 一等奖, 其他, 2012
（5）大规模数字图书与视频资源库建设, 一等奖, 部委级, 2010

专利成果

[1] 苏荔, 崔哲, 黄庆明, 李国荣, 李亮. 一种注视点指导的显著目标检测方法. CN: CN112699878A, 2021-04-23.
[2] 李国荣, 杨一帆, 黄庆明, 苏荔. 一种基于排序网络的弱监督物体数目估计方法. CN: CN112101122A, 2020-12-18.
[3] 黄庆明, 滕尚志, 张史梁. 一种面向航拍影像的无监督车辆重识别方法. CN: CN111950367A, 2020-11-17.
[4] 张晨, 李国荣, 苏荔, 黄庆明. 一种基于开放数据过滤和域适应的视频异常检测方法. CN: CN111950363A, 2020-11-17.
[5] 黄庆明, 独大为, 齐洪钢. 基于受限结构图搜索的目标跟踪方法. CN: CN107194951B, 2020-08-21.
[6] 黄庆明, 苏荔, 周志达, 杨士杰, 吴益灵. 一种基于多模态融合的网络直播内容分析方法. CN: CN111031330A, 2020-04-17.
[7] 李国荣, 于洪洋, 黄庆明, 苏荔. 一种基于运动和表观适应融合的无人机多目标跟踪方法. CN: CN110675430A, 2020-01-10.
[8] 张朋, 苏荔, 黄庆明, 李国荣, 李亮. 一种有效的显著性预测模型方法. CN: CN110443784A, 2019-11-12.
[9] 李国荣, 徐凯, 黄庆明. 一种基于时空卷积神经网络的视频目标分割方法. CN: CN110222595A, 2019-09-10.
[10] 廖昌粟, 苏荔, 黄庆明. 一种基于类别表示指导特征选择的零次学习分类算法. CN: CN110210616A, 2019-09-06.
[11] 卿来云, 苗军, 帅佳玫, 黄庆明. 一种图像显著区域检测方法. 中国: CN104463870A, 2015-03-25.
[12] 黄庆明, 胡方振, 苏荔, 齐洪钢. 视频编码方法. 中国: CN103096076A, 2013-05-08.
[13] 黄庆明, 朱琳, 苏荔, 齐洪钢. 视频质量获取方法. 中国: CN103067733A, 2013-04-24.

出版信息

发表论文

（1） Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 第 4 作者
（2） MaxMatch: Semi-Supervised Learning with Worst-Case Consistency, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 通讯作者
（3） Spatial-Temporal Graph Network for Video Crowd Counting, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 第 5 作者
（4） All in a Row: Compressed Convolution Networks for Graph, International Conference on Machine Learning, 2023, 第 5 作者
（5） Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 第 6 作者
（6） Entity-Enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 通讯作者
（7） Feature Directions Matters: Long-Tailed Learning via Rotated Balanced Representation, International Conference on Machine Learning, 2023, 通讯作者
（8） Rethinking Collaborative Metric Learning: Toward an Efficient Alternative Without Negative Sampling, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 通讯作者
（9） Optimizing Partial Area Under the Top-k Curve: Theory and Practice, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 通讯作者
（10） A Tale of HodgeRank and Spectral Method: Target Attack Against Rank Aggregation Is the Fixed Point of Adversarial Game, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 通讯作者
（11） The Minority Matters: A Diversity-Promoting Collaborative Metric Learning Algorithm, Annual Conference on Neural Information Processing Systems, 2022, 第 6 作者
（12） OTKGE: Multi-modal Knowledge Graph Embeddings via Optimal Transport, Annual Conference on Neural Information Processing Systems, 2022, 通讯作者
（13） Attribute Group Editing for Reliable Few-shot Image Generation, IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 第 7 作者
（14） Inferential Visual Question Generation, ACM International Conference on Multimedia, 2022, 第 5 作者
（15） Not All Samples are Trustworthy: Towards Deep Robust SVP Prediction, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 通讯作者
（16） Long Short-Term Relation Transformer With Global Gating for Video Captioning, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 第 6 作者
（17） Fine-Grained Image Quality Assessment: A Revisit and Further Thinking, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 第 3 作者
（18） A Sparse-Motif Ensemble Graph Convolutional Network against Over-smoothing, International Joint Conference on Artificial Intelligence, 2022, 通讯作者
（19） Poisoning Attack Against Estimating From Pairwise Comparisons, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 通讯作者
（20） Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval, ACM International Conference on Multimedia, 2022, 第 4 作者
（21） Span-based Audio-Visual Localization, ACM International Conference on Multimedia, 2022, 第 4 作者
（22） Pay Attention to Your Positive Pairs: Positive Pair Aware Contrastive Knowledge Distillation, ACM International Conference on Multimedia, 2022, 通讯作者
（23） Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 第 4 作者
（24） A Unified Framework against Topology and Class Imbalance, ACM International Conference on Multimedia, 2022, 通讯作者
（25） AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems, International Conference on Machine Learning, 2022, 通讯作者
（26） Multi-Attention Network for Fast Compressed Referring Video Object Segmentation, ACM International Conference on Multimedia, 2022, 第 7 作者
（27） Geometry Interaction Knowledge Graph Embeddings, AAAI Conference on Artificial Intelligence, 2022, 通讯作者
（28） Dist-PU: Positive-Unlabeled Learning from a Label Distribution Perspective, IEEE / CVF Conference on Computer Vision and Pattern Recognition, 2022, 通讯作者
（29） Meta-Wrapper: Differentiable Wrapping Operator for User Interest Selection in CTR Prediction, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 通讯作者
（30） Hierarchical Modular Network for Video Captioning, IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 第 5 作者
（31） Quaternion Ordinal Embedding, International Joint Conference on Artificial Intelligence, 2022, 通讯作者
（32） Syntax-Guided Hierarchical Attention Network for Video Captioning, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 第 6 作者
（33） Asymptotically Unbiased Instance-wise Regularized Partial AUC Optimization: Theory and Algorithm, Annual Conference on Neural Information Processing Systems, 2022, 通讯作者
（34） OpenAUC: Towards AUC-Oriented Open-Set Recognition, Annual Conference on Neural Information Processing Systems, 2022, 第 6 作者
（35） Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment, IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 第 5 作者
（36） Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability, Annual Conference on Neural Information Processing Systems, 2022, 通讯作者
（37） Automatic Relation-aware Graph Network Proliferation, IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 第 6 作者
（38） ER: Equivariance Regualizer for Knowledge Graph Completion, AAAI Conference on Artificial Intelligence, 2022, 通讯作者
（39） Learning With Multiclass AUC: Theory and Algorithms, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 通讯作者
（40） Weakly Supervised Anomaly Detection in Videos Considering the Openness of Events, IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 第 6 作者
（41） I2Transformer: Intra- and Inter-relation Embedding Transformer for TV Show Captioning, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 第 8 作者
（42） Recurrent Meta-Learning against Generalized Cold-start problem in CTR Prediction, ACM International Conference on Multimedia, 2022, 通讯作者
（43） Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer, ACM International Conference on Multimedia, 2022, 第 6 作者
（44） Confederated Learning: Going Beyond Centralization, ACM International Conference on Multimedia, 2022, 通讯作者
（45） Graph Regularized Encoder-Decoder Networks for Image Representation Learning, IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 第 5 作者
（46） Evaluating Visual Properties via Robust HodgeRank, INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 通讯作者
（47） Decomposition and Completion Network for Salient Object Detection, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 第 3 作者
（48） Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 第 4 作者
（49） Neural Collaborative Preference Learning With Pairwise Comparisons, IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 通讯作者
（50） Self-Supervised Deep TripleNet for Video Object Segmentation, IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 第 4 作者
（51） Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data, IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 第 3 作者
（52） Viewpoint and Scale Consistency Reinforcement for UAV Vehicle Re-Identification, INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 第 3 作者
（53） DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 通讯作者
（54） Task-Feature Collaborative Learning with Application to Personalized Attribute Prediction, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 通讯作者
（55） Augmented Adversarial Training for Cross-Modal Retrieval, IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 第 4 作者
（56） Harmonized Multimodal Learning with Gaussian Process Latent Variable Models, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 第 3 作者
（57） Multi-View Spatial Attention Embedding for Vehicle Re-Identification, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 第 3 作者
（58） Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 第 4 作者
（59） Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval, IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 第 3 作者
（60） Detecting Small Objects Using a Channel-Aware Deconvolutional Network, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 通讯作者
（61） The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline, INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 通讯作者
（62） Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending, IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 第 4 作者
（63） Learning Coupled Convolutional Networks Fusion for Video Saliency Prediction, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 第 3 作者
（64） SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning, IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 第 5 作者
（65） From Social to Individuals: A Parsimonious Path of Multi-Level Models for Crowdsourced Preference Aggregation., IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 通讯作者
（66） Multi-modal semantic autoencoder for cross-modal retrieval, NEUROCOMPUTING, 2019, 第 3 作者
（67） Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 第 4 作者
（68） Increasing Interpretation of Web Topic Detection via Prototype Learning From Sparse Poisson Deconvolution, IEEE TRANSACTIONS ON CYBERNETICS, 2019, 通讯作者
（69） Robust visual tracking via scale-and-state-awareness, NEUROCOMPUTING, 2019, 通讯作者
（70） Hedging Deep Features for Visual Tracking, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 通讯作者
（71） Two Birds With One Stone: A Coupled Poisson Deconvolution for Detecting and Describing Topics From Multimodal Web Data, IEEETRANSACTIONSONNEURALNETWORKSANDLEARNINGSYSTEMS, 2019, 通讯作者
（72） HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images, IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 第 4 作者
（73） Image Class Prediction by Joint Object, Context, and Background Modeling, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 第 5 作者
（74） Iterative Graph Seeking for Object Tracking, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 通讯作者
（75） Structure-Aware Local Sparse Coding for Visual Tracking, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 通讯作者
（76） Bilevel Multiview Latent Space Learning, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 通讯作者
（77） A two-step approach to describing web topics via probable keywords and prototype images from background-removed similarities, NEUROCOMPUTING, 2018, 第 4 作者
（78） Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval, IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 第 4 作者
（79） Joint Feature Selection and Classification for Multilabel Learning, IEEE TRANSACTIONS ON CYBERNETICS, 2018, 通讯作者
（80） Multimodal Similarity Gaussian Process Latent Variable Model, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 第 3 作者
（81） Geometric Hypergraph Learning for Visual Tracking, IEEE TRANSACTIONS ON CYBERNETICS, 2017, 通讯作者
（82） A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2017, 第 2 作者
（83） Image classification by search with explicitly and implicitly semantic representations, INFORMATION SCIENCES, 2017, 第 3 作者
（84） Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks., IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 第 5 作者
（85） Contextual Exemplar Classifier-Based Image Representation for Classification, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 第 2 作者
（86） Hierarchical deep semantic representation for visual categorization, NEUROCOMPUTING, 2017, 通讯作者
（87） Bi-Level Multi-View Latent Space Learning, IEEETCSVTCCFB, 2017, 第 1 作者
（88） Contextual Exemplar Classifier Based Visual Representation for Visual Categorization, IEEE Transactions on Circuits and Systems for Video Technology, 2017, 第 1 作者
（89） Multi-label classification by exploiting local positive and negative pairwise label correlation, NEUROCOMPUTING, 2017, 通讯作者
（90） Cross-modal Retrieval using Multi-ordered Discriminative Structured Subspace Learning, IEEE Trans. on Multimedia, 2017, 第 1 作者
（91） Exploring Coherent Motion Patterns via Structured Trajectory Learning for Crowd Mood Modeling, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 第 5 作者
（92） Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation, IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 第 5 作者
（93） Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 第 5 作者
（94） Coupling Reranking and Structured Output SVM Co-Train for Multitarget Tracking, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 第 3 作者
（95） Hedged Deep Tracking, 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, 第 5 作者
（96） PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval, MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, 第 4 作者
（97） Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification, IEEETRANSACTIONSONKNOWLEDGEANDDATAENGINEERING, 2016, 第 3 作者
（98） Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks, COMPUTER VISION - ECCV 2016, PT VII, 2016, 第 3 作者
（99） Distributed image understanding with semantic dictionary and semantic expansion, NEUROCOMPUTING, 2016, 第 7 作者
（100） Robust latent poisson deconvolution from multiple features for web topic detection, IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 第 5 作者
（101） Boosted random contextual semantic space based representation for visual recognition, INFORMATION SCIENCES, 2016, 第 5 作者
（102） Effective Multimodality Fusion Framework for Cross-Media Topic Detection, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 第 6 作者
（103） Socio-mobile landmark recognition using local features with adaptive region selection, NEUROCOMPUTING, 2016, 第 6 作者
（104） Joint Multi-View Representation Learning and Image Tagging, THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 通讯作者
（105） LSH-based semantic dictionary learning for large scale image understanding, JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 第 6 作者
（106） Multi-level discriminative dictionary learning with application to large scale image classification, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 第 3 作者
（107） Joint image representation and classification in random semantic spaces, NEUROCOMPUTING, 2015, 第 6 作者
（108） Online learning affinity measure with CovBoost for multi-target tracking, NEUROCOMPUTING, 2015, 第 2 作者
（109） Polysemious visual representation based on feature aggregation for large scale image applications, MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 第 5 作者
（110） Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 第 5 作者
（111） Social Attribute-Aware Force Model: Exploiting Richness of Interaction for Abnormal Crowd Detection, IEEETRANSACTIONSONCIRCUITSANDSYSTEMSFORVIDEOTECHNOLOGY, 2015, 第 5 作者
（112） Multi-order visual phrase for scalable partial-duplicate visual search, MULTIMEDIA SYSTEMS, 2015, 第 3 作者
（113） USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 第 3 作者
（114） Topic detection in cross-media: a semi-supervised co-clustering approach, International Journal of Multimedia Information Retrieval, 2014, 通讯作者
（115） Cascade Category-Aware Visual Search, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 第 3 作者
（116） Face Distortion Recovery Based on Online Learning Database for Conversational Video, IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 第 4 作者
（117） Relative image similarity learning with contextual information for Internet cross-media retrieval, MULTIMEDIA SYSTEMS, 2014, 第 3 作者
（118） A Simulation Analysis on the Existence of Network Traffic Flow Equilibria, IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 第 3 作者
（119） Web video thumbnail recommendation with content-aware analysis and query-sensitive matching, MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 第 5 作者
（120） Online HodgeRank on Random Graphs for Crowdsourceable QoE Evaluation, IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 第 3 作者
（121） Representing dense crowd patterns using bag of trajectory graphs, SIGNAL IMAGE AND VIDEO PROCESSING, 2014, 第 2 作者
（122） Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval, IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2014, 第 3 作者
（123） ObjectPatchNet: Towards scalable and semantic image annotation and retrieval, COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 第 4 作者
（124） Online Discriminative Structured Output SVM Learning for Multi-Target Tracking, IEEE SIGNAL PROCESSING LETTERS, 2014, 第 4 作者
（125） Recognizing human group action by layered model with multiple cues, NEUROCOMPUTING, 2014, 第 3 作者
（126） Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching, IEEE MULTIMEDIA, 2013, 第 5 作者
（127） Image classification using spatial pyramid robust sparse coding, PATTERN RECOGNITION LETTERS, 2013, 第 3 作者
（128） Weighted visual vocabulary to balance the descriptive ability on general dataset, NEUROCOMPUTING, 2013, 第 3 作者
（129） Accurate and efficient cross-domain visual matching leveraging multiple feature representations, VISUAL COMPUTER, 2013, 第 4 作者
（130） Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval, IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 第 5 作者
（131） Laplacian affine sparse coding with tilt and orientation consistency for image classification, JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 第 3 作者
（132） Beyond visual features: A weak semantic image representation using exemplar classifiers for classification, NEUROCOMPUTING, 2013, 第 5 作者
（133） Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 第 4 作者
（134） Image classification using Harr-like transformation of local features with coding residuals, SIGNAL PROCESSING, 2013, 第 4 作者
（135） SSOCBT: A Robust Semi-Supervised Online CovBoost Tracker by Using Samples Differently, IEEE Transactions on. Circuits and Systems for Video Technology, 2013, 第 1 作者
（136） @ICT: attention-based virtual content insertion, MULTIMEDIA SYSTEMS, 2012, 第 2 作者
（137） Nearest-neighbor method using multiple neighborhood similarities for social media data mining, NEUROCOMPUTING, 2012, 第 2 作者
（138） Online selection of the best k-feature subset for object tracking, JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 第 2 作者
（139） S3MKL: Scalable Semi-supervised Multiple Kernel Learning for Real World Image Data Mining, IEEE Transactions on Multimedia, 2012, 第 1 作者
（140） A Multiple Targets Appearance Tracker Based on Object Interaction Model, IEEE Transactions on Circuits and Systems for Video Technology, 2012, 第 1 作者
（141） A Generic Approach for Systematic Analysis of Sports Videos, ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 第 4 作者
（142） HodgeRank on Random Graphs for Subjective Video Quality Assessment, IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 第 2 作者
（143） Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding, IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 第 3 作者
（144） Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 第 4 作者
（145） Transferring Boosted Detectors Towards Viewpoint and Scene Adaptiveness, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 第 2 作者
（146） Modeling spatial and semantic cues for large-scale near-duplicated image retrieval, COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 第 5 作者
（147） A Novel Rate Control Technique for Multiview Video plus Depth based 3D Video Coding, IEEE TRANSACTIONS ON BROADCASTING, 2011, 第 2 作者
（148）一种基于奇异值分解的图像匹配算法, An Image Matching Algorithm Based on Singular Value Decomposition, 计算机研究与发展, 2010, 第 2 作者
（149） RD-optimized interactive streaming of multiview video with multiple encodings, JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 第 2 作者
（150） Error-resistance and low-complexity integer inverse discrete cosine transform, JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 第 2 作者
（151） A Low-Cost Very Large Scale Integration Architecture for Multistandard Inverse Transform, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 第 2 作者
（152） Mocc: a fast and robust correlation-based method for interest point matching under large scale changes, EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010, 第 2 作者
（153） Affective visualization and retrieval for music video, IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 第 2 作者
（154） A framework for flexible summarization of racquet sports video using multiple modalities, COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 通讯作者
（155） A configurable method for multi-style license plate recognition, PATTERN RECOGNITION, 2009, 第 3 作者
（156） Pornographic image detection based on multilevel representation, INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2009, 第 2 作者
（157） Event Tactic Analysis Based on Broadcast Sports Video, IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 第 3 作者
（158） Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model, SIGNAL PROCESSING-IMAGE COMMUNICATION, 2009, 通讯作者
（159）行为综合功能流水线中的资源约束LB-ACO算法, Resource-constrained LB-ACO algorithm for functional pipelines in behavioral synthesis, 系统工程与电子技术, 2009, 第 3 作者
（160） Personalized MTV Affective Analysis Using User Profile, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 第 2 作者
（161） Highlight Ranking for Broadcast Tennis Video Based on Multi-modality Analysis and Relevance Feedback, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 第 2 作者
（162） Detecting Violent Scenes in Movies by Auditory and Visual Cues, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 第 4 作者
（163）基于用户关注空间与注意力分析的视频精彩摘要与排序, User Attention Analysis Based Video Summarization and Highlight Ranking, 计算机学报, 2008, 第 1 作者
（164）支持空域随机访问功能的多视点视频编码方法, 计算机辅助设计与图形学学报, 2008,
（165） A Two-Stage Approach to Highlight Extraction in Sports Video by Using AdaBoost and Multi-modal, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 第 3 作者
（166） Unsupervised texture classification: Automatically discover and classify texture patterns, IMAGE AND VISION COMPUTING, 2008,
（167） Multiple Instance Boost Using Graph Embedding Based Decision Stump for Pedestrian Detection, COMPUTER VISION - ECCV 2008, PT IV, PROCEEDINGS, 2008, 第 2 作者
（168）行为综合功能流水线中的负载平衡蚁群调度算法, Load-Balanced Ant Colony Optimization Scheduling Algorithm in Behavioral Synthesis of Functional Pipelines, 微电子学, 2008, 第 3 作者
（169） Joint source-channel rate-distortion optimization for h.264 video coding over error-prone networks, IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 第 4 作者
（170） Video2cartoon: a system for converting broadcast soccer video into 3d cartoon animation, IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 第 2 作者
（171）基于笔画特征的叠加文字检测方法, Text detection based on stroke features, 通信学报, 2007, 第 4 作者
（172）基于“bag of words”的视频匹配方法, Video matching method based on "bag of words", 通信学报, 2007, 第 4 作者
（173） Human behavior analysis for highlight ranking in broadcast racket sports video, IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 第 2 作者
（174）基于群服务器的AVS视频编码并行化实现方法, 哈尔滨工业大学学报, 2007,
（175） Effective algorithms for fast transcoding of AVS to H.264/AVC in the spatial. Multimedia Tools and Applications, Multimedia Tools and Applications, 2007,
（176） Statistical model, analysis and approximation of rate-distortion function in mpeg-4 fgs videos, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006,
（177） Extracting 3D information from broadcast soccer video, IMAGE AND VISION COMPUTING, 2006, 第 3 作者
（178） Self-calibration based 3d information extraction and application in broadcast soccer video, COMPUTER VISION - ACCV 2006, PT II, 2006,
（179） Multi-view Video Coding with Flexible View-Temporal Prediction Structure for Fast Random Access., Lecture Notes in Computer Science, 2006,
（180） Online selection of discriminative features using Bayes error rate for visual tracking, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2006, PROCEEDINGS, 2006, 第 2 作者
（181） Image matching by multiscale oriented corner correlation, COMPUTER VISION - ACCV 2006, PT I, 2006,
（182）自适应高斯混合模型球场检测算法及其在体育视频分析中的应用. 计算机研究与发展, 计算机研究与发展, 2006,
（183） Sports Video Summarization and Adaptation for Application in Mobile Communication, Journal of Zhejiang University SCIENCE A, 2006,
（184） An effective method to detect and categorize digitized traditional Chinese paintings, PATTERN RECOGNITION LETTERS, 2006,
（185） Fast and robust text detection in images and video frames. Image and Vision Computing, Image and Vision Computing, 2005,
（186） Thresholding technique with adaptive window selection for uneven lighting image, PATTERN RECOGNITION LETTERS, 2005,
（187） Robust moving object segmentation on H.264/AVS compressed video using the block-based MRF model, Real-TimeImaging, 2005,
（188）可伸缩多媒体传输错误保护算法综述, Error Protection Algorithms for Scalable Multimedia Transmission: A Survey, 计算机研究与发展, 2005, 第 3 作者
（189） Visual Ontology Construction for Digitized Art Image Retrieval, JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2005,
（190） Robust real-time transmission of scalable multimedia for heterogeneous client bandwidths, REAL-TIME IMAGING, 2005,
（191） A scheme for ball detection and tracking in broadcast soccer video, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2005, PT 1, 2005,
（192） Optimum end-to-end distortion estimation for error resilient video coding, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004,
（193） Key techniques of bit rate reduction for H.264 streams, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004,
（194） MULTFRC-LERD: An improved rate control scheme for video streaming over wireless, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 3, PROCEEDINGS, 2004,
（195） Moving object segmentation: a block-based moving region detection approach, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004,
（196） Multiview video coding based on global motion model, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 3, PROCEEDINGS, 2004,
（197） Embedded packetization framework for layered multiple description coding, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 3, PROCEEDINGS, 2004,
（198） A new text detection algorithm in images/video frames, ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004,
（199） Real-time traffic information extraction based on compressed video with interframe motion vector, Real-time traffic information extraction based on compressed video with interframe motion vector, 哈尔滨工业大学学报：英文版, 2003, 第 1 作者
（200） Geometric Hypergraph Learning for Visual Tracking, 第 5 作者

发表著作

（1）监控视频高效编码与智能分析, 科学出版社, 2016-04, 第 3 作者
（2）机器学习从原理到应用, 人民邮电出版社, 2020-10, 第 2 作者

科研活动

承担了科技创新2030-新一代人工智能重大项目、国家自然科学基金（杰青、重点、重点国际合作和面上项目）、973计划、863计划、中科院****、中科院前沿科学重点研究计划、北京市自然科学基金、新加坡科技局资助项目等国内外众多课题的研究工作。

科研项目

（ 1 ）图像视频结构分析与表达, 参与, 国家级, 2015-01--2019-08
（ 2 ）跨媒体语义学习与内容理解, 主持, 国家级, 2012-01--2016-08
（ 3 ）面向网络事件的跨平台异质媒体语义协同与挖掘, 主持, 国家级, 2014-01--2018-12
（ 4 ）基于内容的视频处理与编码, 主持, 国家级, 2011-01--2014-12
（ 5 ）基于知识图谱的异构媒体大数据搜索与挖掘, 主持, 部委级, 2016-08--2020-12
（ 6 ）面向城市交通监控的无人机视频分析及其关键信息挖掘, 主持, 国家级, 2017-01--2021-12
（ 7 ）网络大数据环境下的主流媒体内容感知、交互与分析展示技术, 参与, 国家级, 2017-01--2020-12
（ 8 ）目标识别技术研制, 主持, 研究所（学校）, 2016-12--2018-12
（ 9 ）面向跨媒体内容管理的智能分析与推理, 主持, 国家级, 2019-12--2023-12
（ 10 ）端云协同视频智能计算方法与芯片架构研究, 参与, 国家级, 2020-01--2024-12

参与会议

（1）Some Stats from the PCs   2018-09-15
（2）Deep Unsupervised Convolutional Domain Adaptation   2017-10-24
（3）Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval   2017-10-24
（4）From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks   2016-10-15
（5）Cross-modal Retrieval by Real Label Partial Least Squares   2016-10-15
（6）PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval   2016-10-15
（7）Webpage Saliency Prediction with Multi-Features Fusion   2016-09-25
（8）Accelerate Convolutional Neural Networks for Binary Classification via Cascading Cost-Sensitive Feature   2016-09-25
（9）Tri-level Combination for Image Representation   2016-09-15
（10）Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks   2016-08-08
（11）Video Saliency Prediction with Optimized Optical Flow and Gravity Center Bias   2016-07-11
（12）Robust Latent Poission Deconvolution from Multiple Imperfect Features for Web Topic Detection   2016-07-11
（13）Crowd Video Retrieval via Deep Attribute-Embedding Graph Ranking   2016-07-11
（14）Hedged Deep Tracking   2016-06-27
（15）Joint Multi-View Representation Learning and Image Tagging   2016-02-12
（16）Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis   2015-12-11
（17）Learning Label Specific Features for Multi-Label Classification   2015-11-14
（18）Cross-media Topic Detection with Refined CNN Based Image-Dominant Topic Model   2015-10-26
（19）ALID: Scalable Dominant Cluster Detection   2015-08-31
（20）Adaptive Sharing for Image Classification   2015-07-25
（21）Semantic-aware Hashing for Social Image Retrieval   2015-06-29
（22）Group Sensitive Classifier Chains for Multi-Label Classification   2015-06-29
（23）Improving Cross-Modal Correlation Learning by Hyperlinks   2015-06-29
（24）GOMES: A Group-Aware Multi-View Fusion Approach towards Real-World Image Clustering   2015-06-29
（25）Formation Period Matters: Towards Socially Consistent Group Detection via Dense Subgraph Seeking   2015-06-23
（26）Location-Based Parallel Tag Completion for Geo-tagged Social Photo Retrieval   2015-06-23
（27）The Face Object Based HEVC System for Video Call   2015-05-25
（28）Cross-modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation   2014-12-14
（29）Large Scale Image Understanding with Non-convex Multi-task Learning   2014-11-25
（30）Coupling Multiple Alignments and Re-ranking for Low-Latency Online Multi-target Tracking   2014-11-01
（31）Sharing Model with Multi-level Feature Representations   2014-10-27
（32）Cross Modal Metric Learning with Multi-level Semantic Relevance   2014-10-27
（33）Weakly Supervised Cross-view Action Recognition via Sequential Motion Accumulation   2014-10-27
（34）DA-CCD: A Novel Action Representation by Deep Architecture of Local Depth Feature   2014-10-27
（35）Web Topic Detection Using A Ranked Clustering-like Pattern Across Similarity Cascades   2014-07-14
（36）Cross media Topic Analytics Based on Synergetic Content and User Behavior Modeling   2014-07-14
（37）Graph-Density-Based Visual Word Vocabulary for Image Retrieval   2014-07-14
（38）On-line Web-video Topic Detection and Tracking with Semi-supervised Learning   2013-12-13
（39）Fine-Grained Image Classification Using Color Exemplar Classifiers   2013-12-13
（40）Semantically-based Human Scanpath Estimation with HMMs   2013-12-03
（41）Undo the Codebook Bias by Linear Transformation for Visual Applications   2013-10-21
（42）Beyond Bag of Words: Image Representation in Sub-semantic Space   2013-10-21
（43）Robust Evaluation for Quality of Experience in Crowdsourcing   2013-10-21
（44）Stochastic Boosting for Large-scale Image Classification   2013-09-15
（45）Particle Flow: Bag of Trajectory Graphs for Dense Crowd Event Recognition   2013-09-15
（46）Set-Based Classification for Person Re-Identification Utilizing Mutural-Information   2013-09-15
（47）An Efficient Occlusion Detection Mehtod to Improve Object Trackers   2013-09-15
（48）Discriminative Spatial Codebook Generation for Image Classification   2013-07-27
（49）Abnormal Event Detection in Crowded Scenes Based on Structural Multi-scale Motion Interlaced Patterns   2013-07-15
（50）WIKI-CMR: A Web Cross Modality Database for Studying and Evaluation of Cross Modality Retrieval Methods   2013-07-15
（51）Cross-media Topic Detection: A Multi-modality Fusion Framework   2013-07-15
（52）Multi-Level Discriminative Dictionary Learning towards Hierarchical Visual Categorization   2013-06-23
（53）Online Learning Based Face Distortion Recovery for Conversational Video Coding   2013-03-20
（54）Cross Concept Local Fisher Discriminant Analysis for Image Classification   2013-01-07
（55）Visual Saliency and Distortion Weighting Based Video Quality Assessment   2012-12-04
（56）Spatio-temporal Visual Distortion and Rate Optimization for Video Coding   2012-12-04
（57）Improving Image Distance Metric Learning by Embedding Semantic Relations   2012-12-04
（58）Theoretical Analysis of Learning Local Anchors for Classification   2012-11-11
（59）Color Maximal-Dissimilarity Pattern for Pedestrian Detection   2012-11-11
（60）Abnormal Crowd Behavior Detection Based on Social Attribuye-Aware Force Model   2012-09-30
（61）Aesthetic Composition Representation for Portrait Photographing Recommendation   2012-09-30
（62）Online Crowdsourcing Subjective Image Quality Assessment   2012-09-29
（63）An Effective Multi-Clue Fusion Approach for Web Video Topic Detection   2012-09-29
（64）Motion Based Perceptual Distortion and Rate Optimization for Video Coding   2012-07-09
（65）Multi-feature Metric Learning with Knowledge Transfer among Semantics and Social Tagging   2012-06-18
（66）A Rotation Invariant Descriptor for Robust Video Copy Detection   Shuqiang Jiang, Li Su, Qingming Huang, Peng Cui, Zhipeng Wu   2011-12-22
（67）Coarse-to-Fine Dissolve Detection Based on Image Quality Assessment   Weigang Zhang, Chunxi Liu, Qingming Huang, Shuqiang Jiang, Wen Gao   2011-12-21
（68）Recognizing Realistic Action Using Contextual Feature Group   Yituo Ye, Lei Qin, Zhongwei Cheng, Qingming Huang   2011-12-21
（69）Learning-to-Share Based on Finding Groups for Large Scale Image Classification   Li Shen, Shuqiang Jiang, Shuhui Wang, Qingming Huang   2011-12-20
（70）Adaptive Moving Cast Shadow Detection   Guizhi Li, Lei Qin, Qingming Huang   2011-12-20
（71）Justifying the Importance of Color Cues in Object Detection: A Case Study on Pedestrian   Qingyuan Wang, Junbiao Pang, Lei Qin, Shuqiang Jiang, Qingming Huang   2011-12-20
（72）Random Partial Paired Comparison for Subjective Video Quality Assessment via HodgeRank   Qianqian Xu   2011-11-29
（73）Detection and Location of Near-Duplicate Video Sub-Clips by Finding Dense Subgraphs   Tianlong Chen   2011-11-29
（74）Human Group Activity Analysis with Fusion of Motion and Appearance Information   Zhongwei Cheng   2011-11-29
（75）Efficient lp-norm multiple feature metric learning for image categorization   Shuhui Wang   2011-10-26
（76）ObjectBook Construction for Large-Scale Semantic-Aware Image Retrieval   Shiliang Zhang   2011-10-18
（77）Visual Perception Based Lagrangian Rate Distortion Optimization for Video Coding   Xi Wang   2011-09-14
（78）Query Sensitive Dynamic Web Video Thumbnail Generation   Chunxi Liu   2011-09-13
（79）Online Vicept Learning for Web-scale Image Understanding   Liang Li   2011-09-13
（80）Human Tracking by Structured Body Parts   Xingkun Xu   2011-09-13
（81）Fast Common Visual Pattern Detection via Radiate Geometric Model   Lingyang Chu   2011-09-12
（82）Content-based intelligent video recorder with its implementation on sports video   Shuqiang Jiang, Qingming Huang, Zhao Zhao   2011-08-05
（83）Matching Content-Based Saliency Regions for Partial-Duplicate Image Retrieval   Liang Li   2011-07-12
（84）News Video Story Sentiment Classification and Ranking   Chunxi Liu   2011-07-11
（85）Learning Image Vicept Description via Mixed-Norm Regularization for Large Scale Semantic Image Search   Liang Li   2011-06-21
（86）Treat samples differently: Object tracking with semi-supervised online CovBoost   Guorong Li   2010-11-08
（87）Vicept: Link Visual Features to Concepts for Large-scale Image Understanding   Zhipeng Wu   2010-10-26
（88）Neighbor Classification Using Unlabeled Data for Large Scale Image Application   Shuhui Wang   2010-10-26
（89）Memory Matrix: A Novel User Experience for Home Video   Qianqian Xu   2010-10-26
（90）Building Contextual Visual Vocabulary for Large-scale Image Applications   Shiliang Zhang   2010-10-26
（91）Real-time Interactive Multi-Target Tracking Using Kernel-Based Trackers   Guorong Li   2010-09-27
（92）A Close-up Detection Method for Movies   Huiying Liu   2010-09-27
（93）Multi-Description of Local Interest Point for Partial-Duplicate Image Retrieval   Liang Li   2010-09-27
（94）Attention Based Album Slideshow   Huiying Liu    2010-09-13
（95）Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval    Zhipeng Wu   2010-08-25
（96）Action Recognition Using Spatial-Temporal Context   Qiong Hu   2010-08-25
（97）Multiple Kernel Learning with High Order Kernels   Shuhui Wang   2010-08-25
（98）Localized Image Matte Evaluation by Gradient Correlation   Guilin Yao   2010-08-25
（99）Group Activity Recognition by Gaussian Processes Estimation   Zhongwei Cheng   2010-08-24
（100）Bridging the Gap Between Objective Score and Subjective Preference in Video Quality Assessment   Qianqian Xu   2010-07-20
（101）Event Based News Video People Classification and Ranking Using Multimodality Features   Chunxi Liu   2010-07-20
（102）Fast Copy Detection Based on Slice Entropy Scattergraph   Peng Cui   2010-07-20
（103）Novel Observation Model for Probabilistic Object Tracking   Dawei Liang   2010-06-15
（104）Measuring Visual Saliency by Site Entropy Rate   Wei Wang   2010-06-15
（105）Pair-wise Visual Word Tree for Efficeint Image Re-ranking   Shiliang Zhang   2010-03-15
（106）Video Shrinking by Auditory and Visual Cues   Qianqian Xu   2009-12-17
（107）A Generic Approach to Classify Sports Video Shots and Its Application in Event Detection   Lingfang Li   2009-11-24
（108）Personalized Online Video Recommendation by Neighborhood Score Propagation Based Global Ranking   Chunxi Liu   2009-11-24
（109）Advertise Gently - In-image advertising with Low Intrusiveness   Huiying Liu, Xuekan Qiu, Qingming Huang, Shuqiang Jiang, Changsheng Xu   2009-11-07
（110）Utilizing Affective Analysis for Efficient Movie Browsing   Shiliang Zhang, Qi Tian, Qingming Huang, Wen Gao, Shipeng Li   2009-11-07
（111）Joint Learning for Side Information and Correlation Model Based on Linear Regression Model in Distributed Video Coding   Xianming Liu, Debin Zhao, Yongbing Zhang, Siwei Ma, Qingming Huang,Wen Gao   2009-11-07
（112）Visual ContextRank for Web Image Re-ranking   Shuhui Wang   2009-10-09
（113）Near-Duplicate Video Matching with Transformation Recognition   Zhipeng Wu   2009-10-09
（114）Descriptive Visual Words and Visual Phrases for Image Applications   Shiliang Zhang   2009-10-09
（115）Automatic Sports Genre Categorization and View-type Classification over Large-scale Dataset   Lingfang Li   2009-10-09
（116）Friend Recommendation According to Appearances on Photos   Zhipeng Wu   2009-10-09
（117）Spatial-temporal Video Browsing for Mobile Application Based on Visual Attention Analysis   Xuekan Qiu   2009-06-09
（118）A Hybrid Text Segmentation Approach   Xiaojun Li   2009-06-09
（119）Compression-Induced Rendering Distortion Analysis for Texture/Depth Rate Allocation in 3D Video Compression   Yanwei Liu   2009-03-09
（120）A Generic Virtual Content Insertion System Based on Visual Attention Model   Huiying Liu   2008-10-28
（121）Naming Faces in Broadcast News Video By Image Google   Chunxi Liu   2008-10-27
（122）iMTV - An Integrated System for MTV Affective Analysis   Shiliang Zhang   2008-10-27
（123）Pedestrian Detection via Logistic Multiple Instance Boosting   Junbiao Pang   2008-10-12
（124）Fast and Effective Text Detection   Xiaojun Li   2008-10-12
（125）People Re-Detection Using AdaBoost with Sift and Color Correlogram   Lei Hu   2008-10-12
（126）Object Tracking Using Incremental 2D-LDA Learning and Bayes Inference   Guorong Li   2008-10-12
（127）Shot Classification for Action Movies Based on Motion Characteristics   Shuhui Wang   2008-10-12
（128）Tactic Analysis Based on Player and Ball Trajectory in Broadcast Video   Guangyu Zhu   2008-07-07
（129）Lower Attentive Region Detection for Virtual Content Insertion in Broadcast Video   Huiying Liu   2008-06-26
（130）Spatial-Temporal Attentive Analysis for Home Video   Xuekan Qiu   2008-06-26
（131）Coarse-to-Fine Video Text Detection   Guangyi Miao   2008-06-26
（132）Affective MTV Analysis Based on Arousal and Valence Features   Shiliang Zhang   2008-06-26
（133）A Cooperative Framework for Background Modeling in Traffic Video Surveillance   ZHONG WEI   2007-12-15
（134）Depth Image Segmentation for Improved Virtual View Image Quality in 3-DTV   Wei Wang   2007-11-28
（135）Region-Based Visuall Attention Analysis with Its Application in Image Browsing on Small DisplaysProc. Of ACM MM 2007   Huiying Liu   2007-09-23
（136）Trajectory Based Event Tactics Analysis in Broadcast Sports Video   Guangyu Zhu   2007-09-23
（137）MEAN-SHIFT BLOB TRACKING WITH ADAPTIVE FEATURE SELECTION AND SCALE ADAPTATION   Dawei Liang   2007-09-17
（138）MONOCULAR TRACKING 3D PEOPLE BY GAUSSIAN PROCESS SPATIO-TEMPORAL VARIABLE MODEL   Junbiao Pang   2007-09-17
（139）MINING INFORMATION OF ATTACK-DEFENSE STATUS FROM SOCCER VIDEO BASED ON SCENE ANALYSIS   Shuqiang Jiang   2007-08-03
（140）A Real-time Score Detection and Recognition Approach for Broadcast Basketball Video   Guangyi Miao   2007-07-03
（141）HIGHLIGHT RANKING FOR RACQUET SPORTS VIDEO IN USER ATTENTION SUBSPACES BASED ON RELEVANCE FEEDBACK   Yijia Zheng   2007-07-03
（142）GENERATING VIDEO SEQUENCE FROM PHOTO IMAGE FOR MOBILE SCREENS BY CONTENT ANALYSIS   Shuqiang Jiang   2007-07-03
（143）A Fast Approach for Natural Image Matting Using Structure Information   Qianhui Ning   2007-07-03
（144）A Pixel-wise Local Information-based Background Subtraction Approach   Zhong Wei   2007-06-26
（145）Robust Copy Detection by Mining Temporal Self-Similarities   Zhipeng Wu   2007-06-09
（146）Macroblock-level Reduced Resolution Video Coding Allowing Adaptive DCT Coefficients Selection   Qiang Hao   2007-05-28
（147）Low-delay View Random Access for Multi-view Video Coding   Yanwei Liu   2007-05-28
（148）Drift Compensated Coding Optimization for Fast Bit-rate Reduction Transcoding   Peng Zhang   2007-01-29
（149）Low-Complexity Video Compression Using Down/Up Sampling at Low Bitrates   Qiang Hao   2006-11-06
（150）View Sequence Coding using Warping-based Image Alignment for Multi-view Video   Weiyan Liu   2006-11-06
（151）Highlight Summarization in Soccer Video Based on Goalmouth Detection   Zhao Zhao   2006-11-06
（152）Monocular Tracking 3D People with Back Constrained Scaled Gaussian Process Latent Variable Models   Junbiao Pang   2006-11-06
（153）Player Action Recognition in Broadcast Tennis Video with Applications to Semantic Analysis of Sports Game   Guangyu Zhu   2006-10-23
（154）Action Recognition in Broadcast Tennis Video   Guangyu Zhu   2006-08-21
（155）Unsupervised Texture Classification: Automatically Discover and Classify Texture Patterns   Lei Qin   2006-08-21
（156）An edge-based median filtering algorithm with consideration of motion vector reliability for adaptive video deinterlacing   Qian Huang   2006-07-17
（157）Automatic multi-player detection and tracking in broadcast sports video using support vector machine and particle filter   Guangyu Zhu   2006-07-10
（158）A fast intra mode decision algorithm for AVS to H.264 transcoding   Zhigang Wang   2006-07-10
（159）Extracting Story Units in Sports Video Based on Unsupervised Video Scene Clustering   Chunxi Liu   2006-07-10
（160）Highlight Summarization in Sports Video Based on Replay Detection   Zhao Zhao   2006-07-10
（161） Image Matching by Normalized Cross-Correlation   Feng Zhao,   2006-05-15
（162）Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine   Guangyu Zhu   2006-05-13
（163）A Simple Algorithm for Constant Quality Reconstruction of Scalable video Using a New Analytical R-D Model   Jun Sun,   2006-04-25
（164）Subjective Evaluation Criterion for Selecting Affective Features and Modeling Highlights   Liyuang Xing   2006-01-18
（165）Exciting Event Detection in Broadcast Soccer Video with Mid-level Description and Incremental Learning   Qixiang Ye   2005-11-07
（166）Video2Cartoon: Generating 3D Cartoon from Broadcast Soccer Video   Dawei Liang   2005-11-07
（167）Improving Particle Filter with Support Vector Regression for Efficient Visual Tracking   Guangyu Zhu   2005-09-12
（168） Jersey Number Detection in Sports Video for athlete Identification   Qixiang Ye   2005-07-13
（169）Statistical Model, Analysis and Approximation of Rate-Distortion Function in MPEG-4 FGS Videos   Jun SUn   2005-07-13
（170）A New Method to Calculate the Camera Focusing Area and Player Position on Playfield in Soccer Video   Yang Liu   2005-07-13
（171）Unsupervised Sports Video Scene Clustering and Its Applications to Story Units Detection   Weigang Zhang   2005-07-13
（172）A Scheme for Racquet Sports Video Analysis with the Combination of Audio-visual Information   Liyuang Xing   2005-07-13
（173）A System for Automatic Generation of Music Sports-video   Weigang Zhang   2005-07-07
（174）Linear Transform Based Motion Compensated Prediction for Lumiance Intensity Changes   Debin LI   2005-05-24
（175）Viewpoint Switching in Multiview Video Streaming   Xun Guo   2005-05-24
（176）Bandwidth Adaptive Quality Smoothing for Unequal Error Protected Scalable Video Streaming   Longshe Huo   2005-03-30
（177）Playfield Detection Using Adaptive GMM and Its Application   Yang Liu   2005-03-20
（178）Quality Smoothing for FEC-based Multiple Description Coding   Longshe Huo   2004-12-24
（179）FEC-based Multiple Description Coding for Heterogeneous Client Bandwidths   Longshe Huo   2004-12-01
（180） A Novel FGS Base-Layer Encoding Model And Weight-Based Rate Adaptation for Constant-Quanlity Streaming   Jun Sun   2004-12-01
（181）Error Resilience Video Coding in H.264 Encoder with Potential Distortion Tracking   Yuan Zhang   2004-10-25
（182）Automatic Text Segmentation from Complex Background   Qixiang Ye   2004-10-25
（183）Mode Mapping Method for H.264/AVC Spatial Downscaling Transcoding   Peng Zhang   2004-10-25
（184）Context-Based 2D-VLC for Video Coding   Qiang Wang    2004-06-28
（185）New Bi-Prediction Techniques for B Pictures Coding   Xiangyang Ji   2004-06-28
（186）Improved Error Concealment Algorithms Based on H.264/AVC Non-normative Decoder   Li Su   2004-06-28

合作情况

项目协作单位

北京大学

哈尔滨工业大学

哈尔滨工程大学

北京工业大学

NEC中国研究院

微软亚洲研究院

新加坡国立大学

新加坡南洋理工大学

指导学生

已指导学生

魏众硕士研究生 081203-计算机应用技术

宋德强博士研究生 081002-信号与信息处理

苗广艺硕士研究生 081203-计算机应用技术

郑轶佳硕士研究生 081203-计算机应用技术

蔡少婕硕士研究生 081203-计算机应用技术

邱学侃硕士研究生 081203-计算机应用技术

刘纯熙博士研究生 081203-计算机应用技术

吴志鹏硕士研究生 081203-计算机应用技术

刘慧颖博士研究生 081203-计算机应用技术

王威博士研究生 081203-计算机应用技术

刘延伟博士研究生 081203-计算机应用技术

赵照硕士研究生 081203-计算机应用技术

郝嫱硕士研究生 081203-计算机应用技术

郭瑞硕士研究生 081203-计算机应用技术

庞俊彪博士研究生 081203-计算机应用技术

王树徽博士研究生 081203-计算机应用技术

崔鹏硕士研究生 081203-计算机应用技术

胡琼硕士研究生 081203-计算机应用技术

李国荣博士研究生 081203-计算机应用技术

王清源硕士研究生 081203-计算机应用技术

许营坤博士研究生 081203-计算机应用技术

许倩倩博士研究生 081203-计算机应用技术

康国强硕士研究生 430112-计算机技术

叶宜拓硕士研究生 081203-计算机应用技术

胡方振硕士研究生 081203-计算机应用技术

王仿坤硕士研究生 430112-计算机技术

朱琳硕士研究生 081203-计算机应用技术

成仲炜博士研究生 081203-计算机应用技术

王茜博士研究生 081203-计算机应用技术

李亮博士研究生 081203-计算机应用技术

陈天龙硕士研究生 081203-计算机应用技术

李桂芝硕士研究生 430141-物流工程

张艳雁硕士研究生 081203-计算机应用技术

刘昊硕士研究生 085211-计算机技术

申丽博士研究生 081203-计算机应用技术

褚令洋博士研究生 081203-计算机应用技术

宋国利博士研究生 081203-计算机应用技术

薛哲博士研究生 081203-计算机应用技术

刘艺硕士研究生 081203-计算机应用技术

王祯骏硕士研究生 081203-计算机应用技术

吴益灵博士研究生 081203-计算机应用技术

李瑞英硕士研究生 081203-计算机应用技术

徐梓钧硕士研究生 081203-计算机应用技术

胡玲硕士研究生 081203-计算机应用技术

独大为博士研究生 081203-计算机应用技术

黄俊博士研究生 081203-计算机应用技术

周志达硕士研究生 085211-计算机技术

陈祖耀硕士研究生 081203-计算机应用技术

张知明硕士研究生 081203-计算机应用技术

杨智勇博士研究生 083900-网络空间安全

张家明硕士研究生 081203-计算机应用技术

杨士杰博士研究生 081203-计算机应用技术

李朝鹏硕士研究生 085211-计算机技术

滕尚志博士研究生 081203-计算机应用技术

卓君宝博士研究生 081203-计算机应用技术

吴哲博士研究生 081203-计算机应用技术

杨一帆博士研究生 081203-计算机应用技术

徐凯博士研究生 081203-计算机应用技术

现指导学生

刘雪静博士研究生 081203-计算机应用技术

王子泰博士研究生 081203-计算机应用技术

姜阳邦彦博士研究生 081203-计算机应用技术

温佩松博士研究生 081203-计算机应用技术

曹宗胜博士研究生 081203-计算机应用技术

陈俊宇硕士研究生 081203-计算机应用技术

赵昀睿硕士研究生 081203-计算机应用技术

孟德超博士研究生 081203-计算机应用技术

肖嘉瑜博士研究生 081203-计算机应用技术

刘心岩博士研究生 081203-计算机应用技术

张振铎博士研究生 081203-计算机应用技术

刘睿洲博士研究生 081203-计算机应用技术

张晨博士研究生 081203-计算机应用技术

包世龙博士研究生 081203-计算机应用技术

倪文鑫硕士研究生 081203-计算机应用技术

高培峰硕士研究生 081203-计算机应用技术

陈伟东博士研究生 081203-计算机应用技术

戚兆波博士研究生 081203-计算机应用技术

段凯文博士研究生 081203-计算机应用技术

韩歆哲博士研究生 081203-计算机应用技术

曹天伟博士研究生 081203-计算机应用技术