发表论文
(1) CLIP-TLNet: Canopy Light Interception Prediction with Transformer-LSTM Network Through 3D Complexity-Temporal Dynamics Modeling, Plant Phenomics, 2026, 第 6 作者(2) A Survey of Visual-Language Foundation Models for Enhancing Virtual and Augmented Reality Interactivity, The Visual Computer, 2026, 第 2 作者(3) Adaptive In Adapter: Boosting Open-Vocabulary Semantic Segmentation with Adaptive Dropout Adapter, IEEE Transactions on Multimedia, 2026, 第 8 作者(4) VRBT:轻量级骨骼识别与损伤预警VR羽毛球训方法, 系统仿真学报, 2026, 第 4 作者(5) PanoDiT: Panoramic Videos Generation with Diffusion Transformer, The 39th Annual AAAI Conference on Artificial Intelligence, 2025, 第 6 作者 通讯作者(6) DiffusionIMU: Diffusion-Based Inertial Navigation with Iterative Motion Refinement, The 34th International Joint Conference on Artificial Intelligence (IJCAI-25), 2025, 第 8 作者(7) MFCPopulus: A Point Cloud Completion Network Based on Multi-feature Fusion for 3D Reconstruction of Individual Populus Tomentosa, Forests, 2025, 第 7 作者(8) Enhanced Dual-Model Framework for Precision Player Tracking and Ball Detection in Soccer Videos, The Visual Computer, 2025, 第 6 作者(9) Mask-Guided Transformer with Hybrid Supervision for 3D Instance Segmentation, ICME, 2025, 第 5 作者 通讯作者(10) GLFAFormer: DeepFake forgery detection with adaptive feature extract and align, Digital Signal Processing, 2025, 第 4 作者(11) L2: Accurate Forestry Time-Series Completion and Growth Factor Inference, Forests, 2025, 第 4 作者(12) PDFT: Parameter-Diminish Fine-Tuning for Transformer-based Models, The Visual Computer, 2025, 第 2 作者 通讯作者(13) Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal Interaction, the Fortieth AAAI Conference on Artificial Intelligence (AAAI-26), 2025, 第 2 作者 通讯作者(14) 基于时空融合的多模态路面特征提取算法, 中国体视学与图像分析, 2025, 第 2 作者 通讯作者(15) Multimodal fusion and vision–language models: A survey for robot vision, Information Fusion, 2025, 第 9 作者 通讯作者(16) DeepWIO: A Lightweight Deep Wheel-Inertial Odometry for Mass-Produced Autonomous Driving Systems, CVCI 2025, 2025, 第 2 作者 通讯作者(17) Collaboration Wins More: Dual-Modal Collaborative Attention Reinforcement for Mitigating Large Vision Language Models Hallucination, ACM Multimedia 2025, 2025, 第 11 作者(18) DiffLane: Diffusion Model-Based Lane Mask Generation for Accurate Video Lane Detection, IEEE International Conference on Multimedia & Expo 2025, 2025, 第 3 作者(19) Shape-Preserving and Surface-Fitting Network for 3D Lane Detection, IEEE International Conference on Multimedia & Expo 2025, 2025, 第 5 作者(20) One-Shot Motion Talking Head Generation with Audio-Driven Model, Expert Systems with Applications, 2025, 第 3 作者(21) D3L: Curvature-Constrained Denoising Diffusion Model for 3D Lane Detection, MM '25: Proceedings of the 33rd ACM International Conference on Multimedia, 2025, 第 4 作者(22) AccidentX: A Large-Scale Multimodal BEV Dataset for Traffic Accident Analysis and Prevention, IROS2025 oral, 2025, 第 5 作者 通讯作者(23) GeoROS++: A Georeferenced Real-time Stitching Methods Suitable for Sparse Aerial Orthophoto Data, Siggraph Asia 2025 Technique Communication, 2025, 第 3 作者 通讯作者(24) Enhancing sonar image segmentation with random fusion in a diffusion model framework, The Visual Computer, 2025, 第 2 作者(25) Cross-lingual font style transfer with full-domain convolutional attention, Pattern Recognition, 2024, 第 5 作者(26) Accurate Lung Nodule Segmentation with Detailed Representation Transfer and Soft Mask Supervision, IEEE Transactions on Neural Networks and Learning Systems, 2024, 第 4 作者 通讯作者(27) ROMOT: Referring-expression-comprehension Open-set Multi-Object Tracking, The Visual Computer, 2024, 第 4 作者 通讯作者(28) Arbitrary style transfer via multi-feature correlation, Computers & Graphics, 2024, 第 5 作者(29) Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization, IEEE Transactions on Image Processing, 2024, 第 4 作者 通讯作者(30) FEKNN: A Wi-Fi Indoor Localization Method Based on Feature Enhancement and KNN, The International Conference on Wireless Artificial Intelligent Computing Systems and Applications (WASA), 2024, 第 4 作者 通讯作者(31) Diff-PCG: Diffusion Point Cloud Generation with Continuous Normalizing Flow, The Visual Computer, 2024, 第 2 作者(32) HIDE:Hierarchical Iterative Decoding Enhancement for Multi-view 3D Human Parameter Regression, Computer Animation and Virtual Worlds, 2024, 第 3 作者 通讯作者(33) A Survey on Soccer Player Detection and Tracking with Videos, The Visual Computer, 2024, 第 7 作者(34) Two-particle Debris Flow Simulation Based on SPH, Computer Animation and Virtual Worlds, 2024, 第 6 作者(35) 融合点云与图像的环境目标检测研究进展, 中国图象图形学报, 2024, 第 3 作者 通讯作者(36) De-NeRF: Ultra-High-Definition NeRF with Deformable Net Alignment, Computer Animation and Virtual Worlds, 2024, 第 4 作者(37) DefFusion: Deformable Multimodal Representation Fusion for 3D Semantic Segmentation, 2024 IEEE International Conference on Robotics and Automation, 2024, 第 6 作者 通讯作者(38) Soccer player tracking and data correction based on attention with full-field videos, The Visual Computer, 2024, 第 7 作者(39) RML: Efficient Representation Mutual Learning Framework for End-to-End Weakly-Supervised Semantic Segmentation, IEEE Transactions on Instrumentation & Measurement, 2024, 第 4 作者 通讯作者(40) Token Masking Transformer for Weakly Supervised Object Localization, IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 第 5 作者(41) Soccer Match Broadcast Video Analysis Method Based on Detection and Tracking, Computer Animation and Virtual Worlds, 2024, 第 6 作者 通讯作者(42) AG-SDM: Aquascape Generation based on Stable Diffusion Model with Low-Rank Adaptation, Computer Animation and Virtual Worlds, 2024, 第 6 作者 通讯作者(43) SocialVis: Dynamic Social Visualization in Dense Scenes via Real-time Multi-Object Tracking and Proximity Graph Construction, Computer Animation and Virtual Worlds, 2024, 第 4 作者 通讯作者(44) SkinFormer: Learning Statistical Texture Representation with Transformer for Skin Lesion Segmentation, Journal of Biomedical and Health Informatics (JBHI), 2024, 第 5 作者 通讯作者(45) DTTCNet: Time-to-Collision Estimation with Autonomous Emergency Braking Using Multi-Scale Transformer Network, Transactions on Mobile Computing, 2024, 第 5 作者(46) MSC-Net: Multi-Stage Colorization Network for Real-world Images with Specular Highlights, The Visual Computer, 2024, 第 3 作者(47) MRFTrans: Multimodal Representation Fusion Transformer for monocular 3D semantic scene completion, Information Fusion, 2024, 第 7 作者(48) Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation, IEEE Transactions on Multimedia, 2023, 第 4 作者 通讯作者(49) Self Correspondence Distillation For End-to-End Weakly-Supervised Semantic Segmentation, Association for the Advance of Artificial Intelligence (AAAI), 2023, 第 5 作者 通讯作者(50) RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 第 5 作者(51) HTCViT: an effective network for image classification and segmentation based on natural disaster datasets, the Visual Computer( CGI2023 special issue), 2023, 第 4 作者 通讯作者(52) DomainFeat: Learning Local Features with Domain Adaptation, IEEE Transactions on Circuits and Systems for Video Technology, 2023, 第 4 作者(53) SCOOT: Self-supervised Centric Open-set Object Tracking, SIGGRAPH ASIA, 2023, 第 2 作者 通讯作者(54) Sand Painting Conversion based on Detail Preservation, Computer &Graphics, 2023, 第 3 作者(55) Towards accurate and efficient road extraction by leveraging the characteristics of road shapes, IEEE Transactions on Geoscience and Remote Sensing, 2023, 第 4 作者 通讯作者(56) Attention Weighted Local Descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 第 5 作者(57) Lightweight Semantic Architecture Modeling by 3D Feature Line Detection, Remote Sensing, 2023, 第 4 作者(58) Automatic polyp segmentation via image-level and surrounding-level context fusion deep neural network, ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 第 4 作者 通讯作者(59) An Efficient Non-iterative SPH Fluid Simulation Method with Variable Smooth Length, Visual Computing for Industry, Biomedicine, and Art, 2023, 第 3 作者 通讯作者(60) FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions, ACM Multimedia 2023, 2023, 第 4 作者 通讯作者(61) Triple Robustness Augmentation Local Features for multi-source image registration, ISPRS Journal of Photogrammetry and Remote Sensing, 2023, 第 5 作者(62) Physics-Based Modeling and Fluttering Dynamic Process Simulation for Catkins, Forests, 2023, 第 6 作者(63) Treating Pseudo-labels Generation as Image Matting for Weakly Supervised Semantic Segmentation, International Conference on Computer Vision 2023, 2023, 第 4 作者 通讯作者(64) Audio-Driven Lips and Expression on 3D Human Face, COMPUTER GRAPHICS INTERNATIONAL 2023, 2023, 第 3 作者 通讯作者(65) RC-Net: Row and Column Network with Text Feature for Parsing Floor Plan Images, Journal of Computer Science and Technology, 2023, 第 2 作者(66) Dual-stream Representation Fusion Learning for accurate medical image segmentation, Engineering Applications of Artificial Intelligence, 2023, 第 4 作者 通讯作者(67) CNDesc: Cross Normalization for Local Descriptors Learning, IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 第 4 作者 通讯作者(68) Instance segmentation of biological images using graph convolutional network, ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 第 5 作者 通讯作者(69) SoftGAN: Towards Accurate Lung Nodules Segmentation via Soft Mask Supervision, IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2022, 第 4 作者 通讯作者(70) MTLDesc: Looking Wider to Describe Better, AAAI Conference on Artificial Intelligence (AAAI), 2022, 第 5 作者 通讯作者(71) Multi-view 3D Human Physique Dataset Construction For Robust Digital Human Modeling of Natural Scenes, International Conference on Communication and Information Processing (ICCIP 2022), 2022, 第 5 作者(72) MFFNet: Multi-Receptive Field Fusion Net for Microscope Steel Grain Grading, 2022 the 8th International Conference on Communication and Information Processing (ICCIP 2022), 2022, 第 4 作者(73) GeoROS: Georeferenced Real-time Orthophoto Stitching with Unmanned Aerial Vehicle, The 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022), 2022, 第 5 作者 通讯作者(74) Triple-strip attention mechanism-based natural disaster images classification and segmentation, VISUAL COMPUTER, 2022, 第 4 作者 通讯作者(75) DomainDesc: Learning Local Descriptors with Domain Adaptation, 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, 第 6 作者 通讯作者(76) Driving EEG based multilayer dynamic brain network analysis for steering process, Expert Systems with Applications, 2022, 第 2 作者(77) DA-Net: Dual Branch Transformer and Adaptive Strip Upsampling for Retinal Vessels Segmentation, The 25th International Conference on Medical Image Computing and Computer Assisted Intervention(MICCAI 2022), 2022, 第 4 作者 通讯作者(78) 基于特征混合聚类和关键点检测的智能人脸搜索, Intelligent Face Search Based on Mixed Feature Clustering and Keypoint Detection, 集成技术, 2022, 第 4 作者(79) Towards Effective Adversarial Attack on Point Cloud for 3D Classification, IEEE International Conference on Multimedia and Expo (ICME) 2021, 2021, 第 2 作者(80) DC-Net: Dual Context Network for 2D Medical Image Segmentation, MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 第 4 作者(81) Intelligent annotation for image sequences and videos, 2021 IEEE International Conference on Virtual Reality and Visualization (ICVRV) Best Paper Award, 2021, 第 5 作者 通讯作者(82) 多目鱼眼相机系统的视觉里程计解决方案, 中国体视学与图像分析, 2021, 第 2 作者 通讯作者(83) A practical framework of multi-person 3D human pose estimation with a single RGB camera, IEEE VR 2021 Poster, 2021, 第 4 作者 通讯作者(84) Data-driven floor plan understanding in rural residential buildings via deep recognition, INFORMATION SCIENCES, 2021, 第 4 作者(85) 基于地面激光点云数据的单木三维重建方法, 南京林业大学学报( 自然科学版), 2021, 第 3 作者(86) 基于CNN的住宅平面图元素识别与布局语义分析, Floor plan recognition and semantic layout analysis based on a convolutional neural network, 中国体视学与图像分析, 2020, 第 5 作者(87) Efficient Joint Gradient Based Atack Against SOR Defense for 3D Point Cloud Classification, Proceedings of the 28th ACM International Conference on Multimedia (MM ’20), 2020, 第 2 作者(88) Unsupervised Multi-View Constrained Convolutional Network for Accurate Depth Estimation, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 第 5 作者 通讯作者(89) 3d point cloud analysis and classification in large-scale scene based on deep learning, IEEE ACCESS, 2019, 第 2 作者(90) Parameter optimization criteria guided 3D point cloud classification, MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 第 2 作者(91) Accurate 3D Locating and Tracking of Basketball Players from Multiple Videos, SA'18: SIGGRAPH ASIA 2018 TECHNICAL BRIEFS, 2018, 第 1 作者(92) Large-scale 3D Point Cloud Classification Based On Feature Description Matrix By CNN, PROCEEDINGS OF THE 31ST INTERNATIONAL CONFERENCE ON COMPUTER ANIMATION AND SOCIAL AGENTS (CASA 2018), 2018, 第 2 作者(93) Accurate blind deblurring using salientpatch-based prior for large-size images, MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 第 4 作者(94) 基于采样半径优化的最大化泊松圆盘采样, SCIENTIA SINICA INFORMATIONIS, 2017, 第 4 作者(95) 基于快速引导滤波的景深实时渲染方法, 中国体视学与图像分析, 2017, 第 3 作者(96) 基于改进RPN深度网络的端到端的监控场景行人检测研究, 中国体视学与图像分析, 2017, 第 3 作者(97) 基于采样半径优化的最大化Poisson圆盘采样, 中国科学:信息科学, 2017, 第 4 作者(98) Building Extraction from Remotely Sensed Images by Integrating Saliency Cue, IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 第 3 作者(99) Shape exploration of 3D heterogeneous models based on cages, MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 第 1 作者 通讯作者(100) Tree Branch Level of Detail Models for Forest Navigation, COMPUTER GRAPHICS FORUM, 2017, 第 3 作者(101) Visualization of Tomato Growth Based on Dry Matter Flow, INTERNATIONAL JOURNAL OF COMPUTER GAMES TECHNOLOGY, 2017, 第 3 作者(102) 3D Point Cloud Classification Based on Discrete Conditional Random Field, Edutainment 2017,Runner-up for Best Paper Award, 2017, 第 3 作者(103) Analyzing surface sampling patterns using the localized pair correlation function, COMPUTATIONAL VISUAL MEDIA, 2016, 第 4 作者(104) A Survey on Processing of Large-Scale 3D Point Cloud, EDUTAINMENT 2016, 2016, 第 2 作者(105) Algorithm of sand painting simulation based on Kinect, 2016 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2016), 2016, 第 6 作者(106) Maximal Poisson-disk Sampling via Sampling Radius Optimization, SIGGRAPH ASIA 2016 Posters, 2016, 第 5 作者(107) Optimized shape semantic graph representation for object understanding and recognition in point clouds, OPTICAL ENGINEERING, 2016, 第 3 作者(108) 3D shape retrieval using viewpoint information‐theoretic measures, COMPUTER ANIMATION AND VIRTUAL WORLDS, 2015, 第 3 作者(109) 基于二阶平滑先验的图像保边平滑快速算法, A fast algorithm for images’ edgepreserving smoothing using second order smoothness prior, 高技术通讯, 2014, 第 2 作者(110) The extraction of feature lines on 3D models: a survey, 2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, 第 2 作者(111) Perception-motivated multiresolution rendering on sole-cube maps, MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 第 2 作者(112) A survey on recent Approaches of Mesh Compressions, ICVRV 2014 : INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION, 2014, 第 2 作者(113) Somatosensory Interaction for Real-Time Large Scale Roaming, 12TH ACM INTERNATIONAL CONFERENCE ON VIRTUAL REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY(VRCAI2013), 2013, 第 2 作者(114) Statistical learning based facial animation, JOURNALOFZHEJIANGUNIVERSITYSCIENCECCOMPUTERSELECTRONICS, 2013, 第 3 作者(115) Sketch-based design for green geometry and image deformation, MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 第 2 作者(116) Viewpoint Information-Theoretic Measures for 3D Shape Similarity, 12TH ACM INTERNATIONAL CONFERENCE ON VIRTUAL REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY(VRCAI2013), 2013, 第 3 作者(117) Rotational Invariant Face Detection On a Mobile Device, 2013 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION, 2013, 第 2 作者(118) An Interactive Warping Method for Multi-channel VR Projection Display Systems with Quadric Surface Screens, 2013 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION, 2013, 第 2 作者(119) Statistical learning based facial animation, JOURNAL OF ZHEJIANG UNIVERSITY SCIENCE C,, 2013, 第 3 作者(120) Cage-based tree deformation, EDUTAINMENT TECHNOLOGIES. EDUCATIONAL GAMES AND VIRTUAL REALITY/AUGMENTED REALITY APPLICATIONS LECTURE NOTES IN COMPUTER SCIENCE, 2011, 第 2 作者(121) Multicage image deformation on GPU, VRCAI 2011, 2011, 第 1 作者(122) MCGIM-Based Model Streaming for Realtime Progressive Rendering, MCGIM-Based Model Streaming for Realtime Progressive Rendering, JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2011, 第 2 作者(123) Hardware instancing for real-time realistic forest rendering, ACM SIGGRAPH ASIA, 2011, 第 2 作者(124) Differential geometry images: remeshing and morphing with local shape preservation, VISUAL COMPUTER, 2010, 第 1 作者 通讯作者(125) Robust discovery of partial rigid symmetries on 3D models, ACM SIGGRAPH ASIA, 2010, 第 3 作者(126) 形状空间中模型变化的插值生成, An Improved Approach for Generation of Intermediate Deformation Models in Shape Space, 计算机辅助设计与图形学学报, 2009, 第 1 作者(127) 基于GPU的近似软影实时绘制, Real-Time Approximate Soft Shadow Rendering on GPU, 计算机辅助设计与图形学学报, 2009, 第 2 作者(128) Interactive image deformation using cage coordinates on GPU, Proceedings of the 8th International Conference on Virtual Reality Continuum and its Applications in Industry, 2009, 第 1 作者 通讯作者(129) 基于两视图重建的平面纹理校正和映射, Planar Texture Rectification and Mapping Based on Reconstruction from Two Views, 中国图象图形学报, 2005, 第 1 作者