


default search action
IEEE Transactions on Multimedia, Volume 26
Volume 26, 2024
- Qinwei Xu
, Ruipeng Zhang
, Ya Zhang
, Yiyan Wu
, Yanfeng Wang
:
Federated Adversarial Domain Hallucination for Privacy-Preserving Domain Generalization. 1-14 - Xingxing Zhang
, Shupeng Gui, Jian Jin
, Zhenfeng Zhu
, Yao Zhao
:
ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries. 15-27 - Weiqing Lu
, Hai-Miao Hu
, Jinzuo Yu, Yibo Zhou, Hanzi Wang
, Bo Li:
Orientation-Aware Pedestrian Attribute Recognition Based on Graph Convolution Network. 28-40 - Zhuang Li
, Leilei Cao
, Hongbin Wang, Lihong Xu
:
End-to-End Instance-Level Human Parsing by Segmenting Persons. 41-50 - Xun Cai, Qingjie Shi, Yanbo Gao
, Shuai Li
, Wei Hua
, Tian Xie:
A Structure-Preserving and Illumination-Consistent Cycle Framework for Image Harmonization. 51-64 - Saizhe Ding, Jinze Chen, Yang Wang
, Yu Kang
, Weiguo Song, Jie Cheng, Yang Cao
:
E-MLB: Multilevel Benchmark for Event-Based Camera Denoising. 65-76 - Jiang Li
, Xiaoping Wang
, Guoqing Lv
, Zhigang Zeng
:
GraphCFC: A Directed Graph Based Cross-Modal Feature Complementation Approach for Multimodal Conversational Emotion Recognition. 77-89 - Zhe Li
, Xinyu Wang
, Yuliang Liu
, Lianwen Jin
, Yichao Huang, Kai Ding:
Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing. 90-102 - Zheng Li
, Caili Guo
, Zerun Feng
, Jenq-Neng Hwang
, Zhongtian Du:
Integrating Language Guidance Into Image-Text Matching for Correcting False Negatives. 103-116 - Muqing Deng
, Zhuyao Fan
, Peng Lin, Xiaoreng Feng:
Human Gait Recognition Based on Frontal-View Sequences Using Gait Dynamics and Deep Learning. 117-126 - Huimin Zeng
, Jie Huang
, Jiacheng Li
, Zhiwei Xiong
:
Region-Aware Portrait Retouching With Sparse Interactive Guidance. 127-140 - Jiawei Liu
, Weining Wang
, Sihan Chen, Xinxin Zhu, Jing Liu
:
Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation. 141-153 - Yan-Bo Liu
, Guo Cao
, Boshan Shi, Yingxiang Hu
:
CCANet: A Collaborative Cross-Modal Attention Network for RGB-D Crowd Counting. 154-165 - Wenhan Wu
, Wenfeng Yi, Jinghai Li, Maoyin Chen
, Xiaoping Zheng
:
Automatic Identification of Human Subgroups in Time-Dependent Pedestrian Flow Networks. 166-177 - Ali Köksal
, Kenan E. Ak, Ying Sun
, Deepu Rajan
, Joo Hwee Lim
:
Controllable Video Generation With Text-Based Instructions. 190-201 - Bosheng Ding, Ruiheng Zhang
, Lixin Xu
, Guanyu Liu
, Shuo Yang
, Yumeng Liu
, Qi Zhang:
U2D2Net: Unsupervised Unified Image Dehazing and Denoising Network for Single Hazy Image Enhancement. 202-217 - Zhiwu Qing
, Shiwei Zhang
, Ziyuan Huang
, Xiang Wang
, Yuehuan Wang
, Yiliang Lv, Changxin Gao
, Nong Sang
:
MAR: Masked Autoencoders for Efficient Action Recognition. 218-233 - Jinguang Wang
, Shengsheng Qian
, Jun Hu
, Richang Hong
:
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network. 234-244 - Jianxun Lou
, Hanhe Lin
, Philippa Young, Richard White
, Zelei Yang, Susan Cheng Shelmerdine, David Marshall, Emiliano Spezi
, Marco Palombo
, Hantao Liu
:
Predicting Radiologists' Gaze With Computational Saliency Models in Mammogram Reading. 256-269 - Md. Moniruzzaman
, Zhaozheng Yin
:
Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization. 270-283 - Zehui Chen
, Chenhongyi Yang, Jiahao Chang
, Feng Zhao
, Zheng-Jun Zha
, Feng Wu:
DDOD: Dive Deeper into the Disentanglement of Object Detector. 284-298 - Kai Zeng
, Kejiang Chen
, Weiming Zhang
, Yaofei Wang:
Upward Robust Steganography Based on Overflow Alleviation. 299-312 - Yukun Su
, Jingliang Deng, Ruizhou Sun, Guosheng Lin
, Hanjing Su, Qingyao Wu
:
A Unified Transformer Framework for Group-Based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection. 313-325 - Jun Wang
, Peng Yin, Yuanyun Wang
, Wenhui Yang:
CMAT: Integrating Convolution Mixer and Self-Attention for Visual Tracking. 326-338 - Lorenzo Agnolucci
, Leonardo Galteri
, Marco Bertini
, Alberto Del Bimbo
:
Perceptual Quality Improvement in Videoconferencing Using Keyframes-Based GAN. 339-352 - Wanting Zhou
, Longteng Kong
, Yushan Han, Jie Qin
, Zhenan Sun
:
Contextualized Relation Predictive Model for Self-Supervised Group Activity Representation Learning. 353-366 - Hua Li
, Junyan Liang, Ruiqi Wu, Runmin Cong
, Wenhui Wu
, Sam Tak-Wu Kwong
:
Stereo Superpixel Segmentation via Decoupled Dynamic Spatial-Embedding Fusion Network. 367-378 - Peipei Zhu
, Xiao Wang
, Lin Zhu
, Zhenglong Sun
, Wei-Shi Zheng
, Yaowei Wang
, Changwen Chen
:
Prompt-Based Learning for Unpaired Image Captioning. 379-393 - Zhiyu Wang
, Chao Yang
, Bin Jiang
, Junsong Yuan
:
A Dual Reinforcement Learning Framework for Weakly Supervised Phrase Grounding. 394-405 - Andong Lu
, Zhang Zhang
, Yan Huang
, Yifan Zhang
, Chenglong Li
, Jin Tang
, Liang Wang
:
Illumination Distillation Framework for Nighttime Person Re-Identification and a New Benchmark. 406-419 - Parham Hadikhani
, Daphne Teck Ching Lai
, Wee-Hong Ong
:
Human Activity Discovery With Automatic Multi-Objective Particle Swarm Optimization Clustering With Gaussian Mutation and Game Theory. 420-435 - Quanpeng Song
, Jiaxin Li
, Si Wu
, Hau-San Wong
:
A Graph-Based Discriminator Architecture for Multi-Attribute Facial Image Editing. 436-446 - Jianping Gou
, Xin He
, Lan Du
, Baosheng Yu, Wenbai Chen
, Zhang Yi
:
Hierarchical Locality-Aware Deep Dictionary Learning for Classification. 447-461 - Senmao Ye
, Huan Wang, Mingkui Tan
, Fei Liu
:
Recurrent Affine Transformation for Text-to-Image Synthesis. 462-473 - Weitao You
, Juntao Ji
, Lingyun Sun
, Changyuan Yang, Mi Yu, Shi Chen
, Jiayi Yao
:
Automatic Generation of Interactive Nonlinear Video for Online Apparel Shopping Navigation. 474-486 - Aijia Yang
, Sihao Lin
, Chung-Hsing Yeh
, Minglei Shu
, Yi Yang, Xiaojun Chang
:
Context Matters: Distilling Knowledge Graph for Enhanced Object Detection. 487-500 - Qinghua Ren
, Qirong Mao
, Shijian Lu
:
Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation. 501-513 - Weizhi Nie
, Yuru Bao, Yue Zhao
, Anan Liu
:
Long Dialogue Emotion Detection Based on Commonsense Knowledge Graph Guidance. 514-528 - Ziqi Yuan
, Yihe Liu, Hua Xu
, Kai Gao:
Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis. 529-539 - Arbind Agrahari Baniya
, Tsz-Kwan Lee
, Peter W. Eklund
, Sunil Aryal
:
Omnidirectional Video Super-Resolution Using Deep Learning. 540-554 - Yufan Hu
, Junyu Gao
, Changsheng Xu
:
Learning Multi-Expert Distribution Calibration for Long-Tailed Video Classification. 555-567 - Yanshan Li
, Huajie Liang
, Rui Yu
:
BI-CAM: Generating Explanations for Deep Neural Networks Using Bipolar Information. 568-580 - Rongtao Xu
, Changwei Wang
, Shibiao Xu
, Weiliang Meng
, Xiaopeng Zhang
:
Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation. 581-592 - Hezhen Hu, Junfu Pu
, Wengang Zhou
, Hang Fang, Houqiang Li
:
Prior-Aware Cross Modality Augmentation Learning for Continuous Sign Language Recognition. 593-606 - Xiongli Chai
, Feng Shao
, Qiuping Jiang
, Xuejin Wang
, Long Xu
, Yo-Sung Ho
:
Blind Quality Evaluator of Light Field Images by Group-Based Representations and Multiple Plane-Oriented Perceptual Characteristics. 607-622 - Yuxuan Liu
, Hongwei Ge
, Zhen Wang
, Yaqing Hou
, Mingde Zhao
:
Discriminative Identity-Feature Exploring and Differential Aware Learning for Unsupervised Person Re-Identification. 623-636 - Chunyang Xie
, Dongheng Zhang
, Zhi Wu
, Cong Yu
, Yang Hu
, Yan Chen
:
RPM: RF-Based Pose Machines. 637-649 - Mingliang Zhou
, Xingtai Wu, Xuekai Wei
, Tao Xiang
, Bin Fang
, Sam Kwong
:
Low-Light Enhancement Method Based on a Retinex Model for Structure Preservation. 650-662 - Zilong Yu, Yunyun Yang
, Yongbin Zhu
, Bixue Guo, Chun Li
:
CS-IntroVAE: Cauchy-Schwarz Divergence-Based Introspective Variational Autoencoder. 663-672 - Shentong Mo
, Miao Xin
:
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting. 673-686 - Songhan He
, Dawen Xu
, Lin Yang
, Weipeng Liang
:
Adaptive HEVC Video Steganography With High Performance Based on Attention-Net and PU Partition Modes. 687-700 - Xueping Wang
, Min Liu
, Fei Wang, Jianhua Dai
, An-An Liu
, Yaonan Wang
:
Relation-Preserving Feature Embedding for Unsupervised Person Re-Identification. 714-723 - Shiqi Lin, Tao Yu
, Ruoyu Feng, Xin Li
, Xiaoyuan Yu, Lei Xiao, Zhibo Chen
:
Local Patch AutoAugment With Multi-Agent Collaboration. 724-736 - Kaipeng Zhang
, Yoichi Sato
:
Semantic Image Segmentation by Dynamic Discriminative Prototypes. 737-749 - Shaowei Weng
, Tangguo Zhu, Tiancong Zhang
, Chunyu Zhang:
UCM-Net: A U-Net-Like Tampered-Region-Related Framework for Copy-Move Forgery Detection. 750-763 - Dandan Zhu
, Kaiwei Zhang
, Nana Zhang, Qiangqiang Zhou, Xiongkuo Min
, Guangtao Zhai, Xiaokang Yang:
Unified Audio-Visual Saliency Model for Omnidirectional Videos With Spatial Audio. 764-775 - Vladimir Frants
, Sos S. Agaian
, Karen Panetta
:
QSAM-Net: Rain Streak Removal by Quaternion Neural Network With Self-Attention Module. 789-798 - Renshuai Liu
, Yao Cheng, Sifei Huang, Chengyang Li, Xuan Cheng
:
Transformer-Based High-Fidelity Facial Displacement Completion for Detailed 3D Face Reconstruction. 799-810 - Jinfu Liu
, Xinshun Wang
, Can Wang, Yuan Gao
, Mengyuan Liu
:
Temporal Decoupling Graph Convolutional Network for Skeleton-Based Gesture Recognition. 811-823 - Yuan Sun
, Zhenwen Ren
, Peng Hu
, Dezhong Peng, Xu Wang
:
Hierarchical Consensus Hashing for Cross-Modal Retrieval. 824-836 - Yaguang Song
, Xiaoshan Yang, Yaowei Wang
, Changsheng Xu
:
Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution Visual Question Answering. 837-851 - Ruimin Li
, Jiajun Xiang, Feixiang Sun, Ye Yuan, Longwu Yuan, Shuiping Gou
:
Multiscale Cross-Modal Homogeneity Enhancement and Confidence-Aware Fusion for Multispectral Pedestrian Detection. 852-863 - Wenjie Li
, Juncheng Li
, Guangwei Gao
, Weihong Deng
, Jiantao Zhou
, Jian Yang
, Guo-Jun Qi
:
Cross-Receptive Focused Inference Network for Lightweight Image Super-Resolution. 864-877 - Kedeng Tong
, Xin Jin
, Yuqing Yang, Chen Wang, Jinshi Kang
, Fan Jiang
:
Learned Focused Plenoptic Image Compression With Microimage Preprocessing and Global Attention. 890-903 - Ke Zhang
, Hanliang Jiang, Jian Zhang
, Qingming Huang
, Jianping Fan
, Jun Yu
, Weidong Han
:
Semi-Supervised Medical Report Generation via Graph-Guided Hybrid Feature Consistency. 904-915 - Gangjian Zhang
, Shikui Wei
, Huaxin Pang
, Shuang Qiu
, Yao Zhao
:
Enhance Composed Image Retrieval via Multi-Level Collaborative Localization and Semantic Activeness Perception. 916-928 - Jianjun Xiang, Peng Chen
, Yuanjie Dang
, Ronghua Liang
, Gangyi Jiang
:
Pseudo Light Field Image and 4D Wavelet-Transform-Based Reduced-Reference Light Field Image Quality Assessment. 929-943 - Jinyu Wen
, Feiwei Qin
, Jiao Du
, Meie Fang
, Xinhua Wei, C. L. Philip Chen
, Ping Li
:
MsgFusion: Medical Semantic Guided Two-Branch Network for Multimodal Brain Image Fusion. 944-957 - Yuxin Xiang
, Dongjie Tang
, Rui Huang, Yong Yao
, Chao Xie, Qiming Shi, Randy Xu, Mohammad Reza Haghighat
, Cathy Bao, Yicheng Gu
, Zhengwei Qi
, Haibing Guan
:
CARE: Cloudified Android With Optimized Rendering Platform. 958-971 - Tuan T. Nguyen
, Hoang H. Nguyen
, Mina Sartipi, Marco Fisichella
:
Multi-Vehicle Multi-Camera Tracking With Graph-Based Tracklet Features. 972-983 - Geng Chen
, Huazhu Fu
, Tao Zhou
, Guobao Xiao
, Keren Fu
, Yong Xia
, Yanning Zhang
:
Fusion-Embedding Siamese Network for Light Field Salient Object Detection. 984-994 - Bing Cao
, Haifang Cao, Jiaxu Liu, Pengfei Zhu
, Changqing Zhang
, Qinghua Hu
:
Autoencoder-Based Collaborative Attention GAN for Multi-Modal Image Synthesis. 995-1010 - Jiesheng Wu
, Fangwei Hao, Weiyun Liang
, Jing Xu
:
Transformer Fusion and Pixel-Level Contrastive Learning for RGB-D Salient Object Detection. 1011-1026 - Tao Xie
, Li Wang, Ke Wang
, Ruifeng Li
, Xinyu Zhang
, Haoming Zhang
, Linqi Yang, Huaping Liu
, Jun Li
:
FARP-Net: Local-Global Feature Aggregation and Relation-Aware Proposals for 3D Object Detection. 1027-1040 - Shuyue Lan
, Zhilu Wang
, Ermin Wei
, Amit K. Roy-Chowdhury
, Qi Zhu
:
Collaborative Multi-Agent Video Fast-Forwarding. 1041-1054 - Shulei Ji
, Xinyu Yang
:
EmoMusicTV: Emotion-Conditioned Symbolic Music Generation With Hierarchical Transformer VAE. 1076-1088 - Yuqi Zhang, Qi Qian, Hongsong Wang
, Chong Liu
, Weihua Chen
, Fan Wang
:
Graph Convolution Based Efficient Re-Ranking for Visual Retrieval. 1089-1101 - Zhenguo Yang
, Zhuopan Yang, Zhiwei Guo, Zehang Lin, Haizhong Zhu, Qing Li
, Wenyin Liu
:
Towards Temporal Event Detection: A Dataset, Benchmarks and Challenges. 1102-1113 - Chengrui Zhang, Junxin Chen
, Dongming Chen
, Wei Wang
, Yushu Zhang
, Yicong Zhou
:
Exploiting Substitution Box for Cryptanalyzing Image Encryption Schemes With DNA Coding and Nonlinear Dynamics. 1114-1128 - Weizhi Nie
, Xin Wen
, Jing Liu, Jiawei Chen
, Jiancan Wu, Guoqing Jin
, Jing Lu, An-An Liu:
Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation. 1129-1142 - Wei Zhou
, Weitao Jiang
, Dihu Chen
, Haifeng Hu
, Tao Su
:
Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification. 1143-1157 - Lin Zhao
, Hui Zhou
, Xinge Zhu
, Xiao Song, Hongsheng Li
, Wenbing Tao
:
LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation. 1158-1168 - Zhaoyi Li
, Ping Zhong
, Jiawei Huang
, Feng Gao, Jian-Xin Wang
:
Achieving QoE Fairness in Bitrate Allocation of 360° Video Streaming. 1169-1178 - Feifei Ding
, Jianjun Li
, Wanyong Tian, Shanqing Zhang, Wenqiang Yuan:
Unsupervised Domain Adaptation via Risk-Consistent Estimators. 1179-1187 - Jian Xiao
, Xiaojun Bi
:
Model-Guided Generative Adversarial Networks for Unsupervised Fine-Grained Image Generation. 1188-1199 - Jiayuan Sun
, Luping Ji
, Jiewen Zhu
:
Shared Coupling-Bridge Scheme for Weakly Supervised Local Feature Learning. 1200-1212 - Kangle Wu
, Jun Huang
, Yong Ma
, Fan Fan
, Jiayi Ma
:
Cycle-Retinex: Unpaired Low-Light Image Enhancement via Retinex-Inline CycleGAN. 1213-1228 - Yuanyuan Shi, Xiaolong Fu, Yunan Li
, Kaibin Miao, Xiangzeng Liu
, Bocheng Zhao, Qiguang Miao
:
A Semi-Supervised Underexposed Image Enhancement Network With Supervised Context Attention and Multi-Exposure Fusion. 1229-1243 - Theyab A. Alotaibi
, Ishtiaq Rasool Khan
, Farid Bourennani:
Quality Assessment of Tone-Mapped Images Using Fundamental Color and Structural Features. 1244-1254 - Bowen Yuan
, Yefei Sheng
, Bing-Kun Bao
, Yi-Ping Phoebe Chen
, Changsheng Xu
:
Semantic Distance Adversarial Learning for Text-to-Image Synthesis. 1255-1266 - Weitao Feng
, Lei Bai
, Yongqiang Yao
, Weihao Gan, Wei Wu
, Wanli Ouyang
:
Similarity- and Quality-Guided Relation Learning for Joint Detection and Tracking. 1267-1280 - Inske Groenen
, Stevan Rudinac
, Marcel Worring
:
PanorAMS: Automatic Annotation for Detecting Objects in Urban Context. 1281-1294 - Jian Zhu
, Hanli Wang
, Bin He
:
Multi-Modal Structure-Embedding Graph Transformer for Visual Commonsense Reasoning. 1295-1305 - Lei Ma
, Hanyu Hong
, Fanman Meng
, Qingbo Wu
, Jinmeng Wu
:
Deep Progressive Asymmetric Quantization Based on Causal Intervention for Fine-Grained Image Retrieval. 1306-1318 - Jianping Gou
, Nannan Xie, Yunhao Yuan
, Lan Du
, Weihua Ou
, Zhang Yi
:
Reconstructed Graph Constrained Auto-Encoders for Multi-View Representation Learning. 1319-1332 - Shuai Xiao
, Guipeng Lan
, Jiachen Yang
, Wen Lu, Qinggang Meng
, Xinbo Gao
:
MCS-GAN: A Different Understanding for Generalization of Deep Forgery Detection. 1333-1345 - Yanxiong Li
, Wenchang Cao, Wei Xie, Jialong Li, Emmanouil Benetos
:
Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes. 1346-1360 - Jiamin Zhuang
, Jing Yu
, Yang Ding, Xiangyan Qu
, Yue Hu
:
Towards Fast and Accurate Image-Text Retrieval With Self-Supervised Fine-Grained Alignment. 1361-1372 - Quan Wang, Sheng Li
, Zichi Wang
, Xinpeng Zhang
, Guorui Feng
:
Multi-Source Style Transfer via Style Disentanglement Network. 1373-1383 - Yan Dai
, Xiaojia Chen
, Xuanhan Wang
, Minghui Pang
, Lianli Gao
, Heng Tao Shen
:
ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets. 1384-1394 - Tianli Sun
, Haonan Chen, Guosheng Hu
, Lianghua He
, Cairong Zhao
:
Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization. 1395-1406 - Zhongyu Bai
, Hongli Xu
, Xiangyue Zhang
, Qichuan Ding
:
GCSANet: Arbitrary Style Transfer With Global Context Self-Attentional Network. 1407-1420 - Ruixuan Cong
, Hao Sheng
, Da Yang
, Zhenglong Cui
, Rongshan Chen
:
Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution. 1421-1435 - Lei Jin
, Xiaojuan Wang
, Xuecheng Nie
, Wendong Wang
, Yandong Guo, Shuicheng Yan
, Jian Zhao
:
Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation. 1436-1447 - Lvlong Lai
, Jian Chen
, Qingyao Wu
:
Zero-Shot Single-View Point Cloud Reconstruction via Cross-Category Knowledge Transferring. 1448-1459 - Liyun Zuo
, Baoyan Wang, Lei Zhang
, Jun Xu
, Xiantong Zhen
:
Variational Neuron Shifting for Few-Shot Image Classification Across Domains. 1460-1473 - Qing Yu
, Go Irie
, Kiyoharu Aizawa
:
Self-Labeling Framework for Open-Set Domain Adaptation With Few Labeled Samples. 1474-1487 - Xiaotian Wu
, Xinjie Feng:
Size Invariant Visual Cryptography Schemes With Evolving Threshold Access Structures. 1488-1503 - Bo Jiang
, Shuxian Luo, Xiao Wang
, Chuanfu Li, Jin Tang
:
AMatFormer: Efficient Feature Matching via Anchor Matching Transformer. 1504-1515 - Shicai Wei
, Chunbo Luo
, Yang Luo
, Jialang Xu
:
Privileged Modality Learning via Multimodal Hallucination. 1516-1527 - Yuanjiang Cao
, Lina Yao
, Le Pan, Quan Z. Sheng
, Xiaojun Chang
:
Guided Image-to-Image Translation by Discriminator-Generator Communication. 1528-1538 - Yongle Zhang
, Yimin Liu
, Ruotong Hu
, Qiang Wu
, Jian Zhang
:
Mutual Dual-Task Generator With Adaptive Attention Fusion for Image Inpainting. 1539-1550 - Lei Zhang
, Leiting Chen, Chuan Zhou
, Xin Li
, Fan Yang
, Zhang Yi
:
Weighted Graph-Structured Semantics Constraint Network for Cross-Modal Retrieval. 1551-1564 - Hongchao Li
, Aihua Zheng
, Liping Sun
, Yonglong Luo
:
Camera Topology Graph Guided Vehicle Re-Identification. 1565-1577 - Xixi Wang
, Bo Jiang
, Xiao Wang
, Jinhui Tang
, Bin Luo
:
Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer Based Approach. 1578-1588 - Haoran Qi
, Yuwei Qiu
, Xing Luo, Zhi Jin
:
An Efficient Latent Style Guided Transformer-CNN Framework for Face Super-Resolution. 1589-1599 - Wentao Tan, Changxing Ding
, Pengfei Wang
, Mingming Gong
, Kui Jia
:
Style Interleaved Learning for Generalizable Person Re-Identification. 1600-1612 - Yong Zhang
, Yingwei Pan
, Ting Yao
, Rui Huang
, Tao Mei
, Chang Wen Chen
:
End-to-End Video Scene Graph Generation With Temporal Propagation Transformer. 1613-1625 - Yue Wu
, Jiaming Liu
, Maoguo Gong
, Peiran Gong
, Xiaolong Fan
, A. Kai Qin
, Qiguang Miao
, Wenping Ma
:
Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud Understanding. 1626-1638 - An-An Liu
, Chenxi Huang, Ning Xu
, Hongshuo Tian
, Jing Liu
, Yongdong Zhang
:
Counterfactual Visual Dialog: Robust Commonsense Knowledge Learning From Unbiased Training. 1639-1651 - Shumin Zhu
, Xingxing Zou
, Jianjun Qian
, Wai Keung Wong
:
Learning Structured Relation Embeddings for Fine-Grained Fashion Attribute Recognition. 1652-1664 - Sheng Yu
, Di-Hua Zhai
, Yuanqing Xia
, Dong Li
, Shiqi Zhao
:
CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer. 1665-1680 - Jiangli Shi, Feng Shao
, Chongzhen Tian, Hangwei Chen
, Long Xu
, Yo-Sung Ho
:
Progressive Bidirectional Feature Extraction and Enhancement Network for Quality Evaluation of Night-Time Images. 1690-1705 - Yaosi Hu
, Chong Luo
, Zhenzhong Chen
:
A Benchmark for Controllable Text -Image-to-Video Generation. 1706-1719 - Huanlong Zhang
, Jingchao Wang
, Jianwei Zhang
, Tianzhu Zhang
, Bineng Zhong
:
One-Stream Vision-Language Memory Network for Object Tracking. 1720-1730 - Shengping Zhang
, Xiaoyu Han
, Weigang Zhang
, Xiangyuan Lan, Hongxun Yao
, Qingming Huang
:
Limb-Aware Virtual Try-On Network With Progressive Clothing Warping. 1731-1746 - Shi-Xue Zhang
, Chun Yang, Xiaobin Zhu
, Xu-Cheng Yin
:
Arbitrary Shape Text Detection via Boundary Transformer. 1747-1760 - Xudong Tan
, Menghan Hu
, Guangtao Zhai
, Yan Zhu, Wenfang Li, Xiao-Ping Zhang
:
Lightweight Video-Based Respiration Rate Detection Algorithm: An Application Case on Intensive Care. 1761-1775 - Zhenxi Zhao
, Xinting Yang
, Jintao Liu
, Chao Zhou
, Chunjiang Zhao
:
GCVC: Graph Convolution Vector Distribution Calibration for Fish Group Activity Recognition. 1776-1789 - Sen Wu, Guoshuai Zhao
, Xueming Qian
:
Resolving Zero-Shot and Fact-Based Visual Question Answering via Enhanced Fact Retrieval. 1790-1800 - Pengcheng Lei
, Faming Fang
, Tieyong Zeng
, Guixu Zhang
:
Flow Guidance Deformable Compensation Network for Video Frame Interpolation. 1801-1812 - Haoyang Zhang
, Guixi Liu
, Yi Zhang
, Zhaohui Hao:
Robust Multi-Model Visual Tracking With Distractor-Aware Template-Coupled Correlation Filters Joint Learning. 1813-1828 - Bo Li
, Xiao Lin
, Bin Liu
, Zhi-Fen He
, Yu-Kun Lai
:
Lightweight Text-Driven Image Editing With Disentangled Content and Attributes. 1829-1841 - Chi Chen
, Ang Jin, Zhiye Wang, Yongwei Zheng, Bisheng Yang
, Jian Zhou
, Yuhang Xu, Zhigang Tu
:
SGSR-Net: Structure Semantics Guided LiDAR Super-Resolution Network for Indoor LiDAR SLAM. 1842-1854 - Huairui Wang
, Zhenzhong Chen
, Chang Wen Chen
:
Learned Video Compression via Heterogeneous Deformable Compensation Network. 1855-1866 - Wenhui Li
, Song Yang
, Qiang Li
, Xuanya Li
, An-An Liu
:
Commonsense-Guided Semantic and Relational Consistencies for Image-Text Retrieval. 1867-1880 - Qibing Qin
, Kezhen Xie, Wenfeng Zhang
, Chengduan Wang
, Lei Huang
:
Deep Neighborhood Structure-Preserving Hashing for Large-Scale Image Retrieval. 1881-1893 - Yutong Luo
, Xinyue Zhong, Minchen Zeng, Jialan Xie
, Shiyuan Wang, Guangyuan Liu
:
CGLF-Net: Image Emotion Recognition Network by Combining Global Self-Attention Features and Local Multiscale Features. 1894-1908 - Yanhua Yang, Rui Pan, Xiangyu Li, Xu Yang
, Cheng Deng
:
Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition. 1909-1919 - Yu Jiang
, Yuehang Wang
, Siqi Li
, Yongji Zhang
, Minghao Zhao
, Yue Gao
:
Event-Based Low-Illumination Image Enhancement. 1920-1931 - Zhuoran Du
, Shikui Wei
, Ting Liu
, Shunli Zhang, Xiaotong Chen
, Shiyin Zhang, Yao Zhao
:
Exploring the Applicability of Spectral Recovery in Semantic Segmentation of RGB Images. 1932-1943 - Zhichao Yang
, Leida Li
, Yuzhe Yang
, Yaqian Li
, Weisi Lin
:
Multi-Level Transitional Contrast Learning for Personalized Image Aesthetics Assessment. 1944-1956 - Tingting Wu
, Xiao Ding
, Hao Zhang
, Jinglong Gao
, Minji Tang, Li Du, Bing Qin
, Ting Liu:
DiscrimLoss: A Universal Loss for Hard Samples and Incorrect Samples Discrimination. 1957-1968 - Yueming Lyu
, Peibin Chen, Jingna Sun, Bo Peng
, Xu Wang
, Jing Dong
:
DRAN: Detailed Region-Adaptive Normalization for Conditional Image Synthesis. 1969-1982 - Jin Huang, Yongshun Gong
, Lu Zhang
, Jian Zhang
, Liqiang Nie
, Yilong Yin
:
Modeling Multiple Aesthetic Views for Series Photo Selection. 1983-1995 - Jiangfeng Du
, Silin Zhou, Jie Yu, Peng Han
, Shuo Shang
:
Cross-Task Multimodal Reinforcement for Long Tail Next POI Recommendation. 1996-2005 - Yuanhui Wang
, Ben Ye
, Zhanchuan Cai
:
Dynamic Template Updating Using Spatial-Temporal Information in Siamese Trackers. 2006-2015 - Jianhong Pan
, Siyuan Yang
, Lin Geng Foo
, Qiuhong Ke
, Hossein Rahmani
, Zhipeng Fan, Jun Liu
:
Progressive Channel-Shrinking Network. 2016-2026 - Donghua Chen
, Runtong Zhang
:
Building Multimodal Knowledge Bases With Multimodal Computational Sequences and Generative Adversarial Networks. 2027-2040 - Xingzheng Wang
, Kaiqiang Chen
, Zixuan Wang
, Wenhao Huang
:
PMSNet: Parallel Multi-Scale Network for Accurate Low-Light Light-Field Image Enhancement. 2041-2055 - Yinghui Xing
, Qirui Wu
, De Cheng
, Shizhou Zhang
, Guoqiang Liang
, Peng Wang
, Yanning Zhang
:
Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. 2056-2068 - Bin Wan
, Xiaofei Zhou
, Yaoqi Sun
, Tingyu Wang, Chengtao Lv
, Shuai Wang
, Haibing Yin
, Chenggang Yan
:
MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection. 2069-2081 - Jie Xu
, Xiaoqian Zhang
, Changming Zhao
, Zili Geng
, Yuren Feng
, Ke Miao
, Yunji Li
:
Improving Fine-Grained Image Classification With Multimodal Information. 2082-2095 - Yanliang Jin
, Ze-Yu Ji
, Dan Zeng
, Xiao-Ping (Steven) Zhang
:
VWP:An Efficient DRL-Based Autonomous Driving Model. 2096-2108 - Qiang Qi
, Yan Yan
, Hanzi Wang
:
Class-Aware Dual-Supervised Aggregation Network for Video Object Detection. 2109-2123 - Peihao Wu, Wenqian Wang
, Faliang Chang
, Chunsheng Liu
, Bin Wang:
DSS-Net: Dynamic Self-Supervised Network for Video Anomaly Detection. 2124-2136 - Zhiquan Wen
, Shuaicheng Niu, Ge Li
, Qingyao Wu
, Mingkui Tan
, Qi Wu
:
Test-Time Model Adaptation for Visual Question Answering With Debiased Self-Supervisions. 2137-2147 - Dongjie Ye
, Zhangkai Ni
, Wenhan Yang, Hanli Wang
, Shiqi Wang
, Sam Kwong
:
Glow in the Dark: Low-Light Image Enhancement With External Memory. 2148-2163 - Chuan Qin
, Xiaomeng Li, Zhenyi Zhang, Fengyong Li
, Xinpeng Zhang
, Guorui Feng
:
Print-Camera Resistant Image Watermarking With Deep Noise Simulation and Constrained Learning. 2164-2177 - Ying Zeng, Sijie Mai
, Wenjun Yan
, Haifeng Hu
:
Multimodal Reaction: Information Modulation for Cross-Modal Representation Learning. 2178-2191 - Dongdong Ni
, Zhenhong Jia
, Jie Yang
, Nikola K. Kasabov
:
Online Low-Light Sand-Dust Video Enhancement Using Adaptive Dynamic Brightness Correction and a Rolling Guidance Filter. 2192-2206 - Hangzhi Jiang
, Xin Zhang
, Shiming Xiang
:
Non-Maximum Suppression Guided Label Assignment for Object Detection in Crowd Scenes. 2207-2218 - Weizhi Xian
, Mingliang Zhou
, Bin Fang
, Tao Xiang
, Weijia Jia
, Bin Chen
:
Perceptual Quality Analysis in Deep Domains Using Structure Separation and High-Order Moments. 2219-2234 - Ziyu Chen
, Hanli Wang
, Chang Wen Chen
:
Self-Supervised Video Representation Learning by Serial Restoration With Elastic Complexity. 2235-2248 - Fuming Sun
, Peng Ren
, Bowen Yin
, Fasheng Wang
, Haojie Li
:
CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection. 2249-2262 - Jianbing Wu
, Hong Liu
, Wei Shi, Mengyuan Liu
, Wenhao Li
:
Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification. 2263-2275 - Zhentao He, Feng Shao
, Gang Chen, Xiongli Chai
, Yo-Sung Ho
:
SCFANet: Semantics and Context Feature Aggregation Network for 360° Salient Object Detection. 2276-2288 - Yu Sun
, Lubing Xu, Qian Bao
, Wu Liu
, Wenpeng Gao
, Yili Fu
:
Learning Monocular Regression of 3D People in Crowds via Scene-Aware Blending and De-Occlusion. 2289-2302 - Zhong Zhang
, Di He
, Shuang Liu
, Baihua Xiao
, Tariq S. Durrani
:
Completed Part Transformer for Person Re-Identification. 2303-2313 - Yuanzhi Wang
, Tao Lu
, Yuan Yao, Yanduo Zhang
, Zixiang Xiong
:
Learning to Hallucinate Face in the Dark. 2314-2326 - Rui Shi
, Tianxing Li
, Liguo Zhang
, Yasushi Yamaguchi
:
Visualization Comparison of Vision Transformers and Convolutional Neural Networks. 2327-2339 - Yepeng Tang
, Weining Wang
, Chunjie Zhang
, Jing Liu
, Yao Zhao
:
Temporal Action Proposal Generation With Action Frequency Adaptive Network. 2340-2353 - Sitong Su
, Junchen Zhu
, Lianli Gao
, Jingkuan Song
:
Utilizing Greedy Nature for Multimodal Conditional Image Synthesis in Transformers. 2354-2366 - Shuaiqi Jing
, Haonan Zhang
, Pengpeng Zeng
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
Memory-Based Augmentation Network for Video Captioning. 2367-2379 - Xiaobin Tan
, Simin Li, Shunyi Wang
, Yangyang Liu, Quan Zheng
, Jian Yang
:
Cooperative Bargaining Game Based Adaptive Video Multicast Over Mobile Edge Networks. 2380-2394 - Gangyang Hou, Bo Ou
, Min Long
, Fei Peng
:
Separable Reversible Data Hiding for Encrypted 3D Mesh Models Based on Octree Subdivision and Multi-MSB Prediction. 2395-2407 - Mengya Han
, Yibing Zhan
, Yong Luo
, Han Hu
, Kehua Su
, Bo Du
:
Textual Enhanced Adaptive Meta-Fusion for Few-Shot Visual Recognition. 2408-2418 - Hao Tang
, Guoshuai Zhao
, Jing Gao
, Xueming Qian
:
Personalized Representation With Contrastive Loss for Recommendation Systems. 2419-2429 - Linfeng Xu
, Qingbo Wu
, Lili Pan
, Fanman Meng
, Hongliang Li
, Chiyuan He
, Hanxin Wang
, Shaoxu Cheng
, Yu Dai
:
Towards Continual Egocentric Activity Recognition: A Multi-Modal Egocentric Activity Dataset for Continual Learning. 2430-2443 - Jing Liu
, Zhiwei Fan
, Ziwen Yang
, Yuting Su, Xiaokang Yang
:
Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement. 2444-2455 - Yu Lu
, Feiyue Ni
, Haofan Wang
, Xiaofeng Guo
, Linchao Zhu
, Zongxin Yang
, Ruihua Song
, Lele Cheng
, Yi Yang:
Show Me a Video: A Large-Scale Narrated Video Dataset for Coherent Story Illustration. 2456-2466 - Wei Zhang, KangBin Zhou, Luyao Teng
, Feiyi Tang, NaiQi Wu
, Shaohua Teng
, Jian Li
:
Dynamic Confidence Sampling and Label Semantic Guidance Learning for Domain Adaptive Retrieval. 2467-2479 - Jingcheng Ke
, Jia Wang, Jun-Cheng Chen
, I-Hong Jhuo, Chia-Wen Lin
, Yen-Yu Lin
:
CLIPREC: Graph-Based Domain Adaptive Network for Zero-Shot Referring Expression Comprehension. 2480-2492 - Xi Yang
, Xiaoqi Wang
, Dong Yang
:
Improving Cross-Modal Constraints: Text Attribute Person Search With Graph Attention Networks. 2493-2503 - Jinghan Ru
, Jun Tian
, Chengwei Xiao
, Jingjing Li
, Heng Tao Shen
:
Imbalanced Open Set Domain Adaptation via Moving-Threshold Estimation and Gradual Alignment. 2504-2514 - Chen Hui
, Shengping Zhang
, Wenxue Cui
, Shaohui Liu
, Feng Jiang
, Debin Zhao
:
Rate-Adaptive Neural Network for Image Compressive Sensing. 2515-2530 - Yang Liu
, Xingming Zhang
, Janne Kauttonen
, Guoying Zhao
:
Uncertain Facial Expression Recognition via Multi-Task Assisted Correction. 2531-2543 - Yang Liu
, Yong Xu
, Peipei Wu
, Wenwu Wang
:
Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. 2544-2559 - Yutao Liu
, Ke Gu
, Jingchao Cao
, Shiqi Wang
, Guangtao Zhai
, Junyu Dong
, Sam Kwong
:
UIQI: A Comprehensive Quality Evaluation Index for Underwater Images. 2560-2573 - Xinchen Ye
, Yanjun Guo
, Baoli Sun
, Rui Xu
, Zhihui Wang
, Haojie Li
:
C2ANet: Cross-Scale and Cross-Modality Aggregation Network for Scene Depth Super-Resolution. 2574-2584 - Fugui Fan
, Yuting Su, Liqiang Nie
, Peiguang Jing
, Daozheng Hong
, Yu Liu
:
Dual-Domain Aligned Deep Hierarchical Matrix Factorization Method for Micro-Video Multi-Label Classification. 2598-2607 - Jingang Shi
, Yusi Wang
, Zitong Yu
, Guanxin Li
, Xiaopeng Hong
, Fei Wang
, Yihong Gong
:
Exploiting Multi-Scale Parallel Self-Attention and Local Variation via Dual-Branch Transformer-CNN Structure for Face Super-Resolution. 2608-2620 - Fan Zhang
, Na Liu
, Fuqing Duan
:
Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention. 2621-2633 - Tongyao Jia
, Jiafeng Li
, Li Zhuo
, Tianjian Yu
:
Semi-Supervised Single-Image Dehazing Network via Disentangled Meta-Knowledge. 2634-2647 - Fen Xiao
, Zhengdong Pu
, Jiaqi Chen
, Xieping Gao
:
DGFNet: Depth-Guided Cross-Modality Fusion Network for RGB-D Salient Object Detection. 2648-2658 - Wentian Zhao
, Xinxiao Wu
:
Boosting Entity-Aware Image Captioning With Multi-Modal Knowledge Graph. 2659-2670 - Shaolin Su
, Hanhe Lin
, Vlad Hosu
, Oliver Wiedemann, Jinqiu Sun, Yu Zhu
, Hantao Liu
, Yanning Zhang
, Dietmar Saupe
:
Going the Extra Mile in Face Image Quality Assessment: A Novel Database and Model. 2671-2685 - Jian Wang
, Fan Li
, Xuchong Zhang
, Hongbin Sun
:
Adversarial Obstacle Generation Against LiDAR-Based 3D Object Detection. 2686-2699 - Zefeng Lu
, Ronghao Lin
, Haifeng Hu
:
Tri-Level Modality-Information Disentanglement for Visible-Infrared Person Re-Identification. 2700-2714 - Bo Hu
, Guang Zhu
, Leida Li
, Ji Gan
, Weisheng Li
, Xinbo Gao
:
Blind Image Quality Index With Cross-Domain Interaction and Cross-Scale Integration. 2729-2739 - Ronghao Lin
, Haifeng Hu
:
Dynamically Shifting Multimodal Representations via Hybrid-Modal Attention for Multimodal Sentiment Analysis. 2740-2755 - Zixi Wang, Fan Li, Yunfei Zhang, Yuan Zhang:
Low-Rate Feature Compression for Collaborative Intelligence: Reducing Redundancy in Spatial and Statistical Levels. 2756-2771 - Yanbiao Ma
, Licheng Jiao
, Fang Liu
, Shuyuan Yang
, Xu Liu
, Puhua Chen
:
Feature Distribution Representation Learning Based on Knowledge Transfer for Long-Tailed Classification. 2772-2784 - Mijanur Rahaman Palash, Bharat K. Bhargava
:
EMERSK -Explainable Multimodal Emotion Recognition With Situational Knowledge. 2785-2794 - Pei An
, Yucong Duan, Yuliang Huang, Jie Ma
, Yanfei Chen, Liheng Wang, You Yang
, Qiong Liu
:
SP-Det: Leveraging Saliency Prediction for Voxel-Based 3D Object Detection in Sparse Point Cloud. 2795-2808 - Xue Li
, Jiong Yu
, Shaochen Jiang
, Hongchun Lu
, Ziyang Li
:
MSViT: Training Multiscale Vision Transformers for Image Retrieval. 2809-2823 - Mengkun Liu
, Licheng Jiao
, Xu Liu
, Lingling Li
, Fang Liu
, Shuyuan Yang
, Xiangrong Zhang
:
Bio-Inspired Multi-Scale Contourlet Attention Networks. 2824-2837 - Ping Xu
, Lei Liu
, Haifeng Zheng
, Xin Yuan
, Chen Xu
, Lingyun Xue
:
Degradation-Aware Dynamic Fourier-Based Network for Spectral Compressive Imaging. 2838-2850 - Mingyi Yang
, Junyan Huo
, Xile Zhou
, Wenhan Qiao
, Shuai Wan
, Hao Wang
, Fuzheng Yang
:
Joint Rate-Distortion Optimization for Video Coding and Learning-Based In-Loop Filtering. 2851-2865 - Daizong Liu
, Wei Hu
, Xin Li
:
Robust Geometry-Dependent Attack for 3D Point Clouds. 2866-2877 - Li Wang, Tao Xie
, Xinyu Zhang
, Zhiqiang Jiang
, Linqi Yang
, Haoming Zhang
, Xiaoyu Li
, Yilong Ren
, Haiyang Yu
, Jun Li
, Huaping Liu
:
Auto-Points: Automatic Learning for Point Cloud Analysis With Neural Architecture Search. 2878-2893 - Decheng Liu
, Zeyang Zheng
, Chunlei Peng
, Yukai Wang
, Nannan Wang
, Xinbo Gao
:
Hierarchical Forgery Classifier on Multi-Modality Face Forgery Clues. 2894-2905 - Yunpeng Xiao
, Xuehong Li
, Qunqing Zhang
, Rui Lv
, Qian Li
, Rong Wang
:
Spreading Mosaic: An Image Restoration-Inspired Social Rumor Propagation Model. 2906-2917 - Honghao Dai
, Shanshan Gao
, Hong Huang
, Deqian Mao
, Chenhao Zhang
, Yuanfeng Zhou
:
An Adaptive Sample Assignment Network for Tiny Object Detection. 2918-2931 - Yan Zhang
, Yuning Su
, Xiaoying Sun
:
A QoE Physiological Measure of VR With Vibrotactile Feedback Based on Frontal Lobe Power Asymmetry. 2932-2942 - Shenjian Gong
, Jian Yang
, Shanshan Zhang
:
Adaptive Teaching for Cross-Domain Crowd Counting. 2943-2952 - Xiaoyu Kong
, Yongyong Chen
, Zhenyu He
:
When Channel Correlation Meets Sparse Prior: Keeping Interpretability in Image Compressive Sensing. 2953-2965 - Haixin Ding
, Shengchuan Zhang, Qiong Wu, Songlin Yu, Jie Hu, Liujuan Cao
, Rongrong Ji
:
Bilateral Knowledge Interaction Network for Referring Image Segmentation. 2966-2977 - Kaile Du
, Fan Lyu
, Linyan Li, Fuyuan Hu
, Wei Feng
, Fenglei Xu
, Xuefeng Xi
, Hanjing Cheng:
Multi-Label Continual Learning Using Augmented Graph Convolutional Network. 2978-2992 - Kejun Wu
, You Yang
, Qiong Liu
, Gangyi Jiang
, Xiao-Ping Zhang
:
Hierarchical Independent Coding Scheme for Varifocal Multiview Images Based on Angular-Focal Joint Prediction. 2993-3006 - Kaijie Zhao
, Haitao Zhao
, Zhongze Wang
, Jingchao Peng
, Zhengwei Hu
:
Object-Preserving Siamese Network for Single-Object Tracking on Point Clouds. 3007-3017 - Sijie Mai
, Ya Sun
, Aolin Xiong
, Ying Zeng
, Haifeng Hu
:
Multimodal Boosting: Addressing Noisy Modalities and Identifying Modality Contribution. 3018-3033 - Yongchao Du
, Min Wang
, Wengang Zhou
, Houqiang Li
:
Progressive Similarity Preservation Learning for Deep Scalable Product Quantization. 3034-3045 - Zhiqiang Bao
, Zihao Chen
, Chang-Dong Wang
, Wei-Shi Zheng
, Zhenhua Huang
, Yunwen Chen
:
Post-Distillation via Neural Resuscitation. 3046-3060 - Qin Yang
, Yuqi Li
, Chenglin Li
, Hao Wang
, Sa Yan
, Li Wei
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
, Pascal Frossard
:
SVGC-AVA: 360-Degree Video Saliency Prediction With Spherical Vector-Based Graph Convolution and Audio-Visual Attention. 3061-3076 - Xin Ma
, Chang Liu
, Chunyu Xie
, Long Ye
, Yafeng Deng
, Xiangyang Ji
:
Disjoint Masking With Joint Distillation for Efficient Masked Image Modeling. 3077-3087 - Tianhao Qi
, Hongtao Xie
, Pandeng Li
, Jiannan Ge
, Yongdong Zhang
:
Balanced Classification: A Unified Framework for Long-Tailed Object Detection. 3088-3101 - Zhen Long
, Ce Zhu
, Jie Chen
, Zihan Li
, Yazhou Ren
, Yipeng Liu
:
Multi-View MERA Subspace Clustering. 3102-3112 - Chuanming Wang
, Huiyuan Fu
, Huadong Ma
:
Learning Mutually Exclusive Part Representations for Fine-Grained Image Classification. 3113-3124 - Shuman Fang
, Zhiwen Lin
, Ke Yan
, Jie Li
, Xianming Lin
, Rongrong Ji
:
HODN: Disentangling Human-Object Feature for HOI Detection. 3125-3136 - Fengyong Li
, Yang Sheng
, Xinpeng Zhang
, Chuan Qin
:
iSCMIS:Spatial-Channel Attention Based Deep Invertible Network for Multi-Image Steganography. 3137-3152 - Di Li
, Susanto Rahardja
:
Learning Deep Representations for Photo Retouching. 3153-3163 - Yulai Xie
, Jingjing Niu
, Yang Zhang
, Fang Ren
:
Global-Shared Text Representation Based Multi-Stage Fusion Transformer Network for Multi-Modal Dense Video Captioning. 3164-3179 - Yan Dai
, Beitao Chen
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
DMH-CL: Dynamic Model Hardness Based Curriculum Learning for Complex Pose Estimation. 3180-3193 - Ke Nai
, Shaomiao Chen
:
Learning a Novel Ensemble Tracker for Robust Visual Tracking. 3194-3206 - Mingdao Wang
, Xueming Li, Siqi Chen
, Xianlin Zhang
, Lei Ma, Yue Zhang
:
Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition. 3207-3220 - Shuai Shen
, Wanhua Li
, Xiaoke Huang
, Zheng Zhu
, Jie Zhou
, Jiwen Lu
:
SD-NeRF: Towards Lifelike Talking Head Animation via Spatially-Adaptive Dual-Driven NeRFs. 3221-3234 - Shiwei Wang
, Liquan Shen
, Jingyue Liu
:
Spatial-Temporal Inter-Layer Reference Frame Generation Network for Spatial SHVC. 3235-3250 - Guosong Zhu
, Zhen Qin
, Yi Ding
, Yao Liu
, Zhiguang Qin
:
MFNet:Real-Time Motion Focus Network for Video Frame Interpolation. 3251-3262 - Xiang Fang
, Daizong Liu
, Pan Zhou
, Zichuan Xu
, Ruixuan Li
:
Hierarchical Local-Global Transformer for Temporal Sentence Grounding. 3263-3277 - Huiwen Ren
, Shanshe Wang
, Siwei Ma
, Wen Gao:
SVT-AVS3: An Open-Source High-Performance AVS3 Encoder With Scalable Video Technology. 3291-3301 - Ke Xian
, Juewen Peng
, Zhiguo Cao
, Jianming Zhang
, Guosheng Lin
:
ViTA: Video Transformer Adaptor for Robust Video Depth Estimation. 3302-3316 - Lei Wei
, Shuai Wan
, Zhecheng Wang
, Fuzheng Yang
:
Near-Lossless Compression of Point Cloud Attribute Using Quantization Parameter Cascading and Rate-Distortion Optimization. 3317-3330 - Xiaofeng Yang
, Fayao Liu
, Guosheng Lin
:
Neural Logic Vision Language Explainer. 3331-3340 - Shuwei Shao
, Zhongcai Pei
, Weihai Chen
, Ran Li
, Zhong Liu
, Zhengguo Li
:
URCDC-Depth: Uncertainty Rectified Cross-Distillation With CutFlip for Monocular Depth Estimation. 3341-3353 - Yu Zhou
, Weikang Gong
, Yanjing Sun
, Leida Li
, Ke Gu
, Jinjian Wu
:
Quality Assessment for Stitched Panoramic Images via Patch Registration and Bidimensional Feature Aggregation. 3354-3365 - Yawen Zeng
, Ning Han
, Keyu Pan
, Qin Jin
:
Temporally Language Grounding With Multi-Modal Multi-Prompt Tuning. 3366-3377 - Mingzheng Feng
, Jianbo Su
:
Learning Multi-Layer Attention Aggregation Siamese Network for Robust RGBT Tracking. 3378-3391 - Zhengyun Lu
, Lu Jin
, Zechao Li
, Jinhui Tang
:
Self-Paced Relational Contrastive Hashing for Large-Scale Image Retrieval. 3392-3404 - Yang Yu
, Rongrong Ni
, Siyuan Yang
, Yao Zhao
, Alex C. Kot
:
Narrowing Domain Gaps With Bridging Samples for Generalized Face Forgery Detection. 3405-3417 - Ping Li
, Chenhan Zhang, Xianghua Xu
:
Fast Fourier Inception Networks for Occluded Video Prediction. 3418-3429 - Baoliang Chen
, Lingyu Zhu
, Hanwei Zhu
, Wenhan Yang
, Linqi Song
, Shiqi Wang
:
Gap-Closing Matters: Perceptual Quality Evaluation and Optimization of Low-Light Image Enhancement. 3430-3443 - Junlong Gao
, Jiguo Li
, Chuanmin Jia
, Shanshe Wang
, Siwei Ma
, Wen Gao:
Cross Modal Compression With Variable Rate Prompt. 3444-3456 - Zhiwei Zhao
, Bin Liu
, Yan Lu
, Qi Chu
, Nenghai Yu
, Chang Wen Chen
:
Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification. 3457-3468 - Fang Peng
, Xiaoshan Yang
, Linhui Xiao
, Yaowei Wang
, Changsheng Xu
:
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification. 3469-3480 - Haojie Ding
, Bin Wang
, Guoliang Kang
, Weijia Li
, Conghui He
, Yao Zhao
, Yunchao Wei
:
DropQueries: A Simple Way to Discover Comprehensive Segment Representations. 3481-3490 - Maregu Assefa
, Wei Jiang
, Jinyu Zhan
, Kumie Gedamu
, Getinet Yilma
, Melese Ayalew
, Deepak Adhikari
:
Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition. 3491-3504 - Yue Wu
, Jiaming Liu
, Maoguo Gong
, Zhixiao Liu
, Qiguang Miao
, Wenping Ma
:
MPCT: Multiscale Point Cloud Transformer With a Residual Network. 3505-3516 - Xiaogang Song
, Haoyue Hu, Li Liang
, Weiwei Shi, Guo Xie
, Xiaofeng Lu, Xinhong Hei
:
Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss. 3517-3529 - Aoqi Li
, Saihui Hou
, Qingyuan Cai
, Yang Fu
, Yongzhen Huang
:
Gait Recognition With Drones: A Benchmark. 3530-3540 - Yanan Chen
, Ang Li
, Dan Wu
, Liang Zhou
:
Toward General Cross-Modal Signal Reconstruction for Robotic Teleoperation. 3541-3553 - Zining Chen
, Weiqiu Wang
, Zhicheng Zhao
, Fei Su
, Aidong Men, Yuan Dong
:
Cluster-Instance Normalization: A Statistical Relation-Aware Normalization for Generalizable Person Re-Identification. 3554-3566 - Tao Chen
, Yanrong Guo
, Shijie Hao
, Richang Hong
:
Semi-Supervised Domain Adaptation for Major Depressive Disorder Detection. 3567-3579 - Xiaoying Ding
, Zhao Chen
, Weisi Lin
, Zhenzhong Chen
:
Towards 3D Colored Mesh Saliency: Database and Benchmarks. 3580-3591 - Jiachen Yang
, Chen Cheng
, Shuai Xiao
, Guipeng Lan
, Jiabao Wen
:
High Fidelity Face-Swapping With Style ConvTransformer and Latent Space Selection. 3604-3615 - Yuan Gao
, Xin Li
, Hui Yan
:
Rethinking Graph Contrastive Learning: An Efficient Single-View Approach via Instance Discrimination. 3616-3625 - Siyu Liu
, Jian Cheng
, Ziying Xia
, Zhilong Xi
, Qin Hou
, Zhicheng Dong
:
HCM: Online Action Detection With Hard Video Clip Mining. 3626-3639 - Guipeng Lan
, Shuai Xiao
, Jiachen Yang
, Yanshuang Zhou
, Jiabao Wen
, Wen Lu
, Xinbo Gao
:
Image Aesthetics Assessment Based on Hypernetwork of Emotion Fusion. 3640-3650 - Binglu Wang
, Tianci Bu, Zaiyi Hu
, Le Yang
, Yongqiang Zhao
, Xuelong Li
:
Coarse-to-Fine Nutrition Prediction. 3651-3662 - Wenwu Yang
, Yeqing Zhao, Bailin Yang
, Jianbing Shen
:
Learning 3D Face Reconstruction From the Cycle-Consistency of Dynamic Faces. 3663-3675 - Kai Zhuang
, Qiang Li
, Yuan Yuan, Qi Wang
:
Multi-Domain Adaptation for Motion Deblurring. 3676-3688 - Gen Luo
, Yiyi Zhou
, Jiamu Sun
, Xiaoshuai Sun
, Rongrong Ji
:
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension. 3689-3700 - Ye Yao
, Ke Wang
, Qi Chang
, Shaowei Weng
:
Reversible Data Hiding in Encrypted Images Using Global Compression of Zero-Valued High Bit-Planes and Block Rearrangement. 3701-3714 - Cong Yu, Dongheng Zhang, Zhi Wu, Chunyang Xie, Zhi Lu, Yang Hu, Yan Chen:
MobiRFPose: Portable RF-Based 3D Human Pose Camera. 3715-3727 - Shili Zhou
, Weimin Tan
, Bo Yan
:
A Motion Distillation Framework for Video Frame Interpolation. 3728-3740 - Chunlei Peng
, Zimo Kong
, Decheng Liu
, Nannan Wang
, Xinbo Gao
:
Disguised Heterogeneous Face Generation With Iterative-Adversarial Style Unification. 3741-3753 - Zhengzhuo Xu
, Zenghao Chai
, Chengyin Xu
, Chun Yuan
, Haiqin Yang
:
Towards Effective Collaborative Learning in Long-Tailed Recognition. 3754-3764 - Jiaxu Leng
, Yiran Liu
, Xinbo Gao
, Zhihui Wang
:
CRNet: Context-guided Reasoning Network for Detecting Hard Objects. 3765-3777 - Xingxing Wei
, Shiji Zhao
:
Boosting Adversarial Transferability With Learnable Patch-Wise Masks. 3778-3787 - Dongliang Chen
, Guihua Wen
, Pengcheng Wen
, Pei Yang
, Rui Chen
, Cheng Li
:
Cross-Domain Sample Relationship Learning for Facial Expression Recognition. 3788-3798 - Tongbao Chen
, Wenmin Wang
, Zhe Jiang
, Ruochen Li
, Bingshu Wang
:
Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval. 3799-3813 - Haifeng Guo
, Sam Kwong
, Dongjie Ye
, Shiqi Wang
:
Enhanced Context Mining and Filtering for Learned Video Compression. 3814-3826 - Dongqing Wu
, Huihui Li
, Cang Gu
, Hang Liu, Cuili Xu
, Yinxuan Hou
, Lei Guo:
Feature First: Advancing Image-Text Retrieval Through Improved Visual Features. 3827-3841 - Huilin Zhu
, Jingling Yuan
, Xian Zhong
, Liang Liao
, Zheng Wang
:
Find Gold in Sand: Fine-Grained Similarity Mining for Domain-Adaptive Crowd Counting. 3842-3855 - Qiangqiang Shen
, Tingting Xu
, Yongsheng Liang
, Yongyong Chen
, Zhenyu He
:
Robust Tensor Recovery for Incomplete Multi-View Clustering. 3856-3870 - Zhenyu Weng
, Huiping Zhuang
, Fulin Luo
, Haizhou Li
, Zhiping Lin
:
Few-Shot Contrastive Transfer Learning With Pretrained Model for Masked Face Verification. 3871-3883 - Zerun Feng
, Zhimin Zeng
, Caili Guo
, Zheng Li
, Lin Hu
:
Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching. 3884-3896 - Jiaming Liu
, Yue Wu
, Maoguo Gong
, Zhixiao Liu
, Qiguang Miao
, Wenping Ma
:
Inter-Modal Masked Autoencoder for Self-Supervised Learning on Point Clouds. 3897-3908 - Guangyong Gao
, Hui Zhang
, Zhihua Xia
, Xiangyang Luo
, Yun-Qing Shi:
Reversible Data Hiding-Based Contrast Enhancement With Multi-Group Stretching for ROI of Medical Image. 3909-3923 - Zhixuan Li
, Weining Ye
, Tingting Jiang
, Tie-Jun Huang
:
GIN: Generative INvariant Shape Prior for Amodal Instance Segmentation. 3924-3936 - Xiaochuan Li
, Baoyu Fan
, Runze Zhang
, Kun Zhao
, Zhenhua Guo
, Yaqian Zhao
, Rengang Li
:
Inexactly Matched Referring Expression Comprehension With Rationale. 3937-3950 - Jiale Cheng
, Dongzi Shi
, Chenyang Li
, Yu Li
, Hao Ni
, Lianwen Jin
, Xin Zhang
:
Skeleton-Based Gesture Recognition With Learnable Paths and Signature Features. 3951-3961 - Shidong Cao
, Wenhao Chai
, Shengyu Hao
, Yanting Zhang
, Hangyue Chen
, Gaoang Wang
:
DiffFashion: Reference-Based Fashion Design With Structure-Aware Transfer by Diffusion Models. 3962-3975 - Junyan Wang
, Yiqi Jiang
, Yang Long
, Xiuyu Sun
, Maurice Pagnucco
, Yang Song
:
Deconfounding Causal Inference for Zero-Shot Action Recognition. 3976-3986 - Duzhen Zhang
, Feilong Chen
, Jianlong Chang
, Xiuyi Chen
, Qi Tian
:
Structure Aware Multi-Graph Network for Multi-Modal Emotion Recognition in Conversations. 3987-3997 - Xu Wang
, Weifeng Kong, Qiudan Zhang
, You Yang
, Tiesong Zhao
, Jianmin Jiang
:
Distortion-Aware Self-Supervised Indoor 360$^{\circ }$ Depth Estimation via Hybrid Projection Fusion and Structural Regularities. 3998-4011 - Wenxue Cui
, Xiaopeng Fan
, Jian Zhang
, Debin Zhao
:
Deep Unfolding Network for Image Compressed Sensing by Content-Adaptive Gradient Updating and Deformation-Invariant Non-Local Modeling. 4012-4027 - Di Wang
, Changning Tian
, Xiao Liang
, Lin Zhao
, Lihuo He
, Quan Wang
:
Dual-Perspective Fusion Network for Aspect-Based Multimodal Sentiment Analysis. 4028-4038 - Yong Wang
, Hongbo Kang
, Doudou Wu
, Wenming Yang
, Longbin Zhang
:
Global and Local Spatio-Temporal Encoder for 3D Human Pose Estimation. 4039-4049 - Yixuan Lyu
, Hong Zhang
, Yan Li
, Hanyang Liu
, Yifan Yang
, Ding Yuan
:
UEDG:Uncertainty-Edge Dual Guided Camouflage Object Detection. 4050-4060 - Shao-Jie Zhang
, Jia-Hui Pan, Jibin Gao
, Wei-Shi Zheng
:
Adaptive Stage-Aware Assessment Skill Transfer for Skill Determination. 4061-4072 - Yan Ju
, Shan Jia
, Jialing Cai
, Haiying Guan
, Siwei Lyu
:
GLFF: Global and Local Feature Fusion for AI-Synthesized Image Detection. 4073-4085 - Yangyang Shu
, Qian Li
, Lingqiao Liu
, Guandong Xu
:
Semi-Supervised Adversarial Learning for Attribute-Aware Photo Aesthetic Assessment. 4086-4096 - Shulan Ruan
, Kun Zhang
, Le Wu
, Tong Xu
, Qi Liu
, Enhong Chen
:
Color Enhanced Cross Correlation Net for Image Sentiment Analysis. 4097-4109 - Harry Cheng
, Yangyang Guo
, Jianhua Yin
, Haonan Chen, Jiafang Wang
, Liqiang Nie
:
Audio-Driven Talking Video Frame Restoration. 4110-4122 - Song Tang
, Yuji Shi
, Zihao Song
, Mao Ye
, Changshui Zhang
, Jianwei Zhang
:
Progressive Source-Aware Transformer for Generalized Source-Free Domain Adaptation. 4138-4152 - Yuwu Lu
, Wai Keung Wong
, Chun Yuan
, Zhihui Lai
, Xuelong Li
:
Low-Rank Correlation Learning for Unsupervised Domain Adaptation. 4153-4167 - Xiaobin Tan
, Shunyi Wang
, Xiang Xu
, Quan Zheng
, Jian Yang
, Shuangwu Chen
:
DACOD360: Deadline-Aware Content Delivery for 360-Degree Video Streaming Over MEC Networks. 4168-4182 - Yunzuo Zhang
, Tian Zhang
, Cunyu Wu
, Ran Tao
:
Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction. 4183-4193 - Binwei Xu
, Haoran Liang
, Ronghua Liang
, Peng Chen
:
Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection. 4194-4205 - Zhengning Wu
, Tianyu He
, Xiaobo Xia
, Jun Yu
, Xu Shen, Tongliang Liu
:
Conditional Consistency Regularization for Semi-Supervised Multi-Label Image Classification. 4206-4216 - Zhiwei Ding
, Guilin Lan, Yanzhi Song
, Zhouwang Yang
:
SGIR: Star Graph-Based Interaction for Efficient and Robust Multimodal Representation. 4217-4229 - Jun Rao
, Xv Meng
, Liang Ding
, Shuhan Qi
, Xuebo Liu
, Min Zhang
, Dacheng Tao
:
Parameter-Efficient and Student-Friendly Knowledge Distillation. 4230-4241 - Ze Zhou
, Yinghui Sun
, Quansen Sun
, Chaobo Li
, Zhenwen Ren
:
Unit Correlation With Interactive Feature for Robust and Effective Tracking. 4242-4254 - Lin Yang
, Rangding Wang
, Dawen Xu
, Li Dong
, Songhan He
:
Centralized Error Distribution-Preserving Adaptive Steganography for HEVC. 4255-4270 - Liping Bao
, Longhui Wei
, Wengang Zhou
, Lin Liu
, Lingxi Xie
, Houqiang Li
, Qi Tian
:
Multi-Granularity Matching Transformer for Text-Based Person Search. 4281-4293 - Yuxuan Liu
, Hongwei Ge
, Zhen Wang
, Yaqing Hou
, Mingde Zhao
:
Clothes-Changing Person Re-Identification via Universal Framework With Association and Forgetting Learning. 4294-4307 - TianYu Ning
, Bineng Zhong
, Qihua Liang
, Zhenjun Tang
, Xianxian Li
:
Robust Tracking via Bidirectional Transduction With Mask Information. 4308-4319 - Zhejing Hu
, Xiao Ma
, Yan Liu, Gong Chen
, Yongxu Liu
, Roger B. Dannenberg
:
The Beauty of Repetition: An Algorithmic Composition Model With Motif-Level Repetition Generator and Outline-to-Music Generator in Symbolic Music Generation. 4320-4333 - Linhui Xiao
, Xiaoshan Yang
, Fang Peng
, Ming Yan
, Yaowei Wang
, Changsheng Xu
:
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding. 4334-4347 - Siran Chen
, Qinglin Xu
, Yue Ma
, Yu Qiao
, Yali Wang
:
Attentive Snippet Prompting for Video Retrieval. 4348-4359 - Yeqing Ren
, Haipeng Peng
, Lixiang Li
, Yixian Yang
:
Lightweight Voice Spoofing Detection Using Improved One-Class Learning and Knowledge Distillation. 4360-4374 - Jiacheng Wang
, Ping Liu
, Jingen Liu
, Wei Xu
:
Text-Guided Eyeglasses Manipulation With Spatial Constraints. 4375-4388 - Yuanzhi Liang
, Linchao Zhu
, Xiaohan Wang
, Yi Yang:
IcoCap: Improving Video Captioning by Compounding Images. 4389-4400 - Ping Ping
, Bobiao Guo
, Olano Teah Bloh
, Yingchi Mao
, Feng Xu
:
Hiding Multiple Images into a Single Image Using Up-Sampling. 4401-4415 - Xin Yang
, Chenyang Zhao
, Jinqi Yang
, Yong Song
, Yufei Zhao
:
Negative-Driven Training Pipeline for Siamese Visual Tracking. 4416-4429 - Hao Wu
, Lincong Fang
, Qian Yu
, Chengzhuan Yang
:
Learning Robust Point Representation for 3D Non-Rigid Shape Retrieval. 4430-4444 - Junjie Zhang
, Mingyan Wang
, Haoran Jiang
, Xinyu Zhang
, Chenggang Yan
, Dan Zeng
:
STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints. 4445-4457 - Tong Tang
, Zhiyang Yin
, Jie Li
, Honggang Wang, Dapeng Wu
, Ruyan Wang
:
End-to-End Distortion Modeling for Error-Resilient Screen Content Video Coding. 4458-4468 - Aoran Zhang
, Zhigang Ling
, Yaonan Wang
:
Multi-Layer Decoupling Attention Network for Weakly Supervised Object Localization. 4469-4479 - Xinyuan Qian
, Wei Xue
, Qiquan Zhang
, Ruijie Tao
, Haizhou Li
:
Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech. 4480-4489 - Chaoyang Zhou, Zengmao Wang
, Xiaoping Zhang, Bo Du
:
Domain Complementary Adaptation by Leveraging Diversity and Discriminability From Multiple Sources. 4490-4501 - Han Zhang
, Yiding Li
, Xuelong Li
:
Constrained Bipartite Graph Learning for Imbalanced Multi-Modal Retrieval. 4502-4514 - Runsheng Wang
, Yuxuan Shi
, Hefei Ling
, Zongyi Li
, Chengxin Zhao, Bohao Wei
, He Li
, Ping Li
:
Gait Recognition With Multi-Level Skeleton-Guided Refinement. 4515-4526 - Ruibin Wang
, Xianghua Ying
, Bowei Xing
:
Exploiting Temporal Correlations for 3D Human Pose Estimation. 4527-4539 - Tian-Bao Li
, Yuting Su, Dan Song
, Wenhui Li
, Zhiqiang Wei
, An-An Liu
:
Progressive Fourier Adversarial Domain Adaptation for Object Classification and Retrieval. 4540-4553 - Tianwen Qian
, Ran Cui
, Jingjing Chen
, Pai Peng
, Xiaowei Guo
, Yu-Gang Jiang
:
Locate Before Answering: Answer Guided Question Localization for Video Question Answering. 4554-4563 - Wujie Zhou
, Yuqi Cai, Liting Zhang, Weiqing Yan
, Lu Yu:
UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation. 4564-4574 - Guanhua Zheng
, Jitao Sang
, Changsheng Xu
:
TIF: Threshold Interception and Fusion for Compact and Fine-Grained Visual Attribution. 4575-4589 - Yuanhong Zhong
, Chenxu Zhang
, Xun Yang
, Shanshan Wang
:
Video Compressed Sensing Reconstruction via an Untrained Network with Low-Rank Regularization. 4590-4601 - Wenfeng Song
, Tangli Chu
, Shuai Li
, Nannan Li
, Aimin Hao
, Hong Qin
:
Joints-Centered Spatial-Temporal Features Fused Skeleton Convolution Network for Action Recognition. 4602-4616 - Yuanyuan Jiang
, Jianqin Yin
, Yonghao Dang
:
Leveraging the Video-Level Semantic Consistency of Event for Audio-Visual Event Localization. 4617-4627 - Yifei Zhang
, Chang Liu
, Yu Zhou
, Weiping Wang
, Qixiang Ye
, Xiangyang Ji
:
Beyond Instance Discrimination: Relation-Aware Contrastive Self-Supervised Learning. 4628-4640 - Jinsong Shi, Pan Gao
, Aljosa Smolic
:
Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token. 4641-4651 - Xin Li
, Yiting Lu
, Zhibo Chen
:
FreqAlign: Excavating Perception-Oriented Transferability for Blind Image Quality Assessment From a Frequency Perspective. 4652-4666 - Yi Ke Yun
, Weisi Lin
:
Towards a Complete and Detail-Preserved Salient Object Detection. 4667-4680 - Fanzhao Lin
, Shiming Ge
, Kexin Bao
, Chenggang Yan
, Dan Zeng
:
Learning Shape-Biased Representations for Infrared Small Target Detection. 4681-4692 - Hui Wu
, Min Wang
, Wengang Zhou
, Houqiang Li
:
Structure Similarity Preservation Learning for Asymmetric Image Retrieval. 4693-4705 - Ke Zhang
, Yan Yang
, Jun Yu
, Hanliang Jiang
, Jianping Fan
, Qingming Huang
, Weidong Han
:
Multi-Task Paired Masking With Alignment Modeling for Medical Vision-Language Pre-Training. 4706-4721 - Ankur
, Rajeev Kumar
, Ajay K. Sharma
:
Bit-Plane Based Reversible Data Hiding in Encrypted Images Using Multi-Level Blocking With Quad-Tree. 4722-4735 - Zengbin Wang
, Saihui Hou
, Man Zhang
, Xu Liu
, Chunshui Cao
, Yongzhen Huang
:
GaitParsing: Human Semantic Parsing for Gait Recognition. 4736-4748 - Bo Qin
, Fanqing Meng
, Shijin Yuan
, Bin Mu
:
CAU: A Causality Attention Unit for Spatial-Temporal Sequence Forecast. 4749-4763 - Linfeng Tang
, Ziang Chen
, Jun Huang
, Jiayi Ma
:
CAMF: An Interpretable Infrared and Visible Image Fusion Network Based on Class Activation Mapping. 4776-4791 - Xuan Han
, Mingyu You
, Ping Lu
:
Improving the Conditional Fine-Grained Image Generation With Part Perception. 4792-4804 - Yue Lu
, Xingyu Chen
, Zhengxing Wu
, Min Tan
, Junzhi Yu
:
Binary Similarity Few-Shot Object Detection With Modeling of Hard Negative Samples. 4805-4818 - Jie Gui
, Xiaofeng Cong
, Lei He
, Yuan Yan Tang, James Tin-Yau Kwok
:
Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding. 4819-4830 - Mengkun Liu
, Licheng Jiao
, Xu Liu
, Lingling Li
, Fang Liu
, Shuyuan Yang
, Shuang Wang
, Biao Hou
:
Multi-Scale Contourlet Knowledge Guide Learning Segmentation. 4831-4845 - Xiaomeng Wang
, Honglong Chen
, Peng Sun
, Junjian Li
, Anqing Zhang
, Weifeng Liu
, Nan Jiang
:
AdvST: Generating Unrestricted Adversarial Images via Style Transfer. 4846-4858 - Tianrun Chen
, Chaotao Ding
, Lanyun Zhu
, Ying Zang
, Yiyi Liao
, Zejian Li
, Lingyun Sun
:
Reality3DSketch: Rapid 3D Modeling of Objects From Single Freehand Sketches. 4859-4870 - Kezhou Lin
, Xiaohan Wang
, Linchao Zhu
, Bang Zhang, Yi Yang:
SKIM: Skeleton-Based Isolated Sign Language Recognition With Part Mixing. 4271-4280 - Qiuping Jiang
, Yaozu Kang, Zhihua Wang
, Wenqi Ren
, Chongyi Li
:
Perception-Driven Deep Underwater Image Enhancement Without Paired Supervision. 4884-4897 - Renjie Pan
, Hua Yang
, Cunyan Li
, Jinhai Yang
:
Joint Intra & Inter-Grained Reasoning: A New Look Into Semantic Consistency of Image-Text Retrieval. 4912-4925 - Wei Lu
, Yujia Zhai
, Jiaze Han, Peiguang Jing
, Yu Liu
, Yuting Su:
VMemNet: A Deep Collaborative Spatial-Temporal Network With Attention Representation for Video Memorability Prediction. 4926-4937 - Huasheng Wang
, Jianxun Lou
, Xiaochang Liu
, Hongchen Tan
, Roger M. Whitaker
, Hantao Liu
:
SSPNet: Predicting Visual Saliency Shifts. 4938-4949 - Yingjiao Pei
, Zhongyuan Wang
, Na Li
, Heling Chen
, Baojin Huang
, Weiping Tu
:
Deep Hashing Network With Hybrid Attention and Adaptive Weighting for Image Retrieval. 4961-4973 - Lanxiao Wang
, Hongliang Li
, Minjian Zhang
, Heqian Qiu
, Fanman Meng
, Qingbo Wu
, Linfeng Xu
:
CrowdCaption++: Collective-Guided Crowd Scenes Captioning. 4974-4986 - Huihui Gong
, Minjing Dong
, Siqi Ma
, Seyit Camtepe
, Surya Nepal
, Chang Xu
:
Stealthy Physical Masked Face Recognition Attack via Adversarial Style Optimization. 5014-5025 - Lingzhi He
, Feng Li
, Runmin Cong
, Yao Zhao
:
Reflection Intensity Guided Single Image Reflection Removal and Transmission Recovery. 5026-5039 - Zijin Yang
, Kejiang Chen
, Kai Zeng
, Weiming Zhang
, Nenghai Yu
:
Provably Secure Robust Image Steganography. 5040-5053 - Xian Zhao
, Lei Huang
, Jie Nie
, Zhiqiang Wei
:
Towards Adaptive Multi-Scale Intermediate Domain via Progressive Training for Unsupervised Domain Adaptation. 5054-5064 - Wentao Ma
, Xinyi Wu
, Shan Zhao
, Tongqing Zhou
, Dan Guo
, Lichuan Gu
, Zhiping Cai
, Meng Wang
:
FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification. 5065-5077 - Mu Wang
, Xingyan Chen
, Xu Yang, Shuai Peng, Yu Zhao
, Mingwei Xu
, Changqiao Xu
:
CoLive: Edge-Assisted Clustered Learning Framework for Viewport Prediction in 360$^{\circ }$ Live Streaming. 5078-5091 - Yueli Cui
, Gangyi Jiang
, Mei Yu
, Yeyao Chen
, Yo-Sung Ho
:
Stitched Wide Field of View Light Field Image Quality Assessment: Benchmark Database and Objective Metric. 5092-5107 - Hongbo Sun
, Xiangteng He
, Yuxin Peng
:
HCL: Hierarchical Consistency Learning for Webly Supervised Fine-Grained Recognition. 5108-5119 - Mengqi Yuan
, Gengyun Jia
, Bing-Kun Bao
:
GPT-Based Knowledge Guiding Network for Commonsense Video Captioning. 5147-5158 - Yiming Liu
, Mengxi Zhang
, Bo Jiang
, Bo Hou
, Dan Liu
, Jie Chen
, Heqing Lian
:
Flexible Alignment Super-Resolution Network for Multi-Contrast Magnetic Resonance Imaging. 5159-5169 - Linfei Wang
, Yibing Zhan
, Wei Liu
, Baosheng Yu
, Dapeng Tao
:
Bounding Box Vectorization for Oriented Object Detection With Tanimoto Coefficient Regression. 5181-5193 - Junyu Shi
, Jianqi Zhong
, Wenming Cao
:
Multi-Semantics Aggregation Network Based on the Dynamic-Attention Mechanism for 3D Human Motion Prediction. 5194-5206 - Zhenyu Wang
, Yunzhou Zhang
, Yan Liu
, Cao Qin
, Sonya A. Coleman
, Dermot Kerr
:
LARNet: Towards Lightweight, Accurate and Real-Time Salient Object Detection. 5207-5222 - Zehua Fu
, Wenhang Zuo
, Zhenghui Hu
, Qingjie Liu
, Yunhong Wang
:
Improving Multi-Person Pose Tracking With a Confidence Network. 5223-5233 - Jinhong Deng
, Xiaoyue Zhang
, Wen Li
, Lixin Duan
, Dong Xu
:
Cross-Domain Detection Transformer Based on Spatial-Aware and Semantic-Aware Token Alignment. 5234-5245 - Hong Liu
, Yongqing Sun
, Yukihiro Bandoh
, Masaki Kitahara, Shin'ichi Satoh
:
Deep Counterfactual Representation Learning for Visual Recognition Against Weather Corruptions. 5257-5272 - Yalan Qin
, Nan Pu
, Hanzhou Wu
:
EDMC: Efficient Multi-View Clustering via Cluster and Instance Space Learning. 5273-5283 - Yuan Bian
, Min Liu
, Xueping Wang
, Yi Tang
, Yaonan Wang
:
Occlusion-Aware Feature Recover Model for Occluded Person Re-Identification. 5284-5295 - Peifu Liu
, Tingfa Xu
, Huan Chen
, Shiyun Zhou
, Haolin Qin
, Jianan Li
:
Spectrum-Driven Mixed-Frequency Network for Hyperspectral Salient Object Detection. 5296-5310 - Yitao Peng
, Lianghua He
, Die Hu
, Yihang Liu
, Longzhen Yang
, Shaohua Shang
:
Hierarchical Dynamic Masks for Visual Explanation of Neural Networks. 5311-5325 - Junyi Wu
, Yan Huang
, Min Gao
, Zhipeng Gao
, Jianqiang Zhao
, Huiji Zhang
, Anguo Zhang
:
A Two-Stream Hybrid Convolution-Transformer Network Architecture for Clothing-Change Person Re-Identification. 5326-5339 - Kexin Tang
, Nuowen Kan
, Yuankun Jiang
, Chenglin Li
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
:
Successor Feature-Based Transfer Reinforcement Learning for Video Rate Adaptation With Heterogeneous QoE Preferences. 5340-5357 - Wenwen Wei
, Ping Wei
, Jialu Qin
, Zhimin Liao
, Shuaijie Wang
, Xiang Cheng
, Meiqin Liu
, Nanning Zheng
:
3D Scene Graph Generation From Point Clouds. 5358-5368 - Yueheng Li
, Hao Chen
, Bowei Xu
, Zicheng Zhang
, Zhan Ma
:
Improving Adaptive Real-Time Video Communication via Cross-Layer Optimization. 5369-5382 - Keyan Ding, Rijin Zhong, Zhihua Wang
, Yang Yu
, Yuming Fang
:
Adaptive Structure and Texture Similarity Metric for Image Quality Assessment and Optimization. 5398-5409 - Yufan Hu
, Junyu Gao
, Jianfeng Dong
, Bin Fan
, Hongmin Liu
:
Exploring Rich Semantics for Open-Set Action Recognition. 5410-5421 - Bingyu Hu
, Jiawei Liu
, Kecheng Zheng
, Zheng-Jun Zha
:
Unleashing Knowledge Potential of Source Hypothesis for Source-Free Domain Adaptation. 5422-5434 - Mingze He
, Hongxia Wang, Fei Zhang
, Yuyuan Xiang
:
Exploring Accurate Invariants on Polar Harmonic Fourier Moments in Polar Coordinates for Robust Image Watermarking. 5435-5449 - Wenda Zhao
, Guang Hu, Fei Wei, Haipeng Wang
, You He
, Huchuan Lu
:
Attacking Defocus Detection With Blur-Aware Transformation for Defocus Deblurring. 5450-5460 - Ye Yao
, Linchao Huang
, Hui Wang
, Qi Chang
, Yizhi Ren
, Fengjun Xiao
:
Robust Adaptive Steganography Based on Adaptive STC-ECC. 5477-5489 - Xiangzeng Liu
, Kunpeng Liu
, Jianfeng Guo
, Peipei Zhao
, Yi-Ning Quan
, Qiguang Miao
:
Pose-Guided Attention Learning for Cloth-Changing Person Re-Identification. 5490-5498 - Kai Gao
, Ji-Hwei Horng
, Chin-Chen Chang
:
Reversible Data Hiding for Encrypted 3D Mesh Models With Secret Sharing Over Galois Field. 5499-5510 - Yuxia Wu
, Guoshuai Zhao
, Mingdi Li
, Zhuocheng Zhang
, Xueming Qian
:
Reason Generation for Point of Interest Recommendation Via a Hierarchical Attention-Based Transformer Model. 5511-5522 - Ge Zhu
, Jinbao Li
, Yahong Guo:
PriorNet: Two Deep Prior Cues for Salient Object Detection. 5523-5535 - Jingyang Lin
, Hang Hua
, Ming Chen
, Yikang Li
, Jenhao Hsiao
, Chiuman Ho
, Jiebo Luo
:
VideoXum: Cross-Modal Visual and Textural Summarization of Videos. 5548-5560 - Haoyue Shi
, Le Wang
, Sanping Zhou
, Gang Hua
, Wei Tang
:
Abnormal Ratios Guided Multi-Phase Self-Training for Weakly-Supervised Video Anomaly Detection. 5575-5587 - Hao Liu
, Jingjing Wu
, Feng Li
, Jianguo Jiang
, Richang Hong
:
SYRER: Synergistic Relational Reasoning for RGB-D Cross-Modal Re-Identification. 5600-5614 - Han Yan
, Haijun Zhang
, Zhao Zhang
:
Learning to Disentangle the Colors, Textures, and Shapes of Fashion Items: A Unified Framework. 5615-5629 - Yun Wang
, Lu Zhu
, Yuanyuan Liu
:
CFENet: Boosting Few-Shot Semantic Segmentation With Complementary Feature-Enhanced Network. 5630-5640 - Yan Hu, Xiaozhao Fang
, Peipei Kang
, Yonghao Chen
, Yuting Fang, Shengli Xie
:
Dual Noise Elimination and Dynamic Label Correlation Guided Partial Multi-Label Learning. 5641-5656 - Nanfeng Jiang
, Weiling Chen
, Jielian Lin
, Tiesong Zhao
, Chia-Wen Lin
:
Video Compression Artifacts Removal With Spatial-Temporal Attention-Guided Enhancement. 5657-5669 - An-An Liu
, Yingchen Zhai
, Ning Xu
, Hongshuo Tian
, Weizhi Nie
, Yongdong Zhang
:
Event-Aware Retrospective Learning for Knowledge-Based Image Captioning. 4898-4911 - Xi Yang
, Zihan Wang, Ziyu Wei, Dong Yang:
SCSP: An Unsupervised Image-to-Image Translation Network Based on Semantic Cooperative Shape Perception. 4950-4960 - Zexing Du
, Di He
, Xue Wang
, Qing Wang
:
Learning Semantics-Guided Representations for Scoring Figure Skating. 4987-4997 - Zhuo Zhang
, Hongfei Wang
, Jie Geng
, Xinyang Deng
, Wen Jiang
:
A New Data Augmentation Method Based on Mixup and Dempster-Shafer Theory. 4998-5013 - Jiahui Zhang
, Jinlong Shi
, Danping Zou
, Xin Shu
, Suqin Bai
, Jiawen Lu
, Haowei Zhu
, Jun Ni
, Yunhan Sun
:
EPM-Net: Efficient Feature Extraction, Point-Pair Feature Matching for Robust 6-D Pose Estimation. 5120-5130 - Yiqiao Mao
, Xiaoqiang Yan
, Jiaming Liu
, Yangdong Ye
:
ConGMC: Consistency-Guided Multimodal Clustering via Mutual Information Maximin. 5131-5146 - Jinguang Wang
, Shengsheng Qian
, Jun Hu
, Richang Hong
:
Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection. 5170-5180 - Jun Zhou
, Chi Xu
, Yuting Ge
, Li Cheng
:
Realistic Depth Image Synthesis for 3D Hand Pose Estimation. 5246-5256 - Yujie Fu
, Pengju Zhang
, Fulin Tang
, Yihong Wu
:
Covariant Peak Constraint for Accurate Keypoint Detection and Keypoint-Specific Descriptor Learning. 5383-5397 - Daizong Liu
, Jiahao Zhu
, Xiang Fang
, Zeyu Xiong
, Huan Wang
, Renfu Li
, Pan Zhou
:
Conditional Video Diffusion Network for Fine-Grained Temporal Sentence Grounding. 5461-5476 - Aoran Xiao
, Dayan Guan
, Xiaoqin Zhang
, Shijian Lu
:
Domain Adaptive LiDAR Point Cloud Segmentation With 3D Spatial Consistency. 5536-5547 - Wei Cong
, Yang Cong
, Jiahua Dong
, Gan Sun
, Henghui Ding
:
Gradient-Semantic Compensation for Incremental Semantic Segmentation. 5561-5574 - Nam Joon Kim
, Hyun Kim
:
Trunk Pruning: Highly Compatible Channel Pruning for Convolutional Neural Networks Without Fine-Tuning. 5588-5599 - Yajie Wang
, Mulin Chen
, Xuelong Li
:
Continuous Emotion-Based Image-to-Music Generation. 5670-5679 - Zhenghong Lin
, Qishan Yan
, Weiming Liu
, Shiping Wang
, Menghan Wang
, Yanchao Tan
, Carl Yang
:
Automatic Hypergraph Generation for Enhancing Recommendation With Sparse Optimization. 5680-5693 - Yuer Ma
, Yi Liu
, Limin Wang
, Wenxiong Kang
, Yu Qiao
, Yali Wang
:
Dual Masked Modeling for Weakly-Supervised Temporal Boundary Discovery. 5694-5704 - Jin Yang
, Ping Wei
, Ziyang Ren
, Nanning Zheng
:
Gated Multi-Scale Transformer for Temporal Action Localization. 5705-5717 - Wenqing Wang
, Yawei Luo
, Zhiqing Chen
, Tao Jiang
, Yi Yang, Jun Xiao
:
Taking a Closer Look At Visual Relation: Unbiased Video Scene Graph Generation With Decoupled Label Learning. 5718-5728 - Yihao Huang
, Felix Juefei-Xu
, Qing Guo
, Geguang Pu
, Yang Liu
:
Natural & Adversarial Bokeh Rendering via Circle-of-Confusion Predictive Network. 5729-5740 - Guanghui Yue
, Honglv Wu
, Qiuping Jiang
, Tianwei Zhou
, Weiqing Yan
, Tianfu Wang
:
Perceptual Quality Assessment of Retouched Face Images. 5741-5752 - Ruohong Huan
, Guowei Zhong
, Peng Chen
, Ronghua Liang
:
UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequences. 5753-5768 - Yihong Chen
, Hao Zheng
, Yanchun Li
, Wanli Ouyang
, Jiang Zhu
:
Online Handwritten Chinese Character Recognition Based on 1-D Convolution and Two-Streams Transformers. 5769-5781 - Qing Ding
, Liquan Shen
, Liangwei Yu
, Hao Yang
, Mai Xu
:
Blind Quality Enhancement for Compressed Video. 5782-5794 - Wenying Wen
, Ziye Yuan
, Shuren Qi
, Yushu Zhang
, Yuming Fang
:
PPM-SEM: A Privacy-Preserving Mechanism for Sharing Electronic Patient Records and Medical Images in Telemedicine. 5795-5806 - Yubin Cho
, Hyunwoo Yu
, Suk-Ju Kang
:
Cross-Aware Early Fusion With Stage-Divided Vision and Language Transformer Encoders for Referring Image Segmentation. 5823-5833 - Jing Li
, Qianqian Wang
, Ming Yang
, Quanxue Gao
, Xinbo Gao
:
Efficient Anchor Graph Factorization for Multi-View Clustering. 5834-5845 - Jia-Wei Ma
, Min Liang
, Lei Chen
, Shu Tian
, Song-Lu Chen
, Jingyan Qin
, Xu-Cheng Yin
:
Sample Weighting with Hierarchical Equalization Loss for Dense Object Detection. 5846-5859 - Shibo Li
, Shuyuan Zhu
, Yao Ge
, Bing Zeng
, Muhammad Ali Imran
, Qammer H. Abbasi
, Jonathan M. Cooper
:
Depth-Guided Deep Video Inpainting. 5860-5871 - Ying Luo
, Guoliang Kang
, Kexin Liu
, Fuzhen Zhuang
, Jinhu Lü
:
Taking a Closer Look at Factor Disentanglement: Dual-Path Variational Autoencoder Learning for Domain Generalization. 5872-5883 - XiuYu Zhang
, Minrui Xu
, Rui Tan
, Dusit Niyato
:
Learning-Based Auction for Matching Demand and Supply of Holographic Digital Twin Over Immersive Communications. 5884-5896 - Yawen Cui
, Zitong Yu
, Wei Peng
, Qi Tian
, Li Liu
:
Rethinking Few-Shot Class-Incremental Learning With Open-Set Hypothesis in Hyperbolic Geometry. 5897-5910 - Yun Zhang
, Haoqin Lin
, Jing Sun
, Linwei Zhu
, Sam Kwong
:
Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression. 5925-5938 - Minda Zhao
, Xingqun Qi
, Zhipeng Hu
, Lincheng Li
, Yongqiang Zhang
, Zi Huang
, Xin Yu
:
Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations. 5939-5950 - Qide Wang
, Daxin Liu
, Zhenyu Liu
, Jiatong Xu
, Jianrong Tan
:
3D Object Segmentation Using Cross-Window Point Transformer With Latent Semantic Boundary Guidance. 5951-5961 - Chi Ji
, Guangyong Gao
, Yunqing Shi:
Reversible Data Hiding in Encrypted Images With Adaptive Huffman Code Based on Dynamic Prediction Axes. 5962-5975 - Chunyi Zhou
, Dekang Liu
, Tianlei Wang
, Jiangmin Tian
, Jiuwen Cao
:
M$^{3}$ANet: Multi-Modal and Multi-Attention Fusion Network for Ship License Plate Recognition. 5976-5986 - Zhizhe Liu
, Zhenfeng Zhu
, Shuai Zheng
, Yawei Zhao
, Kunlun He
, Yao Zhao
:
From Observation to Concept: A Flexible Multi-View Paradigm for Medical Report Generation. 5987-5995 - Laijin Meng
, Xinghao Jiang
, Tanfeng Sun
, Zeyu Zhao
, Qiang Xu
:
A Robust Coverless Video Steganography Based on the Similarity of Inter-Frames. 5996-6011 - Wang Tang
, Linbo Qing
, Lindong Li
, Yuchen Wang
, Ce Zhu
:
Progressive Graph Reasoning-Based Social Relation Recognition. 6012-6024 - Guang Han
, Min Lin
, Ziyang Li
, Haitao Zhao
, Sam Kwong
:
Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network. 6025-6036 - Jiyou Chen
, Gaobo Yang
, Shengchun Wang
, Dewang Wang
, Xin Liao
:
Image Dehazing Assessment: A Real-World Dataset and a Haze Density-Aware Criteria. 6037-6049 - Fawei Ge
, Yunzhou Zhang
, Li Wang
, Sonya Coleman
, Dermot Kerr
:
Double-Domain Adaptation Semantics for Retrieval-Based Long-Term Visual Localization. 6050-6064 - Wenju Xu
, Chengjiang Long
, Yongwei Nie
, Guanghui Wang
:
Disentangled Representation Learning for Controllable Person Image Generation. 6065-6077 - Junxia Li
, Deshuo Shi
, Ying Cui
, Dongyan Guo
, Qingshan Liu
:
Adaptive Activation Network for Weakly Supervised Semantic Segmentation. 6078-6089 - Lizhi Xiong
, Jianhua Xu
, Ching-Nung Yang
, Xinpeng Zhang
:
CMCF-Net: An End-to-End Context Multiscale Cross-Fusion Network for Robust Copy-Move Forgery Detection. 6090-6101 - Xiuli Chai
, Yakun Ma
, Yinjing Wang
, Zhihua Gan
, Yushu Zhang
:
TPE-ADE: Thumbnail-Preserving Encryption Based on Adaptive Deviation Embedding for JPEG Images. 6102-6116 - Youze Wang
, Wenbo Hu
, Richang Hong
:
Iterative Adversarial Attack on Image-Guided Story Ending Generation. 6117-6130 - Yi Cheng
, Hehe Fan
, Dongyun Lin
, Ying Sun
, Mohan S. Kankanhalli
, Joo-Hwee Lim
:
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering. 6131-6141 - Hao Feng
, Shaokai Liu
, Jiajun Deng
, Wengang Zhou
, Houqiang Li
:
Deep Unrestricted Document Image Rectification. 6142-6154 - Jinyang Liu
, Shutao Li
, Renwei Dian
, Ze Song
:
Focus Relationship Perception for Unsupervised Multi-Focus Image Fusion. 6155-6165 - Jiajun Huang
, Chengbin Du
, Xinqi Zhu
, Siqi Ma
, Surya Nepal
, Chang Xu
:
Anti-Compression Contrastive Facial Forgery Detection. 6166-6177 - Yuanhong Zhong
, Guangxia Yang
, Daidi Zhong
, Xun Yang
, Shanshan Wang
:
Frame-Padded Multiscale Transformer for Monocular 3D Human Pose Estimation. 6191-6201 - Siyu Zhang
, Yeming Chen
, Yaoru Sun
, Fang Wang
, Haibo Shi
, Haoran Wang
:
LOIS: Looking Out of Instance Semantics for Visual Question Answering. 6202-6214 - Shankhanil Mitra
, Saiyam Jogani
, Rajiv Soundararajan
:
Semi-Supervised Learning of Perceptual Video Quality by Generating Consistent Pairwise Pseudo-Ranks. 6215-6227 - Zizheng Xun
, Shangzhe Di
, Yulu Gao, Zongheng Tang
, Gang Wang
, Si Liu
, Bo Li
:
Linker: Learning Long Short-term Associations for Robust Visual Tracking. 6228-6237 - Honglei Su
, Qi Liu
, Hui Yuan
, Qiang Cheng
, Raouf Hamzaoui
:
Support Vector Regression-Based Reduced- Reference Perceptual Quality Model for Compressed Point Clouds. 6238-6249 - Yabo Liu
, Jinghua Wang
, Weijia Wang
, Yu Hu
, Yaowei Wang
, Yong Xu
:
CRADA: Cross Domain Object Detection With Cyclic Reconstruction and Decoupling Adaptation. 6250-6261 - Bo Peng
, Guoting Lin
, Jianjun Lei
, Tianyi Qin
, Xiaochun Cao
, Nam Ling
:
Contrastive Multi-View Learning for 3D Shape Clustering. 6262-6272 - Xi Yang
, Menghui Tian, Meijie Li, Ziyu Wei
, Liu Yuan, Nannan Wang
, Xinbo Gao
:
SSRR: Structural Semantic Representation Reconstruction for Visible-Infrared Person Re-Identification. 6273-6284 - Yabo Liu
, Jinghua Wang
, Sheng-hua Zhong
, Lianyang Ma
, Yong Xu
:
Fine-Grained Representation Alignment for Zero-Shot Domain Adaptation. 6285-6296 - Ming Li
, Huazhu Fu
, Shengfeng He
, Hehe Fan
, Jun Liu
, Jussi Keppo
, Mike Zheng Shou
:
DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition. 6297-6309 - Nana Yu
, Hong Shi
, Yahong Han
:
Joint Correcting and Refinement for Balanced Low-Light Image Enhancement. 6310-6324 - Chenghao Xu
, Jiexi Yan
, Yanhua Yang
, Cheng Deng
:
Implicit Compositional Generative Network for Length-Variable Co-Speech Gesture Synthesis. 6325-6335 - Dayoung Chun
, Seungil Lee, Hyun Kim
:
USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection. 6336-6347 - Ying Lv
, Zhi Liu
, Gongyang Li
:
Context-Aware Interaction Network for RGB-T Semantic Segmentation. 6348-6360 - Qibing Qin
, Yadong Huo
, Lei Huang
, Jiangyan Dai
, Huihui Zhang
, Wenfeng Zhang
:
Deep Neighborhood-Preserving Hashing With Quadratic Spherical Mutual Information for Cross-Modal Retrieval. 6361-6374 - Yuxiang Lu
, Shalayiding Sirejiding
, Yue Ding
, Chunlin Wang
, Hongtao Lu
:
Prompt Guided Transformer for Multi-Task Dense Prediction. 6375-6385 - Lifang Wu
, Meng Tian
, Ye Xiang
, Ke Gu
, Ge Shi
:
Learning Label Semantics for Weakly Supervised Group Activity Recognition. 6386-6397 - Weiling Chen
, Boqin Cai
, Sumei Zheng
, Tiesong Zhao
, Ke Gu
:
Perception-and-Cognition-Inspired Quality Assessment for Sonar Image Super-Resolution. 6398-6410 - Juncheng Zhang
, Qingmin Liao
, Haoyu Ma
, Jing-Hao Xue
, Wenming Yang
, Shaojun Liu
:
Exploit the Best of Both End-to-End and Map-Based Methods for Multi-Focus Image Fusion. 6411-6423 - Long Peng
, Yang Cao
, Yuejin Sun
, Yang Wang
:
Lightweight Adaptive Feature De-Drifting for Compressed Image Classification. 6424-6436 - Jia-Nan Li
, Xiao-Qian Liu
, Xin Luo
, Xin-Shun Xu
:
VOLTER: Visual Collaboration and Dual-Stream Fusion for Scene Text Recognition. 6437-6448 - Chao Tian
, Zikun Zhou
, Yuqing Huang
, Gaojun Li
, Zhenyu He
:
Cross-Modality Proposal-Guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection. 6449-6461 - Jeong Hun Yeo
, Minsu Kim
, Jeongsoo Choi
, Dae Hoe Kim
, Yong Man Ro
:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. 6462-6474 - Xiaoqiang Zhou
, Huaibo Huang
, Zilei Wang
, Ran He
:
RISTRA: Recursive Image Super-Resolution Transformer With Relativistic Assessment. 6475-6487 - Xin Wen
, Weizhi Nie
, Jing Liu
, Yuting Su, Yongdong Zhang
, An-An Liu
:
CDCM: ChatGPT-Aided Diversity-Aware Causal Model for Interactive Recommendation. 6488-6500 - Runmin Cong
, Hang Xiong
, Jinpeng Chen
, Wei Zhang
, Qingming Huang
, Yao Zhao
:
Query-Guided Prototype Evolution Network for Few-Shot Segmentation. 6501-6512 - Runmin Wang
, Zhenlin Zhu
, Yanbin Zhu
, Hua Chen
, Yongzhong Liao
, Ziyu Zhu
, Yajun Ding
, Changxin Gao
, Nong Sang
:
DIMGNet: A Transformer-Based Network for Pedestrian Reidentification With Multi-Granularity Information Mutual Gain. 6513-6528 - Fei Hu
, Yibo Ma
, Wei Zhong
, Long Ye
, Xinyan Yang
, Li Fang
, Qin Zhang
:
A Dataset and Benchmark for 3D Scene Plausibility Assessment. 6529-6541 - Qi Cui
, Zhili Zhou
, Ruohan Meng
, Shaowei Wang
, Hongyang Yan
, Q. M. Jonathan Wu
:
ARES: On Adversarial Robustness Enhancement for Image Steganographic Cost Learning. 6542-6553 - Tianyi Wang
, Zian Li
, Ruixia Liu
, Yinglong Wang
, Liqiang Nie
:
An Efficient Attribute-Preserving Framework for Face Swapping. 6554-6565 - Yan Zhang
, Lu Zhang
, Xin Zhao
, Hongyong Fu
, Dequan Yu
:
Automatic Point Cloud Registration for 3D Virtual-to-Real Registration Using Macro and Micro Structures. 6566-6581 - Naiyu Fang
, Lemiao Qiu
, Shuyou Zhang
, Zili Wang
, Kerui Hu
:
PG-VTON: A Novel Image-Based Virtual Try-On Method via Progressive Inference Paradigm. 6595-6608 - Dixuan Lin
, Yi-Xing Peng
, Jingke Meng
, Wei-Shi Zheng
:
Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval. 6609-6620 - Bing Cai
, Gui-Fu Lu
, Hua Li
, Weihong Song
:
Tensorized Scaled Simplex Representation for Multi-View Clustering. 6621-6631 - Kang Chen
, Lei Yu
:
Motion Deblur by Learning Residual From Events. 6632-6647 - Yonghua Pan
, Jing Liu
, Lu Jin
, Zechao Li
:
Unbiased Visual Question Answering by Leveraging Instrumental Variable. 6648-6662 - Yuezhou Li
, Rui Xu
, Yuzhen Niu
, Wenzhong Guo
, Tiesong Zhao
:
Perceptual Decoupling With Heterogeneous Auxiliary Tasks for Joint Low-Light Image Enhancement and Deblurring. 6663-6675 - Yuefang Gao
, Yuhao Xie
, Zeke Zexi Hu
, Tianshui Chen
, Liang Lin
:
Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition. 6676-6688 - Ning Xu
, Zimu Lu
, Hongshuo Tian
, Rongbao Kang
, Jinbo Cao
, Yongdong Zhang
, An-An Liu
:
Learning to Supervise Knowledge Retrieval Over a Tree Structure for Visual Question Answering. 6689-6700 - Shuai Guo
, Jingchuan Hu
, Kai Zhou
, Jionghao Wang
, Li Song
, Rong Xie
, Wenjun Zhang
:
Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network. 6701-6716 - Yuxiang Shao
, Feifei Zhang
, Changsheng Xu
:
Snippet-to-Prototype Contrastive Consensus Network for Weakly Supervised Temporal Action Localization. 6717-6729 - Ali Ak
, Emin Zerman
, Maurice Quach
, Aladine Chetouani
, Aljosa Smolic
, Giuseppe Valenzise
, Patrick Le Callet
:
BASICS: Broad Quality Assessment of Static Point Clouds in a Compression Scenario. 6730-6742 - Junpeng Tan
, Xiaojun Yang
, Zhijing Yang
, Ruihan Chen
, Yongyi Lu
, Liang Lin
:
Extensible Max-Min Collaborative Retention for Online Mini-Batch Learning Hash Retrieval. 6743-6758 - Fanfan Ji
, Xiao-Tong Yuan
, Qingshan Liu
:
Soft Weight Pruning for Cross-Domain Few-Shot Learning With Unlabeled Target Data. 6759-6769 - Pei He
, Licheng Jiao
, Fang Liu
, Xu Liu
, Ronghua Shang
, Shuang Wang
:
Cross-Domain Scene Unsupervised Learning Segmentation With Dynamic Subdomains. 6770-6784 - Zixin Yin
, Jiakai Wang
, Yisong Xiao
, Hanqing Zhao
, Tianlin Li
, Wenbo Zhou
, Aishan Liu
, Xianglong Liu
:
Improving Deepfake Detection Generalization by Invariant Risk Minimization. 6785-6798 - Pei An
, Di Zhu, Siwen Quan
, Junfeng Ding
, Jie Ma
, You Yang
, Qiong Liu
:
ESC-Net: Alleviating Triple Sparsity on 3D LiDAR Point Clouds for Extreme Sparse Scene Completion. 6799-6810 - Myung Han Hyun
, Bumshik Lee
, Munchurl Kim
:
A VVC Intra Rate Control With Small Bit Fluctuations Using a Lagrange Multiplier Adjustment. 6811-6821 - Haofan Lu
, Shuiping Gou
, Ruimin Li
:
SPMHand: Segmentation-Guided Progressive Multi-Path 3D Hand Pose and Shape Estimation. 6822-6833 - Junqi Liao
, Li Li
, Dong Liu
, Houqiang Li
:
Content-Adaptive Rate-Distortion Modeling for Frame-Level Rate Control in Versatile Video Coding. 6864-6879 - Jianxin Lin
, Wei Zhao
, Yijun Wang
:
Visual Correspondence Learning and Spatially Attentive Synthesis via Transformer for Exemplar-Based Anime Line Art Colorization. 6880-6890 - Yuwu Lu
, Haoyu Huang
, Biqing Zeng
, Zhihui Lai
, Xuelong Li
:
Multi-Source and Multi-Target Domain Adaptation Based on Dynamic Generator with Attention. 6891-6905 - Wenxuan Wang
, Xingjian He
, Yisi Zhang
, Longteng Guo
, Jiachen Shen
, Jiangyun Li
, Jing Liu
:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation. 6906-6916 - Jui-Chiu Chiang
, Yu-Tze Wu
, Hsin-Yun Hsieh
, Yun-Chang Tsai:
Enhanced Temporal Consistency for Global Patch Allocation in Video-Based Point Cloud Compression. 6917-6930 - Jianan Li
, Jie Wang
, Tingfa Xu
:
PointGL: A Simple Global-Local Framework for Efficient Point Cloud Analysis. 6931-6942 - Yaochi Zhao
, Sen Chen
, Shiguang Liu
, Zhuhua Hu
, Jingwen Xia
:
Hierarchical Equalization Loss for Long-Tailed Instance Segmentation. 6943-6955 - Bing Yang
, Xueqin Xiang
, Wanzeng Kong
, Jianhai Zhang
, Yong Peng
:
DMF-GAN: Deep Multimodal Fusion Generative Adversarial Networks for Text-to-Image Synthesis. 6956-6967 - Jiankai Li
, Yunhong Wang
, Weixin Li
:
MHRN: A Multimodal Hierarchical Reasoning Network for Topic Detection. 6968-6980 - Fu-Zhao Ou
, Xingyu Chen
, Kai Zhao
, Shiqi Wang
, Yuan-Gen Wang
, Sam Kwong
:
Refining Uncertain Features With Self-Distillation for Face Recognition and Person Re-Identification. 6981-6995 - Sihui Zhang
, Yi Tian
, Yilei Zhang
, Mei Tian
, Yaping Huang
:
Domain-Consistent and Uncertainty-Aware Network for Generalizable Gaze Estimation. 6996-7011 - Zizheng Yang
, Jie Huang
, Man Zhou
, Naishan Zheng
, Feng Zhao
:
IRVR: A General Image Restoration Framework for Visual Recognition. 7012-7026 - Ronghui Zhang
, Jiongze Yu
, Junzhou Chen
, Guofa Li
, Liang Lin
, Danwei Wang
:
A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image Restoration. 7043-7057 - Yi Huang
, Jiancheng Huang
, Jianzhuang Liu
, Mingfu Yan
, Yu Dong
, Jiaxi Lv
, Chaoqi Chen
, Shifeng Chen
:
WaveDM: Wavelet-Based Diffusion Models for Image Restoration. 7058-7073 - Limin Zheng
, Yu Luo
, Zihan Zhou
, Jie Ling
, Guanghui Yue
:
CDINet: Content Distortion Interaction Network for Blind Image Quality Assessment. 7089-7100 - Jianwei Lu
, Guohua Wang, Yi Cai
, Xin Wu
:
Towards Automated Infographic Authoring From Natural Language Statement With Multiple Proportional Facts. 7101-7113 - Xiaofei Zhou
, Zhicong Wu
, Runmin Cong
:
Decoupling and Integration Network for Camouflaged Object Detection. 7114-7129 - Zhi Han
, Yanmei Wang
, Shaojie Zhang, Huijie Fan
, Yandong Tang
, Yao Wang
:
Online Video Sparse Noise Removing via Nonlocal Robust PCA. 7130-7145 - Xiaoyu Guo
, Wei Xiang
, Shunli Zhang
, Wei Lu
, Weiwei Xing
:
DCRP: Class-Aware Feature Diffusion Constraint and Reliable Pseudo-Labeling for Imbalanced Semi-Supervised Learning. 7146-7159 - Xiaoqian Zhang
, Chao Luo
, Xiao Wang, Jinghao Li, Shuai Zhao
, Daojian Jiang:
Learnable Tensor Graph Fusion Framework for Natural Image Segmentation. 7160-7173 - Wenjun Hui
, Zhenfeng Zhu
, Guanghua Gu
, Meiqin Liu
, Yao Zhao
:
Implicit-Explicit Motion Learning for Video Camouflaged Object Detection. 7188-7196 - Shuai Chen
, Fanman Meng
, Runtong Zhang
, Heqian Qiu
, Hongliang Li
, Qingbo Wu
, Linfeng Xu
:
Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond. 7197-7209 - Jin Yuan
, Feng Hou
, Ying Yang
, Yang Zhang
, Zhongchao Shi
, Xin Geng
, Jianping Fan
, Zhiqiang He
, Yong Rui
:
Domain-Aware Graph Network for Bridging Multi-Source Domain Adaptation. 7210-7224 - Chengyang Li
, Baoping Cheng
, Yao Cheng
, Haocheng Zhang
, Renshuai Liu
, Yinglin Zheng
, Jing Liao
, Xuan Cheng
:
FaceRefiner: High-Fidelity Facial Texture Refinement With Differentiable Rendering-Based Style Transfer. 7225-7236 - Xi Yang
, Xian Wang
, Liangchen Liu
, Nannan Wang
, Xinbo Gao
:
STFE: A Comprehensive Video-Based Person Re-Identification Network Based on Spatio-Temporal Feature Enhancement. 7237-7249 - Minglu Zhao
, Wenmin Wang
, Tongbao Chen
, Rui Zhang
, Ruochen Li
:
TA2V: Text-Audio Guided Video Generation. 7250-7264 - Ziqi Yuan
, Baozheng Zhang
, Hua Xu
, Kai Gao
:
Meta Noise Adaption Framework for Multimodal Sentiment Analysis With Feature Noise. 7265-7277 - Xin Liu
, Yuting Zhang
, Zitong Yu
, Hao Lu
, Huanjing Yue
, Jingyu Yang
:
rPPG-MAE: Self-Supervised Pretraining With Masked Autoencoders for Remote Physiological Measurements. 7278-7293 - Lingyun Song
, Siyu Chen
, Ziyang Meng
, Mingxuan Sun
, Xuequn Shang
:
FMSA-SC: A Fine-Grained Multimodal Sentiment Analysis Dataset Based on Stock Comment Videos. 7294-7306 - Xu Lu, Li Liu, Lixin Ning, Liang Zhang, Shaomin Mu, Huaxiang Zhang:
Multi-Facet Weighted Asymmetric Multi-Modal Hashing Based on Latent Semantic Distribution. 7307-7320 - Anqi Liu
, Sumei Li
, Yongli Chang
, Wenlin Zhang
, Yonghong Hou
:
Coarse-to-Fine Cross-View Interaction Based Accurate Stereo Image Super-Resolution Network. 7321-7334 - Ludan Sun
, Kai Zhang
, Feng Zhang
, Wenbo Wan
, Jiande Sun
:
Deep Rank-N Decomposition Network for Image Fusion. 7335-7348 - Jiwei Wei
, Yang Yang
, Xiang Guan
, Xing Xu
, Guoqing Wang
, Heng Tao Shen
:
Runge-Kutta Guided Feature Augmentation for Few-Sample Learning. 7349-7358 - Wenjie Zhu
, Bo Peng
, Wei Qi Yan
:
Dual Knowledge Distillation on Multiview Pseudo Labels for Unsupervised Person Re-Identification. 7359-7371 - Zhe Zhang
, Marc St-Hilaire
, Xin Wei
, Haiwei Dong
, Abdulmotaleb El-Saddik
:
How to Cache Important Contents for Multi-Modal Service in Dynamic Networks: A DRL-Based Caching Scheme. 7372-7385 - Lin Liu
, Junfeng An
, Shanxin Yuan
, Wengang Zhou
, Houqiang Li
, Yanfeng Wang
, Qi Tian
:
Video Demoiréing With Deep Temporal Color Embedding and Video-Image Invertible Consistency. 7386-7397 - Qiang Li
, Guang Zu
, Hui Xu
, Jun Kong
, Yanni Zhang
, Jianzhong Wang
:
An Adaptive Dual Selective Transformer for Temporal Action Localization. 7398-7412 - Wenhui Zhao
, Qin Li
, Huafu Xu, Quanxue Gao
, Qianqian Wang
, Xinbo Gao
:
Anchor Graph-Based Feature Selection for One-Step Multi-View Clustering. 7413-7425 - Huafeng Liu
, Mengmeng Sheng
, Zeren Sun
, Yazhou Yao
, Xian-Sheng Hua
, Heng Tao Shen
:
Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection. 7426-7437 - Jingru Duan
, Yanbin Hao
, Bin Zhu
, Lechao Cheng
, Pengyuan Zhou
, Xiang Wang
:
Efficient Unsupervised Video Hashing With Contextual Modeling and Structural Controlling. 7438-7450 - Shuo Yang
, Xinxiao Wu
, Zirui Shang
, Jiebo Luo
:
Dynamic Pathway for Query-Aware Feature Learning in Language-Driven Action Localization. 7451-7461 - Nan Gao
, Renyuan Yao
, Ronghua Liang
, Peng Chen
, Tianshuang Liu
, Yuanjie Dang
:
Multi-Level Objective Alignment Transformer for Fine-Grained Oral Panoramic X-Ray Report Generation. 7462-7474 - Zongyi Xu
, Xinqi Jiang
, Xinyu Gao, Rui Gao, Changjun Gu, Qianni Zhang
, Weisheng Li
, Xinbo Gao
:
IGReg: Image-Geometry-Assisted Point Cloud Registration via Selective Correlation Fusion. 7475-7489 - Junhu Wang
, Yanyan Wei
, Zhao Zhang
, Jicong Fan
, Yang Zhao
, Yi Yang, Meng Wang
:
Progressive Stereo Image Dehazing Network via Cross-View Region Interaction. 7490-7502 - Jiajia Xie
, Sheng Zhang
, Beihao Xia
, Zhu Xiao
, Hongbo Jiang
, Siwang Zhou
, Zheng Qin
, Hongyang Chen
:
Pedestrian Trajectory Prediction Based on Social Interactions Learning With Random Weights. 7503-7515 - Qingguo Liu
, Pan Gao
, Kang Han
, Ningzhong Liu, Wei Xiang
:
Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution. 7516-7528 - Haiqi Liu
, C. L. Philip Chen
, Xinrong Gong
, Tong Zhang
:
Robust Saliency-Aware Distillation for Few-Shot Fine-Grained Visual Recognition. 7529-7542 - Xin Zhou
, Chunyan Miao
:
Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation With Interpretability. 7543-7554 - Shanmin Pang
, Yueyang Zeng
, Jiawei Zhao
, Jianru Xue
:
A Mutually Textual and Visual Refinement Network for Image-Text Matching. 7555-7566 - Jinyu Cai
, Yunhe Zhang
, Shiping Wang
, Jicong Fan
, Wenzhong Guo
:
Wasserstein Embedding Learning for Deep Clustering: A Generative Approach. 7567-7580 - Zhuang Shao
, Jungong Han
, Kurt Debattista
, Yanwei Pang
:
DCMSTRD: End-to-end Dense Captioning via Multi-Scale Transformer Decoding. 7581-7593 - Yixuan Zhu
, Wenliang Zhao
, Yansong Tang
, Yongming Rao
, Jie Zhou
, Jiwen Lu
:
StableSwap: Stable Face Swapping in a Shared and Controllable Latent Space. 7594-7607 - Yifan Wang
, Liyuan Liu
, Chun Yuan
, Minbo Li
, Jing Liu
:
Negative-Sensitive Framework With Semantic Enhancement for Composed Image Retrieval. 7608-7621 - Ruohao Guo
, Xianghua Ying
, Yanyu Qi
, Liao Qu
:
UniTR: A Unified TRansformer-Based Framework for Co-Object and Multi-Modal Saliency Detection. 7622-7635 - Xi Luo
, Min Jiang
, Jun Kong
, Xuefeng Tao
:
Hierarchical Camera-Aware Contrast Extension for Unsupervised Person Re-Identification. 7636-7648 - Shuang Chen
, Amir Atapour-Abarghouei
, Hubert P. H. Shum
:
HINT: High-Quality INpainting Transformer With Mask-Aware Encoding and Enhanced Attention. 7649-7660 - Siduo Pan
, Ziqi Zhang
, Kun Wei
, Xu Yang
, Cheng Deng
:
Few-Shot Generative Model Adaptation via Style-Guided Prompt. 7661-7672 - Zhongze Wang
, Haitao Zhao
, Lujian Yao
, Jingchao Peng
, Kaijie Zhao
:
DFR-Net: Density Feature Refinement Network for Image Dehazing Utilizing Haze Density Difference. 7673-7686 - Jinkun You
, Yicong Zhou
:
Two-Stage Watermark Removal Framework for Spread Spectrum Watermarking. 7687-7699 - Xiaokun Li
, Rumeng Yi
, Yaping Huang
:
Mutual Filter Teaching for Open-Set Semi-Supervised Learning. 7700-7708 - Yuntong Tian
, Jiaxi Li
, Huazhu Fu
, Lei Zhu
, Lequan Yu
, Liang Wan
:
Self-Mining the Confident Prototypes for Source-Free Unsupervised Domain Adaptation in Image Segmentation. 7709-7720 - Yutao Liu
, Baochao Zhang
, Runze Hu
, Ke Gu
, Guangtao Zhai
, Junyu Dong
:
Underwater Image Quality Assessment: Benchmark Database and Objective Method. 7734-7747 - Shiyuan He
, Jiwei Wei
, Chaoning Zhang
, Xing Xu
, Jingkuan Song
, Yang Yang
, Heng Tao Shen
:
Boosting Adversarial Training with Hardness-Guided Attack Strategy. 7748-7760 - Haochen Han
, Qinghua Zheng
, Minnan Luo
, Kaiyao Miao
, Feng Tian
, Yan Chen
:
Noise-Tolerant Learning for Audio-Visual Action Recognition. 7761-7774 - Ardian Umam
, Cheng-Kun Yang
, Jen-Hui Chuang
, Yen-Yu Lin
:
Unsupervised Point Cloud Co-Part Segmentation via Co-Attended Superpoint Generation and Aggregation. 7775-7786 - Zhuangzhuang Zhou
, Yingying Zhu
:
RaFPN: Relation-Aware Feature Pyramid Network for Dense Image Prediction. 7787-7800 - Mingqi Shao
, Chongkun Xia
, Dongxu Duan
, Xueqian Wang
:
Polarimetric Inverse Rendering for Transparent Shapes Reconstruction. 7801-7811 - Nengzhong Yin
, Chengxu Liu
, Ruhao Tian
, Xueming Qian
:
SDPDet: Learning Scale-Separated Dynamic Proposals for End-to-End Drone-View Detection. 7812-7822 - Junjie Zhang
, Yutao Rao
, Xiaoshui Huang
, Guanyi Li
, Xin Zhou
, Dan Zeng
:
Frequency-Aware Multi-Modal Fine-Tuning for Few-Shot Open-Set Remote Sensing Scene Classification. 7823-7837 - Jingchun Zhou
, Shiyin Wang
, Zifan Lin
, Qiuping Jiang
, Ferdous Sohel
:
A Pixel Distribution Remapping and Multi-Prior Retinex Variational Model for Underwater Image Enhancement. 7838-7849 - Xiaowen Wang
, Lanjun Wang
, Yuting Su, Yongdong Zhang
, An-An Liu
:
MCDAN: A Multi-Scale Context-Enhanced Dynamic Attention Network for Diffusion Prediction. 7850-7862 - Wufei Ma
, Jiahao Li
, Bin Li
, Yan Lu
:
Uncertainty-Aware Deep Video Compression With Ensembles. 7863-7872 - Deebha Mumtaz
, Sadbhawna
, Vinit Jakhetiya
, Badri N. Subudhi
, Weisi Lin
:
Non-Subsampled Contourlet Transform and Ground-Truth Score Generation Based Quality Assessment for DIBR-Synthesized Views. 7873-7886 - Quan Zhou
, Linjie Wang
, Guangwei Gao
, Bin Kang
, Weihua Ou
, Huimin Lu
:
Boundary-Guided Lightweight Semantic Segmentation With Multi-Scale Semantic Context. 7887-7900 - Jianping Gou
, Yu Chen
, Baosheng Yu
, Jinhua Liu
, Lan Du
, Shaohua Wan
, Zhang Yi
:
Reciprocal Teacher-Student Learning via Forward and Feedback Knowledge Distillation. 7901-7916 - Shaohua Teng
, Jiangbo Li
, Luyao Teng
, Lunke Fei
, Naiqi Wu
, Wei Zhang:
Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal Retrieval. 7917-7932 - Zhuopan Yang
, Zhenguo Yang
, Xiaoping Li
, Yi Yu
, Qing Li
, Wenyin Liu
:
A Progressive Placeholder Learning Network for Multimodal Zero-Shot Learning. 7933-7945 - Yaoqian Zhao
, Qizhi Teng
, Honggang Chen
, Shujiang Zhang
, Xiaohai He
, Yi Li
, Ray E. Sheriff
:
Activating More Information in Arbitrary-Scale Image Super-Resolution. 7946-7961 - Ge Song
, Kai Huang
, Hanwen Su
, Fengyi Song
, Ming Yang
:
Deep Ranking Distribution Preserving Hashing for Robust Multi-Label Cross-Modal Retrieval. 7027-7042 - Qihao Liang
, Ye Wang
:
Drawlody: Sketch-Based Melody Creation With Enhanced Usability and Interpretability. 7074-7088 - Haoyu Wang
, Yuhu Cheng
, Xiaomin Liu
, Xuesong Wang
:
Reinforcement Learning Based Markov Edge Decoupled Fusion Network for Fusion Classification of Hyperspectral and LiDAR. 7174-7187 - Jiachen Yang
, Shukun Ma
, Zhuo Zhang
, Yang Li
, Shuai Xiao
, Jiabao Wen
, Wen Lu
, Xinbo Gao
:
Say No to Redundant Information: Unsupervised Redundant Feature Elimination for Active Learning. 7721-7733 - Yutong Gao
, Congyan Lang
, Fayao Liu
, Yuanzhouhan Cao
, Lijuan Sun, Yunchao Wei
:
Dynamic Interaction Dilation for Interactive Human Parsing. 178-189 - Ziwei Niu
, Junkun Yuan
, Xu Ma
, Yingying Xu
, Jing Liu
, Yen-Wei Chen
, Ruofeng Tong
, Lanfen Lin
:
Knowledge Distillation-Based Domain-Invariant Representation Learning for Domain Generalization. 245-255 - Fan Feng
, Yue Ming
, Nannan Hu, Hui Yu
, Yuanan Liu
:
CSS-Net: A Consistent Segment Selection Network for Audio-Visual Event Localization. 701-713 - Hui Ma
, Jian Wang
, Hongfei Lin
, Bo Zhang
, Yi-Jia Zhang
, Bo Xu
:
A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations. 776-788 - Xipeng Chen, Junzheng Zhang, Keze Wang
, Pengxu Wei
, Liang Lin
:
Multi-Person 3D Pose Estimation With Occlusion Reasoning. 878-889 - Qiushi Zhu
, Long Zhou
, Ziqiang Zhang
, Shujie Liu, Binxing Jiao
, Jie Zhang
, Li-Rong Dai
, Daxin Jiang, Jinyu Li
, Furu Wei:
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning. 1055-1064 - Zhenyang Li
, Yangyang Guo
, Kejie Wang
, Fan Liu
, Liqiang Nie
, Mohan S. Kankanhalli
:
Learning to Agree on Vision Attention for Visual Commonsense Reasoning. 1065-1075 - Chen Zhang
, Yi Ren, Kejun Zhang
, Shuicheng Yan
:
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation. 1681-1689 - Congrui Li
, Ziqiang Zheng
, Yi Bin
, Guoqing Wang
, Yang Yang
, Xuesheng Li
, Heng Tao Shen
:
Pixel Bleach Network for Detecting Face Forgery Under Compression. 2585-2597 - Jiaming Zhou
, Kun-Yu Lin
, Yukun Qiu
, Wei-Shi Zheng
:
TwinFormer: Fine-to-Coarse Temporal Modeling for Long-Term Action Recognition. 2715-2728 - Yu Tian
, Baoliang Chen
, Shiqi Wang
, Sam Kwong
:
Towards Thousands to One Reference: Can We Trust the Reference Image for Quality Assessment? 3278-3290 - Ilwi Yun
, Hyuk-Jae Lee
, Chae-Eun Rhee
:
Adversarial Mixture Density Network and Uncertainty-Based Joint Learning for 360$^\circ$ Monocular Depth Estimation. 3592-3603 - Min Tan
, Yinfu Feng
, Lingqiang Chu
, Jingcheng Shi
, Rong Xiao
, Haihong Tang
, Jun Yu
:
FedSea: Federated Learning via Selective Feature Alignment for Non-IID Multimodal Data. 5807-5822 - Wujin Li
, Bin-Bin Gao
, Bizhong Xia, Jinbao Wang
, Jun Liu, Yong Liu, Chengjie Wang
, Feng Zheng
:
Cross-Modal Alternating Learning With Task-Aware Representations for Continual Learning. 5911-5924 - Zixiao Yu
, Xinyi Wu
, Haohong Wang
, Aggelos K. Katsaggelos, Jian Ren
:
Automated Adaptive Cinematography for User Interaction in Open World. 6178-6190 - Huan Zhang
, Dongsheng Zheng
, Yun Zhang
, Jiangzhong Cao
, Weisi Lin
, Wing-Kuen Ling
:
Quality Assessment for DIBR-Synthesized Views Based on Wavelet Transform and Gradient Magnitude Similarity. 6834-6847 - Daniel Becking
, Karsten Müller
, Paul Haase
, Heiner Kirchhoffer
, Gerhard Tech
, Wojciech Samek
, Heiko Schwarz
, Detlev Marpe
, Thomas Wiegand
:
Neural Network Coding of Difference Updates for Efficient Distributed Learning Communication. 6848-6863 - Changhong Fu
, Jin Jin, Fangqiang Ding
, Yiming Li
, Geng Lu:
Spatial Reliability Enhanced Correlation Filter: An Efficient Approach for Real-Time UAV Tracking. 4123-4137 - Boyang Li
, Fei Zhang
, Longguang Wang
, Yingqian Wang
, Ting Liu
, Zaiping Lin
, Wei An
, Yulan Guo
:
DDAug: Differentiable Data Augmentation for Weakly Supervised Semantic Segmentation. 4764-4775 - Yujian Feng
, Feng Chen
, Jian Yu
, Yimu Ji
, Fei Wu
, Tianliang Liu
, Shangdong Liu
, Xiao-Yuan Jing
, Jiebo Luo
:
Cross-Modality Spatial-Temporal Transformer for Video-Based Visible-Infrared Person Re-Identification. 6582-6594 - Peipei Li
, Xing Cui
, Yibo Hu
, Man Zhang
, Ting Yao
, Tao Mei
:
Bidirectional Knowledge Reconfiguration for Lightweight Point Cloud Analysis. 7962-7972 - Jing Nie
, Yanwei Pang
, Jin Xie
, Jungong Han
, Xuelong Li
:
Binocular Image Dehazing via a Plain Network Without Disparity Estimation. 7973-7986 - Jielong Lu
, Zhihao Wu
, Luying Zhong
, Zhaoliang Chen
, Hong Zhao
, Shiping Wang
:
Generative Essential Graph Convolutional Network for Multi-View Semi-Supervised Classification. 7987-7999 - Dong Liu
, Xiaofeng Wang
, Ruidong Han
, Ningning Bai
, Jianpeng Hou
, Shanmin Pang
:
CTE-Net: Contextual Texture Enhancement Network for Image Super-Resolution. 8000-8013 - Jing Liu
, Lele Sun
, Weizhi Nie
, Yuting Su, Yongdong Zhang
, Anan Liu
:
Inter- and Intra-Domain Potential User Preferences for Cross-Domain Recommendation. 8014-8025 - Quan Wang, Zichi Wang
, Xinpeng Zhang
, Guorui Feng
:
Art Image Inpainting With Style-Guided Dual-Branch Inpainting Network. 8026-8037 - Yuanling Lv, Guangyu Huang
, Yan Yan
, Jing-Hao Xue
, Si Chen
, Hanzi Wang
:
Visual-Textual Attribute Learning for Class-Incremental Facial Expression Recognition. 8038-8051 - Huang Zhang
, Changshuo Wang
, Long Yu
, Shengwei Tian
, Xin Ning
, Joel J. P. C. Rodrigues:
PointGT: A Method for Point-Cloud Classification and Segmentation Based on Local Geometric Transformation. 8052-8062 - Lei Xing
, Yawen Song
, Badong Chen
, Changyuan Yu
, Jing Qin
:
Incomplete Multi-View Clustering via Correntropy and Complement Consensus Learning. 8063-8076 - Hui Xiao
, Yuting Hong
, Li Dong
, Diqun Yan
, Junjie Xiong
, Jiayan Zhuang
, Dongtai Liang
, Chengbin Peng
:
Multi-Level Label Correction by Distilling Proximate Patterns for Semi-Supervised Semantic Segmentation. 8077-8087 - Peng Lin
, Yafei Wang
, Yuanyuan Li
, Zihao Fan
, Xianping Fu
:
Underwater Color Correction Network With Knowledge Transfer. 8088-8103 - Wandong Zhang
, Yimin Yang
, Zeng Li
, Q. M. Jonathan Wu
:
Progressive Learning Model for Big Data Analysis Using Subnetwork and Moore-Penrose Inverse. 8104-8118 - Hongliang Bi
, Wenbo Zhang
, Shuaihao Li
, Yanjiao Chen
, Chaoyang Zhou
, Tang Zhou
:
SmartSit: Sitting Posture Recognition Through Acoustic Sensing on Smartphones. 8119-8130 - Jiangtao Zhang
, Qingshan Wang
, Qi Wang
:
A Sign Language Recognition Framework Based on Cross-Modal Complementary Information Fusion. 8131-8144 - Mingyu Li
, Tao Zhou
, Bo Han
, Tongliang Liu
, Xinkai Liang, Jiajia Zhao, Chen Gong
:
Class-Wise Contrastive Prototype Learning for Semi-Supervised Classification Under Intersectional Class Mismatch. 8145-8156 - Zhiyu Pan
, Jiahao Cui
, Kewei Wang
, Yizheng Wu
, Zhiguo Cao
:
Pseudo Label Fusion With Uncertainty Estimation for Semi-Supervised Cropping Box Regression. 8157-8171 - Xi Yang
, Wenjiao Dong
, Meijie Li, Ziyu Wei
, Nannan Wang
, Xinbo Gao
:
Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification. 8172-8183 - De Cheng
, Yanling Ji
, Dong Gong
, Yan Li
, Nannan Wang
, Junwei Han
, Dingwen Zhang
:
Continual All-in-One Adverse Weather Removal With Knowledge Replay on a Unified Network Structure. 8184-8196 - Shenghai Yuan
, Jijia Chen
, Wenchao Jiang
, Zhiming Zhao
, Song Guo
:
LHNetV2: A Balanced Low-Cost Hybrid Network for Single Image Dehazing. 8197-8209 - Bingshan Zhu
, Yi Cai
, Jiexin Wang
:
Graph-Based Multimodal Topic Modeling With Word Relations and Object Relations. 8210-8225 - Ayan Banerjee
, Shivakumara Palaiahnakote
, Umapada Pal
, Apostolos Antonacopoulos
, Tong Lu
, Josep Lladós
:
TTS: Hilbert Transform-Based Generative Adversarial Network for Tattoo and Scene Text Spotting. 8226-8241 - Yingdong Ma
, Xiaoyu Hu
:
TFRNet: Semantic Segmentation Network with Token Filtration and Refinement Method. 8242-8254 - Xiaofeng Qu
, Huaxiang Zhang
, Lei Zhu
, Liqiang Nie
, Li Liu
:
AAMT: Adversarial Attack-Driven Mutual Teaching for Source-Free Domain-Adaptive Person Reidentification. 8255-8267 - Xiaoli Wang
, Yongli Wang
, Yupeng Wang
, Anqi Huang
, Jun Liu
:
Trusted Semi-Supervised Multi-View Classification With Contrastive Learning. 8268-8278 - Min-Jung Shin
, Woojune Park, Minji Cho
, Kyeongbo Kong, Hoseong Son
, Joonsoo Kim
, Kugjin Yun, Gwangsoon Lee, Suk-Ju Kang
:
MosaicMVS: Mosaic-Based Omnidirectional Multi-View Stereo for Indoor Scenes. 8279-8290 - Wengang Zhou
, Jiajun Deng, Niculae Sebe, Qi Tian, Alan L. Yuille, Concetto Spampinato, Zakia Hammal:
Guest Editorial Introduction to the Issue on Pre-Trained Models for Multi-Modality Understanding. 8291-8296 - Tong Zhang
, Xiankai Lu
, Hao Zhang
, Xiushan Nie
, Yilong Yin
, Jianbing Shen
:
Relational Network via Cascade CRF for Video Language Grounding. 8297-8311 - Xingyu Gao
, Xi Wang
, Zhenyu Chen
, Wei Zhou
, Steven C. H. Hoi
:
Knowledge Enhanced Vision and Language Model for Multi-Modal Fake News Detection. 8312-8322 - Liang Zhao
, Pingda Huang
, Tengtuo Chen
, Chunjiang Fu
, Qinghao Hu
, Yangqianhui Zhang
:
Multi-Sentence Complementarily Generation for Text-to-Image Synthesis. 8323-8332 - Xin Jin
, Cuiling Lan
, Wenjun Zeng
, Zhibo Chen
:
Domain Prompt Tuning via Meta Relabeling for Unsupervised Adversarial Adaptation. 8333-8347 - Shun Qian
, Bingquan Liu
, Chengjie Sun
, Zhen Xu
, Lin Ma
, Baoxun Wang
:
CroMIC-QA: The Cross-Modal Information Complementation Based Question Answering. 8348-8359 - Yuan Tang
, Xianzhi Li
, Jinfeng Xu, Qiao Yu
, Long Hu
, Yixue Hao
, Min Chen
:
Point-LGMask: Local and Global Contexts Embedding for Point Cloud Pre-Training With Multi-Ratio Masking. 8360-8370 - Kaijian Liu
, Shixiang Tang
, Ziyue Li
, Zhishuai Li
, Lei Bai
, Feng Zhu
, Rui Zhao
:
Relation-Aware Distribution Representation Network for Person Clustering With Multiple Modalities. 8371-8382 - Jian Huang
, Yanli Ji
, Zhen Qin
, Yang Yang
, Heng Tao Shen
:
Dominant SIngle-Modal SUpplementary Fusion (SIMSUF) for Multimodal Sentiment Analysis. 8383-8394 - Yuheng Shi
, Xinxiao Wu
, Hanxi Lin
, Jiebo Luo
:
Commonsense Knowledge Prompting for Few-Shot Action Recognition in Videos. 8395-8405 - Siying Wu
, Xueyang Fu
, Feng Wu
, Zheng-Jun Zha
:
Vision-and-Language Navigation via Latent Semantic Alignment Learning. 8406-8418 - Hairui Ren
, Fan Tang
, Xingjia Pan
, Juan Cao
, Weiming Dong
, Zhiwen Lin
, Ke Yan
, Changsheng Xu
:
${A^{2}Pt}$: Anti-Associative Prompt Tuning for Open Set Visual Recognition. 8419-8431 - Chengzhi Wu
, Julius Pfrommer
, Mingyuan Zhou
, Jürgen Beyerer
:
Self-Supervised Generative-Contrastive Learning of Multi-Modal Euclidean Input for 3D Shape Latent Representations: A Dynamic Switching Approach. 8432-8441 - Son Duy Dao
, Hengcan Shi
, Dinh Q. Phung
, Jianfei Cai
:
Class Enhancement Losses With Pseudo Labels for Open-Vocabulary Semantic Segmentation. 8442-8453 - Jiashuo Yu
, Junfu Pu
, Ying Cheng
, Rui Feng
, Ying Shan
:
Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization. 8454-8463 - Hai Liu
, Tingting Liu
, Yu Chen
, Zhaoli Zhang
, You-Fu Li
:
EHPE: Skeleton Cues-Based Gaussian Coordinate Encoding for Efficient Human Pose Estimation. 8464-8475 - Tianyi Zhang
, Ronglu Li
, Pengming Feng
, Rubo Zhang
:
Integration of Global and Local Knowledge for Foreground Enhancing in Weakly Supervised Temporal Action Localization. 8476-8487 - Nianzhen Gao
, Guanghua Liu
, Mingjie Feng
, Xinhai Hua
, Tao Jiang
:
Non-Orthogonal Multiple Access Enhanced Scalable 360-Degree Video Multicast. 8488-8503 - Yoonchan Nam
, JoonKyu Kim
, Jae-hun Shim
, Suk-Ju Kang
:
Deep Conditional HDRI: Inverse Tone Mapping via Dual Encoder-Decoder Conditioning Method. 8504-8515 - Yingchen Yu
, Rongliang Wu
, Yifang Men
, Shijian Lu
, Miaomiao Cui, Xuansong Xie
, Chunyan Miao
:
MorphNeRF: Text-Guided 3D-Aware Editing via Morphing Generative Neural Radiance Fields. 8516-8528 - Tao Wang
, Mengyuan Liu
, Hong Liu
, Wenhao Li
, Miaoju Ban
, Tianyu Guo
, Yidi Li
:
Feature Completion Transformer for Occluded Person Re-Identification. 8529-8542 - Qianhao Wu
, Jiaxin Qi
, Dong Zhang
, Hanwang Zhang
, Jinhui Tang
:
Fine-Tuning for Few-Shot Image Classification by Multimodal Prototype Regularization. 8543-8556 - Tao Xiang
, Hongyan Pan
, Zhixiong Nan
:
Video Violence Rating: A Large-Scale Public Database and A Multimodal Rating Model. 8557-8568 - Likun Gao
, Hai-Miao Hu
, Xinhui Xue
, Haoxin Hu
:
From Appearance to Inherence: A Hyperspectral Image Dataset and Benchmark of Material Classification for Surveillance. 8569-8580 - Wenhui Hong
, Hao Zhang
, Jiayi Ma
:
OFPF-MEF: An Optical Flow Guided Dynamic Multi-Exposure Image Fusion Network With Progressive Frequencies Learning. 8581-8595 - Yixuan Li, Bolin Chen, Baoliang Chen, Meng Wang, Shiqi Wang, Weisi Lin:
Perceptual Quality Assessment of Face Video Compression: A Benchmark and An Effective Method. 8596-8608 - Ali Vosoughi
, Shijian Deng
, Songyang Zhang
, Yapeng Tian
, Chenliang Xu
, Jiebo Luo
:
Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA. 8609-8624 - Lin Yang
, Dawen Xu
, Jiangbo Qian
, Rangding Wang
:
Quad-Tree Structure-Preserving Adaptive Steganography for HEVC. 8625-8638 - Bochen Xie
, Yongjian Deng
, Zhanpeng Shao
, Youfu Li
:
EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images. 8639-8650 - Yuxin Zhou
, Chenguang Liu
, Yulong Ding
, Diping Yuan
, Jiyao Yin, Shuang-Hua Yang
:
Crowd Descriptors and Interpretable Gathering Understanding. 8651-8664 - Zeyu Ma
, Yuqi Li
, Yizhi Luo
, Xiao Luo
, Jinxing Li
, Chong Chen
, Xian-Sheng Hua
, Guangming Lu
:
Discrepancy and Structure-Based Contrast for Test-Time Adaptive Retrieval. 8665-8677 - Xiangyang Li
, Shiguo Chen
, Chunna Tian
, Heng Zhou
, Zhenxi Zhang
:
M2FNet: Mask-Guided Multi-Level Fusion for RGB-T Pedestrian Detection. 8678-8690 - Yun Gao
, Dan Wu
, Liang Zhou
:
How to Improve Immersive Experience? 8691-8703 - Hengcan Shi
, Munawar Hayat
, Jianfei Cai
:
Unified Open-Vocabulary Dense Visual Prediction. 8704-8716 - Zhikai Chen
, Fuchen Long
, Zhaofan Qiu
, Ting Yao
, Wengang Zhou
, Jiebo Luo
, Tao Mei
:
Learning 3D Shape Latent for Point Cloud Completion. 8717-8729 - Junfeng Tu
, Xueliang Liu
, Yanbin Hao
, Richang Hong
, Meng Wang
:
Two-Step Discrete Hashing for Cross-Modal Retrieval. 8730-8741 - Lohic Fotio Tiotsop
, Antonio Servetti
, Marcus Barkowsky
, Enrico Masala
:
Modeling Subject Scoring Behaviors in Subjective Experiments Based on a Discrete Quality Scale. 8742-8757 - Jin Huang
, Yongshun Gong
, Yang Shi
, Xinxin Zhang
, Jian Zhang
, Yilong Yin
:
Focusing on Subtle Differences: A Feature Disentanglement Model for Series Photo Selection. 8758-8770 - Jiaxin Huang
, Kecheng Chen
, Yazhou Ren
, Jiayu Sun, Xiaorong Pu
, Ce Zhu
:
Cross-Domain Low-Dose CT Image Denoising With Semantic Preservation and Noise Alignment. 8771-8782 - De Cheng
, Yan Li
, Dingwen Zhang
, Nannan Wang
, Jiande Sun
, Xinbo Gao
:
Progressive Negative Enhancing Contrastive Learning for Image Dehazing and Beyond. 8783-8798 - Mingye Xu
, Zhipeng Zhou
, Hongbin Xu
, Yu Qiao
, Yali Wang
:
CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning. 8799-8810 - Cong Zhang
, Wenxia Yang
, Xin Li
, Huan Han
:
MMGInpainting: Multi-Modality Guided Image Inpainting Based on Diffusion Models. 8811-8823 - Xingsen Huang
, Deshui Miao
, Hongpeng Wang
, Yaowei Wang
, Xin Li
:
Context-Guided Black-Box Attack for Visual Tracking. 8824-8835 - Ran Ran
, Liang-Jian Deng
, Tian-Jing Zhang
, Jianlong Chang
, Xiao Wu
, Qi Tian
:
KNLConv: Kernel-Space Non-Local Convolution for Hyperspectral Image Super-Resolution. 8836-8848 - Jie Guo
, Longyu Wen
, Yan Zhou
, Bin Song
, Yuhao Chi
, Fei Richard Yu
:
SPACE: Self-Supervised Dual Preference Enhancing Network for Multimodal Recommendation. 8849-8859 - Jacob Chakareski
, Mahmudur Khan
:
Live 360° Video Streaming to Heterogeneous Clients in 5G Networks. 8860-8873 - Chunlin Wen
, Hui Huang
, Yan Ma
, Feiniu Yuan
, Hongqing Zhu
:
Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation. 8874-8888 - Zihan Fang
, Shide Du
, Zhiling Cai
, Shiyang Lan
, Chunming Wu
, Yanchao Tan
, Shiping Wang
:
Representation Learning Meets Optimization-Derived Networks: From Single-View to Multi-View. 8889-8901 - Anan Du
, Tianfei Zhou
, Shuchao Pang
, Qiang Wu
, Jian Zhang
:
PCL: Point Contrast and Labeling for Weakly Supervised Point Cloud Semantic Segmentation. 8902-8914 - Yan Shu
, Zhaofan Qiu
, Fuchen Long
, Ting Yao
, Chong-Wah Ngo
, Tao Mei
:
Learning Temporal Dynamics in Videos With Image Transformer. 8915-8927 - Jun Kong
, Jin Wang
, Liang-Chih Yu
, Xuejie Zhang
:
Multimodality Self-distillation for Fast Inference of Vision and Language Pretrained Models. 8928-8940 - Jing Yi
, Zhenzhong Chen
:
Variational Mixture of Stochastic Experts Auto-Encoder for Multi-Modal Recommendation. 8941-8954 - Junyu Lai
, Lianqiang Gan
, Junhong Zhu
, Huashuo Liu
, Lianli Gao
:
Exploring Spatial Frequency Information for Enhanced Video Prediction Quality. 8955-8968 - Wenxiang Shen
, Baoye Zhang
, Hao Xu
, XiaoHan Li
, Jun Wu
:
Multi-Space Point Geometry Compression With Progressive Relation-Aware Transformer. 8969-8980 - Huiyuan Fu
, Kuilong Cui
, Chuanming Wang
, Mengshi Qi
, Huadong Ma
:
Mutual Distillation Learning for Person Re-Identification. 8981-8995 - Wenhui Li
, Houran Zhou
, Chenyu Zhang
, Weizhi Nie
, Xuanya Li
, An-An Liu
:
Dual-Stage Uncertainty Modeling for Unsupervised Cross-Domain 3D Model Retrieval. 8996-9007 - Zijie Song
, Zhenzhen Hu
, Yuanen Zhou
, Ye Zhao
, Richang Hong
, Meng Wang
:
Embedded Heterogeneous Attention Transformer for Cross-Lingual Image Captioning. 9008-9020 - Qihua Li
, Xing Tian
, Wing W. Y. Ng
:
Self-Supervised Temporal Sensitive Hashing for Video Retrieval. 9021-9035 - Teng Sun
, Yinwei Wei
, Juntong Ni
, Zixin Liu
, Xuemeng Song
, Yaowei Wang
, Liqiang Nie
:
Muti-Modal Emotion Recognition via Hierarchical Knowledge Distillation. 9036-9046 - Tong Qiao
, Shichuang Xie
, Yanli Chen
, Florent Retraint
, Ran Shi
, Xiangyang Luo
:
Deepfake Detection Fighting Against Noisy Label Attack. 9047-9059 - Miaojie Feng
, Hao Jia
, Zengqiang Yan
, Xin Yang
:
APCAFlow: All-Pairs Cost Volume Aggregation for Optical Flow Estimation. 9060-9069 - Ming Jin
, Changde Du
, Huiguang He
, Ting Cai
, Jinpeng Li
:
PGCN: Pyramidal Graph Convolutional Network for EEG Emotion Recognition. 9070-9082 - Guanyu Gao
, Yuqi Dong
, Ran Wang
, Xin Zhou
:
EdgeVision: Towards Collaborative Video Analytics on Distributed Edges for Performance Maximization. 9083-9094 - Shanshan Du
, Hanli Wang
, Tengpeng Li
, Chang Wen Chen
:
Hybrid Graph Reasoning With Dynamic Interaction for Visual Dialog. 9095-9108 - Hongmin Cai
, Bin Zhang
, Junyu Li
, Bin Hu
, Jiazhou Chen
:
Unsupervised Dual Hashing Coding (UDC) on Semantic Tagging and Sample Content for Cross-Modal Retrieval. 9109-9120 - Luntian Mou
, Haitao Xie
, Shasha Mao
, Dandan Yan
, Nan Ma
, Baocai Yin
, Wen Gao:
Image-Based Structured Vehicle Behavior Analysis Inspired by Interactive Cognition. 9121-9134 - Yang Liu
, Fang Liu
, Licheng Jiao
, Qianyue Bao
, Lingling Li
, Yuwei Guo
, Puhua Chen
:
A Knowledge-Based Hierarchical Causal Inference Network for Video Action Recognition. 9135-9149 - Song Wu
, Yan Zheng
, Yazhou Ren
, Jing He
, Xiaorong Pu
, Shudong Huang
, Zhifeng Hao
, Lifang He
:
Self-Weighted Contrastive Fusion for Deep Multi-View Clustering. 9150-9162 - Yuan Zhou
, Zhongqi Sun
, Shuwei Huo
, Sun-Yuan Kung
:
Dynamic View Aggregation for Multi-View 3D Shape Recognition. 9163-9174 - Yuling Su
, Xueliang Liu
, Ye Zhao
, Richang Hong
, Meng Wang
:
Partial-Tuning Based Mixed-Modal Prototypes for Few-Shot Classification. 9175-9186 - Chao Ding
, Mingyuan Lin
, Haijian Zhang
, Jianzhuang Liu
, Lei Yu
:
Video Frame Interpolation With Stereo Event and Intensity Cameras. 9187-9202 - Cidan Shi
, Lihuang Fang
, Han Wu
, Xiaoyu Xian
, Yukai Shi
, Liang Lin
:
NiteDR: Nighttime Image De-Raining With Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes. 9203-9215 - Mingzhe Li
, Yiping Duan
, Xiaoming Tao
, Changwen Chen
:
OARNet: Object-Attribute-Relation Network for Predicting Soccer Events. 9216-9227 - Zhenkun Fan
, Zhuoxu Huang
, Zhixiang Chen
, Tao Xu
, Jungong Han
, Josef Kittler
:
Lightweight Multiperson Pose Estimation With Staggered Alignment Self-Distillation. 9228-9240 - Lvlong Lai
, Jian Chen
, Guosheng Lin
, Qingyao Wu
:
CMNet: Component-Aware Matching Network for Few-Shot Point Cloud Classification. 9241-9251 - Zhengyong Wang
, Liquan Shen
, Yihan Yu
, Hui Yuan
:
UIERL: Internal-External Representation Learning Network for Underwater Image Enhancement. 9252-9267 - Tong Zhang
, Hao Fang
, Hao Zhang
, Jialin Gao
, Xiankai Lu
, Xiushan Nie
, Yilong Yin
:
Learning Feature Semantic Matching for Spatio-Temporal Video Grounding. 9268-9279 - Hao Wu
, Jun Sun
:
Robust Image Classification With Noisy Labels by Negative Learning and Feature Space Renormalization. 9280-9291 - Silvano A. Bernabel
, Sos S. Agaian
:
NDELS: A Novel Approach for Nighttime Dehazing, Low-Light Enhancement, and Light Suppression. 9292-9303 - Hongyu Fu
, Xin Yu
, Lincheng Li
, Li Zhang
:
CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. 9304-9315 - Yipo Huang
, Leida Li
, Pengfei Chen
, Jinjian Wu
, Yuzhe Yang
, Yaqian Li
, Guangming Shi
:
Coarse-to-Fine Image Aesthetics Assessment With Dynamic Attribute Selection. 9316-9329 - Shuhong Lin
, Moshe Zukerman
, Hong Yan
:
Music-Driven Choreography Based on Music Feature Clusters and Dynamic Programming. 9330-9341 - Andrés Altieri
, Lohic Fotio Tiotsop
, Giuseppe Valenzise
:
Subjective Media Quality Recovery From Noisy Raw Opinion Scores: A Non-Parametric Perspective. 9342-9357 - Hang Chen
, Qing Wang
, Jun Du
, Genshun Wan
, Shifu Xiong
, Baocai Yin
, Jia Pan
, Chin-Hui Lee
:
Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading. 9358-9371 - Chao Jiao
, Huanqiang Zeng
, Jing Chen
, Chih-Hsien Hsia
, Tianlei Wang
, Kai-Kuang Ma
:
Width-Adaptive CNN: Fast CU Partition Prediction for VVC Screen Content Coding. 9372-9382 - Qingyu Xu, Longguang Wang, Weidong Sheng, Yingqian Wang, Chao Xiao, Chao Ma, Wei An:
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos. 9383-9397 - Yan Luo
, Muming Zhao
, Jun Sun
, Guangtao Zhai
, Chongyang Zhang
:
Consistent GT-Proposal Assignment for Challenging Pedestrian Detection. 9398-9409 - Bo Han
, Lihuo He
, Ying Yu
, Wen Lu
, Xinbo Gao
:
General Deformable RoI Pooling and Semi-Decoupled Head for Object Detection. 9410-9422 - Miao Xu
, Xiangyu Zhu
, Yueying Kao
, Zhiwen Chen
, Jiangjing Lyu
, Zhen Lei
:
Multi-Level Pixel-Wise Correspondence Learning for 6DoF Face Pose Estimation. 9423-9435 - Xuejin Wang
, Leilei Huang, Hangwei Chen
, Qiuping Jiang
, Shaowei Weng
, Feng Shao
:
Benchmark Dataset and Pair-Wise Ranking Method for Quality Evaluation of Night-Time Image Enhancement. 9436-9449 - Lixiang Xu
, Qingzhe Cui
, Richang Hong
, Wei Xu
, Enhong Chen
, Xin Yuan
, Chenglong Li
, Yuanyan Tang
:
Group Multi-View Transformer for 3D Shape Analysis With Spatial Encoding. 9450-9463 - Xianyun Wang
, Linhong Wang
, Zhenchen Yang
, Jiacong Zhou
, Yuchen Zheng
, Feng Chen
, Richang Hong
, Jun Yu
, Fan Yang
:
DSIS-DPR:Structured Instance Segmentation and Diffusion Prior Refinement for Dental Anatomy Learning. 9464-9476 - Peng-Fei Zhang
, Zi Huang
, Xin-Shun Xu
, Guangdong Bai
:
Effective and Robust Adversarial Training Against Data and Label Corruptions. 9477-9488 - Sen Xu
, Shikui Wei
, Tao Ruan
, Lixin Liao
, Yao Zhao
:
Each Performs Its Functions: Task Decomposition and Feature Assignment for Audio-Visual Segmentation. 9489-9498 - Hongtao Xie
, Yan Jiang
, Lei Zhang
, Pandeng Li
, Dongming Zhang
, Yongdong Zhang
:
Semantic-Enhanced Proxy-Guided Hashing for Long-Tailed Image Retrieval. 9499-9514 - Yizhe Li
, Sanping Zhou
, Zheng Qin
, Le Wang
, Jinjun Wang
, Nanning Zheng
:
Single-Shot and Multi-Shot Feature Learning for Multi-Object Tracking. 9515-9526 - Ruiheng Zhang
, Jinyu Tan
, Zhe Cao
, Lixin Xu
, Yumeng Liu
, Lingyu Si
, Fuchun Sun
:
Part-Aware Correlation Networks for Few-Shot Learning. 9527-9538 - Guoshuai Zhao
, Xiaolong Zhang
, Hao Tang
, Jialie Shen
, Xueming Qian
:
Domain-Oriented Knowledge Transfer for Cross-Domain Recommendation. 9539-9550 - Yongshan Zhang
, Guozhu Jiang
, Zhihua Cai
, Yicong Zhou
:
Bipartite Graph-Based Projected Clustering With Local Region Guidance for Hyperspectral Imagery. 9551-9563 - Lin Zhang
, Yifan Wang
, Ran Song
, Mingxin Zhang
, Xiaolei Li
, Wei Zhang
:
Neighborhood-Aware Mutual Information Maximization for Source-Free Domain Adaptation. 9564-9574 - Ruihan Chen
, Junpeng Tan
, Zhijing Yang
, Xiaojun Yang
, Qingyun Dai
, Yongqiang Cheng
, Liang Lin
:
DPHANet: Discriminative Parallel and Hierarchical Attention Network for Natural Language Video Localization. 9575-9590 - Tao Pu
, Qianru Lao, Hefeng Wu
, Tianshui Chen
, Ling Tian
, Jie Liu, Liang Lin
:
Category-Adaptive Label Discovery and Noise Rejection for Multi-Label Recognition With Partial Positive Labels. 9591-9602 - Xiufang Li
, Licheng Jiao
, Qigong Sun
, Fang Liu
, Xu Liu
, Lingling Li
, Puhua Chen
, Shuyuan Yang
:
A Category-Aware Curriculum Learning for Data-Free Knowledge Distillation. 9603-9618 - Yuan Sun
, Yang Qin
, Dezhong Peng
, Zhenwen Ren
, Chao Yang
, Peng Hu
:
Dual Self-Paced Hashing for Image Retrieval. 9619-9629 - Zipeng Qin
, Jianbo Liu
, Xiaolin Zhang
, Maoqing Tian
, Aojun Zhou
, Shuai Yi
, Hongsheng Li
:
Pyramid Fusion Transformer for Semantic Segmentation. 9630-9643 - Shuai Zeng
, Wenzhao Zheng
, Jiwen Lu
, Haibin Yan
:
Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection. 9644-9656 - Xun Jiang
, Xing Xu
, Zailei Zhou
, Yang Yang
, Fumin Shen
, Heng Tao Shen
:
Zero-Shot Video Moment Retrieval With Angular Reconstructive Text Embeddings. 9657-9670 - Aobo Li
, Jinjian Wu
, Yongxu Liu, Leida Li
, Weisheng Dong
, Guangming Shi
:
Blind Image Quality Assessment Based on Perceptual Comparison. 9671-9682 - Quanziang Wang
, Renzhen Wang
, Yuexiang Li
, Dong Wei
, Hong Wang
, Kai Ma
, Yefeng Zheng
, Deyu Meng
:
Relational Experience Replay: Continual Learning by Adaptively Tuning Task-Wise Relationship. 9683-9698 - Wenhua Dong
, Xiaojun Wu
, Zhenhua Feng
, Sara Atito Ali Ahmed
, Muhammad Awais
, Josef Kittler
:
One-pass View-unaligned Clustering. 9699-9709 - Hosung Son
, Min-Jung Shin
, Minji Cho
, Joonsoo Kim
, Kugjin Yun
, Suk-Ju Kang
:
CMVDE: Consistent Multi-View Video Depth Estimation via Geometric-Temporal Coupling Approach. 9710-9721 - Zhe Yuan
, Dan Wu
, Liang Zhou
:
Achieving the Optimum Rate for Cross-Modal Source Coding. 9722-9735 - Chenlu Zhan
, Yufei Zhang
, Yu Lin
, Gaoang Wang
, Hongwei Wang
:
UniDCP: Unifying Multiple Medical Vision-Language Tasks via Dynamic Cross-Modal Learnable Prompts. 9736-9748 - Yue Zhan
, Xin Wang
, Lang Nie
, Yang Zhao
, Tangwen Yang
, Qiuqi Ruan
:
TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation. 9749-9762 - Minghua Wan
, Jingyu Zhu
, Chengli Sun
, Zhangjing Yang
, Jun Yin
, Guowei Yang
:
Tensor Low-Rank Graph Embedding and Learning for One-Step Incomplete Multi-View Clustering. 9763-9775 - Xueli Geng
, Licheng Jiao
, Lingling Li
, Xu Liu
, Fang Liu
, Shuyuan Yang
:
Fast and Effective: Progressive Hierarchical Fusion Classification for Remote Sensing Images. 9776-9789 - Jun Huang
, Honglin Li
, Yijia Gong
, Fan Fan
, Yong Ma
, Qinglei Du
, Jiayi Ma
:
Robust Feature Matching via Graph Neighborhood Motion Consensus. 9790-9803 - Yifei Xu
, Xiaolong Xu
, Honghao Gao
, Fu Xiao
:
SGDM: An Adaptive Style-Guided Diffusion Model for Personalized Text to Image Generation. 9804-9813 - Fangbin Xu
, Dongyue Chen
, Tong Jia
, Shizhuo Deng
, Hao Wang
:
CBDMoE: Consistent-but-Diverse Mixture of Experts for Domain Generalization. 9814-9824 - Yangbo Feng
, Junyu Gao
, Changsheng Xu
:
Spatiotemporal Orthogonal Projection Capsule Network for Incremental Few-Shot Action Recognition. 9825-9838 - Xingyue Liu
, Jiahao Qi, Chen Chen, Kangcheng Bin, Ping Zhong
:
Relation-Aware Weight Sharing in Decoupling Feature Learning Network for UAV RGB-Infrared Vehicle Re-Identification. 9839-9853 - Xiao-Qian Liu
, Peng-Fei Zhang
, Xin Luo
, Zi Huang
, Xin-Shun Xu
:
TextAdapter: Self-Supervised Domain Adaptation for Cross-Domain Text Recognition. 9854-9865 - Weiwei Cai
, Huaidong Zhang
, Xuemiao Xu
, Chenshu Xu
, Kun Zhang
, Shengfeng He
:
Delving Into Important Samples of Semi-Supervised Old Photo Restoration: A New Dataset and Method. 9866-9879 - Dongjie Ye
, Baoliang Chen
, Shiqi Wang
, Sam Kwong
:
CodedBGT: Code Bank-Guided Transformer for Low-Light Image Enhancement. 9880-9891 - Hong Ding
, Haimin Zhang
, Gang Fu
, Caoqing Jiang
, Fei Luo
, Chunxia Xiao
, Min Xu
:
Towards High-Quality Photorealistic Image Style Transfer. 9892-9905 - Gaojie Wu
, Ling-An Zeng
, Jingke Meng
, Wei-Shi Zheng
:
Adaptive Weight Generator for Multi-Task Image Recognition by Task Grouping Prompt. 9906-9919 - Jingyu Zhong
, Ronghua Shang
, Feng Zhao
, Weitong Zhang
, Songhua Xu
:
Negative Label and Noise Information Guided Disambiguation for Partial Multi-Label Learning. 9920-9935 - Yahui Xu
, Yi Bin
, Jiwei Wei
, Yang Yang
, Guoqing Wang
, Heng Tao Shen
:
Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback. 9936-9948 - Baozheng Zhang
, Ziqi Yuan
, Hua Xu
, Kai Gao
:
Crossmodal Translation Based Meta Weight Adaption for Robust Image-Text Sentiment Analysis. 9949-9961 - Fangxun Shu
, Biaolong Chen
, Yue Liao
, Jinqiao Wang
, Si Liu
:
MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval. 9962-9972 - Chang Liu
, Shunxin Xu
, Jialun Peng
, Kaidong Zhang
, Dong Liu
:
Toward Interactive Image Inpainting via Robust Sketch Refinement. 9973-9987 - Zhuo Su
, Yilin Chen
, Fuwei Zhang
, Ruomei Wang
, Fan Zhou
, Ge Lin
:
DMAP: Decoupling-Driven Multi-Level Attribute Parsing for Interpretable Outfit Collocation. 9988-10000 - Huibing Wang
, Mingze Yao
, Yawei Chen
, Yunqiu Xu
, Haipeng Liu
, Wei Jia
, Xianping Fu, Yang Wang
:
Manifold-Based Incomplete Multi-View Clustering via Bi-Consistency Guidance. 10001-10014 - Chen Liu
, Peike Li, Hu Zhang
, Lincheng Li
, Zi Huang
, Dadong Wang
, Xin Yu
:
BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge. 10015-10028 - Hezhen Hu
, Xiaoyi Dong
, Jianmin Bao, Dongdong Chen
, Lu Yuan
, Dong Chen
, Houqiang Li
:
PersonMAE: Person Re-Identification Pre-Training With Masked AutoEncoders. 10029-10040 - Xinran Li
, Zichi Wang
, Guorui Feng
, Xinpeng Zhang
, Chuan Qin
:
Perceptual Image Hashing Using Feature Fusion of Orthogonal Moments. 10041-10054 - Guotian Zeng
, Bi Zeng
, Qingmao Wei
, Huiting Hu
, Hong Zhang
:
Visual Object Tracking With Mutual Affinity Aligned to Human Intuition. 10055-10068 - Jianping Gou
, Xiabin Zhou
, Lan Du
, Yibing Zhan
, Wu Chen
, Zhang Yi
:
Difference-Aware Distillation for Semantic Segmentation. 10069-10080 - Wei Zhuo
, Yuan Wang
, Junliang Chen
, Songhe Deng
, Zhi Wang
, Linlin Shen
, Wenwu Zhu
:
Enhancing Unsupervised Semantic Segmentation Through Context-Aware Clustering. 10081-10093 - Jianwen Song
, Arcot Sowmya
, Changming Sun
:
Efficient Hybrid Feature Interaction Network for Stereo Image Super-Resolution. 10094-10105 - Leyuan Liu
, Xu Liu
, Jianchi Sun
, Changxin Gao
, Jingying Chen
:
SeIF: Semantic-Constrained Deep Implicit Function for Single-Image 3D Head Reconstruction. 10106-10120 - Zhongzheng Yuan
, Samyak Rawlekar
, Siddharth Garg
, Elza Erkip
, Yao Wang
:
Split Computing With Scalable Feature Compression for Visual Analytics on the Edge. 10121-10133 - Peiguang Jing
, Xuan Zhao, Fugui Fan
, Fan Yang
, Yun Li
, Yuting Su:
Multimodal Progressive Modulation Network for Micro-Video Multi-Label Classification. 10134-10144 - Jun Chen
, Jianfeng Ding
, Jiayi Ma
:
HitFusion: Infrared and Visible Image Fusion for High-Level Vision Tasks Using Transformer. 10145-10159 - Chenchen Tao
, Chong Wang
, Sunqi Lin
, Suhang Cai
, Di Li
, Jiangbo Qian
:
Feature Reconstruction With Disruption for Unsupervised Video Anomaly Detection. 10160-10173 - Xianquan Zhang
, Feiyi He
, Chunqiang Yu
, Xinpeng Zhang
, Ching-Nung Yang
, Zhenjun Tang
:
Reversible Data Hiding in Encrypted Images With Asymmetric Coding and Bit-Plane Block Compression. 10174-10188 - Xinghan Wang
, Yadong Mu
:
Localized Linear Temporal Dynamics for Self-Supervised Skeleton Action Recognition. 10189-10199 - Rui Ding
, Kehua Guo
, Xiangyuan Zhu
, Zheng Wu
, Hui Fang
:
Progressive Diversity Generation for Single Domain Generalization. 10200-10210 - Zhangkai Ni
, Yue Liu
, Keyan Ding
, Wenhan Yang
, Hanli Wang
, Shiqi Wang
:
Opinion-Unaware Blind Image Quality Assessment Using Multi-Scale Deep Feature Statistics. 10211-10224 - Shufang Zhang
, Minxue Ni
, Shuai Chen, Lei Wang
, Wenxin Ding
, Yuhong Liu
:
A Two-Stage Personalized Virtual Try-On Framework With Shape Control and Texture Guidance. 10225-10236 - Xiao Liang
, Wensheng Li
, Lifeng Huang
, Chengying Gao
:
DanceComposer: Dance-to-Music Generation Using a Progressive Conditional Music Generator. 10237-10250 - Zheng Wang
, Wei Zhang
, Long Ye
, Dan Zeng
, Tao Mei
:
Cross-Modal Quantization for Co-Speech Gesture Generation. 10251-10263 - Hegui Zhu
, Zhan Gao
, Jiayi Wang
, Yange Zhou, Chengqing Li
:
Few-Shot Fine-Grained Image Classification via Multi-Frequency Neighborhood and Double-Cross Modulation. 10264-10278 - Yu Wang
, Shengjie Zhao
, Shiwei Chen
:
Action-Semantic Consistent Knowledge for Weakly-Supervised Action Localization. 10279-10289 - Xinyu Zhang
, Weiyu Sun
, Hao Lu
, Ying Chen
, Yun Ge, Xiaolin Huang
, Jie Yuan
, Yingcong Chen
:
Self-Similarity Prior Distillation for Unsupervised Remote Physiological Measurement. 10290-10305 - Guoguang Hua, Dalian Zheng, Shishun Tian
, Wenbin Zou
, Shenglan Liu
, Xia Li
:
"Where Does the Devil Lie?": Multimodal Multitask Collaborative Revision Network for Trusted Road Segmentation. 10306-10317 - Zaidao Wen
, Jinhui Wu, Yafei Lv
, Qian Wu
:
Cross-Modality Vessel Re-Identification With Deep Alignment Decomposition Network. 10318-10330 - Fugui Fan
, Peiguang Jing
, Liqiang Nie
, Haoyu Gu
, Yuting Su
:
SADCMF: Self-Attentive Deep Consistent Matrix Factorization for Micro-Video Multi-Label Classification. 10331-10341 - Zheng Wang
, Zhenwei Gao, Mengqun Han, Yang Yang
, Heng Tao Shen
:
Estimating the Semantics via Sector Embedding for Image-Text Retrieval. 10342-10353 - Meihuizi Jia
, Lei Shen
, Luu Anh Tuan
, Meng Chen
, Jing Xu
, Lejian Liao
, Shaozu Yuan
, Xiaodong He
:
MuJo-SF: Multimodal Joint Slot Filling for Attribute Value Prediction of E-Commerce Commodities. 10354-10366 - Jiwei Wei
, Chen Pan
, Shiyuan He
, Guoqing Wang
, Yang Yang
, Heng Tao Shen
:
Towards Robust Person Re-Identification by Adversarial Training With Dynamic Attack Strategy. 10367-10380 - Zhijian Wu
, Jun Li
, Chang Xu
, Dingjiang Huang
, Steven C. H. Hoi
:
RUN: Rethinking the UNet Architecture for Efficient Image Restoration. 10381-10394 - Shan Li
, Lu Yang
, Pu Cao, Liulei Li
, Huadong Ma
:
Frequency-Based Matcher for Long-Tailed Semantic Segmentation. 10395-10405 - Lei Ma
, Xin Luo
, Hanyu Hong
, Fanman Meng
, Qingbo Wu
:
Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval. 10406-10419 - Xingqun Qi
, Chen Liu
, Lincheng Li
, Jie Hou, Haoran Xin, Xin Yu
:
EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation. 10420-10430 - Hao Luo, Zhiqiang Tian
, Kaibing Zhang
, Guofa Wang, Shaoyi Du
:
Semi-Supervised Domain Adaptation via Joint Transductive and Inductive Subspace Learning. 10431-10445 - Yue Jiang
, Kejiang Chen
, Wei Yan
, Xuehu Yan
, Guozheng Yang
, Kai Zeng
:
Robust Secret Image Sharing Resistant to JPEG Recompression Based on Stable Block Condition. 10446-10461 - Hongyu Deng, Yushan Xie, Qi Wang
, Jianjun Wang, Weijian Ruan
, Wu Liu
, Yong-Jin Liu
:
CDKM: Common and Distinct Knowledge Mining Network With Content Interaction for Dense Captioning. 10462-10473 - Dan Zhang
, Zhekai Du
, Jingjing Li
, Lei Zhu
, Heng Tao Shen
:
Domain-Adaptive Energy-Based Models for Generalizable Face Anti-Spoofing. 10474-10488 - Linxia Zhu
, Jun Cheng
, Xu Wang
, Honglei Su
, Huan Yang
, Hui Yuan
, Jari Korhonen
:
3DTA: No-Reference 3D Point Cloud Quality Assessment With Twin Attention. 10489-10502 - Ziyang Zhang
, Xiang Tian, Yuan Zhang
, Kailing Guo
, Xiangmin Xu
:
Label-Guided Dynamic Spatial-Temporal Fusion for Video-Based Facial Expression Recognition. 10503-10513 - Jiannan Ge
, Hongtao Xie
, Pandeng Li
, Lingxi Xie
, Shaobo Min
, Yongdong Zhang
:
Towards Discriminative Feature Generation for Generalized Zero-Shot Learning. 10514-10529 - Xi Yang
, Shaoyi Li
, Saisai Niu
, Binbin Yan
, Zhongjie Meng
:
Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition. 10530-10544 - Xichu Ma, Yuchen Wang
, Ye Wang
:
Symbolic Music Generation From Graph-Learning-Based Preference Modeling and Textual Queries. 10545-10558 - Yuqi Jiang
, Qiankun Liu
, Dongdong Chen
, Lu Yuan
, Ying Fu
:
AnimeDiff: Customized Image Generation of Anime Characters Using Diffusion Model. 10559-10572 - Marco Cotogni
, Marco Arazzi
, Claudio Cusano
:
PhotoStyle60: A Photographic Style Dataset for Photo Authorship Attribution and Photographic Style Transfer. 10573-10584 - Beichen Zhang
, Liang Li
, Zheng-Jun Zha
, Jiebo Luo
, Qingming Huang
:
Downstream-Pretext Domain Knowledge Traceback for Active Learning. 10585-10596 - Lin Wang
, Shiliang Sun
, Jing Zhao
:
VirPNet: A Multimodal Virtual Point Generation Network for 3D Object Detection. 10597-10609 - Yang Bai, Meijing Gao
, Shiyu Li, Ping Wang, Ning Guan, Haozheng Yin, Yonghao Yan:
IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target Mask and Bimodal Feature Extraction Strategy. 10610-10622 - Ke Liu
, Jiwei Wei
, Jie Zou
, Peng Wang
, Yang Yang
, Heng Tao Shen
:
Improving Pre-Trained Model-Based Speech Emotion Recognition From a Low-Level Speech Feature Perspective. 10623-10636 - Jun Yu
, Zhongpeng Cai
, Yihao Li
, Lei Wang, Fang Gao
, Ye Yu
:
Language-Guided Dual-Modal Local Correspondence for Single Object Tracking. 10637-10650 - Zhenglong Cui
, Da Yang
, Hao Sheng
, Sizhe Wang
, Rongshan Chen
, Ruixuan Cong
, Wei Ke
:
Triple Consistency for Transparent Cheating Problem in Light Field Depth Estimation. 10651-10664 - Xinguang Xiang
, Xinhao Ding
, Lu Jin
, Zechao Li
, Jinhui Tang
, Ramesh C. Jain
:
Alleviating Over-Fitting in Hashing-Based Fine-Grained Image Retrieval: From Causal Feature Learning to Binary-Injected Hash Learning. 10665-10677 - Jiayi Li, Min Jiang
, Jun Kong
, Xuefeng Tao
, Xi Luo
:
Learning Semantic Polymorphic Mapping for Text-Based Person Retrieval. 10678-10691 - Kunpeng Wang
, Danying Lin
, Chenglong Li
, Zhengzheng Tu
, Bin Luo
:
Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark. 10692-10707 - Zhuo Chen
, Xiaoyue Wan
, Yiming Bao
, Xu Zhao
:
Joint-Limb Compound Triangulation With Co-Fixing for Stereoscopic Human Pose Estimation. 10708-10719 - Mingsheng Li
, Lin Zhang
, Mingzhen Zhu, Zilong Huang
, Gang Yu
, Jiayuan Fan
, Tao Chen
:
Lightweight Model Pre-Training via Language Guided Knowledge Distillation. 10720-10730 - Junyu Chen
, Jie An
, Hanjia Lyu
, Christopher Kanan
, Jiebo Luo
:
Learning to Evaluate the Artness of AI-Generated Images. 10731-10740 - Wai Keung Wong
, Dewei Lin, Yuwu Lu
, Jiajun Wen
, Zhihui Lai
, Xuelong Li
:
Correlation-Guided Distribution and Geometry Alignments for Heterogeneous Domain Adaptation. 10741-10754 - Jiachen Kang
, Wenjing Jia
, Xiangjian He
, Kin-Man Lam:
Point Clouds are Specialized Images: A Knowledge Transfer Approach for 3D Understanding. 10755-10765 - Bo Wang
, Fei Yu
, Fei Wei
, Yi Li
, Wei Wang
:
Invisible Intruders: Label-Consistent Backdoor Attack Using Re-Parameterized Noise Trigger. 10766-10778 - Liang Zhang
, Jiangwei Zhao
, Qingbo Wu
, Lili Pan
, Hongliang Li
:
InfoUCL: Learning Informative Representations for Unsupervised Continual Learning. 10779-10791 - Rui Xu
, Yuezhou Li
, Yuzhen Niu
, Huangbiao Xu
, Yuzhong Chen
, Tiesong Zhao
:
Bilateral Interaction for Local-Global Collaborative Perception in Low-Light Image Enhancement. 10792-10804 - Liangchen Liu
, Nannan Wang
, Decheng Liu
, Xi Yang
, Xinbo Gao
, Tongliang Liu
:
Towards Specific Domain Prompt Learning via Improved Text Label Optimization. 10805-10815 - Liqun Lin
, Mingxing Wang
, Jing Yang, Keke Zhang
, Tiesong Zhao
:
Toward Efficient Video Compression Artifact Detection and Removal: A Benchmark Dataset. 10816-10827 - Xiaofei Zhou
, Kunye Shen
, Zhi Liu
:
ADMNet: Attention-Guided Densely Multi-Scale Network for Lightweight Salient Object Detection. 10828-10841 - Yuxuan Luo
, Runmin Cong
, Xialei Liu
, Horace Ho-Shing Ip
, Sam Kwong
:
Modeling Inner- and Cross-Task Contrastive Relations for Continual Image Classification. 10842-10853 - Guanlin Li
, Bin Zhao
, Xuelong Li
:
Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance. 10854-10866 - Lingru Zhou, Yiqi Gao, Manqing Zhang, Peng Wu, Peng Wang, Yanning Zhang:
Human-Centric Behavior Description in Videos: New Benchmark and Model. 10867-10878 - Demin Gao
, Liyuan Ou, Ye Liu
, Qing Yang
, Honggang Wang:
DeepSpoof: Deep Reinforcement Learning-Based Spoofing Attack in Cross-Technology Multimedia Communication. 10879-10891 - Yu Qiu
, Yuhang Sun
, Jie Mei
, Jing Xu
:
Deeply Hybrid Contrastive Learning Based on Semantic Pseudo-Label for Salient Object Detection in Optical Remote Sensing Images. 10892-10907 - Yiyi Li
, Xin Liao
, Xiaoshuai Wu
:
Screen-Shooting Resistant Watermarking With Grayscale Deviation Simulation. 10908-10923 - Ning Han
, Xun Yang
, Ee-Peng Lim
, Hao Chen
, Qianru Sun
:
Efficient Cross-Modal Video Retrieval With Meta-Optimized Frames. 10924-10936 - Wei Jiang
, Peirong Ning
, Jiayu Yang
, Yongqi Zhai
, Feng Gao
, Ronggang Wang
:
LLIC: Large Receptive Field Transform Coding With Adaptive Weights for Learned Image Compression. 10937-10951 - Yunxin Li
, Baotian Hu
, Xinyu Chen
, Lin Ma
, Yong Xu
, Min Zhang
:
LMEye: An Interactive Perception Network for Large Language Models. 10952-10964 - Wenfeng Song
, Xuan Wang
, Yuting Guo
, Shuai Li
, Bin Xia, Aimin Hao
:
CenterFormer: A Novel Cluster Center Enhanced Transformer for Unconstrained Dental Plaque Segmentation. 10965-10978 - Ran Ran
, Jiwei Wei
, Chaoning Zhang
, Guoqing Wang
, Yang Yang
, Heng Tao Shen
:
Adaptive Multi-scale Degradation-Based Attack for Boosting the Adversarial Transferability. 10979-10990 - Aming Wu
, Jiaping Yu, Yuxuan Wang, Cheng Deng
:
Prototype-Decomposed Knowledge Distillation for Learning Generalized Federated Representation. 10991-11002 - Pei Geng
, Xuequan Lu
, Wanqing Li
, Lei Lyu
:
Hierarchical Aggregated Graph Neural Network for Skeleton-Based Action Recognition. 11003-11017 - Shuoyao Wang
, Jiawei Lin, Yu Dai:
MMVS: Enabling Robust Adaptive Video Streaming for Wildly Fluctuating and Heterogeneous Networks. 11018-11030 - Housheng Xie
, Meng Sang
, Yukuan Zhang
, Yang Yang
, Shan Zhao
, Jianbo Zhong
:
RCVS: A Unified Registration and Fusion Framework for Video Streams. 11031-11043 - Di Wang
, Xiantao Lu, Quan Wang
, Yumin Tian
, Bo Wan
, Lihuo He
:
Gist, Content, Target-Oriented: A 3-Level Human-Like Framework for Video Moment Retrieval. 11044-11056 - Zhaoda Ye, Yang Liu
, Yuxin Peng
:
MAAN: Memory-Augmented Auto-Regressive Network for Text-Driven 3D Indoor Scene Generation. 11057-11069 - Yonghao Dong
, Le Wang
, Sanping Zhou
, Gang Hua
, Changyin Sun
:
Sparse Pedestrian Character Learning for Trajectory Prediction. 11070-11082 - Zhe Zhang
, Yi Yu
, Atsuhiro Takasu
:
Controllable Syllable-Level Lyrics Generation From Melody With Prior Attention. 11083-11094 - Yuqi Jiang, Jing Li, Haidong Qin, Yanran Dai, Jing Liu, Guodong Zhang, Canbin Zhang, Tao Yang:
GS-SFS: Joint Gaussian Splatting and Shape-From-Silhouette for Multiple Human Reconstruction in Large-Scale Sports Scenes. 11095-11110 - Mao Cui, Yun Zhang
, Chunling Fan
, Raouf Hamzaoui
, Qinglan Li
:
Colored Point Cloud Quality Assessment Using Complementary Features in 3D and 2D Spaces. 11111-11125 - Xuenan Xu
, Ziyang Ma, Mengyue Wu
, Kai Yu
:
Towards Weakly Supervised Text-to-Audio Grounding. 11126-11138 - Xiruo Jiang
, Yazhou Yao
, Xili Dai
, Fumin Shen
, Liqiang Nie
, Heng Tao Shen
:
Anti-Collapse Loss for Deep Metric Learning. 11139-11150 - Chao Cai
, Weide Liu
, Xue Xia
, Zhenghua Chen
, Yuming Fang
:
Bayesian Uncertainty Calibration for Federated Time Series Analysis. 11151-11163 - Xu Wang
, Yifan Li
, Qiudan Zhang
, Wenhui Wu
, Mark Junjie Li, Lin Ma
, Jianmin Jiang
:
Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-Labeling. 11164-11175 - Ruomei Wang
, Yuanmao Luo
, Fuwei Zhang
, Mingyang Liu
, Xiaonan Luo
:
HSSHG: Heuristic Semantics-Constrained Spatio-Temporal Heterogeneous Graph for VideoQA. 11176-11190 - Heng Huang
, Lin Zhao
, Haixing Dai
, Lu Zhang, Xintao Hu
, Dajiang Zhu
, Tianming Liu
:
BI-AVAN: A Brain-Inspired Adversarial Visual Attention Network for Characterizing Human Visual Attention From Neural Activity. 11191-11203 - Zeyu Xiong
, Daizong Liu
, Xiang Fang
, Xiaoye Qu
, Jianfeng Dong
, Jiahao Zhu
, Keke Tang
, Pan Zhou
:
Rethinking Video Sentence Grounding From a Tracking Perspective With Memory Network and Masked Attention. 11204-11218 - Jing Liu
, Qingying Li
, Xiongkuo Min
, Yuting Su
, Guangtao Zhai
, Xiaokang Yang
:
Pixel-Learnable 3DLUT With Saturation-Aware Compensation for Image Enhancement. 11219-11231 - Mingqi Fang
, Lingyun Yu
, Yun Song
, Yongdong Zhang
, Hongtao Xie
:
IEIRNet: Inconsistency Exploiting Based Identity Rectification for Face Forgery Detection. 11232-11245 - Hantao Yao
, Jifei Luo
, Lu Yu
, Changsheng Xu
:
Camera-Incremental Object Re-Identification With Identity Knowledge Evolution. 11246-11260 - Hefeng Wu
, Hao Jiang, Keze Wang
, Ziyi Tang, Xianghuan He, Liang Lin
:
Improving Network Interpretability via Explanation Consistency Evaluation. 11261-11273 - Peiying Wu, Shiwei Wang
, Liquan Shen
, Feifeng Wang, Zhaoyi Tian
, Xia Hua
:
Multi-Prior Driven Resolution Rescaling Blocks for Intra Frame Coding. 11274-11289 - Wandong Zhang
, Yimin Yang
, Tianlong Liu
:
Coarse-to-Fine Target Detection for HFSWR With Spatial-Frequency Analysis and Subnet Structure. 11290-11301 - Hefeng Wu
, Guangzhi Ye, Ziyang Zhou
, Ling Tian
, Qing Wang, Liang Lin
:
Dual-View Data Hallucination With Semantic Relation Guidance for Few-Shot Image Recognition. 11302-11315 - Junjie Ke
, Lihuo He
, Bo Han
, Jie Li
, Di Wang
, Xinbo Gao
:
VLDadaptor: Domain Adaptive Object Detection With Vision-Language Model Distillation. 11316-11331 - Tongtong Zhao
, Gehui Li
, Shanshan Zhao:
End-to-End Image Colorization With Multiscale Pyramid Transformer. 11332-11344 - Xi Yang
, Qiubai Zhou, Ziyu Wei
, Hong Liu, Nannan Wang
, Xinbo Gao
:
Elaborate Teacher: Improved Semi-Supervised Object Detection With Rich Image Exploiting. 11345-11357 - Defu Qiu
, Yuhu Cheng
, Kelvin Kian Loong Wong
, Wen-Jun Zhang
, Zhang Yi
, Xuesong Wang
:
DBSR: Quadratic Conditional Diffusion Model for Blind Cardiac MRI Super-Resolution. 11358-11371 - Yi-Fan Li
, Hong-Bing Ji
, Wenbo Zhang
, Yu-Kun Lai
:
Learning Discriminative Motion Models for Multiple Object Tracking. 11372-11385 - Feixiang Zhou
, Zheheng Jiang
, Huiyu Zhou
, Xuelong Li
:
SMC-NCA: Semantic-Guided Multi-Level Contrast for Semi-Supervised Temporal Action Segmentation. 11386-11401 - Yifei Deng, Guohao Wang, Chenglong Li
, Wei Wang, Cheng Zhang
, Jin Tang
:
Collaborative License Plate Recognition via Association Enhancement Network With Auxiliary Learning and a Unified Benchmark. 11402-11414 - Rao Fu
, Kai Hormann
, Pierre Alliez
:
LFS-Aware Surface Reconstruction From Unoriented 3D Point Clouds. 11415-11427

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.