List of Accepted Regular and Special Session Papers

Background

Title Authors
Clip-level Uncertainty and Temporal-aware Active Learning for End-to-End Multi-Object Tracking Riku Inoue, Shogo Sato, Kazuhiko Murasaki, Tomoyasu Shimada, Toshihiko Nishimura, Ryuichi Tanida
Collision-Resistant Single-Pass Method for Unsupervised Fine-Grained Image Hashing Anh Kiet Duong, Petra Gomez-Krämer, Jean-Michel CAROZZA
Enhancing Zero-shot Personalized Image Aesthetics Assessment with Profile-aware Multimodal LLM Chun Wang, Chenfeng Wei, Chenyang Liu, Weihong Deng
EMARS: Event-based Motion-Aware Correction, Deblurring and Interpolation of Rolling Shutter Images Weixiang Hu, Bohan Huang, Shigeaki Namiki, Yuka Ogino, Takahiro Toizumi, ATSUSHI ITO, Yoshimitsu Aoki
Learning an Elastomer Simulator for Hand-Object Interaction Xinguo He, Yixin Shen, Rahul Chaudhari
ITPLUT: Inverse Tone Mapping based on Lookup Tables with Luma and Chroma Mapping Qiuling He, Yuanfan Huang, Zhanyu Tu, Wenhui Wu, Fei Zhou
RETHINKING DIFFUSION FOR 3D HUMAN POSE ESTIMATION: SPATIOTEMPORAL PATCHIFICATION AND ADAPTIVE MODULATION Shuo Yang, Bart Jansen, Hichem Sahli, Xuan-son Nguyen, Aymeric Histace
LINDE: A Lightweight Neural Network for Remote Sensing Image Denoising Rafael Pires, Daniel F. Silva Santos, Denis Silva Moretto, Yasmin Sobrinho, Pedro Henrique Crespan Ribeiro, Kelton Costa, Khan Muhammad, Joao P. Papa
Loss Functions Matter: A Systematic Study of Class Imbalance in Flood Forecasting Nicolas To Van Trang, Van Linh Nguyen
AdaCorrection: Adaptive Offset Cache Correction for Fast and Accurate Diffusion Transformers Dong Liu, Yanxuan Yu, Ben Lengerich, Yingnian Wu
M2RETINEXFORMER: MULTI-MODAL RETINEXFORMER FOR LOW-LIGHT IMAGE ENHANCEMENT Youssef Aboelwafa, Hicham G. Elmongui, Marwan Torki
D^2-VR: Degradation-Robust and Distilled Video Restoration with Synergistic Optimization Strategy Jianfeng Liang, Shaocheng Shen, Botao Xu, Qiang Hu, xiaoyun zhang
Learning MRI Translation with Explicit Dynamic Texture and Structure Priors Runyu Xiao, Junze Zhu, Zhangkai Ni, Hanli Wang
Beyond Pixel Fidelity: Minimizing Perceptual Distortion and Color Bias in Night Photography Rendering Furkan Kınlı
Noise-Aware Latent Verification for Step-Efficient Diffusion Sampling Vishwajeet Shukla, Himanshu Baurai, Ajay Bedi
STEGANOGRAPHIC APPROACH BASED ON HOMOMORPHIC ENCRYPTION Norman Hutte, William, Puech,
DYNAMIC DISTILLATION AND GRADIENT CONSISTENCY FOR ROBUST LONG-TAILED INCREMENTAL LEARNING Taigo Sakai, Kazuhiro Hotta
3D Gaussian Splatting for Indoor Scene Reconstruction with Photometric and Geometric Consistency Constraints dongsheng xie
DON’T LAG, RAG: TRAINING-FREE ADVERSARIAL DETECTION USING RAG Roie Kazoom, Raz Lapid, Moshe Sipper, Ofer Hadar
Integrating Point Cloud-Based Non-Photorealistic Semi-Transparency into Gaussian Splatting Kento Yamazaki, Jun Minagawa, Takuya Matsuda, Kohei Okahara
QuatGAN: Efficient Spatio-Spectral Synthesis via Quaternion Transformers Ashutosh Gupta, Aarsh Wankar, Siddhartha Hrishikesha Voleti, Nitant Dube, Shanmuganathan Raman
RST-SNN: Robust Spatial Temporal Attention for Spiking Neural Networks Shuo Zhang, Bo Zhang, Zhiyuan Fu, Kuo Pang
Incremental Implicitly-Refined Classification via Class Knowledge Capacity Constrained Optimal Transport Qianna Ye, Shaofan Wang, Yanfeng Sun, Jinghua Li, Baocai Yin
Defence Against Byzantine Attacks in Semi-Supervised Federated Learning Nafisa Parvin, Sayanta Sen, Saumik Bhattacharya
IC-4DGS: Illumination-Compensated 4D Gaussian Splatting Under Photometric Variations He-Bi Yang, Ming-Zhe He, Cheng-Wei Yang, Jui-Chiu Chiang, Yu-Lun Liu, Wen-Hsiao Peng
MULTI-SCALE LATENT PREDICTION VIA LEARNABLE ITERATED FUNCTION SYSTEMS Kamel Belloulata, Amina BELALIA
Focus on the Fog: Leveraging Student Uncertainty for Guided Knowledge Distillation in Semantic Segmentation Emil Mededovic, Fabian Gülhan, Rüveyda Yilmaz, Johannes Stegmaier
EasyControlEdge: A Foundation-Model Fine-Tuning for Edge Detection Hiroki Nakamura, Hiroto Iino, Masashi Okada, Tadahiro Taniguchi
Making Fisher Work: Train-Time Compression for 3D Gaussian Splatting Arun Madhav, Chandra Sekhar Seelamantula
SILVA-Mamba: Spatial-Integrity For Landslide Segmentation Via Vectorized HILBERT Scanning and Adaptive Mamba Yi Tang Hsieh, Chih-Chung Hsu, Xin Li, Ming-Ching Chang, Jun-Wei Hsieh
M2UR: Meta-Guided Multi-Expert with Uncertainty-Aware Refinement Framework for Video Summarization Yupeng Wu, Xiaoran Xu, Xiaoshan Yang, Changsheng Xu
Adapting Pre-trained Diffusion Model for Blind Image Denoising via Noise Compensation and Timestep Prediction Kenan Zou, Qianjun Huang, Jiaqing Wang, Kai Zhang
Evaluating Demographic Fairness in Histopathology Foundation Models Natalia Lourdes Pérez García de la Puente, Miguel López Pérez, Valery Naranjo
Video Quality Evaluation Methodology and Result of AV2 Compression Performance Zhijun Lei, Vibhoothi Vibhoothi, Dzung Hoang, Yixin Du, Ramzi Khsib
Multi-User Multi-Key Image Steganography with Key Isolation Tzu-Ti Wei, Yu-Han Tseng, Jun-Yi Lin, Yu-Chee Tseng, Jen-Jee Chen
Split, Skip and Play: Variance-Reduced ProxSkip for Tomography Reconstruction is Extremely Fast Evangelos Papoutsellis, Zeljko Kereta, Kostas papafitsoros
SAFEGUARDED ANDERSON ACCELERATION FOR PRIMAL-DUAL HYBRID GRADIENT IN CONVEX VARIATIONAL IMAGING Hossein Javidnia
A MULTIMODAL INTRINSICS-GUIDED THERMAL-AWARE FRAMEWORK FOR RGB LOW-LIGHT IMAGE ENHANCEMENT Simone Melcarne, jean-luc DUGELAY
SBP-Net: Learning Thin Structure Reconstruction with Sliding-Box Projections Ofir Gilad, andrei sharf
THINKING LIKE A FORENSIC EXPERT: A MULTIMODAL REASONING CHAIN FOR TRAINING-FREE IMAGE MANIPULATION LOCALIZATION Rui Chen, Bin Liu, Changtao Miao, Xinghao Wang, Yi li, Tao Gong, Qi Chu, Nenghai Yu
Generative 6D pose estimation via conditional flow matching Amir Hamza, Davide Boscaini, Weihang Li, Benjamin Busam, Fabio Poiesi
Evidence-Invariance for Auditable Pseudo-Label Selection under Domain Shift in Semi-Supervised Segmentation Hongkang Zhang, Shao-Lun Huang, Ercan Engin Kuruoglu
Bayer Convolution for Raw Image Processing Jaeseong Yu, Hongjae Lee, Myungjun Son, Seung-Won Jung
CHROMOSIS: A SHAPE-CONSTRAINED AND SPATIALLY-AWARE FRAMEWORK FOR CHROMOSOME INSTANCE SEGMENTATION Weixiao Fang, Yixiong Liang, Shichao Kan, Jianfeng Liu
Cross-Modal Knowledge Transfer from RGB Latent Diffusion Model to Spectral-Spatial Joint Distribution for Spectral Reconstruction Keli Deng, Qipeng Qian, YANG CHU, Yuntao Qian
T2M4AR: Text to Motion Generation for Skeleton-based Action Recognition Jun-Sang Yoo, Hongjae Lee, Sangmin Lee, Chunfei Ma, Byeongwon Lee, Seung-Won Jung
Few-Shot Unseen Gestures Recognition Via Enhancing Multimodal ProtoNet with Dive Before Fly Fusion Yongmeng Yan, Nianzu Lv, Wenchao Du, Hu Chen, Yi Zhang, Hongyu Yang
Enhancing Spike-driven Transformers with Multi-Scale Features and High-Rank Interactions Ka Chen, Ziliang Ren, Hui Zhao, Qieshi Zhang, Xiangyang Gao
CORE-NET: CONSENSUS-BASED SELECTION AND RECIPROCAL RELIABILITY FOR MULTI-MODAL OBJECT RE-IDENTIFICATION Xingan Ma, Jinhui Yi, Juergen Gall
ROI-Focused Geometry-Aware Adaptation for Accurate Small-Structure Segmentation in Medical SAM Tianqi Wang, Jianuo Li, Chenhao Yang, Jinyi Xu, Mian Zhou, Kang Dang, Linxue Zhang
Rotation-Aware Dense Neural Network for Multi-Modal SAR-Optical Image Registration Simon Bertrand, Cornelia Vacar, Lionel BOMBRUN
KNOWLEDGE-GUIDED MULTI-TASK LEARNING FOR ORAL CANCER CLASSIFICATION Jérôme de Chauveron, Chenyu Zha, Youssef Assis, Pauline Le Gatt, Margaux Vinant, Géraldine Lescaille, Caroline Shaar-Chneker, Laurent Wendling, Camille Kurtz, Juliette Rochefort
Unequal by Design: Instance-Aware and Cluster-Differentiated Universum Construction for Multi-View Contrastive Clustering Ghaith Chrit, Shan Du
PRIOR-GUIDED FLEXIBLE FINE-SCALE BRAIN PARCELLATION ON DIFFUSION MRI IN EXTREMELY LABEL-SCARCE SCENARIOS Qingwei You, Zhonghua Wan, Jiahao Yu, Yifei He, Xiangxue Wang, Ye Wu
Adaptive Cross-component Prediction based on Chroma Sample Position Estimation for NGVC Haruhisa Kato, Yoshitaka Kidani, Takeshi Chujoh
Preprocessor-Enhanced Image Compression for Joint Machine and Human Vision yaqian luo, Chao Yang, Xinpeng Huang, Ping An
The Impact of Intrinsic Scene Cues on Perceived Color Transfer Quality Herbert Potechius, Thomas Sikora, Sebastian Knorr
Sliding DCT Wiener Denoising with Lightweight Residual Refinement Karen Eguiazarian, Alla Ghazaryan, Sergey Abrahamyan
REAL-TIMEASTRONOMICALIMAGEPREPROCESSINGFOREDGEDEPLOYMENT VIA PHYSICS-GUIDEDSTRIPECORRECTIONANDSTAR-PRESERVINGFILTERING Qiankai Tong
A CRITIC-FREE APPROACH FOR LDR TO HDR CONVERSION Chansoon Heo, Byeungwoo Jeon
SF-DIFF: JOINT SPATIAL-FREQUENCY DIFFUSION MODEL FOR PROBING HUMAN BRAIN TISSUE MICROARCHITECTURE Shuxin Cao, Chengzhe Zhang, Peng Wang, Zhonghua Wan, Jiaolong Qin, Ye Wu
Towards Coherent Video Colorization: When Optical Flow Meets Image Diffusion Models Wen Si, Yifan Li, Luyao Zhang, Shuai Yang, Jiaying Liu
DETECTION OF SPLICING IN DIGITAL IMAGE FORGERY Zuzana Pitsmausová, David Svoboda
A Hard Negative-Aware Optimization for Multilingual Text-Based Person Search Tung Lam Pham, Hoai Thi-Phan, Thuy-Binh Nguyen, Thanh-Hai Tran, Thi-Ngoc-Diep Do, Hong-Quan Nguyen, Thi-Lan Le
SCALE-ADAPTIVE FEATURE EXTRACTION FOR REAL-TIME INDUSTRIAL SURFACE DEFECT DETECTION JO SEONGJUN, Xian Xu
TENS-LLM: Text-guided Neuron Segmentation using Large Language Models Chengda Mo, qiufu li, Xinle Dai, Linlin Shen
GameScope: A Multi-Attribute, Multi-Codec Benchmark Dataset for Gaming Video Quality Assessment Rajesh Sureddi, shreshth saini, Avinab Saha, Alan Bovik
SYNERGY BETWEEN TRAJECTORIES AND HUMAN POSE FOR SOCCER Marc Peral, Guillem Capellera, Luis Ferraz, Antonio Rubio, Stella Grasshof, Dan Witzner Hansen, Antonio Agudo
Breaking Camera Frame-Rate Limits: A Multi-View Dataset and Baseline for High-Frequency 3D Pose Reconstruction Yuxuan Liu, Zixuan Wang, Junliang Xing, Haizhou Ai
Transfer Anyone: High-Fidelity Human Transfer on Motion Video via Diffusion-Based Reconstruction Haocheng Tang, Ruoke Yan, Xuanyi Liu, Xiaolong Zhang, Bin Zhao, Siwei Ma, Chuanmin Jia
Few-shot Source-Free Domain Adaptation for Surface Defect Detection Qianyu Zhou
Individual Prompt Tuning: A Single-Model Multi-User Framework for Personalized Image Aesthetic Assessment Jiaqi Shi, Xinying Yang, Zhang xiaodan, Zhenxing Niu, Fei Gao
MULTI-SCALE LARGE KERNEL ATTENTION FOR SINGLE-IMAGE DERAINING Congcong Zeng, Dan Xu, Yinghui Zhu, Jiangang Pan, Kangjian He, Hongzhen Shi
DAT: DUAL ATTENTION TRANSFER TO BRIDGE THE SEMANTIC GAP FROM VISION FOUNDATION MODELS TO CNNS Yingbin Wang, Jielei Wang, Qianxin Xia, Xuewan He, Zihan Cheng, Guoming Lu
Pathological Image Diagnosis under Label Noise Conditions Using Bias-Aware Adaptive Knowledge Distillation Masato Watanabe, Wonjik Kim, Kazuki Uehara, Shuta Tsuchio, Hirokazu Nosato, Hidenori Sakanashi
Frequency-Adaptive Depth-Haze Consensus with Semantic Priors for Single Image Dehazing Ahmed Sakr, Hicham G. Elmongui, Marwan Torki
Robust Knowledge Distillation Powered Lightweight Semantic Communication Method for Remote Sensing Image Zhongqiang Zhang, Zeyang Meng, Guangming Shi, Fanyang Meng, Ye Wang, Shuhang Zhang, Lin Mei
ATTRIBUTE-ENHANCED PROMPT LEARNING FOR ZERO-SHOT CROSS-MODAL RETRIEVAL Jiyan Wang, Yuanbo Zhu, Ge Song, Wanqi Yang
Screen-Shooting Resilient Watermarking based on Long-Range Modeling and High-Frequency Enhancement Jun-Zhuo Zou, NanRun Zhou, Jane Wang, Zhihua Xia, Xiangui Kang
Event-Based Batting Impact Estimation Ryotaro Ishida, Wataru Ikeda, Ryosei Hara, Akemi Kobayashi, Toshitaka Kimura, Mariko Isogawa
Contextual Copy-Paste Sample Augmentation for Multi-Class Remote Sensing Object Detection Xue Zhang, Yanxia Wu, Dan Lin, Ruoyu Wang, Guoyin Zhang, Nebojsa Bacanin
A Video Semantic Coding Framework Using Shared Prior Knowledge and Latent Feature Residuals lyx liuYuxiang, Yiping Duan, Qiyuan Du, Ning Ge, Xiaoming Tao, Guangyi Liu
PHOTOMETRIC STEREO PRIOR BOOSTED SPARSE MULTI-VIEW STEREO jilong zhang, songyun yang, Yufei Han, Zhanyu Ma, Heng Guo
Deep Structure-Texture Guided Image Compression via Cross-Scale Interaction and Dual-Domain Enhancement Lijun Zhao, Jiaxin Wang, Kunkun Tu, Kailong Cao, Jinjing Zhang
Stable-NAE: Stabilizing Natural Adversarial Example Generation Using Adaptive Control and Momentum Hui Kuurila-Zhang, Haoyu Chen, Guoying Zhao
IDAG-Edit: Multi-Object Video Editing via Instance-Decoupled Attention and Guidance Yuan-Zhih Lin, Thang Nguyen Huu, Huu-Phu Do, Hong-Han Shuai, Ching-Chun Huang
LEARNING INTERPRETABLE INTERIOR STYLE SEMANTICS VIA LARGE MULTIMODAL MODEL REPRESENTATIONS Junya Yamamori, Ren Togo, Teruhisa Yamashiro, Takahiro Ogawa, Miki Haseyama
A TWO-STAGE IMAGE CROPPING METHOD BASED ON COMPOSITIONAL CONSISTENCY Ran Shi, Lu Feng, Penghao Wang, Tong Qiao
Optimal Neural Architecture Search for Kolmogorov-Arnold Network-based Image Classification Anurag Dutta, Sweta Dey, Rajat Subhra Chakraborty
U^2Mamba: A Two-level Nested U-structure Mamba for Salient Object Detection Junhui Li, Jialu Li, Youshan Zhang
LoREnc: Low-Rank Encryption for Securing Foundation Models and LoRA Adapters Beomjin Ahn, Jungmin Kwon, Chanyong Jung, Jaewook Chung
PROMPT-FREE AND EFFICIENT SAM2 ADAPTATION FOR BIOMEDICAL SEMANTIC SEGMENTATION VIA DUAL ADAPTERS Hinako Mitsuoka, Kazuhiro Hotta
Secret Geometric Deformation for 3D Object Protection Khélian Larvet, William, Puech,
LTOP-Net:Lightweight Transformer Occupancy Prediction Net for Octree-Based Point Cloud Geometry Compression 小俏 张, Anhong Wang, Tillo Tammam, Donghan Bu, Hao Jing, Jing Zhang
Is SAM3 Ready for Pathology Segmentation? qiuyu kong, Shakiba Sharifi, Yiming Wang, Marco Cristani, Zanxi Ruan
Stream-FCGS: Fast Compression and Streaming for 4D Gaussian Splatting Mingjia Yang, Haocheng Tang, Zheng Wang, Xueying Chang, Siwei Ma, Jiaqi Zhang
Multiple Scale Latents for Learned Image Compression Jonas Brenig, Radu Timofte
Mixture-of-Experts-based Entropy Model for Learned Image Compression Jonas Brenig, Radu Timofte
MattenIR: Efficient Image Restoration with Local Attention and Global-Aware State Space Duality Qiwei Dong, Siyu Zhang, Weichao Wang, Wendong Mao, Zhongfeng Wang
Multimodal Confidence Modeling in Audio-Visual Quality Assessment Mayesha Maliha R. Mithila, Mylene Farias
CDMesh: High-Fidelity Sparse-View Mesh Reconstruction with Consensus Diffusion Priors Haoyang Wang, Liming Liu, Xinggong Zhang
Structure-Aware Blind Inpainting for Electron Microscopy Images Zhicheng Wang, Jiateng Shou, Haiqun Jin, Zhiwei Xiong
MBHNet: Multimodal Brain Hallucination Network for Fluid Intelligence Prediction under Missing Structural Connectivity Chong Cheng, Gang Yang, Yu Li, Xun Chen, Aiping Liu
Feature Optimized Dynamic Spectral Correlation Subspace Clustering for Hyperspectral Band Selection Yingying Chu, Xiaodi Shang, Jiahua Zhang, Xudong Sun
ARE FACIAL ACTION UNITS DISCRIMINATIVE FEATURES TO DETECT DEEPFAKES? Paul Chaurand, Ewa KIJAK
DYNAMIC CROSS-MODAL COMPRESSION AND CYCLIC FUSION FOR MULTI-SPECTRAL VEHICLE RE-IDENTIFICATION UNDER SEVERE FLARE CONDITIONS Zhongzheng Liu, di wu
YawDD+: Frame-level Annotations for Accurate Yawn Recognition on Edge Platforms Ahmed Mujtaba, Gleb Radchenko, Marc Masana, Radu Prodan
SIMI: Self-information Mining Network for Low-light Image Enhancement Xuanshuo Fu, Lei Kang, Javier Vazquez-Corral
Towards Multi-Modal Forgery Representation Learning for AI-Generated Video Detection and Localization Dat Le, Khoa Nguyen, Xin Wang, Shu Hu
RING-SHAPED PEPS TENSOR NETWORK DECOMPOSITION FOR HIGH DIMENSIONAL DATA IMPUTATION Rongfeng Huang
DINO-Detector: Leveraging Pre-trained DINO Features for One-Shot 3D Craniofacial Landmark Localization Kaichen Nie, Tianmin Xu, Yuru Pei
Controllable Medical Anomaly Synthesis via Image Editing Yuxin Yang, Haimiao Zhang, Ligen Shi, Di He, Chang Liu, Jun Qiu
SynTeX: Data-Efficient LaTeX OCR via Synthetic Pretraining and Limited Fine-Tuning Yuhan Xu, Yijun Zhao, Renqing Luo, Gary Weiss
Scene-Action Prompt Fusion for Coherent Text-to-Video Storytelling Taewon Kang, Divya Kothandaraman, Ming C. Lin
Improving color fidelity on color E-Paper displays using curve-based transforms Dounia Hammou
GRACE:Estimating Geometry-level 3D Human-Scene Contact Chengfeng Wang, Wei Zhai, Yuhang Yang, Yang Cao, Zheng-Jun Zha
AGREEMENT-DRIVEN MULTI-VIEW 3D RECONSTRUCTION FOR LIVE CATTLE WEIGHT ESTIMATION Rabin Dulal, Wenfeng Jia, Lihong Zheng, Jane Quinn
Retrieval-Driven Knowledge Injection for Context-Aware Video Captioning Karina Abubakirova, Waseem Ullah, Mohsen Guizani
Model-Aware Rate–Distortion Limits for Task–Oriented Source Coding Andriy Enttsel, Vincent Corlay
DepthFix3D: Depth-Guided Diffusion for Artifact Removal in 3D Gaussian Splatting Haoshuai Fu, Junlin Hao, Peiheng Wang, Haoyang Wang, Xinggong Zhang
Semantic-conditioned latent diffusion for low-field brain MRI enhancement Dong Zhang, Jiaxun Gao, Caohui Duan, Xin Lou, Jane Wang
Panoptic3D: Leveraging 3D Pseudo Supervision for Panoptic Occupancy Prediction Dian Jia, Pei Yu, Xiaoqian Ruan, Hyeonjeong Park, Wei Tang
Efficient Unsupervised Metric Learning with UMAP-Based Pseudo-Labeling Dhanunjaya Varma Devalraju, Chandra Sekhar
Respiration modulates pathological brain cardiovascular pulsation propagation in Alzheimer’s disease Youssef Hosni
STQFORMER: SPATIO-TEMPORAL QUATERNION TRANSFORMER FOR VIDEO FRAME DENOISING Aoqing Jin, Shiming Zhang, Shuihua Wang
TriGaze: Camera-Guided 3D Representations for Robust In-Vehicle Gaze Estimation Cao Boxiang, Ming Cao, Gu Qingfeng, Pengfei Huang, Chi Jiannan, Liu Jiahui
Seeing Through the Glare: Robust Nighttime Stereo Depth Estimation via Physics-Guided Synthesis Yuanfan Guo, Zhaolin Xiao, Haonan Su, Qiyuan Zhang
Context and Pixel Aware Large Language Model for Video Quality Assessment Wen Wen, Yaohong Wu, Yue Sheng, Neil Birkbeck, Balu Adsumilli, Yilin Wang
Facial Feature-guided Adaptation for Talking Head Video Compression Riku Takahashi, Ryugo Morita, Jinjia Zhou
AdaFusion: Adaptive Degradation-Aware Infrared and Visible Image Fusion with Cross-Modal Mixture of Experts Lihao Lai, Jiangtao Nie, Lei Zhang, Xiaoguang Guo, Sen Peng, Wei Wei
Multi-Light Relightable Gaussian Splatting with Phong Reflectance from in-the-wild images Duy Khanh Ngo, Huu-Phu Do, Ching-Chun Huang
Label Noise Detection via Loss Dynamics and Predictive Stability: A KL-Divergence and Statistical Feature Guided Approach Zhipeng Zhang, Wenting Ma
Map-Mono-Ego: Map-Grounded Global Human Pose Estimation from Monocular Egocentric Video Hiroyuki Deguchi, Ryosuke Hori, Kotaro Amaya, Tsubasa Maruyama, Mitsunori Tada, Hideo Saito
REGION-OF-INTEREST AND UPSAMPLING-ENHANCED POINT CLOUD TRANSMISSION FOR 3D MACHINE VISION Ao Luo, Weishuai Song, Diego Fujii, Keisuke Nonaka, Linxin Song, Heming Sun, Xuelian Cheng, Jiro Katto
TEXT-PILOT: INTELLIGENT VISUAL TEXT PLANNING AND MANIPULATION VIA MULTI-MODAL LLM AS AGENT Yuan Kang Kuo, Quang-Thang Le, Ngoc-Phu Doan, Ching-Chun Huang
Out-of-Distribution Detection with Angular-Magnitude Likelihood and Targeted Feature Refinement Atik Garg, Yu-Shuen Wang
SplatShield: Adversarial Protection for 3D Gaussian Splatting Against Instruction-Guided Editing Sejin Oh, Suhyeon Ha, Joonsung Jeon, Sung-Eui Yoon
UNSUPERVISED DATA-EFFICIENT CROSS-MODAL RETRIEVAL WITH GLOBAL-NEIGHBORHOOD ALIGNMENT HASHING Runhao Li, Xiaoxu Ma, Zhenyu Weng, Yue Zhang, Guibo Luo, Huiping Zhuang, Zhiping Lin, Yap-Peng Tan
Fractional Fourier Near-Field Ptychography Haoyuan Liu, Zhiyi Zhang, Yixiao Yang, Ran Tao
Selective Global to Local Alignment for Vision Language Retrieval Wei Li, Jiale Chen, Yuanpeng Wang, Jiaxun Li, Yuehai Wang
FRÉCHET WAVELET STYLE DISTANCE: AN INTERPRETABLE IMAGE STYLE SIMILARITY METRIC Abhijat Bharadwaj, Animesh Kumar, Deekshant Kumar, Vikram M. Gadre
Motion-Guided Latent Diffusion for Full-Frame Video Stabilization Huyue Zhu, Dachun Kai, Jiaxiao Wang, Jie Chen, Zhangchi Hu, Quanquan Hu, Xiaoyan Sun
MetricDepth-VLM: Internalizing Metric Spatial Reasoning in VLMs via Depth Discretization and Geometry-Semantic Alignment KO CHIHI, Hao-Chiang Shao, Chih-Tsung Shen
Few-Shot Domain Adaptation with Temporal References and Static Priors for Glacier Calving Front Delineation Marcel Dreier, Nora Gourmelon, Dakota Pyles, Thorsten Seehaus, Matthias H. Braun, Andreas Maier, Vincent Christlein
DEEP IMAGE SEGMENTATION VIA DISCRIMINANT FEATURE LEARNING Adam Sztamborski, Raül Pérez-Gonzalo, Antonio Agudo
Appearance-Routed Fusion for Egocentric Activity Recognition with Synthetic Audio and Depth Cagri Gungor, Adriana Kovashka
CPDDNet: Color-Polarization Denoising and Demosaicking Network Qihang Zhang, Yusuke Monno, Masayuki Tanaka, Masatoshi Okutomi
DISTILLING NOISELESS FEATURES FOR NOISE-ROBUST MONOCULAR 3D POSE ESTIMATION Asuka Ishii, Hiroo Ikeda
Rate-Distortion Optimized LoRA for Efficient Post-Filtering in AV2 Kequan Mao, Xin Yang, Urvang Joshi, Debargha Mukherjee, Dandan Ding
WDFG: Wavelet-Based Dual Frequency Guidance via Foundation Model Priors for Depth from Focus Jeongho Park, Sungmin Woo, Wonjoon Lee, Inseok Jeon, Sangyoun Lee
SAG: Spatial-Attention-Guided Motion Customization for Text-to-Video Diffusion Models Cheng Lei, Delong Liu, Fei Su, Zhicheng Zhao
FD-DIFF: FREQUENCY DECOUPLING AND DUAL-STREAM COLLABORATIVE DIFFUSION FOR 3D FACE RECONSTRUCTION AND ALIGNMENT Xiangzheng Li, Peng Han, Xiaoli Luo, Baiying Dong
GLCV: A Generalized Learnable Cost Volume for Per-pixel Visual Correspondence Sehoon Oh, Ue-Hwan Kim
COHESION: CONSENSUS-BASED HALLUCINATION SUBSPACE ESTIMATION FOR MULTIMODAL LARGE LANGUAGE MODELS Wei-Han Chen, Yu-Feng Chen, Jun-Cheng Chen
Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance Song Wu, Xinyu Chen, Qian Wang, Liang Li, Junlan Feng, Zili Yi
Real-Time Image Restoration via Adaptive Decoupled Knowledge Distillation jiahui liu, Haoran Bai, Ying Chen, sibin deng
Novel Low Operation Point In-Loop Filter for VVC Using Learning Rate Scheduling and SIMD Acceleration Jiang Han, Cheolkon Jung, Qipu Qin
VT-JRD: TASK-AWARE VIDEO CODING FOR MACHINES USING VISION TRANSFORMER AND JUST RECOGNIZABLE DISTORTION Sanaz Nami, Farhad Pakdaman, Moncef Gabbouj
DYNAMIC WEIGHT-BASED TEMPORAL AGGREGATION FOR LOW-LIGHT VIDEO ENHANCEMENT UNDER EXTREME NOISE Ruirui Lin, Guoxi Huang, Nantheera Anantrasirichai
LEARNING SPATIALLY ADAPTIVE SPARSITY LEVEL MAPS FOR ARBITRARY CONVOLUTIONAL DICTIONARIES Joshua Schulz, David Schote, Christoph Kolbitsch, Kostas papafitsoros, Andreas Kofler
AN OVERVIEW OF WARP PREDICTION IN AV2 Mohammed Sarwer, Yeqing Wu, Rachel Barker, Yunqing Wang, Debargha Mukherjee, Han Gao, Jayasingam Adhuran
Leveraging NeRF-Rendered Images for 3D Gaussian Splatting Mizuki Morikawa, Yuta Shimizu, Chunyu Li, Yusuke Monno, Masatoshi Okutomi
Person–Object Relationship Consistency Learning for Zero-Shot Spatio-Temporal Action Detection Yasunori Babazaki, Takashi Shibata, Toru Takahashi
MSCDF: MULTI-SCALE CROSS-DOMAIN FUSION NETWORK FOR UNDERWATER IMAGE ENHANCEMENT Xing Li, Yanfang Wang, Ziqi Dong
TDF-NET : A FREQUENCY-AWARE REPRESENTATION LEARNING GUIDED FUSION NETWORK FOR INFREAD AND VISIBLE IMAGES Hang Xu, Hao Wang
INTERFERENCE-RESISTANT FINE-GRAINED CLASSIFICATION OF HIGHLY SIMILAR NASOPHARYNGEAL ENDOSCOPIC STRUCTURES Pengcheng Wang, Hao Wang, Yiping Wang, Zhichao Zhang
PSCA-NET: INTEGRATING PHYSICAL TRACES AND SEMANTIC CONTEXT FOR AI-GENERATED IMAGE FORGERY DETECTION AND LOCALIZATION Yi Zhang, Qiang Xu, Wenpeng Mu, Jianhao Fu, Tanfeng Sun, Xinghao Jiang
Non-uniform Structured Pruning for Efficient Diffusion-based Real-world Image Super-resolution Le Khang Nguyen, Kevin Ho Man Cheng
Granulo-10k: A Large-Scale Benchmark Dataset for Multiple-View Industrial Granulometry Pasquale Coscia, Angelo Genovese, Vincenzo Piuri, Fabio Scotti
HIERARCHICAL FILTER BAND SELECTION FOR MULTISPECTRAL OBJECT CLASSIFICATION Katja Kossira, Jürgen Seiler, Andre Kaup
HPGN: Hybrid Priors-Guided Network for Compressed Low-Light Image Enhancement hantang li, qiang zhu, xiandong meng, lei xiong, Shuyuan Zhu, Xiaopeng Fan
Color Constancy in Hyperspectral Imaging via Reduced Spectral Spaces Gunnar Dofri Vidarsson, Liying Lu, Sabine Süsstrunk
EFFICIENT VARIABLE-RATE STATE-SPACE MODEL FOR IMAGE COMPRESSION WITH CHANNEL-WISE ENTROPY Bouzid AREZKI, Anissa Mokraoui, Fangchen FENG
Hierarchical Prompt-Aware Zero-Shot Out-of-Distribution Detection Marouane HADJ-ALI, Florence Alberge
Deep Learning-based Compressed Domain Event Data Classification Abdelrahman Seleem, André F. R. Guarda, Nuno Rodrigues, Fernando Pereira
DM-QPMNET: DUAL-MODALITY FUSION NETWORK FOR CELL SEGMENTATION IN SINGLE SHOT QUANTITATIVE PHASE MICROSCOPY Rajatsubhra Chakraborty, Anna Espinosa–Momox, Riley Haskin, Depeng Xu, Rosario Porras-Aguilar
MPS-RETNET: MULTI-SCALE PROTOTYPE-GUIDED SEMI-SUPERVISED LEARNING WITH QUALITY-AWARE SUPERVISION FOR RETINAL DISEASE CLASSIFICATION Maisam Abbas, Ran-Zan Wang
ZERO-SHOT 3D ANOMALY DETECTION USING PRE-TRAINED MODELS Lukun Hu, Hengyi Chen, Yiguo Lou, Zhaocheng Yang, Junwen Ji, Dan Li
Generalizable 3D Gaussian Splatting Guided by a Vision Foundation Model Jie Liang, Cheolkon Jung
SLT: A LAPLACIAN-GUIDED TRANSFORMER FOR MULTI-SCALE SPECTRAL-CHANNEL MODELING IN HYPERSPECTRAL IMAGE CLASSIFICATION Chi Zhang, Jungkwon Kim, Jihun Kim, Jeonghyeon Park, Kwangsun Yoo, Seok-Joo Byun
LOGOFLOW: VISUAL SALIENCY-AWARE ADVERSARIAL ATTACK ON LOGO-BASED PHISHING DETECTORS Yena Cho, Heesung Jeong, Sukyeong Bang, Doowon Kim, Hyoungshick Kim
CA3-GS:COMPLEXITY-AWAREADAPTIVEANCHORALLOCATIONFOR3DGAUSSIAN SPLATTING Genqiang Shi, Qiuming Liu, Changjian Zhu
HYBRID ISP: COMBINING SPARSE CNN WITH DENSE INTERPOLATION FOR ON-SENSOR REAL-TIME 4×4 BAYER IMAGE RECONSTRUCTION Oren Girshkin, Tamar Dreifuss, Tal Bernstein
VLM-DREAMER: VLM-IMAGINED BI-DIRECTIONAL INPAINTING FOR SINGLE-IMAGE 360 SCENE GENERATION TingWei Huang, Fu-En Yang, Min-Hung Chen, Yen-Yu Lin, Yu-Lun Liu
Neuro-Symbolic Video Anomaly Detection via Attribute-Based Reasoning Sofya Filippova, Steven Korevaar, Son Hoang Dau, Trung Pham, Tam Cao, Ruwan Tennakoon
SPATIO-TEMPORAL BIFURCATE-FUSION SPIKE TRANSFORMER ZeFeng Chen, Ziliang Ren, Yangyang Chen, Xiangyang Gao, Qieshi Zhang
The DARRL dataset: Demonstrations for Action Recognition and Robot Learning, extended with gaze data and scene graphs Badr Tahri Joutei, Mathieu Riand, Patrick Le Callet, Alexandre Bruckert, Laurent Dollé
A histogram-based method to extract tag-image from film shot Benjamin Serva, Frédéric Comby, Olivier Strauss, Loig Le Bihan, William, Puech,
Low-Delay Dynamic Point Cloud Attribute Compression via Cross-Coordinate Attention Xiangzuo Liu, Ruishan Huang, Zhikai Liu, Fan Liang
Cross-Modal Slot Alignment for Data-Efficient Multiclass Defect Classification selen pehlivan
A Mixture of Measurement Strategies Framework for Monocular Mobile Rebar Spacing Inspection Cheng-En Li, Jue-Yu Lai, Hung-Kai Hsiao, Chang-Yuan Hsiao, Peggy Joy Lu
A Clinically Relevant and Interpretable Scoring Protocol for Medical Image Enhancement Dong Zhang, Caohui Duan, Xin Lou, Jane Wang
Task-Adaptive Sparse Update for Efficient Continual Learning Takuma Ishibashi, Hikari Otsuka, Junnosuke Suzuki, Daichi Fujiki, Masato Motomura
PGM-Net: Prior-Guided Mamba Network for Pancreas Segmentation Chongshang Zhong, Jun Chen, Qiaoying Teng, Kai Han, Yi Liu, Zhe Liu
MuCALD-SplitFed: Causal-Latent Diffusion for Privacy-Preserving Multi-Task Split-Federated Medical Image Segmentation Chamani Shiranthika, Hadi Hadizadeh, Parvaneh Saeedi
Egocentric Whole-Body Human Mesh Recovery with Prior-Guided Learning Soyeon Na, Seung Young Noh, Ju Yong Chang
IAFE: ILLUMINATION AWARE FREQUENCY ENHANCEMENT NETWORK FOR LOW-LIGHT IMAGE DEBLURRING Chenyuan Jiao, Xun Yang, Yaoru Sun, Xuejie Yang
Robust Bridge Defect Detection via Dynamic Snake Convolution and Hierarchical Feature Fusion Mingdi Hu, SiChen Chen, Bing Yi Jing
Depth from Defocus via Direct Optimization Holly Jackson, Caleb Adams, Ignacio Lopez-Francos, Benjamin Recht
SCIPS: SINGLE-SHOT PHOTOMETRIC STEREO FROM A SNAPSHOT COMPRESSED MULTISPECTRAL IMAGE Yunhao Li, Yanan Hu, Xiaodong Wang, Yuze Yang, Ziyi Meng, Xin Yuan, Peidong Liu
REC-RL: Referring Expression Counting via Gaussian and Range-Based Reward Optimization Hui Liu, Yunlai Teng, Kunlong Bai, Pengfei Qi, Yan Haotian, Liang Li, Junlan Feng
ALF: Sharpness-Aware Adaptive Layer Fusion for Training-Free Anomaly Detection Deyu Yang, Ziliang Ren, Yu Zou, Hongchao Gao, Ying Liu
UniGeoDiff: Unified Geometry-Aware Diffusion for Single-Image 3D Furniture Generation Wen Li, Zongjie Tan, Yuan Liu
When Simplicity Wins: Bottleneck-Aware Context Modeling For Lightweight Semantic Segmentation Mian Muhammad Naeem Abid, Nancy Mehta, Zongwei Wu, Radu Timofte
VolHuMe: a High-Resolution Large Scale Dataset of Volumetric Human Meshes Giulia Martinelli, Niccolò Bisagno, Nicola Garau, Esa Rahtu, Nicola Conci
Aitchison geometry on the simplex for uncertainty quantification in bayesian hyperspectral image unmixing Hector Blondel, Lucas Drumetz, Thierry Chonavel
Controllable blind deblurring with diffusion models Imane SI SALAH, Emile Cribelier, Thomas Veit, Wolf Hauser, Arthur Leclaire
Enhancing Sparse-View 3D Gaussian Splatting with Guidance of Normals Priors and Dense Point Initialization Yi-Huang Hsieh, I-Chen Lin
SPATIAL COMPETITION FOR LOW-COMPLEXITY LEARNED IMAGE COMPRESSION Théophile Blard, Pierrick Philippe, Théo Ladune, Xiaoran Jiang, Olivier Deforges
FM-OVD: Towards Fast Open-Vocabulary Object Detection with Feature-wise Modulation Rahul Singh Maharjan, Angelo Cangelosi
Uncertainty-Guided Latent Diffusion Models for Faithful Super Resolution Ren Wang, Yung-Yu Chuang
MACHINE LEARNING BASED AV2 ENCODER/DECODER EFFORTS Shan Li, In Suk Chong, Stan Vitvitskyy, Raul Blazquez, Joe Young, Conor McCullough, Akshaya Purohit, Urvang Joshi
Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-Spectrograms Heehwan Wang, Joonwoo Kwon, Sooyoung Kim, Jungwoo Seo, Shinjae Yoo, Yuewei Lin, Jiook Cha
Character-Centered Dialogue Generation from Scene-Level Prompts Taewon Kang, Ming C. Lin
A LIGHTWEIGHT THERMAL DENOISING AND OCCLUSION-ROBUST INFRARED DETECTION MODEL FOR SUBSTATION EQUIPMENT Junyi Wu, liqin tian, Weizheng Wang, niu yong, shi xiaoan
SGCLIP: SEMANTIC-GEOMETRIC FUSION FOR TRAINING-FREE OPEN-VOCABULARY SEGMENTATION Rui Xu, Junpu Wang, Huanyu Li, Zhenduo Guo, Chunlei Li
YOLOv8-HRP2: An Efficient Framework for High-Resolution Surface Defect Detection on Shipping Containers Chenchu Huang, Shan Gao, Huinan Shi, Yi Ma, YUHAN LIU, Mårten Sjöström
PaTSeg: A Patch-Level Text Supervision Paradigm for Mask-Free Remote Sensing Segmentation Chi Han Chen, Wen-Huang Cheng, Ching-Chun Huang
CONSTRAINED DENSE CORRESPONDENCE GRAPHS FOR ROBUST STRUCTURE-FROM-MOTION TARGETING ENDOSCOPIC VIDEOS Yu-Chun Lin, Ming Lun Han, Kuang-Chen Yen, Homer Chen
MR-Mono3D: Multi-Resolution Monocular 3D Mesh Reconstruction for Embodied Spatial Perception Hongyi Huang, Jingyi Wu, Han Yu, Juncen Guo, Peng Sun, Liang Song
Zero-Shot Color Constancy by estimating albedo Gabriele Canesi, Marco Buzzelli, Simone Bianco, Raimondo Schettini
A Dual-Branch RGB-Infrared Multimodal Framework for Real-Time Vehicle Detection in Remote Sensing jin Zhang, Yucheng Xia, Kaize Shi, Pengfei Yuan, Xiaoge Li, Mengjiao Wang, jianbo Zheng
A Hybrid CNN–Swin Temporal Attention Network for Nighttime Video Anomaly Detection Yuxuan Jiang, Yunhui Zeng, Yanlei Cui, Xin Jin
EMVMAMBA: A HYBRID CNN-MAMBA MODEL FOR ADDRESSING RARE CLASS CHALLENGES IN FACIAL EXPRESSION RECOGNITION Jun Foo Kui, Lai-Kuan Wong, Yuen Peng Loh, Patrick Le Callet
Test-time image adaptation for semantic segmentation from noisy images via logit refinement and region-constrained activation maximization Myungjun Son, Jaeseong Yu, Hongjae Lee, Seung-Won Jung
3D Unsupervised Sparse Gravimetry Imaging Guided by a Physics-Consistent Neural Field Ana Gabriela Mantilla Dulcey, Yeganeh Gharedaghi, Daniela Quintero Madariaga, Antonio Ortega, Henry Arguello
EFFICIENT HYBRID ADAPTER FOR VISIBLE-THERMAL TRACKING He Wang
Beyond Visual Perception: Mitigating Multimodal Hallucination via Hybrid Preference Optimization Kun Yang, Yuxuan Liu, Jingyi Wu, Lang Qian, Han Yu
Enhanced Detection of Tiny Objects in Aerial Images Kihyun Kim, Michalis Lazarou, Tania Stathaki
Channel-Aware Tensor Nuclear Norm: A Self-Supervised Approach for HSI Inpainting Yunshan Li, Qianqian Wang, Lili Yang, Wenwu Gong
QuadBox: Accelerating 3D Gaussian Splatting with Geometry-Aware Boxes Xinze Li, Bohan Yang, Pengxu Chen, Yiyuan Wang, Hongcheng Luo, CHENG WENTAO, Weifeng Su
Estimation of instrument and noise parameters for inverse problem based on prior diffusion model Jean-François Giovannelli
Towards Quantitative Deep Learning for Image Steganalysis Shijie Zhang, Mingyang Shen, Jane Wang, Xiangui Kang, Dong Wei
Equiangular Prototype Alignment for Unsupervised Domain-Adaptive Medical Image Segmentation Akash Sharma, Arunima Sarkar, Mohanasankar Sivaprakasam
LOCAL SOFT ALIGNMENT FOR HARD-AWARE MULTI-VIEW TEXT-TO-IMAGE PLACE RECOGNITION Lei Ma, Wen Liu, Xuanshun Zhang
Alleviating Hallucination in Large Vision-Language Models via Structure-Aware Adaptive Contrastive Decoding Shunya Shimomura, Haruhiko Murata, Kazuhiro Hotta
SAMba-UNet: SAM2–Mamba UNet for Cardiac MRI in Medical Robotic Perception Guohao Huo, Ruiting Dai, Ling Shao, Hao Tang
Towards More Transferable Architectures for Dense Pose Estimation Shuhei Tarashima, Norio Tagawa
MASAM: Zero-Shot Identity-Consistent Multi-Object Tracking and Segmentation with Memory-Augmented Segment Anything Model Wei-Jie Mu, Cheng-Yu Ho, Shang-Hong Lai
DiRA-Net: Suppressing Cross-Channel Correlated Noise via Differential-Regulated Attention for Low-Light Image Enhancement Qiyuan Zhang, Haonan Su, Zhaolin Xiao, Yuanfan Guo
TaiChiNet: Dual-branch Densely Connected CNN-Transformer with Color-Intensity Factorization for Low-light Image Enhancement Yanan Hu, Mengjie Qin, Yuchao Feng, Yunhao Li, Xin Yuan
OVDM: A training-free open-vocabulary segmentation framework based on diffusion models Haiyang Liu, Shuai Jia, xiaoming huang, Zhiguo Wang
SPIM-Fuse: Sparsely-Coded Image Fusion for Low-Light Enhancement Sena Yagmur Sen, Gazihan Alankus, Mehmet Turkan
Towards Efficient Vision State Space Models via Token Merging Jinyoung Park, Minseok Son, Changick Kim
LFA: Layer Feature Attention for Run-time Introspection of 2D Object Detectors in Automated Driving Mert Keser, Alois Christian Knoll
CLOTH-HUGS: CLOTH AWARE HUMAN GAUSSIAN SPLATTING Sadia Mubashshira, Nazanin Amini, Kevin Desai
MDE-VIO: ENHANCING VISUAL-INERTIAL ODOMETRY USING LEARNED DEPTH PRIORS Arda ALNIAK, Sinan Kalkan, Mert Ankaralı, Afsar Saranli, Aydin Alatan
Invariants to Blur and Channel Mixing of Color Images Václav Košík, Jan Flusser, Filip Sroubek
GAFSEG: GRADIENT-AWARE FEDERATED LEARNING FOR MEDICAL IMAGE SEGMENTATION Sayanta Sen, Gargi Panda, Saumik Bhattacharya
LIFTING-BASED GEOMETRY OPTIMIZATION FOR 3D DYNAMIC MESHES IN THE V-DMC FRAMEWORK Wenjie Zou, Xuanrui Zhang, Fuzheng Yang
Deep Unfolding with Hybrid Mamba-Convolutional Transformers for Hyperspectral Image Reconstruction ZHOU XINGYU, Xian-Hua Han
TAE: Target-Aware Enhancer for Nighttime UAV Tracking Yanyan Chen, Ruigang Fu, Yu Song, Ping Zhong
UGHM-Net: Uncertainty-Aware Dynamic Confidence and Backtracking for Visual-Semantic Hierarchical Image Classification YANG CHU, Keli Deng, Xiaomeng Yang, Yuntao Qian
DASR-NET: UNSUPERVISED FINE-GRAINED ANOMALY SEGMENTATION VIA DISTRIBUTION ALIGNMENT AND SELECTIVE FEATURE RECONSTRUCTION Xin Yuan, Huanyu Li, Junpu Wang, Miao Yu, Chunlei Li
COBI-CLIP: ENHANCING CLIP WITH CONVOLUTIONAL ADAPTERS AND BIDIRECTIONAL ALIGNMENT FOR ZERO-SHOT ANOMALY DETECTION Xiangshuai Zhao, Junpu Wang, Huanyu Li, Miao Yu, Chunlei Li
FULL-REFERENCE POINT CLOUD QUALITY ASSESSMENT USING GRAPH NEURAL NETWORK-BASED REGRESSION Ryosuke Watanabe, Hiromu Yoshida, Tomoaki Konno
THE IN-LOOP FILTERING PIPELINE IN AV2 Debargha Mukherjee, Jianle Chen, Onur Guleryuz, Lin Zheng, In Suk Chong, Andrey Norkin, Yixin Du, Yunfei Zheng, Tianqi Liu, Khanh Quoc Dinh, Yangwoo Kim, Kwang Pyo Choi
Exposing and Erasing Identity in Skeleton Motion: A New Evaluation Protocol and Adversarial Anonymization Framework Ying-Shuo Lee, Pei-Yuan Wu
MYST: Benchmarking Ecological and Cross-Medium Generalization in Sea Turtle Re-Identification Wan Jun Nah, Juanita Joseph, Shier Nee Saw, Wai Lam Hoo
HSSP: Training-Free Visual Token Pruning with Head Selection and Spatial Constraints Yuan-Hsi Lo, Hsi-Ren Hung, Huei-Fang Yang
G-MASt3R-SfM: Graph-based View Pruning and Multi-stage Optimization for Robust SfM Toshiki Watanabe, Shintaro Ito, Natsuki Takama, Koichi Ito, Takafumi Aoki
Implicit Subblock Transform for Versatile Video Coding jiazhen wang, Zhuoyuan Li, Yao Li, Jialin Li, Li Li, Dong Liu
Adapt2Hide: Leveraging Off-the-shelf Autoencoder for Reversible Visual Processing Ernie Chu, I-Sheng Fang, Tai-Ming Huang, Pin-Yen Chiu, Vishal Patel, Jun-Cheng Chen
MICROVITV2: BEYOND THE FLOPS FOR EDGE ENERGY-FRIENDLY VISION TRANSFORMERS Novendra Setyawan, Chi-Chia Sun, Mao-Hsiu Hsu, Wen-Kai Kuo, Jun-Wei Hsieh
CFE-PPAR: Compression-friendly encryption for privacy-preserving action recognition leveraging video transformers Haiwei Lin, Shoko Imaizumi, Hitoshi Kiya
ChestR1: Ground-Truth Augmented Reinforcement Learning for Chest X-Ray Analysis Yichi Cai, Ying Yu, Yubin Wang, Huimin Yu
VVC film grain synthesis in video coding for machines Rudolf Kortelahti, Tero Partanen, Alexandre Mercat, Jarno Vanne, Miska M. Hannuksela, Honglei Zhang
Bridging the Modality Gap via CLIP-Driven RoI-Level Semantic Alignment for Infrared Object Detection Minju Baek, Hyeongseok Oh, Joonki Paik
QUALITY-SMOOTHING RATE CONTROL FOR LEARNED VIDEO TRANSCODING VIA RATE-QUALITY PREDICTION Nianxiang Fu, Daiqin Yang, Zhenan Lin, Chao Zhou
TIME-VARYING RPPG SIGNAL SEPARATION VIA BLOCK-SPARSE SIGNAL MODEL Kosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto
IMPROVING REFERENCE PICTURE RESAMPLING FOR VVC WITH SCALE AND UPSAMPLING GUIDED DOWNSAMPLING Junying Su, Wenzhuo Zhang, Nianxiang Fu, Daiqin Yang, Zhenan Lin, Chao Zhou
GALA: GAUSSIAN LAYERS, AN EFFICIENT OVERLAP-AWARE 3D GAUSSIAN SPLATTING Matthieu Gendrin, Stephane Pateux, Théo Ladune, Xiaoran Jiang, Luce Morin
Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho
From Division to Decision: Leveraging Temporal Cell-Stage Segmentation for Embryo Transferability Prediction Yasmine HACHANI, Patrick Bouthemy, Elisa Fromont, Véronique Duranthon, Ludivine Laffont, Alline de Reis
Scale-Invariant Geometric Regularization for 3D Gaussian Splatting via Pearson Correlation ZhaoYang Wang, Zonghua Yu, Huaijun Wang, Junhuai Li, Shuai Hu, Haiyan Jin
DECODER-DERIVED ACTIVATION MECHANISM FOR NEURAL NETWORK IN-LOOP FILTERS IN VIDEO CODING Maria Santamaria, Done Bugdayci Sansli, Francesco Cricri
MedSAE: Dissecting MedCLIP Representations with Sparse Autoencoders Riccardo Renzulli, Colas LEPOUTRE, Enrico Cassano, Marco Grangetto
Training-Free Stimulus Encoding for Retinal Implants with Sparse Projected Gradient Descent Henning Konermann, Yuli Wu, Emil Mededovic, Volkmar Schulz, Peter Walter, Johannes Stegmaier
M-COLOR: MLLM-GUIDED DIFFUSION MODELS FOR IMAGE COLORIZATION Frank Lin, Wen-Jiin Tsai
An Efficient and Accurate Registration for Structured Light based Intraoral Scanning Xinzhou Du, Yuping Ye, Yixin Zhuang, Juan Zhao, Zhan Song
Improving zero-shot industrial defect detection exploiting LMMs as inverse reasoners Maria Tzelepi, Nikolaos Dimitriou, Christos Gkogkos, Dimitrios Tzovaras
Context-Aware Multimodal Depression Detection via LLM-Derived PHQ-9 Personal Feature Injection Huiyu Yang, Puneet Kumar, Xiaobai Li
TaperVLMs: TASK-AWARE PRUNING OF VISION-LANGUAGE MODELS USING INFORMATION BOTTLENECK PRINCIPLE Peyman Rostami, Dan Pineau, Nassim Ali Ousalah, Anis Kacem, Djamila Aouada
RATE-DISTORTION OPTIMIZED NONLINEAR TRANSFORM CODING FOR VVC RESIDUAL BLOCKS Antoine Monier, Pierre Hellier, Fabrice Le Léannec, Karam Naser, Aline Roumy
Event-Image Deep Stereo Using Multi-Scale Cross Modal Attention Jie Liang, Cheolkon Jung
HiPerViT: A Hybrid Multi-Scale Encoder for Hierarchical Patch Representation on Imbalanced Low-Resolution Data Sakib Ahammed, Xia Cui, Xinqi Fan, Wenqi Lu, Moi Hoon Yap
Federated SVDD Prototype Exchange for Decentralized Object Detection in Natural Disasters Evgenios Vlachos, Vasileios Mygdalis, Ioannis Pitas
LATREF: CONTROLLABLE ILLUMINATION GENERATION AND REFLECTANCE ESTIMATION FROM A SINGLE IMAGE WITH LATENT DIFFUSION MODELS Li Luo, Daljit Singh Dhillon
Refining Acoustic-Based 3D Human Pose Estimation via Vision Pretraining Han-Hsin Lin, Zhong-Wei Lin, Pei-Yuan Wu
PLESS: Pseudo-Label Enhancement with Spreading Scribbles for Weakly Supervised Segmentation Yeva Gabrielyan, Varduhi Yeghiazaryan, Irina Voiculescu
EMOVIS: EMOTION-OPTIMIZED IMAGE PROCESSING Dor Barber, Rony Zatzarinni, Hava Matichin, Noam Levy
Overview of the block-partitioning framework in AV2 Urvang Joshi, Chi Yo Tsai, Yue Chen, Jayasingam Adhuran, Leo Zhao
DISTRIBUTIONAL MODELING OF EVENT-CAMERA STREAMS VIA HIERARCHICAL INTERACTION LEARNING Wessim Omezzine, Lionel Fillatre
4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting Chun Tin Wu, Jun-Cheng Chen
Revealing details in adaptation process: Perceptual Sensitivity Adaptive Volumetric Mamba for Low-light Image Enhancement Huiling Zhou, Qingbo Wu, Fanman Meng, Hongliang Li
Label Free Change Detection: A Statistical Shift for Zero-Shot Change Detection Haolin Huang, Peiyao Guo, Zhizhuo Jiang, Yu Liu
Exploring fine-grained UGC compression quality in a no-reference based approach Yilin Wang, Andreas Pastor, Yaohong Wu, Suriya Prakash Jambunathan, Neil Birkbeck, Balu Adsumilli
Task-adaptive Local Rate Control for Neural Video Coding for Machines Marc Windsheimer, Simon Deniffel, Andre Kaup
Refining the Anatomical Representation of Autism: A Comparative sMRI Study of ROI, and Vertex-Level Features for ASD Communication Severity Grading Mostafa Abdelrahim, Mohamed Khudri, Moumen El-Melegy, Ali Mahmoud, Ahmed Shalaby, Asem Ali, Mohammed Ghazal, Fatma Taher, Ashraf Khalil, Sohail Contractor, Gregory N. Barnes, Ayman El-Baz
Direct Kernel Optimization: Efficient Design for Opto-Electronic Convolutional Neural Networks Ali Almuallem, Harshana Weligampola, Abhiram Gnanasambanbdam, Wei Xu, Dilshan Godaliyadda, Hamid Sheikh, Stanley Chan, Qi Guo
Proximal Vision Transformer: Geometry-Inspired Feature Enhancement Haoyu Yun, Hamid Krim, Emilie Chouzenoux, Jean-Christophe Pesquet, Bo Jiang
A Comprehensive Analysis of Lightweight Design Strategies for Camouflaged Crop Detection Mari Salvador Lapuz, Audrea Arjaemi Tabadero, Charles Joseph Hinolan, Mac Andre Javellana, Arren Matthew Antioquia
Confidence-Gated Training for Efficient Early-Exit Neural Networks Saad Mokssit, Ouassim Karrakchou, Alejandro Mousist, Mounir Ghogho
SAR-NEUS: NEURAL SURFACE INVERSE RENDERING FOR 3D RECONSTRUCTION AND NOVEL VIEW SYNTHESIS FROM SAR IMAGERY Miguel Andres Alonso, Jose Delgado, Alejandro Betancor del Rosario, Giorgia Gobbi, Vicent Gilabert Maño, mario alfonso arsuaga, Andrea Castiella Aguirrezabala, Francescopaolo Sica, Orlando Ávila García
PT-GS: Prompt Tuning based Generalizable 3D Synthesis towards Real-Time Cross-Domain Adaptation Qingyuan Hou, Chunshu Wu, Sushant Kondguli, Tong Geng, Michael Huang
Overview of Intra prediction and intra mode coding in AV2 Leo Zhao, Tianqi Liu, Shan Liu, Jayasingam Adhuran, Qingyang Zhou, Jianle Chen, Cheng Chen, Raul Blazquez, Yixin Du, Mei Guo, Xin Zhao, Van Luong Pham, Mariana Afonso
WiSDet: A Windshield-Guided Few-Shot Framework for Detecting Car Stickers in the Wild Ma. Ysabel Bondoc, Jonaviene Capunitan, Matthew James Villarica, Arren Matthew Antioquia
Residual and Entropy Coding in AV2 Alican Nalci, Hilmi E. Egilmez, Madhu Peringassery Krishnan, Joe Young, Joel Sole, Xiaoqing Zhu, Zhijun Lei, Kruthika Koratti Sivakumar, Qingyang Zhou, Minhua Zhou
Beyond Average FPS: Assessing ACR-HR vs. DCR for Frame Drop Severity and a Novel No-Reference Metric Suriya Prakash Jambunathan, Andreas Pastor, Xujin Zhang, Neil Birkbeck
Is Haar Enough? Exploring Symlets and Coiflets for Wavelet Convolution Layers Md Rifat Ur Rahman
CoreView: Compact Yet Complete Video Representation Susim Roy, Arjun Ramesh Kaushik, Nalini Ratha, Venu Govindaraju
Caption-Guided Graph-Structured Action Segmentation for Weakly Supervised egocentric Dense Video Captioning Takuya Kobayashi, Tatsuya Sasaki, Yoshiki Ito, Takayuki Akiyama
Understanding Domain-Shift Immunity in Deep Deformable Registration mingzhen shao, Sarang Joshi
Action Recognition in Virtual Reality: Real-Time Detection of Sports Gestures using Shallow Learning for Image-Encoded Kinematics Jaime Gallego, David Bernal-Casas
Physics-constrained Diffusion Attack Against SAR Target Recognition Yanjing Ma, Jifang Pei, Weibo Huo, WENJING WANG, Yin Zhang, Yulin Huang, Jianyu Yang
Establishing Robust Retinal Eye Tracking: A Weakly Supervised Algorithmic Framework Bo Wen, Dillon Lohr, Yatong An, Pushkar Anand, Alexander Fix, Ruobing Qian, Catherine Fromm, Yimin Ding, Truong Nguyen, Mohamed El-Haddad, Francesco La Rocca
DYNAMIC RESOLUTION SWITCHING FOR LIVE STREAMING Xin Xiong, Yixu Chen, Hai Wei, Yongjun Wu, Sriram Sethuraman
Compressing Feed-forward 3D Gaussian Splatting via Feature Sorting Xinrui Ju, Shanzhi Yin, Xinju Wu, Bolin Chen, Ru-Ling Liao, Shiqi Wang, Yan Ye
Learning Across Content-Disparate Modalities: Cross-Modality and Semantic Guided Keypoint Matching for Optical-SAR Alignment Yu Wang, yepeng liu, Zaiwang Gu, Shengkai Chen, Wee Siong Ng, Ha Linh Trinh, Hieu Kieu, Wing Keung, Adrian LAW, Jun Cheng
Pseudo-label Induced Subspace Representation Learning for Robust Out-of-Distribution Detection Tarhib Al Azad, Faizul Rakib Sayem, Shahana Ibrahim
Large Vision–Language Models with Object Structure Alignment for Image Matching Nguyen Xuan Nam, Hidetomo Sakaino
End-to-End Learning of Metalens-Based Compressive Sensing and Differentiable Coding for Hyperspectral Image Transmission Takayuki Sasaki, Yoko Sogabe, Kazuya Hayase, Masaki Kitahara, Yukihiro Bandoh
LPConv: Laplacian Pyramid Convolutions for Parameter-Efficient Receptive Field Expansion Naoki Nishiya, Akira Kubota
A Rank-Based Wasserstein Distance for Comparing the Contrastive Power of Post-Hoc XAI Techniques Tamara Lenhard, Nicole Wagner, Horst Zisgen
Manifold-Guided Unified Learning for Partial-Label Domain Adaptation Yifan Pan, Guibo Luo, Yuesheng Zhu
Beyond Detection: Analyzing and Classifying Global and Local Memorization in Diffusion Models Jiyoon Kim, Junha Park, Jaehui Hwang, Jong-Seok Lee
Coarse-to-Fine: Progressive Image Compression for Semantically Hierarchical Classification Jungwoo Kim, Jun-Hyuk Kim, Jong-Seok Lee
R-OVAR: Robust Open-Vocabulary Action Recognition for Practicality Chen Ju, Xu Chen, shuai xiao
An Environment-Adaptive Camouflage Pattern Generator Ciheng Wu, Min Liu, Yinghui Gao, Chule Yang, Zhaoyuan Wu, Naiyang Guan
ACTION DIFFERENCE IDENTIFICATION VIA MULTI-VIEW RELIABILITY RANKING Kotone Mutsuna, Ryota Goka, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
OUT-OF-DISTRIBUTION DETECTION VIA UNCERTAINTY DISTINCTION WITH DIRICHLET GAUSSIAN PROCESS Ryusei Mikami, Koshi Watanabe, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
TCSEG: TOPOLOGY-CONSISTENT SEMANTIC SEGMENTATION OF CORONARY ARTERIES USING INVASIVE ANGIOGRAPHY Zhenchang Liu, Tao Wan, jiexu cui, Lei Cao, Zengchang Qin
REMOTE SENSING CHANGE DETECTION WITH CROSS MLSTM Elman Ghazaei, Erchan Aptoula
CLOE: A CONFIDENCE-BASED LOCAL-TO-GLOBAL ESTIMATION FRAMEWORK FOR MULTISPECTRAL ILLUMINANT RECOVERY Matteo Kolyszko, Alessio Mognato, Marco Buzzelli, Simone Bianco, Raimondo Schettini
Swin-Control-LDM: Structure-Preserving 3D Cross-Modality MRI Synthesis via Disentangled Global-Local Modulation Lifang Zhou, Xiaoqing Li, Jun Hu
3D Semantic Gaussians Compression for Occupancy Prediction Yi-Chen Chiu, Jui-Chiu Chiang
t-APML: A Motion-Gated Loss for Dynamic 3D Point Cloud Generation Tasks Sasan Sharifipour, Constantino Alvarez Casado, Manuel Lage Cañellas, Daniel Herrera Castro, Miguel Bordallo
Manifold Optimization on the Magnitude Torus for Fourier Phase Retrieval Wen Perng, Po-Hung Cheng, Homer Chen
TriFIRNet: A Tri-Stage Frequency-Domain Interactive Restoration Network for Adaptive Low-Light Image Enhancement Yaxing Zhang, Pang Jia, Hua Li, Fei Zhou, Wenhui Wu
HTGBNet: A Hybrid Transformer Graph Network with Boundary Awareness for Brain Image Segmentation Aiman Solyman, Ahmed Elazab, Mohamed Rahouti, Ali Alfatemi, Zeinab Mahmoud
SSM-UNet: Structure-Aware Cross-Line Laser Detection for Robust Underwater 3D Reconstruction heyang gao, Yuki Nishida, Takafumi Iwaguchi, Hiroshi Kawasaki
YOLOv8-Leaf-Pose: Keypoint-Driven Localization of Weed Apical Meristem for Laser Weeding Haoyue Han, Ruixin Wei, Fan Wu, Yuxing Han
HYBIC: A HYBRID BINARY-REAL GRID AND CONTEXTUAL MODELING FRAMEWORK FOR NERF COMPRESSIOM Pu-Hsueh Yen, You-Cheng Chen, Dian-Xuan Yang, Jui-Chiu Chiang
Cell Phantom Video Generation in Elliptical Fourier Descriptor Domain Francesco Benedetto, Roberto Basla, Luca Magri, Giacomo Boracchi
Hardware-Accelerated Implementation of Multi-Resolution Motion Estimation for HEVC Akseli Epäilys, Jesse Smedberg, Panu Sjövall, Alexandre Mercat, Jarno Vanne
ROBUST WATERMARKING WITH LATENT ADAPTER ON RECTIFIED FLOW MODELS Hongfei Wu, Xiaodan Lin, Gewei Tan
Benchmarking Conventional and Learning-based Image Codecs for the Future JPEG DNA Standard Claire Couvreur, Michela Testolina, Théo Ladune, Guillaume Lorand, Pierrick Philippe, Marc Antonini
DyD-DETR: Dynamic Gated Fusion and Dual-Domain Interaction Network for Small Object Detection in UAV Imagery Xin Cong, Yuan Bai, Siyu Qiu
Target-aware training set search for HDR video dataset via shallow feature matching and motion-exposure cues Fengshan ZHAO, Qin Liu, Takeshi Ikenaga
XAI or Attention: improving performance of object detectors with XBL Alexey Zhukov, Jenny Benois-Pineau, Amira Youssef, Akka Zemmari, Mohamed Mosbah, Virginie Taillandier
Data-Parallel CUDA Implementation of the SNIC Super-pixel Algorithm Kıvanç Taş, Toygar Akgün
A$^2$SR: Any-Resolution and Any-Step Diffusion Image Super Resolution with Pure ConvNets Ruiqing Wang, Kai Zhang
SIGMA-Based RGB-Hyperspectral Fusion for Semantic Segmentation in Autonomous Driving Sai Kiran Kocherla, Srinivas Aditya Abbaraju, Adduru U G Sankararao, Duswanth reddy, Rajalakshmi P
WREN: Low Light Image Enhancement Using Retinex theory-based Double U-Net-like Structures Reina Kaneko, Junya Hara, Hiroshi Higashi, Yuichi Tanaka
HALLUCINATION MITIGATION IN LARGE VISION-LANGUAGE MODELS VIA CONTRASTIVE DECODING WITH ATTENTION ENHANCEMENT AND MASKING Chen-Yang Huang, Pin-Zhen Chen, Huei-Fang Yang
Bridging Spectral and Spatial Signatures: A Dense Bag-of-Words Approach for Multispectral and Hyperspectral Image Analysis Mihail-Gabriel Botezatu, Ioana Voica, Daniela-Iulia Calota, Mihai Datcu, Andrei Anghel
Squeeze Out Tokens from Sample for Finer-Grained Data Governance Weixiong Lin, Chen Ju, shuai xiao, Haicheng Wang, Yuheng Jiao
Unsupervised Domain Adaptation for Enhanced Radiometer Image Precipitation Estimation using Conditional Flow Matching Victor Enescu, Assaad Zeghina, Matthieu Meignin, Nicolas Viltard, Cécile Mallet
Sens-VisualNews: A Benchmark Dataset for Sensational Image Detection Andreas Goulas, Damianos Galanopoulos, Evlampios Apostolidis, Vasileios Mezaris
FLASH: Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion June Moh Goo, Zichao Zeng, Jan Boehm
MDSeg: Enhancing Text Segmentation via Detection-Guided Multi-Task Learning Xing Shicong, Yan Li, Yan Shu, Yaru Zhao, Binyang Li
UNmix: A dual decoder U-Net for regression-based unmixing of subcellular structures in 3D confocal images Hajar Hakkoum, Sandrine Lefranc, ayoub ouddah, magalie uyttewaal, david bouchez, martine pastuglia, Philippe Andrey
DeSal: Detail-Enhanced Network for High-Resolution Video Saliency Prediction Jiongzhi Lin, Jiankai Xu, Wenhui Wu, Fei Zhou
TSOG: A format for Temporally and Spatially Ordered Gaussians Shady Gmira, Evangelos Alexiou, Emmanouil Potetsianakis, Emmanuel Thomas
FLASH: Efficient Impact Fall Detection with Unified Hypergraph State-Space Model Youssef Mourchid
Sampling High-Dimensional Constrained Gaussian Distributions Using Circulant Gibbs Pierre Minier, Jean-François Giovannelli, François Orieux, Marcelo Pereyra
SCONE: Sketch-Guided Spatial Gating for Multi-Object 3D Scene Reconstruction Ruyi Li, Zhihan Yin
MACDet: A MisAligned Multispectral Vehicle Detection Network Based on Deformable Cross-Attention si Huang, shangping zhong, Kaizhi Chen, Wu YunBing
IMPROVING SIMILARITY-BASED KNOWLEDGE TRANSFER USING PROTOTYPE REPRESENTATIONS Dimitrios Spanos, Nikolaos Passalis, Anastsios Tefas
Anchored Reliability: Decoupling Estimation from Adaptation for Noisy Test-Time CLIP Malavika Hariprasad, Soma Biswas
Correlation-Aware Knowledge Distillation for Deep Discriminative Feature Learning in Image Retrieval Ioanna Valsamara, Ioannis Pitas
Improving Privacy-Utility Trade-off with Learnable Privacy Mechanism in Machine Learning Tasks Savas Ozkan, Sinan Mutlu, Mete Ozay
Viewpoint-Aware Bitrate Optimization for Multi-Asset 3D Scenes Tomás Borges, Yago Sánchez, Cornelius Hellge, Ricardo de Queiroz
Analysis of the Impact of Training Data Distribution for Neural Reference Frame Generation Qipu Qin, Cheolkon Jung
CAMERA PARAMETER SEARCH (CPS): A SYNTHETIC DATASET FOR SINGLE-IMAGE CAMERA CALIBRATION WITH GROUND-TRUTH INTRINSICS AND BROWN-CONRADY DISTORTION Faiz Muhammad Chaudhry, Jarno Ralli, Jerome Leudet, Fahad Sohrab, Farhad Pakdaman, Pierre Corbani, Moncef Gabbouj
HGS: Head-of-Queue Slack-Aware Generation Scheduling for Generative Real-Time Interaction Systems Bo Peng, Lianchen Jia, Chaoyang Li, Lifeng Sun
SelfVTON: Enhancing Virtual Try-On with Self-Supervised Cloth Detailing and Body Alignment Shengyi Wu, Lingxiao Lu, Xianbing Sun, Zheng Wang, Jianlou Si, Liqing Zhang, Jianfu Zhang
GaLe: memory-efficient Global Approximate and Local Exact features Alberto Ancilotto, Elisabetta Farella
HPA-Seg: Correlation-Gated Fusion and Reliability-Weighted Prototypes for Multi-Modal Remote Sensing Segmentation Hongkang Zhang, Shao-Lun Huang, Yanlong Wang, Ercan Engin Kuruoglu
HYPERNEST-TTA: HYPERBOLIC NESTED LEARNING WITH TEST-TIME ADAPTATION FOR DIABETIC RETINOPATHY ASSESSMENT Francesco Rundo, Massimo, Spata,, Andrea Calvagna, Andrea Orazio Caruso, Emiliano Tramontana, Sebastiano Battiato
Information Router for Mitigating Modality Dominance in Vision-Language Models Seulgi Kim, Mohit Prabhushankar, Ghassan AlRegib
Boosting Point Transformer Segmentation with Self-Supervised Pretrained Point Encoders for Forest Point Clouds Cosmin-Ioan Grigoruta, Mihai-Sorin Stupariu, Ileana Patru-Stupariu
HIGH-FIDELITY FUNDUS IMAGE RESTORATION BEYOND GROUND TRUTH Ozer Can Devecioglu, Serkan Kiranyaz, Uyen Phan, Ilke Adalioglu, Moncef Gabbouj
Deep-Unfolded Autofocus Imaging for Distributed MIMO Radar Tsubasa Terada, Hassan Mansour, Petros Boufounos, Ryuhei Takahashi
SSDOC-DET: EFFICIENT DOCUMENT LAYOUT DETECTION VIA SELECTIVE STATE-SPACE MODELING Jing-Ming Guo, Chun-Wei Huang, Yi-Chong Zeng, Cheng-Yen Hsiao
HMS3FORMER-A Hyperspectral Image Restoration Model Based on Multi-Stage Spatial-Spectral Transformer and Endmember Attention Zheng-Yang Wu, Chia-Ming Lee, Yu-Fan Lin, Chih-Chung Hsu, Shih-Yu Chen, Li-Wei Kang
LiPS: Lightweight Panoptic Segmentation for Resource-Constrained Robotics Calvin GALAGAIN, Martyna POREBA, François GOULETTE, Cyrill STACHNISS
EWMA-CDT for adaptive temporal aggregation in online QIS filtering Francesco Scroccarello, Edoardo Peretti, Aleksi Suonsivu, Leevi Uosukainen, Lauri Salmela, Stefano Bertolasi, Ionut Schiopu, Giacomo Boracchi
CADS: Conformal Adaptive Decision System for Cost-Efficient Image Classification Mikael Turkoglu, Tim Bary, Vincent Thielens, Manon Dausort, Benoit Macq
CETNET: A CONTRAST ENHANCEMENT TWIN-BRANCH NETWORK FOR LOW-LIGHT ENHANCEMENT Yifu He, DongJin Huang, Jiantao Qu, Yiyan Fan
ATTENTION-AWARE TRANSFORMER-BASED AGGREGATION NETWORK FOR VIDEO PERIOCULAR RECOGNITION Luiz Guilherme Fonseca Carreira, Breno A Mariano, Victor H C de Melo, David Menotti, William Robson Schwartz
LAYERNORM-AWARE COMPRESSION OF VISION TRANSFORMERS VIA QUANTIZATION AND PRUNING Soumya Sharma, Archita ., Parimala Kancharla
APCSEG: ADAPTIVE PROMPT COORDINATION TOWARD ROBUST ABDOMINAL VOLUMETRIC SEGMENTATION jiexu cui, Lei Cao, Tao Wan, Zhenchang Liu, Jiankun Xu, Zengchang Qin
Bayesian Image Reconstruction With Local Linear Regressors Colas Schretter
ILLUMINATION-DECOUPLED DUAL-UNET FOR SINGLE IMAGE DEVIGNETTING Mariam Hossam, Hicham G. Elmongui, Marwan Torki
AI FOR 3D CHARACTERIZATION OF BUILDING INVENTORIES FOR VULNERABILITY AND EARTHQUAKE RISK MAPPING FROM SAR DATA Pegah Moradpour, Mahila Hosseini, Luigi Russo, Babak Memar, Paolo Ettore Gamba
FMI2P-Loc: Using Foundation Models for Large-Scale Image-to-Point Cloud Visual Localization Chin-Wei Kuo, PEI-I WU, Kuan-Wen Chen
Parallel Context Modeling for Sliding Window Attention in Neural Video Coding Alexander Kopte, Andre Kaup
Correlation-Based Spectral Fidelity-Guided Unrolled Tensor Rank Minimization for Pansharpening Dung Viet Phan, Chuong Hoang Vo, Chul Lee
Toward Semantic-Agnostic and Shape-Aware Vision-Language Segmentation Models Corentin Seutin, Mohamed Amine Ettaki, Michaël Clément, Pierrick Coupé, Rémi Giraud
Bridging 2D Efficiency and 3D Context: A Memory-Guided Framework for Knee MRI Multi-label Classification Huy Nguyen, Khang Le Minh, Cuong Nguyen
An Overview of Inter Coding Tools in AV2 Yeqing Wu, Keng-Shih Lu, Mohammed Sarwer
AGLDM: ATTRIBUTE-GUIDED ZERO-SHOT TEXT-TO-IMAGE SYNTHESIS USING DATA-EFFICIENT LATENT DIFFUSION MODEL WITH SELF-CONSISTENCY LOSS Sougata Moi, Angshuman Paul
[ICIP 2026] ADAPT: Any-codec Diffusion-based Adaptation for Image Perception-Distortion Tradeoff Yen-Kuan Ho, Feng Chu Lin, Ting-Han Lin, Huu-Tai Phung, Ching-Chun Huang, Alessandro Gnutti, Wen-Hsiao Peng
QUALITY CONSISTENCY SCORE (QCS): A SURVIVAL-BASED RELIABILITY DESCRIPTOR FOR VIDEO QUALITY ASSESSMENT Sergio Sanz-Rodríguez, Jon Frydensbjerg
Multimodal Analysis of T2-Weighted MRI and Clinical Data for Recurrence Prediction in Non–Muscle-Invasive Bladder Cancer Israa Sharaby, Ahmed Alksas
SCENE-SPECIFIC MESH-GUIDED SUPERVISION FOR MONOCULAR 3D OBJECT DETECTION Yash Patel, Ryosuke Kawamura, Mose Sakashita, Yusuke Hida, Laszlo Jeni, Koichiro Niinuma
AUTHENTICATION OF COPY DETECTION PATTERNS VIA CROSS-CAMERA DUAL-SYNTHETIC REFERENCING Ivan Oleksiyuk, Roman Chaban, Slava Voloshynovskiy
Task-Oriented Source Coding Using LDPC Codes for Compressed-Domain Image Retrieval Ahcene Aliouet, Yann Miguet, Elsa Dupraz, Aline Roumy
Faithful Grounded Visual Reasoning via Learned Proxy-Tokens Tom Hodemon, Mohamed Chaouch, Aboubacar Tuo, Angelique Loesch
ADNET: ANISOTROPIC DEFORMABLE NETWORK FOR ENHANCED BOUNDARY-AWARE POLYP SEGMENTATION Federico Urli, Luca Zaccagna, Andrea Salfinger, Francesca Incitti, Lauro Snidaro
FaSST: Fast Sparsifying Secondary Transform Darukeesan Pakiyarajah, Samuel Fernandez, Eduardo Pavez, Antonio Ortega, Debargha Mukherjee
Noisy MRI Reconstruction via MAP Estimation with an Implicit Deep-Denoiser Prior Nikola Janjusevic, Amirhossein Khalilian-Gourtani, Yao Wang, Li Feng
Beyond Frontal: A Renference Model for Joint Multi-view Blind Face Restoration Marcelo Sanchez Ortega, Lara Raad, Coloma Ballester
Unrolled neural mapping schemes based on variational representations for satellite ocean remote sensing Paul de Nailly, Ronan Fablet, Daniel Zhu, Maxime Beauchamp
RELIABLE SEMANTIC IMAGE TRANSMISSION VIA JOINT DJSCC DIFFUSION FRAMEWORK Nimesh Pollwaththage, Yasith Ganearachchi, Prabhath Samarathunga, Joseph El Gemayel, Anil Fernando
Learning from Ambiguity: Uncertainty-Weighted Consistency and Structure-Aware Contrastive Objectives for Medical Image Segmentation Maregu Assefa, Divya Velayudhan, Muzammal Naseer, Kumie Gedamu, Iyyakutti Iyappan Ganapathi, Naoufel Werghi
SPACE: Semantic Projection and Alignment of CLIP Embeddings for Domain Adaptation João Renato Ribeiro Manesco, Danilo Jodas, Douglas Rodrigues, Leandro Aparecido Passos, Joao P. Papa
You Only Step Once: A Single-Pass Zero-Order Sharpness-Aware Minimization for Sparse Training Jie Ji, Gen Li, Kaiyuan Deng, Fatemeh Afghah, Xiaolong Ma
OVERVIEW OF TRANSFORM CODING IN AV2 Madhu Peringassery Krishnan, Xin Zhao, Alican Nalci, Keng-Shih Lu, Hilmi E. Egilmez, Aki Kuusela, Urvang Joshi, Van Luong Pham, Lin Zheng, Jingning Han, Kruthika Koratti Sivakumar
NEXT2FORMER-CD: EFFICIENT REMOTE SENSING CHANGE DETECTION WITH MODERN VISION ARCHITECTURES Yufan Wang, Sokratis Makrogiannis, Chandra Kambhamettu
Hierarchical Motion Estimation and Compensation for Learning-based Dynamic Point Cloud Compression Junghyun Ahn, André F. R. Guarda, Dong Tian
An overview of screen-content coding tools in AV2 Qingyang Zhou, Van Luong Pham, Cheng Chen, Mohammed Sarwer, Yingbin Wang, Aki Kuusela, Dzung Hoang, Guichun Li, Shan Liu
HMDER-AttnNet: A Hybrid Attention-Based Deep Learning Framework for Noise-Aware Brain MRI Image Enhancement and Restoration Shankar Tiwari, Subham Pramanik
Learning Geometry-Consistent Graphs for Multi-Modal Geophysical Data Interpolation Kevin Arias, Paul Goyes, Antonio Ortega, Henry Arguello
Sparse Attention to Emotion: Efficient Facial Emotion Recognition via Token Reduction Aya Zitouni, Aicha Zenakhri, Karim Haroun, Larbi Boubchir
ViPo-MLLM: Visual-Pose Multimodal LLM for Gloss-Free Sign Language Translation Ahmed Abul Hasanaath, Bicheng Xu, Mir Rayat Imtiaz Hossain, Leonid Sigal, Hamzah Luqman
Denoising of Two-Phase Optically Sectioned Structured Illumination Reconstructions Using Encoder-Decoder Networks Allison Davis, Yezhi Shen, Xiaoyu Ji, Fengqing Maggie Zhu
Rate Distortion Optimization for Mesh Geometry Compression Qingyang Zhou, Pranav Kadam, Shan Liu, C.-C. Jay Kuo
RATE-DISTORTION-COMPLEXITY ANALYSIS OF PARAMETRIC VIDEO CODECS Ricardo de Queiroz, Diogo Garcia, Yi-Hsin Chen, Ruhan Conceição, Luciano Agostini, Wen-Hsiao Peng
MAE-UNETR++: Masked Autoencoder Pretraining for 3-D Lung Nodule Segmentation Vinayak Savant, Jianhua Xuan
RAW Image Compression with ISP Priors and Side Information Zixu Chen, Yuqi Li, Li Li, Xiangji Wu, Dong Liu
F2-OWOD: Frequency-Domain Feature Decoupling with Foundation Models for Open-World Object Detection Weilong Zhu, Hualei Shen
ROBUST PRIOR-GUIDED SEGMENTATION FOR EDITABLE 3D GAUSSIAN SPLATTING Raushan Joshi, Jean-Yves Guillemaut
RPO: Training-Free Flow Matching Refinement via Regional Preference Optimization Dejiao Xue, Yiwei Tang, He Wang, Longquan Dai
Spatial-Frequency Cooperative Fusion Network for Multimodal Medical Image Fusion Guanghui Yue, Wentao Li, Siqi Xiao, Cheng Zhao, Tianfu Wang, Tianwei Zhou
Uncertainty-Guided Hybrid CNN-Transformer Architecture for Aircraft Surface Defect Detection Victor Wu, Jichi Ge, Jieling Gong, Eric Saczuk, Michal Aibin
Pixel to Geocoordinate Mapping in Oblique and Nadir UAV Imagery Michal Aibin, Suchang Cao, Victor Wu, Zhiyuan Yang
RADMI: LATENT INFORMATION AGGREGATION AS A PROXY FOR MODEL UNCERTAINTY William Stevens, Mohit Prabhushankar, Ghassan AlRegib
ZERO-SHOT MEMORABILITY CONTROL IN DIFFUSION MODELS Ren Togo, Ryo Shichida, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
Lossless Image Coding Using Context-driven Neural Distribution Estimation Victor Fabre Figueiredo, Lucas Lopes, Ricardo de Queiroz, Philip Chou
Anatomical Region Powered Laryngoscopic Report System via Vision-Language Model Kaiwen Xiong, Yi Liu, Jiayue Xiao, Ruixin Li, Sisi Zheng, Binbin Wang, Ting Xiang, Feng Wang, Xiaomao Fan, Dan Lu, Yumeng Liu
DeSD: A Depth-Aware Diffusion Framework for Small Object Detection in Grassland Rat Holes Lei Xu, Ru Li
Learning Perceptual Representations for Gaming NR-VQA with Multi-Task FR Signals Yu-Chih Chen, Michael Wang, Chieh-Dun Wen, Kai-Siang Ma, Avinab Saha, Li-Heng Chen, Alan Bovik
X-SSLext: Semantic Prototype Self-Distillation for Proposal-Based X-ray Threat Representation Yonathan Michael, Mohamad Alansari, Andreas Henschel, Naoufel Werghi
Low-Light Image Enhancement with Structural Contrast Prior, LAB Noise Modeling and Multi-Path Image Fusion Xinchen Ma, Kailiang Ye, Zheng Lu
MG-NET: A NEW MULTI-AXIAL GUIDANCE NETWORK FOR ABDOMINAL MULTI-ORGAN SEGMENTAION YiYang Chen, Qian Huang, Yulin Chen, Hexuan Hu, Ziyang Yin, Meng Geng
Accelerating Learned SAR Image Compression via Selective Channel-Group Encoding Bypass Zach Button, Paras Maharjan, Zhu Li
Perceptually Optimized LOP In-Loop Filter for VVC Based on Multi-Scale Head Alignment Shaochong Wu, Cheolkon Jung
Learning Dual-Attribute Prompts with Progressive Tuning for AI-Generated Image Quality Assessment Chenyang Zhang, Pengyu Wang, Yiping Duan, Xiaoming Tao
Effective Degree-wise Scalability of Spherical Harmonic Coefficients for 3D Gaussian Splatting Compression Jianfeng Xu, Ryosuke Watanabe, Keisuke Nonaka
Efficient Coreset Generation for Chest X-ray Imaging Using Compressed Sensing Pradyumna Pradhan, Soham Mukherjee, Ramunaidu Randhi, Pradip Sasmal
A Stable Neural Statistical Dependence Estimator for Autoencoder Feature Analysis Bo Hu, José C. Principe
COCO-Inpaint: A Benchmark for Detecting and Localizing Inpainting-Based Image Manipulations Haozhen Yan, Yan Hong, Jiahui Zhan, Suning Lang, Yikun Ji, Yujie Gao, Huijia Zhu, Jun Lan, Jianfu Zhang
CLOUD-ROBUST SPATIOTEMPORAL FUSION OF SATELLITE IMAGES: A CONSTRAINED CONVEX OPTIMIZATION APPROACH Ryosuke Isono, Shunsuke Ono, Antonio Ortega
HDRDCL: AN HDR OBJECT DETECTION AND SEGMENTATION DATASET FOR EVALUATION IN CHALLENGING LIGHTING CONDITIONS Juan Merlos, Andre Harrison, Darius Jefferson, Velibor Adzic, Hari Kalva
Multimodal Attention Framework for Context-Aware and Semantically Rich Image Captioning Nasser Gawfan, Waseem Ullah, Latif U. Khan, Mohsen Guizani
PAND: Prompt-Aware Neighborhood Distillation for Lightweight Fine-Grained Visual Classification Qiuming Luo, Yuebing Li, Feng Li, Chang Kong
PhysUNeXt: Physics-aware Lightweight ConvNeXt-inspired U-Net for Hyperspectral Image Reconstruction Xian-Hua Han, Jian Wang
Uncertainty-Aware Knowledge Distillation for Semantic Segmentation in Autonomous Driving Armaghan Butt, Qing Tian
Secure Graph Filtering based on Graph Fourier Transform in Encrypted Domain Yukihiro Bandoh
An Attention-Enhanced Network with Joint Dehazing and Retinex-Based Enhancement for Underwater Images Sahana Ray, Bibhabasu Debnath, Sanjay Ghosh
FreeInstance: Training-Free Instance-level Customization fengming liu, Tat-Jen Cham
PARAMETER-EFFICIENT FLEXIBLE EXPANSION AND MERGING WITH DUAL-STAGE MODULE RETRIEVAL FOR TASK-FREE ONLINE CONTINUAL LEARNING Pin-Zhen Chen, Huei-Fang Yang
FEW-SHOT LEARNING OF UNCONDITIONAL LATENT DIFFUSION MODELS BASED ON DOMAIN ADAPTATION AND DOMAIN-INDEPENDENT LATENT SPACE Katsumi Yamada, Kazuaki Nakamura
Scene-Aware Physics-Informed Neural Networks for Adaptive Car-Following Modeling Hengyu Zhang, Chonghao Gao, Xin Yang, Xuyang Zhu, Yu Liao, Shijie Zhou
Leveraging motion estimation for Efficient Bayer-Domain Video Convolutional Networks Haichao Wang, Jiangtao Wen, Yuxing Han
Listening without Looking: Modality Bias in Audio-Visual Captioning Yuchi Ishikawa, Toranosuke Manabe, Tatsuya Komatsu, Yoshimitsu Aoki
TeSO: Representing and Compressing 3D Point Cloud Scenes with Textured Surfel Octree Yueyu Hu, Ran Gong, Tingyu Fan, Yao Wang
DIAMOND SHAPE FILTER IN LOW COMPLEXITY NEURAL NETWORK-BASED IN-LOOP FILTERING FOR VIDEO CODING Tong Shao, Jay N. Shingala, Ajay Shyam, Ajat Suneja, Siddarth P Badya, Peng Yin
HyperICM: Hyperspectral Image Compression for Machines with Task-Agnostic Semantics from Foundation Models Jiayao Xu, Yujie Chen, Dingquan Li, Wenhan Yang
Predictive Label Consistency for Mitigating Robust Overfitting in Adversarial Training Hanqi Zhang, Ke Xu, Xinghao Jiang, Tanfeng Sun
ID-Pruner: Disentangling Importance and Diversity for Training-Free Visual Token Pruning Jie Ji, Gen Li, Fatemeh Afghah
Leveraging Vision-Language Models as Weak Annotators in Active Learning Phuong Ngoc Nguyen, Kaito Shiku, Bise Ryoma, Seiichi Uchida, Shinnosuke Matsuo
Accelerated Blur Kernel Estimation with Local Boosting and Subimage Usage KuanChung Ting, Chun-Wei Chang, Sheng-Jyh Wang, Ruey-Bing Hwang
TIME-AWARE SEMANTIC PROTOTYPES FOR WEAKLY-SUPERVISED ENDOMICROSCOPY VIDEO CLASSIFICATION Ilán Carretero, Pablo Meseguer, Irene Zammarchi, Cecilia Pugliano, Giovanni Santacroce, Bisi Bode Kolawole, Ujwala Chaudhari, Rocío del Amor, Enrico Grisan, Marietta Iacucci, Valery Naranjo
HYPERDISTILL: ENABLING TEXT-FREE INFERENCE IN HYPERGRAPH-BASED MEDICAL IMAGE SEGMENTATION VIA KNOWLEDGE DISTILLATION Afrouz Sheikholeslami, Sahar Moradizeyveh, Mohammad Hossein Ahmadi, Yuankai Qi, Amin Beheshti
Generating Topologically Sound and Geometrically Smooth Meshes Daixi Jia, Haiyue Zhang, Cui Wang
Bilateral Kernel Regularization for Few-Shot Adaptation of Large Vision-Language Models Omar Arif, Aizah Arif
CG-Track: Dual-Adaptive Temporal Enhancement and Cue-Gated Fusion for Robust Multi-Object Tracking Jian Li, Fei Gu, Qian Zhou, Jing Wu
Rotation-Equivariant Multi-Scale Convolution via Adaptive Magnitude LBP Peihong Lei, Yuying Ren, Siqi Chen, Fan Bai, Hanlin Mo
HOMOAD:LEVERAGINGHIERARCHICALHOMOGENIZATIONANDSYNERGISTIC SYNTHESIS FORINDUSTRIALANOMALYDETECTION Fang Chih-Heng, Jou Jie-Deng, Yu-Hsuan Chiu, Jison Hsu
HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification Valentina Vadori, Jean-Marie Graïc, Antonella Peruffo, Ujwala Chaudhari, Enrico Grisan
TeRIF:Region-Aware Image Fusion Conditioned on Textual Dynamics ying luo, yanyin guo, chuiyi deng, zhuoyi zhao, junwei li
Uncertainty-Aware DualU-Net: Integrating Calibration and Uncertainty Fusion from Dual Decoders for Cell Analysis David Anglada-Rotger, Ferran Marques, Montse Pardas
Gradient Loss for Spectral Reconstruction Sona Bezirganyan, Lusine Davtyan, Aram Butavyan, Varduhi Yeghiazaryan
Energy Consumption Analysis of FPGA-Accelerated 2D HEVC Encoding in a Practical V-PCC Encoder Louis Fréneau, Nhan Nguyen, Guillaume Gautier, Panu Sjövall, Maxime Pelcat, Alexandre Mercat, Jarno Vanne
Nix and Fix: Targeting 1000× Compression of 3D Gaussian Splatting with Diffusion Models Cem Eteke, Enzo Tartaglione
SURGMLPS: UNLOCKING THE POTENTIAL OF CLIP WITH AN MLP-LIKE ARCHITECTURE FOR SURGICAL PHASE RECOGNITION Hao Xie, Xutao Chen, Bonnie Law, Yuk Hee Chan, Kin-Man Lam, Kenneth K.W. Li, Tracy H.T. Lai, Victor S.C. Chu
Physics-Guided Single-Image Dehazing with Learned Transmission and Atmospheric Light Estimation Koyyada Dinesh Kumar, Sujit Kumar Sahoo
When Restoration Becomes the Reference: Reusing Full-Reference IQA in Blind Settings Aymen Sekhri, Abderrezzaq Sendjasni, Seyed Ali Amirshahi, Chaker Larabi
Efficient Remote Sensing Image Segmentation With Learnable Constrained Convolutional Enhancements Mengmeng Zhang, Hongyuan Jing, Bo Ding, Tianxu Cui
Optimized three-component quaternion coding of 3D Gaussian splats in MPEG V-PCC standard Adrian Dziembowski, Błażej Szydełko, Dawid Mieloch, Kwan-Jung Oh, Gwagnsoon Lee, Jun Young Jeong
STMGaze: Spatiotemporal Modeling with Orthogonal Mamba Scanning for Video-based Gaze Estimation Jingzhi Jiang, Sirui Zhao, Xiaohao Wang, Fangyuan Liu, Tong Xu, Enhong Chen
FOREST CANOPY HEIGHT MAPPING USING DUAL-ENCODER ATTENTION U-NET AND MULTISEASON OPTICAL–SAR FUSION Soma Satya Praveen Mutyala, Suraj Reddy Rodda, Rajashekar Gopalakrishnan
EVDI: Exposure-Aware Joint Video Deblurring and Interpolation under Unknown Exposure Haodong Fan, Yingming Li
LEARNING UNCERTAIN BOUNDARIES: INTERACTION-FUSED MULTI-DECODER CONVOLUTIONAL NEURAL NETWORKS FOR PERINEURAL INVASION DETECTION IN HISTOPATHOLOGICAL IMAGES VIJAY SANKAR BABU P, Faouzi Alaya Cheikh, Madhu S. Nair
ALADIN: Attention-based Lightweight Architecture for Drowsiness Identification Mayuk Sarkar, Swarnava Dey, Arijit Mukherjee
Multi-view Consistency and Frequency-aware Modeling for Scattering Scene Reconstruction Renrong Hu, Qianyue He, Dongyu Du, Yihui Fan, Zhiheng Li, Xin Jin
SEEING ROADS THROUGH WORDS: A LANGUAGE-GUIDED FRAMEWORK FOR RGB-T DRIVING SCENE SEGMENTATION Ruturaj Reddy, Hrishav Bakul Barua, Junn Yong Loo, Thanh Thi Nguyen, Ganesh Krishnasamy
EF-ViMGaze: Dual-Branch Eye-Face Feature Learning Based on Vision Mamba for Gaze Estimation Qinghe Li, Zhaonian Sun, Benying Tan
GATA2Floor: Graph Attention for Floor Counting in Street-View Facades Ngoc Tan Le, Tzoulio Chamiti, Eirini Papagiannopoulou, Nikos Deligiannis
CogniORPO: Complexity-Adaptive Reinforcement Learning for Transparent Image Captioning Junxin Wang, Yuchao Wang, Hongkai Zhang
SAM-Guided Unified Weakly-Supervised 3D Salient Object Detection Network Le Hui, guohang li, Chen Wang, Qi Liu, Yuchao Dai
Physics-Informed Blind Adaptive Degradation Guided Network for Unsupervised Hyperspectral and RGB Image Fusion Linxuan Huang, Song Liu, CONGXUAN ZHANG, Zhen Chen
See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models Kebin Contreras, Luis Toscano, Mauro Dalla Mura, Jorge Bacca
SOCIAL GROUP ACTIVITY RECOGNITION FROM STILL IMAGES USING CONDITIONAL TOKEN SEQUENCE GENERATION Shota Orihashi, Taiga Yamane, Naoki Makishima, Mana Ihori, Satoshi Suzuki, Tomohiro Tanaka, Ryo Masumura
GAUSSIAN SPLATTING WITH REFLECTIONS GUIDED BY WHAT IS SEEN Yiming Liang, Tianyu Xiao, Hiroshi Ishikawa
DeKAHT: Data-efficient Kolmogorov-Arnold Hierarchical Transformer Rajib Kumar Jha, Gurram Harshamanya Thilak, Saloni Kumari
PHYSICS-GUIDED DENOISING DIFFUSION FOR COMPRESSIVE X-RAY COMPTON BACKSCATTERING IMAGING Abdullah Alrushud, Sarah Aranibar, Edgar Eduardo Salazar, Gonzalo Arce
VISION WITHOUT IMAGES: END-TO-END COMPUTER VISION FROM SINGLE COMPRESSIVE MEASUREMENTS Fengpu Pan, Heting Gao, Jiangtao Wen, Yuxing Han
SIGHTA-AI: A TWO-STAGE ON-DEVICE VISION-LANGUAGE ARCHITECTURE FOR REAL-TIME VISUAL ASSISTANCE Junwoo Lee, Yi Fang
Stop Denoising your blurs Sasidhar Parvathireddy, Sree Rama Vamsidhar Saraswathula, Rama Krishna Sai S Gorthi
TC-UNet: Detection of Faint Star Spots Based on Time Consensus Feature Fusion Network Yuheng Wei, Lixin Zhang, Xinguo Wei
CONVOLUTIONAL KOLMOGOROV-ARNOLD NETWORKS AND CONDITIONAL RANDOM FIELDS FOR REMOTE SENSING IMAGE SEMANTIC SEGMENTATION Paola Grotti, Martina Pastorino, Gabriele Moser
RCFL: Recursive Clustered Federated Learning for Distributed Concept Drift Konark Jaishy, Saumik Bhattacharya, Prabir Kumar Biswas
FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection Marin Wada, Takeru Inoue, Ryusuke Miyamoto
ENHANCING DIABETIC RETINOPATHY GRADING VIA ENTROPY-DRIVEN KNOWLEDGE DISTILLATION WITH MAMBA FUSED FREQUENCY CROSS-ATTENTION DEV RISHI VERMA, Dipankar Das, DEEPAK RANJAN NAYAK, TAPAN KUMAR GANDHI
Network Quantization in Neural Video Coding: A Comparative Study across Coding Frameworks and Temporal Buffering Strategies Huu-Tai Phung, Yu-Hsiang Lin, Chun-Hung Wu, Ruhan Conceição, Tzu-Hsiang Chou, Marcelo Porto, Luciano Agostini, Wen-Hsiao Peng
PATCH ENSEMBLES FOR ROBUST SALMON RE-IDENTIFICATION WITH WEAK TRAJECTORY LABELS Espen Uri Høgstedt, Christian Schellewald, Annette Stahl, Rudolf Mester
LEVERAGING POINT CLOUD NORMALS FOR PRACTICAL V-PCC CODING Amar Tious, Louis Fréneau, Guillaume Gautier, Toinon Vigier, Alexandre Mercat, Vincent Ricordel
Photon-Statistics-Driven Learning for Underwater Imaging Zaichang Lu, Dongyu Du, Zhiheng Li, Xin Jin
Contribution-Aware Spatial Recalibration for Training-Free Image Classification Haruhiro Takahashi, Ryuto Ishibashi, Lin Meng
Text image inpainting BY EXPLORING CONTEXTUAL SEMANTICS AND STRUCTURE PRIORS Wangchuk Tsering, Qijun Zhao
FedKPer: Tackling Generalization and Personalization in Medical Federated Learning via Knowledge Personalization Zoe Fowler, Ghassan AlRegib
KD-Ex: A Benchmark for Evaluating Explainability Transfer in Knowledge Distillation Malaika Mushtaq, Michael Madden, Ihsan Ullah
The RealDefocus Benchmark for Defocus Deblurring Tim Seizinger, Zhuyun Zhou, Radu Timofte
Graph-based feature learning for image classification Isabela Borlido Barcelos, Zenilton Patrocínio, Alexandre Falcao, Ewa KIJAK, Silvio GUIMARAES
VLIGM-MoE: VISION-LANGUAGE INDIVIDUAL GRAPH MATCHING WITH INSTRUCTION-TUNED MIXTURE-OF-EXPERTS FOR ASD PREDICTION Juliana Mantebea Danso, Enoch Opanin Gyamfi, Mylene Farias
REPA: Random-Order Embedding Predictive Autoregression for Sparse Vector Field Reconstruction Bilginer Oral, Erdem Koyuncu
PHOTOSAR: SYNTHESIZING OBJECT-LEVEL NEAR-FIELD SAR RAW MEASUREMENTS FROM A SINGLE RGB IMAGE Yuhuan Mo, Tingkai Hu, Chuandong Li, Hailing Xiong, Zhen Luo
Scale-Floor Constrained Fourier Basis Density Models for Transformer-Based Learned Image Compression Yizhi Cao, Wen Tan, Fanyang Meng, Genhong Wang, Yongsheng Liang
Height-Aware Feature-Scale Adaptive RT-DETR for UAV Maritime Object Detection Xuhang Wang, Zheng Lu
LUMEN: LOW-LIGHT UNIFIED MULTI-STAGE ENHANCEMENT NETWORK USING DEPTH-GUIDED FLASH, CLUSTERING, AND ATTENTION-BASED TRANSFORMERS Bibhabasu Debnath, Sahana Ray, Sanjay Ghosh
Diversity Sampling via Maximum Dispersion Batch Selection Nikita Kovalenko, Peter Eisert, Anna Hilsmann, Sebastian Bosse
Efficient Dense Matching for Enhanced Gaussian Splatting using AV1 Motion Vectors Julien Zouein, Vibhoothi Vibhoothi, François Pitié, Anil Kokaram
Leveraging Pretrained RGB Denoisers for Hyperspectral Image Restoration Daniele Picone, Mohamad JOUNI, Mauro Dalla Mura
Representation Compensation of SAM2 for Segmenting Objects under Transformation in Videos Marco Cocco, Matteo Dunnhofer, Christian Micheloni
Window-based Linear Attention for Unified Local-to-Global Context in Image Super-Resolution Nai-Jen Hsueh, Wen-Jiin Tsai
MODIFICATIONS TO BLOCK IMPORTANCE MAPPING AND ALIGNMENT TO GOP-BASED RPR Kenneth Andersson, Per Wennersten, Jacob Ström
A Text-Aware Layered Compression Framework for Game Videos Yanzhuo Ma, Lu Wang, Junyan Huo, Fuzheng Yang
Unified Spatio-Temporal BEV Attention for Omniscient Autonomous Driving with Multi-Sensor Fusion Firas Jendoubi, Redouane Khemmar, Romain Rossi, Madjid Haddad
USPDet3D: Hybrid Uncertainty-Aware Dynamic Spatial Pruning for Efficient 3D Small Object Detection Lin Qian, Mengyuan Ma, Lintao Xiang, Hongpei Zheng, Zhenghao Li, Hujun Yin
BUDGET-AWARE ADAPTIVE ADVERSARIAL PATCHES FOR BLACK-BOX OBJECT DETECTION Pedram Mohajeransari, Amir Salarpour, David Fernandez, Mert D. Pesé
A Preliminary Numerical Feasibility Study of Radar Tomography for the Rubble-Pile Asteroid Dimorphos Topi Pajala, Sampsa Pursiainen, Alexandra Koulouri, Christelle Eyraud
Unsupervised Defect Detection for Surgical Instruments Joseph Huang, Yichi Zhang, Xiaoyu Ji, Jingxi Yu, Wei Chen, Seunghyun Hwang, Qiang Qiu, Amy Reibman, Edward J, Delp,, Fengqing Maggie Zhu
How Sampling Strategy Affects Imbalance Mitigation in LiDAR Segmentation: A Study of Structured vs. Random Point-Based Architectures Antonis Savva, Christos Kyrkou, Theocharis Theocharides
From Universal Segmentation to Cell Quantification: A Hierarchical Image Processing Pipeline for Histological Images Letícia Bianca Oliveira, Gabriel Barbosa da Fonseca, Zenilton Patrocínio, Silvio GUIMARAES
LASOD-YOLO: A Lightweight Global Context Modeling for Aerial Small-Object Detection Zhizhang Wang, Xiangji Huang
Batch Perfect: BSS via Structured Local Covariance Yaorong Xiao, Rogers Silva, Brad Baker, Vince Calhoun, Sergey Plis
TOWARD QUALITY ASSESSMENT OF 3D GAUSSIAN SPLATTING CODING Joao Prazeres, Saeed Mahmoudpour, Stuart Perry, Manuela Pereira, Antonio M. G. Pinheiro
TAUSS: TEMPORALLY ALIGNED UNSUPERVISED 3D LIDAR SEMANTIC SEGMENTATION IN DRIVING SCENES So Minesawa, Hiroshi Ishikawa
UNetv2-Lite: Lightweight Residual Attention U-Net for Medical Image Segmentation Abhin P T, Arun Kumar Sivapuram, Madhu S. Nair, Rama Krishna Sai S Gorthi
Invertible Factorization and Prompt Tuning for Long Term Person Re-identification Lenat Thomas, Nirmala Murali, Madhu S. Nair, Deepak Mishra
Structurally Regularized Self-Supervised Graph Learning For Geochemical Mapping From Hyperspectral Images Ioana Voica, Mihail-Gabriel Botezatu, Daniela-Iulia Calota, Andrei Anghel, Mihai Datcu, Florian Bodescu, Aurora Neagoe, Virgil Alexandru Iordache
Reliability-Aware Weighted Multi-Scale Spatio-Temporal Maps for Heart Rate Monitoring Arpan Bairagi, Rakesh Dey, Siladittya Manna, Umapada Pal
Certified-Progressive Secret Image Sharing via XOR and Counting for Fast Lossless Recovery Meijuan Li, Ziwen Wei, Wang Yidong, Cui Zhe
UV-Guided Match Verification for Animal Re-identification Aleksandr Algasov, Ekaterina Nepovinnykh, Fedor Zolotarev, Tuomas Eerola, Heikki Kälviäinen, Pavel Zemcik, Charles Stewart
PhysGasFluid: Physics-Guided Gaseous Fluid Flow Reconstruction Keyi Wu, Shan Du
Multiclass Subtyping of Renal Tumors from Whole-Slide Images Using a Hybrid CNN-Transformer with Optimized Texture Features Mohamed Azam, Hossam Magdy Balaha, Ahmed Aboudessouki, Asem Ali, Moumen El-Melegy, Muhammad Idrees, Mohammed Ghazal, Ashraf Khalil, Dibson Gondim, Ayman El-Baz
MTS-CSNet: Multiscale Tensor Factorization for Deep Compressive Sensing on RGB Images Mehmet Yamac, Lei Xu, Serkan Kiranyaz, Moncef Gabbouj
Density-Adaptive LiDAR Point Cloud Compression Nuno Martins, Luis Cruz, Fernando Lopes
ESCAN: Enhanced Self-Attention-Driven Multi-Level Adaptive Complementary Fusion Network for CT-MRI Imaging Munish Daroch, Alan Saldanha, Ranjeet Ranjan Jha, Aditya Nigam
Light Field Area ReSTIR: Real-Time Depth-of-Field Guided Light Field Rendering Kamran Akbar, Robert Bregovic
CLIP-PET: High-Fidelity Low-Dose PET Reconstruction via CLIP Guided Cascaded Framework Rihui Xia, Zhuodong Chai, Yongzhou Liu, Liwen Wang, Zhe Jin, Xingbo Dong
HyQuant: A Unified Quantization Framework for Hybrid Mamba-Transformer Vision Models Jui-Chiang Wei, Bo-Yun Shi, An-Yeu Wu
PREDICT WITH UNCERTAINTY, DECIDE WITH CONFIDENCE: CONSISTENT DISTRIBUTION LEARNING FOR BONE AGE ESTIMATION Avinaash A, Bhadresh L, Parth Pandey, Umarani Jayaraman
IndoNav: A Benchmark Dataset of Indonesian Pedestrian Scenes for Assistive Navigation of Vision-Impaired People Dien Rahmawati, Son Lam Phung, Hoang Thanh Le, Yang Di, Ly Bui, Husneni Mukhtar, Abdesselam Bouzerdoum
Exploring Easy Boosts For Lidar Semantic Scene Completion Tetiana Martyniuk, Jonathan Seele, Alexandre Boulch, Gilles Puy, Renaud Marlet, Raoul de Charette
Physics-Informed Self-Supervised Despeckling of Sonar Images via Residual Modeling Swapna Pillai, Siddharth Singh Savner, Sujit Kumar Sahoo
SB-BEVFusion: Enhancing the Robustness against Sensor Malfunction and Corruptions markus essl, Marta Moscati, Mubashir Noman, Muhammad Zaigham Zaheer, Usman Naseem, Shah Nawaz, Markus Schedl
TAFA-GSGC: Group-wise Scalable Point Cloud Geometry Compression with Progressive Residual Refinement Xiumei Li, Alexander Kopte, Andre Kaup
SELF-SUPERVISED PERCEPTUALLY INTERPRETABLE MONOCULAR DEPTH ESTIMATION Zain Ul Abidin, George Dimas, Dimitris Iakovidis
Neural Watermarking: Lack of a Secret Key is still Lack of Security Jan Butora, Hussein Tarhini, Aurélien Noirault, Patrick Bas
Patch-Level Cross-Modal Learning for Multimodal Estrogen Receptor Status Classification in Breast Cancer Histopathology Mohamed Azam, Walid [email protected], Khadiga Ali, Ahmed Aboudessouki, Hossam Magdy Balaha, Moumen El-Melegy, Asem Ali, Mohammed Ghazal, Ashraf Khalil, Dibson Gondim, Ayman El-Baz
GEOMETRY MEETS GAUSSIANS IN BEV: UNCERTAINTY-AWARE LATE FUSION FOR MULTI-VIEW PEDESTRIAN DETECTION Vinicius Avena, Rodrigo S. Couto, Luis Henrique M. K. Costa, Eduardo A. B. da Silva
SEGMENTING WOUNDS IN MULTI-MODAL IMAGES USING GENERATIVE ADVERSARIAL NETWORKS Agata Wijata, Jacek Andrzejewski, Maria Bienkowska, Jakub Nalepa
SceneVGGT: VGGT-based online 3D semantic SLAM for indoor scene understanding and navigation Anna Gelencsér-Horváth, Gergely Dinya, Péter Halász, Dorka Boglárka Erős, Muhammad Muqsit Islam, Kristóf Karacs
Towards a Standard for Gaussian Splat scene Coding with V3C/V‑PCC Patrice Rondao Alface, Lukasz Kondrad, Lauri Ilola, Emre Aksu
SplitFed-CL: A Split Federated Co-Learning Framework for Medical Image Segmentation with Inaccurate Labels Zahra Hafezi, Hadi Hadizadeh, Parvaneh Saeedi
PatchCompressor: A Lightweight Region based Video Streaming Framework at the Edge Shaijal Tripathi, Amitangshu Pal, KOTESWAR RAO JERRIPOTHULA
Improving Viewpoint-Invariance and Temporal Consistency for Action Detection Yannick Porto, Renato Martins, Thomas Chalumeau, Cédric Demonceaux
Feature Space Generative Models For One-Shot Class-Incremental Learning Jack Foster, Kirill Paramonov, Mete Ozay, Umberto Michieli
INNER PART DISCOVERY BASED ON PARTIAL LABEL PROPORTIONS Guillaume PICAUD, Marc Chaumont, Gérard Subsol, Luc TEOT
BENCHMARKING ATTRIBUTE DISCRIMINATION IN INFANT-SCALE VISION-LANGUAGE MODELS James Batsell, Tsutsui Satoshi, Bihan Wen
Towards reconstructing experimental sparse-view X-ray CT data with diffusion models Nelas Jarno Thomsen, Xinyuan Wang, Felix Lucka, Ezgi Demircan-Tureyen
FAST PSF SYNTHESIS WITH DEFOCUSED AND SPHERICAL ABERRATION Nicholas Ganino, Qi Guo
PROTEIN GRAPH NEURAL NETWORKS FOR HETEROGENEOUS CRYO-EM RECONSTRUCTION Jonathan Krook, Axel Janson, Joakim Andén, Melanie Weber, Ozan Öktem
On the Possible Detectability of Image-in-Image Steganography Antoine Mallet, Patrick Bas
Moe-driven Modality-invariant Feature Learning for Visible-Infrared Person Re-Identification Yupeng Chen, Shuli Cheng, Anyu Du, Li Wang, Zirui Jiang, Mingsheng Zheng
ZERO-CLICK BRAIN TUMOR SEGMENTATION USING SEGMENT ANYTHING MODEL 2 Daniel Pasierb, Agata Wijata, Jakub Nalepa
ReDyPrompt: Residual-Guided Dynamic Prompts for Robust Anomaly Detection in Fuel Rod Cladding Surface Inspection Xinwei Lyu, Haiyong Chen, Zhaoyang Wang
Parts-Mamba: Augmenting Joint Context with Part-Level Scanning for Occluded Human Skeleton Tianyi Shen, Huijuan Xu, Nilesh Ahuja, Philip Shin, Vijaykrishnan Narayanan
Dithering Defense: Adversarial Robustness of Vision Foundation Models via Multi-Level Floyd–Steinberg Dithering Yury Belousov, Brian Pulfer, Vitaliy Kinakh, Slava Voloshynovskiy
Adapting SAM Without Labels: Uncertainty-Aware Source-Free Medical Image Segmentation Quang-Khai Bui-Tran, Thanh-Huy Nguyen, Bac LE, Min Xu
Diagnosing and Explaining Failures of Perturbation-Based Fidelity Metrics Revoti Prasad Bora, Philipp Terhörst, Raymond Veldhuis, Raghavendra Ramachandra, Kiran Raja
Texture-Aware Vision Transformers for Robust Diagnosis of Dehiscence and Fenestration in 2D CBCT Cross-Sections Hossam Magdy Balaha, Alaa Mohamed, Rahma Hussein, Reza Farimani, Toru Deguchi, Mohammed Ghazal, Ayman El-Baz
MEANINGFUL LEVEL SETS FOR SMALL SPOT DETECTION Axel Davy
Leveraging Error-Tolerance Asymmetry in Electrical Grid Automated Visual Inspection with a Semi-Supervised Annotation Pipeline Pedro Daniel Rocha, Luis Cruz, Igor Vilela, André Coelho, Fernando Lopes
SPECTRAL REFLECTANCE ESTIMATION OF FACIAL SKIN FROM A SINGLE RGB IMAGE VIA EM-BASED PHYSICAL RECONSTRUCTION Yoshihito Tanaka, Shugo Yamaguchi, Akira Kubota
Reduced-complexity Adaptive Loop Filtering via Input-dependent Graph Filters Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Roman Chernyak, Shan Liu
Topology-Prompted Spatio-Temporal TransUNet: A Geometry-Aware Framework for Consistent Dental Plaque Assessment Botao Xu, Junhao Gu, Haoyan Cui, Ning Luo, Yu Qiao
SRANK: TOWARDS SEMANTIC-AWARE RANKING-BASED EVALUATION FOR CONTINUAL LEARNING OF VISION-LANGUAGE MODELS Suvam Dey, Debarshi Brahma, Soma Biswas
Exploring Rate, Distortion and Cross-Entropy Tradeoffs with Variational Autoencoders Huy LE, Anissa Mokraoui, Pierre DUHAMEL
NON-LEARNING LOW-LIGHT STEREO VISION Jason Wang, Lucas Nguyen, Hyunseung Eom, Wei Xu, Qi Guo
DYNAMIC MODE DECOMPOSITION-BASED FMRI ANALYSIS FOR PARKINSON’S DISEASE DETECTION Yuji Lin, Yifu Huang, Xuanting Wang, Jiayue Cai, Yuheng Wang, Martin McKeown, Chunqi Chang
TRACKNETV5: ROBUST SHUTTLECOCK TRACKING VIA MOTION PROMPTS AND SPATIOTEMPORAL ATTENTIVE FUSION Run-Lin Chang, Yu-Shuen Wang, Jiun-Long Huang
Hierarchical Feedback for No-Reference Video Quality Assessment Using a Spatiotemporal Feature Pyramid Reshu bansal, Parimala Kancharla
PINCurve-S: Physics-Informed Neural Curves with Spatial Attention for Efficient Low-Light Image Enhancement Anubhav Jain, Nikhil Panwar, Vivek Kumar, Ali Reza Alaei, Parthapratim Roy
MULTI-LABEL OBJECT CLASSIFICATION IN POINT CLOUDS USING GRAPH CONVOLUTIONAL NETWORK Md. Nahid Hasan, Md Sohag Mia, Nafis Sadeq, Muhammad Abdullah Adnan
EFFICIENT AND SECURE CONVOLUTIONS ON ENCRYPTED DATA Susim Roy, Bharat Chandra Yalavarthi, Nalini Ratha
RATE-DISTORTION OPTIMIZATION FOR ENSEMBLES OF NON-REFERENCE METRICS Xin Xiong, Samuel Fernandez, Eduardo Pavez, Antonio Ortega, Neil Birkbeck, Balu Adsumilli
MARS-CLIP: Multi-Resolution and Attention Refined Zero-Shot Image Segmentation Nagito Saito, Shintaro Ito, Koichi Ito, Takafumi Aoki
Multi-Site Brain MRI Harmonization via Learned Probability Flows Saeed Moazami, Neda Jahanshad
HOW TO TRAIN YOUR GENERATIVE DIFFUSION MODEL, WHEN ALL TRAINING IMAGES ARE DEGRADED, TO MODEL A DISTRIBUTION ON HIGH-QUALITY IMAGES Subhankar Nag, Suyash Awate
Robust Semantic 3D Mapping from Monocular 360-degree Image Sequences for Intelligent Indoor Street View Jonoshin Shiino, Naomichi Asaki, Sarthak Pathak, Kazushige Yasutake, Junji Furuno, Kazunori Umeda
Perception-based Image Denoising via Generative Compression Nam Nguyen, Thinh Nguyen, Bella Bose
END-TO-END CROSS-MODAL CORRESPONDENCE LEARNING FOR MISALIGNMENT-ROBUST INFRARED-VISIBLE OBJECT DETECTION Jihun Park, Injae Lee, Joonki Paik
A DECOUPLED COARSE-TO-FINE FRAMEWORK WITH SPATIALLY-ADAPTIVE HIGH-PASS FUSION FOR POLYP SEGMENTATION Po-Yi Ke, Kuan-Hsien Liu, Tsung-Jung Liu
CID-LIE: Controlled Illumination Dataset for Low-Light Image Enhancement Felipe Oliveira, Gabrielly Rodrigues, Jade Santos, Alternei Brito, Joao Cavalcanti, José Luiz de Souza Pio
GRAPH-BASED ANALYSIS OF ATTENTIONAL FIDELITY IN BRAIN-TO-IMAGE RECONSTRUCTION Mohammad Moradi, Morteza Moradi, Marco Grassia, Giuseppe Mangioni
GeoScaffold: Implicit Geometric Scaffolding and Directional Distillation for Missing-Modality Multi-Modal MRI Segmentation Jiayang Xu
Machine Learning–Based Control of Local Warped Motion Compensation in the SVT-AV1 Encoder Khouloud Missaoui, Damak taheni, Hassene Tmar, Mohamed Ali Ben Ayed
DFSI: A LiDAR Distance-Field Safety Plug-in with Reliability-Aware Refresh for Diffusion-Based Visual Navigation Shimin Yu, Shiqi Sun, Yantao Lu, Junjie Zuo, Chenglie Du
Pois-DON: Poison Dataset-driven Activation Optimisation-based Novelty Detection Ahmed Gabr, Mahmoud Rady, Pola Qulta, Youssef Abou Eita, Youssef ElKady, Youssef Fayed, Marwan Torki
GPS-denied Drone Navigation via Cross-Domain Keypoint Matching Jun-Wei Hsieh, Hong-Kai Chen
Energy and Compression Efficiency in Large-Scale Video Streaming MOHAMMAD GHASEMPOUR, Hadi Amirpour, Christian Timmererer
The Nonlocal Heat Equation: Bridging PDE Modeling and Physics-Informed Convolutional Neural Networks Bartomeu Garau, Catalina Sbert, Joan Duran
Gaussian Surrogates for Poisson Imaging: Some Theoretical and Empirical Results Alexandra Spitzer, Lorenzo Baldassari, Valentin Debarnot, Ivan Dokmanić
XSA-MAD: Cross-modal Semantic Alignment for Morphing Attack Detection Jie Jin, Mahiro Tokumasu, Yu Makino, Masakatsu Nishigaki, Tetsushi Ohki
Multi-Task Partially Supervised Learning for Super-Resolution and Semantic Segmentation on Earth Observation data Hoàng-Ân Lê, Minh-Tan Pham, Solange Lemai-Chenevier, Daniel Greslou
BLIND X-RAY BRAGG PTYCHOGRAPHY WITH AUTOMATIC DIFFERENTIATION Tingyou Li, Jizhou Li
Few-Shot Anomaly Detection and Localization via Robustly Adaptive Feature Matching Jiangpeng Zhu, Hanxi Li, Bo Li, Shaodi You, Zehui Xie
ASSESSING MEDIA AUTHENTICITY THROUGH WATERMARKING IN THE CONTEXT OF THE JPEG TRUST STANDARD Deepayan Bhowmik, May Alotaibi, Jessie Smith, Lamyaa Aljuaid, Enes Eray Demirtas, Touradj Ebrahimi, Sabrina Caldwell, Frederik Temmermans
Latent graph encoding of multimodal neuroimaging features with generative AI architectures Ishaan Batta, Meenu Ajith, Vince Calhoun
Blind Reconstruction of Low Dose Computed Tomography with Latent Space Non-Local Filtering and Structural Consistency Constraints Angkon Deb, Celia Shahnaz
Anomaly-Aware Vision-Language Adapters for Zero-Shot Anomaly Detection Muhammad Aqeel, Maham Nazir, Uzair Khan, Marco Cristani, Francesco setti
SPATIO-TEMPORAL TENSOR RECONSTRUCTION FOR QUANTA IMAGE SENSORS VIA BINARY TENSOR DECOMPOSITION Yoshihiro Maeda, Kosuke Kurihara, Takayuki Hamamoto
FOURIER SOFT IN 2D (FS2D) REGISTRATION FOR FORWARD-LOOKING SONAR WITH QUASI-PLANAR VALIDITY ANALYSIS Arturo Gomez Chavez, Tim Hansen, Maria Saleem, Andreas Birk
ADAPTIVE ZONE MERGING: GRAPH-BASED ALGORITHM FOR HIERARCHICAL OVER-SEGMENTATION Petar Kotsev, Joao Mota, Robert Nicol, Alex Serb
Efficient Object Detection on JPEG-AI Pre-Reconstruction Latents Mahyar Gohari, Alessandro Gnutti, Fabrizio Guerrini, Nicola Adami, Riccardo Leonardi
Test Model and Optimization for MPEG Lenslet Video Coding Hao Peng, Lili Zhao, Luyuan Zhu, Xun Tang, Lei Yang, Xin Jin
BWCA-Net: Bidirectional Wavelet Cross-Attention Unfolding Network for Image Compressive Sensing Reconstruction Zhidi Yao, Jinjia Zhou
Unsupervised Nighttime Dehazing via Layer Decomposition and Fusion Tianyi Jiang, Shunli Zhang
SDMVSC: A Scalable Plug-in Framework for Deep Multi-View Subspace Clustering on Large-Scale Data Minghua Tang, Yuxuan Sun, Qingwang Wang
Text-Centric Dual-Layer Attention for Multimodal Emotion Analysis with Missing Modalities Xianxun Zhu, Imad Rida, Erik Cambria, Rui Wang, Hui Chen
PRIVACY-PRESERVING FEDERATED ACTION RECOGNITION VIA DIFFERENTIALLY PRIVATE SELECTIVE TUNING AND EFFICIENT COMMUNICATION Idris Zakariyya, Pai Chet Ng, Kaushik Bhargav, Seyed Mohammad Sheikholeslami, Konstantinos Plataniotis, Fani Deligianni
Composite Stability of Graph-Convnets for Label-Efficient Skeleton-based Recognition Hichem Sahbi
Evolution of NVENC Efficiency: A Longitudinal Analysis of HQ and UHQ Tuning Efficiency, Latency and Energy Trade-offs Kasidis Arunruangsirilert, Jiro Katto