| Title | Authors |
| Clip-level Uncertainty and Temporal-aware Active Learning for End-to-End Multi-Object Tracking | Riku Inoue, Shogo Sato, Kazuhiko Murasaki, Tomoyasu Shimada, Toshihiko Nishimura, Ryuichi Tanida |
| Collision-Resistant Single-Pass Method for Unsupervised Fine-Grained Image Hashing | Anh Kiet Duong, Petra Gomez-Krämer, Jean-Michel CAROZZA |
| Enhancing Zero-shot Personalized Image Aesthetics Assessment with Profile-aware Multimodal LLM | Chun Wang, Chenfeng Wei, Chenyang Liu, Weihong Deng |
| EMARS: Event-based Motion-Aware Correction, Deblurring and Interpolation of Rolling Shutter Images | Weixiang Hu, Bohan Huang, Shigeaki Namiki, Yuka Ogino, Takahiro Toizumi, ATSUSHI ITO, Yoshimitsu Aoki |
| Learning an Elastomer Simulator for Hand-Object Interaction | Xinguo He, Yixin Shen, Rahul Chaudhari |
| ITPLUT: Inverse Tone Mapping based on Lookup Tables with Luma and Chroma Mapping | Qiuling He, Yuanfan Huang, Zhanyu Tu, Wenhui Wu, Fei Zhou |
| RETHINKING DIFFUSION FOR 3D HUMAN POSE ESTIMATION: SPATIOTEMPORAL PATCHIFICATION AND ADAPTIVE MODULATION | Shuo Yang, Bart Jansen, Hichem Sahli, Xuan-son Nguyen, Aymeric Histace |
| LINDE: A Lightweight Neural Network for Remote Sensing Image Denoising | Rafael Pires, Daniel F. Silva Santos, Denis Silva Moretto, Yasmin Sobrinho, Pedro Henrique Crespan Ribeiro, Kelton Costa, Khan Muhammad, Joao P. Papa |
| Loss Functions Matter: A Systematic Study of Class Imbalance in Flood Forecasting | Nicolas To Van Trang, Van Linh Nguyen |
| AdaCorrection: Adaptive Offset Cache Correction for Fast and Accurate Diffusion Transformers | Dong Liu, Yanxuan Yu, Ben Lengerich, Yingnian Wu |
| M2RETINEXFORMER: MULTI-MODAL RETINEXFORMER FOR LOW-LIGHT IMAGE ENHANCEMENT | Youssef Aboelwafa, Hicham G. Elmongui, Marwan Torki |
| D^2-VR: Degradation-Robust and Distilled Video Restoration with Synergistic Optimization Strategy | Jianfeng Liang, Shaocheng Shen, Botao Xu, Qiang Hu, xiaoyun zhang |
| Learning MRI Translation with Explicit Dynamic Texture and Structure Priors | Runyu Xiao, Junze Zhu, Zhangkai Ni, Hanli Wang |
| Beyond Pixel Fidelity: Minimizing Perceptual Distortion and Color Bias in Night Photography Rendering | Furkan Kınlı |
| Noise-Aware Latent Verification for Step-Efficient Diffusion Sampling | Vishwajeet Shukla, Himanshu Baurai, Ajay Bedi |
| STEGANOGRAPHIC APPROACH BASED ON HOMOMORPHIC ENCRYPTION | Norman Hutte, William, Puech, |
| DYNAMIC DISTILLATION AND GRADIENT CONSISTENCY FOR ROBUST LONG-TAILED INCREMENTAL LEARNING | Taigo Sakai, Kazuhiro Hotta |
| 3D Gaussian Splatting for Indoor Scene Reconstruction with Photometric and Geometric Consistency Constraints | dongsheng xie |
| DON’T LAG, RAG: TRAINING-FREE ADVERSARIAL DETECTION USING RAG | Roie Kazoom, Raz Lapid, Moshe Sipper, Ofer Hadar |
| Integrating Point Cloud-Based Non-Photorealistic Semi-Transparency into Gaussian Splatting | Kento Yamazaki, Jun Minagawa, Takuya Matsuda, Kohei Okahara |
| QuatGAN: Efficient Spatio-Spectral Synthesis via Quaternion Transformers | Ashutosh Gupta, Aarsh Wankar, Siddhartha Hrishikesha Voleti, Nitant Dube, Shanmuganathan Raman |
| RST-SNN: Robust Spatial Temporal Attention for Spiking Neural Networks | Shuo Zhang, Bo Zhang, Zhiyuan Fu, Kuo Pang |
| Incremental Implicitly-Refined Classification via Class Knowledge Capacity Constrained Optimal Transport | Qianna Ye, Shaofan Wang, Yanfeng Sun, Jinghua Li, Baocai Yin |
| Defence Against Byzantine Attacks in Semi-Supervised Federated Learning | Nafisa Parvin, Sayanta Sen, Saumik Bhattacharya |
| IC-4DGS: Illumination-Compensated 4D Gaussian Splatting Under Photometric Variations | He-Bi Yang, Ming-Zhe He, Cheng-Wei Yang, Jui-Chiu Chiang, Yu-Lun Liu, Wen-Hsiao Peng |
| MULTI-SCALE LATENT PREDICTION VIA LEARNABLE ITERATED FUNCTION SYSTEMS | Kamel Belloulata, Amina BELALIA |
| Focus on the Fog: Leveraging Student Uncertainty for Guided Knowledge Distillation in Semantic Segmentation | Emil Mededovic, Fabian Gülhan, Rüveyda Yilmaz, Johannes Stegmaier |
| EasyControlEdge: A Foundation-Model Fine-Tuning for Edge Detection | Hiroki Nakamura, Hiroto Iino, Masashi Okada, Tadahiro Taniguchi |
| Making Fisher Work: Train-Time Compression for 3D Gaussian Splatting | Arun Madhav, Chandra Sekhar Seelamantula |
| SILVA-Mamba: Spatial-Integrity For Landslide Segmentation Via Vectorized HILBERT Scanning and Adaptive Mamba | Yi Tang Hsieh, Chih-Chung Hsu, Xin Li, Ming-Ching Chang, Jun-Wei Hsieh |
| M2UR: Meta-Guided Multi-Expert with Uncertainty-Aware Refinement Framework for Video Summarization | Yupeng Wu, Xiaoran Xu, Xiaoshan Yang, Changsheng Xu |
| Adapting Pre-trained Diffusion Model for Blind Image Denoising via Noise Compensation and Timestep Prediction | Kenan Zou, Qianjun Huang, Jiaqing Wang, Kai Zhang |
| Evaluating Demographic Fairness in Histopathology Foundation Models | Natalia Lourdes Pérez García de la Puente, Miguel López Pérez, Valery Naranjo |
| Video Quality Evaluation Methodology and Result of AV2 Compression Performance | Zhijun Lei, Vibhoothi Vibhoothi, Dzung Hoang, Yixin Du, Ramzi Khsib |
| Multi-User Multi-Key Image Steganography with Key Isolation | Tzu-Ti Wei, Yu-Han Tseng, Jun-Yi Lin, Yu-Chee Tseng, Jen-Jee Chen |
| Split, Skip and Play: Variance-Reduced ProxSkip for Tomography Reconstruction is Extremely Fast | Evangelos Papoutsellis, Zeljko Kereta, Kostas papafitsoros |
| SAFEGUARDED ANDERSON ACCELERATION FOR PRIMAL-DUAL HYBRID GRADIENT IN CONVEX VARIATIONAL IMAGING | Hossein Javidnia |
| A MULTIMODAL INTRINSICS-GUIDED THERMAL-AWARE FRAMEWORK FOR RGB LOW-LIGHT IMAGE ENHANCEMENT | Simone Melcarne, jean-luc DUGELAY |
| SBP-Net: Learning Thin Structure Reconstruction with Sliding-Box Projections | Ofir Gilad, andrei sharf |
| THINKING LIKE A FORENSIC EXPERT: A MULTIMODAL REASONING CHAIN FOR TRAINING-FREE IMAGE MANIPULATION LOCALIZATION | Rui Chen, Bin Liu, Changtao Miao, Xinghao Wang, Yi li, Tao Gong, Qi Chu, Nenghai Yu |
| Generative 6D pose estimation via conditional flow matching | Amir Hamza, Davide Boscaini, Weihang Li, Benjamin Busam, Fabio Poiesi |
| Evidence-Invariance for Auditable Pseudo-Label Selection under Domain Shift in Semi-Supervised Segmentation | Hongkang Zhang, Shao-Lun Huang, Ercan Engin Kuruoglu |
| Bayer Convolution for Raw Image Processing | Jaeseong Yu, Hongjae Lee, Myungjun Son, Seung-Won Jung |
| CHROMOSIS: A SHAPE-CONSTRAINED AND SPATIALLY-AWARE FRAMEWORK FOR CHROMOSOME INSTANCE SEGMENTATION | Weixiao Fang, Yixiong Liang, Shichao Kan, Jianfeng Liu |
| Cross-Modal Knowledge Transfer from RGB Latent Diffusion Model to Spectral-Spatial Joint Distribution for Spectral Reconstruction | Keli Deng, Qipeng Qian, YANG CHU, Yuntao Qian |
| T2M4AR: Text to Motion Generation for Skeleton-based Action Recognition | Jun-Sang Yoo, Hongjae Lee, Sangmin Lee, Chunfei Ma, Byeongwon Lee, Seung-Won Jung |
| Few-Shot Unseen Gestures Recognition Via Enhancing Multimodal ProtoNet with Dive Before Fly Fusion | Yongmeng Yan, Nianzu Lv, Wenchao Du, Hu Chen, Yi Zhang, Hongyu Yang |
| Enhancing Spike-driven Transformers with Multi-Scale Features and High-Rank Interactions | Ka Chen, Ziliang Ren, Hui Zhao, Qieshi Zhang, Xiangyang Gao |
| CORE-NET: CONSENSUS-BASED SELECTION AND RECIPROCAL RELIABILITY FOR MULTI-MODAL OBJECT RE-IDENTIFICATION | Xingan Ma, Jinhui Yi, Juergen Gall |
| ROI-Focused Geometry-Aware Adaptation for Accurate Small-Structure Segmentation in Medical SAM | Tianqi Wang, Jianuo Li, Chenhao Yang, Jinyi Xu, Mian Zhou, Kang Dang, Linxue Zhang |
| Rotation-Aware Dense Neural Network for Multi-Modal SAR-Optical Image Registration | Simon Bertrand, Cornelia Vacar, Lionel BOMBRUN |
| KNOWLEDGE-GUIDED MULTI-TASK LEARNING FOR ORAL CANCER CLASSIFICATION | Jérôme de Chauveron, Chenyu Zha, Youssef Assis, Pauline Le Gatt, Margaux Vinant, Géraldine Lescaille, Caroline Shaar-Chneker, Laurent Wendling, Camille Kurtz, Juliette Rochefort |
| Unequal by Design: Instance-Aware and Cluster-Differentiated Universum Construction for Multi-View Contrastive Clustering | Ghaith Chrit, Shan Du |
| PRIOR-GUIDED FLEXIBLE FINE-SCALE BRAIN PARCELLATION ON DIFFUSION MRI IN EXTREMELY LABEL-SCARCE SCENARIOS | Qingwei You, Zhonghua Wan, Jiahao Yu, Yifei He, Xiangxue Wang, Ye Wu |
| Adaptive Cross-component Prediction based on Chroma Sample Position Estimation for NGVC | Haruhisa Kato, Yoshitaka Kidani, Takeshi Chujoh |
| Preprocessor-Enhanced Image Compression for Joint Machine and Human Vision | yaqian luo, Chao Yang, Xinpeng Huang, Ping An |
| The Impact of Intrinsic Scene Cues on Perceived Color Transfer Quality | Herbert Potechius, Thomas Sikora, Sebastian Knorr |
| Sliding DCT Wiener Denoising with Lightweight Residual Refinement | Karen Eguiazarian, Alla Ghazaryan, Sergey Abrahamyan |
| REAL-TIMEASTRONOMICALIMAGEPREPROCESSINGFOREDGEDEPLOYMENT VIA PHYSICS-GUIDEDSTRIPECORRECTIONANDSTAR-PRESERVINGFILTERING | Qiankai Tong |
| A CRITIC-FREE APPROACH FOR LDR TO HDR CONVERSION | Chansoon Heo, Byeungwoo Jeon |
| SF-DIFF: JOINT SPATIAL-FREQUENCY DIFFUSION MODEL FOR PROBING HUMAN BRAIN TISSUE MICROARCHITECTURE | Shuxin Cao, Chengzhe Zhang, Peng Wang, Zhonghua Wan, Jiaolong Qin, Ye Wu |
| Towards Coherent Video Colorization: When Optical Flow Meets Image Diffusion Models | Wen Si, Yifan Li, Luyao Zhang, Shuai Yang, Jiaying Liu |
| DETECTION OF SPLICING IN DIGITAL IMAGE FORGERY | Zuzana Pitsmausová, David Svoboda |
| A Hard Negative-Aware Optimization for Multilingual Text-Based Person Search | Tung Lam Pham, Hoai Thi-Phan, Thuy-Binh Nguyen, Thanh-Hai Tran, Thi-Ngoc-Diep Do, Hong-Quan Nguyen, Thi-Lan Le |
| SCALE-ADAPTIVE FEATURE EXTRACTION FOR REAL-TIME INDUSTRIAL SURFACE DEFECT DETECTION | JO SEONGJUN, Xian Xu |
| TENS-LLM: Text-guided Neuron Segmentation using Large Language Models | Chengda Mo, qiufu li, Xinle Dai, Linlin Shen |
| GameScope: A Multi-Attribute, Multi-Codec Benchmark Dataset for Gaming Video Quality Assessment | Rajesh Sureddi, shreshth saini, Avinab Saha, Alan Bovik |
| SYNERGY BETWEEN TRAJECTORIES AND HUMAN POSE FOR SOCCER | Marc Peral, Guillem Capellera, Luis Ferraz, Antonio Rubio, Stella Grasshof, Dan Witzner Hansen, Antonio Agudo |
| Breaking Camera Frame-Rate Limits: A Multi-View Dataset and Baseline for High-Frequency 3D Pose Reconstruction | Yuxuan Liu, Zixuan Wang, Junliang Xing, Haizhou Ai |
| Transfer Anyone: High-Fidelity Human Transfer on Motion Video via Diffusion-Based Reconstruction | Haocheng Tang, Ruoke Yan, Xuanyi Liu, Xiaolong Zhang, Bin Zhao, Siwei Ma, Chuanmin Jia |
| Few-shot Source-Free Domain Adaptation for Surface Defect Detection | Qianyu Zhou |
| Individual Prompt Tuning: A Single-Model Multi-User Framework for Personalized Image Aesthetic Assessment | Jiaqi Shi, Xinying Yang, Zhang xiaodan, Zhenxing Niu, Fei Gao |
| MULTI-SCALE LARGE KERNEL ATTENTION FOR SINGLE-IMAGE DERAINING | Congcong Zeng, Dan Xu, Yinghui Zhu, Jiangang Pan, Kangjian He, Hongzhen Shi |
| DAT: DUAL ATTENTION TRANSFER TO BRIDGE THE SEMANTIC GAP FROM VISION FOUNDATION MODELS TO CNNS | Yingbin Wang, Jielei Wang, Qianxin Xia, Xuewan He, Zihan Cheng, Guoming Lu |
| Pathological Image Diagnosis under Label Noise Conditions Using Bias-Aware Adaptive Knowledge Distillation | Masato Watanabe, Wonjik Kim, Kazuki Uehara, Shuta Tsuchio, Hirokazu Nosato, Hidenori Sakanashi |
| Frequency-Adaptive Depth-Haze Consensus with Semantic Priors for Single Image Dehazing | Ahmed Sakr, Hicham G. Elmongui, Marwan Torki |
| Robust Knowledge Distillation Powered Lightweight Semantic Communication Method for Remote Sensing Image | Zhongqiang Zhang, Zeyang Meng, Guangming Shi, Fanyang Meng, Ye Wang, Shuhang Zhang, Lin Mei |
| ATTRIBUTE-ENHANCED PROMPT LEARNING FOR ZERO-SHOT CROSS-MODAL RETRIEVAL | Jiyan Wang, Yuanbo Zhu, Ge Song, Wanqi Yang |
| Screen-Shooting Resilient Watermarking based on Long-Range Modeling and High-Frequency Enhancement | Jun-Zhuo Zou, NanRun Zhou, Jane Wang, Zhihua Xia, Xiangui Kang |
| Event-Based Batting Impact Estimation | Ryotaro Ishida, Wataru Ikeda, Ryosei Hara, Akemi Kobayashi, Toshitaka Kimura, Mariko Isogawa |
| Contextual Copy-Paste Sample Augmentation for Multi-Class Remote Sensing Object Detection | Xue Zhang, Yanxia Wu, Dan Lin, Ruoyu Wang, Guoyin Zhang, Nebojsa Bacanin |
| A Video Semantic Coding Framework Using Shared Prior Knowledge and Latent Feature Residuals | lyx liuYuxiang, Yiping Duan, Qiyuan Du, Ning Ge, Xiaoming Tao, Guangyi Liu |
| PHOTOMETRIC STEREO PRIOR BOOSTED SPARSE MULTI-VIEW STEREO | jilong zhang, songyun yang, Yufei Han, Zhanyu Ma, Heng Guo |
| Deep Structure-Texture Guided Image Compression via Cross-Scale Interaction and Dual-Domain Enhancement | Lijun Zhao, Jiaxin Wang, Kunkun Tu, Kailong Cao, Jinjing Zhang |
| Stable-NAE: Stabilizing Natural Adversarial Example Generation Using Adaptive Control and Momentum | Hui Kuurila-Zhang, Haoyu Chen, Guoying Zhao |
| IDAG-Edit: Multi-Object Video Editing via Instance-Decoupled Attention and Guidance | Yuan-Zhih Lin, Thang Nguyen Huu, Huu-Phu Do, Hong-Han Shuai, Ching-Chun Huang |
| LEARNING INTERPRETABLE INTERIOR STYLE SEMANTICS VIA LARGE MULTIMODAL MODEL REPRESENTATIONS | Junya Yamamori, Ren Togo, Teruhisa Yamashiro, Takahiro Ogawa, Miki Haseyama |
| A TWO-STAGE IMAGE CROPPING METHOD BASED ON COMPOSITIONAL CONSISTENCY | Ran Shi, Lu Feng, Penghao Wang, Tong Qiao |
| Optimal Neural Architecture Search for Kolmogorov-Arnold Network-based Image Classification | Anurag Dutta, Sweta Dey, Rajat Subhra Chakraborty |
| U^2Mamba: A Two-level Nested U-structure Mamba for Salient Object Detection | Junhui Li, Jialu Li, Youshan Zhang |
| LoREnc: Low-Rank Encryption for Securing Foundation Models and LoRA Adapters | Beomjin Ahn, Jungmin Kwon, Chanyong Jung, Jaewook Chung |
| PROMPT-FREE AND EFFICIENT SAM2 ADAPTATION FOR BIOMEDICAL SEMANTIC SEGMENTATION VIA DUAL ADAPTERS | Hinako Mitsuoka, Kazuhiro Hotta |
| Secret Geometric Deformation for 3D Object Protection | Khélian Larvet, William, Puech, |
| LTOP-Net:Lightweight Transformer Occupancy Prediction Net for Octree-Based Point Cloud Geometry Compression | 小俏 张, Anhong Wang, Tillo Tammam, Donghan Bu, Hao Jing, Jing Zhang |
| Is SAM3 Ready for Pathology Segmentation? | qiuyu kong, Shakiba Sharifi, Yiming Wang, Marco Cristani, Zanxi Ruan |
| Stream-FCGS: Fast Compression and Streaming for 4D Gaussian Splatting | Mingjia Yang, Haocheng Tang, Zheng Wang, Xueying Chang, Siwei Ma, Jiaqi Zhang |
| Multiple Scale Latents for Learned Image Compression | Jonas Brenig, Radu Timofte |
| Mixture-of-Experts-based Entropy Model for Learned Image Compression | Jonas Brenig, Radu Timofte |
| MattenIR: Efficient Image Restoration with Local Attention and Global-Aware State Space Duality | Qiwei Dong, Siyu Zhang, Weichao Wang, Wendong Mao, Zhongfeng Wang |
| Multimodal Confidence Modeling in Audio-Visual Quality Assessment | Mayesha Maliha R. Mithila, Mylene Farias |
| CDMesh: High-Fidelity Sparse-View Mesh Reconstruction with Consensus Diffusion Priors | Haoyang Wang, Liming Liu, Xinggong Zhang |
| Structure-Aware Blind Inpainting for Electron Microscopy Images | Zhicheng Wang, Jiateng Shou, Haiqun Jin, Zhiwei Xiong |
| MBHNet: Multimodal Brain Hallucination Network for Fluid Intelligence Prediction under Missing Structural Connectivity | Chong Cheng, Gang Yang, Yu Li, Xun Chen, Aiping Liu |
| Feature Optimized Dynamic Spectral Correlation Subspace Clustering for Hyperspectral Band Selection | Yingying Chu, Xiaodi Shang, Jiahua Zhang, Xudong Sun |
| ARE FACIAL ACTION UNITS DISCRIMINATIVE FEATURES TO DETECT DEEPFAKES? | Paul Chaurand, Ewa KIJAK |
| DYNAMIC CROSS-MODAL COMPRESSION AND CYCLIC FUSION FOR MULTI-SPECTRAL VEHICLE RE-IDENTIFICATION UNDER SEVERE FLARE CONDITIONS | Zhongzheng Liu, di wu |
| YawDD+: Frame-level Annotations for Accurate Yawn Recognition on Edge Platforms | Ahmed Mujtaba, Gleb Radchenko, Marc Masana, Radu Prodan |
| SIMI: Self-information Mining Network for Low-light Image Enhancement | Xuanshuo Fu, Lei Kang, Javier Vazquez-Corral |
| Towards Multi-Modal Forgery Representation Learning for AI-Generated Video Detection and Localization | Dat Le, Khoa Nguyen, Xin Wang, Shu Hu |
| RING-SHAPED PEPS TENSOR NETWORK DECOMPOSITION FOR HIGH DIMENSIONAL DATA IMPUTATION | Rongfeng Huang |
| DINO-Detector: Leveraging Pre-trained DINO Features for One-Shot 3D Craniofacial Landmark Localization | Kaichen Nie, Tianmin Xu, Yuru Pei |
| Controllable Medical Anomaly Synthesis via Image Editing | Yuxin Yang, Haimiao Zhang, Ligen Shi, Di He, Chang Liu, Jun Qiu |
| SynTeX: Data-Efficient LaTeX OCR via Synthetic Pretraining and Limited Fine-Tuning | Yuhan Xu, Yijun Zhao, Renqing Luo, Gary Weiss |
| Scene-Action Prompt Fusion for Coherent Text-to-Video Storytelling | Taewon Kang, Divya Kothandaraman, Ming C. Lin |
| Improving color fidelity on color E-Paper displays using curve-based transforms | Dounia Hammou |
| GRACE:Estimating Geometry-level 3D Human-Scene Contact | Chengfeng Wang, Wei Zhai, Yuhang Yang, Yang Cao, Zheng-Jun Zha |
| AGREEMENT-DRIVEN MULTI-VIEW 3D RECONSTRUCTION FOR LIVE CATTLE WEIGHT ESTIMATION | Rabin Dulal, Wenfeng Jia, Lihong Zheng, Jane Quinn |
| Retrieval-Driven Knowledge Injection for Context-Aware Video Captioning | Karina Abubakirova, Waseem Ullah, Mohsen Guizani |
| Model-Aware Rate–Distortion Limits for Task–Oriented Source Coding | Andriy Enttsel, Vincent Corlay |
| DepthFix3D: Depth-Guided Diffusion for Artifact Removal in 3D Gaussian Splatting | Haoshuai Fu, Junlin Hao, Peiheng Wang, Haoyang Wang, Xinggong Zhang |
| Semantic-conditioned latent diffusion for low-field brain MRI enhancement | Dong Zhang, Jiaxun Gao, Caohui Duan, Xin Lou, Jane Wang |
| Panoptic3D: Leveraging 3D Pseudo Supervision for Panoptic Occupancy Prediction | Dian Jia, Pei Yu, Xiaoqian Ruan, Hyeonjeong Park, Wei Tang |
| Efficient Unsupervised Metric Learning with UMAP-Based Pseudo-Labeling | Dhanunjaya Varma Devalraju, Chandra Sekhar |
| Respiration modulates pathological brain cardiovascular pulsation propagation in Alzheimer’s disease | Youssef Hosni |
| STQFORMER: SPATIO-TEMPORAL QUATERNION TRANSFORMER FOR VIDEO FRAME DENOISING | Aoqing Jin, Shiming Zhang, Shuihua Wang |
| TriGaze: Camera-Guided 3D Representations for Robust In-Vehicle Gaze Estimation | Cao Boxiang, Ming Cao, Gu Qingfeng, Pengfei Huang, Chi Jiannan, Liu Jiahui |
| Seeing Through the Glare: Robust Nighttime Stereo Depth Estimation via Physics-Guided Synthesis | Yuanfan Guo, Zhaolin Xiao, Haonan Su, Qiyuan Zhang |
| Context and Pixel Aware Large Language Model for Video Quality Assessment | Wen Wen, Yaohong Wu, Yue Sheng, Neil Birkbeck, Balu Adsumilli, Yilin Wang |
| Facial Feature-guided Adaptation for Talking Head Video Compression | Riku Takahashi, Ryugo Morita, Jinjia Zhou |
| AdaFusion: Adaptive Degradation-Aware Infrared and Visible Image Fusion with Cross-Modal Mixture of Experts | Lihao Lai, Jiangtao Nie, Lei Zhang, Xiaoguang Guo, Sen Peng, Wei Wei |
| Multi-Light Relightable Gaussian Splatting with Phong Reflectance from in-the-wild images | Duy Khanh Ngo, Huu-Phu Do, Ching-Chun Huang |
| Label Noise Detection via Loss Dynamics and Predictive Stability: A KL-Divergence and Statistical Feature Guided Approach | Zhipeng Zhang, Wenting Ma |
| Map-Mono-Ego: Map-Grounded Global Human Pose Estimation from Monocular Egocentric Video | Hiroyuki Deguchi, Ryosuke Hori, Kotaro Amaya, Tsubasa Maruyama, Mitsunori Tada, Hideo Saito |
| REGION-OF-INTEREST AND UPSAMPLING-ENHANCED POINT CLOUD TRANSMISSION FOR 3D MACHINE VISION | Ao Luo, Weishuai Song, Diego Fujii, Keisuke Nonaka, Linxin Song, Heming Sun, Xuelian Cheng, Jiro Katto |
| TEXT-PILOT: INTELLIGENT VISUAL TEXT PLANNING AND MANIPULATION VIA MULTI-MODAL LLM AS AGENT | Yuan Kang Kuo, Quang-Thang Le, Ngoc-Phu Doan, Ching-Chun Huang |
| Out-of-Distribution Detection with Angular-Magnitude Likelihood and Targeted Feature Refinement | Atik Garg, Yu-Shuen Wang |
| SplatShield: Adversarial Protection for 3D Gaussian Splatting Against Instruction-Guided Editing | Sejin Oh, Suhyeon Ha, Joonsung Jeon, Sung-Eui Yoon |
| UNSUPERVISED DATA-EFFICIENT CROSS-MODAL RETRIEVAL WITH GLOBAL-NEIGHBORHOOD ALIGNMENT HASHING | Runhao Li, Xiaoxu Ma, Zhenyu Weng, Yue Zhang, Guibo Luo, Huiping Zhuang, Zhiping Lin, Yap-Peng Tan |
| Fractional Fourier Near-Field Ptychography | Haoyuan Liu, Zhiyi Zhang, Yixiao Yang, Ran Tao |
| Selective Global to Local Alignment for Vision Language Retrieval | Wei Li, Jiale Chen, Yuanpeng Wang, Jiaxun Li, Yuehai Wang |
| FRÉCHET WAVELET STYLE DISTANCE: AN INTERPRETABLE IMAGE STYLE SIMILARITY METRIC | Abhijat Bharadwaj, Animesh Kumar, Deekshant Kumar, Vikram M. Gadre |
| Motion-Guided Latent Diffusion for Full-Frame Video Stabilization | Huyue Zhu, Dachun Kai, Jiaxiao Wang, Jie Chen, Zhangchi Hu, Quanquan Hu, Xiaoyan Sun |
| MetricDepth-VLM: Internalizing Metric Spatial Reasoning in VLMs via Depth Discretization and Geometry-Semantic Alignment | KO CHIHI, Hao-Chiang Shao, Chih-Tsung Shen |
| Few-Shot Domain Adaptation with Temporal References and Static Priors for Glacier Calving Front Delineation | Marcel Dreier, Nora Gourmelon, Dakota Pyles, Thorsten Seehaus, Matthias H. Braun, Andreas Maier, Vincent Christlein |
| DEEP IMAGE SEGMENTATION VIA DISCRIMINANT FEATURE LEARNING | Adam Sztamborski, Raül Pérez-Gonzalo, Antonio Agudo |
| Appearance-Routed Fusion for Egocentric Activity Recognition with Synthetic Audio and Depth | Cagri Gungor, Adriana Kovashka |
| CPDDNet: Color-Polarization Denoising and Demosaicking Network | Qihang Zhang, Yusuke Monno, Masayuki Tanaka, Masatoshi Okutomi |
| DISTILLING NOISELESS FEATURES FOR NOISE-ROBUST MONOCULAR 3D POSE ESTIMATION | Asuka Ishii, Hiroo Ikeda |
| Rate-Distortion Optimized LoRA for Efficient Post-Filtering in AV2 | Kequan Mao, Xin Yang, Urvang Joshi, Debargha Mukherjee, Dandan Ding |
| WDFG: Wavelet-Based Dual Frequency Guidance via Foundation Model Priors for Depth from Focus | Jeongho Park, Sungmin Woo, Wonjoon Lee, Inseok Jeon, Sangyoun Lee |
| SAG: Spatial-Attention-Guided Motion Customization for Text-to-Video Diffusion Models | Cheng Lei, Delong Liu, Fei Su, Zhicheng Zhao |
| FD-DIFF: FREQUENCY DECOUPLING AND DUAL-STREAM COLLABORATIVE DIFFUSION FOR 3D FACE RECONSTRUCTION AND ALIGNMENT | Xiangzheng Li, Peng Han, Xiaoli Luo, Baiying Dong |
| GLCV: A Generalized Learnable Cost Volume for Per-pixel Visual Correspondence | Sehoon Oh, Ue-Hwan Kim |
| COHESION: CONSENSUS-BASED HALLUCINATION SUBSPACE ESTIMATION FOR MULTIMODAL LARGE LANGUAGE MODELS | Wei-Han Chen, Yu-Feng Chen, Jun-Cheng Chen |
| Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance | Song Wu, Xinyu Chen, Qian Wang, Liang Li, Junlan Feng, Zili Yi |
| Real-Time Image Restoration via Adaptive Decoupled Knowledge Distillation | jiahui liu, Haoran Bai, Ying Chen, sibin deng |
| Novel Low Operation Point In-Loop Filter for VVC Using Learning Rate Scheduling and SIMD Acceleration | Jiang Han, Cheolkon Jung, Qipu Qin |
| VT-JRD: TASK-AWARE VIDEO CODING FOR MACHINES USING VISION TRANSFORMER AND JUST RECOGNIZABLE DISTORTION | Sanaz Nami, Farhad Pakdaman, Moncef Gabbouj |
| DYNAMIC WEIGHT-BASED TEMPORAL AGGREGATION FOR LOW-LIGHT VIDEO ENHANCEMENT UNDER EXTREME NOISE | Ruirui Lin, Guoxi Huang, Nantheera Anantrasirichai |
| LEARNING SPATIALLY ADAPTIVE SPARSITY LEVEL MAPS FOR ARBITRARY CONVOLUTIONAL DICTIONARIES | Joshua Schulz, David Schote, Christoph Kolbitsch, Kostas papafitsoros, Andreas Kofler |
| AN OVERVIEW OF WARP PREDICTION IN AV2 | Mohammed Sarwer, Yeqing Wu, Rachel Barker, Yunqing Wang, Debargha Mukherjee, Han Gao, Jayasingam Adhuran |
| Leveraging NeRF-Rendered Images for 3D Gaussian Splatting | Mizuki Morikawa, Yuta Shimizu, Chunyu Li, Yusuke Monno, Masatoshi Okutomi |
| Person–Object Relationship Consistency Learning for Zero-Shot Spatio-Temporal Action Detection | Yasunori Babazaki, Takashi Shibata, Toru Takahashi |
| MSCDF: MULTI-SCALE CROSS-DOMAIN FUSION NETWORK FOR UNDERWATER IMAGE ENHANCEMENT | Xing Li, Yanfang Wang, Ziqi Dong |
| TDF-NET : A FREQUENCY-AWARE REPRESENTATION LEARNING GUIDED FUSION NETWORK FOR INFREAD AND VISIBLE IMAGES | Hang Xu, Hao Wang |
| INTERFERENCE-RESISTANT FINE-GRAINED CLASSIFICATION OF HIGHLY SIMILAR NASOPHARYNGEAL ENDOSCOPIC STRUCTURES | Pengcheng Wang, Hao Wang, Yiping Wang, Zhichao Zhang |
| PSCA-NET: INTEGRATING PHYSICAL TRACES AND SEMANTIC CONTEXT FOR AI-GENERATED IMAGE FORGERY DETECTION AND LOCALIZATION | Yi Zhang, Qiang Xu, Wenpeng Mu, Jianhao Fu, Tanfeng Sun, Xinghao Jiang |
| Non-uniform Structured Pruning for Efficient Diffusion-based Real-world Image Super-resolution | Le Khang Nguyen, Kevin Ho Man Cheng |
| Granulo-10k: A Large-Scale Benchmark Dataset for Multiple-View Industrial Granulometry | Pasquale Coscia, Angelo Genovese, Vincenzo Piuri, Fabio Scotti |
| HIERARCHICAL FILTER BAND SELECTION FOR MULTISPECTRAL OBJECT CLASSIFICATION | Katja Kossira, Jürgen Seiler, Andre Kaup |
| HPGN: Hybrid Priors-Guided Network for Compressed Low-Light Image Enhancement | hantang li, qiang zhu, xiandong meng, lei xiong, Shuyuan Zhu, Xiaopeng Fan |
| Color Constancy in Hyperspectral Imaging via Reduced Spectral Spaces | Gunnar Dofri Vidarsson, Liying Lu, Sabine Süsstrunk |
| EFFICIENT VARIABLE-RATE STATE-SPACE MODEL FOR IMAGE COMPRESSION WITH CHANNEL-WISE ENTROPY | Bouzid AREZKI, Anissa Mokraoui, Fangchen FENG |
| Hierarchical Prompt-Aware Zero-Shot Out-of-Distribution Detection | Marouane HADJ-ALI, Florence Alberge |
| Deep Learning-based Compressed Domain Event Data Classification | Abdelrahman Seleem, André F. R. Guarda, Nuno Rodrigues, Fernando Pereira |
| DM-QPMNET: DUAL-MODALITY FUSION NETWORK FOR CELL SEGMENTATION IN SINGLE SHOT QUANTITATIVE PHASE MICROSCOPY | Rajatsubhra Chakraborty, Anna Espinosa–Momox, Riley Haskin, Depeng Xu, Rosario Porras-Aguilar |
| MPS-RETNET: MULTI-SCALE PROTOTYPE-GUIDED SEMI-SUPERVISED LEARNING WITH QUALITY-AWARE SUPERVISION FOR RETINAL DISEASE CLASSIFICATION | Maisam Abbas, Ran-Zan Wang |
| ZERO-SHOT 3D ANOMALY DETECTION USING PRE-TRAINED MODELS | Lukun Hu, Hengyi Chen, Yiguo Lou, Zhaocheng Yang, Junwen Ji, Dan Li |
| Generalizable 3D Gaussian Splatting Guided by a Vision Foundation Model | Jie Liang, Cheolkon Jung |
| SLT: A LAPLACIAN-GUIDED TRANSFORMER FOR MULTI-SCALE SPECTRAL-CHANNEL MODELING IN HYPERSPECTRAL IMAGE CLASSIFICATION | Chi Zhang, Jungkwon Kim, Jihun Kim, Jeonghyeon Park, Kwangsun Yoo, Seok-Joo Byun |
| LOGOFLOW: VISUAL SALIENCY-AWARE ADVERSARIAL ATTACK ON LOGO-BASED PHISHING DETECTORS | Yena Cho, Heesung Jeong, Sukyeong Bang, Doowon Kim, Hyoungshick Kim |
| CA3-GS:COMPLEXITY-AWAREADAPTIVEANCHORALLOCATIONFOR3DGAUSSIAN SPLATTING | Genqiang Shi, Qiuming Liu, Changjian Zhu |
| HYBRID ISP: COMBINING SPARSE CNN WITH DENSE INTERPOLATION FOR ON-SENSOR REAL-TIME 4×4 BAYER IMAGE RECONSTRUCTION | Oren Girshkin, Tamar Dreifuss, Tal Bernstein |
| VLM-DREAMER: VLM-IMAGINED BI-DIRECTIONAL INPAINTING FOR SINGLE-IMAGE 360 SCENE GENERATION | TingWei Huang, Fu-En Yang, Min-Hung Chen, Yen-Yu Lin, Yu-Lun Liu |
| Neuro-Symbolic Video Anomaly Detection via Attribute-Based Reasoning | Sofya Filippova, Steven Korevaar, Son Hoang Dau, Trung Pham, Tam Cao, Ruwan Tennakoon |
| SPATIO-TEMPORAL BIFURCATE-FUSION SPIKE TRANSFORMER | ZeFeng Chen, Ziliang Ren, Yangyang Chen, Xiangyang Gao, Qieshi Zhang |
| The DARRL dataset: Demonstrations for Action Recognition and Robot Learning, extended with gaze data and scene graphs | Badr Tahri Joutei, Mathieu Riand, Patrick Le Callet, Alexandre Bruckert, Laurent Dollé |
| A histogram-based method to extract tag-image from film shot | Benjamin Serva, Frédéric Comby, Olivier Strauss, Loig Le Bihan, William, Puech, |
| Low-Delay Dynamic Point Cloud Attribute Compression via Cross-Coordinate Attention | Xiangzuo Liu, Ruishan Huang, Zhikai Liu, Fan Liang |
| Cross-Modal Slot Alignment for Data-Efficient Multiclass Defect Classification | selen pehlivan |
| A Mixture of Measurement Strategies Framework for Monocular Mobile Rebar Spacing Inspection | Cheng-En Li, Jue-Yu Lai, Hung-Kai Hsiao, Chang-Yuan Hsiao, Peggy Joy Lu |
| A Clinically Relevant and Interpretable Scoring Protocol for Medical Image Enhancement | Dong Zhang, Caohui Duan, Xin Lou, Jane Wang |
| Task-Adaptive Sparse Update for Efficient Continual Learning | Takuma Ishibashi, Hikari Otsuka, Junnosuke Suzuki, Daichi Fujiki, Masato Motomura |
| PGM-Net: Prior-Guided Mamba Network for Pancreas Segmentation | Chongshang Zhong, Jun Chen, Qiaoying Teng, Kai Han, Yi Liu, Zhe Liu |
| MuCALD-SplitFed: Causal-Latent Diffusion for Privacy-Preserving Multi-Task Split-Federated Medical Image Segmentation | Chamani Shiranthika, Hadi Hadizadeh, Parvaneh Saeedi |
| Egocentric Whole-Body Human Mesh Recovery with Prior-Guided Learning | Soyeon Na, Seung Young Noh, Ju Yong Chang |
| IAFE: ILLUMINATION AWARE FREQUENCY ENHANCEMENT NETWORK FOR LOW-LIGHT IMAGE DEBLURRING | Chenyuan Jiao, Xun Yang, Yaoru Sun, Xuejie Yang |
| Robust Bridge Defect Detection via Dynamic Snake Convolution and Hierarchical Feature Fusion | Mingdi Hu, SiChen Chen, Bing Yi Jing |
| Depth from Defocus via Direct Optimization | Holly Jackson, Caleb Adams, Ignacio Lopez-Francos, Benjamin Recht |
| SCIPS: SINGLE-SHOT PHOTOMETRIC STEREO FROM A SNAPSHOT COMPRESSED MULTISPECTRAL IMAGE | Yunhao Li, Yanan Hu, Xiaodong Wang, Yuze Yang, Ziyi Meng, Xin Yuan, Peidong Liu |
| REC-RL: Referring Expression Counting via Gaussian and Range-Based Reward Optimization | Hui Liu, Yunlai Teng, Kunlong Bai, Pengfei Qi, Yan Haotian, Liang Li, Junlan Feng |
| ALF: Sharpness-Aware Adaptive Layer Fusion for Training-Free Anomaly Detection | Deyu Yang, Ziliang Ren, Yu Zou, Hongchao Gao, Ying Liu |
| UniGeoDiff: Unified Geometry-Aware Diffusion for Single-Image 3D Furniture Generation | Wen Li, Zongjie Tan, Yuan Liu |
| When Simplicity Wins: Bottleneck-Aware Context Modeling For Lightweight Semantic Segmentation | Mian Muhammad Naeem Abid, Nancy Mehta, Zongwei Wu, Radu Timofte |
| VolHuMe: a High-Resolution Large Scale Dataset of Volumetric Human Meshes | Giulia Martinelli, Niccolò Bisagno, Nicola Garau, Esa Rahtu, Nicola Conci |
| Aitchison geometry on the simplex for uncertainty quantification in bayesian hyperspectral image unmixing | Hector Blondel, Lucas Drumetz, Thierry Chonavel |
| Controllable blind deblurring with diffusion models | Imane SI SALAH, Emile Cribelier, Thomas Veit, Wolf Hauser, Arthur Leclaire |
| Enhancing Sparse-View 3D Gaussian Splatting with Guidance of Normals Priors and Dense Point Initialization | Yi-Huang Hsieh, I-Chen Lin |
| SPATIAL COMPETITION FOR LOW-COMPLEXITY LEARNED IMAGE COMPRESSION | Théophile Blard, Pierrick Philippe, Théo Ladune, Xiaoran Jiang, Olivier Deforges |
| FM-OVD: Towards Fast Open-Vocabulary Object Detection with Feature-wise Modulation | Rahul Singh Maharjan, Angelo Cangelosi |
| Uncertainty-Guided Latent Diffusion Models for Faithful Super Resolution | Ren Wang, Yung-Yu Chuang |
| MACHINE LEARNING BASED AV2 ENCODER/DECODER EFFORTS | Shan Li, In Suk Chong, Stan Vitvitskyy, Raul Blazquez, Joe Young, Conor McCullough, Akshaya Purohit, Urvang Joshi |
| Repurposing Image Diffusion Models for Training-Free Music Style Transfer on Mel-Spectrograms | Heehwan Wang, Joonwoo Kwon, Sooyoung Kim, Jungwoo Seo, Shinjae Yoo, Yuewei Lin, Jiook Cha |
| Character-Centered Dialogue Generation from Scene-Level Prompts | Taewon Kang, Ming C. Lin |
| A LIGHTWEIGHT THERMAL DENOISING AND OCCLUSION-ROBUST INFRARED DETECTION MODEL FOR SUBSTATION EQUIPMENT | Junyi Wu, liqin tian, Weizheng Wang, niu yong, shi xiaoan |
| SGCLIP: SEMANTIC-GEOMETRIC FUSION FOR TRAINING-FREE OPEN-VOCABULARY SEGMENTATION | Rui Xu, Junpu Wang, Huanyu Li, Zhenduo Guo, Chunlei Li |
| YOLOv8-HRP2: An Efficient Framework for High-Resolution Surface Defect Detection on Shipping Containers | Chenchu Huang, Shan Gao, Huinan Shi, Yi Ma, YUHAN LIU, Mårten Sjöström |
| PaTSeg: A Patch-Level Text Supervision Paradigm for Mask-Free Remote Sensing Segmentation | Chi Han Chen, Wen-Huang Cheng, Ching-Chun Huang |
| CONSTRAINED DENSE CORRESPONDENCE GRAPHS FOR ROBUST STRUCTURE-FROM-MOTION TARGETING ENDOSCOPIC VIDEOS | Yu-Chun Lin, Ming Lun Han, Kuang-Chen Yen, Homer Chen |
| MR-Mono3D: Multi-Resolution Monocular 3D Mesh Reconstruction for Embodied Spatial Perception | Hongyi Huang, Jingyi Wu, Han Yu, Juncen Guo, Peng Sun, Liang Song |
| Zero-Shot Color Constancy by estimating albedo | Gabriele Canesi, Marco Buzzelli, Simone Bianco, Raimondo Schettini |
| A Dual-Branch RGB-Infrared Multimodal Framework for Real-Time Vehicle Detection in Remote Sensing | jin Zhang, Yucheng Xia, Kaize Shi, Pengfei Yuan, Xiaoge Li, Mengjiao Wang, jianbo Zheng |
| A Hybrid CNN–Swin Temporal Attention Network for Nighttime Video Anomaly Detection | Yuxuan Jiang, Yunhui Zeng, Yanlei Cui, Xin Jin |
| EMVMAMBA: A HYBRID CNN-MAMBA MODEL FOR ADDRESSING RARE CLASS CHALLENGES IN FACIAL EXPRESSION RECOGNITION | Jun Foo Kui, Lai-Kuan Wong, Yuen Peng Loh, Patrick Le Callet |
| Test-time image adaptation for semantic segmentation from noisy images via logit refinement and region-constrained activation maximization | Myungjun Son, Jaeseong Yu, Hongjae Lee, Seung-Won Jung |
| 3D Unsupervised Sparse Gravimetry Imaging Guided by a Physics-Consistent Neural Field | Ana Gabriela Mantilla Dulcey, Yeganeh Gharedaghi, Daniela Quintero Madariaga, Antonio Ortega, Henry Arguello |
| EFFICIENT HYBRID ADAPTER FOR VISIBLE-THERMAL TRACKING | He Wang |
| Beyond Visual Perception: Mitigating Multimodal Hallucination via Hybrid Preference Optimization | Kun Yang, Yuxuan Liu, Jingyi Wu, Lang Qian, Han Yu |
| Enhanced Detection of Tiny Objects in Aerial Images | Kihyun Kim, Michalis Lazarou, Tania Stathaki |
| Channel-Aware Tensor Nuclear Norm: A Self-Supervised Approach for HSI Inpainting | Yunshan Li, Qianqian Wang, Lili Yang, Wenwu Gong |
| QuadBox: Accelerating 3D Gaussian Splatting with Geometry-Aware Boxes | Xinze Li, Bohan Yang, Pengxu Chen, Yiyuan Wang, Hongcheng Luo, CHENG WENTAO, Weifeng Su |
| Estimation of instrument and noise parameters for inverse problem based on prior diffusion model | Jean-François Giovannelli |
| Towards Quantitative Deep Learning for Image Steganalysis | Shijie Zhang, Mingyang Shen, Jane Wang, Xiangui Kang, Dong Wei |
| Equiangular Prototype Alignment for Unsupervised Domain-Adaptive Medical Image Segmentation | Akash Sharma, Arunima Sarkar, Mohanasankar Sivaprakasam |
| LOCAL SOFT ALIGNMENT FOR HARD-AWARE MULTI-VIEW TEXT-TO-IMAGE PLACE RECOGNITION | Lei Ma, Wen Liu, Xuanshun Zhang |
| Alleviating Hallucination in Large Vision-Language Models via Structure-Aware Adaptive Contrastive Decoding | Shunya Shimomura, Haruhiko Murata, Kazuhiro Hotta |
| SAMba-UNet: SAM2–Mamba UNet for Cardiac MRI in Medical Robotic Perception | Guohao Huo, Ruiting Dai, Ling Shao, Hao Tang |
| Towards More Transferable Architectures for Dense Pose Estimation | Shuhei Tarashima, Norio Tagawa |
| MASAM: Zero-Shot Identity-Consistent Multi-Object Tracking and Segmentation with Memory-Augmented Segment Anything Model | Wei-Jie Mu, Cheng-Yu Ho, Shang-Hong Lai |
| DiRA-Net: Suppressing Cross-Channel Correlated Noise via Differential-Regulated Attention for Low-Light Image Enhancement | Qiyuan Zhang, Haonan Su, Zhaolin Xiao, Yuanfan Guo |
| TaiChiNet: Dual-branch Densely Connected CNN-Transformer with Color-Intensity Factorization for Low-light Image Enhancement | Yanan Hu, Mengjie Qin, Yuchao Feng, Yunhao Li, Xin Yuan |
| OVDM: A training-free open-vocabulary segmentation framework based on diffusion models | Haiyang Liu, Shuai Jia, xiaoming huang, Zhiguo Wang |
| SPIM-Fuse: Sparsely-Coded Image Fusion for Low-Light Enhancement | Sena Yagmur Sen, Gazihan Alankus, Mehmet Turkan |
| Towards Efficient Vision State Space Models via Token Merging | Jinyoung Park, Minseok Son, Changick Kim |
| LFA: Layer Feature Attention for Run-time Introspection of 2D Object Detectors in Automated Driving | Mert Keser, Alois Christian Knoll |
| CLOTH-HUGS: CLOTH AWARE HUMAN GAUSSIAN SPLATTING | Sadia Mubashshira, Nazanin Amini, Kevin Desai |
| MDE-VIO: ENHANCING VISUAL-INERTIAL ODOMETRY USING LEARNED DEPTH PRIORS | Arda ALNIAK, Sinan Kalkan, Mert Ankaralı, Afsar Saranli, Aydin Alatan |
| Invariants to Blur and Channel Mixing of Color Images | Václav Košík, Jan Flusser, Filip Sroubek |
| GAFSEG: GRADIENT-AWARE FEDERATED LEARNING FOR MEDICAL IMAGE SEGMENTATION | Sayanta Sen, Gargi Panda, Saumik Bhattacharya |
| LIFTING-BASED GEOMETRY OPTIMIZATION FOR 3D DYNAMIC MESHES IN THE V-DMC FRAMEWORK | Wenjie Zou, Xuanrui Zhang, Fuzheng Yang |
| Deep Unfolding with Hybrid Mamba-Convolutional Transformers for Hyperspectral Image Reconstruction | ZHOU XINGYU, Xian-Hua Han |
| TAE: Target-Aware Enhancer for Nighttime UAV Tracking | Yanyan Chen, Ruigang Fu, Yu Song, Ping Zhong |
| UGHM-Net: Uncertainty-Aware Dynamic Confidence and Backtracking for Visual-Semantic Hierarchical Image Classification | YANG CHU, Keli Deng, Xiaomeng Yang, Yuntao Qian |
| DASR-NET: UNSUPERVISED FINE-GRAINED ANOMALY SEGMENTATION VIA DISTRIBUTION ALIGNMENT AND SELECTIVE FEATURE RECONSTRUCTION | Xin Yuan, Huanyu Li, Junpu Wang, Miao Yu, Chunlei Li |
| COBI-CLIP: ENHANCING CLIP WITH CONVOLUTIONAL ADAPTERS AND BIDIRECTIONAL ALIGNMENT FOR ZERO-SHOT ANOMALY DETECTION | Xiangshuai Zhao, Junpu Wang, Huanyu Li, Miao Yu, Chunlei Li |
| FULL-REFERENCE POINT CLOUD QUALITY ASSESSMENT USING GRAPH NEURAL NETWORK-BASED REGRESSION | Ryosuke Watanabe, Hiromu Yoshida, Tomoaki Konno |
| THE IN-LOOP FILTERING PIPELINE IN AV2 | Debargha Mukherjee, Jianle Chen, Onur Guleryuz, Lin Zheng, In Suk Chong, Andrey Norkin, Yixin Du, Yunfei Zheng, Tianqi Liu, Khanh Quoc Dinh, Yangwoo Kim, Kwang Pyo Choi |
| Exposing and Erasing Identity in Skeleton Motion: A New Evaluation Protocol and Adversarial Anonymization Framework | Ying-Shuo Lee, Pei-Yuan Wu |
| MYST: Benchmarking Ecological and Cross-Medium Generalization in Sea Turtle Re-Identification | Wan Jun Nah, Juanita Joseph, Shier Nee Saw, Wai Lam Hoo |
| HSSP: Training-Free Visual Token Pruning with Head Selection and Spatial Constraints | Yuan-Hsi Lo, Hsi-Ren Hung, Huei-Fang Yang |
| G-MASt3R-SfM: Graph-based View Pruning and Multi-stage Optimization for Robust SfM | Toshiki Watanabe, Shintaro Ito, Natsuki Takama, Koichi Ito, Takafumi Aoki |
| Implicit Subblock Transform for Versatile Video Coding | jiazhen wang, Zhuoyuan Li, Yao Li, Jialin Li, Li Li, Dong Liu |
| Adapt2Hide: Leveraging Off-the-shelf Autoencoder for Reversible Visual Processing | Ernie Chu, I-Sheng Fang, Tai-Ming Huang, Pin-Yen Chiu, Vishal Patel, Jun-Cheng Chen |
| MICROVITV2: BEYOND THE FLOPS FOR EDGE ENERGY-FRIENDLY VISION TRANSFORMERS | Novendra Setyawan, Chi-Chia Sun, Mao-Hsiu Hsu, Wen-Kai Kuo, Jun-Wei Hsieh |
| CFE-PPAR: Compression-friendly encryption for privacy-preserving action recognition leveraging video transformers | Haiwei Lin, Shoko Imaizumi, Hitoshi Kiya |
| ChestR1: Ground-Truth Augmented Reinforcement Learning for Chest X-Ray Analysis | Yichi Cai, Ying Yu, Yubin Wang, Huimin Yu |
| VVC film grain synthesis in video coding for machines | Rudolf Kortelahti, Tero Partanen, Alexandre Mercat, Jarno Vanne, Miska M. Hannuksela, Honglei Zhang |
| Bridging the Modality Gap via CLIP-Driven RoI-Level Semantic Alignment for Infrared Object Detection | Minju Baek, Hyeongseok Oh, Joonki Paik |
| QUALITY-SMOOTHING RATE CONTROL FOR LEARNED VIDEO TRANSCODING VIA RATE-QUALITY PREDICTION | Nianxiang Fu, Daiqin Yang, Zhenan Lin, Chao Zhou |
| TIME-VARYING RPPG SIGNAL SEPARATION VIA BLOCK-SPARSE SIGNAL MODEL | Kosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto |
| IMPROVING REFERENCE PICTURE RESAMPLING FOR VVC WITH SCALE AND UPSAMPLING GUIDED DOWNSAMPLING | Junying Su, Wenzhuo Zhang, Nianxiang Fu, Daiqin Yang, Zhenan Lin, Chao Zhou |
| GALA: GAUSSIAN LAYERS, AN EFFICIENT OVERLAP-AWARE 3D GAUSSIAN SPLATTING | Matthieu Gendrin, Stephane Pateux, Théo Ladune, Xiaoran Jiang, Luce Morin |
| Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks | Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho |
| From Division to Decision: Leveraging Temporal Cell-Stage Segmentation for Embryo Transferability Prediction | Yasmine HACHANI, Patrick Bouthemy, Elisa Fromont, Véronique Duranthon, Ludivine Laffont, Alline de Reis |
| Scale-Invariant Geometric Regularization for 3D Gaussian Splatting via Pearson Correlation | ZhaoYang Wang, Zonghua Yu, Huaijun Wang, Junhuai Li, Shuai Hu, Haiyan Jin |
| DECODER-DERIVED ACTIVATION MECHANISM FOR NEURAL NETWORK IN-LOOP FILTERS IN VIDEO CODING | Maria Santamaria, Done Bugdayci Sansli, Francesco Cricri |
| MedSAE: Dissecting MedCLIP Representations with Sparse Autoencoders | Riccardo Renzulli, Colas LEPOUTRE, Enrico Cassano, Marco Grangetto |
| Training-Free Stimulus Encoding for Retinal Implants with Sparse Projected Gradient Descent | Henning Konermann, Yuli Wu, Emil Mededovic, Volkmar Schulz, Peter Walter, Johannes Stegmaier |
| M-COLOR: MLLM-GUIDED DIFFUSION MODELS FOR IMAGE COLORIZATION | Frank Lin, Wen-Jiin Tsai |
| An Efficient and Accurate Registration for Structured Light based Intraoral Scanning | Xinzhou Du, Yuping Ye, Yixin Zhuang, Juan Zhao, Zhan Song |
| Improving zero-shot industrial defect detection exploiting LMMs as inverse reasoners | Maria Tzelepi, Nikolaos Dimitriou, Christos Gkogkos, Dimitrios Tzovaras |
| Context-Aware Multimodal Depression Detection via LLM-Derived PHQ-9 Personal Feature Injection | Huiyu Yang, Puneet Kumar, Xiaobai Li |
| TaperVLMs: TASK-AWARE PRUNING OF VISION-LANGUAGE MODELS USING INFORMATION BOTTLENECK PRINCIPLE | Peyman Rostami, Dan Pineau, Nassim Ali Ousalah, Anis Kacem, Djamila Aouada |
| RATE-DISTORTION OPTIMIZED NONLINEAR TRANSFORM CODING FOR VVC RESIDUAL BLOCKS | Antoine Monier, Pierre Hellier, Fabrice Le Léannec, Karam Naser, Aline Roumy |
| Event-Image Deep Stereo Using Multi-Scale Cross Modal Attention | Jie Liang, Cheolkon Jung |
| HiPerViT: A Hybrid Multi-Scale Encoder for Hierarchical Patch Representation on Imbalanced Low-Resolution Data | Sakib Ahammed, Xia Cui, Xinqi Fan, Wenqi Lu, Moi Hoon Yap |
| Federated SVDD Prototype Exchange for Decentralized Object Detection in Natural Disasters | Evgenios Vlachos, Vasileios Mygdalis, Ioannis Pitas |
| LATREF: CONTROLLABLE ILLUMINATION GENERATION AND REFLECTANCE ESTIMATION FROM A SINGLE IMAGE WITH LATENT DIFFUSION MODELS | Li Luo, Daljit Singh Dhillon |
| Refining Acoustic-Based 3D Human Pose Estimation via Vision Pretraining | Han-Hsin Lin, Zhong-Wei Lin, Pei-Yuan Wu |
| PLESS: Pseudo-Label Enhancement with Spreading Scribbles for Weakly Supervised Segmentation | Yeva Gabrielyan, Varduhi Yeghiazaryan, Irina Voiculescu |
| EMOVIS: EMOTION-OPTIMIZED IMAGE PROCESSING | Dor Barber, Rony Zatzarinni, Hava Matichin, Noam Levy |
| Overview of the block-partitioning framework in AV2 | Urvang Joshi, Chi Yo Tsai, Yue Chen, Jayasingam Adhuran, Leo Zhao |
| DISTRIBUTIONAL MODELING OF EVENT-CAMERA STREAMS VIA HIERARCHICAL INTERACTION LEARNING | Wessim Omezzine, Lionel Fillatre |
| 4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting | Chun Tin Wu, Jun-Cheng Chen |
| Revealing details in adaptation process: Perceptual Sensitivity Adaptive Volumetric Mamba for Low-light Image Enhancement | Huiling Zhou, Qingbo Wu, Fanman Meng, Hongliang Li |
| Label Free Change Detection: A Statistical Shift for Zero-Shot Change Detection | Haolin Huang, Peiyao Guo, Zhizhuo Jiang, Yu Liu |
| Exploring fine-grained UGC compression quality in a no-reference based approach | Yilin Wang, Andreas Pastor, Yaohong Wu, Suriya Prakash Jambunathan, Neil Birkbeck, Balu Adsumilli |
| Task-adaptive Local Rate Control for Neural Video Coding for Machines | Marc Windsheimer, Simon Deniffel, Andre Kaup |
| Refining the Anatomical Representation of Autism: A Comparative sMRI Study of ROI, and Vertex-Level Features for ASD Communication Severity Grading | Mostafa Abdelrahim, Mohamed Khudri, Moumen El-Melegy, Ali Mahmoud, Ahmed Shalaby, Asem Ali, Mohammed Ghazal, Fatma Taher, Ashraf Khalil, Sohail Contractor, Gregory N. Barnes, Ayman El-Baz |
| Direct Kernel Optimization: Efficient Design for Opto-Electronic Convolutional Neural Networks | Ali Almuallem, Harshana Weligampola, Abhiram Gnanasambanbdam, Wei Xu, Dilshan Godaliyadda, Hamid Sheikh, Stanley Chan, Qi Guo |
| Proximal Vision Transformer: Geometry-Inspired Feature Enhancement | Haoyu Yun, Hamid Krim, Emilie Chouzenoux, Jean-Christophe Pesquet, Bo Jiang |
| A Comprehensive Analysis of Lightweight Design Strategies for Camouflaged Crop Detection | Mari Salvador Lapuz, Audrea Arjaemi Tabadero, Charles Joseph Hinolan, Mac Andre Javellana, Arren Matthew Antioquia |
| Confidence-Gated Training for Efficient Early-Exit Neural Networks | Saad Mokssit, Ouassim Karrakchou, Alejandro Mousist, Mounir Ghogho |
| SAR-NEUS: NEURAL SURFACE INVERSE RENDERING FOR 3D RECONSTRUCTION AND NOVEL VIEW SYNTHESIS FROM SAR IMAGERY | Miguel Andres Alonso, Jose Delgado, Alejandro Betancor del Rosario, Giorgia Gobbi, Vicent Gilabert Maño, mario alfonso arsuaga, Andrea Castiella Aguirrezabala, Francescopaolo Sica, Orlando Ávila García |
| PT-GS: Prompt Tuning based Generalizable 3D Synthesis towards Real-Time Cross-Domain Adaptation | Qingyuan Hou, Chunshu Wu, Sushant Kondguli, Tong Geng, Michael Huang |
| Overview of Intra prediction and intra mode coding in AV2 | Leo Zhao, Tianqi Liu, Shan Liu, Jayasingam Adhuran, Qingyang Zhou, Jianle Chen, Cheng Chen, Raul Blazquez, Yixin Du, Mei Guo, Xin Zhao, Van Luong Pham, Mariana Afonso |
| WiSDet: A Windshield-Guided Few-Shot Framework for Detecting Car Stickers in the Wild | Ma. Ysabel Bondoc, Jonaviene Capunitan, Matthew James Villarica, Arren Matthew Antioquia |
| Residual and Entropy Coding in AV2 | Alican Nalci, Hilmi E. Egilmez, Madhu Peringassery Krishnan, Joe Young, Joel Sole, Xiaoqing Zhu, Zhijun Lei, Kruthika Koratti Sivakumar, Qingyang Zhou, Minhua Zhou |
| Beyond Average FPS: Assessing ACR-HR vs. DCR for Frame Drop Severity and a Novel No-Reference Metric | Suriya Prakash Jambunathan, Andreas Pastor, Xujin Zhang, Neil Birkbeck |
| Is Haar Enough? Exploring Symlets and Coiflets for Wavelet Convolution Layers | Md Rifat Ur Rahman |
| CoreView: Compact Yet Complete Video Representation | Susim Roy, Arjun Ramesh Kaushik, Nalini Ratha, Venu Govindaraju |
| Caption-Guided Graph-Structured Action Segmentation for Weakly Supervised egocentric Dense Video Captioning | Takuya Kobayashi, Tatsuya Sasaki, Yoshiki Ito, Takayuki Akiyama |
| Understanding Domain-Shift Immunity in Deep Deformable Registration | mingzhen shao, Sarang Joshi |
| Action Recognition in Virtual Reality: Real-Time Detection of Sports Gestures using Shallow Learning for Image-Encoded Kinematics | Jaime Gallego, David Bernal-Casas |
| Physics-constrained Diffusion Attack Against SAR Target Recognition | Yanjing Ma, Jifang Pei, Weibo Huo, WENJING WANG, Yin Zhang, Yulin Huang, Jianyu Yang |
| Establishing Robust Retinal Eye Tracking: A Weakly Supervised Algorithmic Framework | Bo Wen, Dillon Lohr, Yatong An, Pushkar Anand, Alexander Fix, Ruobing Qian, Catherine Fromm, Yimin Ding, Truong Nguyen, Mohamed El-Haddad, Francesco La Rocca |
| DYNAMIC RESOLUTION SWITCHING FOR LIVE STREAMING | Xin Xiong, Yixu Chen, Hai Wei, Yongjun Wu, Sriram Sethuraman |
| Compressing Feed-forward 3D Gaussian Splatting via Feature Sorting | Xinrui Ju, Shanzhi Yin, Xinju Wu, Bolin Chen, Ru-Ling Liao, Shiqi Wang, Yan Ye |
| Learning Across Content-Disparate Modalities: Cross-Modality and Semantic Guided Keypoint Matching for Optical-SAR Alignment | Yu Wang, yepeng liu, Zaiwang Gu, Shengkai Chen, Wee Siong Ng, Ha Linh Trinh, Hieu Kieu, Wing Keung, Adrian LAW, Jun Cheng |
| Pseudo-label Induced Subspace Representation Learning for Robust Out-of-Distribution Detection | Tarhib Al Azad, Faizul Rakib Sayem, Shahana Ibrahim |
| Large Vision–Language Models with Object Structure Alignment for Image Matching | Nguyen Xuan Nam, Hidetomo Sakaino |
| End-to-End Learning of Metalens-Based Compressive Sensing and Differentiable Coding for Hyperspectral Image Transmission | Takayuki Sasaki, Yoko Sogabe, Kazuya Hayase, Masaki Kitahara, Yukihiro Bandoh |
| LPConv: Laplacian Pyramid Convolutions for Parameter-Efficient Receptive Field Expansion | Naoki Nishiya, Akira Kubota |
| A Rank-Based Wasserstein Distance for Comparing the Contrastive Power of Post-Hoc XAI Techniques | Tamara Lenhard, Nicole Wagner, Horst Zisgen |
| Manifold-Guided Unified Learning for Partial-Label Domain Adaptation | Yifan Pan, Guibo Luo, Yuesheng Zhu |
| Beyond Detection: Analyzing and Classifying Global and Local Memorization in Diffusion Models | Jiyoon Kim, Junha Park, Jaehui Hwang, Jong-Seok Lee |
| Coarse-to-Fine: Progressive Image Compression for Semantically Hierarchical Classification | Jungwoo Kim, Jun-Hyuk Kim, Jong-Seok Lee |
| R-OVAR: Robust Open-Vocabulary Action Recognition for Practicality | Chen Ju, Xu Chen, shuai xiao |
| An Environment-Adaptive Camouflage Pattern Generator | Ciheng Wu, Min Liu, Yinghui Gao, Chule Yang, Zhaoyuan Wu, Naiyang Guan |
| ACTION DIFFERENCE IDENTIFICATION VIA MULTI-VIEW RELIABILITY RANKING | Kotone Mutsuna, Ryota Goka, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama |
| OUT-OF-DISTRIBUTION DETECTION VIA UNCERTAINTY DISTINCTION WITH DIRICHLET GAUSSIAN PROCESS | Ryusei Mikami, Koshi Watanabe, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama |
| TCSEG: TOPOLOGY-CONSISTENT SEMANTIC SEGMENTATION OF CORONARY ARTERIES USING INVASIVE ANGIOGRAPHY | Zhenchang Liu, Tao Wan, jiexu cui, Lei Cao, Zengchang Qin |
| REMOTE SENSING CHANGE DETECTION WITH CROSS MLSTM | Elman Ghazaei, Erchan Aptoula |
| CLOE: A CONFIDENCE-BASED LOCAL-TO-GLOBAL ESTIMATION FRAMEWORK FOR MULTISPECTRAL ILLUMINANT RECOVERY | Matteo Kolyszko, Alessio Mognato, Marco Buzzelli, Simone Bianco, Raimondo Schettini |
| Swin-Control-LDM: Structure-Preserving 3D Cross-Modality MRI Synthesis via Disentangled Global-Local Modulation | Lifang Zhou, Xiaoqing Li, Jun Hu |
| 3D Semantic Gaussians Compression for Occupancy Prediction | Yi-Chen Chiu, Jui-Chiu Chiang |
| t-APML: A Motion-Gated Loss for Dynamic 3D Point Cloud Generation Tasks | Sasan Sharifipour, Constantino Alvarez Casado, Manuel Lage Cañellas, Daniel Herrera Castro, Miguel Bordallo |
| Manifold Optimization on the Magnitude Torus for Fourier Phase Retrieval | Wen Perng, Po-Hung Cheng, Homer Chen |
| TriFIRNet: A Tri-Stage Frequency-Domain Interactive Restoration Network for Adaptive Low-Light Image Enhancement | Yaxing Zhang, Pang Jia, Hua Li, Fei Zhou, Wenhui Wu |
| HTGBNet: A Hybrid Transformer Graph Network with Boundary Awareness for Brain Image Segmentation | Aiman Solyman, Ahmed Elazab, Mohamed Rahouti, Ali Alfatemi, Zeinab Mahmoud |
| SSM-UNet: Structure-Aware Cross-Line Laser Detection for Robust Underwater 3D Reconstruction | heyang gao, Yuki Nishida, Takafumi Iwaguchi, Hiroshi Kawasaki |
| YOLOv8-Leaf-Pose: Keypoint-Driven Localization of Weed Apical Meristem for Laser Weeding | Haoyue Han, Ruixin Wei, Fan Wu, Yuxing Han |
| HYBIC: A HYBRID BINARY-REAL GRID AND CONTEXTUAL MODELING FRAMEWORK FOR NERF COMPRESSIOM | Pu-Hsueh Yen, You-Cheng Chen, Dian-Xuan Yang, Jui-Chiu Chiang |
| Cell Phantom Video Generation in Elliptical Fourier Descriptor Domain | Francesco Benedetto, Roberto Basla, Luca Magri, Giacomo Boracchi |
| Hardware-Accelerated Implementation of Multi-Resolution Motion Estimation for HEVC | Akseli Epäilys, Jesse Smedberg, Panu Sjövall, Alexandre Mercat, Jarno Vanne |
| ROBUST WATERMARKING WITH LATENT ADAPTER ON RECTIFIED FLOW MODELS | Hongfei Wu, Xiaodan Lin, Gewei Tan |
| Benchmarking Conventional and Learning-based Image Codecs for the Future JPEG DNA Standard | Claire Couvreur, Michela Testolina, Théo Ladune, Guillaume Lorand, Pierrick Philippe, Marc Antonini |
| DyD-DETR: Dynamic Gated Fusion and Dual-Domain Interaction Network for Small Object Detection in UAV Imagery | Xin Cong, Yuan Bai, Siyu Qiu |
| Target-aware training set search for HDR video dataset via shallow feature matching and motion-exposure cues | Fengshan ZHAO, Qin Liu, Takeshi Ikenaga |
| XAI or Attention: improving performance of object detectors with XBL | Alexey Zhukov, Jenny Benois-Pineau, Amira Youssef, Akka Zemmari, Mohamed Mosbah, Virginie Taillandier |
| Data-Parallel CUDA Implementation of the SNIC Super-pixel Algorithm | Kıvanç Taş, Toygar Akgün |
| A$^2$SR: Any-Resolution and Any-Step Diffusion Image Super Resolution with Pure ConvNets | Ruiqing Wang, Kai Zhang |
| SIGMA-Based RGB-Hyperspectral Fusion for Semantic Segmentation in Autonomous Driving | Sai Kiran Kocherla, Srinivas Aditya Abbaraju, Adduru U G Sankararao, Duswanth reddy, Rajalakshmi P |
| WREN: Low Light Image Enhancement Using Retinex theory-based Double U-Net-like Structures | Reina Kaneko, Junya Hara, Hiroshi Higashi, Yuichi Tanaka |
| HALLUCINATION MITIGATION IN LARGE VISION-LANGUAGE MODELS VIA CONTRASTIVE DECODING WITH ATTENTION ENHANCEMENT AND MASKING | Chen-Yang Huang, Pin-Zhen Chen, Huei-Fang Yang |
| Bridging Spectral and Spatial Signatures: A Dense Bag-of-Words Approach for Multispectral and Hyperspectral Image Analysis | Mihail-Gabriel Botezatu, Ioana Voica, Daniela-Iulia Calota, Mihai Datcu, Andrei Anghel |
| Squeeze Out Tokens from Sample for Finer-Grained Data Governance | Weixiong Lin, Chen Ju, shuai xiao, Haicheng Wang, Yuheng Jiao |
| Unsupervised Domain Adaptation for Enhanced Radiometer Image Precipitation Estimation using Conditional Flow Matching | Victor Enescu, Assaad Zeghina, Matthieu Meignin, Nicolas Viltard, Cécile Mallet |
| Sens-VisualNews: A Benchmark Dataset for Sensational Image Detection | Andreas Goulas, Damianos Galanopoulos, Evlampios Apostolidis, Vasileios Mezaris |
| FLASH: Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion | June Moh Goo, Zichao Zeng, Jan Boehm |
| MDSeg: Enhancing Text Segmentation via Detection-Guided Multi-Task Learning | Xing Shicong, Yan Li, Yan Shu, Yaru Zhao, Binyang Li |
| UNmix: A dual decoder U-Net for regression-based unmixing of subcellular structures in 3D confocal images | Hajar Hakkoum, Sandrine Lefranc, ayoub ouddah, magalie uyttewaal, david bouchez, martine pastuglia, Philippe Andrey |
| DeSal: Detail-Enhanced Network for High-Resolution Video Saliency Prediction | Jiongzhi Lin, Jiankai Xu, Wenhui Wu, Fei Zhou |
| TSOG: A format for Temporally and Spatially Ordered Gaussians | Shady Gmira, Evangelos Alexiou, Emmanouil Potetsianakis, Emmanuel Thomas |
| FLASH: Efficient Impact Fall Detection with Unified Hypergraph State-Space Model | Youssef Mourchid |
| Sampling High-Dimensional Constrained Gaussian Distributions Using Circulant Gibbs | Pierre Minier, Jean-François Giovannelli, François Orieux, Marcelo Pereyra |
| SCONE: Sketch-Guided Spatial Gating for Multi-Object 3D Scene Reconstruction | Ruyi Li, Zhihan Yin |
| MACDet: A MisAligned Multispectral Vehicle Detection Network Based on Deformable Cross-Attention | si Huang, shangping zhong, Kaizhi Chen, Wu YunBing |
| IMPROVING SIMILARITY-BASED KNOWLEDGE TRANSFER USING PROTOTYPE REPRESENTATIONS | Dimitrios Spanos, Nikolaos Passalis, Anastsios Tefas |
| Anchored Reliability: Decoupling Estimation from Adaptation for Noisy Test-Time CLIP | Malavika Hariprasad, Soma Biswas |
| Correlation-Aware Knowledge Distillation for Deep Discriminative Feature Learning in Image Retrieval | Ioanna Valsamara, Ioannis Pitas |
| Improving Privacy-Utility Trade-off with Learnable Privacy Mechanism in Machine Learning Tasks | Savas Ozkan, Sinan Mutlu, Mete Ozay |
| Viewpoint-Aware Bitrate Optimization for Multi-Asset 3D Scenes | Tomás Borges, Yago Sánchez, Cornelius Hellge, Ricardo de Queiroz |
| Analysis of the Impact of Training Data Distribution for Neural Reference Frame Generation | Qipu Qin, Cheolkon Jung |
| CAMERA PARAMETER SEARCH (CPS): A SYNTHETIC DATASET FOR SINGLE-IMAGE CAMERA CALIBRATION WITH GROUND-TRUTH INTRINSICS AND BROWN-CONRADY DISTORTION | Faiz Muhammad Chaudhry, Jarno Ralli, Jerome Leudet, Fahad Sohrab, Farhad Pakdaman, Pierre Corbani, Moncef Gabbouj |
| HGS: Head-of-Queue Slack-Aware Generation Scheduling for Generative Real-Time Interaction Systems | Bo Peng, Lianchen Jia, Chaoyang Li, Lifeng Sun |
| SelfVTON: Enhancing Virtual Try-On with Self-Supervised Cloth Detailing and Body Alignment | Shengyi Wu, Lingxiao Lu, Xianbing Sun, Zheng Wang, Jianlou Si, Liqing Zhang, Jianfu Zhang |
| GaLe: memory-efficient Global Approximate and Local Exact features | Alberto Ancilotto, Elisabetta Farella |
| HPA-Seg: Correlation-Gated Fusion and Reliability-Weighted Prototypes for Multi-Modal Remote Sensing Segmentation | Hongkang Zhang, Shao-Lun Huang, Yanlong Wang, Ercan Engin Kuruoglu |
| HYPERNEST-TTA: HYPERBOLIC NESTED LEARNING WITH TEST-TIME ADAPTATION FOR DIABETIC RETINOPATHY ASSESSMENT | Francesco Rundo, Massimo, Spata,, Andrea Calvagna, Andrea Orazio Caruso, Emiliano Tramontana, Sebastiano Battiato |
| Information Router for Mitigating Modality Dominance in Vision-Language Models | Seulgi Kim, Mohit Prabhushankar, Ghassan AlRegib |
| Boosting Point Transformer Segmentation with Self-Supervised Pretrained Point Encoders for Forest Point Clouds | Cosmin-Ioan Grigoruta, Mihai-Sorin Stupariu, Ileana Patru-Stupariu |
| HIGH-FIDELITY FUNDUS IMAGE RESTORATION BEYOND GROUND TRUTH | Ozer Can Devecioglu, Serkan Kiranyaz, Uyen Phan, Ilke Adalioglu, Moncef Gabbouj |
| Deep-Unfolded Autofocus Imaging for Distributed MIMO Radar | Tsubasa Terada, Hassan Mansour, Petros Boufounos, Ryuhei Takahashi |
| SSDOC-DET: EFFICIENT DOCUMENT LAYOUT DETECTION VIA SELECTIVE STATE-SPACE MODELING | Jing-Ming Guo, Chun-Wei Huang, Yi-Chong Zeng, Cheng-Yen Hsiao |
| HMS3FORMER-A Hyperspectral Image Restoration Model Based on Multi-Stage Spatial-Spectral Transformer and Endmember Attention | Zheng-Yang Wu, Chia-Ming Lee, Yu-Fan Lin, Chih-Chung Hsu, Shih-Yu Chen, Li-Wei Kang |
| LiPS: Lightweight Panoptic Segmentation for Resource-Constrained Robotics | Calvin GALAGAIN, Martyna POREBA, François GOULETTE, Cyrill STACHNISS |
| EWMA-CDT for adaptive temporal aggregation in online QIS filtering | Francesco Scroccarello, Edoardo Peretti, Aleksi Suonsivu, Leevi Uosukainen, Lauri Salmela, Stefano Bertolasi, Ionut Schiopu, Giacomo Boracchi |
| CADS: Conformal Adaptive Decision System for Cost-Efficient Image Classification | Mikael Turkoglu, Tim Bary, Vincent Thielens, Manon Dausort, Benoit Macq |
| CETNET: A CONTRAST ENHANCEMENT TWIN-BRANCH NETWORK FOR LOW-LIGHT ENHANCEMENT | Yifu He, DongJin Huang, Jiantao Qu, Yiyan Fan |
| ATTENTION-AWARE TRANSFORMER-BASED AGGREGATION NETWORK FOR VIDEO PERIOCULAR RECOGNITION | Luiz Guilherme Fonseca Carreira, Breno A Mariano, Victor H C de Melo, David Menotti, William Robson Schwartz |
| LAYERNORM-AWARE COMPRESSION OF VISION TRANSFORMERS VIA QUANTIZATION AND PRUNING | Soumya Sharma, Archita ., Parimala Kancharla |
| APCSEG: ADAPTIVE PROMPT COORDINATION TOWARD ROBUST ABDOMINAL VOLUMETRIC SEGMENTATION | jiexu cui, Lei Cao, Tao Wan, Zhenchang Liu, Jiankun Xu, Zengchang Qin |
| Bayesian Image Reconstruction With Local Linear Regressors | Colas Schretter |
| ILLUMINATION-DECOUPLED DUAL-UNET FOR SINGLE IMAGE DEVIGNETTING | Mariam Hossam, Hicham G. Elmongui, Marwan Torki |
| AI FOR 3D CHARACTERIZATION OF BUILDING INVENTORIES FOR VULNERABILITY AND EARTHQUAKE RISK MAPPING FROM SAR DATA | Pegah Moradpour, Mahila Hosseini, Luigi Russo, Babak Memar, Paolo Ettore Gamba |
| FMI2P-Loc: Using Foundation Models for Large-Scale Image-to-Point Cloud Visual Localization | Chin-Wei Kuo, PEI-I WU, Kuan-Wen Chen |
| Parallel Context Modeling for Sliding Window Attention in Neural Video Coding | Alexander Kopte, Andre Kaup |
| Correlation-Based Spectral Fidelity-Guided Unrolled Tensor Rank Minimization for Pansharpening | Dung Viet Phan, Chuong Hoang Vo, Chul Lee |
| Toward Semantic-Agnostic and Shape-Aware Vision-Language Segmentation Models | Corentin Seutin, Mohamed Amine Ettaki, Michaël Clément, Pierrick Coupé, Rémi Giraud |
| Bridging 2D Efficiency and 3D Context: A Memory-Guided Framework for Knee MRI Multi-label Classification | Huy Nguyen, Khang Le Minh, Cuong Nguyen |
| An Overview of Inter Coding Tools in AV2 | Yeqing Wu, Keng-Shih Lu, Mohammed Sarwer |
| AGLDM: ATTRIBUTE-GUIDED ZERO-SHOT TEXT-TO-IMAGE SYNTHESIS USING DATA-EFFICIENT LATENT DIFFUSION MODEL WITH SELF-CONSISTENCY LOSS | Sougata Moi, Angshuman Paul |
| [ICIP 2026] ADAPT: Any-codec Diffusion-based Adaptation for Image Perception-Distortion Tradeoff | Yen-Kuan Ho, Feng Chu Lin, Ting-Han Lin, Huu-Tai Phung, Ching-Chun Huang, Alessandro Gnutti, Wen-Hsiao Peng |
| QUALITY CONSISTENCY SCORE (QCS): A SURVIVAL-BASED RELIABILITY DESCRIPTOR FOR VIDEO QUALITY ASSESSMENT | Sergio Sanz-Rodríguez, Jon Frydensbjerg |
| Multimodal Analysis of T2-Weighted MRI and Clinical Data for Recurrence Prediction in Non–Muscle-Invasive Bladder Cancer | Israa Sharaby, Ahmed Alksas |
| SCENE-SPECIFIC MESH-GUIDED SUPERVISION FOR MONOCULAR 3D OBJECT DETECTION | Yash Patel, Ryosuke Kawamura, Mose Sakashita, Yusuke Hida, Laszlo Jeni, Koichiro Niinuma |
| AUTHENTICATION OF COPY DETECTION PATTERNS VIA CROSS-CAMERA DUAL-SYNTHETIC REFERENCING | Ivan Oleksiyuk, Roman Chaban, Slava Voloshynovskiy |
| Task-Oriented Source Coding Using LDPC Codes for Compressed-Domain Image Retrieval | Ahcene Aliouet, Yann Miguet, Elsa Dupraz, Aline Roumy |
| Faithful Grounded Visual Reasoning via Learned Proxy-Tokens | Tom Hodemon, Mohamed Chaouch, Aboubacar Tuo, Angelique Loesch |
| ADNET: ANISOTROPIC DEFORMABLE NETWORK FOR ENHANCED BOUNDARY-AWARE POLYP SEGMENTATION | Federico Urli, Luca Zaccagna, Andrea Salfinger, Francesca Incitti, Lauro Snidaro |
| FaSST: Fast Sparsifying Secondary Transform | Darukeesan Pakiyarajah, Samuel Fernandez, Eduardo Pavez, Antonio Ortega, Debargha Mukherjee |
| Noisy MRI Reconstruction via MAP Estimation with an Implicit Deep-Denoiser Prior | Nikola Janjusevic, Amirhossein Khalilian-Gourtani, Yao Wang, Li Feng |
| Beyond Frontal: A Renference Model for Joint Multi-view Blind Face Restoration | Marcelo Sanchez Ortega, Lara Raad, Coloma Ballester |
| Unrolled neural mapping schemes based on variational representations for satellite ocean remote sensing | Paul de Nailly, Ronan Fablet, Daniel Zhu, Maxime Beauchamp |
| RELIABLE SEMANTIC IMAGE TRANSMISSION VIA JOINT DJSCC DIFFUSION FRAMEWORK | Nimesh Pollwaththage, Yasith Ganearachchi, Prabhath Samarathunga, Joseph El Gemayel, Anil Fernando |
| Learning from Ambiguity: Uncertainty-Weighted Consistency and Structure-Aware Contrastive Objectives for Medical Image Segmentation | Maregu Assefa, Divya Velayudhan, Muzammal Naseer, Kumie Gedamu, Iyyakutti Iyappan Ganapathi, Naoufel Werghi |
| SPACE: Semantic Projection and Alignment of CLIP Embeddings for Domain Adaptation | João Renato Ribeiro Manesco, Danilo Jodas, Douglas Rodrigues, Leandro Aparecido Passos, Joao P. Papa |
| You Only Step Once: A Single-Pass Zero-Order Sharpness-Aware Minimization for Sparse Training | Jie Ji, Gen Li, Kaiyuan Deng, Fatemeh Afghah, Xiaolong Ma |
| OVERVIEW OF TRANSFORM CODING IN AV2 | Madhu Peringassery Krishnan, Xin Zhao, Alican Nalci, Keng-Shih Lu, Hilmi E. Egilmez, Aki Kuusela, Urvang Joshi, Van Luong Pham, Lin Zheng, Jingning Han, Kruthika Koratti Sivakumar |
| NEXT2FORMER-CD: EFFICIENT REMOTE SENSING CHANGE DETECTION WITH MODERN VISION ARCHITECTURES | Yufan Wang, Sokratis Makrogiannis, Chandra Kambhamettu |
| Hierarchical Motion Estimation and Compensation for Learning-based Dynamic Point Cloud Compression | Junghyun Ahn, André F. R. Guarda, Dong Tian |
| An overview of screen-content coding tools in AV2 | Qingyang Zhou, Van Luong Pham, Cheng Chen, Mohammed Sarwer, Yingbin Wang, Aki Kuusela, Dzung Hoang, Guichun Li, Shan Liu |
| HMDER-AttnNet: A Hybrid Attention-Based Deep Learning Framework for Noise-Aware Brain MRI Image Enhancement and Restoration | Shankar Tiwari, Subham Pramanik |
| Learning Geometry-Consistent Graphs for Multi-Modal Geophysical Data Interpolation | Kevin Arias, Paul Goyes, Antonio Ortega, Henry Arguello |
| Sparse Attention to Emotion: Efficient Facial Emotion Recognition via Token Reduction | Aya Zitouni, Aicha Zenakhri, Karim Haroun, Larbi Boubchir |
| ViPo-MLLM: Visual-Pose Multimodal LLM for Gloss-Free Sign Language Translation | Ahmed Abul Hasanaath, Bicheng Xu, Mir Rayat Imtiaz Hossain, Leonid Sigal, Hamzah Luqman |
| Denoising of Two-Phase Optically Sectioned Structured Illumination Reconstructions Using Encoder-Decoder Networks | Allison Davis, Yezhi Shen, Xiaoyu Ji, Fengqing Maggie Zhu |
| Rate Distortion Optimization for Mesh Geometry Compression | Qingyang Zhou, Pranav Kadam, Shan Liu, C.-C. Jay Kuo |
| RATE-DISTORTION-COMPLEXITY ANALYSIS OF PARAMETRIC VIDEO CODECS | Ricardo de Queiroz, Diogo Garcia, Yi-Hsin Chen, Ruhan Conceição, Luciano Agostini, Wen-Hsiao Peng |
| MAE-UNETR++: Masked Autoencoder Pretraining for 3-D Lung Nodule Segmentation | Vinayak Savant, Jianhua Xuan |
| RAW Image Compression with ISP Priors and Side Information | Zixu Chen, Yuqi Li, Li Li, Xiangji Wu, Dong Liu |
| F2-OWOD: Frequency-Domain Feature Decoupling with Foundation Models for Open-World Object Detection | Weilong Zhu, Hualei Shen |
| ROBUST PRIOR-GUIDED SEGMENTATION FOR EDITABLE 3D GAUSSIAN SPLATTING | Raushan Joshi, Jean-Yves Guillemaut |
| RPO: Training-Free Flow Matching Refinement via Regional Preference Optimization | Dejiao Xue, Yiwei Tang, He Wang, Longquan Dai |
| Spatial-Frequency Cooperative Fusion Network for Multimodal Medical Image Fusion | Guanghui Yue, Wentao Li, Siqi Xiao, Cheng Zhao, Tianfu Wang, Tianwei Zhou |
| Uncertainty-Guided Hybrid CNN-Transformer Architecture for Aircraft Surface Defect Detection | Victor Wu, Jichi Ge, Jieling Gong, Eric Saczuk, Michal Aibin |
| Pixel to Geocoordinate Mapping in Oblique and Nadir UAV Imagery | Michal Aibin, Suchang Cao, Victor Wu, Zhiyuan Yang |
| RADMI: LATENT INFORMATION AGGREGATION AS A PROXY FOR MODEL UNCERTAINTY | William Stevens, Mohit Prabhushankar, Ghassan AlRegib |
| ZERO-SHOT MEMORABILITY CONTROL IN DIFFUSION MODELS | Ren Togo, Ryo Shichida, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama |
| Lossless Image Coding Using Context-driven Neural Distribution Estimation | Victor Fabre Figueiredo, Lucas Lopes, Ricardo de Queiroz, Philip Chou |
| Anatomical Region Powered Laryngoscopic Report System via Vision-Language Model | Kaiwen Xiong, Yi Liu, Jiayue Xiao, Ruixin Li, Sisi Zheng, Binbin Wang, Ting Xiang, Feng Wang, Xiaomao Fan, Dan Lu, Yumeng Liu |
| DeSD: A Depth-Aware Diffusion Framework for Small Object Detection in Grassland Rat Holes | Lei Xu, Ru Li |
| Learning Perceptual Representations for Gaming NR-VQA with Multi-Task FR Signals | Yu-Chih Chen, Michael Wang, Chieh-Dun Wen, Kai-Siang Ma, Avinab Saha, Li-Heng Chen, Alan Bovik |
| X-SSLext: Semantic Prototype Self-Distillation for Proposal-Based X-ray Threat Representation | Yonathan Michael, Mohamad Alansari, Andreas Henschel, Naoufel Werghi |
| Low-Light Image Enhancement with Structural Contrast Prior, LAB Noise Modeling and Multi-Path Image Fusion | Xinchen Ma, Kailiang Ye, Zheng Lu |
| MG-NET: A NEW MULTI-AXIAL GUIDANCE NETWORK FOR ABDOMINAL MULTI-ORGAN SEGMENTAION | YiYang Chen, Qian Huang, Yulin Chen, Hexuan Hu, Ziyang Yin, Meng Geng |
| Accelerating Learned SAR Image Compression via Selective Channel-Group Encoding Bypass | Zach Button, Paras Maharjan, Zhu Li |
| Perceptually Optimized LOP In-Loop Filter for VVC Based on Multi-Scale Head Alignment | Shaochong Wu, Cheolkon Jung |
| Learning Dual-Attribute Prompts with Progressive Tuning for AI-Generated Image Quality Assessment | Chenyang Zhang, Pengyu Wang, Yiping Duan, Xiaoming Tao |
| Effective Degree-wise Scalability of Spherical Harmonic Coefficients for 3D Gaussian Splatting Compression | Jianfeng Xu, Ryosuke Watanabe, Keisuke Nonaka |
| Efficient Coreset Generation for Chest X-ray Imaging Using Compressed Sensing | Pradyumna Pradhan, Soham Mukherjee, Ramunaidu Randhi, Pradip Sasmal |
| A Stable Neural Statistical Dependence Estimator for Autoencoder Feature Analysis | Bo Hu, José C. Principe |
| COCO-Inpaint: A Benchmark for Detecting and Localizing Inpainting-Based Image Manipulations | Haozhen Yan, Yan Hong, Jiahui Zhan, Suning Lang, Yikun Ji, Yujie Gao, Huijia Zhu, Jun Lan, Jianfu Zhang |
| CLOUD-ROBUST SPATIOTEMPORAL FUSION OF SATELLITE IMAGES: A CONSTRAINED CONVEX OPTIMIZATION APPROACH | Ryosuke Isono, Shunsuke Ono, Antonio Ortega |
| HDRDCL: AN HDR OBJECT DETECTION AND SEGMENTATION DATASET FOR EVALUATION IN CHALLENGING LIGHTING CONDITIONS | Juan Merlos, Andre Harrison, Darius Jefferson, Velibor Adzic, Hari Kalva |
| Multimodal Attention Framework for Context-Aware and Semantically Rich Image Captioning | Nasser Gawfan, Waseem Ullah, Latif U. Khan, Mohsen Guizani |
| PAND: Prompt-Aware Neighborhood Distillation for Lightweight Fine-Grained Visual Classification | Qiuming Luo, Yuebing Li, Feng Li, Chang Kong |
| PhysUNeXt: Physics-aware Lightweight ConvNeXt-inspired U-Net for Hyperspectral Image Reconstruction | Xian-Hua Han, Jian Wang |
| Uncertainty-Aware Knowledge Distillation for Semantic Segmentation in Autonomous Driving | Armaghan Butt, Qing Tian |
| Secure Graph Filtering based on Graph Fourier Transform in Encrypted Domain | Yukihiro Bandoh |
| An Attention-Enhanced Network with Joint Dehazing and Retinex-Based Enhancement for Underwater Images | Sahana Ray, Bibhabasu Debnath, Sanjay Ghosh |
| FreeInstance: Training-Free Instance-level Customization | fengming liu, Tat-Jen Cham |
| PARAMETER-EFFICIENT FLEXIBLE EXPANSION AND MERGING WITH DUAL-STAGE MODULE RETRIEVAL FOR TASK-FREE ONLINE CONTINUAL LEARNING | Pin-Zhen Chen, Huei-Fang Yang |
| FEW-SHOT LEARNING OF UNCONDITIONAL LATENT DIFFUSION MODELS BASED ON DOMAIN ADAPTATION AND DOMAIN-INDEPENDENT LATENT SPACE | Katsumi Yamada, Kazuaki Nakamura |
| Scene-Aware Physics-Informed Neural Networks for Adaptive Car-Following Modeling | Hengyu Zhang, Chonghao Gao, Xin Yang, Xuyang Zhu, Yu Liao, Shijie Zhou |
| Leveraging motion estimation for Efficient Bayer-Domain Video Convolutional Networks | Haichao Wang, Jiangtao Wen, Yuxing Han |
| Listening without Looking: Modality Bias in Audio-Visual Captioning | Yuchi Ishikawa, Toranosuke Manabe, Tatsuya Komatsu, Yoshimitsu Aoki |
| TeSO: Representing and Compressing 3D Point Cloud Scenes with Textured Surfel Octree | Yueyu Hu, Ran Gong, Tingyu Fan, Yao Wang |
| DIAMOND SHAPE FILTER IN LOW COMPLEXITY NEURAL NETWORK-BASED IN-LOOP FILTERING FOR VIDEO CODING | Tong Shao, Jay N. Shingala, Ajay Shyam, Ajat Suneja, Siddarth P Badya, Peng Yin |
| HyperICM: Hyperspectral Image Compression for Machines with Task-Agnostic Semantics from Foundation Models | Jiayao Xu, Yujie Chen, Dingquan Li, Wenhan Yang |
| Predictive Label Consistency for Mitigating Robust Overfitting in Adversarial Training | Hanqi Zhang, Ke Xu, Xinghao Jiang, Tanfeng Sun |
| ID-Pruner: Disentangling Importance and Diversity for Training-Free Visual Token Pruning | Jie Ji, Gen Li, Fatemeh Afghah |
| Leveraging Vision-Language Models as Weak Annotators in Active Learning | Phuong Ngoc Nguyen, Kaito Shiku, Bise Ryoma, Seiichi Uchida, Shinnosuke Matsuo |
| Accelerated Blur Kernel Estimation with Local Boosting and Subimage Usage | KuanChung Ting, Chun-Wei Chang, Sheng-Jyh Wang, Ruey-Bing Hwang |
| TIME-AWARE SEMANTIC PROTOTYPES FOR WEAKLY-SUPERVISED ENDOMICROSCOPY VIDEO CLASSIFICATION | Ilán Carretero, Pablo Meseguer, Irene Zammarchi, Cecilia Pugliano, Giovanni Santacroce, Bisi Bode Kolawole, Ujwala Chaudhari, Rocío del Amor, Enrico Grisan, Marietta Iacucci, Valery Naranjo |
| HYPERDISTILL: ENABLING TEXT-FREE INFERENCE IN HYPERGRAPH-BASED MEDICAL IMAGE SEGMENTATION VIA KNOWLEDGE DISTILLATION | Afrouz Sheikholeslami, Sahar Moradizeyveh, Mohammad Hossein Ahmadi, Yuankai Qi, Amin Beheshti |
| Generating Topologically Sound and Geometrically Smooth Meshes | Daixi Jia, Haiyue Zhang, Cui Wang |
| Bilateral Kernel Regularization for Few-Shot Adaptation of Large Vision-Language Models | Omar Arif, Aizah Arif |
| CG-Track: Dual-Adaptive Temporal Enhancement and Cue-Gated Fusion for Robust Multi-Object Tracking | Jian Li, Fei Gu, Qian Zhou, Jing Wu |
| Rotation-Equivariant Multi-Scale Convolution via Adaptive Magnitude LBP | Peihong Lei, Yuying Ren, Siqi Chen, Fan Bai, Hanlin Mo |
| HOMOAD:LEVERAGINGHIERARCHICALHOMOGENIZATIONANDSYNERGISTIC SYNTHESIS FORINDUSTRIALANOMALYDETECTION | Fang Chih-Heng, Jou Jie-Deng, Yu-Hsuan Chiu, Jison Hsu |
| HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification | Valentina Vadori, Jean-Marie Graïc, Antonella Peruffo, Ujwala Chaudhari, Enrico Grisan |
| TeRIF:Region-Aware Image Fusion Conditioned on Textual Dynamics | ying luo, yanyin guo, chuiyi deng, zhuoyi zhao, junwei li |
| Uncertainty-Aware DualU-Net: Integrating Calibration and Uncertainty Fusion from Dual Decoders for Cell Analysis | David Anglada-Rotger, Ferran Marques, Montse Pardas |
| Gradient Loss for Spectral Reconstruction | Sona Bezirganyan, Lusine Davtyan, Aram Butavyan, Varduhi Yeghiazaryan |
| Energy Consumption Analysis of FPGA-Accelerated 2D HEVC Encoding in a Practical V-PCC Encoder | Louis Fréneau, Nhan Nguyen, Guillaume Gautier, Panu Sjövall, Maxime Pelcat, Alexandre Mercat, Jarno Vanne |
| Nix and Fix: Targeting 1000× Compression of 3D Gaussian Splatting with Diffusion Models | Cem Eteke, Enzo Tartaglione |
| SURGMLPS: UNLOCKING THE POTENTIAL OF CLIP WITH AN MLP-LIKE ARCHITECTURE FOR SURGICAL PHASE RECOGNITION | Hao Xie, Xutao Chen, Bonnie Law, Yuk Hee Chan, Kin-Man Lam, Kenneth K.W. Li, Tracy H.T. Lai, Victor S.C. Chu |
| Physics-Guided Single-Image Dehazing with Learned Transmission and Atmospheric Light Estimation | Koyyada Dinesh Kumar, Sujit Kumar Sahoo |
| When Restoration Becomes the Reference: Reusing Full-Reference IQA in Blind Settings | Aymen Sekhri, Abderrezzaq Sendjasni, Seyed Ali Amirshahi, Chaker Larabi |
| Efficient Remote Sensing Image Segmentation With Learnable Constrained Convolutional Enhancements | Mengmeng Zhang, Hongyuan Jing, Bo Ding, Tianxu Cui |
| Optimized three-component quaternion coding of 3D Gaussian splats in MPEG V-PCC standard | Adrian Dziembowski, Błażej Szydełko, Dawid Mieloch, Kwan-Jung Oh, Gwagnsoon Lee, Jun Young Jeong |
| STMGaze: Spatiotemporal Modeling with Orthogonal Mamba Scanning for Video-based Gaze Estimation | Jingzhi Jiang, Sirui Zhao, Xiaohao Wang, Fangyuan Liu, Tong Xu, Enhong Chen |
| FOREST CANOPY HEIGHT MAPPING USING DUAL-ENCODER ATTENTION U-NET AND MULTISEASON OPTICAL–SAR FUSION | Soma Satya Praveen Mutyala, Suraj Reddy Rodda, Rajashekar Gopalakrishnan |
| EVDI: Exposure-Aware Joint Video Deblurring and Interpolation under Unknown Exposure | Haodong Fan, Yingming Li |
| LEARNING UNCERTAIN BOUNDARIES: INTERACTION-FUSED MULTI-DECODER CONVOLUTIONAL NEURAL NETWORKS FOR PERINEURAL INVASION DETECTION IN HISTOPATHOLOGICAL IMAGES | VIJAY SANKAR BABU P, Faouzi Alaya Cheikh, Madhu S. Nair |
| ALADIN: Attention-based Lightweight Architecture for Drowsiness Identification | Mayuk Sarkar, Swarnava Dey, Arijit Mukherjee |
| Multi-view Consistency and Frequency-aware Modeling for Scattering Scene Reconstruction | Renrong Hu, Qianyue He, Dongyu Du, Yihui Fan, Zhiheng Li, Xin Jin |
| SEEING ROADS THROUGH WORDS: A LANGUAGE-GUIDED FRAMEWORK FOR RGB-T DRIVING SCENE SEGMENTATION | Ruturaj Reddy, Hrishav Bakul Barua, Junn Yong Loo, Thanh Thi Nguyen, Ganesh Krishnasamy |
| EF-ViMGaze: Dual-Branch Eye-Face Feature Learning Based on Vision Mamba for Gaze Estimation | Qinghe Li, Zhaonian Sun, Benying Tan |
| GATA2Floor: Graph Attention for Floor Counting in Street-View Facades | Ngoc Tan Le, Tzoulio Chamiti, Eirini Papagiannopoulou, Nikos Deligiannis |
| CogniORPO: Complexity-Adaptive Reinforcement Learning for Transparent Image Captioning | Junxin Wang, Yuchao Wang, Hongkai Zhang |
| SAM-Guided Unified Weakly-Supervised 3D Salient Object Detection Network | Le Hui, guohang li, Chen Wang, Qi Liu, Yuchao Dai |
| Physics-Informed Blind Adaptive Degradation Guided Network for Unsupervised Hyperspectral and RGB Image Fusion | Linxuan Huang, Song Liu, CONGXUAN ZHANG, Zhen Chen |
| See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models | Kebin Contreras, Luis Toscano, Mauro Dalla Mura, Jorge Bacca |
| SOCIAL GROUP ACTIVITY RECOGNITION FROM STILL IMAGES USING CONDITIONAL TOKEN SEQUENCE GENERATION | Shota Orihashi, Taiga Yamane, Naoki Makishima, Mana Ihori, Satoshi Suzuki, Tomohiro Tanaka, Ryo Masumura |
| GAUSSIAN SPLATTING WITH REFLECTIONS GUIDED BY WHAT IS SEEN | Yiming Liang, Tianyu Xiao, Hiroshi Ishikawa |
| DeKAHT: Data-efficient Kolmogorov-Arnold Hierarchical Transformer | Rajib Kumar Jha, Gurram Harshamanya Thilak, Saloni Kumari |
| PHYSICS-GUIDED DENOISING DIFFUSION FOR COMPRESSIVE X-RAY COMPTON BACKSCATTERING IMAGING | Abdullah Alrushud, Sarah Aranibar, Edgar Eduardo Salazar, Gonzalo Arce |
| VISION WITHOUT IMAGES: END-TO-END COMPUTER VISION FROM SINGLE COMPRESSIVE MEASUREMENTS | Fengpu Pan, Heting Gao, Jiangtao Wen, Yuxing Han |
| SIGHTA-AI: A TWO-STAGE ON-DEVICE VISION-LANGUAGE ARCHITECTURE FOR REAL-TIME VISUAL ASSISTANCE | Junwoo Lee, Yi Fang |
| Stop Denoising your blurs | Sasidhar Parvathireddy, Sree Rama Vamsidhar Saraswathula, Rama Krishna Sai S Gorthi |
| TC-UNet: Detection of Faint Star Spots Based on Time Consensus Feature Fusion Network | Yuheng Wei, Lixin Zhang, Xinguo Wei |
| CONVOLUTIONAL KOLMOGOROV-ARNOLD NETWORKS AND CONDITIONAL RANDOM FIELDS FOR REMOTE SENSING IMAGE SEMANTIC SEGMENTATION | Paola Grotti, Martina Pastorino, Gabriele Moser |
| RCFL: Recursive Clustered Federated Learning for Distributed Concept Drift | Konark Jaishy, Saumik Bhattacharya, Prabir Kumar Biswas |
| FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection | Marin Wada, Takeru Inoue, Ryusuke Miyamoto |
| ENHANCING DIABETIC RETINOPATHY GRADING VIA ENTROPY-DRIVEN KNOWLEDGE DISTILLATION WITH MAMBA FUSED FREQUENCY CROSS-ATTENTION | DEV RISHI VERMA, Dipankar Das, DEEPAK RANJAN NAYAK, TAPAN KUMAR GANDHI |
| Network Quantization in Neural Video Coding: A Comparative Study across Coding Frameworks and Temporal Buffering Strategies | Huu-Tai Phung, Yu-Hsiang Lin, Chun-Hung Wu, Ruhan Conceição, Tzu-Hsiang Chou, Marcelo Porto, Luciano Agostini, Wen-Hsiao Peng |
| PATCH ENSEMBLES FOR ROBUST SALMON RE-IDENTIFICATION WITH WEAK TRAJECTORY LABELS | Espen Uri Høgstedt, Christian Schellewald, Annette Stahl, Rudolf Mester |
| LEVERAGING POINT CLOUD NORMALS FOR PRACTICAL V-PCC CODING | Amar Tious, Louis Fréneau, Guillaume Gautier, Toinon Vigier, Alexandre Mercat, Vincent Ricordel |
| Photon-Statistics-Driven Learning for Underwater Imaging | Zaichang Lu, Dongyu Du, Zhiheng Li, Xin Jin |
| Contribution-Aware Spatial Recalibration for Training-Free Image Classification | Haruhiro Takahashi, Ryuto Ishibashi, Lin Meng |
| Text image inpainting BY EXPLORING CONTEXTUAL SEMANTICS AND STRUCTURE PRIORS | Wangchuk Tsering, Qijun Zhao |
| FedKPer: Tackling Generalization and Personalization in Medical Federated Learning via Knowledge Personalization | Zoe Fowler, Ghassan AlRegib |
| KD-Ex: A Benchmark for Evaluating Explainability Transfer in Knowledge Distillation | Malaika Mushtaq, Michael Madden, Ihsan Ullah |
| The RealDefocus Benchmark for Defocus Deblurring | Tim Seizinger, Zhuyun Zhou, Radu Timofte |
| Graph-based feature learning for image classification | Isabela Borlido Barcelos, Zenilton Patrocínio, Alexandre Falcao, Ewa KIJAK, Silvio GUIMARAES |
| VLIGM-MoE: VISION-LANGUAGE INDIVIDUAL GRAPH MATCHING WITH INSTRUCTION-TUNED MIXTURE-OF-EXPERTS FOR ASD PREDICTION | Juliana Mantebea Danso, Enoch Opanin Gyamfi, Mylene Farias |
| REPA: Random-Order Embedding Predictive Autoregression for Sparse Vector Field Reconstruction | Bilginer Oral, Erdem Koyuncu |
| PHOTOSAR: SYNTHESIZING OBJECT-LEVEL NEAR-FIELD SAR RAW MEASUREMENTS FROM A SINGLE RGB IMAGE | Yuhuan Mo, Tingkai Hu, Chuandong Li, Hailing Xiong, Zhen Luo |
| Scale-Floor Constrained Fourier Basis Density Models for Transformer-Based Learned Image Compression | Yizhi Cao, Wen Tan, Fanyang Meng, Genhong Wang, Yongsheng Liang |
| Height-Aware Feature-Scale Adaptive RT-DETR for UAV Maritime Object Detection | Xuhang Wang, Zheng Lu |
| LUMEN: LOW-LIGHT UNIFIED MULTI-STAGE ENHANCEMENT NETWORK USING DEPTH-GUIDED FLASH, CLUSTERING, AND ATTENTION-BASED TRANSFORMERS | Bibhabasu Debnath, Sahana Ray, Sanjay Ghosh |
| Diversity Sampling via Maximum Dispersion Batch Selection | Nikita Kovalenko, Peter Eisert, Anna Hilsmann, Sebastian Bosse |
| Efficient Dense Matching for Enhanced Gaussian Splatting using AV1 Motion Vectors | Julien Zouein, Vibhoothi Vibhoothi, François Pitié, Anil Kokaram |
| Leveraging Pretrained RGB Denoisers for Hyperspectral Image Restoration | Daniele Picone, Mohamad JOUNI, Mauro Dalla Mura |
| Representation Compensation of SAM2 for Segmenting Objects under Transformation in Videos | Marco Cocco, Matteo Dunnhofer, Christian Micheloni |
| Window-based Linear Attention for Unified Local-to-Global Context in Image Super-Resolution | Nai-Jen Hsueh, Wen-Jiin Tsai |
| MODIFICATIONS TO BLOCK IMPORTANCE MAPPING AND ALIGNMENT TO GOP-BASED RPR | Kenneth Andersson, Per Wennersten, Jacob Ström |
| A Text-Aware Layered Compression Framework for Game Videos | Yanzhuo Ma, Lu Wang, Junyan Huo, Fuzheng Yang |
| Unified Spatio-Temporal BEV Attention for Omniscient Autonomous Driving with Multi-Sensor Fusion | Firas Jendoubi, Redouane Khemmar, Romain Rossi, Madjid Haddad |
| USPDet3D: Hybrid Uncertainty-Aware Dynamic Spatial Pruning for Efficient 3D Small Object Detection | Lin Qian, Mengyuan Ma, Lintao Xiang, Hongpei Zheng, Zhenghao Li, Hujun Yin |
| BUDGET-AWARE ADAPTIVE ADVERSARIAL PATCHES FOR BLACK-BOX OBJECT DETECTION | Pedram Mohajeransari, Amir Salarpour, David Fernandez, Mert D. Pesé |
| A Preliminary Numerical Feasibility Study of Radar Tomography for the Rubble-Pile Asteroid Dimorphos | Topi Pajala, Sampsa Pursiainen, Alexandra Koulouri, Christelle Eyraud |
| Unsupervised Defect Detection for Surgical Instruments | Joseph Huang, Yichi Zhang, Xiaoyu Ji, Jingxi Yu, Wei Chen, Seunghyun Hwang, Qiang Qiu, Amy Reibman, Edward J, Delp,, Fengqing Maggie Zhu |
| How Sampling Strategy Affects Imbalance Mitigation in LiDAR Segmentation: A Study of Structured vs. Random Point-Based Architectures | Antonis Savva, Christos Kyrkou, Theocharis Theocharides |
| From Universal Segmentation to Cell Quantification: A Hierarchical Image Processing Pipeline for Histological Images | Letícia Bianca Oliveira, Gabriel Barbosa da Fonseca, Zenilton Patrocínio, Silvio GUIMARAES |
| LASOD-YOLO: A Lightweight Global Context Modeling for Aerial Small-Object Detection | Zhizhang Wang, Xiangji Huang |
| Batch Perfect: BSS via Structured Local Covariance | Yaorong Xiao, Rogers Silva, Brad Baker, Vince Calhoun, Sergey Plis |
| TOWARD QUALITY ASSESSMENT OF 3D GAUSSIAN SPLATTING CODING | Joao Prazeres, Saeed Mahmoudpour, Stuart Perry, Manuela Pereira, Antonio M. G. Pinheiro |
| TAUSS: TEMPORALLY ALIGNED UNSUPERVISED 3D LIDAR SEMANTIC SEGMENTATION IN DRIVING SCENES | So Minesawa, Hiroshi Ishikawa |
| UNetv2-Lite: Lightweight Residual Attention U-Net for Medical Image Segmentation | Abhin P T, Arun Kumar Sivapuram, Madhu S. Nair, Rama Krishna Sai S Gorthi |
| Invertible Factorization and Prompt Tuning for Long Term Person Re-identification | Lenat Thomas, Nirmala Murali, Madhu S. Nair, Deepak Mishra |
| Structurally Regularized Self-Supervised Graph Learning For Geochemical Mapping From Hyperspectral Images | Ioana Voica, Mihail-Gabriel Botezatu, Daniela-Iulia Calota, Andrei Anghel, Mihai Datcu, Florian Bodescu, Aurora Neagoe, Virgil Alexandru Iordache |
| Reliability-Aware Weighted Multi-Scale Spatio-Temporal Maps for Heart Rate Monitoring | Arpan Bairagi, Rakesh Dey, Siladittya Manna, Umapada Pal |
| Certified-Progressive Secret Image Sharing via XOR and Counting for Fast Lossless Recovery | Meijuan Li, Ziwen Wei, Wang Yidong, Cui Zhe |
| UV-Guided Match Verification for Animal Re-identification | Aleksandr Algasov, Ekaterina Nepovinnykh, Fedor Zolotarev, Tuomas Eerola, Heikki Kälviäinen, Pavel Zemcik, Charles Stewart |
| PhysGasFluid: Physics-Guided Gaseous Fluid Flow Reconstruction | Keyi Wu, Shan Du |
| Multiclass Subtyping of Renal Tumors from Whole-Slide Images Using a Hybrid CNN-Transformer with Optimized Texture Features | Mohamed Azam, Hossam Magdy Balaha, Ahmed Aboudessouki, Asem Ali, Moumen El-Melegy, Muhammad Idrees, Mohammed Ghazal, Ashraf Khalil, Dibson Gondim, Ayman El-Baz |
| MTS-CSNet: Multiscale Tensor Factorization for Deep Compressive Sensing on RGB Images | Mehmet Yamac, Lei Xu, Serkan Kiranyaz, Moncef Gabbouj |
| Density-Adaptive LiDAR Point Cloud Compression | Nuno Martins, Luis Cruz, Fernando Lopes |
| ESCAN: Enhanced Self-Attention-Driven Multi-Level Adaptive Complementary Fusion Network for CT-MRI Imaging | Munish Daroch, Alan Saldanha, Ranjeet Ranjan Jha, Aditya Nigam |
| Light Field Area ReSTIR: Real-Time Depth-of-Field Guided Light Field Rendering | Kamran Akbar, Robert Bregovic |
| CLIP-PET: High-Fidelity Low-Dose PET Reconstruction via CLIP Guided Cascaded Framework | Rihui Xia, Zhuodong Chai, Yongzhou Liu, Liwen Wang, Zhe Jin, Xingbo Dong |
| HyQuant: A Unified Quantization Framework for Hybrid Mamba-Transformer Vision Models | Jui-Chiang Wei, Bo-Yun Shi, An-Yeu Wu |
| PREDICT WITH UNCERTAINTY, DECIDE WITH CONFIDENCE: CONSISTENT DISTRIBUTION LEARNING FOR BONE AGE ESTIMATION | Avinaash A, Bhadresh L, Parth Pandey, Umarani Jayaraman |
| IndoNav: A Benchmark Dataset of Indonesian Pedestrian Scenes for Assistive Navigation of Vision-Impaired People | Dien Rahmawati, Son Lam Phung, Hoang Thanh Le, Yang Di, Ly Bui, Husneni Mukhtar, Abdesselam Bouzerdoum |
| Exploring Easy Boosts For Lidar Semantic Scene Completion | Tetiana Martyniuk, Jonathan Seele, Alexandre Boulch, Gilles Puy, Renaud Marlet, Raoul de Charette |
| Physics-Informed Self-Supervised Despeckling of Sonar Images via Residual Modeling | Swapna Pillai, Siddharth Singh Savner, Sujit Kumar Sahoo |
| SB-BEVFusion: Enhancing the Robustness against Sensor Malfunction and Corruptions | markus essl, Marta Moscati, Mubashir Noman, Muhammad Zaigham Zaheer, Usman Naseem, Shah Nawaz, Markus Schedl |
| TAFA-GSGC: Group-wise Scalable Point Cloud Geometry Compression with Progressive Residual Refinement | Xiumei Li, Alexander Kopte, Andre Kaup |
| SELF-SUPERVISED PERCEPTUALLY INTERPRETABLE MONOCULAR DEPTH ESTIMATION | Zain Ul Abidin, George Dimas, Dimitris Iakovidis |
| Neural Watermarking: Lack of a Secret Key is still Lack of Security | Jan Butora, Hussein Tarhini, Aurélien Noirault, Patrick Bas |
| Patch-Level Cross-Modal Learning for Multimodal Estrogen Receptor Status Classification in Breast Cancer Histopathology | Mohamed Azam, Walid [email protected], Khadiga Ali, Ahmed Aboudessouki, Hossam Magdy Balaha, Moumen El-Melegy, Asem Ali, Mohammed Ghazal, Ashraf Khalil, Dibson Gondim, Ayman El-Baz |
| GEOMETRY MEETS GAUSSIANS IN BEV: UNCERTAINTY-AWARE LATE FUSION FOR MULTI-VIEW PEDESTRIAN DETECTION | Vinicius Avena, Rodrigo S. Couto, Luis Henrique M. K. Costa, Eduardo A. B. da Silva |
| SEGMENTING WOUNDS IN MULTI-MODAL IMAGES USING GENERATIVE ADVERSARIAL NETWORKS | Agata Wijata, Jacek Andrzejewski, Maria Bienkowska, Jakub Nalepa |
| SceneVGGT: VGGT-based online 3D semantic SLAM for indoor scene understanding and navigation | Anna Gelencsér-Horváth, Gergely Dinya, Péter Halász, Dorka Boglárka Erős, Muhammad Muqsit Islam, Kristóf Karacs |
| Towards a Standard for Gaussian Splat scene Coding with V3C/V‑PCC | Patrice Rondao Alface, Lukasz Kondrad, Lauri Ilola, Emre Aksu |
| SplitFed-CL: A Split Federated Co-Learning Framework for Medical Image Segmentation with Inaccurate Labels | Zahra Hafezi, Hadi Hadizadeh, Parvaneh Saeedi |
| PatchCompressor: A Lightweight Region based Video Streaming Framework at the Edge | Shaijal Tripathi, Amitangshu Pal, KOTESWAR RAO JERRIPOTHULA |
| Improving Viewpoint-Invariance and Temporal Consistency for Action Detection | Yannick Porto, Renato Martins, Thomas Chalumeau, Cédric Demonceaux |
| Feature Space Generative Models For One-Shot Class-Incremental Learning | Jack Foster, Kirill Paramonov, Mete Ozay, Umberto Michieli |
| INNER PART DISCOVERY BASED ON PARTIAL LABEL PROPORTIONS | Guillaume PICAUD, Marc Chaumont, Gérard Subsol, Luc TEOT |
| BENCHMARKING ATTRIBUTE DISCRIMINATION IN INFANT-SCALE VISION-LANGUAGE MODELS | James Batsell, Tsutsui Satoshi, Bihan Wen |
| Towards reconstructing experimental sparse-view X-ray CT data with diffusion models | Nelas Jarno Thomsen, Xinyuan Wang, Felix Lucka, Ezgi Demircan-Tureyen |
| FAST PSF SYNTHESIS WITH DEFOCUSED AND SPHERICAL ABERRATION | Nicholas Ganino, Qi Guo |
| PROTEIN GRAPH NEURAL NETWORKS FOR HETEROGENEOUS CRYO-EM RECONSTRUCTION | Jonathan Krook, Axel Janson, Joakim Andén, Melanie Weber, Ozan Öktem |
| On the Possible Detectability of Image-in-Image Steganography | Antoine Mallet, Patrick Bas |
| Moe-driven Modality-invariant Feature Learning for Visible-Infrared Person Re-Identification | Yupeng Chen, Shuli Cheng, Anyu Du, Li Wang, Zirui Jiang, Mingsheng Zheng |
| ZERO-CLICK BRAIN TUMOR SEGMENTATION USING SEGMENT ANYTHING MODEL 2 | Daniel Pasierb, Agata Wijata, Jakub Nalepa |
| ReDyPrompt: Residual-Guided Dynamic Prompts for Robust Anomaly Detection in Fuel Rod Cladding Surface Inspection | Xinwei Lyu, Haiyong Chen, Zhaoyang Wang |
| Parts-Mamba: Augmenting Joint Context with Part-Level Scanning for Occluded Human Skeleton | Tianyi Shen, Huijuan Xu, Nilesh Ahuja, Philip Shin, Vijaykrishnan Narayanan |
| Dithering Defense: Adversarial Robustness of Vision Foundation Models via Multi-Level Floyd–Steinberg Dithering | Yury Belousov, Brian Pulfer, Vitaliy Kinakh, Slava Voloshynovskiy |
| Adapting SAM Without Labels: Uncertainty-Aware Source-Free Medical Image Segmentation | Quang-Khai Bui-Tran, Thanh-Huy Nguyen, Bac LE, Min Xu |
| Diagnosing and Explaining Failures of Perturbation-Based Fidelity Metrics | Revoti Prasad Bora, Philipp Terhörst, Raymond Veldhuis, Raghavendra Ramachandra, Kiran Raja |
| Texture-Aware Vision Transformers for Robust Diagnosis of Dehiscence and Fenestration in 2D CBCT Cross-Sections | Hossam Magdy Balaha, Alaa Mohamed, Rahma Hussein, Reza Farimani, Toru Deguchi, Mohammed Ghazal, Ayman El-Baz |
| MEANINGFUL LEVEL SETS FOR SMALL SPOT DETECTION | Axel Davy |
| Leveraging Error-Tolerance Asymmetry in Electrical Grid Automated Visual Inspection with a Semi-Supervised Annotation Pipeline | Pedro Daniel Rocha, Luis Cruz, Igor Vilela, André Coelho, Fernando Lopes |
| SPECTRAL REFLECTANCE ESTIMATION OF FACIAL SKIN FROM A SINGLE RGB IMAGE VIA EM-BASED PHYSICAL RECONSTRUCTION | Yoshihito Tanaka, Shugo Yamaguchi, Akira Kubota |
| Reduced-complexity Adaptive Loop Filtering via Input-dependent Graph Filters | Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Roman Chernyak, Shan Liu |
| Topology-Prompted Spatio-Temporal TransUNet: A Geometry-Aware Framework for Consistent Dental Plaque Assessment | Botao Xu, Junhao Gu, Haoyan Cui, Ning Luo, Yu Qiao |
| SRANK: TOWARDS SEMANTIC-AWARE RANKING-BASED EVALUATION FOR CONTINUAL LEARNING OF VISION-LANGUAGE MODELS | Suvam Dey, Debarshi Brahma, Soma Biswas |
| Exploring Rate, Distortion and Cross-Entropy Tradeoffs with Variational Autoencoders | Huy LE, Anissa Mokraoui, Pierre DUHAMEL |
| NON-LEARNING LOW-LIGHT STEREO VISION | Jason Wang, Lucas Nguyen, Hyunseung Eom, Wei Xu, Qi Guo |
| DYNAMIC MODE DECOMPOSITION-BASED FMRI ANALYSIS FOR PARKINSON’S DISEASE DETECTION | Yuji Lin, Yifu Huang, Xuanting Wang, Jiayue Cai, Yuheng Wang, Martin McKeown, Chunqi Chang |
| TRACKNETV5: ROBUST SHUTTLECOCK TRACKING VIA MOTION PROMPTS AND SPATIOTEMPORAL ATTENTIVE FUSION | Run-Lin Chang, Yu-Shuen Wang, Jiun-Long Huang |
| Hierarchical Feedback for No-Reference Video Quality Assessment Using a Spatiotemporal Feature Pyramid | Reshu bansal, Parimala Kancharla |
| PINCurve-S: Physics-Informed Neural Curves with Spatial Attention for Efficient Low-Light Image Enhancement | Anubhav Jain, Nikhil Panwar, Vivek Kumar, Ali Reza Alaei, Parthapratim Roy |
| MULTI-LABEL OBJECT CLASSIFICATION IN POINT CLOUDS USING GRAPH CONVOLUTIONAL NETWORK | Md. Nahid Hasan, Md Sohag Mia, Nafis Sadeq, Muhammad Abdullah Adnan |
| EFFICIENT AND SECURE CONVOLUTIONS ON ENCRYPTED DATA | Susim Roy, Bharat Chandra Yalavarthi, Nalini Ratha |
| RATE-DISTORTION OPTIMIZATION FOR ENSEMBLES OF NON-REFERENCE METRICS | Xin Xiong, Samuel Fernandez, Eduardo Pavez, Antonio Ortega, Neil Birkbeck, Balu Adsumilli |
| MARS-CLIP: Multi-Resolution and Attention Refined Zero-Shot Image Segmentation | Nagito Saito, Shintaro Ito, Koichi Ito, Takafumi Aoki |
| Multi-Site Brain MRI Harmonization via Learned Probability Flows | Saeed Moazami, Neda Jahanshad |
| HOW TO TRAIN YOUR GENERATIVE DIFFUSION MODEL, WHEN ALL TRAINING IMAGES ARE DEGRADED, TO MODEL A DISTRIBUTION ON HIGH-QUALITY IMAGES | Subhankar Nag, Suyash Awate |
| Robust Semantic 3D Mapping from Monocular 360-degree Image Sequences for Intelligent Indoor Street View | Jonoshin Shiino, Naomichi Asaki, Sarthak Pathak, Kazushige Yasutake, Junji Furuno, Kazunori Umeda |
| Perception-based Image Denoising via Generative Compression | Nam Nguyen, Thinh Nguyen, Bella Bose |
| END-TO-END CROSS-MODAL CORRESPONDENCE LEARNING FOR MISALIGNMENT-ROBUST INFRARED-VISIBLE OBJECT DETECTION | Jihun Park, Injae Lee, Joonki Paik |
| A DECOUPLED COARSE-TO-FINE FRAMEWORK WITH SPATIALLY-ADAPTIVE HIGH-PASS FUSION FOR POLYP SEGMENTATION | Po-Yi Ke, Kuan-Hsien Liu, Tsung-Jung Liu |
| CID-LIE: Controlled Illumination Dataset for Low-Light Image Enhancement | Felipe Oliveira, Gabrielly Rodrigues, Jade Santos, Alternei Brito, Joao Cavalcanti, José Luiz de Souza Pio |
| GRAPH-BASED ANALYSIS OF ATTENTIONAL FIDELITY IN BRAIN-TO-IMAGE RECONSTRUCTION | Mohammad Moradi, Morteza Moradi, Marco Grassia, Giuseppe Mangioni |
| GeoScaffold: Implicit Geometric Scaffolding and Directional Distillation for Missing-Modality Multi-Modal MRI Segmentation | Jiayang Xu |
| Machine Learning–Based Control of Local Warped Motion Compensation in the SVT-AV1 Encoder | Khouloud Missaoui, Damak taheni, Hassene Tmar, Mohamed Ali Ben Ayed |
| DFSI: A LiDAR Distance-Field Safety Plug-in with Reliability-Aware Refresh for Diffusion-Based Visual Navigation | Shimin Yu, Shiqi Sun, Yantao Lu, Junjie Zuo, Chenglie Du |
| Pois-DON: Poison Dataset-driven Activation Optimisation-based Novelty Detection | Ahmed Gabr, Mahmoud Rady, Pola Qulta, Youssef Abou Eita, Youssef ElKady, Youssef Fayed, Marwan Torki |
| GPS-denied Drone Navigation via Cross-Domain Keypoint Matching | Jun-Wei Hsieh, Hong-Kai Chen |
| Energy and Compression Efficiency in Large-Scale Video Streaming | MOHAMMAD GHASEMPOUR, Hadi Amirpour, Christian Timmererer |
| The Nonlocal Heat Equation: Bridging PDE Modeling and Physics-Informed Convolutional Neural Networks | Bartomeu Garau, Catalina Sbert, Joan Duran |
| Gaussian Surrogates for Poisson Imaging: Some Theoretical and Empirical Results | Alexandra Spitzer, Lorenzo Baldassari, Valentin Debarnot, Ivan Dokmanić |
| XSA-MAD: Cross-modal Semantic Alignment for Morphing Attack Detection | Jie Jin, Mahiro Tokumasu, Yu Makino, Masakatsu Nishigaki, Tetsushi Ohki |
| Multi-Task Partially Supervised Learning for Super-Resolution and Semantic Segmentation on Earth Observation data | Hoàng-Ân Lê, Minh-Tan Pham, Solange Lemai-Chenevier, Daniel Greslou |
| BLIND X-RAY BRAGG PTYCHOGRAPHY WITH AUTOMATIC DIFFERENTIATION | Tingyou Li, Jizhou Li |
| Few-Shot Anomaly Detection and Localization via Robustly Adaptive Feature Matching | Jiangpeng Zhu, Hanxi Li, Bo Li, Shaodi You, Zehui Xie |
| ASSESSING MEDIA AUTHENTICITY THROUGH WATERMARKING IN THE CONTEXT OF THE JPEG TRUST STANDARD | Deepayan Bhowmik, May Alotaibi, Jessie Smith, Lamyaa Aljuaid, Enes Eray Demirtas, Touradj Ebrahimi, Sabrina Caldwell, Frederik Temmermans |
| Latent graph encoding of multimodal neuroimaging features with generative AI architectures | Ishaan Batta, Meenu Ajith, Vince Calhoun |
| Blind Reconstruction of Low Dose Computed Tomography with Latent Space Non-Local Filtering and Structural Consistency Constraints | Angkon Deb, Celia Shahnaz |
| Anomaly-Aware Vision-Language Adapters for Zero-Shot Anomaly Detection | Muhammad Aqeel, Maham Nazir, Uzair Khan, Marco Cristani, Francesco setti |
| SPATIO-TEMPORAL TENSOR RECONSTRUCTION FOR QUANTA IMAGE SENSORS VIA BINARY TENSOR DECOMPOSITION | Yoshihiro Maeda, Kosuke Kurihara, Takayuki Hamamoto |
| FOURIER SOFT IN 2D (FS2D) REGISTRATION FOR FORWARD-LOOKING SONAR WITH QUASI-PLANAR VALIDITY ANALYSIS | Arturo Gomez Chavez, Tim Hansen, Maria Saleem, Andreas Birk |
| ADAPTIVE ZONE MERGING: GRAPH-BASED ALGORITHM FOR HIERARCHICAL OVER-SEGMENTATION | Petar Kotsev, Joao Mota, Robert Nicol, Alex Serb |
| Efficient Object Detection on JPEG-AI Pre-Reconstruction Latents | Mahyar Gohari, Alessandro Gnutti, Fabrizio Guerrini, Nicola Adami, Riccardo Leonardi |
| Test Model and Optimization for MPEG Lenslet Video Coding | Hao Peng, Lili Zhao, Luyuan Zhu, Xun Tang, Lei Yang, Xin Jin |
| BWCA-Net: Bidirectional Wavelet Cross-Attention Unfolding Network for Image Compressive Sensing Reconstruction | Zhidi Yao, Jinjia Zhou |
| Unsupervised Nighttime Dehazing via Layer Decomposition and Fusion | Tianyi Jiang, Shunli Zhang |
| SDMVSC: A Scalable Plug-in Framework for Deep Multi-View Subspace Clustering on Large-Scale Data | Minghua Tang, Yuxuan Sun, Qingwang Wang |
| Text-Centric Dual-Layer Attention for Multimodal Emotion Analysis with Missing Modalities | Xianxun Zhu, Imad Rida, Erik Cambria, Rui Wang, Hui Chen |
| PRIVACY-PRESERVING FEDERATED ACTION RECOGNITION VIA DIFFERENTIALLY PRIVATE SELECTIVE TUNING AND EFFICIENT COMMUNICATION | Idris Zakariyya, Pai Chet Ng, Kaushik Bhargav, Seyed Mohammad Sheikholeslami, Konstantinos Plataniotis, Fani Deligianni |
| Composite Stability of Graph-Convnets for Label-Efficient Skeleton-based Recognition | Hichem Sahbi |
| Evolution of NVENC Efficiency: A Longitudinal Analysis of HQ and UHQ Tuning Efficiency, Latency and Energy Trade-offs | Kasidis Arunruangsirilert, Jiro Katto |