[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-19 | Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization | Jingwei Bao et.al. | 2412.14449 | null |
2024-12-16 | EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera | Zheng Fang et.al. | 2412.11680 | null |
2024-12-11 | Implicit Neural Compression of Point Clouds | Hongning Ruan et.al. | 2412.10433 | null |
2024-12-07 | Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC | Zehan Wang et.al. | 2412.05574 | null |
2024-11-18 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-09 | Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data | Xinran Liu et.al. | 2411.06055 | null |
2024-11-01 | PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling | Donghyun Kim et.al. | 2411.00432 | null |
2024-10-28 | Quality Analysis of the Coding Bitrate Tradeoff Between Geometry and Attributes for Colored Point Clouds | Joao Prazeres et.al. | 2410.21613 | null |
2024-10-09 | Point Cloud Compression with Bits-back Coding | Nguyen Quang Hieu et.al. | 2410.18115 | null |
2024-10-23 | Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds | Kai Liu et.al. | 2410.17823 | link |
2024-10-22 | Joint Point Cloud Upsampling and Cleaning with Octree-based CNNs | Jihe Li et.al. | 2410.17001 | link |
2024-10-21 | MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering | Jiayi Song et.al. | 2410.15941 | null |
2024-10-13 | Towards Reproducible Learning-based Compression | Jiahao Pang et.al. | 2410.09872 | null |
2024-10-06 | Tensor-Train Point Cloud Compression and Efficient Approximate Nearest-Neighbor Search | Georgii Novikov et.al. | 2410.04462 | null |
2024-10-01 | Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Pengxi Zeng et.al. | 2410.00582 | null |
2024-09-19 | PVContext: Hybrid Context Model for Point Cloud Compression | Guoqing Zhang et.al. | 2409.12724 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-08-20 | End-to-end learned Lossy Dynamic Point Cloud Attribute Compression | Dat Thanh Nguyen et.al. | 2408.10665 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-06 | Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu et.al. | 2408.02966 | null |
2024-08-01 | Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control | Michael Rudolph et.al. | 2408.00599 | null |
2024-07-22 | Double Deep Learning-based Event Data Coding and Classification | Abdelrahman Seleem et.al. | 2407.15531 | null |
2024-07-11 | Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction | Chang Sun et.al. | 2407.08528 | null |
2024-07-11 | Enhancing context models for point cloud geometry compression with context feature residuals and multi-loss | Chang Sun et.al. | 2407.08520 | null |
2024-07-19 | PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-05 | Rethinking Data Input for Point Cloud Upsampling | Tongxu Zhang et.al. | 2407.04476 | null |
2024-08-26 | TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting | Zixi Guo et.al. | 2407.04284 | link |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-09-25 | Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering | Yueyu Hu et.al. | 2406.05915 | null |
2024-06-02 | Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor | Lei Liu et.al. | 2406.00791 | null |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-19 | Point Cloud Compression with Implicit Neural Representations: A Unified Framework | Hongning Ruan et.al. | 2405.11493 | null |
2024-05-02 | PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-21 | Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes | Kang You et.al. | 2404.13550 | link |
2024-04-16 | Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Zohre Karimi et.al. | 2404.07185 | null |
2024-04-10 | Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression | Kang You et.al. | 2404.06936 | link |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-03-13 | Point Cloud Compression via Constrained Optimal Transport | Zezeng Li et.al. | 2403.08236 | link |
2024-03-08 | Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning | Hang Du et.al. | 2403.05117 | link |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-02-23 | Scalable Human-Machine Point Cloud Compression | Mateen Ulhaq et.al. | 2402.12532 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-17 | Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression | Dingquan Li et.al. | 2402.11250 | link |
2024-02-11 | PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression | Jiahao Pang et.al. | 2402.07243 | null |
2024-02-07 | Performance analysis of Deep Learning-based Lossy Point Cloud Geometry Compression Coding Solutions | Joao Prazeres et.al. | 2402.05192 | null |
2024-02-08 | Subjective performance evaluation of bitrate allocation strategies for MPEG and JPEG Pleno point cloud compression | Davi Lazzarotto et.al. | 2402.04760 | null |
2024-02-15 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2023-12-23 | Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling | Shujuan Li et.al. | 2312.15133 | null |
2024-03-13 | DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction | Yanlong Li et.al. | 2312.03298 | link |
2023-12-03 | A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling | Wentao Qu et.al. | 2312.02719 | link |
2023-11-22 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression | Tam Thuc Do et.al. | 2311.13539 | null |
2023-11-22 | Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction | Tam Thuc Do et.al. | 2311.13533 | null |
2023-11-22 | Test-Time Augmentation for 3D Point Cloud Classification and Segmentation | Tuan-Anh Vu et.al. | 2311.13152 | null |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773 | null |
2023-11-02 | Lightweight super resolution network for point cloud geometry compression | Wei Zhang et.al. | 2311.00970 | link |
2023-11-17 | Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification | Abdelrahman Seleem et.al. | 2310.18849 | null |
2023-10-13 | iPUNet:Iterative Cross Field Guided Point Cloud Upsampling | Guangshun Wei et.al. | 2310.09092 | link |
2024-03-15 | PU-Ray: Domain-Independent Point Cloud Upsampling via Ray Marching on Neural Implicit Surface | Sangwon Lim et.al. | 2310.08755 | link |
2024-02-16 | Quasi-Monte Carlo for 3D Sliced Wasserstein | Khai Nguyen et.al. | 2309.11713 | link |
2023-09-08 | Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression | Jin Heo et.al. | 2309.04549 | null |
2023-09-01 | Test-Time Adaptation for Point Cloud Upsampling Using Meta-Learning | Ahmed Hatem et.al. | 2308.16484 | null |
2024-02-08 | SCP: Spherical-Coordinate-based Learned Point Cloud Compression | Ao Luo et.al. | 2308.12535 | null |
2023-08-22 | Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection | Junsheng Zhou et.al. | 2308.11441 | link |
2023-08-11 | Learned Point Cloud Compression for Classification | Mateen Ulhaq et.al. | 2308.05959 | link |
2023-07-27 | FLiCR: A Fast and Lightweight LiDAR Point Cloud Compression Based on Lossy RI | Jin Heo et.al. | 2307.15005 | null |
2023-07-20 | Aggressive saliency-aware point cloud compression | Eleftheria Psatha et.al. | 2307.10741 | null |
2023-07-18 | Arbitrary point cloud upsampling via Dual Back-Projection Network | Zhi-Song Liu et.al. | 2307.08992 | null |
2023-06-01 | 4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks | Lorenzo Berlincioni et.al. | 2306.01081 | null |
2023-05-16 | Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching | Shuting Xia et.al. | 2305.05356 | null |
2023-05-02 | Geometric Prior Based Deep Human Point Cloud Geometry Compression | Xinju Wu et.al. | 2305.01309 | null |
2023-05-02 | PU-EdgeFormer: Edge Transformer for Dense Prediction in Point Cloud Upsampling | Dohoon Kim et.al. | 2305.01148 | link |
2023-04-24 | Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions | Yun He et.al. | 2304.11846 | link |
2023-04-01 | Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention | Tam Thuc Do et.al. | 2304.00335 | null |
2023-03-27 | NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation | Zehan Zheng et.al. | 2303.15126 | link |
2023-11-07 | GQE-Net: A Graph-based Quality Enhancement Network for Point Cloud Color Attribute | Jinrui Xing et.al. | 2303.13764 | link |
2023-03-22 | Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction | Jianqiang Wang et.al. | 2303.12917 | null |
2023-12-28 | Progressive Frame Patching for FoV-based Point Cloud Video Streaming | Tongyu Zong et.al. | 2303.08336 | null |
2023-12-03 | Parametric Surface Constrained Upsampler Network for Point Cloud | Pingping Cai et.al. | 2303.08240 | link |
2024-03-20 | Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model | Dat Thanh Nguyen et.al. | 2303.06519 | link |
2023-03-11 | Deep probabilistic model for lossless scalable point cloud attribute compression | Dat Thanh Nguyen et.al. | 2303.06517 | null |
2023-03-09 | BIRD-PCC: Bi-directional Range Image-based Deep LiDAR Point Cloud Compression | Chia-Sheng Liu et.al. | 2303.04027 | null |
2023-02-13 | gpcgc: a green point cloud geometry coding method | Qingyang Zhou et.al. | 2302.06062 | null |
2023-02-09 | BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios | Ali Ak et.al. | 2302.04796 | null |
2023-04-27 | Linear Optimal Partial Transport Embedding | Yikun Bai et.al. | 2302.03232 | link |
2023-01-31 | Lidar Upsampling with Sliced Wasserstein Distance | Artem Savkin et.al. | 2301.13558 | null |
2023-01-28 | Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding | Jianqiang Wang et.al. | 2301.12165 | null |
2023-01-27 | Joint Geometry and Attribute Upsampling of Point Clouds Using Frequency-Selective Models with Overlapped Support | Viktoria Heimann et.al. | 2301.11630 | null |
2023-01-03 | Reduced Reference Quality Assessment for Point Cloud Compression | Yipeng Liu et.al. | 2301.01009 | null |
2023-04-06 | Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program | Tiange Luo et.al. | 2212.12952 | null |
2022-12-11 | Learning Neural Volumetric Field for Point Cloud Geometry Compression | Yueyu Hu et.al. | 2212.05589 | link |
2022-12-01 | Low-Rank Tensor Function Representation for Multi-Dimensional Data Recovery | Yisi Luo et.al. | 2212.00262 | null |
2023-12-09 | ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression | Yiqi Jin et.al. | 2211.10916 | null |
2022-11-19 | Rate-Distortion Modeling for Bit Rate Constrained Point Cloud Compression | Pan Gao et.al. | 2211.10646 | null |
2022-10-21 | Motion Policy Networks | Adam Fishman et.al. | 2210.12209 | link |
2022-10-28 | Motion estimation and filtered prediction for dynamic point cloud attribute compression | Haoran Hong et.al. | 2210.08262 | null |
2022-10-08 | Point Cloud Upsampling via Cascaded Refinement Network | Hang Du et.al. | 2210.03942 | link |
2023-02-14 | Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression | Tingyu Fan et.al. | 2209.12512 | null |
2022-09-17 | CARNet:Compression Artifact Reduction for Point Cloud Attribute | Dandan Ding et.al. | 2209.08276 | null |
2022-11-16 | CU-Net: Real-Time High-Fidelity Color Upsampling for Point Clouds | Lingdong Wang et.al. | 2209.06112 | link |
2022-09-09 | GRASP-Net: Geometric Residual Analysis and Synthesis for Point Cloud Compression | Jiahao Pang et.al. | 2209.04401 | link |
2022-09-06 | Learning to Predict on Octree for Scalable Point Cloud Geometry Coding | Yixiang Mao et.al. | 2209.02226 | null |
2022-08-26 | Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention | Ruixiang Xue et.al. | 2208.12573 | null |
2022-08-17 | Efficient dynamic point cloud coding using Slice-Wise Segmentation | Faranak Tohidi et.al. | 2208.08061 | null |
2023-01-10 | Arbitrary Point Cloud Upsampling with Spherical Mixture of Gaussians | Anthony Dell'Eva et.al. | 2208.05274 | link |
2022-08-04 | IT/IST/IPLeiria Response to the Call for Proposals on JPEG Pleno Point Cloud Coding | André F. R. Guarda et.al. | 2208.02716 | null |
2022-08-04 | IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression | Kang You et.al. | 2208.02519 | link |
2022-07-25 | Inter-Frame Compression for Dynamic Point Cloud Geometry Coding | Anique Akhtar et.al. | 2207.12554 | null |
2022-07-20 | GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori et.al. | 2207.09763 | link |
2022-06-25 | BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling | Yechao Bai et.al. | 2206.12648 | null |
2022-06-24 | Rate-Distortion Optimal Transform Coefficient Selection for Unoccupied Regions in Video-Based Point Cloud Compression | Christian Herglotz et.al. | 2206.12186 | null |
2022-05-24 | A Rate Control Algorithm for Video-based Point Cloud Compression | Fangyu Shen et.al. | 2205.11825 | null |
2022-05-19 | A Comparative Study of Feature Expansion Unit for 3D Point Cloud Upsampling | Qiang Li et.al. | 2205.09594 | null |
2022-05-02 | D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction | Tingyu Fan et.al. | 2205.01135 | link |
2022-05-02 | Point Cloud Compression with Sibling Context and Surface Priors | Zhili Chen et.al. | 2205.00760 | link |
2022-04-29 | Deep Geometry Post-Processing for Decompressed Point Clouds | Xiaoqing Fan et.al. | 2204.13952 | link |
2022-04-27 | Density-preserving Deep Point Cloud Compression | Yun He et.al. | 2204.12684 | null |
2022-04-25 | 4DAC: Learning Attribute Compression for Dynamic Point Clouds | Guangchi Fang et.al. | 2204.11723 | null |
2022-04-25 | Dynamic Point Cloud Compression with Cross-Sectional Approach | Faranak Tohidi et.al. | 2204.11409 | null |
2022-04-22 | PU-EVA: An Edge Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling | Luqing Luo et.al. | 2204.10750 | null |
2022-04-18 | Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation | Wenbo Zhao et.al. | 2204.08196 | link |
2022-06-22 | Learning-based Lossless Point Cloud Geometry Coding using Sparse Tensors | Dat Thanh Nguyen et.al. | 2204.05043 | null |
2022-04-03 | Sparse Tensor-based Point Cloud Attribute Compression | Jianqiang Wang et.al. | 2204.01023 | link |
2022-03-22 | IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment | Yiming Zeng et.al. | 2203.11590 | link |
2022-03-21 | Upsampling Autoencoder for Self-Supervised Point Cloud Learning | Cheng Zhang et.al. | 2203.10768 | null |
2022-05-03 | Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds | Viktoria Heimann et.al. | 2203.09224 | null |
2022-03-02 | PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling | Hao Liu et.al. | 2203.00914 | null |
2022-05-16 | Variable Rate Compression for Raw 3D Point Clouds | Md Ahmed Al Muzaddid et.al. | 2202.13862 | link |
2022-09-14 | Point cloud completion via structured feature maps using a feedback network | Zejia Su et.al. | 2202.08583 | null |
2022-05-08 | OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression | Chunyang Fu et.al. | 2202.06028 | link |
2022-02-01 | Point Cloud Compression for Efficient Data Broadcasting: A Performance Comparison | Francesco Nardo et.al. | 2202.00719 | null |
2022-02-01 | Fractional Motion Estimation for Point Cloud Compression | Haoran Hong et.al. | 2202.00172 | null |
2022-01-17 | SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations | Zhenyu Li et.al. | 2112.04680 | link |
2022-03-31 | Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling | Wanquan Feng et.al. | 2112.04148 | link |
2022-03-01 | Attribute Artifacts Removal for Geometry-based Point Cloud Compression | Xihua Sheng et.al. | 2112.00560 | null |
2022-10-03 | PU-Transformer: Point Cloud Upsampling Transformer | Shi Qiu et.al. | 2111.12242 | link |
2022-10-21 | Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2111.10633 | link |
2021-10-18 | Patch-Based Deep Autoencoder for Point Cloud Geometry Compression | Kang You et.al. | 2110.09109 | link |
2022-07-12 | PC |
Chen Long et.al. | 2109.09337 | link |
2021-09-16 | R-PCC: A Baseline for Range Image-based Point Cloud Compression | Sukai Wang et.al. | 2109.07717 | link |
2021-09-15 | Which One is Better: Assessing Objective Metrics for Point Cloud Compression | Yipeng Liu et.al. | 2109.07158 | null |
2021-08-05 | Joint Geometry and Color Projection-based Point Cloud Quality Metric | Alireza Javaheri et.al. | 2108.02481 | link |
2021-08-03 | SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering | Yifan Zhao et.al. | 2108.00454 | link |
2021-07-29 | Video-based Point Cloud Compression Artifact Removal | Anique Akhtar et.al. | 2107.14179 | null |
2024-02-28 | Score-Based Point Cloud Denoising | Shitong Luo et.al. | 2107.10981 | link |
2022-06-08 | PU-Flow: a Point Cloud Upsampling Network with Normalizing Flows | Aihua Mao et.al. | 2107.05893 | link |
2022-04-18 | "Zero-Shot" Point Cloud Upsampling | Kaiyue Zhou et.al. | 2106.13765 | link |
2021-06-23 | Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction | Qian Yin et.al. | 2106.12236 | null |
2021-06-21 | Cylindrical coordinates for LiDAR point cloud compression | Shashank N. Sridhara et.al. | 2106.11237 | null |
2021-10-11 | Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds | Emre Can Kaya et.al. | 2106.06482 | link |
2021-06-09 | Point Cloud Upsampling via Disentangled Refinement | Ruihui Li et.al. | 2106.04779 | link |
2021-06-02 | DeepCompress: Efficient Point Cloud Geometry Compression | Ryan Killea et.al. | 2106.01504 | null |
2021-06-01 | RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network | Lili Zhao et.al. | 2106.00496 | null |
2021-05-28 | An Unsupervised Optical Flow Estimation For LiDAR Image Sequences | Xuezhou Guo et.al. | 2105.13879 | null |
2021-05-05 | VoxelContext-Net: An Octree based Framework for Point Cloud Compression | Zizheng Que et.al. | 2105.02158 | null |
2021-04-20 | Multiscale deep context modeling for lossless point cloud geometry compression | Dat Thanh Nguyen et.al. | 2104.09859 | link |
2021-04-12 | Towards Efficient Graph Convolutional Networks for Point Cloud Handling | Yawei Li et.al. | 2104.05706 | null |
2021-03-11 | Advanced Geometry Surface Coding for Dynamic Point Cloud Compression | Jian Xiong et.al. | 2103.06549 | null |
2021-03-05 | Hybrid Point Cloud Semantic Compression for Automotive Sensors: A Performance Evaluation | Andrea Varischio et.al. | 2103.03819 | null |
2021-02-26 | Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction | Rajat Sharma et.al. | 2102.13391 | link |
2021-02-25 | A deep perceptual metric for 3D point clouds | Maurice Quach et.al. | 2102.12839 | link |
2021-02-08 | Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud | Shuquan Ye et.al. | 2102.04317 | null |
2020-12-15 | NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression | Nicolas Wagner et.al. | 2012.08143 | null |
2022-06-11 | SPU-Net: Self-Supervised Point Cloud Upsampling by Coarse-to-Fine Reconstruction with Self-Projection Optimization | Xinhai Liu et.al. | 2012.04439 | link |
2021-11-18 | Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning | Mohamed K. Abdel-Aziz et.al. | 2012.03414 | null |
2020-12-05 | ParaNet: Deep Regular Representation for 3D Point Clouds | Qijian Zhang et.al. | 2012.03028 | null |
2020-11-27 | Spherical Interpolated Convolutional Network with Distance-Feature Density for 3D Semantic Segmentation of Point Clouds | Guangming Wang et.al. | 2011.13784 | null |
2020-11-25 | Reduced Reference Perceptual Quality Model and Application to Rate Control for 3D Point Cloud Compression | Qi Liu et.al. | 2011.12688 | null |
2020-11-07 | Multiscale Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2011.03799 | link |
2020-10-29 | Point Cloud Attribute Compression via Successive Subspace Graph Transform | Yueru Chen et.al. | 2010.15302 | null |
2020-08-16 | Real-Time Spatio-Temporal LiDAR Point Cloud Compression | Yu Feng et.al. | 2008.06972 | link |
2021-08-03 | Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display | Xinju Wu et.al. | 2008.02501 | null |
2020-06-20 | Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation and Spatial Supervision | Haojie Liu et.al. | 2006.11481 | null |
2020-06-24 | Improved Deep Point Cloud Geometry Compression | Maurice Quach et.al. | 2006.09043 | link |
2020-04-03 | Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation | Marie-Julie Rakotosaona et.al. | 2004.01661 | link |
2020-03-30 | A generalized Hausdorff distance based quality metric for point cloud geometry | Alireza Javaheri et.al. | 2003.13669 | null |
2020-03-30 | Optimizing Geometry Compression using Quantum Annealing | Sebastian Feld et.al. | 2003.13253 | null |
2020-03-27 | Model-based Joint Bit Allocation between Geometry and Color for Video-based 3D Point Cloud Compression | Qi Liu et.al. | 2002.10798 | null |
2020-03-07 | PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Yue Qian et.al. | 2002.10277 | null |
2020-06-22 | Folding-based compression of point cloud attributes | Maurice Quach et.al. | 2002.04439 | null |
2020-01-13 | Efficient 3D Road Map Data Exchange for Intelligent Vehicles in Vehicular Fog Networks | Ivan Wang-Hei Ho et.al. | 2001.04057 | null |
2020-01-12 | Linear Model based Geometry Coding for Lidar Acquired Point Clouds | Xiang Zhang et.al. | 2001.03871 | null |
2021-04-09 | PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection | Shaoshuai Shi et.al. | 1912.13192 | link |
2019-12-20 | A Comprehensive Study and Comparison of Core Technologies for MPEG 3D Point Cloud Compression | Hao Liu et.al. | 1912.09674 | null |
2020-10-15 | Point Cloud Rendering after Coding: Impacts on Subjective and Objective Quality | Alireza Javaheri et.al. | 1912.09137 | null |
2021-03-29 | PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks | Guocheng Qian et.al. | 1912.03264 | link |
2019-11-04 | Video-based compression for plenoptic point clouds | Li Li et.al. | 1911.01355 | null |
2019-09-26 | Learned Point Cloud Geometry Compression | Jianqiang Wang et.al. | 1909.12037 | link |
2019-09-16 | PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation | Haojie Liu et.al. | 1909.07137 | null |
2019-08-17 | 3D Point Cloud Super-Resolution via Graph Total Variation on Surface Normals | Chinthaka Dinesh et.al. | 1908.06261 | null |
2019-08-06 | Point Cloud Super Resolution with Adversarial Residual Graph Networks | Huikai Wu et.al. | 1908.02111 | link |
2020-08-10 | Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds | Yiqun Xu et.al. | 1908.01970 | null |
2019-07-25 | PU-GAN: a Point Cloud Upsampling Adversarial Network | Ruihui Li et.al. | 1907.10844 | null |
2019-06-27 | A Convolutional Decoder for Point Clouds using Adaptive Instance Normalization | Isaak Lim et.al. | 1906.11478 | null |
2019-04-18 | Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds | Wei Yan et.al. | 1905.03691 | null |
2019-05-22 | Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression | Maurice Quach et.al. | 1903.08548 | link |
2019-09-30 | Variational Graph Methods for Efficient Point Cloud Sparsification | Daniel Tenbrinck et.al. | 1903.02858 | null |
2019-03-05 | Pose Estimation of Vehicles Over Uneven Terrain | Yingchong Ma et.al. | 1903.02052 | null |
2019-02-11 | Occupancy-map-based rate distortion optimization for video-based point cloud compression | Li Li et.al. | 1902.04169 | null |
2018-09-30 | A Volumetric Approach to Point Cloud Compression | Maja Krivokuća et.al. | 1810.00484 | null |
2018-05-29 | Surface Light Field Compression using a Point Cloud Codec | Xiang Zhang et.al. | 1805.11203 | null |
2018-05-23 | Comments on "Compression of 3D Point Clouds Using a Region-Adaptive Hierarchical Transform" | Gustavo Sandri et.al. | 1805.09146 | null |
2018-04-28 | Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction | Yiting Shao et.al. | 1804.10783 | null |
2018-03-26 | PU-Net: Point Cloud Upsampling Network | Lequan Yu et.al. | 1801.06761 | link |
2017-10-10 | Attribute Compression of 3D Point Clouds Using Laplacian Sparsity Optimized Graph Transform | Yiting Shao et.al. | 1710.03532 | null |
2017-03-08 | Dynamic Polygon Clouds: Representation and Compression for VR/AR | Philip A. Chou et.al. | 1610.00402 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-18 | Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations | Ludovico Nista et.al. | 2412.14150 | null |
2024-12-18 | Efficient high performance computing with the ALICE Event Processing Nodes GPU-based farm | Federico Ronchetti et.al. | 2412.13755 | null |
2024-12-18 | Robust UAV Jittering and Task Scheduling in Mobile Edge Computing with Data Compression | Bin Li et.al. | 2412.13676 | null |
2024-12-18 | DarkIR: Robust Low-Light Image Restoration | Daniel Feijoo et.al. | 2412.13443 | null |
2024-12-17 | Identifying Bias in Deep Neural Networks Using Image Transforms | Sai Teja Erukude et.al. | 2412.13079 | link |
2024-12-17 | Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression | Ruijie Chen et.al. | 2412.12982 | null |
2024-12-17 | Invisible Watermarks: Attacks and Robustness | Dongjun Hwang et.al. | 2412.12511 | link |
2024-12-16 | Representation learning for fast radio burst dynamic spectra | Dirk Kuiper et.al. | 2412.12394 | link |
2024-12-16 | Point Cloud-Assisted Neural Image Compression | Ziqun Li et.al. | 2412.11771 | null |
2024-12-16 | Whisper-GPT: A Hybrid Representation Audio Large Language Model | Prateek Verma et.al. | 2412.11449 | null |
2024-12-16 | Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression | Chuqin Zhou et.al. | 2412.11379 | null |
2024-12-16 | VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression | Qiang Hu et.al. | 2412.11362 | null |
2024-12-14 | Progressive Compression with Universally Quantized Diffusion Models | Yibo Yang et.al. | 2412.10935 | null |
2024-12-14 | Learned Data Compression: Challenges and Opportunities for the Future | Qiyu Liu et.al. | 2412.10770 | null |
2024-12-11 | Implicit Neural Compression of Point Clouds | Hongning Ruan et.al. | 2412.10433 | null |
2024-12-12 | Video Seal: Open and Efficient Video Watermarking | Pierre Fernandez et.al. | 2412.09492 | link |
2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
2024-12-12 | Versatile Volumetric Medical Image Coding for Human-Machine Vision | Jietao Chen et.al. | 2412.09231 | null |
2024-12-11 | Unicorn: Unified Neural Image Compression with One Number Reconstruction | Qi Zheng et.al. | 2412.08210 | null |
2024-12-09 | Splatter-360: Generalizable 360 |
Zheng Chen et.al. | 2412.06250 | link |
2024-12-08 | Vision Transformer-based Semantic Communications With Importance-Aware Quantization | Joohyuk Park et.al. | 2412.06038 | null |
2024-12-08 | Matrix Pre-orthogonal-Matching Pursuit as a Fundamental AI Algorithm | Wei Qu et.al. | 2412.05878 | null |
2024-12-09 | UniMIC: Towards Universal Multi-modality Perceptual Image Compression | Yixin Gao et.al. | 2412.04912 | null |
2024-12-05 | Solving High-dimensional Inverse Problems Using Amortized Likelihood-free Inference with Noisy and Incomplete Data | Jice Zeng et.al. | 2412.04565 | null |
2024-12-05 | Diagnosing Systematic Effects Using the Inferred Initial Power Spectrum | Tristan Hoellinger et.al. | 2412.04443 | null |
2024-12-05 | Multi-Scale Node Embeddings for Graph Modeling and Generation | Riccardo Milocco et.al. | 2412.04354 | null |
2024-12-05 | Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark | Changsheng Gao et.al. | 2412.04307 | null |
2024-12-05 | LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model | Yuan Xue et.al. | 2412.03841 | null |
2024-12-04 | Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach | Juan Miguel Lopez Alcaraz et.al. | 2412.03717 | link |
2024-12-04 | Is JPEG AI going to change image forensics? | Edoardo Daniele Cannas et.al. | 2412.03261 | null |
2024-12-03 | Efficient Algorithms for Low Tubal Rank Tensor Approximation with Applications to Image Compression, Super-Resolution and Deep Learning | Salman Ahmadi-Asl et.al. | 2412.02598 | null |
2024-12-03 | Randomized algorithms for Kroncecker tensor decomposition and applications | Salman Ahmadi-Asl et.al. | 2412.02597 | null |
2024-12-03 | Efficient Model Compression Techniques with FishLeg | Jamie McGowan et.al. | 2412.02328 | null |
2024-12-02 | Efficient Compression of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling | Xihaier Luo et.al. | 2412.01754 | link |
2024-12-02 | Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior | Yi Yu et.al. | 2412.01646 | null |
2024-12-01 | Construction of generalized samplets in Banach spaces | Peter Balazs et.al. | 2412.00954 | null |
2024-11-30 | Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion | Jona Ballé et.al. | 2412.00505 | null |
2024-11-30 | Hybrid Local-Global Context Learning for Neural Video Compression | Yongqi Zhai et.al. | 2412.00446 | null |
2024-11-30 | DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression | Yongqi Zhai et.al. | 2412.00437 | null |
2024-11-29 | AIDetx: a compression-based method for identification of machine-learning generated text | Leonardo Almeida et.al. | 2411.19869 | link |
2024-11-29 | Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency | Akshaya Rajesh et.al. | 2411.19611 | null |
2024-11-29 | MCUCoder: Adaptive Bitrate Learned Video Compression for IoT Devices | Ali Hojjat et.al. | 2411.19442 | link |
2024-11-28 | Generalized Gaussian Model for Learned Image Compression | Haotian Zhang et.al. | 2411.19320 | null |
2024-11-28 | Upsampling Improvement for Overfitted Neural Coding | Pierrick Philippe et.al. | 2411.19249 | null |
2024-11-27 | Learning Optimal Linear Block Transform by Rate Distortion Minimization | Alessandro Gnutti et.al. | 2411.18494 | null |
2024-11-27 | HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression | Lei Liu et.al. | 2411.18473 | null |
2024-11-26 | Evaluating the Overhead of the Performance Profiler Cloudprofiler With MooBench | Shinhyung Yang et.al. | 2411.17413 | null |
2024-11-26 | Motion Free B-frame Coding for Neural Video Compression | Van Thang Nguyen et.al. | 2411.17160 | null |
2024-11-30 | An Information-Theoretic Regularizer for Lossy Neural Image Compression | Yingwen Zhang et.al. | 2411.16727 | null |
2024-11-25 | WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing | Kai Han et.al. | 2411.16336 | null |
2024-11-25 | Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression | Xi Zhang et.al. | 2411.16119 | null |
2024-11-25 | TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation | Huanqi Yang et.al. | 2411.16020 | null |
2024-11-24 | Variable-size Symmetry-based Graph Fourier Transforms for image compression | Alessandro Gnutti et.al. | 2411.15824 | null |
2024-11-24 | M3-CVC: Controllable Video Compression with Multimodal Generative Models | Rui Wan et.al. | 2411.15798 | null |
2024-11-24 | Advanced Learning-Based Inter Prediction for Future Video Coding | Yanchen Zhao et.al. | 2411.15759 | null |
2024-11-24 | PEnG: Pose-Enhanced Geo-Localisation | Tavis Shore et.al. | 2411.15742 | null |
2024-11-21 | U-Motion: Learned Point Cloud Video Compression with U-Structured Motion Estimation | Tingyu Fan et.al. | 2411.14501 | null |
2024-11-21 | Differentiable SVD based on Moore-Penrose Pseudoinverse for Inverse Imaging Problems | Yinghao Zhang et.al. | 2411.14141 | link |
2024-11-21 | Compact Visual Data Representation for Green Multimedia -- A Human Visual System Perspective | Peilin Chen et.al. | 2411.14135 | null |
2024-11-27 | Image Compression Using Novel View Synthesis Priors | Luyuan Peng et.al. | 2411.13862 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | Benchmarking Quantum Convolutional Neural Networks for Classification and Data Compression Tasks | Jun Yong Khoo et.al. | 2411.13468 | null |
2024-11-20 | Practical Compact Deep Compressed Sensing | Bin Chen et.al. | 2411.13081 | link |
2024-11-20 | LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression | Shimon Murai et.al. | 2411.13033 | link |
2024-11-22 | Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need | Kecheng Chen et.al. | 2411.12448 | null |
2024-11-19 | Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness | Catie Cuan et.al. | 2411.12361 | null |
2024-11-18 | Variable Rate Neural Compression for Sparse Detector Data | Yi Huang et.al. | 2411.11942 | link |
2024-11-18 | Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods | Egor Kovalev et.al. | 2411.11795 | null |
2024-11-18 | Additional Tests for TV 3.0 | Eduardo Peixoto et.al. | 2411.11755 | null |
2024-11-18 | Towards fast DBSCAN via Spectrum-Preserving Data Compression | Yongyu Wang et.al. | 2411.11421 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | null |
2024-11-16 | An End-to-End Real-World Camera Imaging Pipeline | Kepeng Xu et.al. | 2411.10773 | null |
2024-11-16 | Deep Learning-Based Image Compression for Wireless Communications: Impacts on Reliability,Throughput, and Latency | Mostafa Naseri et.al. | 2411.10650 | null |
2024-11-15 | Efficient Progressive Image Compression with Variance-aware Masking | Alberto Presta et.al. | 2411.10185 | link |
2024-11-15 | A Multi-Scale Spatial-Temporal Network for Wireless Video Transmission | Xinyi Zhou et.al. | 2411.09936 | null |
2024-11-14 | Application of signal separation to diffraction image compression and serial crystallography | Jérôme Kieffer et.al. | 2411.09515 | link |
2024-11-14 | DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines | Junqi Liu et.al. | 2411.09308 | null |
2024-11-14 | Towards efficient compression and communication for prototype-based decentralized learning | Pablo Fernández-Piñeiro et.al. | 2411.09267 | null |
2024-11-13 | Learning Optimal and Interpretable Summary Statistics of Galaxy Catalogs with SBI | Kai Lehman et.al. | 2411.08957 | null |
2024-11-13 | LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing | Xiaonan Nie et.al. | 2411.08446 | null |
2024-11-18 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-11 | Accelerating radio astronomy imaging with RICK | Emanuele De Rubeis et.al. | 2411.07321 | link |
2024-11-11 | Low Complexity Learning-based Lossless Event-based Compression | Ahmadreza Sezavar et.al. | 2411.07155 | null |
2024-11-11 | JPEG AI Image Compression Visual Artifacts: Detection Methods and Dataset | Daria Tsereh et.al. | 2411.06810 | null |
2024-11-11 | Machine vision-aware quality metrics for compressed image and video assessment | Mikhail Dremin et.al. | 2411.06776 | null |
2024-11-11 | High-Frequency Enhanced Hybrid Neural Representation for Video Compression | Li Yu et.al. | 2411.06685 | null |
2024-11-09 | HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data | Zhewen Xu et.al. | 2411.06155 | null |
2024-11-08 | A method based on Generative Adversarial Networks for disentangling physical and chemical properties of stars in astronomical spectra | Raúl Santoveña et.al. | 2411.05960 | null |
2024-11-07 | Don't Look Twice: Faster Video Transformers with Run-Length Tokenization | Rohan Choudhury et.al. | 2411.05222 | null |
2024-11-05 | Tuning into spatial frequency space: Satellite and space debris detection in the ZTF alert stream | J. P. Carvajal et.al. | 2411.03258 | null |
2024-11-15 | ZipCache: A DRAM/SSD Cache with Built-in Transparent Compression | Rui Xie et.al. | 2411.03174 | null |
2024-11-05 | Learning-based Lossless Event Data Compression | Ahmadreza Sezavar et.al. | 2411.03010 | null |
2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
2024-11-04 | The evolution of volumetric video: A survey of smart transcoding and compression approaches | Preetish Kakkar et.al. | 2411.02095 | null |
2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
2024-11-02 | Autoencoders for At-Source Data Reduction and Anomaly Detection in High Energy Particle Detectors | Alexander Yue et.al. | 2411.01118 | null |
2024-11-01 | SANN-PSZ: Spatially Adaptive Neural Network for Head-Tracked Personal Sound Zones | Yue Qiao et.al. | 2411.00772 | null |
2024-10-28 | MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression | Noel Elias et.al. | 2410.21548 | link |
2024-10-29 | Enhancing Learned Image Compression via Cross Window-based Attention | Priyanka Mudgal et.al. | 2410.21144 | null |
2024-10-26 | Cross-Platform Neural Video Coding: A Case Study | Ruhan Conceição et.al. | 2410.20145 | null |
2024-10-25 | Conditional Hallucinations for Image Compression | Till Aczel et.al. | 2410.19493 | null |
2024-10-29 | Integration of Communication and Computational Imaging | Zhenming Yu et.al. | 2410.19415 | null |
2024-10-24 | DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy | Huan Cui et.al. | 2410.18400 | null |
2024-10-23 | Predicting total time to compress a video corpus using online inference systems | Xin Shu et.al. | 2410.18260 | null |
2024-10-23 | FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution | Yang-Che Sun et.al. | 2410.18083 | null |
2024-10-23 | Learning Lossless Compression for High Bit-Depth Volumetric Medical Image | Kai Wang et.al. | 2410.17814 | null |
2024-10-21 | Variable Rate Learned Wavelet Video Coding with Temporal Layer Adaptivity | Anna Meyer et.al. | 2410.15873 | link |
2024-10-20 | Extensions on low-complexity DCT approximations for larger blocklengths based on minimal angle similarity | A. P. Radünz et.al. | 2410.15244 | null |
2024-10-19 | Standardizing Generative Face Video Compression using Supplemental Enhancement Information | Bolin Chen et.al. | 2410.15105 | null |
2024-10-16 | MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection | Bokai Lin et.al. | 2410.14731 | null |
2024-10-18 | Design and Prototype of a Unified Framework for Error-robust Compression and Encryption in IoT | Gajraj Kuldeep et.al. | 2410.14396 | null |
2024-10-18 | Compression using Discrete Multi-Level Divisor Transform for Heterogeneous Sensor Data | Gajraj Kuldeep et.al. | 2410.14287 | null |
2024-10-17 | In-context learning and Occam's razor | Eric Elmoznino et.al. | 2410.14086 | link |
2024-10-17 | Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification | Nikolaos-Antonios Ypsilantis et.al. | 2410.13582 | null |
2024-10-16 | Test-time adaptation for image compression with distribution regularization | Kecheng Chen et.al. | 2410.12191 | null |
2024-10-16 | Joint Data Compression, Secure Multi-Part Collaborative Task Offloading and Resource Assignment in Ultra-Dense Networks | Tianqing Zhou et.al. | 2410.12186 | null |
2024-10-14 | Large Language Model Evaluation via Matrix Nuclear-Norm | Yahan Li et.al. | 2410.10672 | link |
2024-10-14 | QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models | Zhumazhan Balapanov et.al. | 2410.10318 | link |
2024-10-14 | Generative Human Video Compression with Multi-granularity Temporal Trajectory Factorization | Shanzhi Yin et.al. | 2410.10171 | null |
2024-10-13 | Towards Reproducible Learning-based Compression | Jiahao Pang et.al. | 2410.09872 | null |
2024-10-13 | Compressing Scene Dynamics: A Generative Approach | Shanzhi Yin et.al. | 2410.09768 | link |
2024-10-13 | ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression | Wei Jiang et.al. | 2410.09706 | link |
2024-10-12 | Fine-grained subjective visual quality assessment for high-fidelity compressed images | Michela Testolina et.al. | 2410.09501 | link |
2024-10-11 | Fast Data-independent KLT Approximations Based on Integer Functions | A. P. Radünz et.al. | 2410.09227 | null |
2024-10-10 | Compressing high-resolution data through latent representation encoding for downscaling large-scale AI weather forecast model | Qian Liu et.al. | 2410.09109 | null |
2024-10-11 | Data-Driven Neural Estimation of Indirect Rate-Distortion Function | Zichao Yu et.al. | 2410.09018 | null |
2024-10-11 | Compressing regularised dynamics improves link prediction in sparse networks | Maja Lindström et.al. | 2410.08777 | link |
2024-10-11 | Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens | Bolin Chen et.al. | 2410.08485 | link |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-16 | Delta-ICM: Entropy Modeling with Delta Function for Learned Image Compression | Takahiro Shindo et.al. | 2410.07669 | null |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | null |
2024-10-10 | R-Adaptive Mesh Optimization to Enhance Finite Element Basis Compression | Graham Harper et.al. | 2410.07646 | null |
2024-10-09 | JPEG Inspired Deep Learning | Ahmed H. Salamah et.al. | 2410.07081 | link |
2024-10-09 | SHRINK: Data Compression by Semantic Extraction and Residuals Encoding | Guoyou Sun et.al. | 2410.06713 | null |
2024-10-09 | Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization | Prateek Varshney et.al. | 2410.06567 | null |
2024-10-09 | Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Wenqi Niu et.al. | 2410.06561 | null |
2024-10-08 | Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regression | Weigutian Ou et.al. | 2410.06378 | null |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | Resolution limit of the eye: how many pixels can we see? | Maliha Ashraf et.al. | 2410.06068 | null |
2024-10-07 | Transformers learn variable-order Markov chains in-context | Ruida Zhou et.al. | 2410.05493 | null |
2024-10-07 | Salient Store: Enabling Smart Storage for Continuous Learning Edge Servers | Cyan Subhra Mishra et.al. | 2410.05435 | null |
2024-10-07 | Causal Context Adjustment Loss for Learned Image Compression | Minghao Han et.al. | 2410.04847 | link |
2024-10-06 | Channel-Aware Throughput Maximization for Cooperative Data Fusion in CAV | Haonan An et.al. | 2410.04320 | null |
2024-10-05 | Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception | Zhengru Fang et.al. | 2410.04168 | null |
2024-10-04 | On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding | Yi-Hsin Chen et.al. | 2410.03898 | null |
2024-10-04 | A Framework for Automatic Validation and Application of Lossy Data Compression in Ensemble Data Assimilation | Kai Keller et.al. | 2410.03184 | null |
2024-10-03 | GABIC: Graph-based Attention Block for Image Compression | Gabriele Spadaro et.al. | 2410.02981 | link |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-03 | High-Efficiency Neural Video Compression via Hierarchical Predictive Learning | Ming Lu et.al. | 2410.02598 | link |
2024-10-02 | A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation | Liang Chen et.al. | 2410.01912 | link |
2024-10-02 | COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation | Ziyuan Zhang et.al. | 2410.01698 | link |
2024-10-03 | Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression | Gai Zhang et.al. | 2410.01654 | null |
2024-10-02 | Task-Oriented Edge-Assisted Cooperative Data Compression, Communications and Computing for UGV-Enhanced Warehouse Logistics | Jiaming Yang et.al. | 2410.01515 | null |
2024-10-01 | STanH : Parametric Quantization for Variable Rate Learned Image Compression | Alberto Presta et.al. | 2410.00557 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | PerCo (SD): Open Perceptual Compression | Nikolai Körber et.al. | 2409.20255 | link |
2024-09-29 | All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation | Xu Zhang et.al. | 2409.19660 | link |
2024-09-28 | Fast Encoding and Decoding for Implicit Video Representation | Hao Chen et.al. | 2409.19429 | null |
2024-09-27 | Learning-Based Image Compression for Machines | Kartik Gupta et.al. | 2409.19184 | link |
2024-09-27 | Effectiveness of learning-based image codecs on fingerprint storage | Daniele Mari et.al. | 2409.18730 | link |
2024-09-27 | Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming | Angeliki Katsenou et.al. | 2409.18713 | null |
2024-09-27 | Neural Video Representation for Redundancy Reduction and Consistency Preservation | Taiga Hayami et.al. | 2409.18497 | null |
2024-09-20 | Blockchain-Enabled Variational Information Bottleneck for Data Extraction Based on Mutual Information in Internet of Vehicles | Cui Zhang et.al. | 2409.17287 | null |
2024-09-25 | Streaming Neural Images | Marcos V. Conde et.al. | 2409.17134 | null |
2024-09-25 | PhD Forum: Efficient Privacy-Preserving Processing via Memory-Centric Computing | Mpoki Mwaisela et.al. | 2409.16777 | null |
2024-09-25 | The Effect of Lossy Compression on 3D Medical Images Segmentation with Deep Learning | Anvar Kurmukov et.al. | 2409.16733 | null |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-25 | COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models | Kehui Liu et.al. | 2409.15146 | link |
2024-09-23 | AlphaZip: Neural Network-Enhanced Lossless Text Compression | Swathi Shree Narashiman et.al. | 2409.15046 | link |
2024-09-23 | Anomaly Detection from a Tensor Train Perspective | Alejandro Mata Ali et.al. | 2409.15030 | null |
2024-09-23 | AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Andrey Moskalenko et.al. | 2409.14827 | link |
2024-09-21 | Window-based Channel Attention for Wavelet-enhanced Learned Image Compression | Heng Xu et.al. | 2409.14090 | null |
2024-09-20 | Reduced bit median quantization: A middle process for Efficient Image Compression | Fikresilase Wondmeneh Abebayew et.al. | 2409.13789 | null |
2024-09-20 | Data Compression using Rank-1 Lattices for Parameter Estimation in Machine Learning | Michael Gnewuch et.al. | 2409.13453 | null |
2024-09-19 | Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees | Sai Sanjeet et.al. | 2409.13117 | link |
2024-09-19 | Optimal Coding for Randomized Kolmogorov Complexity and Its Applications | Shuichi Hirahara et.al. | 2409.12744 | null |
2024-09-19 | Multi-Scale Feature Prediction with Auxiliary-Info for Neural Image Compression | Chajin Shin et.al. | 2409.12719 | null |
2024-09-18 | One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation | Finn Lukas Busch et.al. | 2409.11764 | null |
2024-09-18 | LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution | Shiyu Feng et.al. | 2409.11711 | null |
2024-09-18 | k-mer-based approaches to bridging pangenomics and population genetics | Miles D. Roberts et.al. | 2409.11683 | null |
2024-09-17 | Few-Shot Domain Adaptation for Learned Image Compression | Tianyu Zhang et.al. | 2409.11111 | null |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-14 | Lossy Image Compression with Stochastic Quantization | Anton Kozyriev et.al. | 2409.09488 | null |
2024-09-13 | Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph | Samuel Fernández-Menduiña et.al. | 2409.08970 | null |
2024-09-13 | On the Computation of BD-Rate over a Set of Videos for Fair Assessment of Performance of Learned Video Codecs | M. Akin Yilmaz et.al. | 2409.08772 | null |
2024-09-13 | USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s | Zhuoyuan Li et.al. | 2409.08481 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression | John Mango et.al. | 2409.07028 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Rate-Constrained Quantization for Communication-Efficient Federated Learning | Shayan Mohajer Hamidi et.al. | 2409.06319 | null |
2024-09-09 | Design and Implementation of TAO DAQ System | Shuihan Zhang et.al. | 2409.05522 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds | Xiao Li et.al. | 2409.05357 | null |
2024-09-06 | Convolutional Transformer-Based Image Compression | Bouzid Arezki et.al. | 2409.04118 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | TropNNC: Structured Neural Network Compression Using Tropical Geometry | Konstantinos Fotopoulos et.al. | 2409.03945 | null |
2024-09-05 | Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection | Ali Aghababaei-Harandi et.al. | 2409.03555 | null |
2024-09-05 | Efficient Image Compression Using Advanced State Space Models | Bouzid Arezki et.al. | 2409.02743 | null |
2024-09-10 | FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings | John Li et.al. | 2409.02453 | null |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | Privacy-Preserving Multimedia Mobile Cloud Computing Using Protective Perturbation | Zhongze Tang et.al. | 2409.01710 | null |
2024-09-02 | Multi-Reference Generative Face Video Compression with Contrastive Learning | Goluck Konuko et.al. | 2409.01029 | null |
2024-09-02 | Accelerating block-level rate control for learned image compression | Muchen Dong et.al. | 2409.01009 | null |
2024-09-02 | PNVC: Towards Practical INR-based Video Compression | Ge Gao et.al. | 2409.00953 | null |
2024-09-01 | BWT construction and search at the terabase scale | Heng Li et.al. | 2409.00613 | link |
2024-08-30 | Prioritized Information Bottleneck Theoretic Framework with Distributed Online Learning for Edge Video Analytics | Zhengru Fang et.al. | 2409.00146 | link |
2024-08-28 | Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays | Zeheng Wang et.al. | 2409.00115 | null |
2024-08-30 | NDP: Next Distribution Prediction as a More Broad Target | Junhao Ruan et.al. | 2408.17377 | null |
2024-08-30 | Approximately Invertible Neural Network for Learned Image Compression | Yanbo Gao et.al. | 2408.17073 | null |
2024-08-29 | UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation | Piotr Rudol et.al. | 2408.16501 | null |
2024-08-29 | Convolutional Neural Network Compression Based on Low-Rank Decomposition | Yaping He et.al. | 2408.16289 | null |
2024-08-27 | Bandwidth-Aware and Overlap-Weighted Compression for Communication-Efficient Federated Learning | Zichen Tang et.al. | 2408.14736 | null |
2024-08-25 | Condensed Sample-Guided Model Inversion for Knowledge Distillation | Kuluhan Binici et.al. | 2408.13850 | null |
2024-08-12 | Semantic Variational Bayes Based on a Semantic Information Theory for Solving Latent Variables | Chenguang Lu et.al. | 2408.13122 | null |
2024-08-22 | Quantization-free Lossy Image Compression Using Integer Matrix Factorization | Pooya Ashtari et.al. | 2408.12691 | link |
2024-08-22 | DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding | Jooyoung Lee et.al. | 2408.12150 | null |
2024-08-28 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-20 | Trustworthy Compression? Impact of AI-based Codecs on Biometrics for Law Enforcement | Sandra Bergmann et.al. | 2408.10823 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Bi-Directional Deep Contextual Video Compression | Xihua Sheng et.al. | 2408.08604 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-15 | Algebraic Vertex Ordering of a Sparse Graph for Adjacency Access Locality and Graph Compression | Dimitris Floros et.al. | 2408.08439 | null |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-14 | Towards Real-time Video Compressive Sensing on Mobile Devices | Miao Cao et.al. | 2408.07530 | link |
2024-08-14 | Encoding and Decoding Algorithms of ANS Variants and Evaluation of Their Average Code Lengths | Hirosuke Yamamoto et.al. | 2408.07322 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-19 | Joint Source-Channel Optimization for UAV Video Coding and Transmission | Kesong Wu et.al. | 2408.06667 | null |
2024-08-08 | Flow-Lenia.png: Evolving Multi-Scale Complexity by Means of Compression | Tadashi Adachi et.al. | 2408.06374 | null |
2024-08-09 | Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration | Siyue Teng et.al. | 2408.05042 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-07 | Bi-Level Spatial and Channel-aware Transformer for Learned Image Compression | Hamidreza Soltani et.al. | 2408.03842 | null |
2024-08-07 | BVI-AOM: A New Training Dataset for Deep Video Compression Optimization | Jakub Nawała et.al. | 2408.03265 | link |
2024-08-06 | Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring | Jeremy J. Williams et.al. | 2408.02869 | null |
2024-08-05 | Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation | McKell Woodland et.al. | 2408.02761 | link |
2024-08-04 | CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Xiang He et.al. | 2408.01952 | link |
2024-08-03 | Channel-Aware Distributed Transmission Control and Video Streaming in UAV Networks | Masoud Ghazikor et.al. | 2408.01885 | null |
2024-08-02 | An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression | Shiyi Luo et.al. | 2408.01534 | null |
2024-07-31 | Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study | Mitra Amiri et.al. | 2408.00052 | null |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-30 | Edge Learning Based Collaborative Automatic Modulation Classification for Hierarchical Cognitive Radio Networks | Peihao Dong et.al. | 2407.20772 | link |
2024-07-30 | Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications | Yi Ju et.al. | 2407.20717 | null |
2024-07-29 | Homomorphic data compression for real time photon correlation analysis | Sebastian Strempfer et.al. | 2407.20356 | null |
2024-07-24 | Accelerating the Low-Rank Decomposed Models | Habib Hajimolahoseini et.al. | 2407.20266 | null |
2024-07-29 | ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck | Chia-Hao Kao et.al. | 2407.19651 | null |
2024-07-28 | NVC-1B: A Large Neural Video Coding Model | Xihua Sheng et.al. | 2407.19402 | null |
2024-07-18 | Generative AI Augmented Induction-based Formal Verification | Aman Kumar et.al. | 2407.18965 | null |
2024-07-25 | The seismic purifier: An unsupervised approach to seismic signal detection via representation learning | Onur Efe et.al. | 2407.18402 | link |
2024-07-25 | Adaptable Deep Joint Source-and-Channel Coding for Small Satellite Applications | Olga Kondrateva et.al. | 2407.18146 | null |
2024-07-25 | Scaling Training Data with Lossy Image Compression | Katherine L. Mentzer et.al. | 2407.17954 | link |
2024-07-25 | Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks | Zhicheng Cai et.al. | 2407.17834 | link |
2024-07-24 | Lossy Data Compression By Adaptive Mesh Coarsening | N. Böing et.al. | 2407.17316 | null |
2024-07-24 | High Efficiency Image Compression for Large Visual-Language Models | Binzhe Li et.al. | 2407.17060 | null |
2024-07-23 | Accelerating Learned Video Compression via Low-Resolution Representation Learning | Zidian Qiu et.al. | 2407.16418 | null |
2024-07-24 | FCNR: Fast Compressive Neural Representation of Visualization Images | Yunfei Lu et.al. | 2407.16369 | link |
2024-07-19 | Shapley Pruning for Neural Network Compression | Kamil Adamczewski et.al. | 2407.15875 | null |
2024-07-18 | CIC: Circular Image Compression | Honggui Li et.al. | 2407.15870 | null |
2024-07-22 | Online String Attractors | Philip Whittington et.al. | 2407.15599 | null |
2024-07-22 | Spectral properties of bright deposits in permanently shadowed craters on Ceres | Stefan Schröder et.al. | 2407.15327 | null |
2024-07-21 | Lessons Learned on the Path to Guaranteeing the Error Bound in Lossy Quantizers | Alex Fallin et.al. | 2407.15037 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-18 | Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law | Giorgio Franceschelli et.al. | 2407.13493 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Reliability Function of Classical-Quantum Channels | Ke Li et.al. | 2407.12403 | null |
2024-07-17 | Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression | Junhui Li et.al. | 2407.12295 | null |
2024-07-16 | Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors | Matt Gorbett et.al. | 2407.12075 | null |
2024-07-17 | Rate-Distortion-Cognition Controllable Versatile Neural Image Compression | Jinming Liu et.al. | 2407.11700 | null |
2024-07-16 | MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models | Hongrong Cheng et.al. | 2407.11681 | null |
2024-07-17 | Neural Compression of Atmospheric States | Piotr Mirowski et.al. | 2407.11666 | null |
2024-07-16 | Rethinking Learned Image Compression: Context is All You Need | Jixiang Luo et.al. | 2407.11590 | null |
2024-07-16 | The impact of lossy data compression on the power spectrum of the high redshift 21-cm signal with LOFAR | J. K. Chege et.al. | 2407.11557 | null |
2024-07-21 | Uniformly Accelerated Motion Model for Inter Prediction | Zhuoyuan Li et.al. | 2407.11541 | null |
2024-07-15 | M18K: A Comprehensive RGB-D Dataset and Benchmark for Mushroom Detection and Instance Segmentation | Abdollah Zakeri et.al. | 2407.11275 | link |
2024-07-15 | Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention | Prapti Ganguly et.al. | 2407.11102 | null |
2024-07-15 | In-Loop Filtering via Trained Look-Up Tables | Zhuoyuan Li et.al. | 2407.10926 | null |
2024-07-15 | Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu et.al. | 2407.10632 | link |
2024-07-14 | UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers | Huy Ha et.al. | 2407.10353 | null |
2024-07-13 | WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model | Haisheng Fu et.al. | 2407.09983 | null |
2024-07-13 | Zero-Shot Image Compression with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.09896 | link |
2024-07-13 | Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation | Han Li et.al. | 2407.09853 | link |
2024-07-13 | Infinite families of optimal and minimal codes over rings using simplicial complexes | Yanan Wu et.al. | 2407.09783 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim et.al. | 2407.08975 | null |
2024-07-11 | Manipulating a Tetris-Inspired 3D Video Representation | Mihir Godbole et.al. | 2407.08885 | null |
2024-07-11 | OMR-NET: a two-stage octave multi-scale residual network for screen content image compression | Shiqi Jiang et.al. | 2407.08545 | null |
2024-07-11 | CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data | Hossein Entezari Zarch et.al. | 2407.08108 | null |
2024-07-10 | Using Low-Discrepancy Points for Data Compression in Machine Learning: An Experimental Comparison | Simone Göttlich et.al. | 2407.07450 | null |
2024-07-10 | Standard compliant video coding using low complexity, switchable neural wrappers | Yueyu Hu et.al. | 2407.07395 | null |
2024-07-10 | MNeRV: A Multilayer Neural Representation for Videos | Qingling Chang et.al. | 2407.07347 | link |
2024-07-11 | Entropy Law: The Story Behind Data Compression and LLM Performance | Mingjia Yin et.al. | 2407.06645 | link |
2024-07-08 | A Hybrid Algorithm for Computing a Partial Singular Value Decomposition Satisfying a Given Threshold | James Baglama et.al. | 2407.06306 | link |
2024-07-08 | TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Skanda Koppula et.al. | 2407.05921 | link |
2024-07-05 | The Impact of Quantization and Pruning on Deep Reinforcement Learning Models | Heng Lu et.al. | 2407.04803 | null |
2024-07-05 | An autoencoder for compressing angle-resolved photoemission spectroscopy data | Steinn Ymir Agustsson et.al. | 2407.04631 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-11 | A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization | Daoce Wang et.al. | 2407.04267 | null |
2024-07-04 | Autoencoded Image Compression for Secure and Fast Transmission | Aryan Kashyap Naveen et.al. | 2407.03990 | link |
2024-07-03 | Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations | Trevor Ablett et.al. | 2407.03311 | link |
2024-07-03 | KeyVideoLLM: Towards Large-scale Video Keyframe Selection | Hao Liang et.al. | 2407.03104 | null |
2024-07-01 | Statistical Analysis of ZFP: Understanding Bias | Alyson Fox et.al. | 2407.01826 | null |
2024-07-01 | An AI-based, Error-bounded Compression Scheme for High-frequency Power Quality Disturbance Data | Markus Stroot et.al. | 2407.01112 | null |
2024-06-28 | Wavelets Are All You Need for Autoregressive Image Generation | Wael Mattar et.al. | 2406.19997 | null |
2024-06-28 | Optimal Video Compression using Pixel Shift Tracking | Hitesh Saai Mananchery Panneerselvam et.al. | 2406.19630 | link |
2024-06-27 | MCNC: Manifold Constrained Network Compression | Chayne Thrash et.al. | 2406.19301 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-25 | Asymptotically Minimax Regret by Bayes Mixtures | Jun'ichi Takeuchi et.al. | 2406.17929 | null |
2024-06-24 | Hierarchical B-frame Video Coding for Long Group of Pictures | Ivan Kirillov et.al. | 2406.16544 | null |
2024-06-20 | Ranking LLMs by compression | Peijia Guo et.al. | 2406.14171 | null |
2024-06-21 | Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective | Minsang Kim et.al. | 2406.14124 | null |
2024-06-20 | Prediction and Reference Quality Adaptation for Learned Video Compression | Xihua Sheng et.al. | 2406.14118 | null |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | A Study on the Effect of Color Spaces in Learned Image Compression | Srivatsa Prativadibhayankaram et.al. | 2406.13709 | null |
2024-06-19 | Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics | Weitong Zhang et.al. | 2406.13652 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | null |
2024-06-15 | How Should We Extract Discrete Audio Tokens from Self-Supervised Models? | Pooneh Mousavi et.al. | 2406.10735 | null |
2024-06-15 | Object-Attribute-Relation Representation based Video Semantic Communication | Qiyuan Du et.al. | 2406.10469 | null |
2024-06-14 | On Efficient Neural Network Architectures for Image Compression | Yichi Zhang et.al. | 2406.10361 | link |
2024-06-14 | Information Compression in the AI Era: Recent Advances and Future Challenges | Jun Chen et.al. | 2406.10036 | null |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models | Yi-Fan Zhang et.al. | 2406.08487 | link |
2024-06-12 | On Annotation-free Optimization of Video Coding for Machines | Marc Windsheimer et.al. | 2406.07938 | null |
2024-06-11 | SSNVC: Single Stream Neural Video Compression with Implicit Temporal Information | Feng Wang et.al. | 2406.07645 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Optimal Matrix-Mimetic Tensor Algebras via Variable Projection | Elizabeth Newman et.al. | 2406.06942 | link |
2024-06-10 | Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency | Jincheng Dai et.al. | 2406.06446 | null |
2024-06-10 | Image Compression with Isotropic and Anisotropic Shepard Inpainting | Rahul Mohideen Kaja Mohideen et.al. | 2406.06247 | null |
2024-06-10 | Efficient Neural Compression with Inference-time Decoding | C. Metz et.al. | 2406.06237 | null |
2024-06-10 | Fiducial-Cosmology-dependent systematics for the DESI 2024 BAO Analysis | A. Pérez-Fernández et.al. | 2406.06085 | null |
2024-06-10 | Quantum Sparse Coding and Decoding Based on Quantum Network | Xun Ji et.al. | 2406.06012 | null |
2024-06-09 | Region of Interest Loss for Anonymizing Learned Image Compression | Christoph Liebender et.al. | 2406.05726 | link |
2024-06-08 | Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models | Minho Park et.al. | 2406.05432 | link |
2024-06-07 | PatchSVD: A Non-uniform SVD-based Image Compression Algorithm | Zahra Golpayegani et.al. | 2406.05129 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-06 | LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression | Junhui Li et.al. | 2406.03961 | link |
2024-06-05 | Lossless Image Compression Using Multi-level Dictionaries: Binary Images | Samar Agnihotri et.al. | 2406.03087 | null |
2024-06-05 | On Jacob Ziv's Individual-Sequence Approach to Information Theory | Neri Merhav et.al. | 2406.02904 | null |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-05 | Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Anqi Li et.al. | 2406.00758 | link |
2024-06-01 | Efficient Massive Black Hole Binary parameter estimation for LISA using Sequential Neural Likelihood | Iván Martín Vílchez et.al. | 2406.00565 | null |
2024-06-01 | A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing | Nurul Rafi et.al. | 2406.00239 | null |
2024-05-31 | ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | Yufei Wang et.al. | 2405.20721 | link |
2024-05-30 | Quantum encoder for fixed Hamming-weight subspaces | Renato M. S. Farias et.al. | 2405.20408 | null |
2024-05-29 | Implicit Neural Image Field for Biological Microscopy Image Compression | Gaole Dai et.al. | 2405.19012 | link |
2024-05-28 | Deep Network Pruning: A Comparative Study on CNNs in Face Recognition | Fernando Alonso-Fernandez et.al. | 2405.18302 | null |
2024-05-28 | Channel Reciprocity Based Attack Detection for Securing UWB Ranging by Autoencoder | Wenlong Gou et.al. | 2405.18255 | null |
2024-05-27 | Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas et.al. | 2405.16953 | link |
2024-05-27 | UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation | Runzhao Yang et.al. | 2405.16850 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-25 | N-BVH: Neural ray queries with bounding volume hierarchies | Philippe Weier et.al. | 2405.16237 | link |
2024-05-25 | A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior | Fuheng Zhou et.al. | 2405.16197 | link |
2024-05-24 | Analytical proxy to families of numerical solutions: the case study of spherical mini-boson stars | Jianzhi Yang et.al. | 2405.15651 | null |
2024-05-24 | SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing | Haoxuan Yuan et.al. | 2405.15542 | null |
2024-05-24 | Meta-meshing and triangulating lattice structures at a large scale | Qiang Zou et.al. | 2405.15197 | null |
2024-05-23 | NeCGS: Neural Compression for 3D Geometry Sets | Siyu Ren et.al. | 2405.15034 | link |
2024-05-23 | An augmented Lagrangian trust-region method with inexact gradient evaluations to accelerate constrained optimization problems using model hyperreduction | Tianshu Wen et.al. | 2405.14827 | null |
2024-05-23 | Motion-based video compression for resource-constrained camera traps | Malika Nisal Ratnayake et.al. | 2405.14419 | null |
2024-06-01 | I |
Meiqin Liu et.al. | 2405.14336 | link |
2024-05-23 | Sparse |
Matthias Chung et.al. | 2405.14270 | null |
2024-05-22 | "Turing Tests" For An AI Scientist | Xiaoxin Yin et.al. | 2405.13352 | null |
2024-05-21 | Efficient Learned Wavelet Image and Video Coding | Anna Meyer et.al. | 2405.12631 | null |
2024-05-24 | Accelerating Relative Entropy Coding with Space Partitioning | Jiajun He et.al. | 2405.12203 | null |
2024-05-20 | Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing | Takahiro Shindo et.al. | 2405.11894 | null |
2024-05-19 | Effective In-Context Example Selection through Data Compression | Zhongxiang Sun et.al. | 2405.11465 | null |
2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | link |
2024-05-17 | Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps II. Cosmological results | M. Gatti et.al. | 2405.10881 | null |
2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-15 | Properties that allow or prohibit transferability of adversarial attacks among quantized networks | Abhishek Shrestha et.al. | 2405.09598 | link |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-14 | Parameter-Efficient Instance-Adaptive Neural Video Compression | Hyunmo Yang et.al. | 2405.08530 | link |
2024-05-13 | Goal-oriented compression for |
Yifei Sun et.al. | 2405.07808 | null |
2024-05-13 | Neural Network Compression for Reinforcement Learning Tasks | Dmitry A. Ivanov et.al. | 2405.07748 | null |
2024-05-13 | On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks | Chenhao Wu et.al. | 2405.07717 | null |
2024-05-21 | An Efficient Compression Method for Sign Information of DCT Coefficients via Sign Retrieval | Chihiro Tsutake et.al. | 2405.07487 | link |
2024-05-10 | Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming | Chin-Yun Yu et.al. | 2405.06804 | link |
2024-05-08 | Urban Boundary Delineation from Commuting Data with Bayesian Stochastic Blockmodeling: Scale, Contiguity, and Hierarchy | Sebastian Morel-Balbi et.al. | 2405.04911 | link |
2024-05-14 | Some Notes on the Sample Complexity of Approximate Channel Simulation | Gergely Flamich et.al. | 2405.04363 | null |
2024-05-07 | Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression | Zhenghao Chen et.al. | 2405.04274 | null |
2024-05-08 | Verified Neural Compressed Sensing | Rudy Bunel et.al. | 2405.04260 | null |
2024-05-15 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | DMOFC: Discrimination Metric-Optimized Feature Compression | Changsheng Gao et.al. | 2405.04044 | null |
2024-05-06 | Computational ghost imaging with hybrid transforms by integrating Hadamard, discrete cosine, and Haar matrices | Yi-Ning Zhao et.al. | 2405.03729 | null |
2024-05-06 | A Rate-Distortion-Classification Approach for Lossy Image Compression | Yuefeng Zhang et.al. | 2405.03500 | null |
2024-05-06 | Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition | Xitong Zhang et.al. | 2405.03089 | link |
2024-05-04 | Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos | Joaquim Comas et.al. | 2405.02652 | null |
2024-05-06 | Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | Jian Meng et.al. | 2405.01775 | link |
2024-05-02 | PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-28 | Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression | Li Wan et.al. | 2405.01584 | null |
2024-05-02 | GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression | Daxin Li et.al. | 2405.01170 | null |
2024-04-30 | Analysis and Enhancement of Lossless Image Compression in JPEG-XL | Rustam Mamedov et.al. | 2404.19755 | null |
2024-04-30 | EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization | Jianzong Wang et.al. | 2404.19214 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | link |
2024-04-28 | Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding | Weijie Bao et.al. | 2404.18058 | null |
2024-04-25 | Learning Visuotactile Skills with Two Multifingered Hands | Toru Lin et.al. | 2404.16823 | link |
2024-04-24 | Domain Adaptation for Learned Image Compression with Supervised Adapters | Alberto Presta et.al. | 2404.15591 | link |
2024-04-23 | One-Pass Randomized Algorithm with Practical Rangefinder for Low-Rank Approximation to Quaternion Matrices | Chao Chang et.al. | 2404.14783 | link |
2024-04-22 | Neural Compress-and-Forward for the Relay Channel | Ezgi Ozyilkan et.al. | 2404.14594 | null |
2024-04-22 | Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers | Sandeep Kumar et.al. | 2404.13886 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-18 | Image Compression and Reconstruction Based on Quantum Network | Xun Ji et.al. | 2404.11994 | null |
2024-04-17 | Spatio-Temporal Motion Retargeting for Quadruped Robots | Taerim Yoon et.al. | 2404.11557 | null |
2024-04-17 | Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Luca Bompani et.al. | 2404.11488 | link |
2024-04-17 | Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks | Eri Hosonuma et.al. | 2404.11280 | null |
2024-04-16 | Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning | Kyle Hsu et.al. | 2404.10282 | link |
2024-04-16 | Compressible and Searchable: AI-native Multi-Modal Retrieval System with Learned Image Compression | Jixiang Luo et.al. | 2404.10234 | null |
2024-04-15 | One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing | Yueyu Hu et.al. | 2404.09979 | null |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-18 | Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Tobias Weber et.al. | 2404.09683 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-17 | Incremental data compression for PDE-constrained optimization with a data assimilation application | Xuejian Li et.al. | 2404.09323 | null |
2024-04-14 | A Joint Data Compression and Time-Delay Estimation Method For Distributed Systems via Extremum Encoding | Amir Weiss et.al. | 2404.09244 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT | Miguel Ortiz del Castillo et.al. | 2404.08399 | null |
2024-04-11 | Video Compression Beyond VVC: Quantitative Analysis of Intra Coding Tools in Enhanced Compression Model (ECM) | Mohsen Abdoli et.al. | 2404.07872 | null |
2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
2024-04-14 | A comparison between Shapefit compression and Full-Modelling method with PyBird for DESI 2024 and beyond | Y. Lai et.al. | 2404.07283 | link |
2024-04-10 | Exploring Repetitiveness Measures for Two-Dimensional Strings | Giuseppe Romana et.al. | 2404.07030 | null |
2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | link |
2024-04-09 | Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey | Feng Liang et.al. | 2404.06114 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | Task-Aware Encoder Control for Deep Video Compression | Xingtong Ge et.al. | 2404.04848 | null |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-05 | ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing | Alec Helbling et.al. | 2404.04376 | link |
2024-04-03 | Convolutional variational autoencoders for secure lossy image compression in remote sensing | Alessandro Giuliano et.al. | 2404.03696 | null |
2024-03-25 | RL for Consistency Models: Faster Reward Guided Text-to-Image Generation | Owen Oertell et.al. | 2404.03673 | link |
2024-04-04 | Training LLMs over Neurally Compressed Text | Brian Lester et.al. | 2404.03626 | null |
2024-04-04 | Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning | Tyler Chang et.al. | 2404.03586 | link |
2024-04-04 | Semantic Compression with Information Lattice Learning | Haizi Yu et.al. | 2404.03131 | null |
2024-04-01 | Accounting for contact network uncertainty in epidemic inferences with Approximate Bayesian Computation | Maxwell H. Wang et.al. | 2404.02924 | null |
2024-04-03 | Building test batteries based on analysing random number generator tests within the framework of algorithmic information theory | Boris Ryabko et.al. | 2404.02708 | null |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms | Jiaang Duan et.al. | 2404.02445 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-03-31 | Metric dimensions of generalized Sierpiński graphs over squares | Savari Prabhu et.al. | 2404.00771 | null |
2024-03-27 | Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data | Daniel Menges et.al. | 2403.19721 | null |
2024-03-28 | RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation | Marian Invanov et.al. | 2403.19330 | null |
2024-03-28 | Uncertainty-Aware Deep Video Compression with Ensembles | Wufei Ma et.al. | 2403.19158 | null |
2024-04-08 | Neural Embedding Compression For Efficient Multi-Task Earth Observation Modelling | Carlos Gomes et.al. | 2403.17886 | link |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-26 | Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Kai Yuan et.al. | 2403.17607 | link |
2024-03-25 | Neural Image Compression with Quantization Rectifier | Wei Luo et.al. | 2403.17236 | null |
2024-03-25 | Invertible Diffusion Models for Compressed Sensing | Bin Chen et.al. | 2403.17006 | null |
2024-03-25 | Virtual Cylindrical PET for Efficient DOI Image Reconstruction with Sub-millimetre Resolution | Francisco E Enríquez-Mier-y-Terán et.al. | 2403.16465 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-23 | Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets | Robert Underwood et.al. | 2403.15953 | null |
2024-03-23 | Droplet shape representation using Fourier series and autoencoders | Mihir Durve et.al. | 2403.15797 | null |
2024-03-21 | S2LIC: Learned Image Compression with the SwinV2 Block, Adaptive Channel-wise and Global-inter Attention Context | Yongqiang Wang et.al. | 2403.14471 | link |
2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
2024-03-26 | Powerful Lossy Compression for Noisy Images | Shilv Cai et.al. | 2403.14135 | null |
2024-03-20 | String attractors and bi-infinite words | Pierre Béaur et.al. | 2403.13449 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-19 | Privacy-Preserving Face Recognition Using Trainable Feature Subtraction | Yuxi Mi et.al. | 2403.12457 | link |
2024-03-19 | VQ-NeRV: A Vector Quantized Neural Representation for Videos | Yunjie Xu et.al. | 2403.12401 | link |
2024-03-18 | Encoding of linear kinetic plasma problems in quantum circuits via data compression | Ivan Novikau et.al. | 2403.11989 | null |
2024-03-18 | Object Segmentation-Assisted Inter Prediction for Versatile Video Coding | Zhuoyuan Li et.al. | 2403.11694 | null |
2024-03-18 | Overfitted image coding at reduced complexity | Théophile Blard et.al. | 2403.11651 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-16 | Channel-wise Feature Decorrelation for Enhanced Learned Image Compression | Farhad Pakdaman et.al. | 2403.10936 | null |
2024-03-16 | NARRATE: Versatile Language Architecture for Optimal Control in Robotics | Seif Ismail et.al. | 2403.10762 | link |
2024-03-15 | Process-and-Forward: Deep Joint Source-Channel Coding Over Cooperative Relay Networks | Chenghong Bian et.al. | 2403.10613 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | link |
2024-03-15 | Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration | Usama Ali et.al. | 2403.09988 | link |
2024-03-14 | SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay et.al. | 2403.09344 | link |
2024-03-14 | Noise Dimension of GAN: An Image Compression Perspective | Ziran Zhu et.al. | 2403.09196 | null |
2024-03-20 | Content-aware Masked Image Modeling Transformer for Stereo Image Compression | Xinjie Zhang et.al. | 2403.08505 | null |
2024-03-12 | Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding | Eric Lei et.al. | 2403.07320 | null |
2024-03-11 | Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI | Lang Tong et.al. | 2403.06942 | null |
2024-03-16 | Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression | Zhi Cao et.al. | 2403.06700 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
2024-03-10 | Blockchain-Enabled Variational Information Bottleneck for IoT Networks | Qiong Wu et.al. | 2403.06129 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-07 | Complexity-constrained quantum thermodynamics | Anthony Munson et.al. | 2403.04828 | null |
2024-03-07 | Image Coding for Machines with Edge Information Learning Using Segment Anything | Takahiro Shindo et.al. | 2403.04173 | link |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954 | link |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | ZF Beamforming Tensor Compression for Massive MIMO Fronthaul | Libin Zheng et.al. | 2403.03675 | null |
2024-03-06 | Space Complexity of Euclidean Clustering | Xiaoyi Zhu et.al. | 2403.02971 | null |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders | Daniele Mari et.al. | 2403.02887 | null |
2024-03-04 | Dark Energy Survey Year 3 results: likelihood-free, simulation-based |
N. Jeffrey et.al. | 2403.02314 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-03 | On the Compressibility of Quantized Large Language Models | Yu Mao et.al. | 2403.01384 | null |
2024-03-02 | Towards Accurate Lip-to-Speech Synthesis in-the-Wild | Sindhu Hegde et.al. | 2403.01087 | null |
2024-03-01 | Region-Adaptive Transform with Segmentation Prior for Image Compression | Yuxi Liu et.al. | 2403.00628 | link |
2024-03-07 | ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks | Ahmed Telili et.al. | 2403.00604 | link |
2024-02-29 | Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space | Mahsa Mozafari-Nia et.al. | 2403.00155 | null |
2024-02-29 | Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling | Wenxue Cui et.al. | 2402.19111 | null |
2024-02-29 | Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets | Fatih Kamisli et.al. | 2402.18930 | link |
2024-02-29 | Towards Backward-Compatible Continual Learning of Image Compression | Zhihao Duan et.al. | 2402.18862 | link |
2024-02-29 | Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression | Xinyue Li et.al. | 2402.18761 | null |
2024-01-10 | Motion Guided Token Compression for Efficient Masked Video Modeling | Yukun Feng et.al. | 2402.18577 | null |
2024-02-28 | Tokenization Is More Than Compression | Craig W. Schmidt et.al. | 2402.18376 | link |
2024-02-28 | NERV++: An Enhanced Implicit Neural Video Representation | Ahmed Ghorbel et.al. | 2402.18305 | null |
2024-02-28 | Computing Minimal Absent Words and Extended Bispecial Factors with CDAWG Space | Shunsuke Inenaga et.al. | 2402.18090 | null |
2024-03-03 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759 | null |
2024-02-27 | Gaoyuan Wang et.al. | 2402.17749 | null | |
2024-02-27 | Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model | Panqi Jia et.al. | 2402.17487 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-29 | Neural Video Compression with Feature Modulation | Jiahao Li et.al. | 2402.17414 | link |
2024-01-19 | MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network | Yujun Huang et.al. | 2402.16855 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-02-26 | Enabling robust sensor network design with data processing and optimization making use of local beehive image and video files | Ephrance Eunice Namugenyi et.al. | 2402.16655 | null |
2024-02-26 | Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields | Yifei Li et.al. | 2402.16599 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction | Wen-Yang Lu et.al. | 2402.16371 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-24 | Traditional Transformation Theory Guided Model for Learned Image Compression | Zhiyuan Li et.al. | 2402.15744 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-21 | Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel | Jordan Dotzel et.al. | 2402.13536 | null |
2024-02-20 | Compressing the two-particle Green's function using wavelets: Theory and application to the Hubbard atom | Emin Moghadas et.al. | 2402.13030 | null |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-20 | Transformer-based Learned Image Compression for Joint Decoding and Denoising | Yi-Hsin Chen et.al. | 2402.12888 | null |
2024-02-19 | Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Philip Müller et.al. | 2402.11985 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-18 | Learning to Learn Faster from Human Feedback with Language Model Predictive Control | Jacky Liang et.al. | 2402.11450 | null |
2024-02-17 | TinyLIC-High efficiency lossy image compression method | Gaocheng Ma et.al. | 2402.11164 | null |
2024-02-15 | Analysis of Neural Video Compression Networks for 360-Degree Video Coding | Andy Regensky et.al. | 2402.10257 | null |
2024-02-14 | Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion | Edgar Heinert et.al. | 2402.09530 | link |
2024-02-14 | A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders | Matthias Kränzler et.al. | 2402.09001 | null |
2024-02-14 | Extreme Video Compression with Pre-trained Diffusion Models | Bohan Li et.al. | 2402.08934 | link |
2024-02-14 | Saliency-aware End-to-end Learned Variable-Bitrate 360-degree Image Compression | Oguzhan Gungordu et.al. | 2402.08862 | null |
2024-02-13 | Learned Image Compression with Text Quality Enhancement | Chih-Yu Lai et.al. | 2402.08643 | null |
2024-02-13 | Motion-Adaptive Inference for Flexible Learned B-Frame Compression | M. Akin Yilmaz et.al. | 2402.08550 | null |
2024-02-21 | A Neural-network Enhanced Video Coding Framework beyond ECM | Yanchen Zhao et.al. | 2402.08397 | null |
2024-02-13 | Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss | Kei Iino et.al. | 2402.08267 | null |
2024-02-12 | Distributed Compression in the Era of Machine Learning: A Review of Recent Advances | Ezgi Ozyilkan et.al. | 2402.07997 | null |
2024-02-13 | Towards Meta-Pruning via Optimal Transport | Alexander Theus et.al. | 2402.07839 | link |
2024-02-09 | Parameter estimation for quantum jump unraveling | Marco Radaelli et.al. | 2402.06556 | link |
2024-02-07 | RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications | Christian D. Rask et.al. | 2402.05974 | null |
2024-02-08 | Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers | Onur G. Guleryuz et.al. | 2402.05887 | link |
2024-02-08 | Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs | Yuxin Xie et.al. | 2402.05582 | null |
2024-02-05 | TexShape: Information Theoretic Sentence Embedding for Language Models | H. Kaan Kale et.al. | 2402.05132 | link |
2024-02-07 | Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth | Kevin Kögler et.al. | 2402.05013 | null |
2024-02-06 | A Novel Local and Hyper-Local Multicast Services Transmission Scheme for Beyond 5G Networks | Sweta Singh et.al. | 2402.03963 | null |
2024-02-06 | Cool-chic video: Learned video coding with 800 parameters | Thomas Leguay et.al. | 2402.03179 | link |
2024-02-05 | Perceptual Learned Image Compression via End-to-End JND-Based Optimization | Farhad Pakdaman et.al. | 2402.02836 | null |
2024-02-04 | Discovering More Effective Tensor Network Structure Search Algorithms via Large Language Models (LLMs) | Junhua Zeng et.al. | 2402.02456 | link |
2024-03-04 | RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction | Nikolaos Stathoulopoulos et.al. | 2402.02192 | null |
2024-02-03 | Generative Visual Compression: A Review | Bolin Chen et.al. | 2402.02140 | null |
2024-02-23 | Immersive Video Compression using Implicit Neural Representations | Ho Man Kwan et.al. | 2402.01596 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-02 | UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame Coding | Jiayu Yang et.al. | 2402.01289 | null |
2024-02-02 | Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training | Sota Kudo et.al. | 2402.01238 | link |
2024-02-02 | The O2 software framework and GPU usage in ALICE online and offline reconstruction in Run 3 | Giulio Eulisse et.al. | 2402.01205 | null |
2024-02-01 | Compressed image quality assessment using stacking | S. Farhad Hosseini-Benvidi et.al. | 2402.00993 | null |
2024-02-04 | Evaluating Large Language Models for Generalization and Robustness via Data Compression | Yucheng Li et.al. | 2402.00861 | link |
2024-03-11 | LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression | Wei Jiang et.al. | 2402.00680 | null |
2024-02-01 | Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations | Vignesh V Menon et.al. | 2402.00622 | null |
2024-01-31 | EPSD: Early Pruning with Self-Distillation for Efficient Model Compression | Dong Chen et.al. | 2402.00084 | null |
2024-01-31 | A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 | Darren Ramsook et.al. | 2401.18021 | null |
2024-01-31 | Robustly overfitting latents for flexible neural image compression | Yura Perugachi-Diaz et.al. | 2401.17789 | null |
2024-01-30 | A Group Theoretic Metric for Robot State Estimation Leveraging Chebyshev Interpolation | Varun Agrawal et.al. | 2401.17463 | null |
2024-01-30 | SLIC: A Learned Image Codec Using Structure and Color | Srivatsa Prativadibhayankaram et.al. | 2401.17246 | link |
2024-01-30 | Large Language Model Evaluation via Matrix Entropy | Lai Wei et.al. | 2401.17139 | link |
2024-01-30 | Local integrals of motion in dipole-conserving models with Hilbert space fragmentation | Patrycja Łydżba et.al. | 2401.17097 | null |
2024-01-29 | On Channel Simulation with Causal Rejection Samplers | Daniel Goc et.al. | 2401.16579 | null |
2024-01-29 | Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression | Xihua Sheng et.al. | 2401.15864 | null |
2024-01-29 | Bayesian one- and two-sided inference on the local effective dimension | Eduard Belitser et.al. | 2401.15816 | null |
2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | Minghong Duan et.al. | 2401.15613 | null |
2024-01-26 | Shadow simulation of quantum processes | Xuanqiang Zhao et.al. | 2401.14934 | null |
2024-01-26 | Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images | Jon Alvarez Justo et.al. | 2401.14786 | null |
2024-01-26 | A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction | Jon Alvarez Justo et.al. | 2401.14762 | null |
2024-01-26 | Residual Quantization with Implicit Neural Codebooks | Iris Huijben et.al. | 2401.14732 | link |
2024-01-25 | Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression | Daxin Li et.al. | 2401.14007 | null |
2024-02-07 | Perceptual-oriented Learned Image Compression with Dynamic Kernel | Nianxiang Fu et.al. | 2401.13967 | null |
2024-01-25 | Conditional Neural Video Coding with Spatial-Temporal Super-Resolution | Henan Wang et.al. | 2401.13959 | null |
2024-01-24 | FLLIC: Functionally Lossless Image Compression | Xi Zhang et.al. | 2401.13616 | null |
2024-01-23 | Fast Implicit Neural Representation Image Codec in Resource-limited Devices | Xiang Liu et.al. | 2401.12587 | null |
2024-01-22 | PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression | Aaron Hurst et.al. | 2401.12018 | null |
2024-01-22 | A Training-Free Defense Framework for Robust Learned Image Compression | Myungseo Song et.al. | 2401.11902 | null |
2024-01-21 | Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding | Yichi Zhang et.al. | 2401.11615 | null |
2024-01-21 | ColorVideoVDP: A visual difference predictor for image, video and display distortions | Rafal K. Mantiuk et.al. | 2401.11485 | link |
2024-01-21 | Data-driven compression of electron-phonon interactions | Yao Luo et.al. | 2401.11393 | null |
2024-01-20 | Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding | Haisheng Fu et.al. | 2401.11093 | null |
2024-01-19 | NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines | Jukka I. Ahonen et.al. | 2401.10761 | null |
2024-01-19 | Bridging the gap between image coding for machines and humans | Nam Le et.al. | 2401.10732 | null |
2024-01-18 | Attack and Defense Analysis of Learned Image Compression | Tianyu Zhu et.al. | 2401.10345 | null |
2024-01-18 | Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions | Namitha Padmanabhan et.al. | 2401.10217 | null |
2024-01-18 | Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera | Ido Zuckerman et.al. | 2401.10037 | null |
2024-01-18 | Memory Efficient Corner Detection for Event-driven Dynamic Vision Sensors | Pao-Sheng Vincent Sun et.al. | 2401.09797 | null |
2024-01-18 | Compressing MIMO Channel Submatrices with Tucker Decomposition: Enabling Efficient Storage and Reducing SINR Computation Overhead | Yuanwei Zhang et.al. | 2401.09792 | null |
2024-01-17 | Idempotence and Perceptual Image Compression | Tongda Xu et.al. | 2401.08920 | link |
2024-01-16 | End-to-End Optimized Image Compression with the Frequency-Oriented Transform | Yuefeng Zhang et.al. | 2401.08194 | null |
2024-01-17 | Learned Image Compression with ROI-Weighted Distortion and Bit Allocation | Wei Jiang et.al. | 2401.08154 | null |
2024-01-15 | Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning | Manish Sharma et.al. | 2401.08014 | null |
2024-01-15 | Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models | Dan Jacobellis et.al. | 2401.07957 | link |
2024-01-14 | Exploring Compressed Image Representation as a Perceptual Proxy: A Study | Chen-Hsiu Huang et.al. | 2401.07200 | link |
2024-01-13 | Progressive Feature Fusion Network for Enhancing Image Quality Assessment | Kaiqun Wu et.al. | 2401.06992 | null |
2024-01-12 | Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization | Niklas Kämper et.al. | 2401.06747 | null |
2024-03-18 | LiDAR Depth Map Guided Image Compression Model | Alessandro Gnutti et.al. | 2401.06517 | null |
2024-01-11 | Transformer Masked Autoencoders for Next-Generation Wireless Communications: Architecture and Opportunities | Abdullah Zayat et.al. | 2401.06274 | null |
2024-01-11 | MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring | Qian Gong et.al. | 2401.05994 | null |
2024-01-10 | SnapCap: Efficient Snapshot Compressive Video Captioning | Jianqiao Sun et.al. | 2401.04903 | null |
2024-01-09 | Modified Levenberg-Marquardt Algorithm For Tensor CP Decomposition in Image Compression | Ramin Goudarzi Karim et.al. | 2401.04670 | null |
2024-01-09 | Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation | Jinhai Yang et.al. | 2401.04405 | null |
2024-01-08 | Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion | Minglong Xue et.al. | 2401.03788 | link |
2024-01-08 | A Video Coding Method Based on Neural Network for CLIC2024 | Zhengang Li et.al. | 2401.03623 | null |
2024-01-06 | Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis | Qian Gong et.al. | 2401.03317 | null |
2024-01-06 | Comparison of spectrum models as applied to single-particle |
Thomas A. Trainor et.al. | 2401.03290 | null |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | Yang Sui et.al. | 2401.03115 | null |
2024-01-05 | MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS) | Youhao Yu et.al. | 2401.02884 | null |
2024-03-08 | Importance Matching Lemma for Lossy Compression with Side Information | Buu Phan et.al. | 2401.02609 | null |
2024-01-04 | Cool-Chic: Perceptually Tuned Low Complexity Overfitted Image Coder | Théo Ladune et.al. | 2401.02156 | link |
2024-01-04 | ED: Perceptually tuned Enhanced Compression Model | Pierrick Philippe et.al. | 2401.02145 | null |
2024-01-02 | NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement | Parham Zilouchian Moghaddam et.al. | 2401.01163 | null |
2024-01-28 | Higher-Order Cellular Automata Generated Symmetry-Protected Topological Phases and Detection Through Multi-Point Strange Correlators | Jie-Yu Zhang et.al. | 2401.00505 | null |
2023-12-28 | Selective Run-Length Encoding | Xutan Peng et.al. | 2312.17024 | null |
2023-12-29 | FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information | Yichong Xia et.al. | 2312.16963 | null |
2023-12-26 | Range Entropy Queries and Partitioning | Sanjay Krishnan et.al. | 2312.15959 | null |
2023-12-25 | MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression | Yi-Hsin Chen et.al. | 2312.15829 | null |
2023-12-25 | On Robust Wasserstein Barycenter: The Model and Algorithm | Xu Wang et.al. | 2312.15762 | null |
2023-12-25 | Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision | Qi Mao et.al. | 2312.15622 | null |
2023-12-22 | The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs | Junli Fang et.al. | 2312.14792 | null |
2024-01-09 | Enhanced Color Palette Modeling for Lossless Screen Content Compression | Hannah Och et.al. | 2312.14491 | null |
2023-12-30 | Efficient Communication in Federated Learning Using Floating-Point Lossy Compression | Grant Wilkins et.al. | 2312.13461 | null |
2023-12-19 | A Huffman based short message service compression technique using adjacent distance array | Pranta Sarker et.al. | 2312.12495 | null |
2023-12-19 | Full-reference Video Quality Assessment for User Generated Content Transcoding | Zihao Qi et.al. | 2312.12317 | null |
2023-12-19 | Low-Consumption Partial Transcoding by HEVC | Mohsen Abdoli et.al. | 2312.12174 | link |
2023-12-19 | Comparative Study of Hardware and Software Power Measurements in Video Compression | Angeliki Katsenou et.al. | 2312.12150 | null |
2023-12-18 | Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication | Hyunmin Choi et.al. | 2312.11575 | link |
2024-01-11 | Quantized Decoder in Learned Image Compression for Deterministic Reconstruction | Esin Koyuncu et.al. | 2312.11209 | null |
2023-12-19 | A Computationally Efficient Neural Video Compression Accelerator Based on a Sparse CNN-Transformer Hybrid Network | Siyu Zhang et.al. | 2312.10716 | null |
2023-12-17 | IntraSeismic: a coordinate-based learning approach to seismic inversion | Juan Romero et.al. | 2312.10568 | null |
2023-12-17 | Light-weight CNN-based VVC Inter Partitioning Acceleration | Yiqun Liu et.al. | 2312.10567 | null |
2023-12-16 | Statistical Analysis of Inter Coding in VVC Test Model (VTM) | Yiqun Liu et.al. | 2312.10406 | null |
2023-12-15 | IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding | Yu-Han Sun et.al. | 2312.09799 | null |
2023-12-15 | Towards Neuromorphic Compression based Neural Sensing for Next-Generation Wireless Implantable Brain Machine Interface | Vivek Mohan et.al. | 2312.09503 | null |
2023-12-14 | Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression | Andy Regensky et.al. | 2312.09266 | link |
2023-12-14 | Efficient Online Learning of Contact Force Models for Connector Insertion | Kevin Tracy et.al. | 2312.09190 | null |
2023-12-13 | Balanced and Deterministic Weight-sharing Helps Network Performance | Oscar Chang et.al. | 2312.08401 | null |
2023-12-13 | Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach | Yiqun Liu et.al. | 2312.08330 | null |
2023-12-13 | CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation | Eugenio Chisari et.al. | 2312.08240 | null |
2023-12-13 | Explainable Trajectory Representation through Dictionary Learning | Yuanbo Tang et.al. | 2312.08052 | null |
2023-12-12 | Deep Hierarchical Video Compression | Ming Lu et.al. | 2312.07126 | null |
2023-12-12 | Communication Cost Reduction for Subgraph Counting under Local Differential Privacy via Hash Functions | Quentin Hillebrand et.al. | 2312.07055 | link |
2023-12-11 | RAFIC: Retrieval-Augmented Few-shot Image Classification | Hangfei Lin et.al. | 2312.06868 | link |
2023-12-11 | A New Projection Pursuit Index for Big Data | Yajie Duan et.al. | 2312.06465 | null |
2023-12-11 | Variational Auto-Encoder Based Deep Learning Technique For Filling Gaps in Reacting PIV Data | Shashank Yellapantula et.al. | 2312.06461 | null |
2023-12-07 | Analysis of Coding Gain Due to In-Loop Reshaping | Chau-Wai Wong et.al. | 2312.04022 | null |
2023-12-05 | C3: High-performance and low-complexity neural compression from a single image or video | Hyunjik Kim et.al. | 2312.02753 | null |
2023-12-05 | Unified learning-based lossy and lossless JPEG recompression | Jianghui Zhang et.al. | 2312.02705 | null |
2023-12-05 | Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation | Tianhao Peng et.al. | 2312.02605 | null |
2023-12-04 | Hyperspectral Image Compression Using Sampling and Implicit Neural Representations | Shima Rezasoltani et.al. | 2312.01558 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-19 | SqueezeMe: Efficient Gaussian Avatars for VR | Shunsuke Saito et.al. | 2412.15171 | null |
2024-12-19 | OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization | Jiacheng Zhang et.al. | 2412.15159 | null |
2024-12-19 | Jet: A Modern Transformer-Based Normalizing Flow | Alexander Kolesnikov et.al. | 2412.15129 | null |
2024-12-19 | Joint estimation of activity, attenuation and motion in respiratory-self-gated time-of-flight PET | Masoud Elhamiasl et.al. | 2412.15018 | null |
2024-12-19 | Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model | Minglong Xue et.al. | 2412.14630 | null |
2024-12-19 | Qua |
Keith G. Mills et.al. | 2412.14628 | null |
2024-12-19 | Successive optimization of optics and post-processing with differentiable coherent PSF operator and field information | Zheng Ren et.al. | 2412.14603 | link |
2024-12-19 | Enhancing Diffusion Models for High-Quality Image Generation | Jaineet Shah et.al. | 2412.14422 | null |
2024-12-18 | Improving diabetic retinopathy screening using Artificial Intelligence: design, evaluation and before-and-after study of a custom development | Imanol Pinto et.al. | 2412.14221 | null |
2024-12-19 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-18 | AKiRa: Augmentation Kit on Rays for optical video generation | Xi Wang et.al. | 2412.14158 | null |
2024-12-18 | Real-Time Position-Aware View Synthesis from Single-View Input | Manu Gond et.al. | 2412.14005 | null |
2024-12-18 | Data-Efficient Inference of Neural Fluid Fields via SciML Foundation Model | Yuqiu Liu et.al. | 2412.13897 | null |
2024-12-18 | VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement | Chen Zhao et.al. | 2412.13655 | link |
2024-12-18 | PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms | Etienne Lasalle et.al. | 2412.13592 | link |
2024-12-18 | T |
Zhenhong Sun et.al. | 2412.13486 | null |
2024-12-18 | Real-time One-Step Diffusion-based Expressive Portrait Videos Generation | Hanzhong Guo et.al. | 2412.13479 | link |
2024-12-17 | Optimisation of Magnetic Field Sensing with Optically Pumped Magnetometers for Magnetic Detection Electrical Impedance Tomography | Kai Mason et.al. | 2412.13354 | null |
2024-12-17 | Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures | Guoxing Sun et.al. | 2412.13183 | null |
2024-12-17 | F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration | Lu Liu et.al. | 2412.13155 | null |
2024-12-17 | Unlocking the Potential of Digital Pathology: Novel Baselines for Compression | Maximilian Fischer et.al. | 2412.13137 | null |
2024-12-18 | AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark | Jianlyu Chen et.al. | 2412.13102 | link |
2024-12-17 | Smartphone-based Iris Recognition through High-Quality Visible Spectrum Iris Capture | Naveenkumar G Venkataswamy et.al. | 2412.13063 | null |
2024-12-17 | Experimental Study of Low-Latency Video Streaming in an ORAN Setup with Generative AI | Andreas Casparsen et.al. | 2412.12751 | null |
2024-12-17 | Subspace Implicit Neural Representations for Real-Time Cardiac Cine MR Imaging | Wenqi Huang et.al. | 2412.12742 | link |
2024-12-17 | Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI | Matthias J. Ehrhardt et.al. | 2412.12711 | null |
2024-12-17 | A Two-Fold Patch Selection Approach for Improved 360-Degree Image Quality Assessment | Abderrezzaq Sendjasni et.al. | 2412.12667 | link |
2024-12-17 | RDPI: A Refine Diffusion Probability Generation Method for Spatiotemporal Data Imputation | Zijin Liu et.al. | 2412.12642 | link |
2024-12-17 | Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration | Xinlong Cheng et.al. | 2412.12550 | null |
2024-12-17 | Invisible Watermarks: Attacks and Robustness | Dongjun Hwang et.al. | 2412.12511 | link |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | SPADE: Spectroscopic Photoacoustic Denoising using an Analytical and Data-free Enhancement Framework | Fangzhou Lin et.al. | 2412.12068 | null |
2024-12-16 | Industrial-scale Prediction of Cement Clinker Phases using Machine Learning | Sheikh Junaid Fayaz et.al. | 2412.11981 | null |
2024-12-16 | Towards Physically-Based Sky-Modeling | Ian J. Maquignaz et.al. | 2412.11883 | null |
2024-12-16 | Impact of Face Alignment on Face Image Quality | Eren Onaran et.al. | 2412.11779 | null |
2024-12-16 | Formal Quality Measures for Predictors in Markov Decision Processes | Christel Baier et.al. | 2412.11754 | null |
2024-12-16 | Comparison of three reconstruction algorithms for low-dose phase-contrast computed tomography of the breast with synchrotron radiation | Sandro Donato et.al. | 2412.11641 | null |
2024-12-16 | MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation | Javier García Gilabert et.al. | 2412.11615 | null |
2024-12-16 | Block-Based Multi-Scale Image Rescaling | Jian Li et.al. | 2412.11468 | null |
2024-12-16 | Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression | Chuqin Zhou et.al. | 2412.11379 | null |
2024-12-15 | VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Hao Shao et.al. | 2412.11279 | null |
2024-12-15 | CATER: Leveraging LLM to Pioneer a Multidimensional, Reference-Independent Paradigm in Translation Quality Evaluation | Kurando IIDA et.al. | 2412.11261 | null |
2024-12-15 | Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation | Yujie Zhang et.al. | 2412.11170 | null |
2024-12-15 | A Comprehensive Survey of Action Quality Assessment: Method and Benchmark | Kanglei Zhou et.al. | 2412.11149 | null |
2024-12-14 | Zigzag Diffusion Sampling: The Path to Success Is Zigzag | Lichen Bai et.al. | 2412.10891 | link |
2024-12-14 | Unbiased General Annotated Dataset Generation | Dengyang Jiang et.al. | 2412.10831 | null |
2024-12-14 | Rapid Reconstruction of Extremely Accelerated Liver 4D MRI via Chained Iterative Refinement | Di Xu et.al. | 2412.10629 | null |
2024-12-13 | RAID-Database: human Responses to Affine Image Distortions | Paula Daudén-Oliver et.al. | 2412.10211 | null |
2024-12-13 | GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark | Sitong Su et.al. | 2412.09997 | null |
2024-12-13 | EP-CFG: Energy-Preserving Classifier-Free Guidance | Kai Zhang et.al. | 2412.09966 | null |
2024-12-13 | Jiawei Li et.al. | 2412.09954 | null | |
2024-12-13 | Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images | Yasamin Medghalchi et.al. | 2412.09910 | link |
2024-12-13 | LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity | Hongjie Wang et.al. | 2412.09856 | null |
2024-12-13 | A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method | Jing Sun et.al. | 2412.09846 | null |
2024-12-13 | Super-Resolution for Remote Sensing Imagery via the Coupling of a Variational Model and Deep Learning | Jing Sun et.al. | 2412.09841 | null |
2024-12-13 | Prospects for Systematic Planetary Nebulae Detection with the Census of the Local Universe Narrowband Survey | Rong Du et.al. | 2412.09836 | null |
2024-12-13 | Speech-based Multimodel Pipeline for Vietnamese Services Quality Assessment | Quang-Anh N. D. et.al. | 2412.09829 | null |
2024-12-12 | OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs | Yuanzhi Zhu et.al. | 2412.09465 | link |
2024-12-12 | UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer | Delong Liu et.al. | 2412.09389 | link |
2024-12-13 | Are Conditional Latent Diffusion Models Effective for Image Restoration? | Yunchen Yuan et.al. | 2412.09324 | null |
2024-12-12 | Towards Understanding the Robustness of LLM-based Evaluations under Perturbations | Manav Chaudhary et.al. | 2412.09269 | null |
2024-12-12 | Elevating Flow-Guided Video Inpainting with Reference Generation | Suhwan Cho et.al. | 2412.08975 | link |
2024-12-12 | Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Ali Mollaahmadi Dehaghi et.al. | 2412.08912 | link |
2024-12-11 | DeepNose: An Equivariant Convolutional Neural Network Predictive Of Human Olfactory Percepts | Sergey Shuvaev et.al. | 2412.08747 | null |
2024-12-13 | Utilizing Multi-step Loss for Single Image Reflection Removal | Abdelrahman Elnenaey et.al. | 2412.08582 | link |
2024-12-11 | PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis | Yifan Xie et.al. | 2412.08504 | null |
2024-12-12 | Learning Flow Fields in Attention for Controllable Person Image Generation | Zijian Zhou et.al. | 2412.08486 | link |
2024-12-11 | Visible and Infrared Image Fusion Using Encoder-Decoder Network | Ferhat Can Ataman et.al. | 2412.08073 | link |
2024-12-11 | NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods | Qiang Qu et.al. | 2412.08029 | link |
2024-12-10 | Graph convolutional networks enable fast hemorrhagic stroke monitoring with electrical impedance tomography | J. Toivanen et.al. | 2412.07888 | null |
2024-12-10 | PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition | Kartik Narayan et.al. | 2412.07771 | null |
2024-12-10 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Xiao Fu et.al. | 2412.07759 | null |
2024-12-10 | PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation | Fatemeh Nazarieh et.al. | 2412.07754 | null |
2024-12-10 | Multi-Shot Character Consistency for Text-to-Video Generation | Yuval Atzmon et.al. | 2412.07750 | null |
2024-12-11 | Direct Low-Dose CT Image Reconstruction on GPU using Out-Of-Core: Precision and Quality Study | M. Chillarón et.al. | 2412.07631 | null |
2024-12-10 | OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Linke Ouyang et.al. | 2412.07626 | link |
2024-12-10 | CoMA: Compositional Human Motion Generation with Multi-modal Agents | Shanlin Sun et.al. | 2412.07320 | null |
2024-12-10 | Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger | Yi Yu et.al. | 2412.07277 | null |
2024-12-10 | Moderating the Generalization of Score-based Generative Model | Wan Jiang et.al. | 2412.07229 | null |
2024-12-11 | Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation | Tal Zeevi et.al. | 2412.07169 | link |
2024-12-10 | QCResUNet: Joint Subject-level and Voxel-level Segmentation Quality Prediction | Peijie Qiu et.al. | 2412.07156 | link |
2024-12-10 | Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions | Qiang Qu et.al. | 2412.07079 | null |
2024-12-11 | Diff-GO |
Suchinthaka Wanninayaka et.al. | 2412.06980 | null |
2024-12-09 | Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning | Mehdi Noroozi et.al. | 2412.06978 | null |
2024-12-09 | Ranking-aware adapter for text-driven image ordering with CLIP | Wei-Hsiang Yu et.al. | 2412.06760 | link |
2024-12-09 | AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark | Lan Li et.al. | 2412.06724 | link |
2024-12-10 | A No-Reference Medical Image Quality Assessment Method Based on Automated Distortion Recognition Technology: Application to Preprocessing in MRI-guided Radiotherapy | Zilin Wang et.al. | 2412.06599 | null |
2024-12-09 | How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning | Yuanyuan Wang et.al. | 2412.06451 | null |
2024-12-09 | Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Kim Sung-Bin et.al. | 2412.06209 | null |
2024-12-09 | One-shot Human Motion Transfer via Occlusion-Robust Flow Prediction and Neural Texturing | Yuzhu Ji et.al. | 2412.06174 | null |
2024-12-09 | A CT Image Denoising Method Based on Projection Domain Feature | Mengyu Sun et.al. | 2412.06135 | null |
2024-12-08 | Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Zhenghong Zhou et.al. | 2412.06029 | null |
2024-12-08 | Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation | Aymen Sekhri et.al. | 2412.06003 | null |
2024-12-08 | Nested Diffusion Models Using Hierarchical Latent Priors | Xiao Zhang et.al. | 2412.05984 | null |
2024-12-08 | Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCT | Qing Wu et.al. | 2412.05853 | null |
2024-12-08 | SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization | Shuzhao Xie et.al. | 2412.05808 | null |
2024-12-07 | Emulating Clinical Quality Muscle B-mode Ultrasound Images from Plane Wave Images Using a Two-Stage Machine Learning Model | Reed Chen et.al. | 2412.05758 | link |
2024-12-07 | A Tiered GAN Approach for Monet-Style Image Generation | FNU Neha et.al. | 2412.05724 | null |
2024-12-07 | Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes | Saqib Javed et.al. | 2412.05700 | null |
2024-12-07 | Enhancing Research Methodology and Academic Publishing: A Structured Framework for Quality and Integrity | Md. Jalil Piran et.al. | 2412.05683 | null |
2024-12-07 | Deep Reinforcement Learning-Based Resource Allocation for Hybrid Bit and Generative Semantic Communications in Space-Air-Ground Integrated Networks | Chong Huang et.al. | 2412.05647 | null |
2024-12-06 | LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Donald Shenaj et.al. | 2412.05148 | null |
2024-12-06 | Comprehensive Analysis and Improvements in Pansharpening Using Deep Learning | Mahek Kantharia et.al. | 2412.04896 | null |
2024-12-06 | Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud | Yuanhao Yue et.al. | 2412.04871 | null |
2024-12-05 | Motion-Guided Deep Image Prior for Cardiac MRI | Marc Vornehm et.al. | 2412.04639 | null |
2024-12-05 | MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers | Byeonghyeon Lee et.al. | 2412.04591 | null |
2024-12-05 | 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion | Chaoyang Wang et.al. | 2412.04462 | null |
2024-12-05 | LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors | Yusuf Dalva et.al. | 2412.04460 | null |
2024-12-05 | Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction | George Webber et.al. | 2412.04324 | null |
2024-12-05 | T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts | Ziwei Huang et.al. | 2412.04300 | null |
2024-12-05 | IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation | Sejong Yang et.al. | 2412.04000 | null |
2024-12-05 | Blind Underwater Image Restoration using Co-Operational Regressor Networks | Ozer Can Devecioglu et.al. | 2412.03995 | null |
2024-12-05 | LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model | Yuan Xue et.al. | 2412.03841 | null |
2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
2024-12-04 | MV-Adapter: Multi-view Consistent Image Generation Made Easy | Zehuan Huang et.al. | 2412.03632 | null |
2024-12-04 | Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation | Bingjie Song et.al. | 2412.03571 | null |
2024-12-04 | NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model | Xinheng Xie et.al. | 2412.03539 | null |
2024-12-04 | SGSST: Scaling Gaussian Splatting StyleTransfer | Bruno Galerne et.al. | 2412.03371 | link |
2024-12-04 | Is JPEG AI going to change image forensics? | Edoardo Daniele Cannas et.al. | 2412.03261 | null |
2024-12-04 | Task-driven Image Fusion with Learnable Fusion Loss | Haowen Bai et.al. | 2412.03240 | null |
2024-12-04 | Parametric Enhancement of PerceptNet: A Human-Inspired Approach for Image Quality Assessment | Jorge Vila-Tomás et.al. | 2412.03210 | link |
2024-12-04 | Unsupervised Network for Single Image Raindrop Removal | Huijiao Wang et.al. | 2412.03019 | null |
2024-12-04 | Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach | Lingchen Sun et.al. | 2412.03017 | link |
2024-12-04 | Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference | XiuYu Zhang et.al. | 2412.02962 | null |
2024-12-04 | Surrogate distributed radiological sources III: quantitative distributed source reconstructions | Jayson R. Vavrek et.al. | 2412.02926 | null |
2024-12-04 | Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection | Prabhat Kc et.al. | 2412.02920 | null |
2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
2024-12-03 | High-Quality Passive Acoustic Mapping with the Cross-Correlated Angular Spectrum Method | Yi Zeng et.al. | 2412.02413 | null |
2024-12-03 | Switchable deep beamformer for high-quality and real-time passive acoustic mapping | Yi Zeng et.al. | 2412.02327 | null |
2024-12-03 | Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data | Maximilian E. Tschuchnig et.al. | 2412.02294 | null |
2024-12-02 | NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training | Dar-Yen Chen et.al. | 2412.02030 | null |
2024-12-02 | HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment | Armin Shafiee Sarvestani et.al. | 2412.01986 | null |
2024-12-02 | IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models | Khaled Abud et.al. | 2412.01794 | link |
2024-12-02 | OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking | Xuanyu Zhang et.al. | 2412.01615 | null |
2024-12-02 | Negative Token Merging: Image-based Adversarial Feature Guidance | Jaskirat Singh et.al. | 2412.01339 | null |
2024-12-02 | Data Uncertainty-Aware Learning for Multimodal Aspect-based Sentiment Analysis | Hao Yang et.al. | 2412.01249 | null |
2024-12-02 | Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation | Zilyu Ye et.al. | 2412.01243 | null |
2024-12-02 | PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control | Ruichen Wang et.al. | 2412.01223 | null |
2024-12-02 | Assessing GPT Model Uncertainty in Mathematical OCR Tasks via Entropy Analysis | Alexei Kaltchenko et.al. | 2412.01221 | null |
2024-12-02 | LoyalDiffusion: A Diffusion Model Guarding Against Data Replication | Chenghao Li et.al. | 2412.01118 | null |
2024-12-02 | FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait | Taekyung Ki et.al. | 2412.01064 | null |
2024-12-02 | Evaluating Automated Radiology Report Quality through Fine-Grained Phrasal Grounding of Clinical Findings | Razi Mahmood et.al. | 2412.01031 | null |
2024-12-01 | Optimal Algorithms for Augmented Testing of Discrete Distributions | Maryam Aliakbarpour et.al. | 2412.00974 | null |
2024-12-01 | Generating AI Literacy MCQs: A Multi-Agent LLM Approach | Jiayi Wang et.al. | 2412.00970 | null |
2024-12-01 | Playable Game Generation | Mingyu Yang et.al. | 2412.00887 | link |
2024-11-30 | Multi-resolution Guided 3D GANs for Medical Image Translation | Juhyung Ha et.al. | 2412.00575 | null |
2024-11-29 | INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge | Angelika Romanou et.al. | 2411.19799 | null |
2024-11-29 | ChineseWebText 2.0: Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information | Wanyue Zhang et.al. | 2411.19668 | link |
2024-11-29 | Tortho-Gaussian: Splatting True Digital Orthophoto Maps | Xin Wang et.al. | 2411.19594 | null |
2024-11-29 | Self-Supervised Denoiser Framework | Emilien Valat et.al. | 2411.19593 | null |
2024-11-29 | Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising | Md. Touhidul Islam et.al. | 2411.19549 | link |
2024-11-29 | Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions | Sria Biswas et.al. | 2411.19522 | null |
2024-11-29 | Retrieval-guided Cross-view Image Synthesis | Hongji Yang et.al. | 2411.19510 | null |
2024-11-29 | Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Yuhang Zhang et.al. | 2411.19459 | null |
2024-11-28 | AMO Sampler: Enhancing Text Rendering with Overshooting | Xixi Hu et.al. | 2411.19415 | null |
2024-11-28 | 3D Wasserstein generative adversarial network with dense U-Net based discriminator for preclinical fMRI denoising | Sima Soltanpour et.al. | 2411.19345 | null |
2024-11-28 | Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model | Feng Liu et.al. | 2411.19108 | null |
2024-11-28 | SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing | Rong-Cheng Tu et.al. | 2411.18983 | null |
2024-11-28 | Deep Plug-and-Play HIO Approach for Phase Retrieval | Cagatay Isil et.al. | 2411.18967 | null |
2024-12-02 | AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Sherwin Bahmani et.al. | 2411.18673 | null |
2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | link |
2024-11-27 | Textured Gaussians for Enhanced 3D Scene Appearance Modeling | Brian Chao et.al. | 2411.18625 | null |
2024-11-27 | Uncertainty-driven Sampling for Efficient Pairwise Comparison Subjective Assessment | Shima Mohammadi et.al. | 2411.18372 | link |
2024-11-29 | HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning | Zengxi Zhang et.al. | 2411.18296 | link |
2024-11-27 | Deep End-to-end Adaptive k-Space Sampling, Reconstruction, and Registration for Dynamic MRI | George Yiasemis et.al. | 2411.18249 | null |
2024-11-27 | Towards Improved Objective Perceptual Audio Quality Assessment -- Part 1: A Novel Data-Driven Cognitive Model | Pablo M. Delgado et.al. | 2411.18222 | null |
2024-11-27 | KAN See Your Face | Dong Han et.al. | 2411.18165 | null |
2024-11-27 | Type-R: Automatically Retouching Typos for Text-to-Image Generation | Wataru Shimoda et.al. | 2411.18159 | null |
2024-11-26 | MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework | Xiangcheng Hu et.al. | 2411.17928 | link |
2024-11-26 | SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation | Ximing Xing et.al. | 2411.17832 | null |
2024-11-26 | Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient | Zigeng Chen et.al. | 2411.17787 | link |
2024-11-27 | Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space | Lingxiao Li et.al. | 2411.17784 | null |
2024-11-26 | Perceptually Optimized Super Resolution | Volodymyr Karpenko et.al. | 2411.17513 | null |
2024-11-26 | Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions | Nicolai Hermann et.al. | 2411.17489 | null |
2024-11-26 | Structure-Guided MR-to-CT Synthesis with Spatial and Semantic Alignments for Attenuation Correction of Whole-Body PET/MR Imaging | Jiaxu Zheng et.al. | 2411.17488 | null |
2024-11-26 | Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance | Jingtong Yue et.al. | 2411.17390 | link |
2024-11-26 | InsightEdit: Towards Better Instruction Following for Image Editing | Yingjing Xu et.al. | 2411.17323 | null |
2024-11-26 | Reward Incremental Learning in Text-to-Image Generation | Maorong Wang et.al. | 2411.17310 | null |
2024-11-26 | Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment | Zheng Chen et.al. | 2411.17237 | link |
2024-11-26 | AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM | Jiarui Wang et.al. | 2411.17221 | link |
2024-11-26 | ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting | Chengyou Jia et.al. | 2411.17176 | null |
2024-11-26 | OSDFace: One-Step Diffusion Model for Face Restoration | Jingkai Wang et.al. | 2411.17163 | link |
2024-11-26 | Motion Free B-frame Coding for Neural Video Compression | Van Thang Nguyen et.al. | 2411.17160 | null |
2024-11-26 | 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction | Woong Oh Cho et.al. | 2411.17044 | null |
2024-11-26 | TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On | Zhenchen Wan et.al. | 2411.17017 | link |
2024-11-25 | G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs | Kunyi Li et.al. | 2411.16898 | null |
2024-11-25 | Fully Automatic Deep Learning Pipeline for Whole Slide Image Quality Assessment | Falah Jabar et.al. | 2411.16885 | null |
2024-11-25 | LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction | Yiran Sun et.al. | 2411.16629 | null |
2024-11-25 | Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric | Zhichao Zhang et.al. | 2411.16619 | null |
2024-11-25 | Coherence Based Sound Speed Aberration Correction -- with clinical validation in obstetric ultrasound | Anders Emil Vrålstad et.al. | 2411.16551 | null |
2024-11-25 | Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN | Elona Shatri et.al. | 2411.16405 | null |
2024-11-25 | Human-Calibrated Automated Testing and Validation of Generative Language Models | Agus Sudjianto et.al. | 2411.16391 | null |
2024-11-25 | Bounds for the maximum modulus of polynomial roots with nearly optimal worst-case overestimation | Prashant Batra et.al. | 2411.16385 | null |
2024-11-25 | Privacy-Preserving Federated Foundation Model for Generalist Ultrasound Artificial Intelligence | Yuncheng Jiang et.al. | 2411.16380 | null |
2024-11-25 | Sonic: Shifting Focus to Global Audio Perception in Portrait Animation | Xiaozhong Ji et.al. | 2411.16331 | null |
2024-11-25 | EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training | Yiying Wei et.al. | 2411.16312 | null |
2024-11-25 | Weakly supervised image segmentation for defect-based grading of fresh produce | Manuel Knott et.al. | 2411.16219 | link |
2024-11-25 | VIRES: Video Instance Repainting with Sketch and Text Guidance | Shuchen Weng et.al. | 2411.16199 | null |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-25 | ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images | Prithviraj Purushottam Naik et.al. | 2411.16096 | null |
2024-11-25 | AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity | Jili Xia et.al. | 2411.16087 | null |
2024-11-24 | Distribution models of antennas in radio astronomy: Efficiency comparison of the golden spiral interferometry | Elio Quiroga Rodriguez et.al. | 2411.15904 | null |
2024-11-24 | A review on Machine Learning based User-Centric Multimedia Streaming Techniques | Monalisa Ghosh et.al. | 2411.15801 | null |
2024-11-24 | LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration | Gaojing Zhang et.al. | 2411.15740 | null |
2024-11-23 | SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation | Jiayuan Zhu et.al. | 2411.15513 | null |
2024-11-23 | Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark | Rong-Cheng Tu et.al. | 2411.15488 | link |
2024-11-22 | HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads | Yu Xu et.al. | 2411.15034 | null |
2024-11-22 | FloAt: Flow Warping of Self-Attention for Clothing Animation Generation | Swasti Shreya Mishra et.al. | 2411.15028 | null |
2024-11-22 | Information Extraction from Heterogenous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation | Aniket Bhattacharyya et.al. | 2411.14957 | null |
2024-11-22 | Evaluating Vision Transformer Models for Visual Quality Control in Industrial Manufacturing | Miriam Alber et.al. | 2411.14953 | link |
2024-11-22 | Fast High-Quality Enhanced Imaging Algorithm for Layered Dielectric Targets Based on MMW MIMO-SAR System | Xu Chen et.al. | 2411.14837 | null |
2024-11-22 | BrightVAE: Luminosity Enhancement in Underexposed Endoscopic Images | Farzaneh Koohestani et.al. | 2411.14663 | null |
2024-11-22 | VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Armani Rodriguez et.al. | 2411.14642 | null |
2024-11-21 | Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection | Ali Awad et.al. | 2411.14626 | null |
2024-11-21 | Optimal Transcoding Preset Selection for Live Video Streaming | Zahra Nabizadeh et.al. | 2411.14613 | null |
2024-11-21 | Roadmap on Advances in Visual and Physiological Optics | Jesús E. Gómez-Correa et.al. | 2411.14606 | null |
2024-11-21 | Night-to-Day Translation via Illumination Degradation Disentanglement | Guanzhou Lan et.al. | 2411.14504 | null |
2024-11-21 | Regional Attention for Shadow Removal | Hengxing Liu et.al. | 2411.14201 | link |
2024-11-21 | Image Compression Using Novel View Synthesis Priors | Luyuan Peng et.al. | 2411.13862 | null |
2024-11-21 | Detecting Human Artifacts from Text-to-Image Models | Kaihong Wang et.al. | 2411.13842 | link |
2024-11-21 | Robust Steganography with Boundary-Preserving Overflow Alleviation and Adaptive Error Correction | Yu Cheng et.al. | 2411.13819 | null |
2024-11-21 | Edge-Cloud Routing for Text-to-Image Model with Token-Level Multi-Metric Prediction | Zewei Xin et.al. | 2411.13787 | null |
2024-11-20 | What You See Is What Matters: A Novel Visual and Physics-Based Metric for Evaluating Video Generation Quality | Zihan Wang et.al. | 2411.13609 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-20 | OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging | Rajini Makam et.al. | 2411.13230 | link |
2024-11-20 | ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations | Xulong Zhang et.al. | 2411.13089 | null |
2024-11-20 | LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression | Shimon Murai et.al. | 2411.13033 | link |
2024-11-19 | HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation | Abdul Basit Anees et.al. | 2411.12832 | link |
2024-11-19 | Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment | Siyi Pan et.al. | 2411.12791 | null |
2024-11-19 | Stochastic BIQA: Median Randomized Smoothing for Certified Blind Image Quality Assessment | Ekaterina Shumitskaya et.al. | 2411.12575 | null |
2024-11-19 | PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy | Joanna Kaleta et.al. | 2411.12510 | link |
2024-11-19 | A |
Abdul Halim et.al. | 2411.12457 | null |
2024-11-19 | Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models | Jun Xiao et.al. | 2411.12450 | null |
2024-11-19 | Acquire Precise and Comparable Fundus Image Quality Score: FTHNet and FQS Dataset | Zheng Gong et.al. | 2411.12273 | null |
2024-11-19 | Performance of Large Language Models in Technical MRI Question Answering: A Comparative Study | Alan B McMillan et.al. | 2411.12238 | null |
2024-11-19 | Tangential Randomization in Linear Bandits (TRAiL): Guaranteed Inference and Regret Bounds | Arda Güçlü et.al. | 2411.12154 | null |
2024-11-18 | FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting | Fangyu Wu et.al. | 2411.12089 | null |
2024-11-18 | Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion | Meng Zhou et.al. | 2411.11799 | link |
2024-11-18 | Additional Tests for TV 3.0 | Eduardo Peixoto et.al. | 2411.11755 | null |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | CLUE-MARK: Watermarking Diffusion Models using CLWE | Kareem Shehata et.al. | 2411.11434 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | null |
2024-11-17 | Enhanced Anime Image Generation Using USE-CMHSA-GAN | J. Lu et.al. | 2411.11179 | null |
2024-11-17 | Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion | Yu-Fei Shi et.al. | 2411.11123 | null |
2024-11-17 | MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild | Xi Fang et.al. | 2411.11098 | null |
2024-11-17 | Spectral Subspace Clustering for Attributed Graphs | Xiaoyang Lin et.al. | 2411.11074 | link |
2024-11-17 | Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based Visible-Infrared Person Re-Identification | Wenjia Jiang et.al. | 2411.11069 | null |
2024-11-17 | Hyperspectral Imaging-Based Grain Quality Assessment With Limited Labelled Data | Priyabrata Karmakar et.al. | 2411.10924 | null |
2024-11-16 | HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings | Anton Alekseev et.al. | 2411.10724 | null |
2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | link |
2024-11-15 | On the Foundation Model for Cardiac MRI Reconstruction | Chi Zhang et.al. | 2411.10403 | null |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | null |
2024-11-15 | Block based Adaptive Compressive Sensing with Sampling Rate Control | Kosuke Iwama et.al. | 2411.10200 | null |
2024-11-15 | Visual question answering based evaluation metrics for text-to-image generation | Mizuki Miyamoto et.al. | 2411.10183 | null |
2024-11-15 | SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning | Zewen Chen et.al. | 2411.10161 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations | Jung-Woo Chang et.al. | 2411.10034 | null |
2024-11-14 | Video Denoising in Fluorescence Guided Surgery | Trevor Seets et.al. | 2411.09798 | null |
2024-11-14 | Research evaluation with ChatGPT: Is it age, country, length, or field biased? | Mike Thelwall et.al. | 2411.09768 | null |
2024-11-14 | Evaluating the Predictive Capacity of ChatGPT for Academic Peer Review Outcomes Across Multiple Platforms | Mike Thelwall et.al. | 2411.09763 | null |
2024-11-14 | MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation | Jonas Serych et.al. | 2411.09551 | link |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | Iterative tomographic reconstruction with TV prior for low-dose CBCT dental imaging | Louise Friot-Giroux et.al. | 2411.09306 | null |
2024-11-14 | LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Chenyang Wang et.al. | 2411.09293 | null |
2024-11-14 | LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space | Guanwen Feng et.al. | 2411.09268 | null |
2024-11-14 | JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation | Xuyang Cao et.al. | 2411.09209 | link |
2024-11-14 | Orthogonal Linear Array based Product Beamforming for Real Time Underwater 3D Acoustical Imaging | Mimisha M Menakath et.al. | 2411.09197 | null |
2024-11-14 | Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance | Md Fahim Anjum et.al. | 2411.09174 | null |
2024-11-13 | Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment | Zihao Huang et.al. | 2411.09007 | null |
2024-11-13 | Causal Explanations for Image Classifiers | Hana Chockler et.al. | 2411.08875 | link |
2024-11-13 | A novel imaging setup for hybrid radiotherapy tailored PET/MR in patients with head and neck cancer | R. M. Winter et.al. | 2411.08783 | null |
2024-11-13 | Robust Divergence Learning for Missing-Modality Segmentation | Runze Cheng et.al. | 2411.08305 | null |
2024-11-13 | Numerical Analysis of Lensless Imaging with Active Metasurfaces and Single-Pixel Detectors | Julie Belleville et.al. | 2411.08282 | null |
2024-11-12 | DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks | Zhaoxi Zhang et.al. | 2411.07941 | null |
2024-11-12 | Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization | Ziyu Shan et.al. | 2411.07936 | null |
2024-11-12 | CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT Denoising | Linxuan Li et.al. | 2411.07930 | link |
2024-11-12 | Joint multi-dimensional dynamic attention and transformer for general image restoration | Huan Zhang et.al. | 2411.07893 | link |
2024-11-12 | No-Reference Point Cloud Quality Assessment via Graph Convolutional Network | Wu Chen et.al. | 2411.07728 | null |
2024-11-12 | SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images | Bella Specktor-Fadida et.al. | 2411.07601 | null |
2024-11-12 | IR image databases generation under target intrinsic thermal variability constraints | Jerome Gilles et.al. | 2411.07577 | null |
2024-11-12 | Multi-task Feature Enhancement Network for No-Reference Image Quality Assessment | Li Yu et.al. | 2411.07556 | null |
2024-11-12 | A Novel Automatic Real-time Motion Tracking Method for Magnetic Resonance Imaging-guided Radiotherapy: Leveraging the Enhanced Tracking-Learning-Detection Framework with Automatic Segmentation | Shengqi Chen et.al. | 2411.07503 | null |
2024-11-12 | An Exploration of Parallel Imaging System for Very-low Field (50mT) MRI Scanner | Lei Yang et.al. | 2411.07489 | null |
2024-11-11 | Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy | Sepideh K. Gharamaleki et.al. | 2411.07426 | null |
2024-11-11 | Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study | Khadija Rais et.al. | 2411.07348 | null |
2024-11-11 | Artificial Intelligence-Informed Handheld Breast Ultrasound for Screening: A Systematic Review of Diagnostic Test Accuracy | Arianna Bunnell et.al. | 2411.07322 | null |
2024-11-11 | GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask Generation | Haoyu Yang et.al. | 2411.07311 | null |
2024-11-11 | A Hierarchical Compression Technique for 3D Gaussian Splatting Compression | He Huang et.al. | 2411.06976 | null |
2024-11-11 | Multi-scale Frequency Enhancement Network for Blind Image Deblurring | Yawen Xiang et.al. | 2411.06893 | null |
2024-11-11 | Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation | Reo Yoneyama et.al. | 2411.06807 | null |
2024-11-11 | Machine vision-aware quality metrics for compressed image and video assessment | Mikhail Dremin et.al. | 2411.06776 | null |
2024-11-11 | Loss-tolerant neural video codec aware congestion control for real time video communication | Zhengxu Xia et.al. | 2411.06742 | null |
2024-11-11 | 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results | Ahmed Telili et.al. | 2411.06738 | null |
2024-11-11 | Accelerating Low-field MRI: Compressed Sensing and AI for fast noise-robust imaging | Efrat Shimron et.al. | 2411.06704 | link |
2024-11-10 | CASC: Condition-Aware Semantic Communication with Latent Diffusion Models | Weixuan Chen et.al. | 2411.06552 | null |
2024-11-08 | A Modular Conditional Diffusion Framework for Image Reconstruction | Magauiya Zhussip et.al. | 2411.05993 | null |
2024-11-08 | Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings | Miguel Moura Ramos et.al. | 2411.05986 | null |
2024-11-08 | Dictionary Learning with Convolutional Structure for Seismic Data Denoising and Interpolation | Murad Almadani et.al. | 2411.05956 | null |
2024-11-08 | Alternative Learning Paradigms for Image Quality Transfer | Ahmed Karam Eldaly et.al. | 2411.05885 | null |
2024-11-08 | Benchmarking 3D multi-coil NC-PDNet MRI reconstruction | Asma Tanabene et.al. | 2411.05883 | null |
2024-11-08 | Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Long Truong To et.al. | 2411.05641 | null |
2024-11-08 | DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions | Rafael Berral-Soler et.al. | 2411.05552 | link |
2024-11-08 | Improving image synthesis with diffusion-negative sampling | Alakh Desai et.al. | 2411.05473 | null |
2024-11-08 | RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction | Xingyu Ai et.al. | 2411.05354 | null |
2024-11-08 | Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning | Quang Truong Nguyen et.al. | 2411.05344 | null |
2024-11-08 | A Quality-Centric Framework for Generic Deepfake Detection | Wentang Song et.al. | 2411.05335 | null |
2024-11-08 | Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet | Boxiao Yu et.al. | 2411.05302 | null |
2024-11-07 | Quantum Imaging and Metrology with Undetected squeezed Photons: Noise Canceling and Noise Based Imaging | S. Samimi et.al. | 2411.05175 | null |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | null |
2024-11-07 | SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation | Koichi Namekata et.al. | 2411.04989 | null |
2024-11-07 | Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification | Mischa Dombrowski et.al. | 2411.04956 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-07 | Differentiable Gaussian Representation for Incomplete CT Reconstruction | Shaokai Wu et.al. | 2411.04844 | null |
2024-11-07 | Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation | Benito Buchheim et.al. | 2411.04724 | null |
2024-11-06 | Multi-Reward as Condition for Instruction-based Image Editing | Xin Gu et.al. | 2411.04713 | null |
2024-11-06 | SEE-DPO: Self Entropy Enhanced Direct Preference Optimization | Shivanshu Shekhar et.al. | 2411.04712 | null |
2024-11-07 | Generative Semantic Communications with Foundation Models: Perception-Error Analysis and Semantic-Aware Power Allocation | Chunmei Xu et.al. | 2411.04575 | null |
2024-11-07 | Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao et.al. | 2411.04424 | link |
2024-11-07 | A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment | Subrina Sultana et.al. | 2411.04379 | null |
2024-11-06 | X-ray Single-Pixel Imaging with MPGD-based detectors | M. Simões et.al. | 2411.03907 | null |
2024-11-06 | VQA |
Ziheng Jia et.al. | 2411.03795 | link |
2024-11-06 | MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models | Wen-Chin Huang et.al. | 2411.03715 | link |
2024-11-06 | Evaluating Eye Tracking Signal Quality with Real-time Gaze Interaction Simulation | Mehedi Hasan Raju et.al. | 2411.03708 | null |
2024-11-06 | Investigation of Inward-Outward Ring Permanent Magnet Array for Portable Magnetic Resonance Imaging (MRI) | Ting-Ou Liang et.al. | 2411.03249 | null |
2024-11-05 | The Impact of Medicaid Expansion on Medicare Quality Measures | Hala Algrain et.al. | 2411.03140 | null |
2024-11-05 | Investigating the Applicability of a Snapshot Computed Tomography Imaging Spectrometer for the Prediction of Brix and pH of Grapes | Mads Svanborg Peters et.al. | 2411.03114 | null |
2024-11-05 | Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications | Lei Wang et.al. | 2411.02843 | null |
2024-11-04 | Interaction Design with Generative AI: An Empirical Study of Emerging Strategies Across the Four Phases of Design | Marie Muehlhaus et.al. | 2411.02662 | null |
2024-11-04 | Euclid: High-precision imaging astrometry and photometry from Early Release Observations. I. Internal kinematics of NGC 6397 by combining Euclid and Gaia data | M. Libralato et.al. | 2411.02487 | null |
2024-11-02 | Cross-D Conv: Cross-Dimensional Transferable Knowledge Base via Fourier Shifting Operation | Mehmet Can Yavuz et.al. | 2411.02441 | link |
2024-11-04 | Physically Based Neural Bidirectional Reflectance Distribution Function | Chenliang Zhou et.al. | 2411.02347 | null |
2024-11-04 | Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition | Xinkai Liu et.al. | 2411.02334 | null |
2024-11-03 | Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration | Xiaole Tang et.al. | 2411.01656 | link |
2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
2024-11-03 | TPOT: Topology Preserving Optimal Transport in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2411.01403 | null |
2024-11-02 | Interacting Large Language Model Agents. Interpretable Models and Social Learning | Adit Jain et.al. | 2411.01271 | null |
2024-11-02 | The impact of MRI image quality on statistical and predictive analysis on voxel based morphology | Felix Hoffstaedter et.al. | 2411.01268 | link |
2024-11-02 | Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures | Ameya Uppina et.al. | 2411.01251 | null |
2024-11-02 | Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting | Fengze Li et.al. | 2411.01218 | null |
2024-11-01 | Evaluation Metric for Quality Control and Generative Models in Histopathology Images | Pranav Jeevan et.al. | 2411.01034 | null |
2024-11-01 | Re-thinking Richardson-Lucy without Iteration Cutoffs: Physically Motivated Bayesian Deconvolution | Zachary H. Hendrix et.al. | 2411.00991 | null |
2024-11-01 | Inter-Feature-Map Differential Coding of Surveillance Video | Kei Iino et.al. | 2411.00984 | null |
2024-11-01 | Scalable AI Framework for Defect Detection in Metal Additive Manufacturing | Duy Nhat Phan et.al. | 2411.00960 | null |
2024-11-01 | Intensity Field Decomposition for Tissue-Guided Neural Tomography | Meng-Xun Li et.al. | 2411.00900 | null |
2024-11-01 | CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes | Yang Liu et.al. | 2411.00771 | null |
2024-11-01 | Face Anonymization Made Simple | Han-Wei Kung et.al. | 2411.00762 | link |
2024-11-01 | Demystifying the use of Compression in Virtual Production | Anil Kokaram et.al. | 2411.00547 | null |
2024-11-01 | MV-Adapter: Enhancing Underwater Instance Segmentation via Adaptive Channel Attention | Lianjun Liu et.al. | 2411.00472 | null |
2024-10-31 | IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision | Maxwell Meyer et.al. | 2411.00252 | null |
2024-10-31 | Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise | Yongxuan Yan et.al. | 2411.00199 | null |
2024-10-31 | Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning | Penghui Ruan et.al. | 2410.24219 | link |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Parameter choices in HaarPSI for IQA with medical images | Clemens Karner et.al. | 2410.24098 | link |
2024-10-31 | Advanced Predictive Quality Assessment for Ultrasonic Additive Manufacturing with Deep Learning Model | Lokendra Poudel et.al. | 2410.24055 | null |
2024-10-31 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation | Yihang Zhou et.al. | 2410.23962 | null |
2024-10-29 | Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images | Vishal Dubey et.al. | 2410.23898 | null |
2024-10-31 | Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data | Yucun Hou et.al. | 2410.23628 | null |
2024-10-31 | LBurst: Learning-Based Robotic Burst Feature Extraction for 3D Reconstruction in Low Light | Ahalya Ravendran et.al. | 2410.23522 | null |
2024-10-30 | Plug-and-play superiorization | Jon Henshaw et.al. | 2410.23401 | null |
2024-10-30 | Redundant Cross-Correlation for Drift Correction in SEM Nanoparticle Imaging | Iago Bischoff Montenegro et.al. | 2410.23390 | link |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-10-30 | AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection | Yujin Wang et.al. | 2410.22939 | null |
2024-10-30 | Prune and Repaint: Content-Aware Image Retargeting for any Ratio | Feihong Shen et.al. | 2410.22865 | link |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-30 | Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models | Arash Marioriyad et.al. | 2410.22775 | null |
2024-10-30 | st-DTPM: Spatial-Temporal Guided Diffusion Transformer Probabilistic Model for Delayed Scan PET Image Prediction | Ran Hong et.al. | 2410.22732 | null |
2024-10-30 | FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution | Shuai Wang et.al. | 2410.22655 | null |
2024-10-31 | Consistency Diffusion Bridge Models | Guande He et.al. | 2410.22637 | null |
2024-10-29 | Deep Priors for Video Quality Prediction | Siddharath Narayan Shakya et.al. | 2410.22566 | null |
2024-10-29 | Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models | Seetharam Killivalavan et.al. | 2410.22323 | null |
2024-10-29 | Multimodal Semantic Communication for Generative Audio-Driven Video Conferencing | Haonan Tong et.al. | 2410.22112 | null |
2024-10-29 | Data Generation for Hardware-Friendly Post-Training Quantization | Lior Dikstein et.al. | 2410.22110 | link |
2024-10-29 | Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis | Deepak Sridhar et.al. | 2410.21638 | null |
2024-10-28 | Exploring the Design Space of Diffusion Bridge Models via Stochasticity Control | Shaorong Zhang et.al. | 2410.21553 | null |
2024-10-28 | SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han et.al. | 2410.21485 | link |
2024-10-28 | Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework | Vladimir Arkhipkin et.al. | 2410.21061 | link |
2024-10-28 | A Simple Yet Effective Corpus Construction Framework for Indonesian Grammatical Error Correction | Nankai Lin et.al. | 2410.20838 | link |
2024-10-28 | FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space | Yiyang Guo et.al. | 2410.20824 | null |
2024-10-28 | Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting | Jiawei Xu et.al. | 2410.20815 | null |
2024-10-28 | LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars | Xiaonuo Dongye et.al. | 2410.20789 | null |
2024-10-28 | CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Chongjian Ge et.al. | 2410.20723 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering | Meng Wei et.al. | 2410.20593 | null |
2024-10-27 | Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Chongxiao Liu et.al. | 2410.20546 | link |
2024-10-27 | Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust | Xiaofeng Lei et.al. | 2410.20309 | null |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-26 | OAR-Weighted Dice Score: A spatially aware, radiosensitivity aware metric for target structure contour quality assessment | Lucas McCullum et.al. | 2410.20243 | null |
2024-10-26 | Cross-Platform Neural Video Coding: A Case Study | Ruhan Conceição et.al. | 2410.20145 | null |
2024-10-26 | Super-resolved virtual staining of label-free tissue using diffusion models | Yijie Zhang et.al. | 2410.20073 | null |
2024-10-25 | The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey | Benne W. Holwerda et.al. | 2410.19985 | null |
2024-10-25 | FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Zhengyao Lv et.al. | 2410.19355 | null |
2024-10-25 | Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion | Emiel Hoogeboom et.al. | 2410.19324 | null |
2024-10-24 | Optimising image capture for low-light widefield quantitative fluorescence microscopy | Zane Peterkovic et.al. | 2410.19210 | null |
2024-10-24 | Sort-free Gaussian Splatting via Weighted Sum Rendering | Qiqi Hou et.al. | 2410.18931 | null |
2024-10-24 | SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models | Zonghao Ying et.al. | 2410.18927 | null |
2024-10-24 | Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances | Shilin Lu et.al. | 2410.18775 | link |
2024-10-24 | Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data | Ankur Garg et.al. | 2410.18690 | null |
2024-10-24 | ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks | Renshuai Tao et.al. | 2410.18687 | null |
2024-10-24 | Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data | Anup Shirgaonkar et.al. | 2410.18588 | null |
2024-10-24 | ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis | Zezhong Wang et.al. | 2410.18447 | null |
2024-10-24 | FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling | Zhengqiang Zhang et.al. | 2410.18410 | link |
2024-10-23 | Neural Cover Selection for Image Steganography | Karl Chahine et.al. | 2410.18216 | link |
2024-10-23 | In-Pixel Foreground and Contrast Enhancement Circuits with Customizable Mapping | Md Rahatul Islam Udoy et.al. | 2410.18052 | null |
2024-10-23 | Scalable Ranked Preference Optimization for Text-to-Image Generation | Shyamgopal Karthik et.al. | 2410.18013 | null |
2024-10-23 | Together We Can: Multilingual Automatic Post-Editing for Low-Resource Languages | Sourabh Deoghare et.al. | 2410.17973 | null |
2024-10-23 | Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech | Danilo de Oliveira et.al. | 2410.17834 | null |
2024-10-23 | TopoQA: a topological deep learning-based approach for protein complex structure interface quality assessment | Bingqing Han et.al. | 2410.17815 | null |
2024-10-23 | An Intelligent Agentic System for Complex Image Restoration Problems | Kaiwen Zhu et.al. | 2410.17809 | link |
2024-10-24 | Testing Deep Learning Recommender Systems Models on Synthetic GAN-Generated Datasets | Jesús Bobadilla et.al. | 2410.17651 | null |
2024-10-25 | Comprehensive Evaluation of Matrix Factorization Models for Collaborative Filtering Recommender Systems | Jesús Bobadilla et.al. | 2410.17644 | null |
2024-10-23 | Bilateral Hippocampi Segmentation in Low Field MRIs Using Mutual Feature Learning via Dual-Views | Himashi Peiris et.al. | 2410.17502 | link |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | Multispectral Texture Synthesis using RGB Convolutional Neural Networks | Sélim Ollivier et.al. | 2410.16019 | null |
2024-10-22 | Wireless Link Quality Estimation Using LSTM Model | Yuki Kanto et.al. | 2410.15357 | null |
2024-10-19 | A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends | Junjun Jiang et.al. | 2410.15067 | link |
2024-10-18 | DRACO: Differentiable Reconstruction for Arbitrary CBCT Orbits | Chengze Ye et.al. | 2410.14900 | link |
2024-10-18 | Dynamic Negative Guidance of Diffusion Models | Felix Koulischer et.al. | 2410.14398 | null |
2024-10-18 | Gaia Data Release 3: spectroscopic binary-star orbital solutions and the SB1 processing chain | E. Gosset et.al. | 2410.14372 | null |
2024-10-18 | 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization | Junan Chen et.al. | 2410.14343 | null |
2024-10-18 | Advanced Underwater Image Quality Enhancement via Hybrid Super-Resolution Convolutional Neural Networks and Multi-Scale Retinex-Based Defogging Techniques | Yugandhar Reddy Gogireddy et.al. | 2410.14285 | null |
2024-10-18 | Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization | Bin Lin et.al. | 2410.14283 | null |
2024-10-18 | Combining Hough Transform and Deep Learning Approaches to Reconstruct ECG Signals From Printouts | Felix Krones et.al. | 2410.14185 | null |
2024-10-18 | Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping | Renguang Chen et.al. | 2410.14161 | null |
2024-10-17 | Generating Signed Language Instructions in Large-Scale Dialogue Systems | Mert İnan et.al. | 2410.14026 | null |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863 | null |
2024-10-15 | Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation | Renato Augusto Tavares et.al. | 2410.13622 | null |
2024-10-17 | L3DG: Latent 3D Gaussian Diffusion | Barbara Roessle et.al. | 2410.13530 | null |
2024-10-17 | Enhancing Crowdsourced Audio for Text-to-Speech Models | José Giraldo et.al. | 2410.13357 | null |
2024-10-17 | Active inference and deep generative modeling for cognitive ultrasound | Ruud JG van Sloun et.al. | 2410.13310 | null |
2024-10-17 | Latent Image and Video Resolution Prediction using Convolutional Neural Networks | Rittwika Kansabanik et.al. | 2410.13227 | null |
2024-10-17 | Anchored Alignment for Self-Explanations Enhancement | Luis Felipe Villa-Arenas et.al. | 2410.13216 | null |
2024-10-17 | Using RLHF to align speech enhancement approaches to mean-opinion quality scores | Anurag Kumar et.al. | 2410.13182 | null |
2024-10-16 | Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model | Yang Liu et.al. | 2410.12961 | null |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | SWIM: An Attention-Only Model for Speech Quality Assessment Under Subjective Variance | Imran E Kibria et.al. | 2410.12675 | null |
2024-10-16 | MambaPainter: Neural Stroke-Based Rendering in a Single Step | Tomoya Sawada et.al. | 2410.12524 | link |
2024-10-16 | Conditional Outcome Equivalence: A Quantile Alternative to CATE | Josh Givens et.al. | 2410.12454 | link |
2024-10-16 | Triplet: Triangle Patchlet for Mesh-Based Inverse Rendering and Scene Parameters Approximation | Jiajie Yang et.al. | 2410.12414 | link |
2024-10-14 | Learnable Optimization-Based Algorithms for Low-Dose CT Reconstruction | Daisy Chen et.al. | 2410.11903 | null |
2024-10-15 | Generative Image Steganography Based on Point Cloud | Zhong Yangjie et.al. | 2410.11673 | null |
2024-10-15 | Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination | Arturo Salmi et.al. | 2410.11625 | null |
2024-10-15 | Rician Denoising Diffusion Probabilistic Models For Sodium Breast MRI Enhancement | Shuaiyu Yuan et.al. | 2410.11511 | null |
2024-10-15 | Visual-Geometric Collaborative Guidance for Affordance Learning | Hongchen Luo et.al. | 2410.11363 | link |
2024-10-15 | Evolutionary Retrofitting | Mathurin Videau et.al. | 2410.11330 | null |
2024-10-14 | Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation | Emmanouil Zaranis et.al. | 2410.10995 | null |
2024-10-14 | LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Tianwei Xiong et.al. | 2410.10816 | link |
2024-10-14 | Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention | Dejia Xu et.al. | 2410.10774 | null |
2024-10-14 | LISAC: Learned Coded Waveform Design for ISAC with OFDM | Chenghong Bian et.al. | 2410.10711 | null |
2024-10-14 | A Novel No-Reference Image Quality Metric For Assessing Sharpness In Satellite Imagery | Lucas Gonzalo Antonel et.al. | 2410.10488 | null |
2024-10-14 | Two-Stage Approach for Brain MR Image Synthesis: 2D Image Synthesis and 3D Refinement | Jihoon Cho et.al. | 2410.10269 | null |
2024-10-14 | Saliency Guided Optimization of Diffusion Latents | Xiwen Wang et.al. | 2410.10257 | null |
2024-10-14 | QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation | Gahyun Yoo et.al. | 2410.10228 | null |
2024-10-14 | Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Yongjin Yang et.al. | 2410.10166 | null |
2024-10-14 | StegaINR4MIH: steganography by implicit neural representation for multi-image hiding | Weina Dong et.al. | 2410.10117 | link |
2024-10-13 | Crowd IQ -- Aggregating Opinions to Boost Performance | Michal Kosinski et.al. | 2410.10004 | null |
2024-10-13 | Combining Generative and Geometry Priors for Wide-Angle Portrait Correction | Lan Yao et.al. | 2410.09911 | link |
2024-10-13 | Two-Stage Human Verification using HandCAPTCHA and Anti-Spoofed Finger Biometrics with Feature Selection | Asish Bera et.al. | 2410.09866 | null |
2024-10-12 | Preserving Old Memories in Vivid Detail: Human-Interactive Photo Restoration Framework | Seung-Yeon Back et.al. | 2410.09529 | null |
2024-10-12 | Fine-grained subjective visual quality assessment for high-fidelity compressed images | Michela Testolina et.al. | 2410.09501 | link |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-11 | TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning | Tsiry Mayet et.al. | 2410.09306 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Xuan Huang et.al. | 2410.08840 | link |
2024-10-11 | Towards virtual painting recolouring using Vision Transformer on X-Ray Fluorescence datacubes | Alessandro Bombini et.al. | 2410.08826 | null |
2024-10-11 | A Theoretical Framework for AI-driven data quality monitoring in high-volume data environments | Nikhil Bangad et.al. | 2410.08576 | null |
2024-10-11 | Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models | Pascl Zwick et.al. | 2410.08551 | link |
2024-10-11 | Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities | Abhijay Ghildyal et.al. | 2410.08534 | null |
2024-10-10 | Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Qiuheng Wang et.al. | 2410.08260 | null |
2024-10-10 | Exploring ASR-Based Wav2Vec2 for Automated Speech Disorder Assessment: Insights and Analysis | Tuan Nguyen et.al. | 2410.08250 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | null |
2024-10-10 | Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency | Florian Hahlbohm et.al. | 2410.08129 | null |
2024-10-10 | Medical Image Quality Assessment based on Probability of Necessity and Sufficiency | Boyu Chen et.al. | 2410.08118 | null |
2024-10-10 | High-redshift LBG selection from broadband and wide photometric surveys using a Random Forest algorithm | C. Payerne et.al. | 2410.08062 | null |
2024-10-10 | Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal et.al. | 2410.07779 | null |
2024-10-10 | Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models | Danush Kumar Venkatesh et.al. | 2410.07753 | link |
2024-10-10 | Multi-Facet Counterfactual Learning for Content Quality Evaluation | Jiasheng Zheng et.al. | 2410.07693 | null |
2024-10-10 | DPL: Cross-quality DeepFake Detection via Dual Progressive Learning | Dongliang Zhang et.al. | 2410.07633 | null |
2024-10-10 | Rank Aggregation in Crowdsourcing for Listwise Annotations | Wenshui Luo et.al. | 2410.07538 | null |
2024-10-10 | A 3D-Printed Table for Hybrid X-ray CT and Optical Imaging of a Live Mouse | Wenxuan Xue et.al. | 2410.07517 | null |
2024-10-09 | An undetectable watermark for generative image models | Sam Gunn et.al. | 2410.07369 | link |
2024-10-09 | Secure Video Quality Assessment Resisting Adversarial Attacks | Ao-Xiang Zhang et.al. | 2410.06866 | null |
2024-10-09 | Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography | Qianqian Xue et.al. | 2410.06757 | null |
2024-10-09 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | Perceptual Quality Assessment of Octree-RAHT Encoded 3D Point Clouds | Dongshuai Duan et.al. | 2410.06729 | link |
2024-10-09 | Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point Clouds | Juncheng Long et.al. | 2410.06689 | link |
2024-10-09 | SCOREQ: Speech Quality Assessment with Contrastive Regression | Alessandro Ragano et.al. | 2410.06675 | link |
2024-10-09 | InstantIR: Blind Image Restoration with Instant Generative Reference | Jen-Yuan Huang et.al. | 2410.06551 | null |
2024-10-08 | Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content? | Shenbin Qian et.al. | 2410.06338 | link |
2024-10-08 | Automated quality assessment using appearance-based simulations and hippocampus segmentation on low-field paediatric brain MR images | Vaanathi Sundaresan et.al. | 2410.06161 | link |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation | Boyuan Cao et.al. | 2410.06055 | link |
2024-10-08 | Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization | Wei Liu et.al. | 2410.06003 | link |
2024-10-08 | Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination | Yupeng Yang et.al. | 2410.05798 | link |
2024-10-08 | T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design | Jiachen Li et.al. | 2410.05677 | null |
2024-10-08 | Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning | Saemi Moon et.al. | 2410.05664 | null |
2024-10-08 | Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree? | Xueru Wen et.al. | 2410.05584 | null |
2024-10-07 | Image Watermarks are Removable Using Controllable Regeneration from Clean Noise | Yepeng Liu et.al. | 2410.05470 | null |
2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
2024-10-07 | Towards a Modern and Lightweight Rendering Engine for Dynamic Robotic Simulations | Christopher John Allison et.al. | 2410.05095 | null |
2024-10-07 | Real-time cardiac cine MRI -- A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions | Oliver Schad et.al. | 2410.04843 | null |
2024-10-07 | Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration | Zhiyu Zhu et.al. | 2410.04811 | link |
2024-10-07 | Transforming Color: A Novel Image Colorization Method | Hamza Shafiq et.al. | 2410.04799 | null |
2024-10-07 | CAR: Controllable Autoregressive Modeling for Visual Generation | Ziyu Yao et.al. | 2410.04671 | link |
2024-10-07 | Federated Learning Nodes Can Reconstruct Peers' Image Data | Ethan Wilson et.al. | 2410.04661 | null |
2024-10-06 | Towards Unsupervised Blind Face Restoration using Diffusion Prior | Tianshu Kuai et.al. | 2410.04618 | null |
2024-10-06 | How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li et.al. | 2410.04545 | null |
2024-10-06 | VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide | Dohun Lee et.al. | 2410.04364 | null |
2024-10-05 | Persona Knowledge-Aligned Prompt Tuning Method for Online Debate | Chunkit Chan et.al. | 2410.04239 | link |
2024-10-05 | AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results | Ivan Molodetskikh et.al. | 2410.04225 | null |
2024-10-05 | Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles | Md. Tarek Hasan et.al. | 2410.04202 | null |
2024-10-05 | Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model | Keda Tao et.al. | 2410.04161 | null |
2024-10-05 | Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT? | Àlex R. Atrio et.al. | 2410.04147 | null |
2024-10-05 | Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer | Aref Tabatabaei et.al. | 2410.04052 | null |
2024-10-04 | LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding | Doohyuk Jang et.al. | 2410.03355 | null |
2024-10-04 | CLOVE: Travelling Salesman's approach to hyperbolic embeddings of complex networks with communities | Sámuel G. Balogh et.al. | 2410.03270 | null |
2024-10-04 | Parallel Corpus Augmentation using Masked Language Models | Vibhuti Kumari et.al. | 2410.03194 | null |
2024-10-04 | ECHOPulse: ECG controlled echocardio-grams video generation | Yiwei Li et.al. | 2410.03143 | link |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-03 | An Improved Variational Method for Image Denoising | Jing-En Huang et.al. | 2410.02587 | null |
2024-10-03 | Combining Pre- and Post-Demosaicking Noise Removal for RAW Video | Marco Sánchez-Beeckman et.al. | 2410.02572 | null |
2024-10-03 | Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment | Kai Liu et.al. | 2410.02505 | link |
2024-10-03 | Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models | Seyedmorteza Sadat et.al. | 2410.02416 | null |
2024-10-03 | Morphological evaluation of subwords vocabulary used by BETO language model | Óscar García-Sierra et.al. | 2410.02283 | null |
2024-10-03 | SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model | Kexin Zhang et.al. | 2410.02121 | null |
2024-10-02 | DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation | Jing He et.al. | 2410.02067 | null |
2024-10-02 | Impact of White-Box Adversarial Attacks on Convolutional Neural Networks | Rakesh Podder et.al. | 2410.02043 | null |
2024-10-02 | Social Media Authentication and Combating Deepfakes using Semi-fragile Invisible Image Watermarking | Aakash Varma Nadimpalli et.al. | 2410.01906 | null |
2024-10-02 | Enhancing LLM Fine-tuning for Text-to-SQLs by SQL Quality Measurement | Shouvon Sarker et.al. | 2410.01869 | null |
2024-10-02 | ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation | Rinon Gal et.al. | 2410.01731 | null |
2024-10-04 | HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration | Yushi Huang et.al. | 2410.01723 | null |
2024-10-02 | Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding | Yao Teng et.al. | 2410.01699 | link |
2024-10-02 | SAFE: Semantic Adaptive Feature Extraction with Rate Control for 6G Wireless Communications | Yuna Yan et.al. | 2410.01597 | null |
2024-10-02 | Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning | Martin F. Schiffner et.al. | 2410.01593 | null |
2024-10-02 | Imaging foundation model for universal enhancement of non-ideal measurement CT | Yuxin Liu et.al. | 2410.01591 | link |
2024-10-02 | HARMONI at ELT: tolerance analysis and expected as-build imaging performance of the infrared spectrograph | Eduard Muslimov et.al. | 2410.01581 | null |
2024-10-02 | Adaptive Radiofrequency Shimming in MRI using Reconfigurable Dielectric Materials | Paulina Šiurytė et.al. | 2410.01501 | null |
2024-10-02 | Quo Vadis RankList-based System in Face Recognition? | Xinyi Zhang et.al. | 2410.01498 | null |
2024-10-02 | Design of a custom wideband camera for MISTRAL imager-spectrograph | Eduard Muslimov et.al. | 2410.01414 | null |
2024-10-02 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-10-01 | Generating Seamless Virtual Immunohistochemical Whole Slide Images with Content and Color Consistency | Sitong Liu et.al. | 2410.01072 | null |
2024-10-01 | LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details | Jian Yang et.al. | 2410.00990 | null |
2024-10-01 | Energy-Quality-aware Variable Framerate Pareto-Front for Adaptive Video Streaming | Prajit T Rajendran et.al. | 2410.00849 | null |
2024-10-01 | Maximum entropy and quantized metric models for absolute category ratings | Dietmar Saupe et.al. | 2410.00817 | null |
2024-10-01 | Basis function compression for field probe monitoring | Paul Dubovan et.al. | 2410.00754 | null |
2024-10-01 | Development of the normalization method for the first large field-of-view plastic-based PET Modular scanner | A. Coussat et.al. | 2410.00669 | null |
2024-10-01 | Contribution of soundscape appropriateness to soundscape quality assessment in space: a mediating variable affecting acoustic comfort | Xinhao Yang et.al. | 2410.00667 | null |
2024-10-01 | AutoTM 2.0: Automatic Topic Modeling Framework for Documents Analysis | Maria Khodorchenko et.al. | 2410.00655 | null |
2024-10-01 | Dynamic and Scalable Data Preparation for Object-Centric Process Mining | Lien Bosmans et.al. | 2410.00596 | null |
2024-09-30 | UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation | Cheng Zhang et.al. | 2409.20197 | link |
2024-09-30 | Segmenting Wood Rot using Computer Vision Models | Roland Kammerbauer et.al. | 2409.20137 | null |
2024-09-30 | Machine Learning in Industrial Quality Control of Glass Bottle Prints | Maximilian Bundscherer et.al. | 2409.20132 | null |
2024-09-30 | Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs | Zicheng Zhang et.al. | 2409.20063 | null |
2024-09-30 | Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis | Hippolyte Gisserot-Boukhlef et.al. | 2409.20059 | null |
2024-10-01 | UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs | Yuho Lee et.al. | 2409.19898 | link |
2024-09-29 | OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines | Daniel Silver et.al. | 2409.19823 | null |
2024-09-29 | SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal | Fang Long et.al. | 2409.19679 | link |
2024-09-29 | Effective Diffusion Transformer Architecture for Image Super-Resolution | Kun Cheng et.al. | 2409.19589 | link |
2024-09-29 | High Quality Human Image Animation using Regional Supervision and Motion Blur Condition | Zhongcong Xu et.al. | 2409.19580 | null |
2024-09-27 | A comprehensive review and new taxonomy on superpixel segmentation | I. B. Barcelos et.al. | 2409.19179 | link |
2024-09-27 | Multimodal Pragmatic Jailbreak on Text-to-image Models | Tong Liu et.al. | 2409.19149 | null |
2024-09-27 | ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions | Wenfeng Huang et.al. | 2409.18932 | null |
2024-09-27 | Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors | Yunlong Lin et.al. | 2409.18899 | null |
2024-09-27 | Effectiveness of learning-based image codecs on fingerprint storage | Daniele Mari et.al. | 2409.18730 | link |
2024-09-27 | Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming | Angeliki Katsenou et.al. | 2409.18713 | null |
2024-09-27 | Align |
Hongzhe Huang et.al. | 2409.18541 | link |
2024-09-27 | Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models | Nguyen Gia Bach et.al. | 2409.18476 | link |
2024-09-27 | GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation | Jiawei Lu et.al. | 2409.18401 | null |
2024-09-27 | SinoSynth: A Physics-based Domain Randomization Approach for Generalizable CBCT Image Enhancement | Yunkui Pang et.al. | 2409.18355 | link |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | link |
2024-09-26 | Low Photon Number Non-Invasive Imaging Through Time-Varying Diffusers | Adrian Makowski et.al. | 2409.18072 | null |
2024-09-26 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications | Jothi Prasanna Shanmuga Sundaram et.al. | 2409.18043 | null |
2024-09-26 | PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging | Xin Cai et.al. | 2409.17996 | null |
2024-09-26 | Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Qihan Huang et.al. | 2409.17920 | link |
2024-09-26 | Cross-lingual Human-Preference Alignment for Neural Machine Translation with Direct Quality Optimization | Kaden Uhlig et.al. | 2409.17673 | null |
2024-09-26 | FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates | Nicola Pia et.al. | 2409.17635 | null |
2024-09-26 | Pixel-Space Post-Training of Latent Diffusion Models | Christina Zhang et.al. | 2409.17565 | null |
2024-09-26 | Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Yongrok Kim et.al. | 2409.17451 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts | Mohammad Sadil Khan et.al. | 2409.17106 | null |
2024-09-25 | Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model | Xinfeng Wei et.al. | 2409.17104 | null |
2024-09-25 | The effect of image quality on galaxy merger identification with deep learning | Robert W. Bickley et.al. | 2409.17081 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Katharina Anderer et.al. | 2409.16765 | link |
2024-09-25 | Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Youngwan Jin et.al. | 2409.16706 | null |
2024-09-25 | In which fields can ChatGPT detect journal article quality? An evaluation of REF2021 results | Mike Thelwall et.al. | 2409.16695 | null |
2024-09-25 | Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement | Yihao Zhou et.al. | 2409.16661 | null |
2024-09-25 | Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts | Taehun Cha et.al. | 2409.16658 | link |
2024-09-25 | Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation | Siyin Wang et.al. | 2409.16644 | null |
2024-09-25 | DeformStream: Deformation-based Adaptive Volumetric Video Streaming | Boyan Li et.al. | 2409.16615 | null |
2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | link |
2024-09-24 | Low Latency Point Cloud Rendering with Learned Splatting | Yueyu Hu et.al. | 2409.16504 | link |
2024-09-24 | A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Yue Chang et.al. | 2409.16494 | link |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-26 | Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients | Wanchen Zhao et.al. | 2409.16042 | null |
2024-09-24 | Deep chroma compression of tone-mapped images | Xenios Milidonis et.al. | 2409.16032 | link |
2024-09-24 | VascX Models: Model Ensembles for Retinal Vascular Analysis from Color Fundus Images | Jose Vargas Quiros et.al. | 2409.16016 | link |
2024-09-24 | Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality | Hannah Schieber et.al. | 2409.15959 | null |
2024-09-24 | Unsupervised dMRI Artifact Detection via Angular Resolution Enhancement and Cycle Consistency Learning | Sheng Chen et.al. | 2409.15883 | null |
2024-09-25 | Ring Artifacts Removal Based on Implicit Neural Representation of Sinogram Data | Ligen Shi et.al. | 2409.15731 | null |
2024-09-23 | Blind Localization of Early Room Reflections with Arbitrary Microphone Array | Yogev Hadadi et.al. | 2409.15484 | null |
2024-09-23 | Simplifying Triangle Meshes in the Wild | Hsueh-Ti Derek Liu et.al. | 2409.15458 | null |
2024-09-23 | MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning | Yue Han et.al. | 2409.15179 | null |
2024-09-23 | Advancing Video Quality Assessment for AIGC | Xinli Yue et.al. | 2409.14888 | null |
2024-09-23 | Revisiting Video Quality Assessment from the Perspective of Generalization | Xinli Yue et.al. | 2409.14847 | link |
2024-09-23 | AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Andrey Moskalenko et.al. | 2409.14827 | link |
2024-09-23 | HiFi-Glot: Neural Formant Synthesis with Differentiable Resonant Filters | Lauri Juvela et.al. | 2409.14823 | null |
2024-09-22 | Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing | Wenze Ren et.al. | 2409.14554 | null |
2024-09-22 | Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting | Daniel A. Mitchell et.al. | 2409.14346 | null |
2024-09-22 | MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators | Qingyu Lu et.al. | 2409.14335 | null |
2024-09-22 | Quantitative and Qualitative Evaluation of NLM and Wavelet Methods in Image Enhancement | Cameron Khanpour et.al. | 2409.14334 | null |
2024-09-21 | JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation | Hadrien Reynaud et.al. | 2409.14149 | null |
2024-09-21 | N-Version Assessment and Enhancement of Generative AI | Marcus Kessel et.al. | 2409.14071 | null |
2024-09-18 | An Efficient Projection-Based Next-best-view Planning Framework for Reconstruction of Unknown Objects | Zhizhou Jia et.al. | 2409.12096 | null |
2024-09-18 | Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement | Zizhen Lin et.al. | 2409.11725 | null |
2024-09-18 | DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion | Jian Xu et.al. | 2409.11642 | link |
2024-09-17 | Noise-aware Dynamic Image Denoising and Positron Range Correction for Rubidium-82 Cardiac PET Imaging via Self-supervision | Huidong Xie et.al. | 2409.11543 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks | Edgar Heinert et.al. | 2409.11373 | link |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-17 | CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2409.10966 | link |
2024-09-17 | Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending | Yongyang Pan et.al. | 2409.10958 | null |
2024-09-17 | Neural Fields for Adaptive Photoacoustic Computed Tomography | Tianao Li et.al. | 2409.10876 | null |
2024-09-16 | Investigating Training Objectives for Generative Speech Enhancement | Julius Richter et.al. | 2409.10753 | link |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | FGR-Net:Interpretable fundus imagegradeability classification based on deepreconstruction learning | Saif Khalid et.al. | 2409.10246 | null |
2024-09-16 | RF-GML: Reference-Free Generative Machine Listener | Arijit Biswas et.al. | 2409.10210 | null |
2024-09-16 | Towards Explainable Automated Data Quality Enhancement without Domain Knowledge | Djibril Sarr et.al. | 2409.10139 | null |
2024-09-16 | 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Atsuya Nakata et.al. | 2409.09969 | link |
2024-09-15 | A Global Perspective on the Past, Present, and Future of Video Streaming over Starlink | Liz Izhikevich et.al. | 2409.09846 | null |
2024-09-15 | Underwater Image Enhancement via Dehazing and Color Restoration | Chengqin Wu et.al. | 2409.09779 | null |
2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
2024-09-15 | Superconducting and low temperature RF Coils for Ultra-Low-Field MRI: A Study on SNR Performance | Aditya A Bhosale et.al. | 2409.09608 | null |
2024-09-14 | Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans | Mohammed Munzer Dwedari et.al. | 2409.09387 | link |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | Confocal Raman Microscopy with Adaptive Optics | J. D. Munoz-Bolanos et.al. | 2409.08725 | null |
2024-09-13 | Joint image reconstruction and segmentation of real-time cardiac MRI in free-breathing using a model based on disentangled representation learning | Tobias Wech et.al. | 2409.08619 | null |
2024-09-13 | DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Xinxu Ge et.al. | 2409.08572 | link |
2024-09-13 | CasDyF-Net: Image Dehazing via Cascaded Dynamic Filters | Wang Yinglong et.al. | 2409.08510 | link |
2024-09-12 | OpenACE: An Open Benchmark for Evaluating Audio Coding Performance | Jozef Coldenhoff et.al. | 2409.08374 | link |
2024-09-12 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2024-09-12 | OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation | Shun Zou et.al. | 2409.08000 | link |
2024-09-14 | Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment | Shaode Yu et.al. | 2409.07762 | null |
2024-09-11 | Foundation Models Boost Low-Level Perceptual Similarity Metrics | Abhijay Ghildyal et.al. | 2409.07650 | link |
2024-09-11 | Machine Learning and Constraint Programming for Efficient Healthcare Scheduling | Aymen Ben Said et.al. | 2409.07547 | null |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Jian Zhang et.al. | 2409.07255 | null |
2024-09-12 | 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents | Yingjie Zhou et.al. | 2409.07236 | link |
2024-09-11 | Phantom-based gradient waveform measurements with compensated variable-prephasing: Description and application to EPI at 7T | Hannah Scholten et.al. | 2409.07203 | null |
2024-09-11 | Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment | Mohammed Alsaafin et.al. | 2409.07115 | link |
2024-09-11 | CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion | Joshua Kazdan et.al. | 2409.07025 | null |
2024-09-11 | AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models | Boming Miao et.al. | 2409.07002 | null |
2024-09-10 | ExIQA: Explainable Image Quality Assessment Using Distortion Attributes | Sepehr Kazemi Ranjbar et.al. | 2409.06853 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements | Antonio Cuéllar et.al. | 2409.06548 | null |
2024-09-11 | AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval | Runqing Zhang et.al. | 2409.06385 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing | Kuang Yuan et.al. | 2409.06137 | null |
2024-09-09 | Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion | Fuxin Fan et.al. | 2409.05982 | null |
2024-09-09 | SynMorph: Generating Synthetic Face Morphing Dataset with Mated Samples | Haoyu Zhang et.al. | 2409.05595 | null |
2024-09-09 | Efficient Quality Estimation of True Random Bit-streams | Cesare Caratozzolo et.al. | 2409.05543 | null |
2024-09-09 | Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild | Xiongkuo Min et.al. | 2409.05540 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization | Xudong Li et.al. | 2409.05381 | null |
2024-09-09 | PersonaTalk: Bring Attention to Your Persona in Visual Dubbing | Longhao Zhang et.al. | 2409.05379 | null |
2024-09-09 | BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec | Detai Xin et.al. | 2409.05377 | link |
2024-09-09 | Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices | Yuanyi He et.al. | 2409.05297 | null |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-07 | Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography | Jiahao Zhu et.al. | 2409.04878 | null |
2024-09-07 | Metadata augmented deep neural networks for wild animal classification | Aslak Tøn et.al. | 2409.04825 | link |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-06 | Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE) | Shen Zhao et.al. | 2409.04353 | link |
2024-09-06 | Design and Characterization of MRI-compatible Plastic Ultrasonic Motor | Zhanyue Zhao et.al. | 2409.04006 | null |
2024-09-06 | Bi-modality Images Transfer with a Discrete Process Matching Method | Zhe Xiong et.al. | 2409.03977 | null |
2024-09-03 | Applications and Advances of Artificial Intelligence in Music Generation:A Review | Yanxu Chen et.al. | 2409.03715 | null |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation | Prerak Mody et.al. | 2409.03470 | link |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-05 | Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation | Brian Chao et.al. | 2409.03143 | null |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models | Pujing Yang et.al. | 2409.02597 | null |
2024-09-04 | Coral Model Generation from Single Images for Virtual Reality Applications | Jie Fu et.al. | 2409.02376 | null |
2024-09-04 | Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI | Xuan Lei et.al. | 2409.02348 | null |
2024-09-03 | Coaching a Robotic Sonographer: Learning Robotic Ultrasound with Sparse Expert's Feedback | Deepak Raina et.al. | 2409.02337 | null |
2024-09-03 | Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning | Xiaowei Hu et.al. | 2409.02108 | link |
2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | UWStereo: A Large Synthetic Dataset for Underwater Stereo Matching | Qingxuan Lv et.al. | 2409.01782 | null |
2024-09-03 | Boron Isotope Effects on Raman Scattering in Bulk BN, BP, and BAs: A Density-Functional Theory Study | Nima Ghafari Cherati et.al. | 2409.01671 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-03 | Learning Task-Specific Sampling Strategy for Sparse-View CT Reconstruction | Liutao Yang et.al. | 2409.01544 | null |
2024-09-03 | Long-Range Biometric Identification in Real World Scenarios: A Comprehensive Evaluation Framework Based on Missions | Deniz Aykac et.al. | 2409.01540 | null |
2024-09-02 | Real-Time Multi-Scene Visibility Enhancement for Promoting Navigational Safety of Vessels Under Complex Weather Conditions | Ryan Wen Liu et.al. | 2409.01500 | link |
2024-09-02 | Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement | Tathagata Bandyopadhyay et.al. | 2409.01352 | link |
2024-09-02 | A Roadmap to Holographic Focused Ultrasound Approaches to Generate Thermal Patterns | Ceren Cengiz et.al. | 2409.01323 | null |
2024-09-02 | Investigation of the spatial resolution of PET imaging system measuring polarization-correlated Compton events | Ana Marija Kožuljević et.al. | 2409.01238 | null |
2024-09-02 | MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation | Zewen Chen et.al. | 2409.01212 | link |
2024-09-02 | Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics | Tuong Vy Nguyen et.al. | 2409.01138 | null |
2024-09-02 | Rapid GPU-Based Pangenome Graph Layout | Jiajie Li et.al. | 2409.00876 | null |
2024-09-01 | An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI | Michelle Su et.al. | 2409.00798 | null |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-08-30 | Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution | Yixin Wu et.al. | 2408.17285 | null |
2024-08-30 | LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model | Nasim Jamshidi Avanaki et.al. | 2408.17057 | link |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Legacy Learning Using Few-Shot Font Generation Models for Automatic Text Design in Metaverse Content: Cases Studies in Korean and Chinese | Younghwi Kim et.al. | 2408.16900 | null |
2024-08-29 | The Continuous Electron Beam Accelerator Facility at 12 GeV | P. A. Adderley et.al. | 2408.16880 | null |
2024-08-29 | MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning | Nasim Jamshidi Avanaki et.al. | 2408.16879 | null |
2024-09-04 | Auto-resolving atomic structure at van der Waal interfaces using a generative model | Wenqiang Huang et.al. | 2408.16802 | link |
2024-09-02 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-09-02 | A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising | Shuaiyu Yuan et.al. | 2408.16481 | null |
2024-08-29 | LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement | Ye Yu et.al. | 2408.16235 | link |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Segmentation-guided Layer-wise Image Vectorization with Gradient Fills | Hengyu Zhou et.al. | 2408.15741 | link |
2024-08-28 | Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini et.al. | 2408.15660 | link |
2024-08-28 | Avoiding Generative Model Writer's Block With Embedding Nudging | Ali Zand et.al. | 2408.15450 | null |
2024-09-02 | Pitfalls and Outlooks in Using COMET | Vilém Zouhar et.al. | 2408.15366 | link |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP | Zhenchen Tang et.al. | 2408.15098 | null |
2024-08-27 | Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Taewoo Kim et.al. | 2408.14916 | link |
2024-08-27 | Alfie: Democratising RGBA Image Generation With No $$$ | Fabio Quattrini et.al. | 2408.14826 | link |
2024-08-27 | Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation | Qiaoxin Li et.al. | 2408.14754 | null |
2024-08-26 | Gallery-Aware Uncertainty Estimation For Open-Set Face Recognition | Leonid Erlygin et.al. | 2408.14229 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-27 | Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing | Rohin Sood et.al. | 2408.14010 | null |
2024-08-26 | LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models | Qihang Ge et.al. | 2408.14008 | null |
2024-08-25 | Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Minghao Liu et.al. | 2408.13858 | null |
2024-08-25 | Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! | Stefano Perrella et.al. | 2408.13831 | link |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks | Nicholas S. DiBrita et.al. | 2408.13389 | link |
2024-08-23 | Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack | Vaibhav Sundharam et.al. | 2408.13251 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | A density ratio framework for evaluating the utility of synthetic data | Thom Benjamin Volker et.al. | 2408.13167 | null |
2024-08-23 | When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation | Xi Zhu et.al. | 2408.12897 | null |
2024-08-22 | Variable Stars in M31 Stellar Clusters from the Panchromatic Hubble Andromeda Treasury | Richard Smith et.al. | 2408.12765 | null |
2024-08-22 | Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis | Memoona Aziz et.al. | 2408.12762 | null |
2024-08-22 | Unlocking Intrinsic Fairness in Stable Diffusion | Eunji Kim et.al. | 2408.12692 | null |
2024-08-22 | Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features | Shaoxiang Dang et.al. | 2408.12279 | null |
2024-08-21 | MBSS-T1: Model-Based Self-Supervised Motion Correction for Robust Cardiac T1 Mapping | Eyal Hanania et.al. | 2408.11992 | null |
2024-08-21 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-21 | Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Lodewijk Gelauff et.al. | 2408.11936 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | Interpretable Long-term Action Quality Assessment | Xu Dong et.al. | 2408.11687 | link |
2024-08-21 | E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Shangkun Sun et.al. | 2408.11481 | link |
2024-08-21 | Fairness measures for biometric quality assessment | André Dörsch et.al. | 2408.11392 | null |
2024-08-21 | Gender Bias Evaluation in Text-to-image Generation: A Survey | Yankun Wu et.al. | 2408.11358 | null |
2024-08-21 | Image Score: Learning and Evaluating Human Preferences for Mercari Search | Chingis Oinar et.al. | 2408.11349 | null |
2024-08-21 | High-quality imaging of large areas through path-difference ptychography | Jizhe Cui et.al. | 2408.11332 | null |
2024-08-21 | Optimizing Transmit Field Inhomogeneity of Parallel RF Transmit Design in 7T MRI using Deep Learning | Zhengyi Lu et.al. | 2408.11323 | null |
2024-08-21 | Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods | David Jacob Kedziora et.al. | 2408.11322 | link |
2024-08-20 | Compress Guidance in Conditional Diffusion Sampling | Anh-Dung Dinh et.al. | 2408.11194 | null |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055 | link |
2024-08-20 | Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models | Hojat Asgariandehkordi et.al. | 2408.10987 | null |
2024-08-20 | Influence of Medical Foreign Bodies on Dark-Field Chest Radiographs: First experiences | Lennard Kaster et.al. | 2408.10855 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images | Wei Zhou et.al. | 2408.10134 | null |
2024-08-19 | Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement | Kang Xiao et.al. | 2408.09920 | link |
2024-08-19 | Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Yunxin Li et.al. | 2408.09787 | link |
2024-08-21 | Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning | Zhi Qiao et.al. | 2408.09731 | null |
2024-08-18 | FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model | Ziyu Yao et.al. | 2408.09384 | null |
2024-08-17 | Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming | Seungyeop Han et.al. | 2408.09244 | null |
2024-08-16 | Explore Cross-Codec Quality-Rate Convex Hulls Relation for Adaptive Streaming | Masoumeh Farhadi Nia et.al. | 2408.09044 | null |
2024-08-16 | Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions | Bhuvanashree Murugadoss et.al. | 2408.08781 | null |
2024-08-16 | Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data | Sanjjushri Varshini R et.al. | 2408.08774 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-16 | Visual-Friendly Concept Protection via Selective Adversarial Perturbations | Xiaoyue Mi et.al. | 2408.08518 | link |
2024-08-16 | Achieving Complex Image Edits via Function Aggregation with Diffusion Models | Mohammadreza Samadi et.al. | 2408.08495 | null |
2024-08-15 | Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment | Daniele Rege Cambrin et.al. | 2408.08396 | link |
2024-08-15 | METR: Image Watermarking with Large Number of Unique Messages | Alexander Varlamov et.al. | 2408.08340 | link |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective | Zixuan Pan et.al. | 2408.08228 | link |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment | Zongzong Wu et.al. | 2408.08088 | null |
2024-08-15 | Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation | Seon-Hoon Kim et.al. | 2408.07947 | link |
2024-08-15 | MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Lucas Nedel Kirsten et.al. | 2408.07932 | link |
2024-08-14 | New Curriculum, New Chance -- Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation | Simon Kloker et.al. | 2408.07542 | null |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement | Tao Sun et.al. | 2408.07388 | null |
2024-08-13 | Direction of Arrival Correction through Speech Quality Feedback | Caleb Rascon et.al. | 2408.07234 | link |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | BVI-UGC: A Video Quality Database for User-Generated Content Transcoding | Zihao Qi et.al. | 2408.07171 | null |
2024-08-13 | Efficient Deep Model-Based Optoacoustic Image Reconstruction | Christoph Dehner et.al. | 2408.07109 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | Evaluating Research Quality with Large Language Models: An Analysis of ChatGPT's Effectiveness with Different Settings and Inputs | Mike Thelwall et.al. | 2408.06752 | null |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses | Zhongweiyang Xu et.al. | 2408.06468 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | DiagESC: Dialogue Synthesis for Integrating Depression Diagnosis into Emotional Support Conversation | Seungyeon Seo et.al. | 2408.06044 | link |
2024-08-12 | A Sharpness Based Loss Function for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2408.06014 | link |
2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link |
2024-08-12 | Creating Arabic LLM Prompts at Scale | Abdelrahman El-Sheikh et.al. | 2408.05882 | null |
2024-08-11 | LaWa: Using Latent Space for In-Generation Image Watermarking | Ahmad Rezaei et.al. | 2408.05868 | null |
2024-08-14 | Iterative Improvement of an Additively Regularized Topic Model | Alex Gorbulev et.al. | 2408.05840 | null |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-11 | Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Yifan Pu et.al. | 2408.05710 | link |
2024-08-11 | Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets | Ghazal Kaviani et.al. | 2408.05697 | null |
2024-08-09 | CBCT scatter correction with dual-layer flat-panel detector | Xin Zhang et.al. | 2408.04943 | null |
2024-08-09 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-08 | DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing -- A Design Study | Alexander Wyss et.al. | 2408.04749 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-08-11 | Synchronous Multi-modal Semantic Communication System with Packet-level Coding | Yun Tian et.al. | 2408.04535 | null |
2024-08-08 | Robustness investigation of quality measures for the assessment of machine learning models | Thomas Most et.al. | 2408.04391 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-08 | LLDif: Diffusion Models for Low-light Emotion Recognition | Zhifeng Wang et.al. | 2408.04235 | null |
2024-08-07 | Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Yiqing Shen et.al. | 2408.04098 | null |
2024-08-07 | Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy | Yu Liu et.al. | 2408.04055 | null |
2024-08-07 | Global-Local Progressive Integration Network for Blind Image Quality Assessment | Xiaoqi Wang et.al. | 2408.03885 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal | Eirini Cholopoulou et.al. | 2408.03734 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-07 | D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods | Onkar Susladkar et.al. | 2408.03558 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-06 | Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI | Alp G. Cicimen et.al. | 2408.03216 | null |
2024-08-06 | Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models | Sho Ozaki et.al. | 2408.03156 | null |
2024-08-05 | VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Zhiyu Tan et.al. | 2408.02629 | null |
2024-08-05 | Cascading Refinement Video Denoising with Uncertainty Adaptivity | Xinyuan Yu et.al. | 2408.02284 | null |
2024-08-04 | PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aoming Liu et.al. | 2408.02157 | null |
2024-08-06 | RICA2: Rubric-Informed, Calibrated Assessment of Actions | Abrar Majeedi et.al. | 2408.02138 | link |
2024-08-04 | View-consistent Object Removal in Radiance Fields | Yiren Lu et.al. | 2408.02100 | null |
2024-08-04 | Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity | Krishna Srikar Durbha et.al. | 2408.01932 | null |
2024-08-03 | Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Jintao Tan et.al. | 2408.01732 | null |
2024-08-03 | JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model | Farzaneh Jafari et.al. | 2408.01627 | null |
2024-08-02 | Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics | Alexander Gushchin et.al. | 2408.01541 | link |
2024-08-02 | Underwater Object Detection Enhancement via Channel Stabilization | Muhammad Ali et.al. | 2408.01293 | link |
2024-08-02 | Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement | Wenbin Zou et.al. | 2408.01276 | link |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-02 | Validation of an Analysability Model in Hybrid Quantum Software | Díaz-Muñoz Ana et.al. | 2408.01105 | null |
2024-08-06 | FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Xiang Gao et.al. | 2408.00998 | link |
2024-08-01 | SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement | Mark Boss et.al. | 2408.00653 | null |
2024-08-01 | Regional quality estimation for echocardiography using deep learning | Gilles Van De Vyver et.al. | 2408.00591 | link |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-08-01 | RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace | Lu Ou et.al. | 2408.00294 | null |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model | Zhichao Zhang et.al. | 2407.21408 | null |
2024-07-31 | An all-sky catalogue of stellar reddening values | E. Paunzen et.al. | 2407.21373 | null |
2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | null |
2024-08-01 | Outlier Detection in Large Radiological Datasets using UMAP | Mohammad Tariqul Islam et.al. | 2407.21263 | link |
2024-07-30 | MP-You: A Web-based MPI Simulation Tool | The-Vinh Tran-Luu et.al. | 2407.21155 | null |
2024-07-30 | Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition | Yuancheng Jiang et.al. | 2407.20904 | null |
2024-07-30 | Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy | Xiaoheng Tan et.al. | 2407.20766 | null |
2024-07-30 | Questionnaires for Everyone: Streamlining Cross-Cultural Questionnaire Adaptation with GPT-Based Translation Quality Evaluation | Otso Haavisto et.al. | 2407.20608 | link |
2024-07-29 | Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods | Hyeon Yu et.al. | 2407.20427 | null |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets | Yili Jin et.al. | 2407.19988 | null |
2024-07-29 | Noise-Resilient Unsupervised Graph Representation Learning via Multi-Hop Feature Quality Estimation | Shiyuan Li et.al. | 2407.19944 | null |
2024-07-29 | FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Yu Lu et.al. | 2407.19918 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-29 | UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content | Yuqin Cao et.al. | 2407.19704 | null |
2024-07-29 | Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment | Wulian Yun et.al. | 2407.19675 | null |
2024-07-28 | X-Fake: Juggling Utility Evaluation and Explanation of Simulated SAR Images | Zhongling Huang et.al. | 2407.19436 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-27 | Towards Clean-Label Backdoor Attacks in the Physical World | Thinh Dao et.al. | 2407.19203 | null |
2024-07-26 | Regularized Multi-Decoder Ensemble for an Error-Aware Scene Representation Network | Tianyu Xiong et.al. | 2407.19082 | null |
2024-07-26 | Correcting for objective sample refractive index mismatch in extended field of view selective plane illumination microscopy | Steven J. Sheppard et.al. | 2407.18862 | null |
2024-07-25 | Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography | Kailai Zhou et.al. | 2407.17996 | link |
2024-07-29 | Invariance of deep image quality metrics to affine transformations | Nuria Alabau-Bosque et.al. | 2407.17927 | link |
2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | null |
2024-07-24 | Final Alignment and Image Quality Test for the Acquisition and Guiding System of SOXS | J. A. Araiza-Duran et.al. | 2407.17382 | null |
2024-07-24 | SOXS NIR: Optomechanical integration and alignment, optical performance verification before full instrument assembly | M. Genoni et.al. | 2407.17244 | null |
2024-07-24 | Q-Ground: Image Quality Grounding with Large Multi-modality Models | Chaofeng Chen et.al. | 2407.17035 | link |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-24 | SAR to Optical Image Translation with Color Supervised Diffusion Model | Xinyu Bai et.al. | 2407.16921 | null |
2024-07-23 | QPT V2: Masked Image Modeling Advances Visual Scoring | Qizhi Xie et.al. | 2407.16541 | link |
2024-07-23 | ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation | Zhenhua Wu et.al. | 2407.16508 | null |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-23 | Improving multidimensional projection quality with user-specific metrics and optimal scaling | Maniru Ibrahim et.al. | 2407.16328 | null |
2024-07-23 | A new visual quality metric for Evaluating the performance of multidimensional projections | Maniru Ibrahim et.al. | 2407.16309 | null |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | link |
2024-07-22 | Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator | Florian Robert et.al. | 2407.15817 | null |
2024-07-22 | SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection | Daniel Jakab et.al. | 2407.15646 | null |
2024-07-22 | Experimenting with Adaptive Bitrate Algorithms for Virtual Reality Streaming over Wi-Fi | Ferran Maura et.al. | 2407.15614 | link |
2024-07-22 | SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time | Stanislav Frolov et.al. | 2407.15507 | link |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | link |
2024-07-20 | Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs | Karl Van Eeden Risager et.al. | 2407.14994 | null |
2024-07-20 | Deep Learning CT Image Restoration using System Blur and Noise Models | Yijie Yuan et.al. | 2407.14983 | null |
2024-07-20 | GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation | Jingzhi Gong et.al. | 2407.14982 | link |
2024-07-20 | Dual High-Order Total Variation Model for Underwater Image Restoration | Yuemei Li et.al. | 2407.14868 | link |
2024-07-20 | CBCTLiTS: A Synthetic, Paired CBCT/CT Dataset For Segmentation And Style Transfer | Maximilian E. Tschuchnig et.al. | 2407.14853 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-20 | Difflare: Removing Image Lens Flare with Latent Diffusion Model | Tianwen Zhou et.al. | 2407.14746 | link |
2024-07-20 | Polarimetric compressed sensing with hollow, self-assembled diffractive films | Ji Feng et.al. | 2407.14722 | null |
2024-07-19 | A Minibatch Alternating Projections Algorithm for Robust and Efficient Magnitude Least-Squares RF Pulse Design in MRI | Jonathan B. Martin et.al. | 2407.14696 | link |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-19 | Shape and Style GAN-based Multispectral Data Augmentation for Crop/Weed Segmentation in Precision Farming | Mulham Fawakherji et.al. | 2407.14119 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Personalized Privacy Protection Mask Against Unauthorized Facial Recognition | Ka-Ho Chow et.al. | 2407.13975 | link |
2024-07-18 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | A Novel Freeform Slicer IFU for the Magellan InfraRed Multi-Object Spectrograph (MIRMOS) | Maren Cosens et.al. | 2407.13747 | null |
2024-07-18 | HazeCLIP: Towards Language Guided Real-World Image Dehazing | Ruiyi Wang et.al. | 2407.13719 | link |
2024-07-18 | Removing cloud shadows from ground-based solar imagery | Amal Chaoui et.al. | 2407.13379 | null |
2024-07-18 | Any Image Restoration with Efficient Automatic Degradation Adaptation | Bin Ren et.al. | 2407.13372 | link |
2024-07-18 | Heterogeneous Clinical Trial Outcomes via Multi-Output Gaussian Processes | Owen Thomas et.al. | 2407.13283 | null |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-18 | Image Inpainting Models are Effective Tools for Instruction-guided Image Editing | Xuan Ju et.al. | 2407.13139 | null |
2024-07-18 | Enhanced Denoising of OCT Images Using Residual U-Net: A Cross-Modality Approach on PSOCT and ASOCT for Clinical Diagnostics | Akkidas Noel Prakasha et.al. | 2407.13090 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao et.al. | 2407.12676 | link |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Tomáš Chobola et.al. | 2407.12511 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process | Yang Cheng et.al. | 2407.12261 | null |
2024-07-16 | Semantic Communication for the Internet of Sounds: Architecture, Design Principles, and Challenges | Chengsi Liang et.al. | 2407.12203 | null |
2024-07-16 | Neural Passage Quality Estimation for Static Pruning | Xuejun Chang et.al. | 2407.12170 | link |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | LoFTI: Localization and Factuality Transfer to Indian Locales | Sona Elza Simon et.al. | 2407.11833 | link |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-16 | ITI-IQA: a Toolbox for Heterogeneous Univariate and Multivariate Missing Data Imputation Quality Assessment | Pedro Pons-Suñer et.al. | 2407.11767 | null |
2024-07-16 | Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution | Francesco Pio Ramunno et.al. | 2407.11659 | link |
2024-07-16 | ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Xinyi Wang et.al. | 2407.11496 | link |
2024-07-16 | Cover-separable Fixed Neural Network Steganography via Deep Generative Models | Guobiao Li et.al. | 2407.11405 | link |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-15 | UFQA: Utility guided Fingerphoto Quality Assessment | Amol S. Joshi et.al. | 2407.11141 | null |
2024-07-15 | Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu et.al. | 2407.10817 | null |
2024-07-15 | Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation | Seungri Yoon et.al. | 2407.10413 | null |
2024-07-15 | Exploring the Impact of Moire Pattern on Deepfake Detectors | Razaib Tariq et.al. | 2407.10399 | null |
2024-07-14 | Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Qinyu Yang et.al. | 2407.10285 | link |
2024-07-14 | Low Sensitivity Hopsets | Vikrant Ashvinkumar et.al. | 2407.10249 | null |
2024-07-14 | A Novel Approach to Ultrasound Beamforming using Synthetic Transmit Aperture with Low Complexity and High SNR for Medical Imaging | Thenmozhi Elango et.al. | 2407.10242 | null |
2024-07-13 | Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment | Yujie Zhang et.al. | 2407.09806 | link |
2024-07-12 | Quantum-dot-based Kitaev chains: Majorana quality measures and scaling with increasing chain length | Viktor Svensson et.al. | 2407.09211 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-12 | LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang et.al. | 2407.08939 | link |
2024-07-12 | 15M Multimodal Facial Image-Text Dataset | Dawei Dai et.al. | 2407.08515 | null |
2024-07-11 | Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives | Diego Dall'Alba et.al. | 2407.08506 | null |
2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | null |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-10 | Coherent and Multi-modality Image Inpainting via Latent Space Optimization | Lingzhi Pan et.al. | 2407.08019 | link |
2024-07-10 | Intensity-sensitive quality assessment of extended sources in astronomical images | X. Li et.al. | 2407.07863 | link |
2024-07-12 | Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Feixiang Zhou et.al. | 2407.07673 | null |
2024-07-10 | Video In-context Learning | Wentao Zhang et.al. | 2407.07356 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment | K M Arefeen Sultan et.al. | 2407.07254 | null |
2024-07-09 | Scaling Up Personalized Aesthetic Assessment via Task Vector Customization | Jooyeol Yun et.al. | 2407.07176 | link |
2024-07-09 | Microsoft Cloud-based Digitization Workflow with Rich Metadata Acquisition for Cultural Heritage Objects | Krzysztof Kutt et.al. | 2407.06972 | null |
2024-07-09 | CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Shuang Hao et.al. | 2407.06780 | link |
2024-07-09 | Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Mingfang Zhang et.al. | 2407.06628 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-09 | Low-dose, high-resolution CT of infant-sized lungs via propagation-based phase contrast | James A. Pollock et.al. | 2407.06527 | null |
2024-07-08 | MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Xuan Ju et.al. | 2407.06358 | null |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | null |
2024-07-08 | Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation | Shuang Xu et.al. | 2407.06064 | link |
2024-07-08 | MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices | Jianwen Jiang et.al. | 2407.05712 | null |
2024-07-09 | PCAC-GAN:ASparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-08 | GSBIQA: Green Saliency-guided Blind Image Quality Assessment Method | Zhanxuan Mei et.al. | 2407.05590 | null |
2024-07-08 | Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jiacheng Su et.al. | 2407.05577 | null |
2024-07-06 | Panopticon: a telescope for our times | Will Saunders et.al. | 2407.05103 | null |
2024-07-06 | CLIPVQA:Video Quality Assessment via CLIP | Fengchuang Xing et.al. | 2407.04928 | link |
2024-07-06 | OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding | Tiancheng Zhao et.al. | 2407.04923 | null |
2024-07-05 | MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Zhaorun Chen et.al. | 2407.04842 | link |
2024-07-05 | Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps | Mattias Nilsson et.al. | 2407.04578 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator | Mehryar Abbasi et.al. | 2407.04258 | null |
2024-07-05 | HCS-TNAS: Hybrid Constraint-driven Semi-supervised Transformer-NAS for Ultrasound Image Segmentation | Renqi Chen et.al. | 2407.04203 | null |
2024-07-04 | Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion | Yutian Zhong et.al. | 2407.03992 | link |
2024-07-04 | DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality Assessment | Jinsong Shi et.al. | 2407.03886 | link |
2024-07-04 | Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy | Yujie Zhang et.al. | 2407.03885 | link |
2024-07-04 | DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts | Zheng-Peng Duan et.al. | 2407.03757 | null |
2024-07-04 | Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming | Rundong Fan et.al. | 2407.03688 | null |
2024-07-04 | Pathological Semantics-Preserving Learning for H&E-to-IHC Virtual Staining | Fuqiang Chen et.al. | 2407.03655 | link |
2024-07-04 | Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration | Yuhong Zhang et.al. | 2407.03636 | null |
2024-07-04 | Orthogonal Constrained Minimization with Tensor |
Xiaoxia Liu et.al. | 2407.03605 | null |
2024-07-03 | Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models | Chunmei Xu et.al. | 2407.03050 | null |
2024-07-03 | Single Image Rolling Shutter Removal with Diffusion Models | Zhanglei Yang et.al. | 2407.02906 | null |
2024-07-03 | FedPot: A Quality-Aware Collaborative and Incentivized Honeypot-Based Detector for Smart Grid Networks | Abdullatif Albaseer et.al. | 2407.02845 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-03 | SF-GNN: Self Filter for Message Lossless Propagation in Deep Graph Neural Network | Yushan Zhu et.al. | 2407.02762 | null |
2024-07-03 | MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control | Yeonji Lee et.al. | 2407.02736 | null |
2024-07-02 | Meta 3D Gen | Raphael Bensadoun et.al. | 2407.02599 | null |
2024-07-02 | Off-Grid Ultrasound Imaging by Stochastic Optimization | Vincent van de Schaft et.al. | 2407.02285 | link |
2024-07-02 | SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules | Suyi Li et.al. | 2407.02031 | null |
2024-07-01 | Free-text Rationale Generation under Readability Level Control | Yi-Sheng Hsu et.al. | 2407.01384 | null |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag | A. Y. Shikhovtsev et.al. | 2407.00960 | null |
2024-07-01 | Diffusion Transformer Model With Compact Prior for Low-dose PET Reconstruction | Bin Huang et.al. | 2407.00944 | link |
2024-06-30 | A Comparative Study of Quality Evaluation Methods for Text Summarization | Huyen Nguyen et.al. | 2407.00747 | null |
2024-06-30 | DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models | Wenda Wang et.al. | 2407.00560 | null |
2024-06-29 | Dynamic Optimization of Video Streaming Quality Using Network Digital Twin Technology | Zurh Farus et.al. | 2407.00513 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | Benchmark Evaluation of Image Fusion algorithms for Smartphone Camera Capture | Lucas N. Kirsten et.al. | 2407.00301 | null |
2024-06-28 | PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration | Yuxuan Sun et.al. | 2407.00203 | null |
2024-06-28 | Quantitative Methods in Research Evaluation Citation Indicators, Altmetrics, and Artificial Intelligence | Mike Thelwall et.al. | 2407.00135 | null |
2024-06-28 | MR-zero meets FLASH -- Controlling the transient signal decay in gradient- and rf-spoiled gradient echo sequences | Simon Weinmüller et.al. | 2406.19877 | null |
2024-06-28 | Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation | Niful Islam et.al. | 2406.19690 | null |
2024-06-28 | UltraGelBot: Autonomous Gel Dispenser for Robotic Ultrasound | Deepak Raina et.al. | 2406.19678 | null |
2024-06-28 | PopAlign: Population-Level Alignment for Fair Text-to-Image Generation | Shufan Li et.al. | 2406.19668 | link |
2024-06-27 | Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation | Jack Highton et.al. | 2406.19557 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
2024-06-27 | Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Bhunia et.al. | 2406.19393 | link |
2024-06-27 | AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI | Kaveen Hiniduma et.al. | 2406.19256 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-27 | Local Manifold Learning for No-Reference Image Quality Assessment | Timin Gao et.al. | 2406.19247 | null |
2024-06-27 | Complex-valued scatter compensation in nonlinear microscopy | Maximilian Sohmen et.al. | 2406.19031 | null |
2024-06-27 | Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model | Jiangtong Tan et.al. | 2406.19030 | link |
2024-06-26 | IDA-UIE: An Iterative Framework for Deep Network-based Degradation Aware Underwater Image Enhancement | Pranjali Singh et.al. | 2406.18628 | null |
2024-06-26 | On Scaling Up 3D Gaussian Splatting Training | Hexu Zhao et.al. | 2406.18533 | link |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation | Shenghai Yuan et.al. | 2406.18522 | link |
2024-06-26 | MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal | Yiguo Jiang et.al. | 2406.18079 | link |
2024-06-26 | Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation | Qilai Zhang et.al. | 2406.18054 | link |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation | Bowei Yao et.al. | 2406.17578 | null |
2024-06-25 | UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment | Vlad Hosu et.al. | 2406.17472 | null |
2024-06-25 | Leveraging LLMs for Dialogue Quality Measurement | Jinghan Jia et.al. | 2406.17304 | null |
2024-06-25 | HD snapshot diffractive spectral imaging and inferencing | Apratim Majumder et.al. | 2406.17302 | null |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-25 | Disentangled Motion Modeling for Video Frame Interpolation | Jaihyun Lew et.al. | 2406.17256 | link |
2024-06-24 | Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Bei Yan et.al. | 2406.17115 | link |
2024-06-24 | Fine-tuning Diffusion Models for Enhancing Face Quality in Text-to-image Generation | Zhenyi Liao et.al. | 2406.17100 | link |
2024-06-24 | Reducing the Memory Footprint of 3D Gaussian Splatting | Panagiotis Papantonakis et.al. | 2406.17074 | null |
2024-06-24 | 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T | Sarah McElroy et.al. | 2406.16809 | null |
2024-06-24 | Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation | Katherine M. Collins et.al. | 2406.16807 | null |
2024-06-24 | Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment | Jun Fu et.al. | 2406.16641 | link |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors | Ming-Che Li et.al. | 2406.16358 | null |
2024-06-24 | Priorformer: A UGC-VQA Method with content and distortion priors | Yajing Pei et.al. | 2406.16297 | null |
2024-06-23 | Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation | Rafael Redondo et.al. | 2406.16155 | null |
2024-06-23 | LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction | Hengyu Liu et.al. | 2406.16073 | link |
2024-06-22 | Quality-guided Skin Tone Enhancement for Portrait Photography | Shiqi Gao et.al. | 2406.15848 | null |
2024-06-21 | Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction | Mojtaba Safari et.al. | 2406.15656 | null |
2024-06-21 | Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora et.al. | 2406.15576 | null |
2024-06-21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et.al. | 2406.15331 | null |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-24 | VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation | Xuan He et.al. | 2406.15252 | null |
2024-06-21 | Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior | Junbo Peng et.al. | 2406.15219 | null |
2024-06-21 | Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease Generalization | Jeremiah Fadugba et.al. | 2406.14994 | link |
2024-06-21 | Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning | Xu Han et.al. | 2406.14847 | null |
2024-06-21 | Is this a bad table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu et.al. | 2406.14829 | null |
2024-06-20 | Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu et.al. | 2406.14643 | null |
2024-06-20 | A Fuzzy Logic-Based Quality Model For Identifying Microservices With Low Maintainability | Rahime Yilmaz et.al. | 2406.14489 | null |
2024-06-20 | Enhancing multivariate post-processed visibility predictions utilizing CAMS forecasts | Mária Lakatos et.al. | 2406.14159 | null |
2024-06-20 | EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations | Jie Ren et.al. | 2406.13933 | null |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | Assessing the 3D resolution of refocused correlation plenoptic images using a general-purpose image quality estimator | Gianlorenzo Massaro et.al. | 2406.13501 | null |
2024-06-19 | ALiiCE: Evaluating Positional Fine-grained Citation Generation | Yilong Xu et.al. | 2406.13375 | link |
2024-06-19 | AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models | Ken Chen et.al. | 2406.13272 | null |
2024-06-19 | New methods for ALMA angular-scale based observation scheduling, quality assessment, and beam shaping II: refinements | Dirk Petry et.al. | 2406.13199 | null |
2024-06-18 | NTIRE 2024 Challenge on Night Photography Rendering | Egor Ershov et.al. | 2406.13007 | null |
2024-06-18 | Pattern or Artifact? Interactively Exploring Embedding Quality with TRACE | Edith Heiter et.al. | 2406.12953 | link |
2024-06-18 | Automatic generation of insights from workers' actions in industrial workflows with explainable Machine Learning | Francisco de Arriba-Pérez et.al. | 2406.12732 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Training Diffusion Models with Federated Learning | Matthijs de Goede et.al. | 2406.12575 | null |
2024-06-18 | Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation | Sophie Loizillon et.al. | 2406.12448 | link |
2024-06-18 | AI-Assisted Human Evaluation of Machine Translation | Vilém Zouhar et.al. | 2406.12419 | link |
2024-06-18 | SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions | Yuexiong Ding et.al. | 2406.12395 | null |
2024-06-17 | A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets | Bernhard Kerbl et.al. | 2406.12080 | null |
2024-06-17 | FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure | Ziyue Xu et.al. | 2406.12009 | link |
2024-06-17 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836 | null |
2024-06-17 | Latent Denoising Diffusion GAN: Faster sampling, Higher image quality | Luan Thanh Trinh et.al. | 2406.11713 | link |
2024-06-17 | Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT | Maximilian E. Tschuchnig et.al. | 2406.11650 | null |
2024-06-17 | Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation | Boxuan Lyu et.al. | 2406.11632 | null |
2024-06-17 | Compressed Skinning for Facial Blendshapes | Ladislav Kavan et.al. | 2406.11597 | null |
2024-06-17 | Energy Reduction Opportunities in HDR Video Encoding | Christian Herglotz et.al. | 2406.11492 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-17 | Incentivizing Quality Text Generation via Statistical Contracts | Eden Saig et.al. | 2406.11118 | link |
2024-06-16 | Parameter Blending for Multi-Camera Harmonization for Automotive Surround View Systems | Yuzhuo Ren et.al. | 2406.11066 | null |
2024-06-16 | SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction | Yuxun Tang et.al. | 2406.10911 | null |
2024-06-15 | MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images | Tao Yan et.al. | 2406.10652 | null |
2024-06-15 | Exploring the Impact of AI-generated Image Tools on Professional and Non-professional Users in the Art and Design Fields | Yuying Tang et.al. | 2406.10640 | null |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-06-15 | CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation | Wei Chen et.al. | 2406.10462 | null |
2024-06-14 | Consistency-diversity-realism Pareto fronts of conditional image generative models | Pietro Astolfi et.al. | 2406.10429 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | AlignNet: Learning dataset score alignment functions to enable better training of speech quality estimators | Jaden Pieper et.al. | 2406.10205 | null |
2024-06-14 | D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video | Moritz Kappel et.al. | 2406.10078 | null |
2024-06-14 | Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Fei Zhou et.al. | 2406.09858 | null |
2024-06-14 | Full-reference Point Cloud Quality Assessment Using Spectral Graph Wavelets | Ryosuke Watanabe et.al. | 2406.09762 | null |
2024-06-14 | Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion | Qiang Zhu et.al. | 2406.09693 | null |
2024-06-13 | DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer | Wei-Ting Chen et.al. | 2406.09622 | null |
2024-06-13 | Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment | Fengbin Guan et.al. | 2406.09546 | null |
2024-06-13 | Modeling Ambient Scene Dynamics for Free-view Synthesis | Meng-Li Shih et.al. | 2406.09395 | null |
2024-06-14 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | LRM-Zero: Training Large Reconstruction Models with Synthesized Data | Desai Xie et.al. | 2406.09371 | link |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning | Giuseppe Vecchio et.al. | 2406.09293 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Adaptive Cooperative Streaming of Holographic Video Over Wireless Networks: A Proximal Policy Optimization Solution | Wanli Wen et.al. | 2406.08806 | null |
2024-06-13 | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | Mingwang Xu et.al. | 2406.08801 | null |
2024-06-13 | FouRA: Fourier Low Rank Adaptation | Shubhankar Borse et.al. | 2406.08798 | null |
2024-06-12 | Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods | Eugene Vyborov et.al. | 2406.08582 | null |
2024-06-12 | IMFL-AIGC: Incentive Mechanism Design for Federated Learning Empowered by Artificial Intelligence Generated Content | Guangjing Huang et.al. | 2406.08526 | null |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | link |
2024-06-12 | WMAdapter: Adding WaterMark Control to Latent Diffusion Models | Hai Ci et.al. | 2406.08337 | null |
2024-06-12 | Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation | Javad Pourmostafa Roshan Sharami et.al. | 2406.07970 | link |
2024-06-12 | DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera | Senyan Xu et.al. | 2406.07951 | link |
2024-06-12 | Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation | Jiadong Liang et.al. | 2406.07895 | null |
2024-06-11 | A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection | Can Akbas et.al. | 2406.07694 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499 | null |
2024-06-11 | Textual Similarity as a Key Metric in Machine Translation Quality Estimation | Kun Sun et.al. | 2406.07440 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390 | null |
2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
2024-06-11 | Accurate estimate of the ESPRESSO fiber-injection losses inferred from integrated field-stabilization images | Tobias M. Schmidt et.al. | 2406.07193 | null |
2024-06-11 | Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Yuanhao Zhai et.al. | 2406.06890 | link |
2024-06-11 | A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality | Duc Nguyen et.al. | 2406.06888 | null |
2024-06-09 | Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises | Jianhua Pei et.al. | 2406.06644 | link |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202 | null |
2024-06-10 | Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios | Raül Pérez-Gonzalo et.al. | 2406.06165 | null |
2024-06-10 | JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis | Hyunjae Cho et.al. | 2406.06111 | null |
2024-06-10 | GAIA: Rethinking Action Quality Assessment for AI-Generated Videos | Zijian Chen et.al. | 2406.06087 | link |
2024-06-10 | FRAG: Frequency Adapting Group for Diffusion Video Editing | Sunjae Yoon et.al. | 2406.06044 | link |
2024-06-12 | MLCM: Multistep Consistency Distillation of Latent Diffusion Model | Qingsong Xie et.al. | 2406.05768 | link |
2024-06-08 | Energy-Efficient Approximate Full Adders Applying Memristive Serial IMPLY Logic For Image Processing | Seyed Erfan Fatemieh et.al. | 2406.05525 | null |
2024-06-08 | Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid | Thanh-Huy Nguyen et.al. | 2406.05349 | null |
2024-06-08 | Deep convolutional demosaicking network for multispectral polarization filter array | Tomoharu Ishiuchi et.al. | 2406.05312 | null |
2024-06-08 | YouTube SFV+HDR Quality Dataset | Yilin Wang et.al. | 2406.05305 | null |
2024-06-07 | Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis | Ryan Langman et.al. | 2406.05298 | null |
2024-06-07 | GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications | Shakhnaz Akhmedova et.al. | 2406.05023 | link |
2024-06-07 | Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior | Tanvir Mahmud et.al. | 2406.04873 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-07 | The Active Optics System on the Vera C. Rubin Observatory: Optimal Control of Degeneracy Among the Large Number of Degrees of Freedom | Guillem Megias Homar et.al. | 2406.04656 | null |
2024-06-07 | GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models | Diptanu De et.al. | 2406.04654 | null |
2024-06-07 | StreamOptix: A Cross-layer Adaptive Video Delivery Scheme | Mufan Liu et.al. | 2406.04632 | link |
2024-06-07 | Attention Fusion Reverse Distillation for Multi-Lighting Image Anomaly Detection | Yiheng Zhang et.al. | 2406.04573 | null |
2024-06-06 | Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance | Reyhane Askari Hemmat et.al. | 2406.04551 | null |
2024-06-06 | A Versatile Collage Visualization Technique | Zhenyu Wang et.al. | 2406.04008 | null |
2024-06-06 | JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits | Minzhou Pan et.al. | 2406.03720 | link |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-05 | Anatomy-based quality metric of diffusion-weighted MRI data for accurate derivation of muscle fiber orientation | Nadya Shusharina et.al. | 2406.03560 | null |
2024-06-05 | Globally and Locally Optimized Pannini Projection for High FoV Rendering of 360-degree Images | Falah Jabar et.al. | 2406.03282 | null |
2024-06-05 | FAPNet: An Effective Frequency Adaptive Point-based Eye Tracker | Xiaopeng Lin et.al. | 2406.03177 | null |
2024-06-05 | Dynamic 3D Gaussian Fields for Urban Areas | Tobias Fischer et.al. | 2406.03175 | null |
2024-06-05 | The new Herschel/PACS Point Source Catalogue | Gábor Marton et.al. | 2406.03116 | null |
2024-06-05 | A-Bench: Are LMMs Masters at Evaluating AI-generated Images? | Zicheng Zhang et.al. | 2406.03070 | link |
2024-06-05 | DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross Domain | Jun Liu et.al. | 2406.03017 | link |
2024-06-05 | Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms | Firas Trabelsi et.al. | 2406.02832 | null |
2024-06-04 | ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation | Tianchen Zhao et.al. | 2406.02540 | link |
2024-06-04 | Guiding a Diffusion Model with a Bad Version of Itself | Tero Karras et.al. | 2406.02507 | link |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-04 | I4VGen: Image as Stepping Stone for Text-to-Video Generation | Xiefan Guo et.al. | 2406.02230 | null |
2024-06-04 | OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection | Chenyang Huang et.al. | 2406.01919 | null |
2024-06-04 | Rank-based No-reference Quality Assessment for Face Swapping | Xinghui Zhou et.al. | 2406.01884 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-03 | DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised |
Alexander Denker et.al. | 2406.01781 | link |
2024-06-03 | Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers | Pablo Arratia et.al. | 2406.01299 | null |
2024-06-03 | Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction | Rita Pucci et.al. | 2406.01294 | link |
2024-06-03 | Dimba: Transformer-Mamba Diffusion Models | Zhengcong Fei et.al. | 2406.01159 | null |
2024-06-03 | Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline | Jan Lippemeier et.al. | 2406.01071 | null |
2024-06-03 | UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment | Hantao Zhou et.al. | 2406.01069 | link |
2024-06-03 | CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment | Daekyu Kwon et.al. | 2406.01020 | null |
2024-06-02 | EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing | Hadrien Reynaud et.al. | 2406.00808 | link |
2024-06-04 | Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models | Cristiano Patrício et.al. | 2406.00772 | link |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-06-01 | Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner | Xing Cui et.al. | 2406.00432 | link |
2024-06-01 | Hybrid attention structure preserving network for reconstruction of under-sampled OCT images | Zezhao Guo et.al. | 2406.00279 | null |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075 | null |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Tsang's resolution enhancement method for imaging with focused illumination | Alexander Duplinskiy et.al. | 2405.20979 | null |
2024-05-31 | Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation | Shuzhou Yang et.al. | 2405.20669 | link |
2024-05-30 | An Automatic Question Usability Evaluation Toolkit | Steven Moore et.al. | 2405.20529 | link |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | CoSy: Evaluating Textual Explanations of Neurons | Laura Kopf et.al. | 2405.20331 | link |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion | Jiangkai Wu et.al. | 2405.20032 | null |
2024-06-03 | DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild | Honghao Fu et.al. | 2405.19996 | link |
2024-05-29 | CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning | Yiping Wang et.al. | 2405.19547 | null |
2024-05-29 | A Full-duplex Speech Dialogue Scheme Based On Large Language Models | Peng Wang et.al. | 2405.19487 | null |
2024-05-29 | VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture | Heesup Yun et.al. | 2405.19413 | null |
2024-05-29 | Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare | Hanwei Zhu et.al. | 2405.19298 | link |
2024-05-29 | A study on the adequacy of common IQA measures for medical images | Anna Breger et.al. | 2405.19224 | link |
2024-05-29 | A study of why we need to reassess full reference image quality assessment with medical images | Anna Breger et.al. | 2405.19097 | null |
2024-05-31 | Benchmarking and Improving Detail Image Caption | Hongyuan Dong et.al. | 2405.19092 | link |
2024-05-29 | Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization | Zhiwei Tang et.al. | 2405.18881 | link |
2024-05-29 | Descriptive Image Quality Assessment in the Wild | Zhiyuan You et.al. | 2405.18842 | null |
2024-05-29 | Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics | Zhangkai Ni et.al. | 2405.18790 | link |
2024-05-28 | Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? | Zebin You et.al. | 2405.18029 | null |
2024-05-30 | Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains | Zhenjie Zhang et.al. | 2405.17934 | null |
2024-05-30 | MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization | Tianchen Zhao et.al. | 2405.17873 | null |
2024-05-28 | PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild | Kun Yuan et.al. | 2405.17765 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-27 | Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba | Jiahao Huang et.al. | 2405.17659 | null |
2024-05-27 | Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction | Wenhao Zhang et.al. | 2405.17167 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control | Arle Lommel et.al. | 2405.16969 | null |
2024-05-27 | EM Distillation for One-step Diffusion Models | Sirui Xie et.al. | 2405.16852 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-26 | Coil Reweighting to Suppress Motion Artifacts in Real-Time Exercise Cine Imaging | Chong Chen et.al. | 2405.16715 | null |
2024-05-26 | Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping | Chao Li et.al. | 2405.16664 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination | Shelly Golan et.al. | 2405.16260 | link |
2024-05-25 | Maintaining and Managing Road Quality:Using MLP and DNN | Makgotso Jacqueline Maotwana et.al. | 2405.16196 | null |
2024-05-25 | Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection | Yun Zhu et.al. | 2405.16178 | null |
2024-05-24 | Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model | Lang Zhang et.al. | 2405.15830 | null |
2024-05-24 | Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction | Yuyang Xue et.al. | 2405.15517 | link |
2024-05-24 | Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks | Munief Hassan Tahir et.al. | 2405.15453 | null |
2024-05-24 | Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image | Hyeonjae Gil et.al. | 2405.15395 | link |
2024-05-24 | CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation | Xia Li et.al. | 2405.15385 | null |
2024-05-24 | Seeing the World through an Antenna's Eye: Reception Quality Visualization Using Incomplete Technical Signal Information | Leif Bergerhoff et.al. | 2405.15253 | null |
2024-05-24 | Improved Distribution Matching Distillation for Fast Image Synthesis | Tianwei Yin et.al. | 2405.14867 | link |
2024-05-23 | Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography | Shuo Han et.al. | 2405.14770 | null |
2024-05-23 | Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms | Aditya Jonnalagadda et.al. | 2405.14720 | null |
2024-05-23 | OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance | Shuheng Ge et.al. | 2405.14709 | null |
2024-05-24 | Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI | Guanxiong Luo et.al. | 2405.14327 | link |
2024-05-23 | Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization | Zhibo Chen et.al. | 2405.14221 | null |
2024-05-22 | Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior | Lorenzo Perini et.al. | 2405.13699 | null |
2024-05-22 | Euclid: Early Release Observations -- Programme overview and pipeline for compact- and diffuse-emission photometry | J. -C. Cuillandre et.al. | 2405.13496 | null |
2024-05-25 | Class-Conditional self-reward mechanism for improved Text-to-Image models | Safouane El Ghazouali et.al. | 2405.13473 | link |
2024-05-22 | Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications | Md. Toukir Ahmed et.al. | 2405.13331 | null |
2024-05-21 | Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos | Jayroop Ramesh et.al. | 2405.13235 | link |
2024-05-24 | Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction | Maciej Kilian et.al. | 2405.13218 | null |
2024-05-21 | NieR: Normal-Based Lighting Scene Rendering | Hongsheng Wang et.al. | 2405.13097 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? | Ziqin Lin et.al. | 2405.12584 | null |
2024-05-20 | Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI | Di Xu et.al. | 2405.12357 | null |
2024-05-20 | Deep learning-based hyperspectral image reconstruction for quality assessment of agro-product | Md. Toukir Ahmed et.al. | 2405.12313 | null |
2024-05-20 | GGAvatar: Geometric Adjustment of Gaussian Head Avatar | Xinyang Li et.al. | 2405.11993 | null |
2024-05-20 | On Efficient and Statistical Quality Estimation for Data Annotation | Jan-Christoph Klie et.al. | 2405.11919 | null |
2024-05-20 | ViViD: Video Virtual Try-on using Diffusion Models | Zixun Fang et.al. | 2405.11794 | null |
2024-05-19 | Solar image quality assessment: a proof of concept using Variance of Laplacian method and its application to optical atmospheric condition monitoring | Chu Wing So et.al. | 2405.11490 | null |
2024-05-18 | Sampling Strategies for Mitigating Bias in Face Synthesis Methods | Emmanouil Maragkoudakis et.al. | 2405.11320 | null |
2024-05-18 | Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | Xingyu Miao et.al. | 2405.11252 | link |
2024-05-18 | Testing the Performance of Face Recognition for People with Down Syndrome | Christian Rathgeb et.al. | 2405.11240 | null |
2024-05-21 | SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation | Ziyao Xu et.al. | 2405.10650 | link |
2024-05-17 | Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI | Yirong Zhou et.al. | 2405.10570 | null |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-16 | Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder | Mohamed Ilyes Lakhal et.al. | 2405.10423 | null |
2024-05-16 | GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction | Rui Jin et.al. | 2405.10142 | null |
2024-05-16 | Semantic Communication via Rate Distortion Perception Bottleneck | Zihe Zhao et.al. | 2405.09995 | null |
2024-05-16 | VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | Binghui Chen et.al. | 2405.09985 | null |
2024-05-16 | NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge | Jie Liang et.al. | 2405.09923 | null |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-16 | Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images | Memoona Aziz et.al. | 2405.09426 | null |
2024-05-15 | Application of Gated Recurrent Units for CT Trajectory Optimization | Yuedong Yuan et.al. | 2405.09333 | null |
2024-05-21 | Deep Blur Multi-Model (DeepBlurMM) - a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis | Yujie Xiang et.al. | 2405.09298 | null |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-15 | Shacl4Bib: custom validation of library data | Péter Király et.al. | 2405.09177 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-14 | Chemically peculiar stars on the pre-main sequence | L. Kueß et.al. | 2405.08946 | null |
2024-05-14 | Enhancing Blind Video Quality Assessment with Rich Quality-aware Features | Wei Sun et.al. | 2405.08745 | link |
2024-05-13 | The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective | Andrew Shin et.al. | 2405.08720 | null |
2024-05-14 | Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs | P. Mas-Buitrago et.al. | 2405.08703 | link |
2024-05-15 | RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content | Tianhao Peng et.al. | 2405.08621 | null |
2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | link |
2024-05-14 | WaterMamba: Visual State Space Model for Underwater Image Enhancement | Meisheng Guan et.al. | 2405.08419 | null |
2024-05-14 | Perivascular space Identification Nnunet for Generalised Usage (PINGU) | Benjamin Sinclair et.al. | 2405.08337 | link |
2024-05-14 | Progressive enhancement and restoration for mural images under low-light and defected conditions based on multi-receptive field strategy | Xiameng Wei et.al. | 2405.08245 | link |
2024-05-13 | Quality of Experience Optimization for Real-time XR Video Transmission with Energy Constraints | Guangjin Pan et.al. | 2405.07689 | null |
2024-05-15 | PRANK: a singular value based noise filtering approach | Francesco Trainotti et.al. | 2405.07578 | null |
2024-05-13 | Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches | Gao Yu Lee et.al. | 2405.07520 | null |
2024-05-12 | Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning | Jiarui Wang et.al. | 2405.07346 | link |
2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | link |
2024-05-12 | Stable Signature is Unstable: Removing Image Watermark from Diffusion Models | Yuepeng Hu et.al. | 2405.07145 | null |
2024-05-11 | Large Language Model-aided Edge Learning in Distribution System State Estimation | Renyou Xie et.al. | 2405.06999 | null |
2024-05-15 | Generation of Granular-Balls for Clustering Based on the Principle of Justifiable Granularity | Zihang Jia et.al. | 2405.06904 | null |
2024-05-11 | FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | Jinglin Xu et.al. | 2405.06887 | link |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Compression-Realized Deep Structural Network for Video Quality Enhancement | Hanchi Sun et.al. | 2405.06342 | null |
2024-05-09 | Perceptual Crack Detection for Rendered 3D Textured Meshes | Armin Shafiee Sarvestani et.al. | 2405.06143 | link |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
2024-05-09 | LatentColorization: Latent Diffusion-Based Speaker Video Colorization | Rory Ward et.al. | 2405.05707 | null |
2024-05-09 | SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space | Zeren Zhang et.al. | 2405.05636 | null |
2024-05-09 | Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data | Yangyang Wang et.al. | 2405.05565 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | Bridging the Gap Between Saliency Prediction and Image Quality Assessment | Kirillov Alexey et.al. | 2405.04997 | link |
2024-05-07 | Remote Diffusion | Kunal Sunil Kasodekar et.al. | 2405.04717 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | Dogucan Yaman et.al. | 2405.04327 | null |
2024-05-07 | Cross-IQA: Unsupervised Learning for Image Quality Assessment | Zhen Zhang et.al. | 2405.04311 | null |
2024-05-07 | Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models | Zhixuan Chu et.al. | 2405.04180 | link |
2024-05-07 | Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment | Aobo Li et.al. | 2405.04167 | link |
2024-05-07 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints | Xiongjun Guan et.al. | 2405.03959 | link |
2024-05-06 | AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration | Widad Elouataoui et.al. | 2405.03870 | null |
2024-05-06 | Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction | Jinho Kim et.al. | 2405.03732 | null |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-06 | An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks | Peng Jia et.al. | 2405.03408 | null |
2024-05-06 | Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement | Jiesong Bai et.al. | 2405.03349 | link |
2024-05-06 | Light-VQA+: A Video Quality Assessment Model for Exposure Correction with Vision-Language Guidance | Xunchu Zhou et.al. | 2405.03333 | link |
2024-05-06 | Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning | Jiewen Deng et.al. | 2405.03255 | link |
2024-05-05 | Matten: Video Generation with Mamba-Attention | Yu Gao et.al. | 2405.03025 | null |
2024-05-05 | Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens | Shaohua Gao et.al. | 2405.02942 | null |
2024-05-05 | Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration | Xiaole Tang et.al. | 2405.02843 | link |
2024-05-04 | Deep Image Restoration For Image Anti-Forensics | Eren Tahir et.al. | 2405.02751 | link |
2024-05-04 | DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model | Liangqi Lei et.al. | 2405.02696 | null |
2024-05-03 | On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | Maxime Zanella et.al. | 2405.02266 | link |
2024-05-01 | Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts | Han Cui et.al. | 2405.02208 | null |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-07 | Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration | Praveen Kumar Chandaliya et.al. | 2405.01273 | null |
2024-05-02 | Singular Value and Frame Decomposition-based Reconstruction for Atmospheric Tomography | Lukas Weissinger et.al. | 2405.01079 | null |
2024-05-01 | Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer | Hui Lin et.al. | 2405.00857 | link |
2024-05-01 | Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Xiaoshi Wu et.al. | 2405.00760 | null |
2024-05-01 | Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays | Andrei Chubarau et.al. | 2405.00670 | link |
2024-05-01 | Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning | Yuxi Xie et.al. | 2405.00451 | link |
2024-04-30 | Fast MRI Reconstruction Using Deep Learning-based Compressed Sensing: A Systematic Review | Mojtaba Safari et.al. | 2405.00241 | link |
2024-04-30 | Charting the Path Forward: CT Image Quality Assessment -- An In-Depth Review | Siyi Xun et.al. | 2405.00075 | null |
2024-04-30 | Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity | Lei Wang et.al. | 2404.19666 | null |
2024-04-30 | Perceptual Constancy Constrained Single Opinion Score Calibration for Image Quality Assessment | Lei Wang et.al. | 2404.19595 | null |
2024-04-30 | Causal Perception Inspired Representation Learning for Trustworthy Image Quality Assessment | Lei Wang et.al. | 2404.19567 | null |
2024-05-04 | Towards Real-world Video Face Restoration: A New Benchmark | Ziyan Chen et.al. | 2404.19500 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-30 | Global Search Optics: Automatically Exploring Optimal Solutions to Compact Computational Imaging Systems | Yao Gao et.al. | 2404.19201 | null |
2024-04-30 | Advancing low-field MRI with a universal denoising imaging transformer: Towards fast and high-quality imaging | Zheren Zhu et.al. | 2404.19167 | link |
2024-04-29 | A Comprehensive Rubric for Annotating Pathological Speech | Mario Corrales-Astorgano et.al. | 2404.18851 | null |
2024-04-29 | Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology | Luzhe Huang et.al. | 2404.18458 | null |
2024-04-29 | PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images | Jiquan Yuan et.al. | 2404.18409 | link |
2024-04-29 | G-Refine: A General Quality Refiner for Text-to-Image Generation | Chunyi Li et.al. | 2404.18343 | link |
2024-04-28 | An automated pipeline for computation and analysis of functional ventilation and perfusion lung MRI with matrix pencil decomposition: TrueLung | Orso Pusterla et.al. | 2404.18275 | null |
2024-04-28 | LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM | Zicheng Zhang et.al. | 2404.18203 | link |
2024-04-28 | Assessing Image Quality Using a Simple Generative Representation | Simon Raviv et.al. | 2404.18178 | link |
2024-04-28 | fMRI Exploration of Visual Quality Assessment | Yiming Zhang et.al. | 2404.18162 | null |
2024-04-27 | Quality Estimation with |
Tu Anh Dinh et.al. | 2404.18031 | null |
2024-04-27 | LpQcM: Adaptable Lesion-Quantification-Consistent Modulation for Deep Learning Low-Count PET Image Denoising | Menghua Xia et.al. | 2404.17994 | null |
2024-04-27 | From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client Sharpness Matching | Nannan Wu et.al. | 2404.17805 | link |
2024-04-27 | Large Multi-modality Model Assisted AI-Generated Image Quality Assessment | Puyi Wang et.al. | 2404.17762 | link |
2024-04-27 | Segmentation Quality and Volumetric Accuracy in Medical Imaging | Zheyuan Zhang et.al. | 2404.17742 | null |
2024-04-27 | Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission | Mingyu Yang et.al. | 2404.17736 | link |
2024-04-26 | Attention-aware non-rigid image registration for accelerated MR imaging | Aya Ghoul et.al. | 2404.17621 | link |
2024-04-26 | Low Cost Machine Vision for Insect Classification | Danja Brandt et.al. | 2404.17488 | null |
2024-04-26 | S-IQA Image Quality Assessment With Compressive Sampling | Ronghua Liao et.al. | 2404.17170 | null |
2024-04-25 | ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images | Weiqi Li et.al. | 2404.16825 | null |
2024-04-25 | NTIRE 2024 Quality Assessment of AI-Generated Content Challenge | Xiaohong Liu et.al. | 2404.16687 | null |
2024-04-25 | Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior | Han Wang et.al. | 2404.16678 | null |
2024-04-25 | Application of RESNET50 Convolution Neural Network for the Extraction of Optical Parameters in Scattering Media | Bowen Deng et.al. | 2404.16647 | null |
2024-04-25 | COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images | Panagiotis Sapoutzoglou et.al. | 2404.16471 | link |
2024-04-25 | PAD: Patch-Agnostic Defense against Adversarial Patch Attacks | Lihua Jing et.al. | 2404.16452 | link |
2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | link |
2024-04-24 | AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results | Marcos V. Conde et.al. | 2404.16205 | link |
2024-04-24 | Quantitative Characterization of Retinal Features in Translated OCTA | Rashadul Hasan Badhon et.al. | 2404.16133 | null |
2024-04-24 | Assessment of the quality of a prediction | Roger Sewell et.al. | 2404.15764 | null |
2024-04-24 | A stochastic approach to estimate distribution grid state with confidence regions | Rasmus L. Olsen et.al. | 2404.15722 | null |
2024-04-24 | Deep Learning for Accelerated and Robust MRI Reconstruction: a Review | Reinhard Heckel et.al. | 2404.15692 | null |
2024-04-24 | Neural network-based recognition of multiple nanobubbles in graphene | Subin Kim et.al. | 2404.15658 | null |
2024-04-24 | PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing | Yutong Chen et.al. | 2404.15638 | null |
2024-04-24 | Direct Zernike Coefficient Prediction from Point Spread Functions and Extended Images using Deep Learning | Yong En Kok et.al. | 2404.15231 | null |
2024-04-23 | Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment | Tianwei Zhou et.al. | 2404.15163 | null |
2024-04-23 | Multi-Modal Prompt Learning on Blind Image Quality Assessment | Wensheng Pan et.al. | 2404.14949 | link |
2024-04-23 | Novel Topological Machine Learning Methodology for Stream-of-Quality Modeling in Smart Manufacturing | Jay Lee et.al. | 2404.14728 | null |
2024-04-22 | Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 |
Haopeng Wang et.al. | 2404.14573 | null |
2024-04-25 | Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach | Tahmim Hossain et.al. | 2404.14560 | null |
2024-04-22 | Narrative Action Evaluation with Prompt-Guided Multimodal Interaction | Shiyi Zhang et.al. | 2404.14471 | link |
2024-04-22 | CrossScore: Towards Multi-View Image Evaluation and Scoring | Zirui Wang et.al. | 2404.14409 | null |
2024-04-22 | Experimental Validation of Ultrasound Beamforming with End-to-End Deep Learning for Single Plane Wave Imaging | Ryan A. L. Schoop et.al. | 2404.14188 | link |
2024-04-22 | Text in the Dark: Extremely Low-Light Text Image Enhancement | Che-Tsung Lin et.al. | 2404.14135 | null |
2024-04-22 | CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task | Kangzhen Yang et.al. | 2404.14132 | link |
2024-04-22 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment | Kanglei Zhou et.al. | 2404.13999 | link |
2024-04-22 | SI-FID: Only One Objective Indicator for Evaluating Stitched Images | Xinrui Zhang et.al. | 2404.13905 | null |
2024-04-21 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-21 | Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap | Bowen Qu et.al. | 2404.13573 | link |
2024-04-21 | Cell Phone Image-Based Persian Rice Detection and Classification Using Deep Learning Techniques | Mahmood Saeedi kelishami et.al. | 2404.13555 | null |
2024-04-20 | Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content | Abhinau K. Venkataramanan et.al. | 2404.13484 | null |
2024-04-20 | Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos | Abhinau K. Venkataramanan et.al. | 2404.13452 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-20 | PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition | Xi Fang et.al. | 2404.13299 | null |
2024-04-20 | Beyond Score Changes: Adversarial Attack on No-Reference Image Quality Assessment from Two Perspectives | Chenxi Yang et.al. | 2404.13277 | null |
2024-04-19 | A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks | Ronglei Ji et.al. | 2404.13018 | link |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture | Zarif Ahmed et.al. | 2404.12986 | null |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-19 | 3D Multi-frame Fusion for Video Stabilization | Zhan Peng et.al. | 2404.12887 | null |
2024-04-19 | ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation | Yu-Hsuan Ho et.al. | 2404.12606 | null |
2024-04-18 | Plane-wave compounding with adaptive joint coherence factor weighting | Nikunj Khetan et.al. | 2404.12533 | link |
2024-04-18 | Advancing Applications of Satellite Photogrammetry: Novel Approaches for Built-up Area Modeling and Natural Environment Monitoring using Stereo/Multi-view Satellite Image-derived 3D Data | Shengxi Gui et.al. | 2404.12487 | null |
2024-04-18 | On the Content Bias in Fréchet Video Distance | Songwei Ge et.al. | 2404.12391 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes | Jan Niklas Kolf et.al. | 2404.12203 | link |
2024-04-18 | Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models | Yuzhu Cai et.al. | 2404.12104 | null |
2024-04-18 | Seeing Motion at Nighttime with an Event Camera | Haoyue Liu et.al. | 2404.11884 | link |
2024-04-18 | Automated tomographic assessment of structural defects of freeze-dried pharmaceuticals | Patric Müller et.al. | 2404.11867 | null |
2024-04-18 | Multiphoton super-resolution imaging via virtual structured illumination | Sumin Lim et.al. | 2404.11849 | null |
2024-04-17 | Analysis of blurring due to short T2 decay at different resolutions in 23Na MRI | Olga Dergachyova et.al. | 2404.11774 | null |
2024-04-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al. | 2404.11429 | null |
2024-04-17 | Achromatic Full Stokes Polarimetry Metasurface for Full-color Polarization Imaging in the Visible | Yueqiang Hu et.al. | 2404.11415 | null |
2024-04-17 | Toward Understanding the Disagreement Problem in Neural Network Feature Attribution | Niklas Koenen et.al. | 2404.11330 | link |
2024-04-17 | NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results | Xin Li et.al. | 2404.11313 | link |
2024-04-18 | Study on the static detection of ICF target based on muonic X-ray sphere encoded imaging | Dikai Li et.al. | 2404.11278 | null |
2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | null |
2024-04-17 | ONOT: a High-Quality ICAO-compliant Synthetic Mugshot Dataset | Nicolò Di Domenico et.al. | 2404.11236 | null |
2024-04-17 | Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey | Nicolas Chahine et.al. | 2404.11159 | link |
2024-04-17 | MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training | Jiayang Li et.al. | 2404.11016 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time | Sicheng Xu et.al. | 2404.10667 | null |
2024-04-16 | A Computer Vision-Based Quality Assessment Technique for the automatic control of consumables for analytical laboratories | Meriam Zribi et.al. | 2404.10454 | null |
2024-04-16 | OneActor: Consistent Character Generation via Cluster-Conditioned Guidance | Jiahao Wang et.al. | 2404.10267 | null |
2024-04-16 | Diffusion assisted image reconstruction in optoacoustic tomography | M. G. González et.al. | 2404.10239 | null |
2024-04-16 | Novel Method to Estimate Kinetic Microparameters from Dynamic Whole-Body Imaging in Regular-Axial Field-of-View PET Scanners | Kyung-Nam Lee et.al. | 2404.10197 | null |
2024-04-15 | Quality Assessment of Prompts Used in Code Generation | Mohammed Latif Siddiq et.al. | 2404.10155 | null |
2024-04-15 | ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis | Aashish Anantha Ramakrishnan et.al. | 2404.10141 | link |
2024-04-15 | Ti-Patch: Tiled Physical Adversarial Patch for no-reference video quality metrics | Victoria Leonenkova et.al. | 2404.09961 | link |
2024-04-15 | The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models | Ngoc-Giau Pham et.al. | 2404.09817 | null |
2024-04-15 | Language-Agnostic Modeling of Wikipedia Articles for Content Quality Assessment across Languages | Paramita Das et.al. | 2404.09764 | null |
2024-04-15 | Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement | Wenyi Lian et.al. | 2404.09735 | link |
2024-04-15 | AI Competitions and Benchmarks: Dataset Development | Romain Egele et.al. | 2404.09703 | null |
2024-04-15 | Are Large Language Models Reliable Argument Quality Annotators? | Nailia Mirzakhmedova et.al. | 2404.09696 | link |
2024-04-15 | Real-world Instance-specific Image Goal Navigation for Service Robots: Bridging the Domain Gap with Contrastive Learning | Taichi Sakaguchi et.al. | 2404.09645 | null |
2024-04-15 | AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation | Žiga Babnik et.al. | 2404.09555 | link |
2024-04-15 | WiTUnet: A U-Shaped Architecture Integrating CNN and Transformer for Improved Feature Alignment and Local Information Fusion | Bin Wang et.al. | 2404.09533 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-14 | Exploring Generative AI for Sim2Real in Driving Data Synthesis | Haonan Zhao et.al. | 2404.09111 | null |
2024-04-13 | A Parametric Rate-Distortion Model for Video Transcoding | Maedeh Jamali et.al. | 2404.09029 | null |
2024-04-13 | THQA: A Perceptual Quality Assessment Database for Talking Heads | Yingjie Zhou et.al. | 2404.09003 | link |
2024-04-13 | PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos | Qi Zhao et.al. | 2404.08921 | null |
2024-04-12 | Multi-Branch Generative Models for Multichannel Imaging with an Application to PET/CT Joint Reconstruction | Noel Jeffrey Pinton et.al. | 2404.08748 | null |
2024-04-12 | Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Yanhao Zheng et.al. | 2404.08603 | link |
2024-04-12 | Self-Supervised k-Space Regularization for Motion-Resolved Abdominal MRI Using Neural Implicit k-Space Representation | Veronika Spieker et.al. | 2404.08350 | link |
2024-04-11 | Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis | Marc Aubreville et.al. | 2404.07676 | link |
2024-04-10 | GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Zewei Zhang et.al. | 2404.07206 | null |
2024-04-10 | Adversarial purification for no-reference image-quality metrics: applicability study and new methods | Aleksandr Gushchin et.al. | 2404.06957 | null |
2024-04-10 | Perception-Oriented Video Frame Interpolation via Asymmetric Blending | Guangyang Wu et.al. | 2404.06692 | link |
2024-04-10 | CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge | Yu Ying Chiu et.al. | 2404.06664 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | Low-Cost Generation and Evaluation of Dictionary Example Sentences | Bill Cai et.al. | 2404.06224 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-09 | Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis | Sekeun Kim et.al. | 2404.05916 | null |
2024-04-06 | Study of the effect of Sharpness on Blind Video Quality Assessment | Anantha Prabhu et.al. | 2404.05764 | null |
2024-04-08 | A Training-Free Plug-and-Play Watermark Framework for Stable Diffusion | Guokai Zhang et.al. | 2404.05607 | null |
2024-04-08 | UniFL: Improve Stable Diffusion via Unified Feedback Learning | Jiacheng Zhang et.al. | 2404.05595 | null |
2024-04-08 | Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance | Dazhong Shen et.al. | 2404.05384 | link |
2024-04-08 | Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt | Zhiqi Huang et.al. | 2404.05331 | null |
2024-04-08 | Progressive Alignment with VLM-LLM Feature to Augment Defect Classification for the ASE Dataset | Chih-Chung Hsu et.al. | 2404.05183 | null |
2024-04-08 | QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis | Junlin Hou et.al. | 2404.05169 | null |
2024-04-07 | Data Conditioning for Subsurface Models with Single-Image Generative Adversarial Network (SinGAN) | Lei Liu et.al. | 2404.05068 | null |
2024-04-07 | LOGO: A Long-Form Video Dataset for Group Action Quality Assessment | Shiyi Zhang et.al. | 2404.05029 | link |
2024-04-07 | Dual-Scale Transformer for Large-Scale Single-Pixel Imaging | Gang Qu et.al. | 2404.05001 | link |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-04-06 | Convolutional Neural Network Transformer (CNNT) for Fluorescence Microscopy image Denoising with Improved Generalization and Fast Adaptation | Azaan Rehman et.al. | 2404.04726 | null |
2024-04-09 | Computation and Critical Transitions of Rate-Distortion-Perception Functions With Wasserstein Barycenter | Chunhui Chen et.al. | 2404.04681 | null |
2024-04-06 | FastHDRNet: A new efficient method for SDR-to-HDR Translation | Siyuan Tian et.al. | 2404.04483 | null |
2024-04-06 | RoNet: Rotation-oriented Continuous Image Translation | Yi Li et.al. | 2404.04474 | null |
2024-04-05 | Physics-Inspired Synthesized Underwater Image Dataset | Reina Kaneko et.al. | 2404.03998 | link |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment | Chunyi Li et.al. | 2404.03407 | null |
2024-04-04 | DI-Retinex: Digital-Imaging Retinex Theory for Low-Light Image Enhancement | Shangquan Sun et.al. | 2404.03327 | null |
2024-04-04 | CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning | Ruoyou Wu et.al. | 2404.03209 | null |
2024-04-02 | Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models | Jiachen Ma et.al. | 2404.02928 | null |
2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905 | link |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | Imaging transformer for MRI denoising with the SNR unit training: enabling generalization across field-strengths, imaging contrasts, and anatomy | Hui Xue et.al. | 2404.02382 | null |
2024-04-02 | DSGNN: A Dual-View Supergrid-Aware Graph Neural Network for Regional Air Quality Estimation | Xin Zhang et.al. | 2404.01975 | null |
2024-04-02 | Event-assisted Low-Light Video Object Segmentation | Hebei Li et.al. | 2404.01945 | link |
2024-04-02 | PATCH -- Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency | Qixiang Fang et.al. | 2404.01799 | link |
2024-04-02 | Super-Resolution Analysis for Landfill Waste Classification | Matias Molina et.al. | 2404.01790 | null |
2024-04-02 | Upsample Guidance: Scale Up Diffusion Models without Training | Juno Hwang et.al. | 2404.01709 | null |
2024-04-02 | Boosting Visual Recognition for Autonomous Driving in Real-world Degradations with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | A CT Image Denoising Method with Residual Encoder-Decoder Network | Helena Shawn et.al. | 2404.01553 | null |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | New infrared camera of the Caucasian Mountain Observatory of the SAI MSU: design, main parameters, and first light | S. G. Zheltoukhov et.al. | 2404.01246 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-04-01 | AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images | Liu Yang et.al. | 2404.01024 | link |
2024-04-01 | Digital Twins for Supporting AI Research with Autonomous Vehicle Networks | Anıl Gürses et.al. | 2404.00954 | null |
2024-04-01 | Towards Memorization-Free Diffusion Models | Chen Chen et.al. | 2404.00922 | null |
2024-04-01 | Model-Agnostic Human Preference Inversion in Diffusion Models | Jeeyung Kim et.al. | 2404.00879 | null |
2024-03-31 | GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration | Youssef Mansour et.al. | 2404.00807 | null |
2024-03-31 | Personalized Neural Speech Codec | Inseon Jang et.al. | 2404.00791 | null |
2024-04-02 | DRCT: Saving Image Super-resolution away from Information Bottleneck | Chih-Chung Hsu et.al. | 2404.00722 | link |
2024-03-30 | Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network | Md Hassanuzzaman et.al. | 2404.00470 | null |
2024-03-30 | Harmonizing Light and Darkness: A Symphony of Prior-guided Data Synthesis and Adaptive Focus for Nighttime Flare Removal | Lishen Qu et.al. | 2404.00313 | null |
2024-03-30 | Learned Scanpaths Aid Blind Panoramic Video Quality Assessment | Kanglong Fan et.al. | 2404.00252 | link |
2024-03-29 | Evolving Semantic Communication with Generative Model | Shunpu Tang et.al. | 2403.20237 | link |
2024-03-29 | Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context | Tuan Nguyen et.al. | 2403.20184 | null |
2024-03-29 | Unsupervised Tumor-Aware Distillation for Multi-Modal Brain Image Translation | Chuan Huang et.al. | 2403.20168 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-03-28 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Imperceptible Protection against Style Imitation from Diffusion Models | Namhyuk Ahn et.al. | 2403.19254 | null |
2024-03-28 | DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation | Haonan Lin et.al. | 2403.19235 | null |
2024-03-28 | AAPMT: AGI Assessment Through Prompt and Metric Transformer | Benhao Huang et.al. | 2403.19101 | link |
2024-03-27 | TextCraftor: Your Text Encoder Can be Image Quality Controller | Yanyu Li et.al. | 2403.18978 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776 | link |
2024-03-27 | Bringing Textual Prompt to AI-Generated Image Quality Assessment | Bowen Qu et.al. | 2403.18714 | link |
2024-03-27 | qIoV: A Quantum-Driven Internet-of-Vehicles-Based Approach for Environmental Monitoring and Rapid Response Systems | Ankur Nahar et.al. | 2403.18622 | null |
2024-03-27 | Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review | Mohammadreza Amirian et.al. | 2403.18565 | null |
2024-03-27 | Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting | Haiwei Chen et.al. | 2403.18186 | null |
2024-03-26 | Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic Model | Weijie Gan et.al. | 2403.18139 | null |
2024-03-26 | TDIP: Tunable Deep Image Processing, a Real Time Melt Pool Monitoring Solution | Javid Akhavan et.al. | 2403.18117 | null |
2024-03-26 | Cross-system biological image quality enhancement based on the generative adversarial network as a foundation for establishing a multi-institute microscopy cooperative network | Dominik Panek et.al. | 2403.18026 | null |
2024-03-26 | Improving Text-to-Image Consistency via Automatic Prompt Optimization | Oscar Mañas et.al. | 2403.17804 | null |
2024-03-26 | Can patient-specific acquisition protocol improve performance on defect detection task in myocardial perfusion SPECT? | Nu Ri Choi et.al. | 2403.17764 | null |
2024-03-26 | Panonut360: A Head and Eye Tracking Dataset for Panoramic Video | Yutong Xu et.al. | 2403.17708 | null |
2024-03-26 | AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation | Huawei Wei et.al. | 2403.17694 | link |
2024-03-26 | ExpressEdit: Video Editing with Natural Language and Sketching | Bekzat Tilekbay et.al. | 2403.17693 | null |
2024-03-26 | Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis | Jingyu Xu et.al. | 2403.17549 | null |
2024-03-26 | ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales? | Fan Huang et.al. | 2403.17368 | link |
2024-03-26 | AutoMRISimQA: an automated system for daily quality control of a 3T MRI simulator | Aitang Xing et.al. | 2403.17365 | null |
2024-03-25 | Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models | Li Qiao et.al. | 2403.17256 | null |
2024-03-25 | PROSPECT: Precision Robot Spectroscopy Exploration and Characterization Tool | Nathaniel Hanson et.al. | 2403.17232 | null |
2024-03-25 | Comp4D: LLM-Guided Compositional 4D Scene Generation | Dejia Xu et.al. | 2403.16993 | null |
2024-03-25 | Towards Low-Latency and Energy-Efficient Hybrid P2P-CDN Live Video Streaming | Reza Farahani et.al. | 2403.16985 | null |
2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
2024-03-25 | C-arm inverse geometry CT for 3D cardiac chamber mapping | Jordan M. Slagowski et.al. | 2403.16779 | null |
2024-03-25 | FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression | Alireza Furutanpey et.al. | 2403.16677 | link |
2024-03-25 | Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network | Yijin Zhou et.al. | 2403.16540 | null |
2024-03-25 | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang et.al. | 2403.16510 | link |
2024-03-25 | Plaintext-Free Deep Learning for Privacy-Preserving Medical Image Analysis via Frequency Information Embedding | Mengyu Sun et.al. | 2403.16473 | null |
2024-03-25 | Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging | Jintong Hu et.al. | 2403.16384 | link |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-24 | Passive Screen-to-Camera Communication | Seyed Keyarash Ghiasi et.al. | 2403.16185 | null |
2024-03-24 | Argument Quality Assessment in the Age of Instruction-Following Large Language Models | Henning Wachsmuth et.al. | 2403.16084 | null |
2024-03-23 | An edge detection-based deep learning approach for tear meniscus height measurement | Kesheng Wang et.al. | 2403.15853 | null |
2024-03-22 | Medical Image Data Provenance for Medical Cyber-Physical System | Vijay Kumar et.al. | 2403.15522 | null |
2024-03-22 | Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression | Hongyan Liu et.al. | 2403.15379 | link |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316 | link |
2024-03-22 | Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos | Abhinau K. Venkataramanan et.al. | 2403.15061 | null |
2024-03-21 | On the exploitation of DCT statistics for cropping detectors | Claudio Vittorio Ragaglia et.al. | 2403.14789 | null |
2024-03-21 | From Handcrafted Features to LLMs: A Brief Survey for Machine Translation Quality Estimation | Haofei Zhao et.al. | 2403.14118 | null |
2024-03-20 | Multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers | Ignacy Stępka et.al. | 2403.13940 | link |
2024-03-20 | Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Richard Osuala et.al. | 2403.13890 | link |
2024-03-20 | Hierarchical NeuroSymbolic Approach for Action Quality Assessment | Lauren Okamoto et.al. | 2403.13798 | link |
2024-03-20 | Step-Calibrated Diffusion for Biomedical Optical Image Restoration | Yiwei Lyu et.al. | 2403.13680 | link |
2024-03-20 | Defining metric-aware size-shape measures to validate and optimize curved high-order meshes | Guillermo Aparicio-Estrems et.al. | 2403.13528 | null |
2024-03-20 | AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation | Jingkun An et.al. | 2403.13352 | null |
2024-03-20 | Learning Novel View Synthesis from Heterogeneous Low-light Captures | Quan Zheng et.al. | 2403.13337 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-18 | Invisible Backdoor Attack Through Singular Value Decomposition | Wenmin Chen et.al. | 2403.13018 | null |
2024-03-19 | Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference | Baolin Li et.al. | 2403.12900 | null |
2024-03-19 | VisualCritic: Making LMMs Perceive Visual Quality Like Humans | Zhipeng Huang et.al. | 2403.12806 | null |
2024-03-19 | Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean | Dojun Park et.al. | 2403.12666 | link |
2024-03-19 | GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation | Quankai Gao et.al. | 2403.12365 | null |
2024-03-19 | Deep Few-view High-resolution Photon-counting Extremity CT at Halved Dose for a Clinical Trial | Mengzhou Li et.al. | 2403.12331 | null |
2024-03-18 | Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2 mapping using dual-echo spiral navigators and conjugate-phase reconstruction* | Yuguang Meng et.al. | 2403.12230 | null |
2024-03-19 | Generic 3D Diffusion Adapter Using Controlled Multi-View Editing | Hansheng Chen et.al. | 2403.12032 | link |
2024-03-18 | Enhancing Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Bo-Han Lu et.al. | 2403.12024 | link |
2024-03-18 | VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model | Qi Zuo et.al. | 2403.12010 | null |
2024-03-19 | Subjective-Aligned Dateset and Metric for Text-to-Video Quality Assessment | Tengchuan Kou et.al. | 2403.11956 | link |
2024-03-18 | HyperColorization: Propagating spatially sparse noisy spectral clues for reconstructing hyperspectral images | M. Kerem Aydin et.al. | 2403.11935 | link |
2024-03-18 | Evaluating Text to Image Synthesis: Survey and Taxonomy of Image Quality Metrics | Sebastian Hartwig et.al. | 2403.11821 | null |
2024-03-18 | Hallucination in Perceptual Metric-Driven Speech Enhancement Networks | George Close et.al. | 2403.11732 | null |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662 | link |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning | Teppei Suzuki et.al. | 2403.11460 | link |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-18 | Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization | Yujia Liu et.al. | 2403.11397 | link |
2024-03-18 | Simulating Wearable Urban Augmented Reality Experiences in VR: Lessons Learnt from Designing Two Future Urban Interfaces | Tram Thi Minh Tran et.al. | 2403.11377 | null |
2024-03-17 | Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction | Xue Bai et.al. | 2403.11337 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-17 | Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment | Lorenzo Agnolucci et.al. | 2403.11176 | link |
2024-03-17 | Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model | Dian Zheng et.al. | 2403.11157 | link |
2024-03-17 | Interactive |
Yixiang Mao et.al. | 2403.11155 | null |
2024-03-17 | Hierarchical Generative Network for Face Morphing Attacks | Zuyuan He et.al. | 2403.11101 | null |
2024-03-17 | Endora: Video Generation Models as Endoscopy Simulators | Chenxin Li et.al. | 2403.11050 | null |
2024-03-16 | A Spectrum-based Image Denoising Method with Edge Feature Enhancement | Peter Luvton et.al. | 2403.11036 | null |
2024-03-16 | Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming | Amritha Premkumar et.al. | 2403.10976 | link |
2024-03-16 | A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | Tianhe Wu et.al. | 2403.10854 | link |
2024-03-16 | MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | Mude Hui et.al. | 2403.10815 | link |
2024-03-16 | ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models | Yuwen Chen et.al. | 2403.10786 | null |
2024-03-15 | Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation | Anton Pelykh et.al. | 2403.10731 | link |
2024-03-15 | EAGLE: An Edge-Aware Gradient Localization Enhanced Loss for CT Image Reconstruction | Yipeng Sun et.al. | 2403.10695 | link |
2024-03-15 | A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models | Xijun Wang et.al. | 2403.10589 | null |
2024-03-21 | Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment | Yixiao Li et.al. | 2403.10406 | null |
2024-03-15 | PASTA: Towards Flexible and Efficient HDR Imaging Via Progressively Aggregated Spatio-Temporal Aligment | Xiaoning Liu et.al. | 2403.10376 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | link |
2024-03-15 | Context-Semantic Quality Awareness Network for Fine-Grained Visual Categorization | Qin Xu et.al. | 2403.10298 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | Perceptual Quality-based Model Training under Annotator Label Uncertainty | Chen Zhou et.al. | 2403.10190 | null |
2024-03-15 | Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li et.al. | 2403.10179 | null |
2024-03-15 | PQDynamicISP: Dynamically Controlled Image Signal Processor for Any Image Sensors Pursuing Perceptual Quality | Masakazu Yoshimura et.al. | 2403.10091 | null |
2024-03-15 | Learning Physical Dynamics for Object-centric Visual Prediction | Huilin Xu et.al. | 2403.10079 | null |
2024-03-15 | Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment | Ziyu Shan et.al. | 2403.10066 | null |
2024-03-15 | PAME: Self-Supervised Masked Autoencoder for No-Reference Point Cloud Quality Assessment | Ziyu Shan et.al. | 2403.10061 | null |
2024-03-14 | ProMark: Proactive Diffusion Watermarking for Causal Attribution | Vishal Asnani et.al. | 2403.09914 | null |
2024-03-14 | MultiGripperGrasp: A Dataset for Robotic Grasping from Parallel Jaw Grippers to Dexterous Hands | Luis Felipe Casas Murrilo et.al. | 2403.09841 | null |
2024-03-13 | PICNIQ: Pairwise Comparisons for Natural Image Quality Assessment | Nicolas Chahine et.al. | 2403.09746 | link |
2024-03-14 | Renovating Names in Open-Vocabulary Segmentation Benchmarks | Haiwen Huang et.al. | 2403.09593 | null |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | link |
2024-03-20 | D-YOLO a robust framework for object detection in adverse weather conditions | Zihan Chu et.al. | 2403.09233 | null |
2024-03-14 | Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Byeongjun Park et.al. | 2403.09176 | link |
2024-03-14 | Dial-insight: Fine-tuning Large Language Models with High-Quality Domain-Specific Data Preventing Capability Collapse | Jianwei Sun et.al. | 2403.09167 | null |
2024-03-15 | NTIRE 2023 Image Shadow Removal Challenge Technical Report: Team IIM_TTI | Yuki Kondo et.al. | 2403.08995 | link |
2024-03-13 | Structural Positional Encoding for knowledge integration in transformer-based medical process monitoring | Christopher Irwin et.al. | 2403.08836 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | link |
2024-03-13 | Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment | Paraskevas Pegios et.al. | 2403.08700 | null |
2024-03-13 | Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages | Rik van Noord et.al. | 2403.08693 | null |
2024-03-13 | Physics-Guided Inverse Regression for Crop Quality Assessment | David Shulman et.al. | 2403.08653 | null |
2024-03-14 | GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Xinjie Zhang et.al. | 2403.08551 | link |
2024-03-13 | Masked Generative Story Transformer with Character Guidance and Caption Augmentation | Christos Papadimitriou et.al. | 2403.08502 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | Protocol Optimization for Functional Cardiac CT Imaging Using Noise Emulation in the Raw Data Domain | Zhye Yin et.al. | 2403.08486 | null |
2024-03-13 | PFStorer: Personalized Face Restoration and Super-Resolution | Tuomas Varanka et.al. | 2403.08436 | null |
2024-03-13 | AADNet: Attention aware Demoiréing Network | M Rakesh Reddy et.al. | 2403.08384 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | IG-FIQA: Improving Face Image Quality Assessment through Intra-class Variance Guidance robust to Inaccurate Pseudo-Labels | Minsoo Kim et.al. | 2403.08256 | null |
2024-03-13 | PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping | Jiafu Chen et.al. | 2403.08252 | null |
2024-03-15 | A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT | Hongyang Zhu et.al. | 2403.08247 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-18 | BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives | Ivo M. Baltruschat et.al. | 2403.07800 | null |
2024-03-12 | Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation | Michael Ogezi et.al. | 2403.07605 | null |
2024-03-12 | Learning Correction Errors via Frequency-Self Attention for Blind Image Super-Resolution | Haochen Sun et.al. | 2403.07390 | null |
2024-03-12 | Time-Efficient Light-Field Acquisition Using Coded Aperture and Events | Shuji Habuchi et.al. | 2403.07244 | null |
2024-03-10 | Propensity-score matching analysis in COVID-19-related studies: a method and quality systematic review | Chunhui Gu et.al. | 2403.07023 | null |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | link |
2024-03-11 | Applicability of oculomics for individual risk prediction: Repeatability and robustness of retinal Fractal Dimension using DART and AutoMorph | Justin Engelmann et.al. | 2403.06950 | null |
2024-03-11 | Monitoring the Venice Lagoon: an IoT Cloud-Based Sensor Nerwork Approach | Filippo Campagnaro et.al. | 2403.06915 | null |
2024-03-11 | COOD: Combined out-of-distribution detection using multiple measures for anomaly & novel class detection in large-scale hierarchical classification | L. E. Hogeweg et.al. | 2403.06874 | null |
2024-03-20 | QUASAR: QUality and Aesthetics Scoring with Advanced Representations | Sergey Kastryulin et.al. | 2403.06866 | null |
2024-03-11 | A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos | Weixia Zhang et.al. | 2403.06421 | link |
2024-03-11 | Comparison of No-Reference Image Quality Models via MAP Estimation in Diffusion Latents | Weixia Zhang et.al. | 2403.06406 | null |
2024-03-11 | Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models | Yang Zhang et.al. | 2403.06381 | link |
2024-03-15 | ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge | Sami Khairy et.al. | 2403.06324 | link |
2024-03-10 | Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising | Yuang Wang et.al. | 2403.06069 | null |
2024-03-09 | IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality Metrics | Ekaterina Shumitskaya et.al. | 2403.05955 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-08 | Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis | Muxi Chen et.al. | 2403.05125 | link |
2024-03-08 | CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model | Pengwei Yin et.al. | 2403.05124 | null |
2024-03-08 | Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile | Seokjun Lee et.al. | 2403.05093 | link |
2024-03-08 | Improving Diffusion-Based Generative Models via Approximated Optimal Transport | Daegyu Kim et.al. | 2403.05069 | link |
2024-03-08 | PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts | Zewen Chen et.al. | 2403.04993 | link |
2024-03-08 | StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models | Lezhong Wang et.al. | 2403.04965 | link |
2024-03-07 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-03-17 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | link |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-03-07 | MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment | Kanglei Zhou et.al. | 2403.04398 | link |
2024-03-07 | Self-Evaluation of Large Language Model based on Glass-box Features | Hui Huang et.al. | 2403.04222 | link |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | Development and evaluation of Artificial Intelligence techniques for IoT data quality assessment and curation | Laura Martín et.al. | 2403.03661 | null |
2024-03-06 | A Connector for Integrating NGSI-LD Data into Open Data Portals | Laura Martín et.al. | 2403.03648 | null |
2024-03-06 | Low-Dose CT Image Reconstruction by Fine-Tuning a UNet Pretrained for Gaussian Denoising for the Downstream Task of Image Enhancement | Tim Selig et.al. | 2403.03551 | null |
2024-03-06 | Combined optimization ghost imaging based on random speckle field | Zhiqing Yang et.al. | 2403.03426 | null |
2024-03-06 | DaISy: Diffuser-aided Sub-THz Imaging System | Shao-Hsuan Wu et.al. | 2403.03383 | null |
2024-03-05 | Imaging the event horizon of M87 from space on different timescales* | Anastasia Shlentsova et.al. | 2403.03327 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194 | link |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | DIFNet: SAR RFI suppression based on domain invariant features | Fuping Fang et.al. | 2403.02894 | null |
2024-03-05 | Rehabilitation Exercise Quality Assessment through Supervised Contrastive Learning with Hard and Soft Negatives | Mark Karlov et.al. | 2403.02772 | null |
2024-03-04 | Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection | Shitao Chen et.al. | 2403.01978 | link |
2024-03-04 | Revisiting the dust torus size-luminosity relation based on a uniform reverberation mapping analysis | Amit Kumar Mandal et.al. | 2403.01885 | null |
2024-03-04 | PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis | Zhengyao Lv et.al. | 2403.01852 | link |
2024-03-04 | ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Lukas Höllein et.al. | 2403.01807 | link |
2024-03-04 | Development of a near-infrared wide-field integral field unit by ultra-precision diamond cutting | Kosuke Kushibiki et.al. | 2403.01668 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-05 | 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos | Jiakai Sun et.al. | 2403.01444 | link |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images | Shufan Pei et.al. | 2403.01083 | null |
2024-03-02 | LLMCRIT: Teaching Large Language Models to Use Criteria | Weizhe Yuan et.al. | 2403.01069 | link |
2024-03-01 | Near-Real-Time Mueller Polarimetric Image Processing for Neurosurgical Intervention | Stefano Moriconi et.al. | 2403.00893 | null |
2024-03-01 | Gate-set evaluation metrics for closed-loop optimal control on nitrogen-vacancy center ensembles in diamond | Philipp J. Vetter et.al. | 2403.00616 | null |
2024-03-01 | Equilibrium Model with Anisotropy for Model-Based Reconstruction in Magnetic Particle Imaging | Marco Maass et.al. | 2403.00602 | link |
2024-03-01 | Data Quality Assessment: Challenges and Opportunities | Sedir Mohammed et.al. | 2403.00526 | null |
2024-03-01 | Phase retrieval beyond the homogeneous object assumption for X-ray in-line holographic imaging | Jens Lucht et.al. | 2403.00461 | null |
2024-03-01 | An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels | Shumpei Takezaki et.al. | 2403.00452 | null |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-03-01 | List-Mode PET Image Reconstruction Using Dykstra-Like Splitting | Kibo Ote et.al. | 2403.00394 | null |
2024-03-01 | Optimization of Array Encoding for Ultrasound Imaging | Jacob Spainhour et.al. | 2403.00289 | link |
2024-03-01 | Deep-learning-based Magnetic Resonance Simultaneous Multislice Imaging Using Holographic Image Decoding | Satoshi Ito et.al. | 2403.00220 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | Integral field spectroscopy supports atmospheric optics to reveal the finite outer scale of the turbulence | Begoña García-Lorenzo et.al. | 2402.19337 | null |
2024-03-13 | Modular Blind Video Quality Assessment | Wen Wen et.al. | 2402.19276 | link |
2024-02-29 | Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Cansu Korkmaz et.al. | 2402.19215 | link |
2024-02-29 | Disentangling representations of retinal images with generative models | Sarah Müller et.al. | 2402.19186 | link |
2024-02-29 | Trajectory Consistency Distillation | Jianbin Zheng et.al. | 2402.19159 | link |
2024-02-29 | Atmospheric Turbulence Removal with Video Sequence Deep Visual Priors | P. Hill et.al. | 2402.19041 | null |
2024-02-28 | Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge et.al. | 2402.18191 | link |
2024-02-28 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-03-02 | G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment | Juan Zhang et.al. | 2402.18122 | null |
2024-02-28 | Improvement Of Audiovisual Quality Estimation Using A Nonlinear Autoregressive Exogenous Neural Network And Bitstream Parameters | Koffi Kossi et.al. | 2402.18056 | null |
2024-02-28 | PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis | Jason J. Yu et.al. | 2402.17986 | null |
2024-02-28 | Rapid hyperspectral photothermal mid-infrared spectroscopic imaging from sparse data for gynecologic cancer tissue subtyping | Reza Reihanisaransari et.al. | 2402.17960 | null |
2024-02-29 | QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction | Ishak Ayad et.al. | 2402.17951 | null |
2024-02-27 | Accelerated Real-time Cine and Flow under In-magnet Staged Exercise | Preethi Chandrasekaran et.al. | 2402.17877 | null |
2024-02-27 | A Performance Evaluation of Filtered Delay Multiply and Sum Beamforming for Ultrasound Localization Microscopy: Preliminary Results | A. N. Madhavanunni et.al. | 2402.17643 | null |
2024-02-28 | Black-box Adversarial Attacks Against Image Quality Assessment Models | Yu Ran et.al. | 2402.17533 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Sora Generates Videos with Stunning Geometrical Consistency | Xuanyi Li et.al. | 2402.17403 | null |
2024-03-10 | Learning Exposure Correction in Dynamic Scenes | Jin Liu et.al. | 2402.17296 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-03-01 | Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System | Majid Memari et.al. | 2402.17204 | null |
2024-03-19 | Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain | Qunliang Xing et.al. | 2402.17200 | null |
2024-02-27 | SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Chengcheng Wang et.al. | 2402.17133 | link |
2024-02-27 | T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality | Susan Epstein et.al. | 2402.17101 | null |
2024-02-26 | Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids | Jasper Kirton-Wingate et.al. | 2402.16757 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-03-04 | Towards Open-ended Visual Quality Comparison | Haoning Wu et.al. | 2402.16641 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues | Tassadaq Hussain et.al. | 2402.16394 | null |
2024-02-26 | Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech | Szu-Wei Fu et.al. | 2402.16321 | link |
2024-02-24 | Design, Implementation and Analysis of a Compressed Sensing Photoacoustic Projection Imaging System | Markus Haltmeier et.al. | 2402.15750 | null |
2024-02-23 | Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Yiting Wang et.al. | 2402.15469 | null |
2024-02-23 | Ten computational challenges in human virome studies | Yifan Wu et.al. | 2402.15186 | null |
2024-02-23 | The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling | Jiajun Ma et.al. | 2402.15170 | null |
2024-02-22 | Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | Willi Menapace et.al. | 2402.14797 | null |
2024-02-25 | Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening | Zhenrong Shen et.al. | 2402.14707 | null |
2024-02-22 | Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment | Zhaoyang Wang et.al. | 2402.14401 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-20 | Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control | Denis Lukovnikov et.al. | 2402.13404 | null |
2024-02-24 | Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference | Aytaç Özkan et.al. | 2402.12735 | null |
2024-02-20 | Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation | Zheng Wei Lim et.al. | 2402.12690 | null |
2024-02-21 | Robust-Wide: Robust Watermarking against Instruction-driven Image Editing | Runyi Hu et.al. | 2402.12688 | link |
2024-02-20 | X-ray multibeam ptychography at up to 20 keV: nano-lithography enhances X-ray nano-imaging | Tang Li et.al. | 2402.12082 | null |
2024-02-19 | A Lightweight Parallel Framework for Blind Image Quality Assessment | Qunyue Huang et.al. | 2402.12043 | null |
2024-02-18 | Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs | Arian Askari et.al. | 2402.11633 | link |
2024-02-16 | Path Loss Modeling for RIS-Assisted Wireless System with Direct Link and Elevation Factors | Vinay Kumar Chapala et.al. | 2402.10419 | null |
2024-02-15 | Deep Spectral Meshes: Multi-Frequency Facial Mesh Processing with Graph Neural Networks | Robert Kosk et.al. | 2402.10365 | null |
2024-02-15 | Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community | Arman Isajanyan et.al. | 2402.09872 | link |
2024-02-15 | How to Train Data-Efficient LLMs | Noveen Sachdeva et.al. | 2402.09668 | null |
2024-02-14 | TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction | Xueqi Guo et.al. | 2402.09567 | null |
2024-02-14 | Assessing test artifact quality -- A tertiary study | Huynh Khanh Vi Tran et.al. | 2402.09541 | null |
2024-02-14 | LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning | Adithya Raman et.al. | 2402.09392 | null |
2024-02-14 | Generalized Portrait Quality Assessment | Nicolas Chahine et.al. | 2402.09178 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213 | null |
2024-12-18 | Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations | Ludovico Nista et.al. | 2412.14150 | null |
2024-12-17 | Learning of Patch-Based Smooth-Plus-Sparse Models for Image Reconstruction | Stanislas Ducotterd et.al. | 2412.13070 | link |
2024-12-17 | Super-Resolving Normalising Flows for Lattice Field Theories | Marc Bauer et.al. | 2412.12842 | null |
2024-12-16 | EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera | Zheng Fang et.al. | 2412.11680 | null |
2024-12-16 | CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution | Bingwen Hu et.al. | 2412.11609 | null |
2024-12-18 | Sequence Matters: Harnessing Video Models in 3D Super-Resolution | Hyun-kyu Ko et.al. | 2412.11525 | null |
2024-12-16 | Block-Based Multi-Scale Image Rescaling | Jian Li et.al. | 2412.11468 | null |
2024-12-16 | Quantization of Climate Change Impacts on Renewable Energy Generation Capacity: A Super-Resolution Recurrent Diffusion Model | Xiaochong Dong et.al. | 2412.11399 | null |
2024-12-18 | A Staged Deep Learning Approach to Spatial Refinement in 3D Temporal Atmospheric Transport | M. Giselle Fernández-Godino et.al. | 2412.10945 | null |
2024-12-13 | SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution | Runyi Hu et.al. | 2412.10049 | null |
2024-12-13 | A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method | Jing Sun et.al. | 2412.09846 | null |
2024-12-13 | Super-Resolution for Remote Sensing Imagery via the Coupling of a Variational Model and Deep Learning | Jing Sun et.al. | 2412.09841 | null |
2024-12-11 | RealOSR: Latent Unfolding Boosting Diffusion-based Real-world Omnidirectional Image Super-Resolution | Xuhan Sheng et.al. | 2412.09646 | null |
2024-12-12 | OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs | Yuanzhi Zhu et.al. | 2412.09465 | link |
2024-12-12 | A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data | Alice Ruget et.al. | 2412.09427 | null |
2024-12-12 | Distribution free uncertainty quantification in neuroscience-inspired deep operators | Shailesh Garg et.al. | 2412.09369 | null |
2024-12-12 | Arbitrary-steps Image Super-resolution via Diffusion Inversion | Zongsheng Yue et.al. | 2412.09013 | link |
2024-12-11 | Fair Primal Dual Splitting Method for Image Inverse Problems | Yunfei Qu et.al. | 2412.08613 | null |
2024-12-12 | Efficient estimation of error bounds for quantum multiparametric imaging with constraints | Alexander Mikhalychev et.al. | 2412.08199 | null |
2024-12-11 | Statistical Downscaling via High-Dimensional Distribution Matching with Generative Models | Zhong Yi Wan et.al. | 2412.08079 | null |
2024-12-10 | MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution | Yuchun He et.al. | 2412.07222 | null |
2024-12-10 | A Progressive Image Restoration Network for High-order Degradation Imaging in Remote Sensing | Yujie Feng et.al. | 2412.07195 | null |
2024-12-10 | Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors | Jiangang Wang et.al. | 2412.07152 | null |
2024-12-10 | RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Resolution | Jiangang Wang et.al. | 2412.07149 | link |
2024-12-09 | Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning | Mehdi Noroozi et.al. | 2412.06978 | null |
2024-12-09 | Neural Garment Dynamic Super-Resolution | Meng Zhang et.al. | 2412.06285 | link |
2024-12-09 | MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery | Qinfeng Zhu et.al. | 2412.06211 | null |
2024-12-09 | You KAN Do It in a Single Shot: Plug-and-Play Methods with Single-Instance Priors | Yanqi Cheng et.al. | 2412.06204 | null |
2024-12-07 | Jointly RS Image Deblurring and Super-Resolution with Adjustable-Kernel and Multi-Domain Attention | Yan Zhang et.al. | 2412.05696 | link |
2024-12-07 | Test-time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution with Variable Fourier Components | Kazutoshi Akita et.al. | 2412.05517 | null |
2024-12-07 | Enhancing Sample Generation of Diffusion Models using Noise Level Correction | Abulikemu Abuduweili et.al. | 2412.05488 | null |
2024-12-06 | MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution | Jie Lin et.al. | 2412.04861 | null |
2024-12-05 | 2.5D Super-Resolution Approaches for X-ray Computed Tomography-based Inspection of Additively Manufactured Parts | Haley Duba-Sullivan et.al. | 2412.04525 | null |
2024-12-05 | LocalSR: Image Super-Resolution in Local Region | Bo Ji et.al. | 2412.04314 | null |
2024-12-05 | Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image | Shuang Xu et.al. | 2412.04201 | null |
2024-12-05 | Deep priors for satellite image restoration with accurate uncertainties | Biquard Maud et.al. | 2412.04130 | null |
2024-12-05 | LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents | Bingchen Li et.al. | 2412.04090 | null |
2024-12-04 | HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution | Yuxuan Jiang et.al. | 2412.03748 | null |
2024-12-09 | MTVNet: Mapping using Transformers for Volumes -- Network for Super-Resolution with Long-Range Interactions | August Leander Høeg et.al. | 2412.03379 | link |
2024-12-04 | TASR: Timestep-Aware Diffusion Model for Image Super-Resolution | Qinwei Lin et.al. | 2412.03355 | link |
2024-12-04 | RFSR: Improving ISR Diffusion Models via Reward Feedback Learning | Xiaopeng Sun et.al. | 2412.03268 | link |
2024-12-04 | Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach | Lingchen Sun et.al. | 2412.03017 | link |
2024-12-04 | Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution | Jiahua Xiao et.al. | 2412.02960 | null |
2024-12-03 | Efficient Algorithms for Low Tubal Rank Tensor Approximation with Applications to Image Compression, Super-Resolution and Deep Learning | Salman Ahmadi-Asl et.al. | 2412.02598 | null |
2024-12-03 | Randomized algorithms for Kroncecker tensor decomposition and applications | Salman Ahmadi-Asl et.al. | 2412.02597 | null |
2024-12-03 | CubeFormer: A Simple yet Effective Baseline for Lightweight Image Super-Resolution | Jikai Wang et.al. | 2412.02234 | null |
2024-12-02 | SUICA: Learning Super-high Dimensional Sparse Implicit Neural Representations for Spatial Transcriptomics | Qingtian Zhu et.al. | 2412.01124 | null |
2024-12-03 | VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models | Taesung Kwon et.al. | 2412.00156 | null |
2024-11-28 | Auto-Encoded Supervision for Perceptual Image Super-Resolution | MinKyu Lee et.al. | 2412.00124 | link |
2024-11-28 | Stochastic Frequency Fluctuation Super-Resolution Imaging | Yifan Chen et.al. | 2411.19369 | null |
2024-11-27 | FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution | Junyang Chen et.al. | 2411.18824 | null |
2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | link |
2024-11-27 | Uncertainty-driven Sampling for Efficient Pairwise Comparison Subjective Assessment | Shima Mohammadi et.al. | 2411.18372 | link |
2024-11-27 | TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution | Linwei Dong et.al. | 2411.18263 | null |
2024-12-01 | HAAT: Hybrid Attention Aggregation Transformer for Image Super-Resolution | Song-Jiang Lai et.al. | 2411.18003 | null |
2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
2024-11-26 | Perceptually Optimized Super Resolution | Volodymyr Karpenko et.al. | 2411.17513 | null |
2024-11-26 | MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution | Chengxing Xie et.al. | 2411.17214 | null |
2024-12-03 | PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution | Libo Zhu et.al. | 2411.17106 | link |
2024-11-26 | ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction | Chang Li et.al. | 2411.17088 | null |
2024-11-25 | ZoomLDM: Latent Diffusion Model for multi-scale image generation | Srikar Yellapragada et.al. | 2411.16969 | null |
2024-11-25 | From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task | Bohao Chen et.al. | 2411.16792 | null |
2024-11-25 | EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training | Yiying Wei et.al. | 2411.16312 | null |
2024-11-25 | High-Resolution Be Aware! Improving the Self-Supervised Real-World Super-Resolution | Yuehan Zhang et.al. | 2411.16175 | null |
2024-11-23 | FFT-Enhanced Low-Complexity Near-Field Super-Resolution Sensing | Yuxiao Wu et.al. | 2411.15532 | null |
2024-11-21 | UPdec-Webb: A Dataset for Coaddition of JWST NIRCam Images | Lei Wang et.al. | 2411.13891 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | Adversarial Diffusion Compression for Real-World Image Super-Resolution | Bin Chen et.al. | 2411.13383 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-19 | Efficient Medicinal Image Transmission and Resolution Enhancement via GAN | Rishabh Kumar Sharma et.al. | 2411.12833 | null |
2024-11-19 | ISAC Super-Resolution Receivers: The Effect of Different Dictionary Matrices | Iman Valiulahi et.al. | 2411.12672 | null |
2024-11-19 | Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Yang Zou et.al. | 2411.12530 | link |
2024-11-18 | Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Brian B. Moser et.al. | 2411.12072 | link |
2024-11-16 | Peizhe Xia et.al. | 2411.11906 | null | |
2024-11-17 | Low-Complexity Algorithms for Multichannel Spectral Super-Resolution | Xunmeng Wu et.al. | 2411.10938 | null |
2024-11-21 | Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution | Long Peng et.al. | 2411.10798 | null |
2024-11-15 | Experimental demonstration of Tessellation Structured Illumination Microscopy | Doron Shterman et.al. | 2411.10405 | null |
2024-11-15 | A Low-Resolution Image is Worth 1x1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift | Sanath Budakegowdanadoddi Nagaraju et.al. | 2411.10231 | null |
2024-11-15 | DiffFNO: Diffusion Fourier Neural Operator | Xiaoyi Liu et.al. | 2411.09911 | null |
2024-11-15 | Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements | Shijie Zhou et.al. | 2411.09850 | null |
2024-11-14 | OneNet: A Channel-Wise 1D Convolutional U-Net | Sanghyun Byun et.al. | 2411.09838 | link |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | ISAC Super-Resolution Receiver via Lifted Atomic Norm Minimization | Iman Valiulahi et.al. | 2411.09495 | null |
2024-11-14 | Evaluation of RIS-Enabled B5G/6G Indoor Positioning and Mapping using Ray Tracing Models | Dimitris Kompostiotis et.al. | 2411.09440 | null |
2024-11-14 | LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Chenyang Wang et.al. | 2411.09293 | null |
2024-11-14 | Performance Boundaries and Tradeoffs in Super-Resolution Imaging Technologies for Space Targets | XiaoLe He et.al. | 2411.09155 | null |
2024-11-12 | On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction | Tao Hong et.al. | 2411.08178 | null |
2024-11-12 | ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation | Liang Zhao et.al. | 2411.07752 | null |
2024-11-12 | LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution | Aditya Kasliwal et.al. | 2411.07750 | null |
2024-11-12 | Numerical Homogenization by Continuous Super-Resolution | Zhi-Song Liu et.al. | 2411.07576 | null |
2024-11-11 | Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy | Sepideh K. Gharamaleki et.al. | 2411.07426 | null |
2024-11-11 | Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound | Sepideh K. Gharamaleki et.al. | 2411.07376 | null |
2024-11-11 | AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models | Wallace Abreu et.al. | 2411.07364 | null |
2024-11-13 | General Geospatial Inference with a Population Dynamics Foundation Model | Mohit Agarwal et.al. | 2411.07207 | null |
2024-11-11 | 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results | Ahmed Telili et.al. | 2411.06738 | null |
2024-11-11 | Expansion microscopy reveals neural circuit organization in genetic animal models | Shakila Behzadi et.al. | 2411.06676 | null |
2024-11-10 | Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution | Minghong Duan et.al. | 2411.06442 | link |
2024-11-10 | SuperResolution Radar Gesture Recognitio | Netanel Blumenfeld et.al. | 2411.06410 | null |
2024-11-09 | Quasi-Newton OMP Approach for Super-Resolution Channel Estimation and Extrapolation | Yi Zeng et.al. | 2411.06082 | null |
2024-11-09 | Predicting band structures for 2D Photonic Crystals via Deep Learning | Yueqi Wang et.al. | 2411.06063 | null |
2024-11-08 | A Modular Conditional Diffusion Framework for Image Reconstruction | Magauiya Zhussip et.al. | 2411.05993 | null |
2024-11-08 | WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning | Xiangyu Zhao et.al. | 2411.05420 | null |
2024-11-08 | Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons | Rahul Gulati et.al. | 2411.05329 | null |
2024-11-07 | Reducing data resolution for better super-resolution: Reconstructing turbulent flows from noisy observation | Kyongmin Yeo et.al. | 2411.05240 | null |
2024-11-07 | ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing | Zhihui Zhang et.al. | 2411.04706 | null |
2024-11-06 | "Super-resolution" holographic optical tweezers array | Keisuke Nishimura et.al. | 2411.03564 | null |
2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | link |
2024-11-05 | Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution | Huan Zheng et.al. | 2411.03239 | null |
2024-11-05 | Applications of Automatic Differentiation in Image Registration | Warin Watson et.al. | 2411.02806 | **[link](https://github |