A Collection of Papers and Codes in ICCV2023 related to Low-Level Vision
[In Construction] If you find some missing papers or typos, feel free to pull issues or requests.
- Awesome-ICCV2021-Low-Level-Vision
- Awesome-CVPR2023/2022-Low-Level-Vision
- Awesome-NeurIPS2022/2021-Low-Level-Vision
- Awesome-ECCV2022-Low-Level-Vision
- Awesome-AAAI2022-Low-Level-Vision
- Awesome-CVPR2021/2020-Low-Level-Vision
- Awesome-ECCV2020-Low-Level-Vision
DiffIR: Efficient Diffusion Model for Image Restoration
Under-Display Camera Image Restoration with Scattering Effect
- Paper: https://arxiv.org/abs/2308.04163
- Code: https://github.com/NamecantbeNULL/SRUDC
- Tags: Under-Display Camera
Multi-weather Image Restoration via Domain Translation
- Paper:
- Code: https://github.com/pwp1208/Domain_Translation_Multi-weather_Restoration
- Tags: Multi-weather
Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond
- Paper: https://arxiv.org/abs/2307.08996
- Tags: Authentic Face Restoration, Diffusion
Improving Lens Flare Removal with General Purpose Pipeline and Multiple Light Sources Recovery
- Paper: https://arxiv.org/abs/2308.16460
- Code: https://github.com/YuyanZhou1/Improving-Lens-Flare-Removal
- Tags: Flare Removal
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
- Paper: https://arxiv.org/abs/2308.14221
- Code: https://github.com/CXH-Research/DocShadow-SD7K
- Tags: Document Shadow Removal
Physics-Driven Turbulence Image Restoration with Stochastic Refinement
- Paper: https://arxiv.org/abs/2307.10603
- Code: https://github.com/VITA-Group/PiRN
- Tags: Turbulence Image
DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration
Pixel Adaptive Deep Unfolding Transformer for Hyperspectral Image Reconstruction
Snow Removal in Video: A New Dataset and A Novel Method
- Paper:
- Code: https://github.com/haoyuc/VideoDesnowing
- Tags: Desnowing
Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation
- Paper:
- Code: https://github.com/scott-yjyang/ViWS-Net
Fast Full-frame Video Stabilization with Iterative Optimization
- Paper: https://arxiv.org/abs/2307.12774
- Code: https://github.com/zwyking/Fast-Stab
- Tags: Video Stabilization
Minimum Latency Deep Online Video Stabilization
- Paper: https://arxiv.org/abs/2212.02073
- Code: https://github.com/liuzhen03/NNDVS
- Tags: Video Stabilization
On the Effectiveness of Spectral Discriminators for Perceptual Quality Improvement
SRFormer: Permuted Self-Attention for Single Image Super-Resolution
Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution
DLGSANet: Lightweight Dynamic Local and Global Self-Attention Network for Image Super-Resolution
Boosting Single Image Super-Resolution via Partial Channel Shifting
- Paper:
- Code: https://github.com/OwXiaoM/_PCS
Dual Aggregation Transformer for Image Super-Resolution
Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution
MetaF2N: Blind Image Super-Resolution by Learning Efficient Model Adaptation from Faces
- Paper:
- Code: https://github.com/yinzhicun/MetaF2N
Lightweight Image Super-Resolution with Superpixel Token Interaction
- Paper:
- Code: https://github.com/ArcticHare105/SPIN
Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
- Paper: https://arxiv.org/abs/2211.13654
- Code: https://github.com/Jiamian-Wang/Iterative-Soft-Shrinkage-SR
Spherical Space Feature Decomposition for Guided Depth Map Super-Resolution
Real-CE: A Benchmark for Chinese-English Scene Text Image Super-resolution
- Paper: https://arxiv.org/abs/2308.03262
- Code: https://github.com/mjq11302010044/Real-CE
- Tag: Text SR
Towards Real-World Burst Image Super-Resolution: Benchmark and Method
- Paper:
- Code: https://github.com/yjsunnn/FBANet
MoTIF: Learning Motion Trajectories with Local Implicit Neural Functions for Continuous Space-Time Video Super-Resolution
Downscaled Representation Matters: Improving Image Rescaling with Collaborative Downscaled Images
Random Sub-Samples Generation for Self-Supervised Real Image Denoising
Score Priors Guided Deep Variational Inference for Unsupervised Real-World Single Image Denoising
The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior
- Paper: https://arxiv.org/abs/2304.11409
- Code: https://github.com/YilinLiu97/FasterDIP-devil-in-upsampling
Lighting Every Darkness in Two Pairs: A Calibration-Free Pipeline for RAW Denoising
Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches
ExposureDiffusion: Learning to Expose for Low-light Image Enhancement
Towards General Low-Light Raw Noise Synthesis and Modeling
- Paper: https://arxiv.org/abs/2307.16508
- Code: https://github.com/fengzhang427/LRD
- Tags: Noise Modeling
Hybrid Spectral Denoising Transformer with Guided Attention
- Paper: https://arxiv.org/abs/2303.09040
- Code: https://github.com/Zeqiang-Lai/HSDT
- Tags: hyperspectral image denoising
From Sky to the Ground: A Large-scale Benchmark and Simple Baseline Towards Real Rain Removal
- Paper:
- Code: https://github.com/yunguo224/LHP-Rain
Learning Rain Location Prior for Nighttime Deraining
- Paper:
- Code: https://github.com/zkawfanx/RLP
Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Generalizing Event-Based Motion Deblurring in Real-World Scenarios
- Paper: https://arxiv.org/abs/2308.05932
- Tags: Event-Based
MEFLUT: Unsupervised 1D Lookup Tables for Multi-exposure Image Fusion
- Paper:
- Code: https://github.com/Hedlen/MEFLUT
RawHDR: High Dynamic Range Image Reconstruction from a Single Raw Image
- Paper:
- Code: https://github.com/jackzou233/RawHDR
LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction
Video Object Segmentation-aware Video Frame Interpolation
- Paper:
- Code: https://github.com/junsang7777/VOS-VFI
Iterative Prompt Learning for Unsupervised Backlit Image Enhancement
ExposureDiffusion: Learning to Expose for Low-light Image Enhancement
Implicit Neural Representation for Cooperative Low-light Image Enhancement
Low-Light Image Enhancement with Illumination-Aware Gamma Correction and Complete Image Modelling Network
- Paper:
- Code: https://arxiv.org/abs/2308.08220
Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model
Deep Image Harmonization with Learnable Augmentation
- Paper: https://arxiv.org/abs/2308.00376
- Code: https://github.com/bcmi/SycoNet-Adaptive-Image-Harmonization
Deep Image Harmonization with Globally Guided Feature Transformation and Relation Distillation
- Paper: https://arxiv.org/abs/2308.00356
- Code: https://github.com/bcmi/Image-Harmonization-Dataset-ccHarmony
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
Diverse Inpainting and Editing with GAN Inversion
Parallax-Tolerant Unsupervised Deep Image Stitching
RFD-ECNet: Extreme Underwater Image Compression with Reference to Feature Dictionary
- Paper:
- Code: https://github.com/lilala0/RFD-ECNet
Delegate Transformer for Image Color Aesthetics Assessment
Test Time Adaptation for Blind Image Quality Assessment
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives
AesPA-Net: Aesthetic Pattern-Aware Style Transfer Networks
Two Birds, One Stone: A Unified Framework for Joint Learning of Image and Video Style Transfers
- Paper:
- Code: https://github.com/NevSNev/UniST
All-to-key Attention for Arbitrary Style Transfer
- Paper:
- Code: https://github.com/LearningHx/StyA2K
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Adaptive Nonlinear Latent Transformation for Conditional Face Editing
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
- Paper: https://arxiv.org/abs/2304.02051
- Code: https://github.com/aimagelab/multimodal-garment-designer
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation
- Paper: https://arxiv.org/abs/2307.08448
- Code: https://github.com/AndysonYs/Selective-Diffusion-Distillation
HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
- Paper:
- Code: https://github.com/wty-ustc/HairCLIPv2
StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
Diverse Inpainting and Editing with GAN Inversion
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Better Aligning Text-to-Image Models with Human Preference
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
- Paper: https://arxiv.org/abs/2306.05357
- Code: https://github.com/nanlliu/Unsupervised-Compositional-Concepts-Discovery
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Ablating Concepts in Text-to-Image Diffusion Models
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
Story Visualization by Online Text Augmentation with Context Memory
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Dense Text-to-Image Generation with Attention Modulation
Reinforced Disentanglement for Face Swapping without Skip Connection
BlendFace: Re-designing Identity Encoders for Face-Swapping
General Image-to-Image Translation with One-Shot Image Guidance
- Paper: https://arxiv.org/abs/2307.14352
- Code: https://github.com/CrystalNeuro/visual-concept-translator
GaFET: Learning Geometry-aware Facial Expression Translation from In-The-Wild Images
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Conditional 360-degree Image Synthesis for Immersive Indoor Scene Decoration
Masked Diffusion Transformer is a Strong Image Synthesizer
Q-Diffusion: Quantizing Diffusion Models
The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation
- Paper:
- Code: https://github.com/lingxiao-li/HAE
LFS-GAN: Lifelong Few-Shot Image Generation
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations
Smoothness Similarity Regularization for Few-Shot GAN Adaptation
Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
- Paper: https://arxiv.org/abs/2303.13439
- Code: https://github.com/Picsart-AI-Research/Text2Video-Zero
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
RIGID: Recurrent GAN Inversion and Editing of Real Face Videos
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation
Others [back]
DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders
- Paper: https://arxiv.org/abs/2212.11613
- Code: https://github.com/piddnad/DDColor
- Tags: Colorization
DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion
- Paper: https://arxiv.org/abs/2303.06840
- Code: https://github.com/Zhaozixiang1228/MMIF-DDFM
- Tags: Image Fusion
Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer
Unfolding Framework with Prior of Convolution-Transformer Mixture and Uncertainty Estimation for Video Snapshot Compressive Imaging
- Paper: https://arxiv.org/abs/2306.11316
- Code: https://github.com/zsm1211/CTM-SCI
- Tags: Snapshot Compressive Imaging
Deep Optics for Video Snapshot Compressive Imaging
- Paper:
- Code: https://github.com/pwangcs/DeepOpticsSCI
- Tags: Snapshot Compressive Imaging
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
- Paper: https://arxiv.org/abs/2308.09040
- Code: https://github.com/fh2019ustc/SimFIR
- Tags: Fisheye Image Rectification
Single Image Reflection Separation via Component Synergy
- Paper: https://arxiv.org/abs/2308.10027
- Code: https://github.com/mingcv/DSRNet
- Tag: Image Reflection Separation
Learned Image Reasoning Prior Penetrates Deep Unfolding Network for Panchromatic and Multi-Spectral Image Fusion
- Paper: https://arxiv.org/abs/2308.16083
- Tags: pan-sharpening
Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation