[카테고리:] Paper Review

[논문리뷰] One-Step Effective Diffusion Network for Real-World Image Super-Resolution (2024 NuerIPS)

2025년 04월 01일

Introduction & Related Work 2024년 NeurIPS에 발표된 “One-Step Effective Diffusion Network for Real-World Image Super-Resolution” (OSEDiff)를 리뷰해보려 한다. 이 논문은 실세계 이미지 초해상도(Real-ISR) task를 one step diffusion모델로 해결하려는 논문이다. 기본적으로 최근 super resolution task는 diffusion 모델이 접목되기 시작하면서 perceptual한 퀄리티가 눈에띄게 향상했다. 관련 논문으로는 SUPIR(Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration…
[논문리뷰] PromptFix: You Prompt and We Fix the Photo (2024 NeurIPS)

2024년 10월 16일

Introduction 2024 뉴립스에 발표된 논문이고 instrucrtion을 기반으로 low-level vision task를 수행하는 논문이다. 이전에 포스트한 instructIR과 비슷한 느낌인데 instructIR에서는 단순 unet으로 image를 바로 뽑아냈다면 이 논문에서는 diffusion을 활용한것이 첫번째 차이점이고, instrcutIR에서는 instruction이 거의 classification용으로만 거의 쓰였는데 이 논문에서는 task중에 object removal같은 editing task가 있어서 이를 위해 instruction에서 단순 task정보 이외에도 문맥의 정보를 디테일하게 얻어야한다는 점이 두번째…
[논문 리뷰] InstructIR: High-Quality Image Restoration Following Human Instructions (2024 ECCV)

2024년 07월 02일

SONY 회사에서 나온 2024 ECCV 논문이다. instructpix2pix 논문에서 착안해서 all-in-one image restoration에 적용한 논문이다. instructpix2pix 논문도 리뷰를 포스팅 했었으니 필요하면 참고 (https://doinghun.com/instructpix2pix-learning-to-follow-image-editing-instructions-2023-cvpr/). 기본적으로 all-in-one setting으로 여러 image restoration task들을 하나의 모델만 가지고 학습시키고 human instruction을 통해 각각의 task를 구분하여 실행시키는 방법을 제안한다. Image Restoration Following Instructions instruction-based image restoration을 위해 먼저 instruction을 생성한다. GPT4를 통해…
[논문 리뷰] InstructPix2Pix: Learning to Follow Image Editing Instructions (2023 CVPR)

2024년 06월 27일

2023년 CVPR에서 소개된 image editing 관련 논문이다. 전체적인 flow는 아래 그림과 같다. 먼저 GPT3, Stable Diffusion, Prompt2Prompt 등을 이용해 instruction과 image pair를 생성하여 학습용 데이터셋을 생성한다. 이후 Stable Diffusion 모델을 fine-tuning 하여 inference에서는 image와 instruction을 주면 그에 맞게 이미지가 바뀌는 식이다. 1. Introduction 기본적으로 human-written instruction 과 그에 해당하는 image pair 데이터는 구하기 쉽지가 않다.…
[논문 리뷰] Controlling Vision-Language Models For Multi-Task Image Restoration (2024 ICLR)

2024년 04월 29일

Introduction 이전의 CVPR 2022 AirNet (All-in-one Image Restoration) 이후에 universal한 image restoration 네트워크를 만들고자 하는 노력이 이어지고 있다. 이번 논문은 CLIP을 이용하여 Clean 이미지, degradation type 등의 정보를 더 잘 추출할 수 있도록 학습시키는 방법을 제안한다. CLIP을 단순하게 그대로 사용하는 것이 아니라 Image Restoration에 맞게 약간 변형하여 사용하여 Degradation Aware CLIP (DA-CLIP) 이라고 부르고 아래와…

[카테고리:] Paper Review

[논문리뷰] One-Step Effective Diffusion Network for Real-World Image Super-Resolution (2024 NuerIPS)

[논문리뷰] PromptFix: You Prompt and We Fix the Photo (2024 NeurIPS)

[논문 리뷰] InstructIR: High-Quality Image Restoration Following Human Instructions (2024 ECCV)

[논문 리뷰] InstructPix2Pix: Learning to Follow Image Editing Instructions (2023 CVPR)

[논문 리뷰] Controlling Vision-Language Models For Multi-Task Image Restoration (2024 ICLR)