NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Jeong, Yoonwoo; Lee, Jinwoo; Kim, Chiheon; Cho, Minsu; Lee, Doyup

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.07315 (cs)

[Submitted on 12 Dec 2023 (v1), last revised 10 Aug 2024 (this version, v2)]

Title:NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Authors:Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim, Minsu Cho, Doyup Lee

View PDF HTML (experimental)

Abstract:Transfer learning of large-scale Text-to-Image (T2I) models has recently shown impressive potential for Novel View Synthesis (NVS) of diverse objects from a single image. While previous methods typically train large models on multi-view datasets for NVS, fine-tuning the whole parameters of T2I models not only demands a high cost but also reduces the generalization capacity of T2I models in generating diverse images in a new domain. In this study, we propose an effective method, dubbed NVS-Adapter, which is a plug-and-play module for a T2I model, to synthesize novel multi-views of visual objects while fully exploiting the generalization capacity of T2I models. NVS-Adapter consists of two main components; view-consistency cross-attention learns the visual correspondences to align the local details of view features, and global semantic conditioning aligns the semantic structure of generated views with the reference view. Experimental results demonstrate that the NVS-Adapter can effectively synthesize geometrically consistent multi-views and also achieve high performance on benchmarks without full fine-tuning of T2I models. The code and data are publicly available in ~\href{this https URL}{this https URL}.

Comments:	[ECCV2024] Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.07315 [cs.CV]
	(or arXiv:2312.07315v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.07315

Submission history

From: Yoonwoo Jeong [view email]
[v1] Tue, 12 Dec 2023 14:29:57 UTC (27,683 KB)
[v2] Sat, 10 Aug 2024 07:07:35 UTC (8,439 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Computer Vision and Pattern Recognition

Title:NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.