Intelligent Metaverse Scene Content Construction

Uploaded by

Vedant Maindad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Intelligent Metaverse Scene Content Construction

Uploaded by

Vedant Maindad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Received 27 May 2023, accepted 17 July 2023, date of publication 21 July 2023, date of current version 27 July 2023.

Digital Object Identifier 10.1109/ACCESS.2023.3297873

Intelligent Metaverse Scene Content Construction

JUNXIANG WANG 1, SIRU CHEN 1, YUXUAN LIU 1, AND RICHEN LAU1,2
1 School of Computer and Electronic Information/School of Artificial Intelligence, Nanjing Normal University, Nanjing 210023, China
2 State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing 100191, China

Corresponding authors: Siru Chen (222202026@njnu.edu.cn) and Richen Lau (329789995@qq.com)

This work was supported in part by the National Natural Science Foundation of China under Grant 61702271; and in part by the Open
Project Program of the State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, under Grant
VRLAB2023B05.

ABSTRACT The integration of artificial intelligence (AI) and virtual reality (VR) has revolutionized
research across various scientific fields, with AI-driven VR simulations finding applications in education,
healthcare, and entertainment. However, existing literature lacks a comprehensive investigation that
systematically summarizes the fundamental characteristics and development trajectory of AI-generated
visual content in the metaverse. This survey focuses on intelligent metaverse scene content construction,
aiming to address this gap by exploring the application of AI in content generation. It investigates
scene content generation, simulation biology, personalized content, and intelligent agents. Analyzing the
current state and identifying common features, this survey provides a detailed description of methods for
constructing intelligent metaverse scenes. The primary contribution is a comprehensive analysis of the
current landscape of intelligent visual content production in the metaverse, highlighting emerging trends.
The discussion on methods for constructing intelligent scene content in the metaverse suggests that in the
era of intelligence, it has the potential to become the dominant approach for content creation in metaverse
scenes.

INDEX TERMS Content generation, metaverse, immersive visualization, deep learning.

I. INTRODUCTION text, images, audio, video, and models, our research focuses
The rapid evolution of metaverse technologies, coupled with on the utilization of AI methods to generate content within
exponential growth in computing power, has brought about metaverse scenes. Distinct from AIGC’s video content
a paradigm shift in the production of visual content. The generation [1], our emphasis is on the creation of specific
traditional manual creation of static scene content has been scenes that empower users to engage in free interaction
revolutionized by the emergence of intelligent metaverse within the generated metaverse environment. This essential
technology, which enables the generation of scene content feature sets our content generation approach apart from
with unprecedented capabilities. This intelligent approach simple videos, as it fosters a highly immersive and interactive
leverages existing data conditions as specialized tools for user experience integrated with the interaction concept of
programmatically constructing and enhancing metaverse Metaverse [2]. Based on the aforementioned considerations,
scenarios. The application of intelligent metaverse scene we have compiled and analyzed recent research on the
content generation extends to various scientific domains construction of intelligent metaverse scene content. We have
including education, biology, medicine, and art. Specifically, classified, summarized, and discussed these works in order to
it involves the dynamic construction of multi-variable scenes inspire future endeavors in this field.
through programming. The rapid advancement of AI technologies has led to the
In contrast to conventional artificial intelligence-generated development of various methodological models that integrate
content (AIGC), which predominantly focuses on generating AI intelligence into automated metaverse scene generation.
Three primary strategies have emerged: the development
The associate editor coordinating the review of this manuscript and of intelligent framework systems for coordinating meta-
approving it for publication was Xiaogang Jin . verse content, simulation of intelligent agents that replicate
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
76222 For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/ VOLUME 11, 2023
J. Wang et al.: Intelligent Metaverse Scene Content Construction

FIGURE 1. We choose a few exemplary papers from each category. Different publications are shown on the horizontal axis, while various kinds of
intelligent visual content production strategies are represented on the vertical axis, including scene content generation (V3D: virtual 3D construction,
CDG: classification-based directed generation, DG: data-driven generation, content construction and improvement (AC: adaptive change, EI: effect
improvement)), simulated biology (ILTS: individual life trait simulation, GBS: group behavior simulation, SFL: stress feedback learning), personalized
contents (ECP: emotional content personalization, NCP: non-emotional content personalization), intelligent agents (IATD: intelligent agents trained based
on data, IACK: intelligent agents constructed from external knowledge systems, intelligent agents trained for interaction data adaptation (SRI: simulating
realistic images, IAAC: interaction actions to adapt agent changes, IPTC: interaction processes to adapt task collaboration)). Many of the papers share the
basic characteristics of multiple intelligent visual content generation techniques at the same time. All the papers are categorized (in different colors) and
disaggregated according to the main techniques they use.

biological logic, and utilization of machine learning or deep representative studies [11], [12], [13]. In contrast, our survey
learning methods to generate or enhance scene content. offers a comprehensive analysis of recent advancements from
Understanding the comprehensive integration of intelligent a cutting-edge perspective, incorporating relevant papers
metaverse scene content generation requires a detailed study from the IEEE Xplore and ACM Digital Library based on
of these strategies and an analysis of the employed methods their relevance and research direction.
to reveal valuable insights into the future trends of metaverse
B. TAXONOMY OF THE SURVEY
scene content generation in the era of AI.
The exploration of AI for generating content in metaverse
scenes has led to the emergence of several common areas
A. RELATED SURVEYS
of interest, including scene content generation, simulated
Previous surveys in this field have predominantly focused on biology, personalized content, and intelligent agents, either
AI-based computer vision imaging, with particular emphasis individually or in combination (Figure 1). It is important
on evolutionary computation and biological vision [3], to highlight that many methods employ a combination
or on generating interactive and immersive content [4]. of these intelligent metaverse scene content generation
Some surveys have delved into the impact of content techniques rather than relying solely on a single approach.
generation in AI on the art industry [5] and explored deep While this survey does not encompass all papers related to
learning-based image style transfer [6]. Additionally, the metaverse scene content generation, its primary focus is on
Unreal 3D authoring engine has shown interest in leveraging the utilization of deep learning or other machine learning
agents and decision-tree-based procedural content generation methods and intelligent frameworks to achieve the generation
techniques for immersive game development [7]. of metaverse scene content.
In comparison to studies that delve into the application
of AI technology in machine learning for character pro- II. SCENE CONTENT GENERATION
gramming and calculation in games [8], or compare search- Integrating AI into the design framework of VR presents
based methods and machine learning methods to traditional a logical solution for overcoming the challenges associated
approaches in surveying game program content [9], our with automated metaverse scene content generation, thereby
survey takes a more comprehensive approach by focusing enhancing the realism of virtual environments. Deep-learning
on the application of AI technology and the generation of modules have emerged as effective tools for achieving
intelligent metaverse scene content. While there have been this objective. Notably, Unreal Engine explored AI-based
surveys that explore the inclusion of human emotions in approaches by utilizing agents and decision trees to generate
virtual agents and the advancements of AI in education [10], content in immersive game programs [7].
these surveys either have limited coverage of content This section introduces several common intelligent meth-
generation or primarily focus on detailed descriptions of a few ods used in the generation and improvement of metaverse