Treffer: Human Motion Video Generation: A Survey.

Title:
Human Motion Video Generation: A Survey.
Source:
IEEE transactions on pattern analysis and machine intelligence [IEEE Trans Pattern Anal Mach Intell] 2025 Nov; Vol. 47 (11), pp. 10709-10730.
Publication Type:
Journal Article; Review
Language:
English
Journal Info:
Publisher: IEEE Computer Society Country of Publication: United States NLM ID: 9885960 Publication Model: Print Cited Medium: Internet ISSN: 1939-3539 (Electronic) Linking ISSN: 00985589 NLM ISO Abbreviation: IEEE Trans Pattern Anal Mach Intell Subsets: MEDLINE
Imprint Name(s):
Original Publication: [New York] IEEE Computer Society.
Entry Date(s):
Date Created: 20250731 Date Completed: 20251003 Latest Revision: 20251118
Update Code:
20251118
DOI:
10.1109/TPAMI.2025.3594034
PMID:
40742844
Database:
MEDLINE

Weitere Informationen

Human motion video generation has garnered significant research interest due to its broad applications, enabling innovations such as photorealistic singing heads or dynamic avatars that seamlessly dance to music. However, existing surveys in this field focus on individual methods, lacking a comprehensive overview of the entire generative process. This paper addresses this gap by providing an in-depth survey of human motion video generation, encompassing over ten sub-tasks, and detailing the five key phases of the generation process: input, motion planning, motion video generation, refinement, and output. Notably, this is the first survey that discusses the potential of large language models in enhancing human motion video generation. Our survey reviews the latest developments and technological trends in human motion video generation across three primary modalities: vision, text, and audio. By covering over two hundred papers, we offer a thorough overview of the field and highlight milestone works that have driven significant technological breakthroughs. Our goal for this survey is to unveil the prospects of human motion video generation and serve as a valuable resource for advancing the comprehensive applications of digital humans.