Abstract
Manipulating 3D faces using text is an important technology in the entertainment industry. However, text-based manipulation of 3D faces remains a challenging area due to the scarcity of data pairs consisting of 3D faces and corresponding text. Additionally, inference for manipulating 3D faces using text prompts often requires several minutes due to the large model sizes or the optimization process to fit the text prompt. In this paper, we propose the ITFaceEdit model, a text and image-based 3D face manipulation model. ITFaceEdit constructs a framework trainable only with image and text data pairs, allowing it to learn a direct relationship between the text latent space and the 3D face latent space. By utilizing vectors from the learned text embeddings, we can manipulate 3D faces, employing face parsing for disentangled manipulation. Through this approach, we not only extend the reconstructed 3D face space using images with text-based manipulation but also configure an inference process without relatively heavy model structures and optimization steps, enabling 3D face manipulation in a few seconds. We demonstrate the superiority of our proposed method through comparisons with existing methods in various ways.
| Original language | English |
|---|---|
| Title of host publication | Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings |
| Editors | Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 129-146 |
| Number of pages | 18 |
| ISBN (Print) | 9783031781216 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 27th International Conference on Pattern Recognition, ICPR 2024 - Kolkata, India Duration: 2024 Dec 1 → 2024 Dec 5 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 15303 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 27th International Conference on Pattern Recognition, ICPR 2024 |
|---|---|
| Country/Territory | India |
| City | Kolkata |
| Period | 24/12/1 → 24/12/5 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
Keywords
- 3D face manipulation
- Text-driven 3D face animation
- Text-driven 3D face manipulation
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science
Fingerprint
Dive into the research topics of 'Interactive-Time Text-Guided Editing of 3D Face'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS