跳到主要导航 跳到搜索 跳到主要内容

Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models

  • East China Normal University
  • Alibaba Group Holding Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Toon shading is a type of non-photorealistic rendering task in animation. Its primary purpose is to render objects with a flat and stylized appearance. As diffusion models have ascended to the forefront of image synthesis, this paper delves into an innovative form of toon shading based on diffusion models, aiming to directly render photorealistic videos into anime styles. In video stylization, existing methods encounter persistent challenges, notably in maintaining consistency and achieving high visual quality. In this paper, we model the toon shading problem as four subproblems, i.e., stylization, consistency enhancement, structure guidance, and colorization. To address the challenges in video stylization, we propose an effective toon shading approach called Diffutoon. Diffutoon is capable of rendering remarkably detailed, high-resolution, and extended-duration videos in anime style. It can also edit the video content according to input prompts via an additional branch. The efficacy of Diffutoon is evaluated through quantitive metrics and human evaluation. Notably, Diffutoon surpasses both open-source and closed-source baseline approaches in our experiments. Our work is accompanied by the release of both the source code and example videos on Github.

源语言英语
主期刊名Proceedings of the 33rd International Joint Conference on Artificial Intelligence, IJCAI 2024
编辑Kate Larson
出版商International Joint Conferences on Artificial Intelligence
7645-7653
页数9
ISBN(电子版)9781956792041
出版状态已出版 - 2024
活动33rd International Joint Conference on Artificial Intelligence, IJCAI 2024 - Jeju, 韩国
期限: 3 8月 20249 8月 2024

出版系列

姓名IJCAI International Joint Conference on Artificial Intelligence
ISSN(印刷版)1045-0823

会议

会议33rd International Joint Conference on Artificial Intelligence, IJCAI 2024
国家/地区韩国
Jeju
时期3/08/249/08/24

指纹

探究 'Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models' 的科研主题。它们共同构成独一无二的指纹。

引用此