DiffLane: Diffusion Model-Based Lane Mask Generation for Accurate Video Lane Detection

Wenxiang Liu, Yongkang Liu, Weiliang Meng, Gaoqi He*, Jianhua Li*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Mask-based video lane detection methods currently have achieved promising performance. However, they generate irregular lane masks in complex scenes, resulting in inaccurate lane positioning. Diffusion models have achieved notable success in the field of image segmentation because of their ability to restore pixel-level details. In this paper, we propose a novel framework DiffLane, termed Diffusion Model-Based Lane Mask Generation for Accurate Video Lane Detection. The main idea of our work is to exploit the detail-restoring capability of diffusion models to generate high-quality lane masks. DiffLane includes the MultiFrame Fusion Enhancer (MFFE), the MultiScale De-noising Network (MSDN) and the Dynamic Lane Perception Unit (DLPU). In MFFE, the current frame is enhanced with visual information from the past two frames through global matching-based optical flow estimation. This enhanced frame serves as a condition for each denoising step. MSDN predicts noise through a multi-scale fusion strategy, enabling the diffusion model to remove noise and generate regular lane masks precisely. DLPU regresses the coefficient vectors from the generated lane masks with DSConv applied in two directions, completing the accurate video lane detection task. Extensive experiments on the VIL-100 and OpenLane-V datasets demonstrate that our method outperforms other state-of-the-art approaches.

Original languageEnglish
Title of host publication2025 IEEE International Conference on Multimedia and Expo
Subtitle of host publicationJourney to the Center of Machine Imagination, ICME 2025 - Conference Proceedings
PublisherIEEE Computer Society
ISBN (Electronic)9798331594954
DOIs
StatePublished - 2025
Event2025 IEEE International Conference on Multimedia and Expo, ICME 2025 - Nantes, France
Duration: 30 Jun 20254 Jul 2025

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2025 IEEE International Conference on Multimedia and Expo, ICME 2025
Country/TerritoryFrance
CityNantes
Period30/06/254/07/25

Keywords

  • diffusion model
  • lane mask generation
  • video lane detection

Fingerprint

Dive into the research topics of 'DiffLane: Diffusion Model-Based Lane Mask Generation for Accurate Video Lane Detection'. Together they form a unique fingerprint.

Cite this