flymin's picture
upload ckpt
f26cd84
metadata
license: apache-2.0
language:
  - en
tags:
  - magicdrive
  - image-generation
  - controllable-generation
  - autonomous-driving

MagicDrive-t

ArXiv | Web | License | GitHub (video-branch)

This repository contains 60-frame driving-view video generation model checkpoint.

  • We train this model by loading from the 16-frame 224x400 checkpoint (we interpolate the temporal positional embeddings from 16 to 60), and train for 80k steps.
  • Please check hydra/overrides.yaml for command overrides in the training config.

MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao1*, Kai Chen2*, Enze Xie3^, Lanqing Hong3, Zhenguo Li3, Dit-Yan Yeung2, Qiang Xu1^
1CUHK 2HKUST 3Huawei Noah's Ark Lab
*Equal Contribution ^Corresponding Authors

Generated video A.

More cars in A on the right and front.


Generated video B

Change B to night.

Remove the car on the left in B.


Generated video C.

Remove the car on the right in C.

Change C to a rainy day.

Change C to a rainy day and remove the car on the right.

For more information, please refer to our GitHub: https://github.com/cure-lab/MagicDrive/tree/video (i.e., the video branch of MagicDrive repo)