Generate interactive game-style videos from a single image using keyboard actions (W/A/S/D).
Using the distilled model for faster generation (8 inference steps).
0.53
420
Tips:
Each action generates 33 frames (1.3 seconds at 25 FPS)
The distilled model is optimized for speed with 8 inference steps