Pixelle Video is an open-source fully automated short video generation engine from Alibaba's AIDC (AI development and computing) team. It takes a text prompt or topic and produces a complete short video — script, voiceover, stock footage selection, subtitle overlay, and background music — without manual editing steps. With 6,289 GitHub stars and 1,011 new stars in a single day, it is the fastest-growing open-source video generation project this week. Built in Python, it runs locally with GPU acceleration or via Alibaba Cloud. The pipeline covers the full short-form video production loop that would otherwise require a human editor: topic → script → voiceover → footage → assembly → export.
Pixelle Video compresses the short-form video production pipeline into a single automated workflow. The human bottleneck in short video content — writing scripts, recording voiceover, finding footage, adding subtitles, selecting music, and assembling the final cut — is handled entirely by the system given a topic or prompt.
This is Alibaba AIDC’s entry into the open-source video generation space, released as a direct alternative to closed commercial platforms. The codebase is Python-based and designed for deployment on consumer GPUs (NVIDIA) as well as Alibaba Cloud instances.
A complete short-form video from a text prompt:
Content operations automation: Generate draft videos for editorial review faster than any manual pipeline. Edit the script before generation to control the output narrative.
Multilingual content: The TTS pipeline supports multiple languages, enabling the same topic to produce parallel videos in different markets.
Prototype storytelling: Quickly visualize a video concept before committing production resources.
git clone https://github.com/AIDC-AI/Pixelle-Video
cd Pixelle-Video
pip install -r requirements.txt
# Generate a video
python generate.py --topic "5 AI tools that changed how developers work in 2026" --duration 60 --aspect 9:16
Requires Python 3.10+, CUDA-compatible GPU (recommended), and FFmpeg.
Content teams that produce high-volume short-form video and need to automate the production pipeline. Developers exploring open-source video generation. Teams evaluating alternatives to closed-source video AI platforms before committing to a vendor.
Open-source game development framework powered by Claude Code — autonomous agent loop for designing, iterating, and playtesting games using natural language.
OpenAI's AI image generation model that creates and edits realistic images from natural language descriptions.
AI voice platform that generates ultra-realistic speech, voice cloning, and audio content from text with human-like expressiveness.