AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
-
Updated
Apr 11, 2026 - Python
AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
GenMedia Creative Studio is a Vertex AI generative media user experience highlighting the use of Imagen, Veo, Gemini 🍌, Gemini TTS, Chirp 3, Lyria and other generative media APIs on Google Cloud.
AI-assisted storyboard and video generation tool. Uses Gemini for generating storyboard text and frames, Vertex AI Veo for generating transition clips, and ffmpeg for stitching the final video. Built-in logs and gallery management.
基于lobe-chat,增加了无限画布功能,支持google、openai、kling、midjourney等画图和视频模型,还有额外的独立的绘图、音乐、视频等创作面板,支持用户注册登录,充值消费,模型及价格管理,聊天、绘图、音乐、视频创作记录管理,通知公告等
A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch
ImgStudio is a NextJS web app designed for easy deployment and user-friendly experience, streamlining access to the power of Google's GenAI model Imagen & Veo to generate powerful images & videos 🔥
Open-source AI pipeline that turns any topic into a publish-ready YouTube/Instagram/TikTok Short — research, script, voiceover, visuals, music, captions, and assembly in one command.
comfyui中使用sora2、veo3.1、NanoBanana Pro等热门ai(api)
Neural Reels - AI-Powered Short-Form Video Generator
Google Veo 3 & Veo 3.1 prompt generator. Create professional video prompts with style presets, native audio support, cinematography controls, and templates for cinematic, commercial, social media, and educational content. Free tool with JSON/Markdown export. Perfect for creators, marketers, and educators.
AI Image & Video Generator powered by Google Gemini & Veo 3.1. Create images, videos, stickers, slides, and convert PDFs to editable PPTX. 100% client-side, no backend. | Gemini + Veo 3.1 驅動的 AI 圖像與影片生成器,支援簡報轉 PPTX。
Neural Reels is an AI-powered application that automates the creation of short-form video content from simple text prompts.
Claude AI skill for cinematic Higgsfield AI prompts — 18 sub-skills, MCSLA formula, Soul ID, Cinema Studio 2.5, 10 genre templates
Open-source AI video pipeline. Text prompt → scenario → images → video clips → editor → MP4. Self-hosted, multi-provider, MCP-ready.
AI storyteller that turns screen time into active adventure. Puck — a live Gemini agent — narrates interactive fairytales, generates watercolor scenes, and only continues the story when the child completes a real physical challenge. Built with Gemini Live 2.5, Google ADK, Veo 3.1, React 19.
Generate AI videos from text or images using 13 top models — an Agent Skills plugin powered by Pollo AI
Capy Video Gen Skill: Multi-shot AI video generation with face identity consistency. 300 experiments, 70% improvement. Script-to-video and idea-to-video pipelines for HappyCapy AI Gateway.
Model-Agnostic AI Video Framework
Python SDK and MCP server for generating and extending videos with Google Veo
Add a description, image, and links to the veo topic page so that developers can more easily learn about it.
To associate your repository with the veo topic, visit your repo's landing page and select "manage topics."