Open Generative AI: Free, Self-Hosted Image & Video Studio with 200+ Models

Open-source AI media studio with 200+ image and video models, no content filters, offline generation via sd.cpp, and a pipeline builder for automated workflows.

Open Generative AI is a free, open-source AI media studio for creating AI images, videos, lip sync clips, cinematic shots, audio assets, and multi-step media workflows.

It runs in the browser, as a desktop app, or as a self-hosted Next.js project. The model catalog covers image generation, image editing, text-to-video, image-to-video, lip sync, audio, and workflow automation through Muapi plus local inference options in the desktop app.

AI video work rarely stays within a single model. You may start with a reference image, test several video generators, send a frame back through an image editor, add lip sync, then save the result as part of a larger content workflow.

Open Generative AI keeps those stages in one workspace, with separate studios for images, video, lip sync, cinema controls, audio, and visual workflows.

The web version is enough for a first look, but the desktop and self-hosted paths are the reason this project deserves attention.

A single-purpose AI video generator will be faster for one-off clips. Open Generative AI earns its place when one project needs model comparison, reference-image reuse, lip sync, local image generation, and repeatable media workflows.

Features

  • Generates text-to-image outputs with 50+ models, including Flux, Nano Banana, Seedream, Ideogram, GPT Image, Midjourney, and SDXL.
  • Transforms reference images through 55+ image-to-image models, including edit, upscaling, background removal, and style transfer models.
  • Accepts up to 14 reference images in compatible image-editing models, with selection order badges and batch upload.
  • Creates text-to-video outputs with 40+ models, including Kling, Veo, Wan, Seedance, Hailuo, and Runway.
  • Animates start-frame images through 60+ image-to-video models, including Kling I2V, Veo3 I2V, Runway I2V, Seedance I2V, Midjourney I2V, Hunyuan I2V, and Wan I2V.
  • Creates talking videos in Lip Sync Studio from portrait images plus audio or existing videos plus audio.
  • Provides 9 lip sync models, including Infinite Talk, Wan Speech to Video, LTX Lipsync, Sync, LatentSync, Creatify, and Veed.
  • Cinema Studio controls for camera type, lens style, focal length, and aperture.
  • Builds multi-step media pipelines through Workflow Studio, templates, custom workflows, community workflows, a node builder, and a playground.
  • Stores upload history and generation history in browser storage for easier reuse across sessions.
  • Runs local image models through bundled sd.cpp in the desktop app.
  • Connects to a user-run Wan2GP server for local or remote GPU video generation.
  • Desktop installers for Windows, macOS Apple Silicon, macOS Intel, Linux AppImage, and Linux DEB.

Use Cases

  • Create AI images from text prompts for concept art, thumbnails, mockups, product visuals, and social posts.
  • Edit reference images with multiple inputs for character consistency, product variations, style transfer, and visual remixing.
  • Generate short AI videos from prompts or start-frame images for experiments, storyboarding, social clips, and visual drafts.
  • Sync voices to portrait images or existing videos for talking-head demos, character clips, and prototype narration.
  • Build repeatable image, video, and audio pipelines through visual workflows or API execution.

How to Use Open Generative AI

Use the hosted version

  1. Open https://muapi.ai/open-generative-ai.
  2. Create a free account before the first generation.
  3. Choose Image Studio, Video Studio, Lip Sync Studio, Cinema Studio, or Workflow Studio.
  4. Enter a prompt before choosing a text-to-image or text-to-video model.
  5. Upload a reference image before choosing an image-to-image or image-to-video model.
  6. Select the model first when you need model-specific controls such as aspect ratio, resolution, quality, duration, or reference-image count.
  7. Upload all reference images in the order the model should read them when a multi-image picker appears.
  8. Download completed outputs from the generation history panel.

Use the desktop app

  1. Download the current desktop build from the releases page.
  2. Choose the installer that matches your operating system and processor.
  3. Open the app after installation.
  4. Add a Muapi access key only when you plan to use Muapi cloud models.
  5. Skip the Muapi key when you plan to use only supported local models.
  6. Open Settings > Local Models before local generation.
  7. Install the sd.cpp engine inside the app for local image models.
  8. Choose a smaller SD 1.5 model on an 8 GB Mac.
  9. Use Z-Image models only on machines with enough RAM.
  10. Set a custom local AI storage directory before downloading multi-GB models when your system drive is small.

Use local image generation through sd.cpp

  1. Open Settings > Local Models.
  2. Install the sd.cpp inference engine.
  3. Download Dreamshaper 8, Realistic Vision v5.1, Anything v5, SDXL Base 1.0, Z-Image Turbo, or Z-Image Base.
  4. Download the shared Qwen3-4B text encoder and FLUX VAE files when you choose a Z-Image model.
  5. Open Image Studio.
  6. Turn on the Local toggle beside the model selector.
  7. Select the local model.
  8. Generate the image without a Muapi API key.
Dreamshaper 8 is the safest first local test on lower-memory Macs because the Z-Image path has much higher memory pressure.

Use Wan2GP for local video generation

  1. Run Wan2GP on a machine with a CUDA or ROCm GPU.
  2. Start the Wan2GP server on the GPU machine.
  3. Open Settings > Local Models in Open Generative AI.
  4. Paste the Wan2GP server URL.
  5. Click Test.
  6. Save the connection.
  7. Choose a Wan2GP model from the desktop app.
Wan2GP makes the most sense when you already own a gaming PC, Linux GPU box, workstation, or rented GPU instance. Mac-only users can still run the desktop app on macOS and send video jobs to a separate GPU server.

Technical Details

Access and platform options

OptionRequirementBest use
Hosted web appFree accountFast browser access and cloud models.
Desktop appWindows, macOS, or Linux installerLocal engines, desktop workflow, and fewer browser limits.
Source buildNode.js v18+ and repository setupDevelopment, customization, and self-hosting.
sd.cpp local engineDesktop app and local model weightsLocal image generation.
Wan2GP serverSeparate CUDA or ROCm GPU machineLocal or remote video generation.
Muapi APIMuapi access keyCloud model requests and API workflows.

macOS Gatekeeper command

Run this command after dragging the app into /Applications.

xattr -cr "/Applications/Open Generative AI.app"

Linux build and install commands

Build Linux installers.

npm run electron:build:linux

Run the AppImage.

chmod +x "release/Open Generative AI-*.AppImage"
./release/Open\ Generative\ AI-*.AppImage

Install the DEB package.

sudo apt install ./release/open-generative-ai_*_amd64.deb

Install libfuse2 when older systems cannot start the AppImage.

sudo apt install libfuse2

Apply the temporary Ubuntu 24.04+ AppArmor workaround for AppImage builds.

sudo sysctl -w kernel.apparmor_restrict_unprivileged_userns=0

Make the Ubuntu 24.04+ AppArmor workaround persistent.

echo 'kernel.apparmor_restrict_unprivileged_userns=0' | sudo tee /etc/sysctl.d/99-userns.conf

Wan2GP server setup

Clone and install Wan2GP on the GPU machine.

git clone https://github.com/deepbeepmeep/Wan2GP
cd Wan2GP
./install.sh
python wgp.py --listen --server-name 0.0.0.0

Windows users can use install.bat during the Wan2GP install step.

install.bat

Source setup

Clone the repository with submodules.

git clone --recurse-submodules https://github.com/Anil-matcha/Open-Generative-AI.git
cd Open-Generative-AI

Initialize submodules after a normal clone.

git submodule update --init --recursive

Install dependencies and build workspace packages.

npm run setup

Start the desktop app in development mode.

npm run electron:dev

Start the hosted web version in development mode.

npm run dev

Build and start the production web app.

npm run build
npm run start

Build desktop apps.

npm run electron:build
npm run electron:build:win
npm run electron:build:linux
npm run electron:build:all

Local model storage

PlatformDefault local AI path
macOS~/Library/Application Support/open-generative-ai/local-ai
Windows%APPDATA%\open-generative-ai\local-ai
Linux~/.config/open-generative-ai/local-ai

Set OPEN_GENERATIVE_AI_LOCAL_AI_DIR before launching the app when model weights belong on another drive.

export OPEN_GENERATIVE_AI_LOCAL_AI_DIR="/Volumes/AIModels/open-generative-ai"

SD 1.5 local verification on Mac

Define the app data path.

APP_DATA="${OPEN_GENERATIVE_AI_LOCAL_AI_DIR:-$HOME/Library/Application Support/open-generative-ai/local-ai}"
ls "$APP_DATA/bin"
ls "$APP_DATA/models"

Download Dreamshaper 8.

curl -L --fail --progress-bar \
  -o "$APP_DATA/models/DreamShaper_8_pruned.safetensors" \
  "https://huggingface.co/Lykon/DreamShaper/resolve/main/DreamShaper_8_pruned.safetensors"

Run a 512×512 test generation.

DYLD_LIBRARY_PATH="$APP_DATA/bin" "$APP_DATA/bin/sd-cli" \
  -m "$APP_DATA/models/DreamShaper_8_pruned.safetensors" \
  -p "a serene mountain lake at sunrise, oil painting" \
  -o /tmp/sd15-test.png \
  --steps 12 -H 512 -W 512 --cfg-scale 7.5 --seed 42 \
  --sampling-method euler_a

Check Metal linkage when the run falls back to CPU.

otool -L "$APP_DATA/bin/libstable-diffusion.dylib" | grep -i metal

Local engines

EngineRuns insideHardwareModel types
sd.cppDesktop appCPU, Apple Silicon Metal, CUDA, Vulkan, ROCmSD 1.5, SDXL, Z-Image image models.
Wan2GPSeparate Gradio serverCUDA or ROCm GPUFlux.1 Dev, Qwen Image, Wan 2.2, Hunyuan Video, LTX Video.

Local image model notes

ModelTypeSize notesPractical note
Z-Image TurboDiffusion Transformer2.5 GB plus 2.7 GB auxiliary filesUse 16 GB RAM or higher.
Z-Image BaseDiffusion Transformer3.5 GB plus 2.7 GB auxiliary filesHigher quality path with more steps.
Dreamshaper 8SD 1.52.1 GBGood first test model.
Realistic Vision v5.1SD 1.52.1 GBPhotorealistic image path.
Anything v5SD 1.52.1 GBAnime and illustration path.
SDXL Base 1.0SDXL6.9 GBHigher-resolution image path.

Model categories

CategoryCountExamples
Text-to-image50+Flux Dev, Nano Banana 2, Seedream 5.0, Ideogram v3, Midjourney v7, GPT-4o, SDXL.
Image-to-image55+Nano Banana 2 Edit, Flux Kontext Pro, GPT-4o Edit, Seededit v3, Upscaler, Background Remover.
Text-to-video40+Kling v3, Sora 2, Veo 3, Wan 2.6, Seedance 2.0, Hailuo 2.3, Runway Gen-3.
Image-to-video60+Kling I2V, Veo3 I2V, Runway I2V, Seedance I2V, Midjourney I2V, Hunyuan I2V, Wan I2V.
Lip sync9Infinite Talk, Wan 2.2 Speech to Video, LTX Lipsync, Sync, LatentSync, Creatify, Veed.

Lip sync endpoints

ModelEndpointInput modeResolution
Infinite Talkinfinitetalk-image-to-videoPortrait image plus audio480p, 720p
Wan 2.2 Speech to Videowan2.2-speech-to-videoPortrait image plus audio480p, 720p
LTX 2.3 Lipsyncltx-2.3-lipsyncPortrait image plus audio480p, 720p, 1080p
LTX 2 19B Lipsyncltx-2-19b-lipsyncPortrait image plus audio480p, 720p, 1080p
Sync Lipsyncsync-lipsyncVideo plus audioNot listed
LatentSynclatentsync-videoVideo plus audioNot listed
Creatify Lipsynccreatify-lipsyncVideo plus audioNot listed
Veed Lipsyncveed-lipsyncVideo plus audioNot listed
Infinite Talk V2Vinfinitetalk-video-to-videoVideo plus audio480p, 720p

API pattern

StepMethodEndpointPurpose
SubmitPOST/api/v1/{model-endpoint}Send prompt, model parameters, and input URLs.
PollGET/api/v1/predictions/{request_id}/resultCheck job status until completion.
UploadPOST/api/v1/upload_fileUpload media and receive a hosted URL.
Authentication uses the x-api-key header. Multi-image models send the full images_list array in one request. Lip sync jobs send image_url or video_url with audio_url and poll until the output video URL becomes available.

Alternatives and Related Tools

Pros

  • MIT licensed.
  • Hosted web access.
  • Desktop installers available.
  • 200+ model catalog.
  • Local image generation.
  • Visual workflow builder.
  • Lip sync included.
  • Multi-image editing.

Cons

  • Free account needed online.
  • Muapi key needed cloud.
  • Local video needs GPU.
  • No guardrails require judgment.

FAQs

Q: Is Open Generative AI free?
A: Open Generative AI is free and open source. The hosted browser version starts with a free account, while cloud model usage depends on Muapi access and the selected model route.

Q: Does Open Generative AI include content filters?
A: Open Generative AI advertises no built-in content filters or prompt blocking. You should apply your own content policy before using it in public, client, or brand-sensitive workflows.

Q: Is Open Generative AI good for developers?
A: Developers can self-host the app, build from the Next.js monorepo, call Muapi endpoints, run workflows through API execution, and connect related agent or workflow projects.

Q: Can Open Generative AI replace paid AI video platforms?
A: Open Generative AI can replace parts of a paid AI video workflow when model access, self-hosting, and customization matter. A closed paid platform may still be easier for teams that need managed billing, stricter review controls, and predictable support.

Q: Can I generate video locally without a GPU?
A: The sd.cpp engine generates images locally on CPU, Metal (Apple Silicon), CUDA, Vulkan, or ROCm. Video generation locally requires a Wan2GP server running on a CUDA or ROCm GPU machine. No local video path exists for CPU or Apple Silicon alone.

Q: What is the Workflow Studio?
A: Workflow Studio is a node-based visual pipeline builder inside the app. You can connect image, video, and audio model steps into automated sequences, run them in an interactive playground, browse and clone community workflow templates, and call any saved workflow from external code through the Muapi API.

Leave a Reply

Your email address will not be published. Required fields are marked *

Get the latest & top AI tools sent directly to your email.

Subscribe now to explore the latest & top AI tools and resources, all in one convenient newsletter. No spam, we promise!