MCP servers that generate images
Turn prompts into images via Flux, MiniMax, Recraft, and more.
4 servers · Last updated June 17, 2026
TL;DR: These servers wrap text-to-image (and sometimes video/audio) models so your agent can create assets on demand. They differ by underlying model, speed/cost, and whether they also do edits, upscaling, or background removal.
Bottom line: if you only try one, MiniMax MCP Server (Official) is the most popular, verified option for this (1,500★). 3 more compared below.
Compare 4 servers
| Server | Transport | Auth | Verified | Stars | Tools for this |
|---|---|---|---|---|---|
| MiniMax MCP Server (Official) | Local (stdio) | API key | 1,500 | text_to_image, generate_video | |
| Replicate Flux MCP | Local (stdio) | API key | 103 | generate_image, generate_multiple_images, generate_image_variants +1 | |
| Recraft MCP Server (Official) | Local (stdio) | API key | 60 | generate_image, image_to_image, vectorize_image | |
| Fal.ai MCP Server | Local (stdio) | API key | 49 | generate_image, generate_image_structured, generate_image_from_image +9 |
The servers
Official MiniMax server for TTS, voice cloning, music, image, and video generation.
Community server generating images and SVG assets via Replicate's Flux models.
Official Recraft server for raster and vector image generation, vectorization, and editing.
Community server for image, video, music, and audio generation across fal.ai models.
Use these in a stack
FAQ
Which image-generation MCP is fastest?
Hosted inference providers (fal, Replicate) optimize for latency; the right pick depends on the model you want and your budget per image.
Can these edit existing images, not just generate?
Some expose edit/upscale/variation tools in addition to generation — check each server's tool list on its page.