We fine-tuned Qwen-Image-Edit on the brand-new Pico-Banana-400K dataset using our EigenTrain platform, introduced a Lightning option that produces edits in as few as 4 steps, and rounded out the stack with EigenInference (production-safe inference optimizations) and EigenDeploy (secure, autoscaling deployments for regulated enterprises). We’re also open-sourcing a LoRA branch that runs in Diffusers and DiffSynth-Studio, with a hosted demo for a much faster experience.
Pico-Banana-400K is a large-scale, instruction-based image editing dataset built from real photos. It contains ~400K text–image–edit triplets spanning 35 edit operations across 8 semantic categories, with both single-turn and multi-turn supervision. The authors generate human-like edit instructions (e.g., via Gemini-2.5-Flash), produce edits (via Nano-Banana), and auto-screen quality (via Gemini-2.5-Pro) to ensure instruction faithfulness and content preservation. Source images come from Open Images.
EigenTrain unifies SFT, offline RL, and online RL for training text LLMs and VLMs, and includes first-class workflows for multimodal image/video generation. It’s designed for teams that need to turn new datasets into high-quality models fast:
Production-safe inference optimizations—quantization, sparsity, distillation, routing, and more—to crush strict latency and cost targets across LLM, VLM, and multi-modal workloads.
Resilient deployments across cloud or your metal—choose serverless, on-demand, or dedicated serving.
We started from Qwen-Image-Edit—a strong, Apache-2.0 image-editing foundation that supports precise text editing and both appearance-preserving and semantic edits. Using EigenTrain, we fine-tuned it on Pico-Banana-400K to create Eigen-Banana-Qwen-Image-Edit.
We also optimized a Lightning variant that generates high-quality edits in 4 steps—great for interactive tools and low-latency workloads. Our Lightning build follows the community’s Qwen-Image-Lightning approach (FlowMatch-style scheduler + LoRA distillation) and supports 4-step presets.
What this means for you
Eigen-Banana-Qwen-Image-Edit is a LoRA (Low-Rank Adaptation) checkpoint for the Qwen-Image-Edit model, optimized for fast, high-quality image editing with text prompts. This model enables efficient text-guided image transformations with reduced inference steps while maintaining excellent quality. Access model weights from here.
Use it in DiffSynth-Studio or try it in Diffusers (QwenImage / Qwen-Image-Edit pipelines).
For a much faster hosted version without any quality loss, visit our demo here.

Prompt: Integrate a minimalist, dark-toned, rectangular gallery bench into the mid-ground, positioned slightly to the right of the central pillar and facing the right wall, ensuring its texture, lighting, and subtle shadows are consistent with the existing black and white aesthetic and diffused ambient light of the art gallery.

Qwen-Image-Edit

Eigen-Banana (⚡Lightning)

Prompt: Apply a vintage film aesthetic to the image, featuring a subtle desaturation of colors with a warm, golden-hour tone, introduce a fine and natural-looking film grain across the entire scene, gently reduce overall contrast for a softer appearance, and add a very faint, dark vignette to the edges to mimic an aged photographic print.

Qwen-Image-Edit

Eigen-Banana (⚡Lightning)

Prompt: Add the text "CHAMPION" in a bold, sans-serif font, horizontally aligned below the existing "GLASGOW" text on the race bib of the runner wearing number 454 (yellow singlet), ensuring the text color, lighting, and subtle fabric distortion match the existing elements on the bib.

Qwen-Image-Edit

Eigen-Banana (⚡Lightning)

Prompt: Replace the current plain wall background with a sophisticated, softly lit indoor event space, featuring warm golden ambient lighting, elegant architectural details such as decorative panels or subtle artwork, and a slightly blurred depth of field to keep the focus on the subjects while ensuring the new background's rich, muted tones complement their attire.

Qwen-Image-Edit

Eigen-Banana (⚡Lightning)

Prompt: Adjust the subject's facial expression to a subtle, closed-mouth smile, ensuring natural skin folds and realistic lighting on the face, while maintaining the existing head posture and integrating seamlessly with the overall image context.

Qwen-Image-Edit

Eigen-Banana (⚡Lightning)
Huge thanks to the teams and communities that made this possible:
About EigenTrain, EigenInference, EigenDeploy — Platform & Availability.
Bring instruction-guided image editing to your product—or adapt to a new domain—fast. EigenTrain turns your data into high-quality editors with efficient fine-tuning. EigenInference delivers production-safe inference optimizations—quantization, sparsity, distillation, routing, and more—to hit strict latency and cost targets across LLM, VLM, and multi-modal workloads. EigenDeploy provides secure, autoscaling deployments across cloud or on-prem with high availability, observability, and cost control for regulated enterprises.
We’ll work with your team to scope fit and deployment options aligned to your dataset licensing and compliance needs. If you are interested in EigenTrain, EigenInference, EigenDeploy and dedicated serving across LLM, VLM, and multi-modal (see here) with SLOs, observability, and on-prem/air-gapped options, contact Eigen AI.
The open-sourced LoRA branch is plug-and-play in Diffusers (QwenImage pipelines) and DiffSynth-Studio, so you can try it locally right away—or visit here to use our hosted demo for maximum speed.