Playground V2.5

Playground V2.5 is a diffusion-based text-to-image generative model, designed to create highly aesthetic images based on textual prompts.


Pricing

Serverless Pricing

Buy credits that can be used anywhere on Segmind

$ 0.001 /per gpu second

Dedicated Cloud Pricing

For enterprise costs and dedicated endpoints

$ 0.0007 - $ 0.0031 /per gpu second

Playground V2.5

Playground V2.5 is a diffusion-based text-to-image generative model, designed to create highly aesthetic images based on textual prompts. As the successor to Playground V2, it represents the state-of-the-art in open-source aesthetic quality. Playground v2.5 excels at producing visually attractive images. It achieves this through advancements in color, contrast and human details.

Technical Details

  • Model Type: Playground V2.5 operates as a Latent Diffusion Model.

  • Text Encoders: It utilizes two fixed, pre-trained text encoders: OpenCLIP-ViT/G and CLIP-ViT/L.

  • Architecture: The model follows the same architecture as Stable Diffusion XL.

  • Resolution: Playground V2.5 generates images at a resolution of 1024x1024 pixels, catering to both portrait and landscape aspect ratios.

  • Scheduler Options: The default scheduler is EDMDPMSolver Multistep Scheduler, which enhances fine details. A guidance scale of 3.0 works well with this scheduler.

Playground V2.5 outperforms SDXL, PixArt-α, DALL-E 3, Midjourney 5.2, and even its predecessor, Playground V2.

Cookie settings

We use cookies to enhance your browsing experience, analyze site traffic, and personalize content. By clicking "Accept all", you consent to our use of cookies.