← Help Center

Limitations & Capabilities

What Arli AI can and cannot do

What Arli AI Can Do

Text-to-Image Generation

Generate images from text descriptions using 74+ Stable Diffusion models including FLUX, SDXL, SD 1.5, anime, and artistic variants.

Image-to-Image Transformation

Transform existing images based on text prompts. Control the intensity with denoising strength.

Image Upscaling

Upscale images by 2x-4x using ESRGAN models with enhanced detail.

Text Generation (117 models)

OpenAI-compatible text/chat completions with 117 models. Includes guided generation (JSON schema, regex, choice constraints).

Vision Analysis (base64)

Analyze images using vision-capable text models (Qwen3.5-VL, Qwen3.5-27B). Base64 input only — URLs not supported.

What Arli AI Cannot Do

Video Generation

Arli AI cannot generate or edit video content. Consider Runway, Kling AI, or AnimateDiff for video generation.

Audio/Speech/TTS

No text-to-speech, speech recognition, or audio generation capabilities.

Vision via URL

Arli AI text endpoints with vision only accept base64-encoded images, not URLs. Use Featherless for URL-based vision input.

Streaming Responses

Arli AI image endpoints return full base64 responses. Text endpoints may support streaming depending on the model.

Fine-tuning / Training

Cannot fine-tune, train, or customize models. All 74 image models and 117 text models are used as-is.

Inpainting with Masks

While the img2img endpoint accepts a mask parameter, inpainting results may vary. Dedicated inpainting models may give better results.

Size Limits

ParameterLimitNotes
Image width64 – 2048 pxMust be a multiple of 64
Image height64 – 2048 pxMust be a multiple of 64
Batch size1 – 4Images per request
Steps1 – 150Denoising steps
CFG scale1 – 20Classifier-free guidance
Prompt lengthNo hard limitVery long prompts may be truncated by the model
Input image (img2img)Base64 encodedLarge images may timeout
Input image (upscale)Base64 encodedLarge images may timeout
Max tokens (text)Model-dependentVaries by model context window

Rate Limits

TypeLimitDetails
Parallel requests (image)6 concurrentExceeding returns HTTP 429. Check GET /v1/parallel-requests
Request timeout (image)300 seconds5-minute timeout for generation requests
Request timeout (list)30 secondsFor model listing and metadata endpoints
Text rate limitsPer-keyVaries by API key plan tier

Checking Rate Limits

curl https://api.arliai.com/v1/parallel-requests \
  -H "Authorization: Bearer YOUR_KEY"

# Response:
# {"parallel_requests": 6, "remaining": 4, "message": "..."}

Timeout Handling

Image generation can take 30-120 seconds depending on model, resolution, and steps.

Quality Limitations

Best Practices