modellix

📁 modellix/modellix-skill 📅 8 days ago

总安装量

周安装量

#53229

全站排名

安装命令

npx skills add https://github.com/modellix/modellix-skill --skill modellix

Agent 安装分布

replit 1

amp 1

openclaw 1

opencode 1

kimi-cli 1

Skill 文档

Capabilities

Modellix is a unified API platform that enables agents to access 100+ AI models for generative tasks. Agents can generate images from text, create videos from text or images, edit existing images, and perform specialized tasks like virtual try-on, image outpainting, and text-based image translation. The platform supports asynchronous task processing with full request/response logging and transparent pricing.

Skills

Text-to-Image Generation

Generate images from text prompts using models like Qwen Image Plus, Wanx, Seedream, and Hailuo
Supports multiple artistic styles and high-quality image generation
Endpoint: POST /api/v1/text-to-image/{provider}/{model_id}/async
Example: curl -X POST https://api.modellix.ai/api/v1/text-to-image/alibaba/qwen-image-plus/async -H "Authorization: Bearer YOUR_API_KEY" -d '{"prompt": "A cute cat playing in a garden"}'

Text-to-Video Generation

Create videos from text descriptions with cinematic quality
Models include Wan 2.6 T2V, Seedance, and Hailuo 02 T2V
Supports automatic dubbing and custom audio file uploads
Endpoint: POST /api/v1/text-to-video/{provider}/{model_id}/async
Charged by video duration (USD/sec)

Image-to-Image Editing

Edit existing images through text instructions
Capabilities include style transfer, watermark removal, image expansion, detail enhancement, and object addition/removal
Models: Wanx 2.1 Image Edit, Qwen Image Edit Plus, Seedream 4.0+ I2I, Seededit 3.0
Endpoint: POST /api/v1/image-to-image/{provider}/{model_id}/async

Image-to-Video Generation

Generate videos from image references combined with text prompts
Supports first-and-last-frame video generation (KF2V models)
Models: Wan 2.6 I2V, Seedance I2V, Hailuo 02 I2V
Endpoint: POST /api/v1/image-to-video/{provider}/{model_id}/async
Charged by video duration (USD/sec)

Specialized Image Tasks

Virtual Try-On: Generate try-on images from portrait and clothing photos (AI Try-On, AI Try-On Plus)
Image Outpainting: Extend images with free expansion and rotation support
Image Translation: Translate text in images across 11 languages while preserving layout
WordArt: Create artistic text with semantic deformation or texture effects
Image Parsing: Segment model and clothing images for preprocessing

Async Task Management

Submit tasks and receive immediate task_id response
Query task status and results using: GET /api/v1/tasks/{task_id}
Task results include resource URLs, metadata, and timing information
Results retained for 24 hours after completion
Response includes status (pending/success/failed), duration, and generated resources

API Authentication & Key Management

Create, view, and delete API keys through Modellix console
Authentication via Bearer token: Authorization: Bearer YOUR_API_KEY
API keys displayed only once after creation
Support for Email OTP, Google, and GitHub authentication

Rate Limiting & Quotas

Global limit: 1000 requests/minute
Per API key limit: 100 requests/minute
Per model limits subject to provider constraints
Response headers include: X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Reset
Concurrent task limits enforced per team

Error Handling

Unified error response format with code and message fields
HTTP status codes: 200 (success), 400 (bad request), 401 (unauthorized), 404 (not found), 429 (rate limit), 500 (server error), 503 (service unavailable)
Error categories: Invalid parameters, Missing required parameter, Invalid format, Value out of range, Authentication failed, Resource not found, Rate limit exceeded, Concurrent limit exceeded
Retryable errors: 429, 500, 503 with exponential backoff strategy

Request/Response Logging

Complete logging of all API calls including input parameters, output parameters, timestamps, response time, and cost consumption
Financial-grade transaction ledger system for every user
Real-time visibility into recharge records and consumption details
Supports business optimization and audit compliance

Workflows

Basic Image Generation Workflow

Register and log in to Modellix console
Create API key at https://modellix.ai/console/api-key
Submit text-to-image request: POST /api/v1/text-to-image/alibaba/qwen-image-plus/async
Receive task_id in response
Poll task status: GET /api/v1/tasks/{task_id}
Retrieve generated image URL from result when status is “success”
Download image before 24-hour expiration

Video Generation with Custom Audio

Prepare text prompt describing desired video
Submit text-to-video request with optional audio parameters
Receive task_id
Monitor task progress via polling or streaming interface
Retrieve video URL and metadata when complete
Extract video duration for cost calculation (charged per second)

Image Editing Pipeline

Prepare source image and editing instructions
Submit image-to-image request with image URL and text prompt
Receive task_id
Query task status until completion
Retrieve edited image from result resources
Optionally chain multiple edits by using output as input

Error Recovery with Retry Logic

Attempt API request
Check response code
If 429: Extract X-RateLimit-Reset header, wait until reset time
If 500/503: Apply exponential backoff (1s, 2s, 4s) with max 3 retries
If 400/401/404: Fix request parameters and retry
Log error details including timestamp, parameters, and error message

Integration

MCP (Model Context Protocol) Integration

Remote MCP server available at: https://docs.modellix.ai/mcp
Compatible with Cursor, Claude Desktop, and other MCP clients
Search tool for Modellix documentation with filter parameters (version, language, apiReferenceOnly, codeOnly)
Enables AI applications to search documentation during response generation

OpenAI Integration

Use Modellix as remote MCP server with OpenAI models
Configure MCP tool in OpenAI API calls with server URL and label
Allows GPT models to access Modellix documentation and capabilities

Multi-Provider Support

Alibaba (Qwen, Wanx, Wan models)
ByteDance (Seedream, Seedance, Seededit)
MiniMax (Hailuo, Image-01)
Additional providers continuously added

Context

Async Processing Model

All Modellix API calls are asynchronous. Agents must submit a request, receive a task_id, then poll the task status endpoint to retrieve results. This enables handling of long-running generation tasks without blocking connections.

Pricing Structure

Text-to-image and image-to-image: Charged per image (USD/img)
Text-to-video and image-to-video: Charged per second of video (USD/sec)
Transparent pricing with granular unit costs for different models and parameters
More competitive rates for select core models compared to official pricing

Result Retention

Generated results are stored for 24 hours after task completion. Agents must download or save results within this window or they will be permanently deleted.

Rate Limiting Strategy

Team-level rate limiting applies to all API keys under the same account. Agents should implement client-side concurrency control and monitor X-RateLimit-Remaining header to avoid frequent limit triggers. Exponential backoff is recommended for retryable errors.

Enterprise Reliability

AWS Singapore infrastructure with redundant deployment
15-year enterprise IT service experience
Rolling upgrades and strict SLA operational standards
Support in Mandarin, Cantonese, and English via email and Discord community

For additional documentation and navigation, see: https://docs.modellix.ai/llms.txt

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台