Documentation Index
Fetch the complete documentation index at: https://factory-docs-auto-sync-jp-docs.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Access cost-effective inference for a wide variety of open-source models with DeepInfra’s optimized infrastructure.
Configuration
Add to ~/.factory/settings.json:
{
"customModels": [
{
"model": "zai-org/GLM-4.7",
"displayName": "GLM-4.7 [DeepInfra]",
"baseUrl": "https://api.deepinfra.com/v1/openai",
"apiKey": "YOUR_DEEPINFRA_TOKEN",
"provider": "generic-chat-completion-api",
"maxOutputTokens": 16384
},
{
"model": "deepseek-ai/DeepSeek-V3.1-Terminus",
"displayName": "DeepSeek V3.1 Terminus [DeepInfra]",
"baseUrl": "https://api.deepinfra.com/v1/openai",
"apiKey": "YOUR_DEEPINFRA_TOKEN",
"provider": "generic-chat-completion-api",
"maxOutputTokens": 16384
},
{
"model": "moonshotai/Kimi-K2-Instruct-0905",
"displayName": "Kimi K2 Instruct [DeepInfra]",
"baseUrl": "https://api.deepinfra.com/v1/openai",
"apiKey": "YOUR_DEEPINFRA_TOKEN",
"provider": "generic-chat-completion-api",
"maxOutputTokens": 32768
},
{
"model": "Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo",
"displayName": "Qwen3 Coder 480B [DeepInfra]",
"baseUrl": "https://api.deepinfra.com/v1/openai",
"apiKey": "YOUR_DEEPINFRA_TOKEN",
"provider": "generic-chat-completion-api",
"maxOutputTokens": 32768
}
]
}
Getting Started
- Sign up at deepinfra.com
- Get your API token from the dashboard
- View available models at their model list
- Add desired models to your configuration
Notes
- Base URL format:
https://api.deepinfra.com/v1/openai
- Model names match Hugging Face repository format
- Supports OpenAI-compatible API
- Automatic model updates when new versions are released