Baseten - Factory Documentation

Deploy and serve custom models with Baseten’s enterprise-grade infrastructure for ML model serving.

Configuration

Add to ~/.factory/settings.json:

{
  "customModels": [
    {
      "model": "Qwen/Qwen3-Coder-480B-A35B-Instruct",
      "displayName": "Qwen3-Coder-480B [Baseten]",
      "baseUrl": "https://inference.baseten.co/v1",
      "apiKey": "YOUR_BASETEN_API_KEY",
      "provider": "generic-chat-completion-api",
      "maxOutputTokens": 8192
    }
  ]
}

Getting Started

Sign up at baseten.co
Deploy a model from their model library or upload your own
Get your API key from the settings page
Find your model ID in the deployment dashboard
Add the configuration to your Factory config

Notes

Base URL format: https://inference.baseten.co/v1
Replace YOUR_MODEL_ID with your deployed model’s ID from Baseten dashboard
Supports OpenAI-compatible API format
Contact Baseten for enterprise features and custom deployments

Bring Your Own Key (BYOK)