Providers

NVIDIA NIM

NVIDIA NIM provides optimized inference endpoints for open-source models running on NVIDIA GPUs. Get production-quality performance from models like Llama, Mixtral, and Gemma without managing your own GPU infrastructure.

Prerequisites

  • An NVIDIA account
  • An API key from the NVIDIA build platform

Setup

  1. Go to build.nvidia.com
  2. Sign in with your NVIDIA account
  3. Click Get API Key and copy it
  4. In Sythoria, open Settings from the sidebar
  5. Select NVIDIA NIM from the Provider Preset dropdown — the API base URL and default model fill in automatically
  6. Paste your API key into the API Key field
  7. The connection indicator should turn green within a few seconds

Your API key is stored securely in your OS keychain. It is never sent to Sythoria's servers. See Privacy & Security.

Default configuration

SettingValue
API Basehttps://integrate.api.nvidia.com/v1/chat/completions
Default Modelmeta/llama-3.3-70b-instruct
API FormatOpenAI Chat Completions
StreamingSupported

Available models

NVIDIA NIM hosts optimized versions of popular open-source models. Model IDs use the provider/model-name format:

Model IDBase modelBest for
meta/llama-3.3-70b-instructLlama 3.3 70BGeneral-purpose, strong reasoning
mistralai/mixtral-8x22b-instruct-v0.1Mixtral 8x22BEfficient mixture-of-experts, good throughput
google/gemma-2-27b-itGemma 2 27BConversation and instruction following
nvidia/llama-3.1-nemotron-70b-instructNemotron 70BNVIDIA-tuned Llama, high accuracy

Browse all available models at build.nvidia.com/explore/discover.

Model ID format

NVIDIA NIM uses the provider/model-name naming scheme. When entering a model in Sythoria's Model ID field, use the exact ID from build.nvidia.com/explore/discover. Examples:

  • meta/llama-3.3-70b-instruct
  • mistralai/mixtral-8x22b-instruct-v0.1
  • google/gemma-2-27b-it

Pricing

NVIDIA offers free credits for new accounts, with pay-per-token pricing afterward. Check build.nvidia.com for current rates and credit availability.

Troubleshooting

ErrorCauseFix
401 UnauthorizedAPI key is invalid or expiredGenerate a new key at build.nvidia.com
404 Model not foundIncorrect model IDUse the exact model ID from build.nvidia.com/explore/discover
429 Rate LimitExceeded free tier or usage limitsWait and retry, or add credits to your account
503 Service unavailableModel endpoint temporarily downWait a moment and retry — NVIDIA scales endpoints dynamically
Connection indicator redNetwork issue or invalid configurationVerify the API base URL is https://integrate.api.nvidia.com/v1/chat/completions and click Refresh
SythoriaDocs Navigation