API keys (Anthropic, OpenAI etc.) are stored in the inference_endpoints
table and passed through the request from gnommoweb. Removed the
API_KEY_ANTHROPIC / AGENT_INFERENCE_MODEL env var fallbacks entirely.
Missing endpoint config now returns a clear error instead of silently
falling back to a hardcoded model.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Moved from gnommoweb/agent-inference. Generic LLM inference bridge
supporting litellm (anthropic/openai/ollama/lm_studio), Agent Zero MCP,
and Hermes JSON-RPC WebSocket agent types.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>