Commit Graph

4 Commits

Author SHA1 Message Date
gitprov e3e4f9d0ec Fix model config context/token limits and inject gutask orientation into agent identity
- Chat model max_tokens: 32000 → 4096 (was inflating context budget calculations)
- Utility model ctx_length: 8192 → 14000 (matches GLM flash actual loaded context)
- Utility model max_tokens: explicit 2048 cap (utility tasks need short responses)
- Utility model name: full zai-org/glm-4.7-flash to target correct LM Studio instance
- pull-agent-identity.py: include agent_id and gutask orientation block in system prompt
- gutask.md: expand lore command docs, note done auto-creates review record

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-10 23:32:58 +02:00
gitprov 44633ac6d6 Modifications to many skills 2026-05-09 19:36:03 +02:00
gitprov 8e97cbc97a Configure all agents for local inference via festinger
- All agents now use lm_studio provider → http://festinger:11434
- ctx_length set to 32768 for Omega13 (128GB RAM); reduce for smaller machines
- Model: qwen2.5-7b-instruct (update to larger model on Omega13)
- Each agent has a unique A0_PERSISTENT_RUNTIME_ID for stable mcp_server_token
- agent_profile=agent0 and mcp_server_enabled=true set in all settings.json
- agents/agent0/prompts/ placeholder created for pull-on-start persona override
- pull-agent-identity.py now writes to usr/agents/agent0/prompts/ (correct override path)
- festinger: agent_frameworks table auto-seeded on startup with all 5 agents
- festinger: num_ctx injection, agent_frameworks CRUD + admin UI, /chat endpoint
- festinger: removed debug system_prompt logging

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-03 13:00:07 +02:00
gitprov e8301fb2bf Adding content to git 2026-05-03 08:45:58 +02:00