e3e4f9d0ec
- Chat model max_tokens: 32000 → 4096 (was inflating context budget calculations) - Utility model ctx_length: 8192 → 14000 (matches GLM flash actual loaded context) - Utility model max_tokens: explicit 2048 cap (utility tasks need short responses) - Utility model name: full zai-org/glm-4.7-flash to target correct LM Studio instance - pull-agent-identity.py: include agent_id and gutask orientation block in system prompt - gutask.md: expand lore command docs, note done auto-creates review record Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>