发现和使用优秀的技能扩展
针对不报告上下文限制的本地模型(MLX、llama.cpp、Ollama)的基于令牌的上下文压缩
Token-based context compaction for local models (MLX, llama.cpp, Ollama) that don't report context limits.