Agent & LLM · 2026年6月27日

每日论文速递 · Agent & LLM

💡 一句话：提出 MemStrata，用确定性的 bi-temporal ledger 管理事实 supersession，专门解决 RAG/agent memory 里“旧事实和新事实一起被召回”的结构性问题。

2026-06-27 09:01:468 篇论文条目

📄 每日论文速递 · Agent & LLM

日期：2026-06-27

1. Temporal Validity in Retrieval Memory: Eliminating Stale-Fact Errors for AI Agents over Evolving Knowledge

中文：检索记忆中的时间有效性：消除 AI Agent 在演化知识上的过期事实错误

🔗 https://arxiv.org/abs/2606.26511

💡 一句话：提出 MemStrata，用确定性的 bi-temporal ledger 管理事实 supersession，专门解决 RAG/agent memory 里“旧事实和新事实一起被召回”的结构性问题。

🎯 关联：很高。Anna 做 InternOS 和 Agent 平台一定会碰到 memory 版本、状态过期、承诺变更的问题；这篇比普通 RAG memory 更接近真实系统设计。

2. A Process Harness for Uplifting Legacy Workflows to Agentic BPM: Design and Realization in CUGA FLO

中文：面向 Agentic BPM 的流程 Harness：把传统工作流升级为 Agent 工作流

💡 一句话：它不是让 LLM 接管整个流程，而是在确定性 workflow engine 外包一层 policy-governed agentic harness，在指定 control point 上让 agent 参与任务、决策和流程调整。

🎯 关联：非常高。这和 InternOS 的组织协调系统方向高度贴合：确定性流程保底，Agent 在 hook 点增强，而不是无边界自治。

3. Semantic Early-Stopping for Iterative LLM Agent Loops

中文：迭代式 LLM Agent Loop 的语义早停机制

🔗 https://arxiv.org/abs/2606.27009

💡 一句话：用连续 draft 的 embedding 语义变化和质量变化来决定 multi-agent loop 何时停止，避免固定 max_iterations 造成 token 浪费或提前截断。

🎯 关联：很高。对 Agent 平台的 scheduler/orchestrator 很实用：loop 不该只靠轮数上限，而应该有状态感知的 stop policy。

4. Joint Learning of Experiential Rules and Policies for Large Language Model Agents

中文：面向 LLM Agent 的经验规则与策略联合学习

🔗 https://arxiv.org/abs/2606.27136

💡 一句话：JERP 同时维护可检索的长期经验规则池和可训练 policy，让 agent 从交互轨迹里沉淀经验，又避免规则和当前策略脱节。

🎯 关联：很高。Anna 如果要做长期运行的 Agent，不只是“存 memory”，还要回答：哪些经验变成规则、何时检索、如何和当前策略同步。

5. A Deterministic Control Plane for LLM Coding Agents

中文：LLM 编程 Agent 的确定性控制平面

🔗 https://arxiv.org/abs/2606.26924

💡 一句话：把 coding agent 的规则文件、权限、审计、phase state machine、IDE 配置编译都纳入 deterministic control plane，而不是继续交给 LLM 自己管。

🎯 关联：很高。这个判断是对的：Agent 平台需要 deterministic governance layer，尤其是文件权限、任务阶段、需求到测试的 traceability，不能靠 prompt 自觉。

6. How Much Static Structure Do Code Agents Need? A Study of Deterministic Anchoring

中文：代码 Agent 到底需要多少静态结构？关于确定性锚点的研究

🔗 https://arxiv.org/abs/2606.26979

💡 一句话：给 code agent 注入轻量 call graph / inheritance / dependency 注释，主要不是让它“更聪明”，而是让 repo navigation 更稳定、更可复现。

🎯 关联：高。对 Anna 做 AI dev tooling 或 InternOS codebase agent 都有启发：结构化上下文不是越多越好，关键是提供稳定锚点，降低 stochastic wandering。

7. Autoformalization of Agent Instructions into Policy-as-Code

中文：将 Agent 指令自动形式化为 Policy-as-Code

🔗 https://arxiv.org/abs/2606.26649

💡 一句话：用 LLM generator-critic loop 把 agent prompts、MCP tool descriptions 和自然语言 policy 转成 Cedar Policy Language，实现更可验证的权限与行为约束。

🎯 关联：很高。Agent 平台如果要接工具和 MCP，迟早要把“自然语言规则”降级为可执行 policy；这篇直接打在安全边界和治理层。

8. Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

中文：多步工具使用 RL 为什么崩溃，以及监督信号如何修复它

🔗 https://arxiv.org/abs/2606.26027

💡 一句话：分析 tool-use RL 中结构化 tool call 崩掉的原因，指出控制 token 概率尖峰会破坏执行格式，并验证 SFT/RL 交错训练能提高稳定性。

🎯 关联：中高。Anna 如果关心 Agent 训练和 tool-use reliability，这篇有价值；但它偏模型训练侧，不如前几篇直接服务平台架构。