boocode/openspec/changes/archived/2026-06-07-eval-sandbox-agent-runtime/tasks.md at 9e2b0a7dc0a340effa8d06febb435f8e5d58f9ea

indifferentketchup c935687725 chore(openspec): drop 9 superseded proposals + 11 stub archive files

Drop 9 batch proposals that are superseded by the boocode-lift-analysis
(boocontext-audit, conductor upgrades, self-healing/verify-gate skills):
add-3tier-memory, import-llm-evaluator, import-pregel-engine, plugin-platform,
conductor-evolution, code-intelligence-upgrade, dev-workflow, ui-overhaul,
agent-reliability.

Delete 11 stub archive files (49-66B each, 'Status: Shipped. Archived.' only)
that provide zero documentation value over the existing CHANGELOG.md + git tags.

3.2 KiB

Raw Blame History

1. Foundation: Core Types & Monorepo Setup ✅

2. Eval: LLM-as-Judge Core

3. Eval: Trajectory Evaluators

4. Eval: Code Correctness Evaluators

5. Eval: Prompt Library

6. Documentation & Release

3.2 KiB Raw Blame History

1. Foundation: Core Types & Monorepo Setup ✅

2. Eval: LLM-as-Judge Core

3. Eval: Trajectory Evaluators

4. Eval: Code Correctness Evaluators

5. Eval: Prompt Library

6. Documentation & Release

3.2 KiB

Raw Blame History