Agentic builds: dsynth evidence capture hooks #1517

tuxillo · 2026-01-10T00:59:52Z

Goal

We are designing a system to automatically (agent-assisted) fix ports while keeping the existing, build-driven workflow intact:

dsynth stays the authoritative build executor.
On failure, we capture a bounded evidence bundle (distilled errors + small port context) so automated triage/patch generation can be driven by real build output without dumping huge logs or entire work directories into an AI context.
Evidence is intended to flow into an asynchronous agent pipeline (triage → patch → review) via a central queue (documented), so builds never block on AI availability.

What this PR adds (foundation)

dsynth hook scripts under scripts/dsynth-hooks/:
- hook_run_start / hook_run_end group failures per build run and snapshot dsynth summary lists.
- hook_pkg_failure creates a per-failure evidence bundle with:
  - logs/errors.txt (high-signal extract, capped at 200KB)
  - logs/full.log.gz (full log preserved for humans)
  - port/* snapshot (Makefile/distinfo/pkg-plist/patches, etc.)
  - meta.txt and basic dsynth profile/config snapshots
Design/usage documentation in docs/AGENTIC_BUILDS.md describing:
- the overall automated-fixing workflow (bounded evidence → triage → snippet escalation → patch → rebuild)
- an opencode integration plan, including a central queue model for asynchronous triage
A small README pointer to the hook location.

What this PR does not do (yet)

No network calls from hooks.
No queue writer/runner implementation.
No automated patch application.

Those are intentionally deferred so this PR can land the core evidence-capture mechanism safely and independently.

How to try it

Install hooks by copying/symlinking scripts/dsynth-hooks/hook_* and scripts/dsynth-hooks/hook_common.sh into dsynth’s config base (/etc/dsynth/ or /usr/local/etc/dsynth/) and making them executable.
Run dsynth normally.
On a port failure, inspect ${Directory_logs}/evidence/runs/.../ports/.../ for the evidence bundle.

Why this matters for automated fixing

Reliable, size-capped evidence capture is the prerequisite for an automated port-fixing system:

the triage agent needs consistent inputs (errors.txt + port metadata)
the patch agent can generate DeltaPorts-style diffs based on evidence, not guesses
the rebuild loop stays dsynth-driven, and automation can be layered on without destabilizing build infrastructure

Add dsynth hook scripts that snapshot distilled build errors and relevant port metadata on failures, grouped by run, so debugging can stay build-driven without keeping full workdirs. Document the bounded evidence contract and the planned opencode integration/central queue model for asynchronous triage.

Add observe-only state server for remote UI integration: - REST API for runs, jobs, bundles, ports, artifacts - SSE event stream with replay support - SQLite persistence for full history - Filesystem reconciler for live updates Validated on DragonFlyBSD VM - all endpoints tested.

- Add vanilla JS Bootstrap 5 UI served by state-server - Live SSE event stream with replay/reconnect - Views: Overview, Events, Jobs, Runs, Ports, Bundles - Artifact viewer for markdown, diffs, logs - SSE improvements: after_id, tail query params, ts in payloads

- Add /bundles API endpoint listing recent bundles - Add #/bundles route with renderBundles() view - Add Bundles nav item to navbar - Update Phase 9 docs with completion status and new route

- agent-queue-runner: add apply job type and iteration tracking - apply-patch: add DragonFly local mode, --no-push flag, BSD-compatible patch - hook_common.sh: detect rebuild iterations, track previous bundles - Add KEDB entry for DragonFly source patch conventions

Makefiles use tabs, not spaces. The agent was generating patches with spaces which caused patch application failures. Added rule #8 to emphasize preserving exact whitespace from the bundle context.

When retrying a patch application, the branch may already exist from a previous failed attempt. Delete it first to allow the retry.

Stop extraction when hitting common section markers like 'Rationale', 'Files Modified', etc. Also detect when prose text starts after hunks. This prevents non-diff content from being included in patch.diff.

The agent was generating patches with incorrect hunk line counts. Added detailed instructions on unified diff format with example.

- Change dports-patch prompt to request complete file contents - Add extract_files_from_response() to parse FILE content blocks - Add generate_unified_diff() to create diffs programmatically - Add generate_combined_diff() for multi-file patches - Update write_patch_outputs() to try new format first, fallback to legacy This fixes the malformed diff issue - LLMs are good at generating file content but struggle with unified diff syntax and line counts.

The agent was outputting diff syntax inside FILE blocks for Makefile.DragonFly. Make it explicit that Makefile.DragonFly should be raw makefile content, while dragonfly/patch-* files are actual diffs. Also add specific hint for the IFM_IEEE80211_VHT5G error.

…er UI - Add activity_log and runner_status tables to state-server schema - Add /activity and /runner-status API endpoints with SSE events - Update agent-queue-runner to log activities at all job stages - Add heartbeat thread for runner liveness detection (5s interval) - UI: Add Activity Log panel showing last 10 runner activities - UI: Add Runner Status indicator with staleness detection (>15s) - UI: Add back button for artifact navigation in bundle view - UI: Hide session_id.txt files from artifact lists

…b error display - state-server: Only emit runner_status SSE events when status/job_id/stage changes, not on every heartbeat update_at change - app.js: Don't trigger full re-render for runner_status/activity events (fixes bundle tab reset issue), only re-render on overview page - app.js: Add renderJobDetail() with prominent error display and related activity log entries for failed jobs - agent-queue-runner: Write .job.error files before moving failed jobs, move error files along with job files

tuxillo added 30 commits January 10, 2026 01:55

docs: add UI option for the agentic proof of concept

4dc0c72

docs: add implementation plan

ba8d4f8

scripts: complete phase 1

6816931

scripts: complete phase 2

23e4b2f

scripts: complete phase 3

267fd5c

scripts: complete phase 4

8a48f11

scripts: complete phase 5

b6dde31

scripts: add Bundles to UI navbar

d100256

- Add /bundles API endpoint listing recent bundles - Add #/bundles route with renderBundles() view - Add Bundles nav item to navbar - Update Phase 9 docs with completion status and new route

scripts: complete phase 7

c15973a

config: add whitespace preservation rule to dports-patch agent

56aa788

Makefiles use tabs, not spaces. The agent was generating patches with spaces which caused patch application failures. Added rule #8 to emphasize preserving exact whitespace from the bundle context.

scripts/apply-patch: delete existing branch before creating

d0c721e

When retrying a patch application, the branch may already exist from a previous failed attempt. Delete it first to allow the retry.

scripts/agent-queue-runner: improve diff extraction from agent response

e95c557

Stop extraction when hitting common section markers like 'Rationale', 'Files Modified', etc. Also detect when prose text starts after hunks. This prevents non-diff content from being included in patch.diff.

config: add explicit diff format requirements to dports-patch agent

280a207

The agent was generating patches with incorrect hunk line counts. Added detailed instructions on unified diff format with example.

fix(runner): make apply-patch flags configurable

e3e973d

fix(runner): write apply context at runs root

7218e48

fix(hooks): honor apply context without ai-fix branch

8857750

fix(hooks): use apply context even without DeltaPorts dir

0576726

fix(runner): propagate iteration to patch jobs

e63a460

feat(observability): add user context gating and retriage

b5e8aca

fix(patch): enforce unified diff validation before apply

ffd66d7

feat(blobstore): add artifact-store daemon and db-only bundles

1f53318

fix(hooks): snapshot DeltaPorts port files for patch base

06be393

fix(agents): require unified diff output for dports-patch

d926fdd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Agentic builds: dsynth evidence capture hooks #1517

Agentic builds: dsynth evidence capture hooks #1517

Uh oh!

tuxillo commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Agentic builds: dsynth evidence capture hooks #1517

Are you sure you want to change the base?

Agentic builds: dsynth evidence capture hooks #1517

Uh oh!

Conversation

tuxillo commented Jan 10, 2026

Goal

What this PR adds (foundation)

What this PR does not do (yet)

How to try it

Why this matters for automated fixing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants