9.2 KiB
9.2 KiB
CHANGELOG
Workspace lab notebook for long-running or resumable research work.
Use this file to track chronology, not release notes. Keep entries short, factual, and operational.
Entry template
YYYY-MM-DD HH:MM TZ — [slug or objective]
- Objective: ...
- Changed: ...
- Verified: ...
- Failed / learned: ...
- Blockers: ...
- Next: ...
2026-03-25 00:00 local — scaling-laws
- Objective: Set up a deep research workflow for scaling laws.
- Changed: Created plan artifact at
outputs/.plans/scaling-laws.md; defined 4 disjoint researcher dimensions and acceptance criteria. - Verified: Read
CHANGELOG.mdand checked prior memory for related planscaling-laws-implications. - Failed / learned: No prior run-specific changelog entries existed beyond the template.
- Blockers: Waiting for user confirmation before launching researcher round 1.
- Next: On confirmation, spawn 4 parallel researcher subagents and begin evidence collection.
2026-03-25 00:30 local — scaling-laws (T4 inference/time-scale pass)
- Objective: Complete T4 on inference/test-time scaling and reasoning-time compute, scoped to 2023–2026.
- Changed: Wrote
notes/scaling-laws-research-inference.md; updatedoutputs/.plans/scaling-laws.mdto mark T4 done and log the inference-scaling verification pass. - Verified: Cross-read 13 primary/official sources covering Tree-of-Thoughts, PRMs, repeated sampling, compute-optimal test-time scaling, provable laws, o1, DeepSeek-R1, s1, verifier failures, Anthropic extended thinking, and OpenAI reasoning API docs.
- Failed / learned: OpenAI blog fetch for
learning-to-reason-with-llmsreturned malformed content, so the note leans on the o1 system card and API docs instead of that blog post. - Blockers: T2 and T5 remain open before final synthesis; no single unified law for inference-time scaling emerged from public sources.
- Next: Complete T5 implications synthesis, then reconcile T3/T4 with foundational T2 before drafting the cited brief.
2026-03-25 11:20 local — scaling-laws (T6 draft synthesis)
- Objective: Synthesize the four research notes into a single user-facing draft brief for the scaling-laws workflow.
- Changed: Wrote
outputs/.drafts/scaling-laws-draft.mdwith an executive summary, curated reading list, qualitative meta-analysis, core-paper comparison table, explicit training-vs-inference distinction, and numbered inline citations with direct-URL sources. - Verified: Cross-checked the draft against
notes/scaling-laws-research-foundations.md,notes/scaling-laws-research-revisions.md,notes/scaling-laws-research-inference.md, andnotes/scaling-laws-research-implications.mdto ensure the brief explicitly states the literature is too heterogeneous for a pooled effect-size estimate. - Failed / learned: The requested temp-run
context.mdandplan.mdwere absent, so the synthesis usedoutputs/.plans/scaling-laws.mdplus the four note files as the working context. - Blockers: Citation/claim verification pass still pending; this draft should be treated as pre-verification.
- Next: Run verifier/reviewer passes, then promote the draft into the final cited brief and provenance sidecar.
2026-03-25 11:28 local — scaling-laws (final brief + pdf)
- Objective: Deliver a paper guide and qualitative meta-analysis on AI scaling laws.
- Changed: Finalized
outputs/scaling-laws.mdand sidecaroutputs/scaling-laws.provenance.md; rendered preview PDF atoutputs/scaling-laws.pdf; updated plan ledger and verification log inoutputs/.plans/scaling-laws.md. - Verified: Ran a reviewer pass recorded in
notes/scaling-laws-verification.md; spot-checked key primary papers via alpha-backed reads for Kaplan 2020, Chinchilla 2022, and Snell 2024; confirmed PDF render output exists. - Failed / learned: A pooled statistical meta-analysis would be misleading because the literature mixes heterogeneous outcomes, scaling axes, and evaluation regimes; final deliverable uses a qualitative meta-analysis instead.
- Blockers: None for this brief.
- Next: If needed, extend into a narrower sub-survey (e.g. only pretraining laws, only inference-time scaling, or only post-Chinchilla data-quality revisions).
2026-03-25 14:52 local — skills-only-install
- Objective: Let users download the Feynman research skills without installing the full terminal runtime.
- Changed: Added standalone skills-only installers at
scripts/install/install-skills.shandscripts/install/install-skills.ps1; synced website-public copies; documented user-level and repo-local install flows inREADME.md,website/src/content/docs/getting-started/installation.md, andwebsite/src/pages/index.astro. - Verified: Ran
sh -n scripts/install/install-skills.sh; rannode scripts/sync-website-installers.mjs; rancd website && npm run build; executedsh scripts/install/install-skills.sh --dir <tmp>and confirmed extractedSKILL.mdfiles land in the target directory. - Failed / learned: PowerShell installer behavior was not executed locally because PowerShell is not installed in this environment.
- Blockers: None for the Unix installer flow; Windows remains syntax-only by inspection.
- Next: If users want this exposed more prominently, add a dedicated docs/reference page and a homepage-specific skills-only CTA instead of a text link.
2026-03-26 18:08 PDT — installer-release-unification
- Objective: Remove the moving
edgeinstaller channel and unify installs on tagged releases only. - Changed: Updated
scripts/install/install.sh,scripts/install/install.ps1,scripts/install/install-skills.sh, andscripts/install/install-skills.ps1so the default target is the latest tagged release, latest-version resolution uses public GitHub release pages instead ofapi.github.com, and explicitedgerequests now fail with a removal message; removed therelease-edgejob from.github/workflows/publish.yml; updatedREADME.mdandwebsite/src/content/docs/getting-started/installation.md; re-syncedwebsite/public/install*. - Verified: Ran
sh -non the Unix installer copies; confirmedsh scripts/install/install.sh edgeandsh scripts/install/install-skills.sh edge --dir <tmp>fail with the intended removal message; executedsh scripts/install/install.shinto temp dirs and confirmed the installed binary reports0.2.14; executedsh scripts/install/install-skills.sh --dir <tmp>and confirmed extractedSKILL.mdfiles; rancd website && npm run build. - Failed / learned: The install failure was caused by unauthenticated GitHub API rate limiting on the
edgepath, so renaming channels without removing the API dependency would not have fixed the root cause. - Blockers:
npm run buildstill emits a pre-existing duplicate-content warning forgetting-started/installation; the build succeeds. - Next: If desired, remove the now-unused
stablealias too and clean up the duplicate docs-content warning separately.
2026-03-27 11:58 PDT — release-0.2.15
- Objective: Make the non-Anthropic subagent/auth fixes and contributor-guide updates releasable to tagged-install users instead of leaving them only on
main. - Changed: Bumped the package version from
0.2.14to0.2.15inpackage.jsonandpackage-lock.json; updated pinned installer examples inREADME.mdandwebsite/src/content/docs/getting-started/installation.md; aligned the local-development docs example to the npm-based root workflow; addedCONTRIBUTING.mdplus the bundledskills/contributing/SKILL.md. - Verified: Confirmed the publish workflow keys off
package.jsonversus the currently published npm version; confirmed localnpm test,npm run typecheck, andnpm run buildpass before the release bump. - Failed / learned: The open subagent issue is fixed on
mainbut still user-visible on tagged installs until a fresh release is cut. - Blockers: Need the GitHub publish workflow to finish successfully before the issue can be honestly closed as released.
- Next: Push
0.2.15, monitor the publish workflow, then update and close the relevant GitHub issue/PR once the release is live.
2026-03-28 15:15 PDT — pi-subagents-agent-dir-compat
- Objective: Debug why tagged installs can still fail subagent/auth flows after
0.2.15when users are not on Anthropic. - Changed: Added
scripts/lib/pi-subagents-patch.mjsplus type declarations and wiredscripts/patch-embedded-pi.mjsto rewrite vendoredpi-subagentsruntime files so they resolve user-scoped paths fromPI_CODING_AGENT_DIRinstead of hardcoded~/.pi/agent; addedtests/pi-subagents-patch.test.ts. - Verified: Materialized
.feynman/npm, inspected the shippedpi-subagents@0.11.11sources, confirmed the hardcoded~/.pi/agentpaths inindex.ts,agents.ts,artifacts.ts,run-history.ts,skills.ts, andchain-clarify.ts; rannode scripts/patch-embedded-pi.mjs; rannpm test,npm run typecheck, andnpm run build. - Failed / learned: The earlier
0.2.15fix only proved that Feynman exportedPI_CODING_AGENT_DIRto the top-level Pi child; it did not cover vendored extension code that still hardcoded.pipaths internally. - Blockers: Users still need a release containing this patch before tagged installs benefit from it.
- Next: Cut the next release and verify a tagged install exercises subagents without reading from
~/.pi/agent.