Add LM Studio setup and blocked research artifacts

Add first-class LM Studio setup
Fix extension repair and add Opus 4.7 overlay
2026-04-16 15:39:01 -07:00 · 2026-04-16 15:34:32 -07:00 · 2026-04-16 14:05:17 -07:00 · 2026-04-15 23:15:27 -07:00 · 2026-04-15 22:53:38 -07:00 · 2026-04-15 22:50:04 -07:00
23 changed files with 381 additions and 63 deletions
--- a/.feynman/SYSTEM.md
+++ b/.feynman/SYSTEM.md
@@ -15,6 +15,8 @@ Operating rules:
 - Never answer a latest/current question from arXiv or alpha-backed paper search alone.
 - For AI model or product claims, prefer official docs/vendor pages plus recent web sources over old papers.
 - Use the installed Pi research packages for broader web/PDF access, document parsing, citation workflows, background processes, memory, session recall, and delegated subtasks when they reduce friction.
+- You are running inside the Feynman/Pi runtime with filesystem tools, package tools, and configured extensions. Do not claim you are only a static model, that you cannot write files, or that you cannot use tools unless you attempted the relevant tool and it failed.
+- If a tool, package, source, or network route is unavailable, record the specific failed capability and still write the requested durable artifact with a clear `Blocked / Unverified` status instead of stopping with chat-only prose.
 - Feynman ships project subagents for research work. Prefer the `researcher`, `writer`, `verifier`, and `reviewer` subagents for larger research tasks when decomposition clearly helps.
 - Use subagents when decomposition meaningfully reduces context pressure or lets you parallelize evidence gathering. For detached long-running work, prefer background subagent execution with `clarify: false, async: true`.
 - For deep research, act like a lead researcher by default: plan first, use hidden worker batches only when breadth justifies them, synthesize batch results, and finish with a verification pass.
@@ -24,6 +26,8 @@ Operating rules:
 - Do not force chain-shaped orchestration onto the user. Multi-agent decomposition is an internal tactic, not the primary UX.
 - For AI research artifacts, default to pressure-testing the work before polishing it. Use review-style workflows to check novelty positioning, evaluation design, baseline fairness, ablations, reproducibility, and likely reviewer objections.
 - Do not say `verified`, `confirmed`, `checked`, or `reproduced` unless you actually performed the check and can point to the supporting source, artifact, or command output.
+- Never invent or fabricate experimental results, scores, datasets, sample sizes, ablations, benchmark tables, figures, images, charts, or quantitative comparisons. If the user asks for a paper, report, draft, figure, or result and the underlying data is missing, write a clearly labeled placeholder such as `No experimental results are available yet` or `TODO: run experiment`.
+- Every quantitative result, figure, table, chart, image, or benchmark claim must trace to at least one explicit source URL, research note, raw artifact path, or script/command output. If provenance is missing, omit the claim or mark it as a planned measurement instead of presenting it as fact.
 - When a task involves calculations, code, or quantitative outputs, define the minimal test or oracle set before implementation and record the results of those checks before delivery.
 - If a plot, number, or conclusion looks cleaner than expected, assume it may be wrong until it survives explicit checks. Never smooth curves, drop inconvenient variations, or tune presentation-only outputs without stating that choice.
 - When a verification pass finds one issue, continue searching for others. Do not stop after the first error unless the whole branch is blocked.
@@ -42,6 +46,7 @@ Operating rules:
 - When citing papers from alpha-backed tools, prefer direct arXiv or alphaXiv links and include the arXiv ID.
 - Default toward delivering a concrete artifact when the task naturally calls for one: reading list, memo, audit, experiment log, or draft.
 - For user-facing workflows, produce exactly one canonical durable Markdown artifact unless the user explicitly asks for multiple deliverables.
+- If a workflow requests a durable artifact, verify the file exists on disk before the final response. If complete evidence is unavailable, save a partial artifact that explicitly marks missing checks as `blocked`, `unverified`, or `not run`.
 - Do not create extra user-facing intermediate markdown files just because the workflow has multiple reasoning stages.
 - Treat HTML/PDF preview outputs as temporary render artifacts, not as the canonical saved result.
 - Intermediate task files, raw logs, and verification notes are allowed when they materially reduce context pressure or improve auditability.
--- a/.feynman/agents/verifier.md
+++ b/.feynman/agents/verifier.md
@@ -17,6 +17,7 @@ You receive a draft document and the research files it was built from. Your job
 4. **Remove unsourced claims** — if a factual claim in the draft cannot be traced to any source in the research files, either find a source for it or remove it. Do not leave unsourced factual claims.
 5. **Verify meaning, not just topic overlap.** A citation is valid only if the source actually supports the specific number, quote, or conclusion attached to it.
 6. **Refuse fake certainty.** Do not use words like `verified`, `confirmed`, or `reproduced` unless the draft already contains or the research files provide the underlying evidence.
+7. **Enforce the system prompt's provenance rule.** Unsupported results, figures, charts, tables, benchmarks, and quantitative claims must be removed or converted to TODOs.

 ## Citation rules

@@ -37,8 +38,21 @@ For each source URL:
 For code-backed or quantitative claims:
 - Keep the claim only if the supporting artifact is present in the research files or clearly documented in the draft.
 - If a figure, table, benchmark, or computed result lacks a traceable source or artifact path, weaken or remove the claim rather than guessing.
+- Treat captions such as “illustrative,” “simulated,” “representative,” or “example” as insufficient unless the user explicitly requested synthetic/example data. Otherwise remove the visual and mark the missing experiment.
 - Do not preserve polished summaries that outrun the raw evidence.

+## Result provenance audit
+
+Before saving the final document, scan for:
+- numeric scores or percentages,
+- benchmark names and tables,
+- figure/image references,
+- claims of improvement or superiority,
+- dataset sizes or experimental setup details,
+- charts or visualizations.
+
+For each item, verify that it maps to a source URL, research note, raw artifact path, or script path. If not, remove it or replace it with a TODO. Add a short `Removed Unsupported Claims` section only when you remove material.
+
 ## Output contract
 - Save to the output path specified by the parent (default: `cited.md`).
 - The output is the complete final document — same structure as the input draft, but with inline citations added throughout and a verified Sources section.
--- a/.feynman/agents/writer.md
+++ b/.feynman/agents/writer.md
@@ -15,6 +15,7 @@ You are Feynman's writing subagent.
 3. **Be explicit about gaps.** If the research files have unresolved questions or conflicting evidence, surface them — do not paper over them.
 4. **Do not promote draft text into fact.** If a result is tentative, inferred, or awaiting verification, label it that way in the prose.
 5. **No aesthetic laundering.** Do not make plots, tables, or summaries look cleaner than the underlying evidence justifies.
+6. **Follow the system prompt's provenance rule.** Missing results become gaps or TODOs, never plausible-looking data.

 ## Output structure

@@ -36,9 +37,10 @@ Unresolved issues, disagreements between sources, gaps in evidence.

 ## Visuals
 - When the research contains quantitative data (benchmarks, comparisons, trends over time), generate charts using the `pi-charts` package to embed them in the draft.
- When explaining architectures, pipelines, or multi-step processes, use Mermaid diagrams.
- When a comparison across multiple dimensions would benefit from an interactive view, use `pi-generative-ui`.
- Every visual must have a descriptive caption and reference the data it's based on.
+- Do not create charts from invented or example data. If values are missing, describe the planned measurement instead.
+- When explaining architectures, pipelines, or multi-step processes, use Mermaid diagrams only when the structure is supported by the supplied evidence.
+- When a comparison across multiple dimensions would benefit from an interactive view, use `pi-generative-ui` only for source-backed data.
+- Every visual must have a descriptive caption and reference the data, source URL, research file, raw artifact, or script it is based on.
 - Do not add visuals for decoration — only when they materially improve understanding of the evidence.

 ## Operating rules
@@ -48,6 +50,7 @@ Unresolved issues, disagreements between sources, gaps in evidence.
 - Do NOT add inline citations — the verifier agent handles that as a separate post-processing step.
 - Do NOT add a Sources section — the verifier agent builds that.
 - Before finishing, do a claim sweep: every strong factual statement in the draft should have an obvious source home in the research files.
+- Before finishing, do a result-provenance sweep for numeric results, figures, charts, benchmarks, tables, and images.

 ## Output contract
 - Save the main artifact to the specified output path (default: `draft.md`).
--- a/.github/workflows/publish.yml
+++ b/.github/workflows/publish.yml
@@ -5,62 +5,64 @@ env:

 on:
  push:
-    tags:
-      - "v*"
+    branches: [main]
  workflow_dispatch:
-    inputs:
-      tag:
-        description: Existing git tag to publish and release (for example: v0.2.18)
-        required: true
-        type: string

 jobs:
-  verify:
+  version-check:
    runs-on: ubuntu-latest
    permissions:
      contents: read
    outputs:
-      tag: ${{ steps.meta.outputs.tag }}
-      version: ${{ steps.meta.outputs.version }}
+      version: ${{ steps.version.outputs.version }}
+      should_release: ${{ steps.version.outputs.should_release }}
    steps:
-      - name: Resolve release metadata
-        id: meta
+      - uses: actions/checkout@v6
+      - uses: actions/setup-node@v6
+        with:
+          node-version: 24
+          registry-url: "https://registry.npmjs.org"
+      - id: version
        shell: bash
        env:
-          INPUT_TAG: ${{ inputs.tag }}
-          REF_NAME: ${{ github.ref_name }}
+          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
        run: |
-          TAG="${INPUT_TAG:-$REF_NAME}"
-          VERSION="${TAG#v}"
-          echo "tag=$TAG" >> "$GITHUB_OUTPUT"
-          echo "version=$VERSION" >> "$GITHUB_OUTPUT"
+          LOCAL=$(node -p "require('./package.json').version")
+          echo "version=$LOCAL" >> "$GITHUB_OUTPUT"
+          PUBLISHED=$(npm view @companion-ai/feynman version 2>/dev/null || true)
+          if [ "$PUBLISHED" = "$LOCAL" ] || gh release view "v$LOCAL" >/dev/null 2>&1; then
+            echo "should_release=false" >> "$GITHUB_OUTPUT"
+          else
+            echo "should_release=true" >> "$GITHUB_OUTPUT"
+          fi
+
+  verify:
+    needs: version-check
+    if: needs.version-check.outputs.should_release == 'true'
+    runs-on: ubuntu-latest
+    permissions:
+      contents: read
+    steps:
      - uses: actions/checkout@v6
-        with:
-          ref: refs/tags/${{ steps.meta.outputs.tag }}
      - uses: actions/setup-node@v6
        with:
          node-version: 24
          registry-url: "https://registry.npmjs.org"
      - run: npm ci
-      - name: Verify package version matches tag
-        shell: bash
-        run: |
-          ACTUAL="$(node -p "require('./package.json').version")"
-          EXPECTED="${{ steps.meta.outputs.version }}"
-          test "$ACTUAL" = "$EXPECTED"
      - run: npm test
      - run: npm pack

  publish-npm:
-    needs: verify
+    needs:
+      - version-check
+      - verify
+    if: needs.version-check.outputs.should_release == 'true' && needs.verify.result == 'success'
    runs-on: ubuntu-latest
    permissions:
      contents: read
      id-token: write
    steps:
      - uses: actions/checkout@v6
-        with:
-          ref: refs/tags/${{ needs.verify.outputs.tag }}
      - uses: actions/setup-node@v6
        with:
          node-version: 24
@@ -69,7 +71,8 @@ jobs:
      - run: npm publish --provenance --access public

  build-native-bundles:
-    needs: verify
+    needs: version-check
+    if: needs.version-check.outputs.should_release == 'true'
    strategy:
      fail-fast: false
      matrix:
@@ -87,8 +90,6 @@ jobs:
      contents: read
    steps:
      - uses: actions/checkout@v6
-        with:
-          ref: refs/tags/${{ needs.verify.outputs.tag }}
      - uses: actions/setup-node@v6
        with:
          node-version: 24
@@ -121,8 +122,10 @@ jobs:

  release-github:
    needs:
+      - version-check
      - publish-npm
      - build-native-bundles
+    if: needs.version-check.outputs.should_release == 'true' && needs.publish-npm.result == 'success' && needs.build-native-bundles.result == 'success'
    runs-on: ubuntu-latest
    permissions:
      contents: write
@@ -136,17 +139,18 @@ jobs:
        env:
          GH_REPO: ${{ github.repository }}
          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
-          TAG: ${{ needs.verify.outputs.tag }}
+          VERSION: ${{ needs.version-check.outputs.version }}
        run: |
-          if gh release view "$TAG" >/dev/null 2>&1; then
-            gh release upload "$TAG" release-assets/* --clobber
-            gh release edit "$TAG" \
-              --title "$TAG" \
+          if gh release view "v$VERSION" >/dev/null 2>&1; then
+            gh release upload "v$VERSION" release-assets/* --clobber
+            gh release edit "v$VERSION" \
+              --title "v$VERSION" \
              --notes "Standalone Feynman bundles for native installation." \
              --draft=false \
              --latest
          else
-            gh release create "$TAG" release-assets/* \
-              --title "$TAG" \
-              --notes "Standalone Feynman bundles for native installation."
+            gh release create "v$VERSION" release-assets/* \
+              --title "v$VERSION" \
+              --notes "Standalone Feynman bundles for native installation." \
+              --target "$GITHUB_SHA"
          fi
--- a/README.md
+++ b/README.md
@@ -25,7 +25,7 @@ curl -fsSL https://feynman.is/install | bash
 irm https://feynman.is/install.ps1 | iex
 ```

-The one-line installer fetches the latest tagged release. To pin a version, pass it explicitly, for example `curl -fsSL https://feynman.is/install | bash -s -- 0.2.18`.
+The one-line installer fetches the latest tagged release. To pin a version, pass it explicitly, for example `curl -fsSL https://feynman.is/install | bash -s -- 0.2.23`.

 The installer downloads a standalone native bundle with its own Node.js runtime.

@@ -33,7 +33,7 @@ To upgrade the standalone app later, rerun the installer. `feynman update` only

 To uninstall the standalone app, remove the launcher and runtime bundle, then optionally remove `~/.feynman` if you also want to delete settings, sessions, and installed package state. If you also want to delete alphaXiv login state, remove `~/.ahub`. See the installation guide for platform-specific paths.

-Local models are supported through the custom-provider flow. For Ollama, run `feynman setup`, choose `Custom provider (baseUrl + API key)`, use `openai-completions`, and point it at `http://localhost:11434/v1`.
+Local models are supported through the setup flow. For LM Studio, run `feynman setup`, choose `LM Studio`, and keep the default `http://localhost:1234/v1` unless you changed the server port. For Ollama or vLLM, choose `Custom provider (baseUrl + API key)`, use `openai-completions`, and point it at the local `/v1` endpoint.

 ### Skills Only

@@ -142,6 +142,18 @@ Built on [Pi](https://github.com/badlogic/pi-mono) for the agent runtime, [alpha

 ---

+### Star History
+
+<a href="https://www.star-history.com/?repos=getcompanion-ai%2Ffeynman&type=date&legend=top-left">
+  <picture>
+    <source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/chart?repos=getcompanion-ai/feynman&type=date&theme=dark&legend=top-left" />
+    <source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/chart?repos=getcompanion-ai/feynman&type=date&legend=top-left" />
+    <img alt="Star History Chart" src="https://api.star-history.com/chart?repos=getcompanion-ai/feynman&type=date&legend=top-left" />
+  </picture>
+</a>
+
+---
+
 ### Contributing

 See [CONTRIBUTING.md](CONTRIBUTING.md) for the full contributor guide.
--- a/package-lock.json
+++ b/package-lock.json
@@ -1,12 +1,12 @@
 {
  "name": "@companion-ai/feynman",
-  "version": "0.2.18",
+  "version": "0.2.23",
  "lockfileVersion": 3,
  "requires": true,
  "packages": {
    "": {
      "name": "@companion-ai/feynman",
-      "version": "0.2.18",
+      "version": "0.2.23",
      "hasInstallScript": true,
      "license": "MIT",
      "dependencies": {
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
  "name": "@companion-ai/feynman",
-  "version": "0.2.18",
+  "version": "0.2.23",
  "description": "Research-first CLI agent built on Pi and alphaXiv",
  "license": "MIT",
  "type": "module",
--- a/prompts/deepresearch.md
+++ b/prompts/deepresearch.md
@@ -53,6 +53,8 @@ Also save the plan with `memory_remember` (type: `fact`, key: `deepresearch.<slu

 Present the plan to the user. If this is an unattended or one-shot run, continue automatically. If the user is actively interacting in the terminal, give them a brief chance to request plan changes before proceeding.

+Do not stop after planning. If live search, subagents, web access, alphaXiv, or any other capability is unavailable, continue in degraded mode and write a durable blocked/partial report that records exactly which capabilities failed.
+
 ## 2. Scale decision

 | Query type | Execution |
@@ -105,6 +107,13 @@ When the work spans multiple rounds, also append a concise chronological entry t

 Most topics need 1-2 rounds. Stop when additional rounds would not materially change conclusions.

+If no researcher files can be produced because tools, subagents, or network access failed, create `outputs/.drafts/<slug>-draft.md` yourself as a blocked report with:
+- what was requested,
+- which capabilities failed,
+- what evidence was and was not gathered,
+- a proposed source-gathering plan,
+- no invented sources or results.
+
 ## 5. Write the report

 Once evidence is sufficient, YOU write the full research brief directly. Do not delegate writing to another agent. Read the research files, synthesize the findings, and produce a complete document:
@@ -190,6 +199,7 @@ Before you stop, verify on disk that all of these exist:
 - `outputs/<slug>.provenance.md` or `papers/<slug>.provenance.md` provenance sidecar

 Do not stop at `<slug>-brief.md` alone. If the cited brief exists but the promoted final output or provenance sidecar does not, create them before responding.
+If full verification could not be completed, still create the final deliverable and provenance sidecar with `Verification: BLOCKED` or `PASS WITH NOTES` and list the missing checks. Never end with only an explanation in chat.

 ## Background execution

--- a/prompts/draft.md
+++ b/prompts/draft.md
@@ -13,7 +13,8 @@ Requirements:
 - Use the `writer` subagent when the draft should be produced from already-collected notes, then use the `verifier` subagent to add inline citations and verify sources.
 - Include at minimum: title, abstract, problem statement, related work, method or synthesis, evidence or experiments, limitations, conclusion.
 - Use clean Markdown with LaTeX where equations materially help.
- Generate charts with `pi-charts` for quantitative data, benchmarks, and comparisons. Use Mermaid for architectures and pipelines. Every figure needs a caption.
+- Follow the system prompt's provenance rules for all results, figures, charts, images, tables, benchmarks, and quantitative comparisons. If evidence is missing, leave a placeholder or proposed experimental plan instead of claiming an outcome.
+- Generate charts with `pi-charts` only for source-backed quantitative data, benchmarks, and comparisons. Use Mermaid for architectures and pipelines only when the structure is supported by sources. Every figure needs a provenance-bearing caption.
 - Before delivery, sweep the draft for any claim that sounds stronger than its support. Mark tentative results as tentative and remove unsupported numerics instead of letting the verifier discover them later.
 - Save exactly one draft to `papers/<slug>.md`.
 - End with a `Sources` appendix with direct URLs for all primary references.
--- a/scripts/install/install.ps1
+++ b/scripts/install/install.ps1
@@ -110,7 +110,7 @@ This usually means the release exists, but not all platform bundles were uploade
 Workarounds:
  - try again after the release finishes publishing
  - pass the latest published version explicitly, e.g.:
-    & ([scriptblock]::Create((irm https://feynman.is/install.ps1))) -Version 0.2.18
+    & ([scriptblock]::Create((irm https://feynman.is/install.ps1))) -Version 0.2.23
 "@
  }

--- a/scripts/install/install.sh
+++ b/scripts/install/install.sh
@@ -261,7 +261,7 @@ This usually means the release exists, but not all platform bundles were uploade
 Workarounds:
  - try again after the release finishes publishing
  - pass the latest published version explicitly, e.g.:
-    curl -fsSL https://feynman.is/install | bash -s -- 0.2.18
+    curl -fsSL https://feynman.is/install | bash -s -- 0.2.23
 EOF
  exit 1
 fi
--- a/scripts/patch-embedded-pi.mjs
+++ b/scripts/patch-embedded-pi.mjs
@@ -260,6 +260,23 @@ function ensureParentDir(path) {
 	mkdirSync(dirname(path), { recursive: true });
 }

+function packageDependencyExists(packagePath, globalNodeModulesRoot, dependency) {
+	return existsSync(resolve(packagePath, "node_modules", dependency)) ||
+		existsSync(resolve(globalNodeModulesRoot, dependency));
+}
+
+function installedPackageLooksUsable(packagePath, globalNodeModulesRoot) {
+	if (!existsSync(resolve(packagePath, "package.json"))) return false;
+	try {
+		const pkg = JSON.parse(readFileSync(resolve(packagePath, "package.json"), "utf8"));
+		return Object.keys(pkg.dependencies ?? {}).every((dependency) =>
+			packageDependencyExists(packagePath, globalNodeModulesRoot, dependency)
+		);
+	} catch {
+		return false;
+	}
+}
+
 function linkPointsTo(linkPath, targetPath) {
 	try {
 		if (!lstatSync(linkPath).isSymbolicLink()) return false;
@@ -281,6 +298,8 @@ function ensureBundledPackageLinks(packageSpecs) {
 		try {
 			if (lstatSync(targetPath).isSymbolicLink()) {
 				rmSync(targetPath, { force: true });
+			} else if (!installedPackageLooksUsable(targetPath, globalNodeModulesRoot)) {
+				rmSync(targetPath, { recursive: true, force: true });
 			}
 		} catch {}
 		if (existsSync(targetPath)) continue;
--- a/src/model/commands.ts
+++ b/src/model/commands.ts
@@ -83,6 +83,7 @@ const API_KEY_PROVIDERS: ApiKeyProviderInfo[] = [
 	{ id: "openai", label: "OpenAI Platform API", envVar: "OPENAI_API_KEY" },
 	{ id: "anthropic", label: "Anthropic API", envVar: "ANTHROPIC_API_KEY" },
 	{ id: "google", label: "Google Gemini API", envVar: "GEMINI_API_KEY" },
+	{ id: "lm-studio", label: "LM Studio (local OpenAI-compatible server)" },
 	{ id: "__custom__", label: "Custom provider (local/self-hosted/proxy)" },
 	{ id: "amazon-bedrock", label: "Amazon Bedrock (AWS credential chain)" },
 	{ id: "openrouter", label: "OpenRouter", envVar: "OPENROUTER_API_KEY" },
@@ -132,6 +133,8 @@ async function selectApiKeyProvider(): Promise<ApiKeyProviderInfo | undefined> {
 		label: provider.label,
 		hint: provider.id === "__custom__"
 			? "Ollama, vLLM, LM Studio, proxies"
+			: provider.id === "lm-studio"
+				? "http://localhost:1234/v1"
 			: provider.envVar ?? provider.id,
 	}));
 	options.push({ value: "cancel", label: "Cancel" });
@@ -362,6 +365,44 @@ async function promptCustomProviderSetup(): Promise<CustomProviderSetup | undefi
 	return { providerId, modelIds, baseUrl, api, apiKeyConfig, authHeader };
 }

+async function promptLmStudioProviderSetup(): Promise<CustomProviderSetup | undefined> {
+	printSection("LM Studio");
+	printInfo("Start the LM Studio local server first, then load a model.");
+
+	const baseUrlRaw = await promptText("Base URL", "http://localhost:1234/v1");
+	const { baseUrl } = normalizeCustomProviderBaseUrl("openai-completions", baseUrlRaw);
+	if (!baseUrl) {
+		printWarning("Base URL is required.");
+		return undefined;
+	}
+
+	const detectedModelIds = await bestEffortFetchOpenAiModelIds(baseUrl, "lm-studio", false);
+	let modelIdsDefault = "local-model";
+	if (detectedModelIds && detectedModelIds.length > 0) {
+		const sample = detectedModelIds.slice(0, 10).join(", ");
+		printInfo(`Detected LM Studio models: ${sample}${detectedModelIds.length > 10 ? ", ..." : ""}`);
+		modelIdsDefault = detectedModelIds[0]!;
+	} else {
+		printInfo("No models detected from /models. Enter the exact model id shown in LM Studio.");
+	}
+
+	const modelIdsRaw = await promptText("Model id(s) (comma-separated)", modelIdsDefault);
+	const modelIds = normalizeModelIds(modelIdsRaw);
+	if (modelIds.length === 0) {
+		printWarning("At least one model id is required.");
+		return undefined;
+	}
+
+	return {
+		providerId: "lm-studio",
+		modelIds,
+		baseUrl,
+		api: "openai-completions",
+		apiKeyConfig: "lm-studio",
+		authHeader: false,
+	};
+}
+
 async function verifyCustomProvider(setup: CustomProviderSetup, authPath: string): Promise<void> {
 	const registry = createModelRegistry(authPath);
 	const modelsError = registry.getError();
@@ -548,6 +589,31 @@ async function configureApiKeyProvider(authPath: string, providerId?: string): P
 		return configureBedrockProvider(authPath);
 	}

+	if (provider.id === "lm-studio") {
+		const setup = await promptLmStudioProviderSetup();
+		if (!setup) {
+			printInfo("LM Studio setup cancelled.");
+			return false;
+		}
+
+		const modelsJsonPath = getModelsJsonPath(authPath);
+		const result = upsertProviderConfig(modelsJsonPath, setup.providerId, {
+			baseUrl: setup.baseUrl,
+			apiKey: setup.apiKeyConfig,
+			api: setup.api,
+			authHeader: setup.authHeader,
+			models: setup.modelIds.map((id) => ({ id })),
+		});
+		if (!result.ok) {
+			printWarning(result.error);
+			return false;
+		}
+
+		printSuccess("Saved LM Studio provider.");
+		await verifyCustomProvider(setup, authPath);
+		return true;
+	}
+
 	if (provider.id === "__custom__") {
 		const setup = await promptCustomProviderSetup();
 		if (!setup) {
--- a/src/model/registry.ts
+++ b/src/model/registry.ts
@@ -1,11 +1,41 @@
 import { dirname, resolve } from "node:path";

 import { AuthStorage, ModelRegistry } from "@mariozechner/pi-coding-agent";
+import { getModels } from "@mariozechner/pi-ai";
+import { anthropicOAuthProvider } from "@mariozechner/pi-ai/oauth";

 export function getModelsJsonPath(authPath: string): string {
 	return resolve(dirname(authPath), "models.json");
 }

-export function createModelRegistry(authPath: string): ModelRegistry {
-	return ModelRegistry.create(AuthStorage.create(authPath), getModelsJsonPath(authPath));
+function registerFeynmanModelOverlays(modelRegistry: ModelRegistry): void {
+	const anthropicModels = getModels("anthropic");
+	if (anthropicModels.some((model) => model.id === "claude-opus-4-7")) {
+		return;
+	}
+
+	const opus46 = anthropicModels.find((model) => model.id === "claude-opus-4-6");
+	if (!opus46) {
+		return;
+	}
+
+	modelRegistry.registerProvider("anthropic", {
+		baseUrl: "https://api.anthropic.com",
+		api: "anthropic-messages",
+		oauth: anthropicOAuthProvider,
+		models: [
+			...anthropicModels,
+			{
+				...opus46,
+				id: "claude-opus-4-7",
+				name: "Claude Opus 4.7",
+			},
+		],
+	});
+}
+
+export function createModelRegistry(authPath: string): ModelRegistry {
+	const registry = ModelRegistry.create(AuthStorage.create(authPath), getModelsJsonPath(authPath));
+	registerFeynmanModelOverlays(registry);
+	return registry;
 }
--- a/src/pi/package-ops.ts
+++ b/src/pi/package-ops.ts
@@ -1,5 +1,5 @@
 import { spawn } from "node:child_process";
-import { cpSync, existsSync, lstatSync, mkdirSync, readlinkSync, rmSync, symlinkSync, writeFileSync } from "node:fs";
+import { cpSync, existsSync, lstatSync, mkdirSync, readFileSync, readlinkSync, rmSync, symlinkSync, writeFileSync } from "node:fs";
 import { fileURLToPath } from "node:url";
 import { dirname, join, resolve } from "node:path";

@@ -423,6 +423,47 @@ function linkDirectory(linkPath: string, targetPath: string): void {
 	}
 }

+function packageNameToPath(root: string, packageName: string): string {
+	return resolve(root, packageName);
+}
+
+function packageDependencyExists(packagePath: string, globalNodeModulesRoot: string, dependency: string): boolean {
+	return existsSync(packageNameToPath(resolve(packagePath, "node_modules"), dependency)) ||
+		existsSync(packageNameToPath(globalNodeModulesRoot, dependency));
+}
+
+function installedPackageLooksUsable(packagePath: string, globalNodeModulesRoot: string): boolean {
+	if (!existsSync(resolve(packagePath, "package.json"))) {
+		return false;
+	}
+
+	try {
+		const pkg = JSON.parse(readFileSync(resolve(packagePath, "package.json"), "utf8")) as {
+			dependencies?: Record<string, string>;
+		};
+		const dependencies = Object.keys(pkg.dependencies ?? {});
+		return dependencies.every((dependency) => packageDependencyExists(packagePath, globalNodeModulesRoot, dependency));
+	} catch {
+		return false;
+	}
+}
+
+function replaceBrokenPackageWithBundledCopy(targetPath: string, bundledPackagePath: string, globalNodeModulesRoot: string): boolean {
+	if (!existsSync(targetPath)) {
+		return false;
+	}
+	if (pathsMatchSymlinkTarget(targetPath, bundledPackagePath)) {
+		return false;
+	}
+	if (installedPackageLooksUsable(targetPath, globalNodeModulesRoot)) {
+		return false;
+	}
+
+	rmSync(targetPath, { recursive: true, force: true });
+	linkDirectory(targetPath, bundledPackagePath);
+	return true;
+}
+
 export function seedBundledWorkspacePackages(
 	agentDir: string,
 	appRoot: string,
@@ -446,6 +487,10 @@ export function seedBundledWorkspacePackages(
 		if (!existsSync(bundledPackagePath)) continue;

 		const targetPath = resolve(globalNodeModulesRoot, parsed.name);
+		if (replaceBrokenPackageWithBundledCopy(targetPath, bundledPackagePath, globalNodeModulesRoot)) {
+			seeded.push(source);
+			continue;
+		}
 		if (!existsSync(targetPath)) {
 			linkDirectory(targetPath, bundledPackagePath);
 			seeded.push(source);
--- a/tests/content-policy.test.ts
+++ b/tests/content-policy.test.ts
@@ -30,3 +30,42 @@ test("bundled prompts and skills do not contain blocked promotional product cont
 		}
 	}
 });
+
+test("research writing prompts forbid fabricated results and unproven figures", () => {
+	const draftPrompt = readFileSync(join(repoRoot, "prompts", "draft.md"), "utf8");
+	const systemPrompt = readFileSync(join(repoRoot, ".feynman", "SYSTEM.md"), "utf8");
+	const writerPrompt = readFileSync(join(repoRoot, ".feynman", "agents", "writer.md"), "utf8");
+	const verifierPrompt = readFileSync(join(repoRoot, ".feynman", "agents", "verifier.md"), "utf8");
+
+	for (const [label, content] of [
+		["system prompt", systemPrompt],
+	] as const) {
+		assert.match(content, /Never (invent|fabricate)/i, `${label} must explicitly forbid invented or fabricated results`);
+		assert.match(content, /(figure|chart|image|table)/i, `${label} must cover visual/table provenance`);
+		assert.match(content, /(provenance|source|artifact|script|raw)/i, `${label} must require traceable support`);
+	}
+
+	for (const [label, content] of [
+		["writer prompt", writerPrompt],
+		["verifier prompt", verifierPrompt],
+		["draft prompt", draftPrompt],
+	] as const) {
+		assert.match(content, /system prompt.*provenance rule/i, `${label} must point back to the system provenance rule`);
+	}
+
+	assert.match(draftPrompt, /system prompt's provenance rules/i);
+	assert.match(draftPrompt, /placeholder or proposed experimental plan/i);
+	assert.match(draftPrompt, /source-backed quantitative data/i);
+});
+
+test("deepresearch workflow requires durable artifacts even when blocked", () => {
+	const systemPrompt = readFileSync(join(repoRoot, ".feynman", "SYSTEM.md"), "utf8");
+	const deepResearchPrompt = readFileSync(join(repoRoot, "prompts", "deepresearch.md"), "utf8");
+
+	assert.match(systemPrompt, /Do not claim you are only a static model/i);
+	assert.match(systemPrompt, /write the requested durable artifact/i);
+	assert.match(deepResearchPrompt, /Do not stop after planning/i);
+	assert.match(deepResearchPrompt, /degraded mode/i);
+	assert.match(deepResearchPrompt, /Verification: BLOCKED/i);
+	assert.match(deepResearchPrompt, /Never end with only an explanation in chat/i);
+});
--- a/tests/model-harness.test.ts
+++ b/tests/model-harness.test.ts
@@ -7,6 +7,7 @@ import { join } from "node:path";
 import { resolveInitialPrompt, shouldRunInteractiveSetup } from "../src/cli.js";
 import { buildModelStatusSnapshotFromRecords, chooseRecommendedModel } from "../src/model/catalog.js";
 import { resolveModelProviderForCommand, setDefaultModelSpec } from "../src/model/commands.js";
+import { createModelRegistry } from "../src/model/registry.js";

 function createAuthPath(contents: Record<string, unknown>): string {
 	const root = mkdtempSync(join(tmpdir(), "feynman-auth-"));
@@ -26,6 +27,17 @@ test("chooseRecommendedModel prefers the strongest authenticated research model"
 	assert.equal(recommendation?.spec, "anthropic/claude-opus-4-6");
 });

+test("createModelRegistry overlays new Anthropic Opus model before upstream Pi updates", () => {
+	const authPath = createAuthPath({
+		anthropic: { type: "api_key", key: "anthropic-test-key" },
+	});
+
+	const registry = createModelRegistry(authPath);
+
+	assert.ok(registry.find("anthropic", "claude-opus-4-7"));
+	assert.equal(registry.getAvailable().some((model) => model.provider === "anthropic" && model.id === "claude-opus-4-7"), true);
+});
+
 test("setDefaultModelSpec accepts a unique bare model id from authenticated models", () => {
 	const authPath = createAuthPath({
 		openai: { type: "api_key", key: "openai-test-key" },
@@ -67,6 +79,15 @@ test("resolveModelProviderForCommand falls back to API-key providers when OAuth
 	assert.equal(resolved?.id, "google");
 });

+test("resolveModelProviderForCommand supports LM Studio as a first-class local provider", () => {
+	const authPath = createAuthPath({});
+
+	const resolved = resolveModelProviderForCommand(authPath, "lm-studio");
+
+	assert.equal(resolved?.kind, "api-key");
+	assert.equal(resolved?.id, "lm-studio");
+});
+
 test("resolveModelProviderForCommand prefers OAuth when a provider supports both auth modes", () => {
 	const authPath = createAuthPath({});

--- a/tests/package-ops.test.ts
+++ b/tests/package-ops.test.ts
@@ -6,13 +6,17 @@ import { join, resolve } from "node:path";

 import { installPackageSources, seedBundledWorkspacePackages, updateConfiguredPackages } from "../src/pi/package-ops.js";

-function createBundledWorkspace(appRoot: string, packageNames: string[]): void {
+function createBundledWorkspace(
+	appRoot: string,
+	packageNames: string[],
+	dependenciesByPackage: Record<string, Record<string, string>> = {},
+): void {
 	for (const packageName of packageNames) {
 		const packageDir = resolve(appRoot, ".feynman", "npm", "node_modules", packageName);
 		mkdirSync(packageDir, { recursive: true });
 		writeFileSync(
 			join(packageDir, "package.json"),
-			JSON.stringify({ name: packageName, version: "1.0.0" }, null, 2) + "\n",
+			JSON.stringify({ name: packageName, version: "1.0.0", dependencies: dependenciesByPackage[packageName] }, null, 2) + "\n",
 			"utf8",
 		);
 	}
@@ -76,6 +80,33 @@ test("seedBundledWorkspacePackages preserves existing installed packages", () =>
 	assert.equal(lstatSync(existingPackageDir).isSymbolicLink(), false);
 });

+test("seedBundledWorkspacePackages repairs broken existing bundled packages", () => {
+	const appRoot = mkdtempSync(join(tmpdir(), "feynman-bundle-"));
+	const homeRoot = mkdtempSync(join(tmpdir(), "feynman-home-"));
+	const agentDir = resolve(homeRoot, "agent");
+	const existingPackageDir = resolve(homeRoot, "npm-global", "lib", "node_modules", "pi-markdown-preview");
+
+	mkdirSync(agentDir, { recursive: true });
+	createBundledWorkspace(appRoot, ["pi-markdown-preview", "puppeteer-core"], {
+		"pi-markdown-preview": { "puppeteer-core": "^24.0.0" },
+	});
+	mkdirSync(existingPackageDir, { recursive: true });
+	writeFileSync(
+		resolve(existingPackageDir, "package.json"),
+		JSON.stringify({ name: "pi-markdown-preview", version: "broken", dependencies: { "puppeteer-core": "^24.0.0" } }) + "\n",
+		"utf8",
+	);
+
+	const seeded = seedBundledWorkspacePackages(agentDir, appRoot, ["npm:pi-markdown-preview"]);
+
+	assert.deepEqual(seeded, ["npm:pi-markdown-preview"]);
+	assert.equal(lstatSync(existingPackageDir).isSymbolicLink(), true);
+	assert.equal(
+		readFileSync(resolve(existingPackageDir, "package.json"), "utf8").includes('"version": "1.0.0"'),
+		true,
+	);
+});
+
 test("installPackageSources filters noisy npm chatter but preserves meaningful output", async () => {
 	const root = mkdtempSync(join(tmpdir(), "feynman-package-ops-"));
 	const workingDir = resolve(root, "project");
--- a/website/public/install
+++ b/website/public/install
@@ -261,7 +261,7 @@ This usually means the release exists, but not all platform bundles were uploade
 Workarounds:
  - try again after the release finishes publishing
  - pass the latest published version explicitly, e.g.:
-    curl -fsSL https://feynman.is/install | bash -s -- 0.2.18
+    curl -fsSL https://feynman.is/install | bash -s -- 0.2.23
 EOF
  exit 1
 fi
--- a/website/public/install.ps1
+++ b/website/public/install.ps1
@@ -110,7 +110,7 @@ This usually means the release exists, but not all platform bundles were uploade
 Workarounds:
  - try again after the release finishes publishing
  - pass the latest published version explicitly, e.g.:
-    & ([scriptblock]::Create((irm https://feynman.is/install.ps1))) -Version 0.2.18
+    & ([scriptblock]::Create((irm https://feynman.is/install.ps1))) -Version 0.2.23
 "@
  }

--- a/website/src/content/docs/getting-started/installation.md
+++ b/website/src/content/docs/getting-started/installation.md
@@ -117,13 +117,13 @@ These installers download the bundled `skills/` and `prompts/` trees plus the re
 The one-line installer already targets the latest tagged release. To pin an exact version, pass it explicitly:

 ```bash
-curl -fsSL https://feynman.is/install | bash -s -- 0.2.18
+curl -fsSL https://feynman.is/install | bash -s -- 0.2.23
 ```

 On Windows:

 ```powershell
-& ([scriptblock]::Create((irm https://feynman.is/install.ps1))) -Version 0.2.18
+& ([scriptblock]::Create((irm https://feynman.is/install.ps1))) -Version 0.2.23
 ```

 ## Post-install setup
--- a/website/src/content/docs/getting-started/setup.md
+++ b/website/src/content/docs/getting-started/setup.md
@@ -52,9 +52,25 @@ Amazon Bedrock (AWS credential chain)

 Feynman verifies the same AWS credential chain Pi uses at runtime, including `AWS_PROFILE`, `~/.aws` credentials/config, SSO, ECS/IRSA, and EC2 instance roles. Once that check passes, Bedrock models become available in `feynman model list` without needing a traditional API key.

-### Local models: Ollama, LM Studio, vLLM
+### Local models: LM Studio, Ollama, vLLM

-If you want to use a model running locally, choose the API-key flow and then select:
+If you want to use LM Studio, start the LM Studio local server, load a model, choose the API-key flow, and then select:
+
+```text
+LM Studio (local OpenAI-compatible server)
+```
+
+The default settings are:
+
+```text
+Base URL: http://localhost:1234/v1
+Authorization header: No
+API key: lm-studio
+```
+
+Feynman attempts to read LM Studio's `/models` endpoint and prefill the loaded model id.
+
+For Ollama, vLLM, or another OpenAI-compatible local server, choose:

 ```text
 Custom provider (baseUrl + API key)
@@ -70,7 +86,7 @@ Model ids: llama3.1:8b
 API key: local
 ```

-That same custom-provider flow also works for other OpenAI-compatible local servers such as LM Studio or vLLM. After saving the provider, run:
+After saving the provider, run:

 ```bash
 feynman model list
--- a/website/src/content/docs/workflows/draft.md
+++ b/website/src/content/docs/workflows/draft.md
@@ -35,6 +35,8 @@ When working from existing session context (after a deep research or literature

 The writer pays attention to academic conventions: claims are attributed to their sources with inline citations, methodology sections describe procedures precisely, and limitations are discussed honestly. The draft includes placeholder sections for any content the writer cannot generate from available sources, clearly marking what needs human input.

+Drafts follow Feynman's system-wide provenance rules: unsupported results, figures, images, tables, or benchmark data should become clearly labeled gaps or TODOs, not plausible-looking claims.
+
 ## Output format

 The draft follows standard academic structure:
Author	SHA1	Message	Date
Advait Paliwal	c3f7f6ec08	Add LM Studio setup and blocked research artifacts	2026-04-16 15:39:01 -07:00
Advait Paliwal	d2570188f9	Add first-class LM Studio setup	2026-04-16 15:34:32 -07:00
Advait Paliwal	ca559dfd91	Fix extension repair and add Opus 4.7 overlay	2026-04-16 14:05:17 -07:00
Advait Paliwal	46b2aa93d0	Skip release when npm version already exists	2026-04-15 23:15:27 -07:00
Advait Paliwal	043e241464	Deduplicate fabricated-results guardrails	2026-04-15 22:53:38 -07:00
Advait Paliwal	501364da45	Deduplicate draft guardrails under system prompt	2026-04-15 22:50:04 -07:00
Advait Paliwal	fe24224965	Add system-wide guardrails against fabricated results	2026-04-15 22:45:04 -07:00
Advait Paliwal	9bc59dad53	Forbid fabricated draft results	2026-04-15 22:38:51 -07:00
Advait Paliwal	7fd94c028e	Add star history chart to README	2026-04-15 18:40:54 -07:00
Advait Paliwal	080bf8ad2c	Simplify publish workflow and restore auto release	2026-04-15 18:17:28 -07:00
Advait Paliwal	82cafd10cc	Fix publish workflow dispatch context	2026-04-15 18:15:20 -07:00