autoresearch-quantum

mirror of https://github.com/saymrwulf/autoresearch-quantum.git synced 2026-06-29 03:00:36 +00:00

Author	SHA1	Message	Date
saymrwulf	d4ca5f99dc	Clarify background/foreground behavior in app.sh start output Users now see explicit messages about whether Jupyter runs in background or foreground, what happens when the terminal closes, and how to stop the server.	2026-04-16 09:28:10 +02:00
saymrwulf	a16f99e2ed	Harden Jupyter lifecycle and enhance E2E browser UX tests app.sh: full Jupyter env isolation (CONFIG_DIR, DATA_DIR, RUNTIME_DIR, IPYTHONDIR), auto port allocation, foreground mode, restart command, graceful stop (SIGTERM → wait → SIGKILL), orphan detection, stale runtime JSON cleanup, reset-state command. test_browser_ux.py: add TestNavigationClickThrough (click Plan A link, verify target opens; click forward-link in Plan D), TestWidgetInteraction (run all cells, verify output areas, click quiz Submit and verify feedback), TestProgressPersistence (kernel API session, progress file schema validation). .gitignore: add Jupyter isolation directories. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-15 21:20:34 +02:00
saymrwulf	29caba3a1a	Add professional toolchain: mypy strict, CI pipeline, Playwright UX tests, pedagogy validation Infrastructure: - Configure mypy strict mode in pyproject.toml; fix all 53 type errors across 8 source files - Add .pre-commit-config.yaml (ruff, mypy, nbstripout, trailing whitespace) - Add .github/workflows/ci.yml: lint + type check, unit tests (Python 3.11/3.12), notebook execution - Add scripts/app.sh consumer lifecycle manager (bootstrap, start, stop, status, validate, logs, reset) Testing: - Add tests/test_browser_ux.py: Playwright end-to-end UX tests covering JupyterLab launch, notebook rendering, navigation links, widget rendering, and full consumer walkthrough - Add tests/test_pedagogy.py: 130 pedagogical structure tests validating prose quality (word counts, markdown ratio), section structure, assessment density and variety, Bloom's taxonomy coverage, checkpoint presence, tracker integration, key insight callouts, and cross-plan concept consistency Quality: - Fix ruff E741 (ambiguous variable name) across all builder scripts - Add Key Insight callouts to plan_a/01_encoded_magic_state.ipynb - Add pytest 'browser' marker for selective UX test runs - Expand .gitignore with .logs/ and build artifacts 319 tests pass, 85% coverage, mypy strict clean, ruff clean.	2026-04-15 20:00:19 +02:00
saymrwulf	5d0c28f939	Harden teaching layer, add notebook execution tests, fix repo hygiene Quality fixes: - Add deprecation warnings to 5 silent no-op legacy wrappers in assess.py - Remove dead code in tracker.py score_by_section (unused first loop) - Remove unused variable in assess.py _check_order - Fix .gitignore: add progress JSONs, checkpoints, .coverage, .DS_Store, LaTeX aux - Fix "all three plans" → "all four plans" in learning_objectives.md - Add teaching/ package to README project tree - Add compendium to README paper tree Testing: - Add 43 unit tests for teaching/assess.py and tracker.py (quiz, predict_choice, reflect, order, checkpoint_summary, legacy wrapper deprecation warnings, tracker scoring, persistence, mastery calculation) - Add notebook execution test suite (nbclient): all 11 notebooks execute without errors in a fresh kernel, structural validation (valid JSON, has code cells, has assessments, section parameters, learning objectives document) - Overall test count: 185 passing (was 107), coverage: 85% (was ~25% in tests) Toolchain: - Add pytest-cov, ruff, nbclient, nbformat to dev dependencies - Add ruff config (E, F, W, I, UP, B, SIM rules) - Add coverage config with term-missing output - Fix all ruff lint issues across src/ and tests/ (import sorting, unused imports) - Fix Plan D notebook paths (configs/rungs → ../../configs/rungs) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-15 15:34:37 +02:00
saymrwulf	5f8b584210	Add Plan D: three claim-driven experiment notebooks Plan D structures the material as a scientific argument chain: - Experiment 1: Can the [[4,2,2]] code protect a magic state? (W=1.0, 12/12 errors) - Experiment 2: How much magic survives noise? (scoring, parameter sweeps) - Experiment 3: Can a ratchet learn to optimise? (monotonic improvement, transfer) Each notebook follows Hypothesis → Claim → Experiment → Proof → Next Hypothesis. Includes builder script, updated learning objectives, README, and compendium cross-ref. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 19:13:43 +02:00
saymrwulf	e13a3268c2	Add teaching notebooks, widget-based quizzes, bug fixes, and expanded tests - 8 Jupyter notebooks across 3 learning plans (A: bottom-up, B: spiral, C: parallel tracks) - Teaching toolkit (src/autoresearch_quantum/teaching/) with ipywidgets-based quiz, predict_choice, reflect, and order widgets — visually distinct from code cells - Fix spectator_z operator: was {1:'Z',2:'Z'} (IZZI, expectation=0), now {1:'Z',3:'Z'} (ZIZI, expectation=+1 for ideal T-state, commutes with logical operators) - Fix u_magic seed: swap phase arguments to match h_p and ry_rz preparations - Fix double-display bug: widgets rendered twice when function returned the box - Fix CLI override parser for negative integers and missing '=' validation - Fix stabilizer detection quiz: ZZZZ detects X errors, not Z errors - Add ties parameter to order() for questions with interchangeable items - Expand test suite from 21 to 107 tests - Update README with notebook instructions and project tree	2026-04-07 17:14:37 +02:00

6 commits