autoresearch-quantum

mirror of https://github.com/saymrwulf/autoresearch-quantum.git synced 2026-05-23 21:50:18 +00:00

Author	SHA1	Message	Date
saymrwulf	a16f99e2ed	Harden Jupyter lifecycle and enhance E2E browser UX tests app.sh: full Jupyter env isolation (CONFIG_DIR, DATA_DIR, RUNTIME_DIR, IPYTHONDIR), auto port allocation, foreground mode, restart command, graceful stop (SIGTERM → wait → SIGKILL), orphan detection, stale runtime JSON cleanup, reset-state command. test_browser_ux.py: add TestNavigationClickThrough (click Plan A link, verify target opens; click forward-link in Plan D), TestWidgetInteraction (run all cells, verify output areas, click quiz Submit and verify feedback), TestProgressPersistence (kernel API session, progress file schema validation). .gitignore: add Jupyter isolation directories. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-15 21:20:34 +02:00
saymrwulf	29caba3a1a	Add professional toolchain: mypy strict, CI pipeline, Playwright UX tests, pedagogy validation Infrastructure: - Configure mypy strict mode in pyproject.toml; fix all 53 type errors across 8 source files - Add .pre-commit-config.yaml (ruff, mypy, nbstripout, trailing whitespace) - Add .github/workflows/ci.yml: lint + type check, unit tests (Python 3.11/3.12), notebook execution - Add scripts/app.sh consumer lifecycle manager (bootstrap, start, stop, status, validate, logs, reset) Testing: - Add tests/test_browser_ux.py: Playwright end-to-end UX tests covering JupyterLab launch, notebook rendering, navigation links, widget rendering, and full consumer walkthrough - Add tests/test_pedagogy.py: 130 pedagogical structure tests validating prose quality (word counts, markdown ratio), section structure, assessment density and variety, Bloom's taxonomy coverage, checkpoint presence, tracker integration, key insight callouts, and cross-plan concept consistency Quality: - Fix ruff E741 (ambiguous variable name) across all builder scripts - Add Key Insight callouts to plan_a/01_encoded_magic_state.ipynb - Add pytest 'browser' marker for selective UX test runs - Expand .gitignore with .logs/ and build artifacts 319 tests pass, 85% coverage, mypy strict clean, ruff clean.	2026-04-15 20:00:19 +02:00
saymrwulf	5d0c28f939	Harden teaching layer, add notebook execution tests, fix repo hygiene Quality fixes: - Add deprecation warnings to 5 silent no-op legacy wrappers in assess.py - Remove dead code in tracker.py score_by_section (unused first loop) - Remove unused variable in assess.py _check_order - Fix .gitignore: add progress JSONs, checkpoints, .coverage, .DS_Store, LaTeX aux - Fix "all three plans" → "all four plans" in learning_objectives.md - Add teaching/ package to README project tree - Add compendium to README paper tree Testing: - Add 43 unit tests for teaching/assess.py and tracker.py (quiz, predict_choice, reflect, order, checkpoint_summary, legacy wrapper deprecation warnings, tracker scoring, persistence, mastery calculation) - Add notebook execution test suite (nbclient): all 11 notebooks execute without errors in a fresh kernel, structural validation (valid JSON, has code cells, has assessments, section parameters, learning objectives document) - Overall test count: 185 passing (was 107), coverage: 85% (was ~25% in tests) Toolchain: - Add pytest-cov, ruff, nbclient, nbformat to dev dependencies - Add ruff config (E, F, W, I, UP, B, SIM rules) - Add coverage config with term-missing output - Fix all ruff lint issues across src/ and tests/ (import sorting, unused imports) - Fix Plan D notebook paths (configs/rungs → ../../configs/rungs) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-15 15:34:37 +02:00
saymrwulf	5f8b584210	Add Plan D: three claim-driven experiment notebooks Plan D structures the material as a scientific argument chain: - Experiment 1: Can the [[4,2,2]] code protect a magic state? (W=1.0, 12/12 errors) - Experiment 2: How much magic survives noise? (scoring, parameter sweeps) - Experiment 3: Can a ratchet learn to optimise? (monotonic improvement, transfer) Each notebook follows Hypothesis → Claim → Experiment → Proof → Next Hypothesis. Includes builder script, updated learning objectives, README, and compendium cross-ref. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-07 19:13:43 +02:00
saymrwulf	e13a3268c2	Add teaching notebooks, widget-based quizzes, bug fixes, and expanded tests - 8 Jupyter notebooks across 3 learning plans (A: bottom-up, B: spiral, C: parallel tracks) - Teaching toolkit (src/autoresearch_quantum/teaching/) with ipywidgets-based quiz, predict_choice, reflect, and order widgets — visually distinct from code cells - Fix spectator_z operator: was {1:'Z',2:'Z'} (IZZI, expectation=0), now {1:'Z',3:'Z'} (ZIZI, expectation=+1 for ideal T-state, commutes with logical operators) - Fix u_magic seed: swap phase arguments to match h_p and ry_rz preparations - Fix double-display bug: widgets rendered twice when function returned the box - Fix CLI override parser for negative integers and missing '=' validation - Fix stabilizer detection quiz: ZZZZ detects X errors, not Z errors - Add ties parameter to order() for questions with interchangeable items - Expand test suite from 21 to 107 tests - Update README with notebook instructions and project tree	2026-04-07 17:14:37 +02:00

5 commits