Test Report

Automated test results from nightly CI runs. Updated daily at 2am UTC.

Loading...

Fetching latest test results...

Error

Failed to load test results.

ALL PASS
0
Passed
0
Failed
0
Total
0s
Duration

v7 Fast Regression

`make regression-fast` status is not available in this report yet.

-
Overall
0
Families Passed
0
Families Failed
0
Families Total
Family Status Build Smoke Coherence Contract 1stTok Note
No regression-fast data published in this report.

v7 Kernel Map Contracts

`make v7-kernel-map-contracts` status is not available in this report yet.

-
Overall
0
Maps Passed
0
Maps Failed
0
Warnings
Validator Notes
No kernel-map summary published in this report.

v7 Training Family Regression

`make regression-training-full` status is not available in this report yet.

-
Overall
0
Families Passed
0
Families Failed
0
Families Total
Family Status A B C D E F Failed Stages
No training family summary published in this report.

30-Day History

All passed
Some failed
No data

Test Details

0 kernel tests passed, 0 failed (click rows to expand)

Status Test Category Kernels Duration

About Nightly Tests

The nightly test suite runs automatically at 2am UTC every day. It includes:

๐Ÿ’ก
Run locally: make test (kernel-focused), make visualizer (fast path), make visualizer-full (includes tiny train-runtime ASan checks), make nightly (full matrix).
Nightly coverage now includes v7 kernel-map contracts, v7 backprop kernel parity, published training family regression, visualizer runbook regression checks, and published regression-fast family summaries.

View the GitHub Actions workflow for detailed logs.

See also: Version History for project roadmap and milestone tracking.


Visualizer Health Matrix

Contract-driven test coverage for the IR Visualizer and Dataset Viewer. Each component declares its tab/function/DOM contracts in JSON; tests validate against those contracts at every push. See the full test runbook.

Run locally: make v7-visualizer-health

IR Visualizer โ€” 11 tabs, 19 core functions

Tab Render Function L1 Static L2 Unit Tests
MemoryrenderMemorytab + panel + fnโ€”
Kernel FlowrenderKernelFlowtab + panel + fnโ€”
StatsrenderStatstab + panel + fnโ€”
TrainingrenderTrainingtab + panel + fnโ€”
QuantizationrenderQuantizationAudittab + panel + fnโ€”
DataflowrenderDataflowtab + panel + fnโ€”
InterpretabilityrenderOperatorMathIntuitiontab + panel + fnโ€”
ProfilerenderProfiletab + panel + fnโ€”
Data PipelinerenderDataPipelinetab + panel + fnโ€”
ParityrenderParityCockpittab + panel + fnโ€”

L2-tested utilities: formatBytes (4) ยท normalizeShapeInput (7) ยท formatShapeDisplay (3) ยท normalizeMode (3) ยท escapeHtml (3) ยท quoteShell (3) ยท normalizePathString (2) ยท pathDirname (3) ยท extractGgufStem (3) ยท relativePathFromTo (4) โ€” 35 tests

Dataset Viewer โ€” 12 tabs, 30 core functions

Tab Render Function L1 Static L2 Unit Tests
OverviewrenderOverviewtab + panel + fnโ€”
PreflightrenderPreflighttab + panel + fnโ€”
GalleryrenderGallerytab + panel + fnโ€”
TextrenderTextSamplestab + panel + fnโ€”
TokenizerrenderTokenizertab + panel + fnโ€”
VocabularyrenderVocabularytab + panel + fnโ€”
ClassificationrenderClassificationtab + panel + fnโ€”
BrowserenderBrowsetab + panel + fnโ€”
CandidatesrenderCandidatestab + panel + fnโ€”
QualityrenderQualitytab + panel + fnโ€”
EmbeddingsrenderEmbeddingstab + panel + fnembColor ยท embNormalise ยท cosineSim
AttentionrenderAttentiontab + panel + fnattnColor ยท attnEntropy ยท avgMatrices

L2-tested functions: attnColor (9) ยท embColor (3) ยท cosineSim (4) ยท attnEntropy (4) ยท avgMatrices (3) ยท embNormalise (4) โ€” 27 tests

Coverage Summary

LevelComponentChecksRun In
L1IR Visualizer~68pre-push
L1Dataset Viewer~78pre-push
L1IR Hub~5pre-push
L2IR Pure Functions~50pre-push
L2DS Pure Functions~50pre-push
L3Generated E2E (all 3 visualizers)~24-44nightly
Total ~260 < 3 seconds
๐Ÿ“‹
Contract-driven: All tab, function, and test vector definitions live in version/v7/tests/contracts/ir_visualizer_contract.json and dataset_viewer_contract.json. Add a tab or function? Update the contract โ€” tests auto-expand.

Level 3 โ€” Generated-File E2E (Nightly)

Validates the full generation โ†’ validation chain: generate all three visualizer HTML files from the latest training run, then run L1 health checks + embedded JSON structure + cross-artifact consistency on the output.

Run locally: make v7-visualizer-generated-e2e ยท Specific run: make v7-visualizer-generated-e2e RUN=/path/to/run

StageWhat it validatesArtifact
Generateir_report.html via open_ir_visualizer.pyir_report.html
Generatedataset_viewer.html via prepare_run_viewer.pydataset_viewer.html
Generateir_hub.html via open_ir_hub.pyir_hub.html
L1 HealthTabs, functions, DOM targets in generated outputAll three files
JSON StructureEmbedded JSON blobs (run_config, ir1_decode, layout_decode)ir_report.html
Panel StructurePanel IDs, attnColor presence, file sizedataset_viewer.html
Hub StructureRun cards, ir_report links, navigationir_hub.html
Cross-artifactRun name in hub, vocab consistency, config.jsonAll three files
Image
100% | |
Scroll to zoom | Drag to pan | W/H to fit | 0 to reset | ESC to close