Auto-aggregated figure index across every research domain. New figures appear here as soon as their manifest.json is added to public/. Click any chart to view at full resolution.

v0.2 audit — full 11×11 confusion matrix (row-normalised)

v0.2 audit — per-class precision / recall / F1 (with macro and weighted F1)

v0.2 audit — overall accuracy and per-class recall with 95 % confidence intervals

v0.2 audit — coverage–accuracy abstention curve

v0.2 audit — ResNet18 embedding nearest-neighbour leakage check (test → train)

v0.2 audit — perceptual-hash leakage check (test → train)

v0.2 audit — full 11×11 confusion matrix (row-normalised)
🔍
v0.2 audit — per-class precision / recall / F1 (with macro and weighted F1)
🔍
v0.2 audit — overall accuracy and per-class recall with 95 % confidence intervals
🔍
v0.2 audit — coverage–accuracy abstention curve
🔍
v0.2 audit — ResNet18 embedding nearest-neighbour leakage check (test → train)
🔍
v0.2 audit — perceptual-hash leakage check (test → train)
🔍
v0.2 robustness — safety-first scatter (Δaccuracy vs Δconfidence)
🔍
v0.2 robustness — accuracy and ECE under each perturbation / severity
🔍
v0.2 robustness — abstention rate at τ = 0.90 under each perturbation
🔍
Original headline figure (v0.1 framing) — 97.4 % on 11-class Chula
🔍
Detection gallery — real micrographs
🔍
Grad-CAM attention — how the model sees
🔍
Confusion matrix — v0.1 visualisation
🔍
Per-class recall — two channels
🔍
Calibration — per-class temperature scaling
🔍
Training dynamics
🔍
Conv-1 filter bank
🔍
Training dynamics — Deep MLP head on the 5-axis perception
🔍
Per-axis ablation — which modality carries the discriminative signal
🔍
Accuracy stratified by sequence-length quintile
🔍
Principal-component projection of the 64-D perception
🔍
Cognitive-module signals across training
🔍
Metacognition L1 — Ψ-vector across training
🔍
Ananke Protocol — safety verdict timeline
🔍
Reliability diagram and per-class confidence distribution
🔍
Mean Kyte–Doolittle hydrophobicity profile per class
🔍
Spectral fingerprint of the hydrophobicity track
🔍
Persistent-homology surrogate — Betti curves per class
🔍
Synaesthetic coherence — per-class cross-axis Pearson matrices
🔍
Hardest misclassifications — high-confidence errors in PCA space
🔍
Cognitive-module signal synergy across training checkpoints
🔍
Sequence-length distributions per split
🔍
Per-amino-acid composition shifts (toxic − non-toxic)
🔍
Mollweide of −log10 p
🔍
p-value distribution + KS test
🔍
Significance vs Galactic latitude
🔍
Q–Q against Uniform(0,1)
🔍
Tier 3 headline — V preserved, ECE −33%, F-rescue ×8
🔍
Coverage vs accuracy (selective prediction)
🔍
Free-energy AUROC per arrhythmia class
🔍
F-rescue ROC — moat finding
🔍
L2 class-aware drift events on DS2 stream
🔍
Tier 2 headline — 73.6% / F1=0.358 on full DS2
🔍
Tier 2 per-class recall on full DS2
🔍
Tier 1 → Tier 2 ablation
🔍
Inter-patient AAMI vs literature
🔍
Tier 2 training dynamics
🔍
Cardio Tier 1 headline (legacy)
🔍
Cardio Tier 1 per-class (legacy)
🔍
Cardio Tier 1 training (legacy)
🔍
Cardio Ψ-trajectory (cognitive state)
🔍
Cardio cognitive module signals
🔍
Cross-domain comparison
🔍
Headline accuracy — random 9.1% · cosine 17.8% · trained head 26.7%
🔍
Self-consistency trajectory
🔍
Cognitive module signals — three concur
🔍
Training dynamics
🔍
Per-class recall
🔍
Plasma — UCogNet vs HW baselines
🔍
Plasma — energy trajectories
🔍
Plasma — budget vs performance Pareto
🔍
Plasma — out-of-distribution gap
🔍
Plasma — ablation study
🔍
Module capability radar
🔍
Campaign heatmap
🔍
Regime dynamics
🔍
Hypothesis distribution
🔍
Module symbiosis matrix
🔍
Confidence phase plot
🔍
Dashboard summary
🔍
Test coverage
🔍
Regime radar
🔍
Confidence signals
🔍
Routing heatmap
🔍
Reward composition
🔍
Cost breakdown
🔍
Evidence asymmetry
🔍
Mutation intensity
🔍
Rollout pipeline
🔍
Statistical arsenal
🔍
Architecture pyramid
🔍
Shaping guardrails
🔍
Campaign results
🔍
Integration flow
🔍
Capability boundaries
🔍
Mode ablation
🔍
Parameter efficiency
🔍
Cost vs performance
🔍
Combined leaderboard
🔍
Leaderboard — ARC-AGI-2
🔍
Leaderboard — HLE rolling
🔍
Leaderboard — LiveBench
🔍
Leaderboard — SWE-Bench Lite
🔍