QUALITY SCORES

Current state of the corpus — scored on 15 dimensions across 3,632 documents. Updated as the knowledge base improves.

← How We Work
01 — Headline Metrics

Two Independent Scores, One Corpus

Quality and Factuality are measured separately — quality captures structural rigor, factuality captures verifiability. A document can be well-structured and poorly sourced, or vice versa. Both matter.

88.45
A- · Quality Scorecard
Structure, citations, evidence depth,
counter-args, cross-references
75.29
B · Factuality Score
Verifiability, bibliography breadth,
specificity, tier weighting

Last measured: April 27, 2026 (after 18-document counter-argument improvement pass and full canonical bibliography migration). Scores update when the pipeline reruns on the full corpus.

02 — Quality Dimensions

Eight Dimensions — 100 Points Total

Each dimension is weighted by importance. Structural compliance and counter-argument rigor carry the most weight — these are the hardest to get right and the most important for reader trust.

Structural Compliance
100%
Content Completeness
99%
Bibliography Quality
94.5%
Cross-Referencing
87%
Source Attribution
85%
Evidence Depth
85.5%
Factual Traceability
81.8%
Counter-Arg Rigor
70%

Counter-Argument Rigor is the hardest dimension to improve — it requires finding real, published scholarly challenges for every document, not inventing objections. It has improved from 51% (early 2026) to 70% through a systematic improvement pass. It remains the primary quality target.

03 — Source Confidence Distribution

How the 3,632 documents Are Distributed by Source Quality

The [N/5] source confidence rating is assigned to each document based on its weighted bibliography score. Distribution reflects a corpus built for breadth — many topics have limited peer-reviewed literature available.

[5/5]
243
6.7%
[4/5]
1,074
29.6%
[3/5]
1,302
35.9%
[2/5]
344
9.5%
[1/5]
664
18.3%

Why so many [1/5] documents? The [1/5] rating (under 14 weighted points) captures two types of document: those covering topics where peer-reviewed literature is genuinely sparse (oral history, ancient mythology, some alternative archaeology), and some older documents still pending bibliography enrichment. It does not necessarily mean the content is wrong — it means the sourcing is thin and should be treated with extra caution.

04 — Quality Milestones

Where We Are, Where We're Going

Quality improvement is ongoing and tracked. The targets below are the current active benchmarks — not arbitrary, but tied to specific improvements in research reliability.

Target Metric Status
Structural compliance at 100% Quality: Structural ✓ Complete
Factuality score above 75.0/100 (B) Factuality average ✓ 75.29 — Complete Apr 2026
Counter-argument rigor at 70% Quality: Counter-Arg ✓ Complete Apr 2026
Source attribution at 85% Quality: Source Attr ✓ Complete
Canonical bibliography migration (3,049 docs) Bibliography schema ✓ Complete Apr 2026
CrossRef coverage: 4,000+ docs Crossref verified ✓ 4,318 docs
100% falsifier coverage on InterDocs Epistemic Integrity ✓ 65/65 InterDocs
Quality score above 90/100 (A) Quality average → In progress · 88.45 now
Factuality score above 77/100 Factuality average → In progress · 75.29 now
Counter-argument rigor at 75% Quality: Counter-Arg ○ Planned