Document ID: T_1_10
Section: T_Psychology_Social
Keywords: psychometrics, intelligence, IQ, g factor, Spearman, fluid intelligence, crystallized intelligence, Cattell, multiple intelligences, Gardner, emotional intelligence, Sternberg, Wechsler, Stanford-Binet, Flynn effect, stereotype threat, measurement, reliability, validity, factor analysis, item response theory, bias, cultural fairness
Category Tags: psychology, social, art-culture
Cross-References: T_1_08 · ZC_1_12 · T_3_06 · T_1_09 · T_3_05
Reliability Tier: Tier 1 (g factor among the most replicated findings in differential psychology)
Last Updated: Mar 07, 2026 | Source Count: 20 | Weighted Score: 45 | Source Confidence: [5/5] | Confidence: Very High
QUICK SUMMARY
Intelligence testing is among the oldest and most psychometrically robust enterprises in psychology. Spearman's g factor (1904) — a general mental ability extracted through factor analysis — remains one of the strongest predictors of academic achievement, job performance (r ≈ .51; Schmidt & Hunter, 1998), health outcomes, and longevity. Despite controversy over what intelligence "is," the predictive validity of cognitive ability tests is among the most replicated findings in differential psychology.
Modern intelligence theory recognizes a hierarchical structure: the Cattell-Horn-Carroll (CHC) model posits a general factor (g) at the apex, broad abilities (fluid reasoning [Gf], crystallized ability [Gc], processing speed [Gs], working memory [Gwm], etc.) at the second stratum, and narrow abilities at the first. The Flynn effect — rising IQ scores across generations (≈3 points per decade) — demonstrates environmental contributions but complicates comparisons across cohorts. Stereotype threat (Steele & Aronson, 1995) showed situational factors can depress test performance, though recent meta-analyses have found smaller and more situationally specific effects than originally reported.
1. VERIFIED CLAIMS (Tier 1 — Peer-Reviewed / Archaeological Record)
1.1 The g factor and hierarchical structure
- Spearman (1904): Factor analysis of cognitive tests reveals a dominant general factor (g) — performance on diverse cognitive tasks is positively correlated (the "positive manifold"); g accounts for 40–50% of variance in large test batteries.
- Cattell-Horn-Carroll (CHC) model: Three-stratum hierarchy — (1) narrow abilities (~70), (2) broad abilities (~10: Gf, Gc, Gs, Gwm, Gv, Ga, Glr, Gsm), (3) g at the apex; most comprehensive empirically-derived model; basis of modern test construction (WJ-IV, KABC-II).
- Fluid (Gf) vs. Crystallized (Gc) intelligence (Cattell, 1963): Gf — reasoning with novel problems, pattern detection, declines from ~25 years; Gc — accumulated knowledge and vocabulary, stable or increasing into 60s–70s; double dissociation supports the distinction.
- Neural correlates: g is correlated with total brain volume (r ≈ .25–.30; McDaniel, 2005 meta-analysis), white matter integrity, cortical thickness, and neural efficiency (more intelligent individuals show less prefrontal activation during moderate-difficulty tasks — neural efficiency hypothesis; Neubauer & Fink, 2009).
1.2 Predictive validity of intelligence tests
- Academic achievement: IQ correlates r ≈ .50–.60 with grades and educational attainment (Strenze, 2007 meta-analysis; N = 70,000+).
- Job performance: General mental ability is the strongest single predictor of job performance across all occupations (r ≈ .51 for medium-complexity jobs; Schmidt & Hunter, 1998 meta-analysis); validity increases with job complexity; incremental validity above experience, education, and personality.
- Health and longevity: Higher childhood IQ predicts lower mortality risk (Deary et al., 2004 — Scottish Mental Survey 1932; HR ≈ 0.79 per 1 SD increase); mechanisms include health literacy, socioeconomic pathways, and system integrity.
- G is not the whole story: Specific abilities predict outcomes beyond g; conscientiousness adds incremental validity for job performance (ρ ≈ .22–.27; Barrick & Mount, 1991); domain knowledge, creativity, and practical skills matter for real-world success.
1.3 Major intelligence tests
- Wechsler Adult Intelligence Scale (WAIS-IV): Full-Scale IQ (FSIQ) with four index scores: Verbal Comprehension (VCI), Perceptual Reasoning (PRI), Working Memory (WMI), Processing Speed (PSI); mean = 100, SD = 15; most widely used adult IQ test.
- Stanford-Binet Intelligence Scales, 5th Edition (SB5): Five factors: Fluid Reasoning, Knowledge, Quantitative Reasoning, Visual-Spatial Processing, Working Memory; strong CHC alignment; extends to age 2.
- Wechsler Intelligence Scale for Children (WISC-V): Dominant child intelligence test; five factor indexes aligned with CHC.
- Raven's Progressive Matrices: Non-verbal, culture-reduced measure of fluid intelligence (Gf); widely used cross-culturally; requires pattern completion with increasing difficulty.
- Psychometric properties: Internal consistency reliabilities ≥ .90 for FSIQ; test-retest stability ≥ .85. These are among the most reliable psychological measures.
1.4 The Flynn effect
- Flynn (1984, 1987): IQ scores have risen ~3 points per decade throughout the 20th century in industrialized nations — a massive effect (≈15 points over 50 years); gains are largest on fluid intelligence measures (Raven's), smaller on crystallized measures; discovered by James Flynn and named by Herrnstein & Murray.
- Proposed mechanisms: Improved nutrition, reduced infectious disease, greater schooling, increased environmental complexity, cognitive stimulation from technology; the effect is almost certainly environmental since genetic change cannot occur this rapidly.
- Reversal: Some Scandinavian countries show declining IQ scores since the 1990s (Dutton & Lynn, 2013; Bratsberg & Rogeberg, 2018 — within-family design rules out immigration/compositional effects); cause debated (environmental factors, ceiling effects, changing item content).
- Implication: IQ norms must be regularly restandardized; using outdated norms inflates scores (critical for clinical/legal decisions, e.g., intellectual disability diagnosis for death penalty eligibility).
2. CREDIBLE BUT DEBATED CLAIMS (Tier 2 — Academic / Debated)
2.1 Heritability of intelligence
- Twin studies: Heritability of IQ is ~50% in childhood, rising to ~70–80% in adulthood (Bouchard & McGue, 1981; Plomin & Deary, 2015 — "Wilson effect"); shared environment matters substantially in childhood but diminishes by adulthood; non-shared environment accounts for ~20%.
- GWAS findings: Individual SNPs have tiny effects; polygenic scores (GPS/PGS) explain ~10–15% of IQ variance as of 2023 (Savage et al., 2018 — N > 269,000); thousands of genetic variants each contribute minimally.
- Debate: Heritability is a population statistic, not an individual one; it does not mean traits are immutable; heritability estimates vary by socioeconomic context (lower in deprived environments, higher in enriched ones — the Scarr-Rowe interaction, replicated in some but not all studies).
2.2 Stereotype threat
- Steele & Aronson (1995): African American students performed worse on a verbal test when it was presented as diagnostic of ability vs. a lab exercise; interpreted as showing that awareness of negative stereotypes impairs performance through anxiety, reduced working memory, and self-monitoring.
- Meta-analyses (Nguyen & Ryan, 2008; Flore & Wicherts, 2015): Effects are smaller than original studies suggested (d ≈ 0.26 for minorities, smaller for gender-math); significant publication bias detected; effects depend on the degree of threat activation and whether the test is truly high-stakes.
- Status: Stereotype threat is a real but more modest and context-dependent phenomenon than the original literature suggested; it does not fully account for group differences in test scores.
2.3 Test bias and cultural fairness
- Predictive bias: Major IQ tests generally predict academic and job outcomes equally well across racial/ethnic groups — they do not systematically overpredict or underpredict for minority groups (Jensen, 1980; Hartigan & Wigdor, 1989 NRC report); in some cases, they slightly overpredict minority performance.
- Measurement invariance: Factor structure of intelligence tests is largely invariant across groups (same g, same broad factors; Wicherts, 2007); however, some specific items may show differential item functioning (DIF).
- Debate: Absence of predictive bias does not mean tests are "fair" in a broader social sense — test scores reflect accumulated environmental (dis)advantage; construct bias vs. predictive bias are different questions.
2.4 Multiple intelligences and emotional intelligence
- Gardner's Multiple Intelligences (1983): Proposed 8 intelligences (linguistic, logical-mathematical, spatial, musical, bodily-kinesthetic, interpersonal, intrapersonal, naturalist); influential in education but lacking psychometric support — no reliable independent measures; the proposed intelligences tend to correlate (suggesting g); not empirically distinguishable from talents/abilities.
- Emotional Intelligence (Mayer & Salovey, 1997 vs. Goleman, 1995): Ability model (perceiving, using, understanding, managing emotions) — measured by MSCEIT, correlates modestly with g (r ≈ .20–.30); adds small incremental prediction of life outcomes beyond g and personality (ΔR² ≈ .01–.04; Joseph & Newman, 2010). Trait EI (self-report) — substantially overlaps with personality traits (especially agreeableness and emotional stability).
- Sternberg's Triarchic Theory: Analytical, creative, practical intelligence; weak psychometric support; factor analyses suggest the three components are not clearly separable from g.
3. SPECULATIVE CLAIMS (Tier 3 — Possible but Unverified)
3.1 Intelligence augmentation through neurostimulation
- tDCS (transcranial direct current stimulation) over the DLPFC has been proposed to enhance fluid intelligence and working memory; Jaeggi et al. (2008) dual n-back training showed initial transfer to fluid intelligence, but meta-analyses (Melby-Lervåg & Hulme, 2013) found no convincing evidence for far transfer from working memory training to fluid intelligence; tDCS effects are small and inconsistent.
3.2 Group differences in IQ: environmental vs. genetic contributions
- Mean IQ score differences between racial/ethnic groups are documented (e.g., ~15 points between Black and White Americans, narrowing over time); experts overwhelmingly agree that environmental factors (socioeconomic disadvantage, educational access, health, stereotype threat, test exposure) play a major role; the heritability of IQ within groups does not logically entail genetic contributions to between-group differences (Lewontin, 1972); rigorous causal studies (adoption, admixture) remain inconclusive.
4. DUBIOUS OR FRINGE CLAIMS (Tier 4 — No Credible Source / Contradicted by Evidence)
4.1 IQ tests measure innate, fixed intelligence
IQ tests measure current cognitive performance — not a fixed quantity. IQ scores can change with education, intervention, nutrition, and environmental enrichment; the Flynn effect alone demonstrates massive malleability within genetically stable populations.
4.2 Brain Training games substantially raise intelligence
Commercial brain training programs (Lumosity, BrainAge) claim to boost IQ or general cognitive ability; the FTC fined Lumosity $2 million (2016) for deceptive advertising; meta-analyses show training improves performance on trained tasks but does not transfer to general intelligence or real-world cognitive functioning (Simons et al., 2016 — comprehensive review).
4.3 Single IQ score captures all of human intelligence
Intelligence is multidimensional (CHC model identifies ~10 broad abilities); a single FSIQ score is a useful summary statistic but obscures meaningful profile variability; practical, creative, social, and emotional competencies are not fully captured.
COUNTER-ARGUMENTS & CRITICISMS
| Claim | Counter-Argument | Source |
|---|
| g factor is the dominant predictor | Specific abilities, personality, motivation also matter | Sternberg, 1985 |
| IQ tests are culturally biased | Predictive validity is roughly equal across groups | Jensen, 1980; NRC, 1989 |
| Multiple intelligences are distinct | They correlate highly, suggesting g underlies them | Visser et al., 2006 |
| Brain training raises intelligence | No far transfer; trained task improvement only | Simons et al., 2016 |
| Stereotype threat explains group gaps | Effects smaller than reported; publication bias | Flore & Wicherts, 2015 |
IMAGES
| Description | Source | Type |
|---|
| CHC three-stratum intelligence model | McGrew, 2009 | Hierarchical model |
| Flynn effect rising IQ scores across nations | Flynn, 1987 | Trend data |
| Wechsler WAIS-IV index score structure | Wechsler, 2008 | Test architecture |
| Raven's Progressive Matrices item example | Raven et al., 1998 | Measurement tool |
| Heritability of IQ across the lifespan (Wilson effect) | Plomin & Deary, 2015 | Developmental trajectory |
BIBLIOGRAPHY
- Spearman, Charles | 1904 | "'General Intelligence,' Objectively Determined and Measured" | American Journal of Psychology | ∅ | 15::201–293 | ∅ | ∅ | doi:10.2307/1412107 | ∅ | ∅ | ∅
- McGrew, Kevin S | 2009 | "CHC Theory and the Human Cognitive Abilities Project" | Journal of Psychoeducational Assessment | ∅ | 27::1–21 | ∅ | ∅ | doi:10.1016/j.intell.2008.08.004 | ∅ | ∅ | ∅
- Cattell, Raymond B | 1963 | "Theory of Fluid and Crystallized Intelligence" | Journal of Educational Psychology | ∅ | 54::1–22 | ∅ | ∅ | doi:10.1037/h0046743 | ∅ | ∅ | ∅
- Schmidt, Frank L.; John E | 1998 | "The Validity and Utility of Selection Methods in Personnel Psychology" | Psychological Bulletin | ∅ | 124::262–274 | Hunter | ∅ | doi:10.1037//0033-2909.124.2.262 | ∅ | ∅ | ∅
- Flynn, James R | 1984 | "The Mean IQ of Americans: Massive Gains 1932 to 1978" | Psychological Bulletin | ∅ | 95::29–51 | ∅ | ∅ | doi:10.1037/0033-2909.95.1.29 | ∅ | ∅ | ∅
- Plomin, Robert; Ian J | 2015 | "Genetics and Intelligence Differences: Five Special Findings" | Molecular Psychiatry | ∅ | 20::98–108 | Deary | ∅ | ∅ | ∅ | ∅ | ∅
- Neubauer, Aljoscha C.; Andreas Fink | 2009 | "Intelligence and Neural Efficiency" | Neuroscience & Biobehavioral Reviews | ∅ | 33::1004–1023 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Steele, Claude M.; Joshua Aronson | 1995 | "Stereotype Threat and the Intellectual Test Performance of African Americans" | Journal of Personality and Social Psychology | ∅ | 69::797–811 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Flore, Paulette C.; Jelte M | 2015 | "Does Stereotype Threat Influence Performance of Girls in Stereotyped Domains?" | Journal of School Psychology | ∅ | 53::25–44 | Wicherts | ∅ | ∅ | ∅ | ∅ | ∅
- Mayer, John D.; Peter Salovey | 1997 | "What Is Emotional Intelligence?" | Emotional Development and Emotional Intelligence | ∅ | ∅ | In , edited by Peter Salovey and David J | ∅ | ∅ | ∅ | ∅ | Slusyer, 3 31; New York: Basic Books
- Gardner, Howard | 1983 | ∅ | Frames of Mind: The Theory of Multiple Intelligences | ∅ | ∅ | New York: Basic Books | ∅ | ∅ | ∅ | ∅ | ∅
- Strenze, Tarmo | 2007 | "Intelligence and Socioeconomic Success: A Meta-Analytic Review of Longitudinal Research" | Intelligence | ∅ | 35::401–426 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Deary, Ian J., et al | 2004 | "The Impact of Childhood Intelligence on Later Life" | Journal of Personality and Social Psychology | ∅ | 86::130–147 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Savage, Jeanne E., et al | 2018 | "Genome-wide Association Meta-Analysis in 269,867 Individuals Identifies New Genetic and Functional Links to Intelligence" | Nature Genetics | ∅ | 50::912–919 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Simons, Daniel J., et al | 2016 | "Do 'Brain-Training' Programs Work?" | Psychological Science in the Public Interest | ∅ | 17::103–186 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Jensen, Arthur R. | 1980 | ∅ | Bias in Mental Testing | ∅ | ∅ | New York: Free Press | ∅ | ∅ | ∅ | ∅ | ∅
- Wechsler, David | 2008 | ∅ | WAIS-IV: Administration and Scoring Manual | ∅ | ∅ | San Antonio, TX: Pearson | ∅ | ∅ | ∅ | ∅ | ∅
- Bouchard, Thomas J.; Matthew McGue | 1981 | "Familial Studies of Intelligence: A Review" | Science | ∅ | 212::1055–1059 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Bratsberg, Bernt; Ole Rogeberg | 2018 | "Flynn Effect and Its Reversal Are Both Environmentally Caused" | Proceedings of the National Academy of Sciences | ∅ | 115::6674–6678 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
- Melby-Lervåg, Monica; Charles Hulme | 2013 | "Is Working Memory Training Effective? A Meta-Analytic Review" | Developmental Psychology | ∅ | 49::270–291 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
CROSS-REFERENCE INDEX
Document T_1_10 · Created Mar 07, 2026 · TheoriesOfAnything Knowledge Base
<table border="1" cellpadding="12" cellspacing="0" style="border-collapse: collapse; border: 2px solid #888; margin-top: 2em; background: #fafafa;">
<tr><td>
⚠️ AI-Assisted Research Disclaimer
This document was generated and structured with the assistance of AI tools.
While every effort is made to ensure accuracy, AI-assisted content may
contain errors, misattributions, or unintended inaccuracies. **Always
verify claims, dates, and sources independently** before citing or relying
on any information presented here.
- Sources may contain errors. Bibliography entries and cross-references
are checked by automated systems, but mistakes can occur. If something
looks wrong, it may be.
- Speculative and unverified claims are clearly labeled. This project
uses a four-tier evidence system:
- Tier 1 — Verified: Peer-reviewed, established scientific consensus.
- Tier 2 — Credible: Academically supported, debated but grounded.
- Tier 3 — Speculative: Plausible but unverified by mainstream science.
- Tier 4 — Dubious: No credible support or contradicted by evidence.
- This project maps multiple perspectives — not a single truth. Mainstream,
alternative, and skeptical viewpoints are presented side by side for
critical comparison, not endorsement. Inclusion does not imply agreement.
- We are actively improving. Source verification, factuality scoring,
and bibliography enrichment are ongoing. Each revision adds stronger
citations, corrects identified errors, and expands coverage.
📖 For full details on our verification methodology, scoring systems, and
quality metrics, see: Fact-Checking & Verification Systems
Think Openly. Check the sources. Draw your own conclusions.
</td></tr>
</table>