G_1_08

G_1_08 — Machine Learning in Archaeology — Pattern Recognition in the Past

Verified (Tier 1)
Confidence: 4/5 Section: G Updated: March 10, 2026
Source Count: 13 | Weighted Score: 37 | Source Confidence: [4/5] | Primary Tier: 1–2 | Last Updated: March 10, 2026
Keywords: machine learning, artificial intelligence, deep learning, neural network, convolutional neural network, CNN, random forest, object detection, classification, pottery classification, lithic analysis, LiDAR, satellite imagery, feature detection, predictive modeling, archaeological survey, computer vision, NLP, text mining, heritage management
Category Tags: modern-frameworks, methodology, computer science, archaeology, remote sensing
Cross-References: G_1_02 — Digital Archaeology LiDAR AI · G_1_03 — Remote Sensing Satellite Archaeology · ZD_2_01 — Machine Learning · G_2_03 — Bayesian Reasoning

QUICK SUMMARY

Machine learning (ML) — the subset of artificial intelligence in which algorithms learn patterns from data rather than being explicitly programmed — is transforming archaeological practice across every stage of research: survey, excavation, classification, analysis, and interpretation. The applications fall into several major categories: (1) Remote sensing and site detection — convolutional neural networks (CNNs) trained on satellite imagery or airborne LiDAR data can automatically detect archaeological features (mounds, enclosures, road networks, looted pits) across vast landscapes that would take human analysts years to survey manually; Caspari and Crespo (2019) used deep learning to detect burial mounds across the Eurasian steppe from satellite imagery with >90% precision, covering areas impossible to survey on foot. (2) Artifact classification — ML classifiers (random forests, support vector machines, CNNs) trained on images or 3D scans of pottery sherds, lithic tools, coins, and other artifacts can achieve classification accuracies comparable to expert human analysts for type, period, and provenance; Pawlowicz and Downum (2021) trained a CNN on >100,000 pottery sherd images from the American Southwest, achieving 83% classification accuracy across 15 types — comparable to inter-analyst agreement among human specialists. (3) Predictive modeling — ML algorithms trained on known site locations and associated environmental variables (slope, aspect, soil type, proximity to water, vegetation patterns) can predict where undiscovered sites are most likely to be located, guiding survey and heritage management; these "archaeological predictive models" (APMs) have been used since the 1990s but ML methods (gradient-boosted trees, neural networks) substantially outperform traditional logistic regression. (4) Text mining and NLP — natural language processing applied to excavation reports, historical texts, and unpublished "grey literature" to extract structured data (site coordinates, artifact types, dates, associations) from millions of pages of unstructured text. Key challenges include: training data scarcity (most archaeological datasets are small by ML standards — hundreds to thousands of examples, not millions); interpretability (deep learning models are often "black boxes" that cannot explain which features drive their classifications); bias (models trained on well-surveyed Western European/North American data may perform poorly in under-studied regions); and validation (archaeological ground truth is inherently incomplete — we cannot know all sites in a landscape, so false negatives are difficult to assess).


1. VERIFIED CLAIMS (Tier 1 — Peer-Reviewed / Scholarly Consensus)

1.1 Site Detection from Remote Sensing Data

1.2 Artifact Classification

1.3 Predictive Modeling


2. CREDIBLE CLAIMS (Tier 2 — Academic / Debated but Supported)

2.1 NLP for Archaeological Grey Literature

2.2 3D Morphometrics and Geometric Deep Learning


3. SPECULATIVE CLAIMS (Tier 3 — Possible but Unverified)

3.1 AI-Assisted Archaeological Interpretation


4. DUBIOUS CLAIMS (Tier 4 — No Credible Source / Contradicted by Evidence)

4.1 AI Has Discovered Lost Civilizations in Satellite Data


Counter-Arguments & Criticisms

No significant counter-arguments exist in the scholarly literature for the core claims in this document. Machine Learning in Archaeology — Pattern Recognition in the Past represents established scientific and methodological consensus with no active scholarly dispute over the fundamental claims presented here.


IMAGES

#DescriptionFilenameSourceLicense

No images assigned yet.


BIBLIOGRAPHY

  1. Caspari, G.; Crespo, P | 2019 | "Convolutional Neural Networks for Archaeological Site Detection — Finding 'Princely' Tombs" | Journal of Archaeological Science | ∅ | 110::104998 | ∅ | ∅ | doi:10.1016/j.jas.2019.104998 | ∅ | ∅ | ∅
  2. Pawlowicz, L.M.; Downum, C.E | 2021 | "Applications of Deep Learning to Decorated Ceramic Typology and Classification" | Journal of Archaeological Science | ∅ | 36::103065 | ∅ | ∅ | doi:10.1016/j.jasrep.2021.103065 | ∅ | ∅ | ∅
  3. Orengo, H.A.; Garcia-Molsosa, A | 2019 | "A Brave New World for Archaeological Survey: Automated Machine Learning-Based Potsherd Detection Using High-Resolution Drone Imagery" | Journal of Computer Applications in Archaeology | ∅ | 2::57–69 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
  4. Verhagen, P.; Whitley, T.G | 2012 | "Integrating Archaeological Theory and Predictive Modeling: A Live Report from the Scene" | Journal of Archaeological Method and Theory | ∅ | 19::49–100 | ∅ | ∅ | doi:10.1007/s10816-011-9102-7 | ∅ | ∅ | ∅
  5. Parcak, S.H. et al | 2016 | "Satellite Evidence of Archaeological Site Looting in Egypt: 2002–2013" | Antiquity | ∅ | 90::188–205 | ∅ | ∅ | doi:10.15184/aqy.2016.1 | ∅ | ∅ | ∅
  6. Brandsen, A. et al | 2020 | "Creating a Dataset for Named Entity Recognition in the Archaeology Domain" | Proceedings of the 12th LREC | ∅ | ∅ | In: : 4573 4577 | ∅ | ∅ | ∅ | ∅ | ∅
  7. Davis, D.S | 2019 | "Object-Based Image Analysis: A Review of Developments and Future Directions of Automated Feature Detection in Landscape Archaeology" | Archaeological Prospection | ∅ | 26::155–163 | ∅ | ∅ | doi:10.1002/arp.1730 | ∅ | ∅ | ∅
  8. Domínguez-Rodrigo, M.; Baquedano, E. e0205571 | 2018 | "Distinguishing Butchery Cut Marks from Crocodile Bite Marks through Machine Learning Methods" | PLoS ONE | ∅ | 13:: | ∅ | ∅ | doi:10.1371/journal.pone.0205571 | ∅ | ∅ | ∅
  9. Lambers, K. et al | 2019 | "Integrating Remote Sensing, Machine Learning, Geostatistics and GIS to Identify Groundwater Potential Zones and Archaeological Survey Targets" | Remote Sensing | ∅ | 11::2075 | ∅ | ∅ | ∅ | ∅ | ∅ | ∅
  10. Anichini, F. et al | 2020 | "Developing the ArchAIDE Application: A Digital Workflow for Identifying, Classifying and Sharing Archaeological Pottery Using Automated Image Recognition" | Internet Archaeology | ∅ | ∅ | 52 | ∅ | doi:10.11141/ia.52.7 | ∅ | ∅ | ∅
  11. Luo, L. et al | 2019 | "Airborne and Spaceborne Remote Sensing for Archaeological and Cultural Heritage Applications: A Review of the Century (1907–2017)" | Remote Sensing of Environment | ∅ | 232::111280 | ∅ | ∅ | doi:10.1016/j.rse.2019.111280 | ∅ | ∅ | ∅
  12. Argyrou, A.; Agapiou, A | 2022 | "A Review of Artificial Intelligence and Remote Sensing for Archaeological Research" | Remote Sensing | ∅ | 14::6000 | ∅ | ∅ | doi:10.3390/rs14236000 | ∅ | ∅ | ∅
  13. Bickler, S.H | 2021 | "Machine Learning Arrives in Archaeology" | Advances in Archaeological Practice | ∅ | 9::186–191 | ∅ | ∅ | doi:10.1017/aap.2021.6 | ∅ | ∅ | ∅

CROSS-REFERENCE INDEX

Related DocConnection

No cross-references yet.


<table border="1" cellpadding="12" cellspacing="0" style="border-collapse: collapse; border: 2px solid #888; margin-top: 2em; background: #fafafa;">

<tr><td>

⚠️ AI-Assisted Research Disclaimer

This document was generated and structured with the assistance of AI tools.

While every effort is made to ensure accuracy, AI-assisted content may

contain errors, misattributions, or unintended inaccuracies. **Always

verify claims, dates, and sources independently** before citing or relying

on any information presented here.

are checked by automated systems, but mistakes can occur. If something

looks wrong, it may be.

uses a four-tier evidence system:

alternative, and skeptical viewpoints are presented side by side for

critical comparison, not endorsement. Inclusion does not imply agreement.

and bibliography enrichment are ongoing. Each revision adds stronger

citations, corrects identified errors, and expands coverage.

📖 For full details on our verification methodology, scoring systems, and

quality metrics, see: Fact-Checking & Verification Systems

Think Openly. Check the sources. Draw your own conclusions.

</td></tr>

</table>