RESEARCH BASE
Search 3,717 documents across 34 fields — every claim tier-rated by evidence
12 results for "transformer"
ZD_2_15 — Transformer Architecture: Self-Attention and the Foundation of Modern AI
The transformer is a neural network architecture introduced in 2017 that replaced recurrent and convolutional models as the dominant paradigm in artificial intelligence. Its core innovation — the self-attention mechanism
ZD_2_12 — Generative AI: Large Language Models, Diffusion, and the Transformer Revolution
Generative AI refers to artificial intelligence systems capable of creating new content — text, images, audio, video, code, 3D models — that is novel, coherent, and often indistinguishable from human-created work. The fi
ZG_5_01 — Computational Linguistics and NLP
Computational linguistics (CL) and natural language processing (NLP) are the interdisciplinary fields concerned with enabling computers to process, analyze, understand, and generate human language. CL originated in the 1
ZG_5_16 — Machine Translation and Semantic Loss: What Gets Lost Between Languages
Machine translation (MT) — the use of computational systems to translate text or speech from one language to another — has undergone revolutionary transformation since the 2010s through the advent of neural machine trans
ZG_5_09 — Machine Translation: Rule-Based, Statistical, and Neural Approaches
Machine Translation (MT) — the use of computers to translate text or speech from one natural language to another — has been a central problem of computational linguistics and artificial intelligence since the earliest da
INTERDOC_32 — AI, Consciousness, and the Ethical Frontier
[KEY FINDING] The alignment problem — ensuring that artificial intelligence systems pursue goals aligned with human values — has moved from science fiction to mainstream AI safety research. Stuart Russell (Human Compatib
O_1_10 — Carrington Event and Space Weather Threats to Earth
The Carrington Event of September 1–2, 1859 was the most intense geomagnetic storm in recorded history — caused by a massive coronal mass ejection (CME) from the Sun that struck Earth's magnetosphere approximately 17.6 h
ZD_2_03 — Natural Language Processing
Natural language processing (NLP) — the computational analysis, understanding, and generation of human language — spans rule-based, statistical, and neural approaches across tasks including machine translation, text clas
ZD_2_07 — Artificial General Intelligence — Architectures and Challenges
Artificial General Intelligence (AGI) — a hypothetical AI system capable of performing any intellectual task that a human can, with the same flexibility, generality, and ability to learn and transfer knowledge across dom
S_1_16 — Large Language Models: Architecture, Capabilities, and Societal Impact
Large Language Models (LLMs) are neural networks with billions to trillions of parameters, trained on massive text corpora to predict the next token in a sequence. Built on the transformer architecture introduced by Vasw
S_1_11 — Machine Learning and Deep Learning
Machine learning (ML) is the subfield of AI in which systems learn patterns from data rather than being explicitly programmed. Deep learning uses artificial neural networks with many layers (hence "deep") to learn hierar
V_4_19 — Machine Learning Mathematics: Neural Networks, Optimization, and Learning Theory
Machine learning mathematics — the theoretical foundations underlying the training, generalization, and behavior of learning algorithms — spans statistical learning theory, optimization, approximation theory, information
BROWSE BY SECTION — 3717 documents across 34 fields