The Training Costs of AI Models Over Time
What We’re Showing
The rising cost of training AI models over time, based on data from the 2024 Artificial Intelligence Index Report released by Stanford University.
How Training Cost is Determined
Stanford University collaborated with research firm Epoch AI to estimate AI model training costs, which were based on cloud compute rental prices.
Key factors that were analyzed include the model's training duration, the hardware’s utilization rate, and the value of the training hardware.
Ballooning Training Costs
The cost of training AI models has increased dramatically these past few years. For instance, it cost just $930 to train Transformer, a groundbreaking neural network architecture introduced in 2017—yet it cost over $78 million to train GPT-4 last year.
Google’s AI model, Gemini Ultra, costs even more, at a staggering $191 million. As of early 2024, the model outperformed GPT-4 on several metrics, most notably across the Massive Multitask Language Understanding (MMLU) benchmark.
This benchmark serves as a crucial yardstick for gauging the capabilities of large language models, evaluating its knowledge and problem solving proficiency across 57 subject areas.
Dataset
Model name | Training Cost (USD) | Model Creators/Contributors | Release year |
---|---|---|---|
Transformer | $930 | 2017 | |
BERT-Large | $3,288 | 2018 | |
RoBERTa Large | $160,018 | Meta | 2019 |
GPT-3 175B (davinci) | $4,324,883 | OpenAI | 2020 |
Megatron-Turing NLG 530B | $6,405,653 | Microsoft/NVIDIA | 2021 |
LaMDA | $1,319,586 | 2022 | |
PaLM (540B) | $12,389,056 | 2022 | |
GPT-4 | $78,352,034 | OpenAI | 2023 |
Llama 2 70B | $3,931,897 | Meta | 2023 |
Gemini Ultra | $191,400,000 | 2023 |