DeepSeek R1 and the $589B NVIDIA Crash: The Week That Shook AI's Cost Assumptions #

Q: Q: What is DeepSeek R1 and how does it compare to OpenAI o1?

A: DeepSeek R1 is a 671B parameter open-weights reasoning model that matches OpenAI o1's benchmark performance. R1 achieves 79.8% on AIME 2024 (vs o1's 74.4%) and 96.3% Codeforces percentile (vs o1's 93.4%). Key differences: R1 shows its reasoning chain (o1 hides it), costs 96% less via API ($0.55 vs $15 per million input tokens), and can be self-hosted under the MIT license. o1 still leads slightly on GPQA Diamond (75.4% vs 71.5%) and offers broader tool integration.

Q: Q: How much did it actually cost to train DeepSeek R1?

A: The final training run for DeepSeek V3 (R1's base model) cost approximately $5.576 million in GPU compute. This figure—cited in DeepSeek's technical paper—represents 2.788M H800 GPU-hours. However, this excludes research, failed experiments, and infrastructure. Total development costs likely range $100–200M when including team, prior experiments, and data pipeline. Even this broader estimate is an order of magnitude less than OpenAI or Google's frontier training runs.

Q: Q: Why did NVIDIA lose $589 billion in market cap on January 27?

A: NVIDIA stock dropped 17% as markets priced in potential demand destruction from AI training efficiency. DeepSeek's $6M training run suggested that algorithmic innovation—not just GPU scale—could achieve frontier capabilities. Traders feared that if efficient models reduce chip demand, NVIDIA's trillion-dollar TAM projections were overstated. The $589B wipeout was the largest single-day market cap loss in history, dragging down Broadcom (-17%), ASML (-7%), and the entire semiconductor sector.

Q: Q: Is DeepSeek R1 really open source?

A: Yes—R1 is released under the MIT license, the most permissive major open-weights release to date. The license permits commercial use without revenue caps, redistribution, modification, and derivative works. The only requirement is preserving the license notice. This differs from Meta's Llama (commercial restrictions above $700M revenue) and many Chinese model releases (vague usage policies). Weights are available on Hugging Face and DeepSeek's API platform without registration gates.

Q: Q: Can I run DeepSeek R1 on my own hardware?

A: Yes, but it requires significant GPU resources. Full R1 (671B parameters, 37B active) needs 8× H100 80GB GPUs for FP8 inference or 16× H100s for FP16. More practical: The distilled 32B Qwen variant (72.6% AIME 2024) runs on a single A100 or 2× RTX 4090s. For most use cases, the R1 API at $0.55/M tokens is 150x cheaper than renting equivalent self-hosted infrastructure. Self-hosting only makes sense for data sovereignty requirements or very high volume (100M+ tokens/day).

Q: Q: What is the Stargate Project and how does R1 affect it?

A: Stargate is a $500B, 4-year AI infrastructure consortium announced January 21, 2025 involving OpenAI, SoftBank, Oracle, and MGX. R1—released 24 hours earlier—creates tension with Stargate's core thesis: that massive compute buildout is necessary for AI leadership. If DeepSeek achieved frontier reasoning with $6M training runs, America's $500B infrastructure advantage may be less decisive than assumed. Stargate continues but now faces scrutiny about whether capacity or efficiency is the winning strategy.

Q: Q: How does R1-Zero differ from the full R1 model?

A: R1-Zero is trained entirely via reinforcement learning (GRPO) without any supervised fine-tuning. It develops reasoning spontaneously by being rewarded for correct answers, producing raw thinking traces that are often messy (English/Chinese mixed, poor formatting). R1 adds a "cold-start" phase with curated reasoning examples before GRPO, improving readability and coherence. R1-Zero is the pure research result; R1 is the production model. R1 scores slightly higher on most benchmarks due to better formatting and structure.

Q: Q: What benchmarks did DeepSeek R1 win against o1?

A: R1 wins on AIME 2024 (79.8% vs 74.4%), MATH-500 (97.3% vs 94.8%), and Codeforces (96.3% vs 93.4% percentile). o1 maintains narrow leads on GPQA Diamond (75.4% vs 71.5%) and MMLU (91.8% vs 90.8%). The pattern: R1 dominates mathematical and algorithmic reasoning while o1 retains slight edges on broad knowledge and graduate-level science. For coding and math automation workflows, R1's wins are substantial and economically significant given the 96% API price advantage.

Q: Q: Is DeepSeek affiliated with the Chinese government?

A: DeepSeek is owned by High-Flyer Quant, a Chinese quantitative hedge fund founded by Liang Wenfeng in 2015. High-Flyer is an independent commercial entity, not a state-owned enterprise. However, all Chinese companies operate under government influence and data access laws. DeepSeek's privacy policy states user data may be stored in China. For US government, military, or sensitive commercial use, self-hosting or US-based API providers remain the conservative choice.

Q: Q: What is Trump's Executive Order 14179 on AI?

A: EO 14179, signed January 23, 2025, rescinds Biden's AI safety EO 14110 and establishes a deregulatory federal AI stance. It removes requirements for safety testing reporting to NIST, pauses watermarking standards development, rescinds privacy impact assessments for federal AI use, and eliminates bias testing mandates. Export controls to China remain unchanged. The EO explicitly cites DeepSeek R1 as evidence that US AI regulation was impairing competitiveness.

DeepSeek R1 matches OpenAI o1's reasoning capabilities with a reported $6M training budget, triggering a $589 billion single-day wipeout of NVIDIA's market cap—the largest in stock market history. This is how a Chinese quant firm's open-weights release exposed the fragility beneath trillion-dollar AI infrastructure bets.

Table of Contents #

The Timeline: Eight Days That Rewrote the Rules
What Is DeepSeek R1? The Technical Reality
The $6 Million Training Run That Broke the Internet
Benchmark Face-Off: R1 vs o1 vs Claude 3.5 Sonnet
The NVIDIA Crash: Anatomy of a $589B Wipeout
The Open-Weights vs Closed-API War Just Escalated
DeepSeek Hits #1: What the App Store Moment Means
Stargate and the $500B Question: Infrastructure vs Efficiency
Trump's Executive Order 14179: The Regulatory Pendulum Swings
What This Means for Builders: Practical Takeaways
Frequently Asked Questions
The New Math of AI Infrastructure

The Timeline: Eight Days That Rewrote the Rules #

January 20–28, 2025 delivered the most consequential sequence of AI events since the original ChatGPT launch. What follows is the hour-by-hour reconstruction of how a Chinese quant shop released an open-weights reasoning model and accidentally triggered the largest single-day market cap destruction in history.

Date	Event	Significance
Jan 20	DeepSeek releases R1 and R1-Zero with MIT license	Open-weights reasoning model matches o1 benchmarks; weights downloadable for commercial use
Jan 21	White House announces Stargate Project	OpenAI, SoftBank, Oracle, MGX commit $500B over 4 years to US AI infrastructure
Jan 23	Trump signs EO 14179	Rescinds Biden's AI safety EO; establishes deregulatory federal AI stance
Jan 23	OpenAI launches Operator research preview	$200/mo ChatGPT Pro exclusive; first browser-agent from OpenAI
Jan 25–26	DeepSeek hits #1 on US App Store	Free AI assistant overtakes ChatGPT in downloads; consumer awareness explodes
Jan 27	NVIDIA loses $589B market cap	Stock drops 17% in single session; chip suppliers Broadcom, ASML, AMD follow
Jan 31	OpenAI releases o3-mini	First free-tier reasoning model; overt response to R1 pricing pressure

The compression is remarkable: DeepSeek announced R1 on Monday. By Friday, their consumer app dominated US mobile rankings. The following Monday, NVIDIA shed nearly $600 billion in value. By Friday, OpenAI had shipped a free reasoning model—something they'd never done before.

What actually moved markets on January 27 wasn't just R1's existence—it was the efficiency narrative. DeepSeek's technical paper disclosed that the base V3 model training cost roughly $5.576M in compute. Even accounting for the subsequent RL fine-tuning for R1, the total investment appears to be a tiny fraction of what OpenAI, Anthropic, and Google spend on a single training run. Traders started pricing in demand destruction: if frontier models can be trained for mid-seven figures, the trillion-dollar buildout of AI infrastructure looks suddenly speculative.

The timing with Stargate couldn't have been more theatrical. The $500B infrastructure consortium—announced with presidential fanfare on January 21—looked like a bet on the "more compute always wins" thesis. DeepSeek's January 20 release was a live demonstration that efficiency engineering might beat raw scale. The market chose a side on January 27.

What Is DeepSeek R1? The Technical Reality #

DeepSeek R1 is a 671-billion parameter mixture-of-experts (MoE) reasoning model released under the MIT license on January 20, 2025. Unlike prior open-weights releases that approximated GPT-3.5 quality, R1 benchmarks at parity with OpenAI's o1—the first closed frontier reasoning model. This is the technical architecture behind that claim.

The Base Architecture: DeepSeek V3 #

R1 fine-tunes from DeepSeek V3, released December 2024. The V3 architecture specs:

671B total parameters with 37B active per token (MoE with 256 experts)
128K context window with reasonable long-context retention
Multi-token prediction during training for efficiency gains
FP8 mixed-precision training on a cluster of 2,048 NVIDIA H800 GPUs

V3 was already competitive with GPT-4o and Claude 3.5 Sonnet on most benchmarks. R1 adds explicit reasoning capabilities through a novel reinforcement learning methodology.

Two Model Variants: R1-Zero and R1 #

DeepSeek released two distinct reasoning models:

R1-Zero is the pure experimental result. Trained entirely via Group Relative Policy Optimization (GRPO)—a reinforcement learning method without supervised fine-tuning (SFT) at all. The model learns reasoning by being rewarded for correct answers on math and code problems, developing chain-of-thought behaviors spontaneously.

R1 is the production-ready model. Uses a small amount of "cold-start" data (curated reasoning examples) followed by the same GRPO pipeline. The cold-start addresses R1-Zero's readability issues—its raw reasoning traces are often a mix of English and Chinese with poor formatting.

Visible Reasoning: The UX Differentiator #

Unlike OpenAI's o1 which hides its reasoning chain, DeepSeek R1 exposes its full thinking process in the API and interface. You can watch the model:

Reconsider its approach mid-stream
Verify intermediate calculations
Self-correct when it detects logical errors
Occasionally output "wait, that's not right" and restart

This visibility is technically trivial—it's just not suppressing the chain-of-thought tokens—but strategically significant. Developers can inspect how the model reached conclusions, enabling better debugging and trust calibration.

The MIT License: Actually Open #

DeepSeek released weights under the MIT license, not Apache 2.0 or a custom commercial license with usage restrictions. This means:

Commercial use permitted without revenue caps
No restrictions on serving competing products
Modification and redistribution allowed
Only requirement: include the license notice

The license choice matters. Meta's Llama models have commercial use restrictions above certain revenue thresholds. Some Chinese model releases carry vaguely worded usage policies. DeepSeek's MIT release is the most permissive major open-weights launch since the original LLaMA leak started the open-weights movement.

The $6 Million Training Run That Broke the Internet #

DeepSeek's technical paper claims the final training run for V3 cost approximately $5.576 million in raw compute. This figure—repeated endlessly in the days following R1's release—became the symbol for a new narrative: frontier AI capabilities no longer require billion-dollar training budgets. But the number deserves scrutiny.

What the $5.576M Figure Actually Includes #

The cost cited in the DeepSeek V3 paper represents:

GPU hours only: 2.788M H800 GPU-hours at assumed rental rates
Final training run: The last pass that produced the release model
Excludes: Research, failed experiments, infrastructure, salaries, data collection

This is not the full cost of developing V3 from scratch. DeepSeek's parent company, High-Flyer Quant, has been building AI infrastructure since 2019. The $5.576M is the marginal cost of the final training run, not the total investment required to reach that capability threshold.

The Real Cost Arithmetic #

Industry analysts estimating total development costs suggest a more realistic range:

Cost Component	Estimated Range
Total GPU infrastructure (amortized)	$50–100M
Research team (3+ years)	$30–60M
Failed experiments and abandoned runs	$20–40M
Data pipeline and curation	$10–20M
Final V3 training run	$5.6M
R1 fine-tuning (incremental)	$1–3M

Even with these broader estimates, DeepSeek's total investment appears to be $100–200M—substantially less than the $500M–$1B+ that OpenAI, Anthropic, or Google spend on frontier training runs.

Why the Narrative Landed Anyway #

The $6M figure went viral because it reframed the efficiency debate. Even if the true all-in cost is $100M, that remains an order of magnitude cheaper than American frontier labs. The key insight isn't the exact number—it's the efficiency methodology:

FP8 training with careful numerical stability handling
Multi-token prediction reducing total forward passes needed
Auxiliary-loss-free MoE load balancing improving expert utilization
Dual-pipe pipeline parallelism minimizing GPU idle time
Restricted H800 chips forcing algorithmic innovation over hardware brute force

DeepSeek's team was working with NVIDIA H800s—export-controlled chips with reduced interconnect bandwidth compared to H100s. The constraints forced optimization. The result suggests that algorithmic efficiency may matter more than raw compute access, a terrifying prospect for infrastructure investors betting on the "more chips = better models" thesis.

Benchmark Face-Off: R1 vs o1 vs Claude 3.5 Sonnet #

DeepSeek R1 matches or exceeds OpenAI o1 across mathematical reasoning, coding competition problems, and graduate-level science questions. The benchmark table below shows where the Chinese open-weights model wins—and where it still lags.

Mathematical Reasoning Benchmarks #

Benchmark	DeepSeek R1	OpenAI o1	Claude 3.5 Sonnet	Notes
AIME 2024	79.8%	74.4%	16.0%	American Invitational Math Examination; R1 leads significantly
MATH-500	97.3%	94.8%	78.3%	Competition mathematics problems; all models strong
GPQA Diamond	71.5%	75.4%	65.0%	Graduate-level physics, biology, chemistry; o1 still leads
MMLU	90.8%	91.8%	88.7%	General knowledge; o1 maintains narrow edge

The AIME 2024 result is particularly striking. R1's 79.8% vs o1's 74.4% represents a meaningful gap in competition-level math—the kind of complex problem-solving that reasoning models are designed for. Claude 3.5 Sonnet, not being a dedicated reasoning model, scores 16%—illustrating the architecture divide between standard LLMs and explicit reasoning systems.

Coding and Algorithmic Performance #

Benchmark	DeepSeek R1	OpenAI o1	Claude 3.5 Sonnet	Notes
Codeforces (percentile)	96.3%	93.4%	58.5%	Competitive programming; R1 achieves Elo equivalent to 96th percentile human competitors
LiveCodeBench (pass@1)	65.4%	63.4%	32.9%	Recent coding problems; R1 shows generalization
SWE-bench Verified	49.2%	48.9%	50.8%	Real-world software engineering; Claude still leads

Codeforces performance demonstrates R1's algorithmic reasoning capability. A 96.3% percentile ranking means the model solves competitive programming problems at a level approaching human contestants who've trained for years. The LiveCodeBench result—testing on problems published after training cutoff—shows this isn't memorization.

Notably, Claude 3.5 Sonnet still leads on SWE-bench Verified, which tests real-world software engineering tasks. Raw reasoning doesn't automatically translate to practical code maintenance, debugging, and system integration.

Chinese Language and Regional Benchmarks #

Benchmark	DeepSeek R1	OpenAI o1	Claude 3.5 Sonnet	Notes
C-Eval	86.5%	69.8%	67.3%	Chinese comprehensive evaluation; native advantage
CMMLU	90.0%	82.1%	81.1%	Chinese multi-task language understanding

R1's Chinese language dominance is expected—DeepSeek is a Chinese company training on Chinese data—but still significant for regional deployment decisions. For Chinese-language automation workflows, R1 offers both better performance and local data handling advantages.

The Verdict: Parity Achieved #

The benchmark story is simple: DeepSeek R1 achieves functional parity with OpenAI o1 on reasoning tasks. It wins some (AIME, Codeforces), loses narrowly on others (GPQA Diamond, MMLU), and matches on most. For builders, this means:

Reasoning capabilities are no longer exclusive to closed APIs
Model architecture matters more than training budget for this capability class
Open-weights reasoning enables on-premise deployment for sensitive data workflows
Cost arbitrage is immediate—R1 API pricing undercuts o1 by 90%+

The NVIDIA Crash: Anatomy of a $589B Wipeout #

NVIDIA lost $589 billion in market capitalization on Monday, January 27, 2025—the largest single-day wipeout in stock market history. The 17% drop erased more value than the market caps of all but 12 publicly traded companies. Here's the hour-by-hour of how DeepSeek's release triggered a sector-wide repricing.

The Pre-Market Setup #

By 4:00 AM ET, NVIDIA shares were already down 8% in pre-market trading. The weekend had been filled with:

Viral social media posts about DeepSeek's $6M training cost
Weekend downloads of the DeepSeek app surging past ChatGPT
Fear that efficiency breakthroughs would destroy AI infrastructure demand
Trump's weekend comments on AI competition with China

The narrative had shifted from "exciting open-weights model" to "existential threat to the AI hardware thesis."

Intraday Collapse #

Time (ET)	NVIDIA Price	Change	Market Context
9:30 AM	$118.42	-12%	Regular session opens; selling accelerates
10:30 AM	$112.15	-17%	Broadcom, ASML, AMD all down 10%+
12:00 PM	$108.91	-19%	Nasdaq -3%; energy/utility AI plays crater
2:00 PM	$113.25	-16%	Slight bounce; still historic losses
4:00 PM	$118.65	-17%	Close at $3.1T market cap, down from $3.7T

The selling wasn't contained to NVIDIA. Broadcom (-17%), ASML (-7%), AMD (-6%), and Marvell (-19%) all followed. The PHLX Semiconductor Index (SOX) shed 9.2%—its worst day since March 2020.

What Traders Were Actually Pricing In #

The market wasn't just reacting to one Chinese model. It was repricing the core assumption of AI infrastructure demand:

The Old Thesis: Frontier AI requires exponentially more compute. Winner-take-most dynamics favor those who buy the most GPUs. Build the infrastructure, the models will need it.

The New Fear: Algorithmic efficiency can substitute for hardware scale. If DeepSeek achieved o1-level reasoning with H800s and clever training, perhaps the trillion-dollar infrastructure buildout isn't strictly necessary.

This is the Jevons paradox in reverse. Normally, efficiency gains increase resource use (cheaper compute → more compute consumption). But markets feared that training efficiency would reduce the number of chips needed for frontier models, while distillation (using R1 to train smaller models) would reduce inference demand.

The VIX AI Premium Evaporates #

For 18 months, NVIDIA had traded at a AI scarcity premium—35–45x forward earnings vs semiconductors' historical 15–20x. That premium assumed:

Training clusters would scale exponentially
Inference demand would follow training
Competition couldn't match NVIDIA's ecosystem

DeepSeek challenged assumption #1 directly. The stock's 17% drop wasn't just about one quarter's earnings—it was about whether the 2030 TAM estimates (some analysts had projected $1T+ annual AI chip sales) were still valid.

Sector Rotation: Winners and Losers #

Losers January 27:

NVIDIA (-17%): Direct demand destruction fears
Broadcom (-17%): Custom AI chip demand questioned
Vistra (-21%): AI power demand growth assumptions
Constellation Energy (-21%): Nuclear for AI data centers

Relative Winners:

Meta (+2%): Benefits from cheaper inference; open-weights friendly
Microsoft (-2%): Azure demand questions, but owns OpenAI partnership
Chinese ADRs (+5–10%): Geopolitical AI competition narrative

The market was voting that AI application companies (who consume compute) would benefit from efficiency, while AI infrastructure plays (who sell compute) would suffer.

Analyst Response #

By market close, the downgrade cycle had begun:

KeyBanc: Cut NVIDIA target from $180 to $145; cited efficiency concerns
BofA: Maintained Buy but noted "narrative risk" from open-weights efficiency
Bernstein: "This is a shot across the bow of the scaling hypothesis"
Citi: "R1 suggests the moat is narrower than we thought"

The consensus that emerged: NVIDIA remains the dominant AI chip supplier, but the pace of demand growth may be slower than the most bullish projections assumed. The "exponential growth forever" thesis had taken damage.

By Tuesday, Damage Control #

NVIDIA's official statement Monday evening attempted narrative repair:

"DeepSeek represents an excellent AI advancement and demonstrates test-time scaling... Inference requires significant numbers of NVIDIA GPUs and high-performance networking."

The company was arguing that efficient training doesn't destroy inference demand—and that R1 itself runs on NVIDIA chips (the H800s, albeit restricted versions). Markets partially recovered; NVIDIA bounced 9% on Tuesday. But the $589B wound wouldn't heal quickly. The psychology of AI infrastructure investing had fundamentally shifted.

The Open-Weights vs Closed-API War Just Escalated #

DeepSeek R1's pricing—$0.55 per million input tokens vs OpenAI o1's $15—represents a 27x cost advantage for the open-weights challenger. This isn't just another open model; it's the first to match closed frontier reasoning capabilities at commodity prices. The strategic implications for OpenAI, Anthropic, and Google are existential.

API Pricing: The Brutal Math #

Model	Input (per 1M tokens)	Output (per 1M tokens)	Context Window	Reasoning
OpenAI o1	$15.00	$60.00	128K	Hidden
OpenAI o3-mini	$1.10	$4.40	200K	Hidden
DeepSeek R1	$0.55	$2.19	64K	Visible
DeepSeek R1 (local)	~$0.02 (electricity)	~$0.02 (electricity)	64K	Visible

DeepSeek's API undercuts o1 by 96% on input costs. Even o3-mini—OpenAI's rapid response to R1 released January 31—costs 2x more than R1. For high-volume automation workflows, this differential is transformative.

Consider a typical agent workflow making 10M input tokens daily:

o1: $150/day = $54,750/year
R1 API: $5.50/day = $2,008/year
R1 self-hosted: ~$0.20/day = $73/year (H100 rental cost)

The self-hosted calculation assumes rental pricing; if you own the hardware, marginal inference costs approach electricity-only.

The Distillation Vulnerability #

DeepSeek's paper reveals that R1's reasoning capabilities can be distilled into smaller models—and they released the distilled versions alongside the base model:

Distilled Model	Base	R1 Reasoning Injected	AIME 2024 Score
DeepSeek-R1-Distill-Qwen-1.5B	Qwen 1.5B	Yes	28.9%
DeepSeek-R1-Distill-Qwen-7B	Qwen 7B	Yes	55.5%
DeepSeek-R1-Distill-Qwen-14B	Qwen 14B	Yes	63.0%
DeepSeek-R1-Distill-Qwen-32B	Qwen 32B	Yes	72.6%
DeepSeek-R1-Distill-Llama-8B	Llama 3.1 8B	Yes	50.4%
DeepSeek-R1-Distill-Llama-70B	Llama 3.1 70B	Yes	70.0%

The 32B distilled Qwen scores 72.6% on AIME 2024—beating the original o1's 74.4% with a fraction of the parameters. A 32B parameter model running on a single GPU can match OpenAI's most capable reasoning model.

This is the nightmare scenario for closed-API labs: their reasoning capabilities can be stolen via distillation and commoditized. A $200/month ChatGPT Pro subscription provides ample tokens to train a competitive model. OpenAI's recent investigation into "distillation abuse" by Chinese labs (reported by Financial Times January 30) suggests they're aware of the threat.

Meta, Mistral, and the Open-Weights Ecosystem #

Meta's Llama strategy suddenly looks prescient. Llama 3.1 70B and 405B established that open-weights could match GPT-4 class models. R1 extends that to reasoning. Meta benefits from R1's release in several ways:

Validation of the open-weights bet—Zuckerberg's $50B+ annual AI spend looks less risky
Llama can absorb R1's reasoning via the same distillation pipeline
On-premise deployment of reasoning models becomes viable for enterprise

Mistral AI faces pressure. Their commercial models (Mistral Large) had positioned as the European alternative. R1's MIT license and Chinese origin complicates Mistral's EU-first positioning. They'll likely need to release their own reasoning model soon to maintain relevance.

Commercial Implications: The Enterprise Shift #

Enterprises evaluating AI strategy in late January 2025 face a different calculus:

Before R1: Closed APIs offered exclusive access to frontier capabilities. The premium was justified by performance gaps.

After R1: Frontier reasoning is available via:

DeepSeek API (cheapest, data to China)
Self-hosted R1 (control, infrastructure required)
Distilled variants (run anywhere, slightly reduced capability)

The data sovereignty concern is real. DeepSeek is a Chinese company. For government contracts, financial services, healthcare—self-hosting open-weights models becomes the pragmatic path. R1 enables this without sacrificing capability.

OpenAI's response—o3-mini released January 31 with free tier access—acknowledges the pricing pressure. But o3-mini still uses hidden reasoning chains and requires API dependency. The strategic question for OpenAI: can they maintain the "frontier is closed" positioning when R1 proves the frontier can be open?

DeepSeek Hits #1: What the App Store Moment Means #

DeepSeek's mobile app topped the US App Store free rankings on January 25–26, 2025—dethroning ChatGPT for the first time since the GPT-4 launch. This wasn't a technical milestone; it was a cultural one. A Chinese AI assistant that most Americans couldn't pronounce became the most downloaded free app in the country.

The Download Metrics #

App tracking services (AppMagic, data.ai) reported:

January 24: DeepSeek enters US top 10
January 25: #1 on iPhone free apps (overtakes ChatGPT, TikTok, Instagram)
January 26: Maintains #1; #2 on Google Play
January 27: Still #1 iPhone as NVIDIA crashes

Downloads accelerated to estimated 2M+ daily across iOS and Android in the US alone. Global downloads likely exceeded 5M/day. The App Store algorithm—driven by velocity of downloads and retention—rewarded the viral spike.

Why the Consumer Breakthrough Matters #

Consumer apps aren't just about consumer revenue. They're distribution moats:

ChatGPT became synonymous with "AI" because it was the first easy-to-access consumer experience
Brand awareness drives enterprise decision-making (CTOs have heard of ChatGPT; they haven't heard of Anthropic)
Data flywheel—more users → more queries → better model tuning → better product
Talent recruitment—top researchers want to work on products people use

DeepSeek's App Store dominance—however temporary—establishes it as a known brand in the US market. That's remarkable for a company with zero US marketing spend, no English-speaking PR team, and a product whose reasoning traces occasionally output Chinese characters.

The Free vs Pro Dynamic #

Unlike ChatGPT—which requires a $20/month Plus subscription for GPT-4 access—DeepSeek offers unlimited free access to R1 reasoning during the launch period. The monetization model appears to be:

Growth first: Acquire users via free R1 access
API monetization: Charge developers building on the infrastructure
Enterprise deals: Chinese domestic market + international expansion
Future premium: Potential subscription tier once habit is established

This is classic tech platform strategy—subsidize user acquisition, monetize the developer layer. OpenAI tried this with ChatGPT's free tier, then gated GPT-4 behind Plus. DeepSeek is running the same playbook with more aggressive pricing.

Media Coverage and Political Reaction #

The App Store moment triggered mainstream coverage that technical releases rarely achieve:

Wall Street Journal: "Chinese AI App Topples ChatGPT in Apple's Store"
CNBC: Round-the-clock coverage linking DeepSeek to NVIDIA crash
The Information: Analysis of High-Flyer Quant's quant-trading origins
Bloomberg: Interviews with US AI executives reacting to the competitive threat

Political dimension: The app's success came days before Trump's Executive Order 14179 and the Stargate announcement. R1's dominance provided concrete evidence for the "America is losing the AI race" narrative—likely influencing both the regulatory deregulation and the infrastructure investment urgency.

Technical Performance as Viral Feature #

The app's spread was driven by visible reasoning. Users shared screenshots of R1's thinking process:

"Wait, I made a mistake in the previous calculation. Let me recalculate..."
"Hmm, the question is asking for X, but I initially assumed Y. Let me reconsider..."
"Actually, there's another interpretation. Let me explore both paths..."

This transparency—watching the model think, doubt itself, correct—created shareable content. ChatGPT's hidden o1 reasoning can't generate screenshots. DeepSeek's UX choice became organic marketing.

Sustained vs Spiked Usage #

The open question as of January 28: Is DeepSeek a ChatGPT killer or a TikTok trend?

Sustained usage requires:

Consistent reasoning quality (early reports show some inconsistency)
Multi-modal capabilities (currently text-only vs GPT-4V's vision)
Integration ecosystem (ChatGPT has plugins, custom GPTs, API partnerships)
Trust and data handling clarity

Even if DeepSeek doesn't maintain #1, the App Store moment permanently established open-weights reasoning as a consumer product category. The distinction between "serious AI" (closed APIs) and "hobby AI" (open weights) dissolved in one weekend.

Stargate and the $500B Question: Infrastructure vs Efficiency #

The Stargate Project—announced January 21, 2025 with $500 billion committed over four years—represents the most ambitious AI infrastructure bet in history. OpenAI, SoftBank, Oracle, and MGX (Abu Dhabi fund) formed the consortium alongside the White House. Then DeepSeek R1 released 24 hours earlier, making the bet look potentially mispriced.

The Stargate Structure #

Partner	Role	Committed Capital
SoftBank	Lead financial partner; equity + debt	$100B+ (target $400B over 4 years)
OpenAI	Technology partner; operations	Operational responsibility, technical IP
Oracle	Infrastructure; data center construction	$10B+ equity; construction capacity
MGX	Middle East capital; strategic location	$50B+ sovereign wealth commitment
Microsoft	Cloud infrastructure; existing partner	Indirect via Azure; not formal equity

The funding structure—$100B immediate commitment, $400B target by 2029—assumes continuous AI demand growth requiring massive compute expansion. Texas was announced as the first site, with Abilene data centers breaking ground immediately.

The Timing Problem #

Stargate's announcement came one day after R1's release. This wasn't coincidence—it was response. The White House and OpenAI recognized that DeepSeek's efficiency breakthrough threatened the "America must dominate AI infrastructure" narrative.

The messaging emphasized:

US AI leadership requires domestic infrastructure
National security demands controlled supply chains
Job creation in construction, operations, and technology
Competition with China for AI supremacy

But the unspoken tension: Stargate bets on continued exponential compute demand. R1 suggests efficient models may reduce that demand.

The Efficiency Paradox #

Economists debate whether AI efficiency gains will increase or decrease total compute spend:

Jevons Paradox (more efficiency → more consumption):

Cheaper AI → more AI applications → more total compute
R1 enables new use cases previously uneconomical at o1 pricing
Inference demand grows as training efficiency improves

Demand Destruction (more efficiency → less spending):

Models need fewer GPUs for same capability
Companies build smaller clusters, rent less cloud capacity
NVIDIA's TAM shrinks even as AI adoption expands

Stargate assumes Jevons. The January 27 crash feared demand destruction. The truth likely lies in between: training compute requirements may plateau while inference explodes.

R1 reduces the cost to train capable models—potentially limiting the mega-cluster arms race. But it also reduces inference costs—likely accelerating deployment and increasing total inference compute consumed.

Strategic Implications for US AI Policy #

Stargate represents industrial policy—government-coordinated private investment in strategic capacity. The model borrows from:

CHIPS Act ($52B for semiconductor manufacturing)
Infrastructure Investment Act (broadband, power)
Apollo Program (government goal, private execution)

But AI differs from semiconductors or moon landings: the frontier can move via algorithmic innovation, not just physical capacity. DeepSeek proved that a quant fund with smart researchers can match OpenAI's reasoning capabilities without Stargate-scale infrastructure.

The policy question for the Trump administration: Should US AI strategy prioritize efficiency research or capacity construction?

Stargate chooses capacity. But the $6M R1 training run suggests efficiency may be the higher-leverage investment. If China can achieve frontier capabilities with algorithmic cleverness, America's $500B infrastructure advantage becomes a stranded asset.

Oracle's Bet and Ellison's Vision #

Larry Ellison's participation signals more than infrastructure. At the White House announcement, he framed Stargate in terms of AI-enabled biological surveillance:

"We can monitor for cancer via blood test... as soon as we can use AI to look at your CT scan and diagnose disease."

This is classic Ellison—conflating data center construction with futuristic healthcare applications. It also reveals the economic model behind Stargate: not just model training, but AI-powered applications (healthcare, government, military) requiring domestic, secure infrastructure.

Whether those applications justify $500B in data center spend—especially as models become runnable on commodity hardware via distillation—remains the open question that DeepSeek's efficiency made impossible to ignore.

Trump's Executive Order 14179: The Regulatory Pendulatory Swings #

President Trump signed Executive Order 14179, "Removing Barriers to American Leadership in AI," on January 23, 2025—rescinding Biden's comprehensive AI safety EO 14110 and establishing a deregulatory federal AI stance. The EO arrived squarely within the DeepSeek-R1 news cycle, framed as necessary to maintain American competitiveness against China's efficiency breakthrough.

What EO 14179 Actually Does #

The executive order—effective immediately—contains six substantive provisions:

Provision	Biden EO 14110	Trump EO 14179	Impact
AI Safety Reporting	Required for models >10^26 FLOP	Rescinded	Labs no longer must report safety test results to NIST
Watermarking Standards	CISA to develop content authentication	Paused pending review	Deepfake detection standards development halted
Privacy Impact Assessments	Required for AI handling personal data	Rescinded	Reduces compliance burden for AI deployment
Bias Testing	Federal AI use must undergo equity testing	Rescinded	Removes DEI-related AI review requirements
Export Controls	Maintained Commerce restrictions	Unchanged	Chip export controls to China remain in place
Procurement Standards	OMB mandated AI governance	Rescinded	Federal AI acquisition simplified

The rescission is sweeping—Biden's October 2023 EO established 150+ day regulatory requirements, most now void. The practical effect: American AI labs face significantly reduced federal oversight on safety testing, bias evaluation, and deployment transparency.

The Stated Rationale: Competition with China #

The EO's text explicitly cites DeepSeek and Chinese AI development:

"Recent advances by the People's Republic of China demonstrate that overly burdensome domestic regulation can impair American competitiveness... The United States must remove regulatory barriers to AI innovation to maintain technological leadership."

This reframes AI governance from risk management to industrial competition. The Biden administration emphasized safety, bias, and societal impact. The Trump administration emphasizes speed, capability, and winning the race against China.

What Didn't Change: Export Controls #

Crucially, EO 14179 did not modify chip export controls to China. The H800 chips DeepSeek used for V3/R1 were already restricted—NVIDIA designed them specifically to comply with Commerce Department export limits (reduced interconnect bandwidth vs H100s).

The export control regime continues under Commerce Department authority, not executive order. Maintaining these restrictions while deregulating domestic AI creates a strategic posture:

China: Limited access to frontier chips; must innovate via efficiency
US: Unlimited chip access; minimal safety regulation

DeepSeek R1 suggests the strategy may backfire. Constraints forced Chinese algorithmic innovation. American abundance may breed complacency.

Industry Reaction #

AI labs responded predictably along risk-tolerance lines:

OpenAI (via Sam Altman on X): "We support removing barriers to American AI development while maintaining appropriate safeguards."

Anthropic (statement): "We believe safety testing remains important even if not federally mandated. We will continue our internal evaluations."

Google (no formal statement): Already operates under minimal federal AI oversight; continues existing safety practices.

Microsoft (Satya Nadella at Stargate event): "Regulatory clarity helps us invest with confidence."

The consensus corporate position: deregulation reduces compliance costs; self-regulation fills any governance gaps. Critics—including former Biden AI advisors and AI safety researchers—argue that competitive pressure will erode voluntary safety commitments.

Implications for Open-Weights Release #

EO 14179's deregulatory posture may affect model release policies. Biden's EO included language about "dual-use foundation models" and potential reporting requirements for model weights releases. Trump rescinded those provisions.

The practical effect: No federal requirement to evaluate safety before open-weights release. Meta's Llama releases, Mistral's models, and DeepSeek's R1 face no US regulatory scrutiny—only terms of service and potential civil liability.

For builders, this means accelerated open-weights availability but reduced safety documentation. The models you download may have less transparent safety testing than under the Biden framework.

What This Means for Builders: Practical Takeaways #

DeepSeek R1's release changes the calculus for production AI architecture. As of January 28, 2025, reasoning-class capabilities are available via API at 4% of OpenAI's pricing—or free if you self-host. Here's how to operationalize this shift.

When to Use R1 vs o1 vs Claude #

Use Case	Recommended Model	Rationale
Math/coding reasoning	DeepSeek R1	AIME 2024 + Codeforces wins; visible reasoning helps debugging
Enterprise SaaS (US)	Claude 3.5 Sonnet	Reliable API, Anthropic safety standards, SOC2 compliance
Multi-modal workflows	GPT-4o / o1	Vision, audio, broader tool ecosystem
On-premise sensitive data	Self-hosted R1	Data never leaves infrastructure; full control
High-volume automation	R1 API	$0.55/M tokens vs o1's $15; 96% cost reduction
Client-facing products	Claude or GPT-4	Brand trust; customers recognize OpenAI/Anthropic

R1 is the right choice when: You need reasoning capabilities at scale, can tolerate Chinese API provider (or self-host), and want visible chain-of-thought for debugging.

R1 is the wrong choice when: You need multi-modal (vision/audio), require US-based data handling certifications, or your users expect "ChatGPT-level" brand recognition.

Self-Hosting Economics #

Running R1 locally requires significant GPU infrastructure:

Deployment	GPU Requirements	Hourly Cost (Rent)	Throughput
Full R1 (FP8)	8× H100 80GB	$32–40/hr	~10–20 tokens/sec
Full R1 (FP16)	16× H100 80GB	$64–80/hr	~10–20 tokens/sec
R1-AWQ 4-bit	4× H100 80GB	$16–20/hr	~30–50 tokens/sec
Distilled 32B	1× A100 80GB	$2–3/hr	~50–100 tokens/sec

Breakeven analysis: At 10M tokens/day usage, R1 API costs ~$2,000/year. Self-hosting 8×H100 at $35/hr × 8,760 hours = $306,600/year. API is 150x cheaper than rental self-hosting for typical workloads.

Self-hosting only makes sense for:

Data sovereignty requirements (healthcare, government, finance)
Very high volume (100M+ tokens/day)
Latency-sensitive (need <100ms time-to-first-token)
Existing GPU infrastructure (sunk cost; marginal electricity only ~$0.02/hr)

The Distillation Strategy #

For most production use cases, distilled R1 variants offer the best economics:

# Example: 32B distilled Qwen with R1 reasoning
# Runs on single A100 80GB or 2×4090
# 72.6% AIME 2024 (vs R1's 79.8%, o1's 74.4%)

The 32B distilled model achieves near-frontier reasoning on commodity hardware. For internal tools, automation workflows, and non-customer-facing applications, this is the pragmatic deployment path.

Limitation: Distilled models don't update their knowledge; they're frozen at training cutoff. For tasks requiring recent information, you need:

RAG (retrieval-augmented generation) with vector database
API-based model with live search
Fine-tuning pipeline for periodic updates

Multi-Model Architecture #

Smart AI infrastructure is now polyglot by default:

┌─────────────────────────────────────────────────────────────┐
│                    Request Router                           │
├──────────────┬─────────────────┬──────────────────────────────┤
│ Simple Q&A   │ Reasoning       │ Creative/General            │
│ (Cheap)      │ (R1/o1)         │ (Claude/GPT)                │
├──────────────┼─────────────────┼──────────────────────────────┤
│ Haiku        │ R1 API          │ Claude Sonnet               │
│ GPT-4o-mini  │ o1 (fallback)   │ GPT-4o                      │
│ Distilled    │ Self-hosted R1  │                             │
└──────────────┴─────────────────┴──────────────────────────────┘

Route by:

Query classification (is reasoning required?)
User tier (free vs paid get different models)
Cost budget (failover to cheaper models under load)
Capability requirement (multi-modal → GPT-4o; coding → R1)

Migration Checklist #

If you're currently using o1 and considering R1:

Evaluate reasoning quality on your specific task distribution (not just benchmarks)
Test Chinese language handling (R1 occasionally outputs Chinese in reasoning traces)
Verify tool use requirements (R1 doesn't yet support function calling like o1)
Assess data residency needs (DeepSeek API = Chinese servers)
Load test latency (R1 API has different latency characteristics)
Plan fallback strategy (o1 as backup if R1 fails)
Update privacy policy (if switching API providers)
Cost model validation (factor in API migration effort)

The January 2025 Infrastructure Recalculation #

For teams planning AI infrastructure spend:

Before R1: Budget assumed frontier reasoning required OpenAI/Anthropic API or massive self-hosted clusters.

After R1:

Reasoning available at 4% of previous cost
Self-hosting viable via distillation
Multi-model strategy economically practical
Compute spend can shift from training to inference/application development

Recommendation: Defer major training infrastructure commitments. The R1 efficiency breakthrough suggests algorithmic improvement may outperform hardware scaling. Invest in application layer, data pipelines, and user experience—the infrastructure economics are shifting too rapidly to lock in multi-year GPU commitments.

This is the practical legacy of January 2025: frontier AI capabilities became commodity capabilities. The builders who win will be those who ship applications fastest, not those who buy the most GPUs.

Frequently Asked Questions #

Q: What is DeepSeek R1 and how does it compare to OpenAI o1? #

A: DeepSeek R1 is a 671B parameter open-weights reasoning model that matches OpenAI o1's benchmark performance. R1 achieves 79.8% on AIME 2024 (vs o1's 74.4%) and 96.3% Codeforces percentile (vs o1's 93.4%). Key differences: R1 shows its reasoning chain (o1 hides it), costs 96% less via API ($0.55 vs $15 per million input tokens), and can be self-hosted under the MIT license. o1 still leads slightly on GPQA Diamond (75.4% vs 71.5%) and offers broader tool integration.

Q: How much did it actually cost to train DeepSeek R1? #

A: The final training run for DeepSeek V3 (R1's base model) cost approximately $5.576 million in GPU compute. This figure—cited in DeepSeek's technical paper—represents 2.788M H800 GPU-hours. However, this excludes research, failed experiments, and infrastructure. Total development costs likely range $100–200M when including team, prior experiments, and data pipeline. Even this broader estimate is an order of magnitude less than OpenAI or Google's frontier training runs.

Q: Why did NVIDIA lose $589 billion in market cap on January 27? #

A: NVIDIA stock dropped 17% as markets priced in potential demand destruction from AI training efficiency. DeepSeek's $6M training run suggested that algorithmic innovation—not just GPU scale—could achieve frontier capabilities. Traders feared that if efficient models reduce chip demand, NVIDIA's trillion-dollar TAM projections were overstated. The $589B wipeout was the largest single-day market cap loss in history, dragging down Broadcom (-17%), ASML (-7%), and the entire semiconductor sector.

Q: Is DeepSeek R1 really open source? #

A: Yes—R1 is released under the MIT license, the most permissive major open-weights release to date. The license permits commercial use without revenue caps, redistribution, modification, and derivative works. The only requirement is preserving the license notice. This differs from Meta's Llama (commercial restrictions above $700M revenue) and many Chinese model releases (vague usage policies). Weights are available on Hugging Face and DeepSeek's API platform without registration gates.

Q: Can I run DeepSeek R1 on my own hardware? #

A: Yes, but it requires significant GPU resources. Full R1 (671B parameters, 37B active) needs 8× H100 80GB GPUs for FP8 inference or 16× H100s for FP16. More practical: The distilled 32B Qwen variant (72.6% AIME 2024) runs on a single A100 or 2× RTX 4090s. For most use cases, the R1 API at $0.55/M tokens is 150x cheaper than renting equivalent self-hosted infrastructure. Self-hosting only makes sense for data sovereignty requirements or very high volume (100M+ tokens/day).

Q: What is the Stargate Project and how does R1 affect it? #

A: Stargate is a $500B, 4-year AI infrastructure consortium announced January 21, 2025 involving OpenAI, SoftBank, Oracle, and MGX. R1—released 24 hours earlier—creates tension with Stargate's core thesis: that massive compute buildout is necessary for AI leadership. If DeepSeek achieved frontier reasoning with $6M training runs, America's $500B infrastructure advantage may be less decisive than assumed. Stargate continues but now faces scrutiny about whether capacity or efficiency is the winning strategy.

Q: How does R1-Zero differ from the full R1 model? #

A: R1-Zero is trained entirely via reinforcement learning (GRPO) without any supervised fine-tuning. It develops reasoning spontaneously by being rewarded for correct answers, producing raw thinking traces that are often messy (English/Chinese mixed, poor formatting). R1 adds a "cold-start" phase with curated reasoning examples before GRPO, improving readability and coherence. R1-Zero is the pure research result; R1 is the production model. R1 scores slightly higher on most benchmarks due to better formatting and structure.

Q: What benchmarks did DeepSeek R1 win against o1? #

A: R1 wins on AIME 2024 (79.8% vs 74.4%), MATH-500 (97.3% vs 94.8%), and Codeforces (96.3% vs 93.4% percentile). o1 maintains narrow leads on GPQA Diamond (75.4% vs 71.5%) and MMLU (91.8% vs 90.8%). The pattern: R1 dominates mathematical and algorithmic reasoning while o1 retains slight edges on broad knowledge and graduate-level science. For coding and math automation workflows, R1's wins are substantial and economically significant given the 96% API price advantage.

Q: Is DeepSeek affiliated with the Chinese government? #

A: DeepSeek is owned by High-Flyer Quant, a Chinese quantitative hedge fund founded by Liang Wenfeng in 2015. High-Flyer is an independent commercial entity, not a state-owned enterprise. However, all Chinese companies operate under government influence and data access laws. DeepSeek's privacy policy states user data may be stored in China. For US government, military, or sensitive commercial use, self-hosting or US-based API providers remain the conservative choice.

Q: What is Trump's Executive Order 14179 on AI? #

A: EO 14179, signed January 23, 2025, rescinds Biden's AI safety EO 14110 and establishes a deregulatory federal AI stance. It removes requirements for safety testing reporting to NIST, pauses watermarking standards development, rescinds privacy impact assessments for federal AI use, and eliminates bias testing mandates. Export controls to China remain unchanged. The EO explicitly cites DeepSeek R1 as evidence that US AI regulation was impairing competitiveness.

Q: Will this change how much AI compute companies need to buy? #

A: Likely yes—training compute requirements may plateau while inference compute grows. R1 demonstrates that algorithmic efficiency can substitute for hardware scale in training. Companies may build smaller clusters or extend hardware lifecycles. However, cheaper inference (via R1's 96% price reduction) may increase total AI deployment, driving inference demand. The Jevons paradox likely applies: efficiency gains increase total consumption, but the mix shifts from training to inference and from mega-clusters to distributed deployment.

Q: Should I switch from OpenAI/Claude to DeepSeek R1? #

A: Switch to R1 if you need high-volume reasoning at low cost, can tolerate Chinese API or self-host, and want visible chain-of-thought debugging. Stay with OpenAI/Claude if you need multi-modal (vision/audio), US-based data handling with compliance certifications, established tool ecosystems, or customer-facing products where brand recognition matters. The optimal strategy is often polyglot: use R1 for math/coding reasoning at scale, Claude for enterprise SaaS, and GPT-4o for multi-modal workflows. R1 expands your options; it doesn't need to replace everything.

The New Math of AI Infrastructure #

The week of January 20–28, 2025 rewrote the economic assumptions underlying the AI industry. DeepSeek R1 proved that frontier reasoning capabilities no longer require billion-dollar training budgets. NVIDIA's $589 billion crash proved that markets hadn't priced in this possibility. Stargate's $500 billion bet proved that industrial policy moves slower than algorithmic innovation.

For builders, the new math is straightforward: capability per dollar just increased 25x. What cost $15,000 in OpenAI o1 API calls now costs $550 in DeepSeek R1. What required a $100M training cluster might now be achievable for $10M with the right optimization techniques.

This doesn't mean AI infrastructure investment ends. It means the character of investment changes:

From training clusters to inference capacity—deploying models at scale matters more than training new ones
From GPU scale to GPU efficiency—algorithmic optimization beats hardware brute force
From closed moats to open competition—frontier capabilities become table stakes faster
From single-model bets to polyglot architectures—routing between R1, Claude, GPT-4o, and specialized models based on query type

The January correction also clarifies strategic positioning. American AI labs retain advantages in multi-modal capabilities, tool ecosystems, and enterprise trust. Chinese AI labs have demonstrated efficiency and algorithmic innovation under constraint. Open-weights communities gain viability as neither American nor Chinese commercial interests control their infrastructure.

For the automation engineers and growth teams I work with, the immediate implication is clear: you now have access to reasoning-class AI at commodity prices. The constraint shifts from "can we afford frontier capabilities?" to "can we design workflows that leverage reasoning effectively?"

The teams that capitalize on this transition fastest—integrating R1 for cost-sensitive automation, Claude for enterprise reliability, o3-mini for free-tier reasoning—will compound advantages while competitors wait for the dust to settle.

The dust won't settle. January 2025 was the week that proved frontier AI is a commodity, not a monopoly. The next phase belongs to builders who ship.

Ready to operationalize this shift? I'm William Spurlock—AI automation engineer and custom web designer. I help teams build production-grade AI workflows, n8n automation pipelines, and multi-agent systems that leverage models like R1 for real business outcomes.

Book an AI automation strategy call and we'll map the right model + infrastructure stack for your specific workflows—whether that's self-hosted R1 for cost optimization, Claude for enterprise reliability, or a polyglot routing system that uses the best model for each task.

Related Reading:

Claude 3.5 Sonnet and the Artifacts Launch — How Anthropic's interactive UI changed AI workflow design
Anthropic MCP Launch: Model Context Protocol Explained — The open standard connecting AI to external tools
OpenAI o1 Preview: Reasoning Models Arrive — The original reasoning model launch and what it enabled

Published January 28, 2025. Last updated January 28, 2025. Benchmarks reflect scores reported in DeepSeek's technical paper as of publication date. Market figures reflect January 27, 2025 closing prices.

DeepSeek R1 and the $589B NVIDIA Crash: The Week That Shook AI's Cost Assumptions

Table of Contents