主题综述

数字台账 · Numbers That Matter

活文档 · 最近更新 2026-07-02 · 覆盖 283 篇访谈

教 AI 经济学与规模感的硬数字。同一主题族内按时间排序以显趋势线;相互冲突或彼此更新的数字相邻放置并标注对照;最亮眼的 11 条附逐字引用。

更新日志

推理成本与算力经济

成本曲线:同等能力的推理价格以每年 10x—100x 的速度崩塌,而总消耗按 Jevons 悖论指数上升;算力与收入的换算关系首次有了公开口径。

"We exited at 23 at 2 billion in ARR, so 200 megawatts, 2 billion. We exited at 24 at 6 billion, so 6 billion, 600 megawatts. And we exited last year a little over 20 billion, 20 billion, 2 gigawatts. Actually, it's been accelerating."
「我们 23 年收官时 ARR 是 20 亿美元——200 兆瓦对 20 亿;24 年收官 60 亿——600 兆瓦对 60 亿;去年收官略超 200 亿——2 吉瓦对 200 亿。而且这个比例还在加速。」
Sarah Friar (OpenAI CFO) · Episode 12 - State of the AI Industry
"The cost of a million tokens on an O1 equivalent model in December of 24 was about 26 bucks. And then in November of 25, it was 30 cents. So we saw another 88X drop, not 88%, you know, 88 times cheaper in 11 months for that next generation of models."
「o1 等级模型一百万 token 的价格,24 年 12 月约 26 美元,25 年 11 月只要 30 美分——又一次 88 倍的下降。不是 88%,是便宜 88 倍,只用了 11 个月。」

训练与 CapEx

两条主线:数据质量 > 数据数量(专家数据溢价成为行业定价);capex 从美元计价换成吉瓦计价,回本算术仍没有答案。

"10 gigawatts is like $400 billion, something like that. And that $400 billion will have to be largely funded by their offtake, right, their revenues, which is growing exponentially. It has to be funded by their capital, the money they've raised through equity, and whatever debt they can raise."
「10 吉瓦大约就是 4000 亿美元。这 4000 亿主要要靠他们的承购——也就是指数增长中的收入——再加上股权融资和能募到的债务来买单。」
"If you invest $150 billion in NVIDIA chips, that's about $300 billion of data center investments. And to pay that back, the person using the compute needs to earn a 50% gross margin. So there's about $600 billion of revenue that needs to get generated. ... The question behind the question was, is the customer's customer healthy?"
「买 1500 亿美元的英伟达芯片,约等于 3000 亿美元的数据中心投资;要回本,用这些算力的人得按 50% 毛利挣出约 6000 亿美元收入。……问题背后的问题是:客户的客户健康吗?」
"You could have gotten something that was ChatGPT 3.5 level maybe back in 2018 or 2019 with a couple people."
「你本可以在 2018 或 2019 年用几个人得到一些达到 ChatGPT 3.5 水平的东西。」
"Training neural nets and LLMs specifically is a huge amount of code. But all of that code is actually complexity from efficiency. It's just because you need it to go fast. If you don't need it to go fast and you just care about the algorithm, then that algorithm actually is 200 lines of Python."
「训练神经网络、尤其是 LLM,代码量巨大。但那些代码其实都是效率带来的复杂度——只是因为你需要它跑得快。如果不需要快、只关心算法本身,那这个算法其实就是 200 行 Python。」
"We built in the last 15 months more Azure capacity than we built in the first 15 years."
「过去 15 个月我们建成的 Azure 容量,超过了最初 15 年建成的总和。」

定价·毛利·商业模型

定价锚正从软件预算(美国 ~$1T)切换到人力预算($20T—$80T);客服是第一个 ROI 完全可量化的用例。

增长与规模

收入爬坡的新基准线:$1B→$10B 从二十年压缩到一年;agent 采用曲线(任务时长每 4-7 个月翻倍)是能力侧最重要的单一指标。

人效与组织

AI 写代码的占比、单人杠杆与组织形态:人均产出取代人头数成为核心指标,token 支出开始超过工资单。

"These numbers are just totally crazy, right? Like 4% of all commits in the world is just way more than I imagined. And like you said, it still feels like the starting point. These are also just public commits. So we actually think if you look at private repositories, it's quite a bit higher than that."
「这些数字简直疯狂——全世界 4% 的提交,远超我的想象。而且这仍然只像是起点。这还只是公开提交;看私有仓库的话,比例还要高不少。」
"We used to be 6,000 or over 7,000 people and we're now less than 3,000. And I didn't ask for a single dime to do all this."
「我们过去有 6000 或超过 7000 人,现在不到 3000 人。我做这一切没要一分钱。」
"Like right now, we're spending more on tokens for our internal agents than we are on employee headcount. And I think most businesses are going to look like that."
「就在现在,我们花在内部 agent token 上的钱,已经超过了员工人头开支。我认为大多数企业都会变成这样。」

估值与融资

AI 溢价(A 轮 +30% 持续一年未消)、幂律集中与泡沫算术。

其他关键数字

评估工程、世界模型、能源与其他不肯归类但值得记住的数字。

"AI alone, 88%. ... But then they gave the AI to the doctors. The doctors improved from 73% to 76%. The AI got degraded from 88 to 76%."
「仅靠 AI,准确率 88%。……但后来他们让医生使用 AI。医生从 73% 提高到 76%。AI 从 88% 降到 76%。」