Concise Summary简洁概述
Yudkowsky distinguishes two intellectual styles. Toolbox thinkers hold that wisdom is a large bag of context-sensitive techniques and distrust anyone who claims one method beats all others. Lawful thinkers hold that there are objective truths — shortest paths, Carnot limits, Bayesian updates — that govern outcomes whether or not anyone invokes them. The essay argues these camps routinely talk past each other because Toolboxers hear every Law as a prescriptive command ("always do X"), while Lawists hear every disclaimer ("X is just one tool") as a sneaky motte-and-bailey. The resolution: Laws are descriptive facts, not prescriptions; knowing the ideal doesn't force you to follow it, but it does give you a reference point to improve from.
Yudkowsky 区分了两种思维风格。工具箱思维者认为,智慧是一大包情境敏感的技巧,并对任何声称某法一劳永逸的人保持警惕。规律思维者认为,存在客观真理——最短路径、卡诺极限、贝叶斯更新——无论有没有人援引,它们都在支配结果。文章论证,这两派经常鸡同鸭讲:工具箱派把每条规律都听成处方式命令(「总是做X」),而规律派则把每一句免责声明(「X只是工具之一」)都当作狡猾的蒙混。化解之道在于:规律是描述性事实,不是处方;知道理想并不强迫你去遵循,但它确实为你提供了一个可以努力改进的参照点。
Infographic信息图
Toolbox style
工具箱风格
Wisdom is a large bag of context-sensitive tricks; anyone pushing a single method is probably ignorant of the rest of the bag.
智慧是一大包情境敏感的技巧;推崇单一方法的人,很可能对其他工具一无所知。
Lawful style
规律风格
Some truths — shortest paths, thermodynamic limits, Bayesian updates — hold universally, independent of whether they are computationally reachable.
某些真理——最短路径、热力学极限、贝叶斯更新——普遍成立,与它们是否可被计算实现无关。
The durable misunderstanding
持久的误解
Toolboxers hear Laws as prescriptions; Lawists hear toolbox disclaimers as motte-and-bailey retreats. Both misread the other's actual claim.
工具箱派把规律听成处方;规律派把工具箱的免责声明听成「稻草人-堡垒」式退缩。双方都误读了对方的真实主张。
Left-hand rule vs. shortest path
靠左手法则 vs. 最短路径
Using the left-hand rule in a maze is wise given limited memory, even if it doesn't yield the optimal path — knowing the ideal still helps you see room for improvement.
在迷宫里使用靠左手法则,在记忆有限的情况下是明智的,即便它不能给出最优路径——知道理想状态仍有助于看出改进空间。
Laws as descriptive facts
规律作为描述性事实
The triangle inequality isn't a moral command; it's simply true. Viewing Bayesian updates as descriptive truths rather than normative ideals is the deeper level of understanding.
三角不等式不是道德命令;它就是真的。将贝叶斯更新视为描述性事实而非规范性理想,才是更深层的理解。
Detailed Summary详细概述
The dichotomy introduced
Yudkowsky begins with a tl;dr that itself illustrates the tension: Toolbox thinking values a diverse bag of context-sensitive techniques and is suspicious of anyone claiming a single best method. Lawful thinking distinguishes between descriptive truths (how things are), normative ideals (what is optimal in principle), and prescriptive ideals (what you ought to do given your constraints). Laws may assert that some path is optimal even if no practical algorithm reaches it.
The Twitter debate as case study
A near-perfect real example follows: a Twitter exchange between David Chapman, Graham Rowe, and Julia Galef about whether rationalism posits "one weird trick." Chapman defines rationalism as claiming a single ultimate criterion; Julia counters that the normative model is unreachable in practice, so the real program is assembling approximating tricks. Chapman's rejoinder is that rationality is itself just "a bag of tricks that are more-or-less applicable in different situations."
Yudkowsky argues this exchange crystallises a durable mutual misunderstanding:
- Msr. Toolbox hears every Law claim as "throw away all your other tools; here is the one recipe."
- Msr. Lawful hears every toolbox disclaimer as a covert motte-and-bailey: asserting normative status when convenient, then retreating to "it's just one tool" under pressure.
The maze analogy
Every traversable maze has a spatially shortest path. This is a fact — not a command. That fact does not mean you can stand at an intersection and "just pick whichever branch is on the shortest path." The left-hand rule (keep your left hand on the wall) is a wise heuristic for agents with limited memory, even though it may not yield the shortest route. Following the left-hand rule consistently is actually better in expectation than making a single locally smart deviation, because one wall-crossing can trap you on a disconnected island forever.
The key move: shortness is a property of paths; a tendency to produce shorter paths is a property of recipes. These are different levels of description. Knowing there is a shortest path lets you evaluate recipes against a stable standard and motivates improvements (e.g., building a phone app that maps the maze).
The Toolbox reply and its limits
The Toolbox objection: letting yourself think about "shortest paths" will, in practice, cause you to conflate your current recipe with the ideal — and then insist that someone in a wheelchair take the "optimal" stair-laden path, or that mortgage-backed securities are Gaussian because the math is neater.
Yudkowsky's concession: yes, this does happen, and lecturing Msr. Law on diverse toolboxes is a genuinely useful context-sensitive tool. But prescribing that lecture to everyone is itself a context-insensitive move — exactly the error you're warning against.
Both as metatools, then the deeper resolution
Not invoking a golden mean: both thinking styles are metatools in the metatoolbox, and you should deploy whichever fits the context. Yudkowsky's prescription for the Toolbox-inclined: try to see Laws as descriptive statements, not normative ones.
The triangle inequality is not a normative ideal — it is simply true of spatial distances in Euclidean geometry. The Carnot limit is not a cult object; it is a thermodynamic fact. Bayesian updating is not best understood as "the right way to reason"; it is best understood as a descriptive law: you cannot extract more evidence from an observation than is given by its likelihood ratio — regardless of your recipe, regardless of your preferences, even if you own a bicycle.
Understanding coherence theorems at the deepest level means reading them not as "you'll be incoherent if you don't use probabilities" but as: "if your behavior is incoherent in way X, it will correspond to a dominated strategy in way Y — universally, over all tools in all toolboxes."
Closing note
For things like Fun Theory there may be no deeper fact behind the normative ideal. But in math and science, normative ideals like the Carnot cycle or Bayesian decision theory are almost always manifestations of a simpler descriptive fact we are tempted to relabel as "normative." Shedding the normative framing and understanding the underlying fact is the deeper move — though understanding the normative framing first is a perfectly reasonable path to get there.
二元对立的引入
Yudkowsky 以一段「长话短说」开篇,而这段话本身就体现了所要讨论的张力:工具箱式思维重视多样的情境敏感技巧,对任何声称存在单一最优方法的人保持怀疑。规律式思维则区分描述性真理(事物本来如何)、规范性理想(原则上什么是最优的)与处方性理想(在你的约束条件下你应该做什么)。规律可以主张某条路径是最优的,即便没有任何可实际执行的算法能达到它。
推特辩论作为案例
随后是一个近乎完美的真实案例:David Chapman、Graham Rowe 和 Julia Galef 在推特上关于「理性主义是否声称存在一个奇技」的交流。Chapman 将理性主义定义为声称存在单一终极标准;Julia 反驳说,规范性模型在实践中不可达,所以真正的纲领是拼凑出一些近似技巧。Chapman 的反驳是:理性本身只是「一袋在不同情境下或多或少适用的技巧」。
Yudkowsky 认为,这场对话结晶出了一种持久的相互误解:
- 工具箱先生听到的每一条规律主张都是:「扔掉你所有其他工具;这是那个唯一的方法。」
- 规律先生听到的每一个工具箱式免责声明都是一场隐蔽的「稻草人-堡垒」:在方便时主张规范性地位,一旦受到挑战就退守「这只是工具之一」。
迷宫类比
每一个可以穿越的迷宫都有一条空间上最短的路径。这是一个事实——不是命令。这一事实并不意味着你可以站在一个路口,「随手挑走在最短路径上的那条岔路」。靠左手法则(把左手贴着墙走)是一个对记忆有限的行动者而言明智的启发式方法,即便它不能给出最短路线。一贯地遵循靠左手法则,其实比偶尔做出一个局部聪明的偏离期望值更优——因为一次翻墙可能让你永远困在一座孤立小岛里。
关键一步:短暂性是路径的属性;倾向于生成更短路径是方法的属性。 这是两个不同层次的描述。知道存在最短路径,让你能把方法对照稳定的标准加以评估,并激励改进(例如,开发一个能绘制迷宫地图的手机 App)。
工具箱派的反驳及其局限
工具箱派的反对意见:让自己去思考「最短路径」,在实践中会导致你把当前的方法与理想混同——于是你坚持让坐轮椅的人走那条「最优」的布满楼梯的路,或者因为数学更简洁就假设房贷支持证券服从高斯分布。
Yudkowsky 的让步:是的,这确实会发生,就「拥有多样工具箱的重要性」对规律先生进行说教,是一个真正有用的、情境敏感的工具。但把这堂课普遍开给所有人,本身就是一个情境不敏感的举动——恰恰是你所警告的那种错误。
两者皆为元工具,再到更深的化解
这里并非诉诸黄金中道谬误:两种思维风格都是元工具箱中的元工具,你应该视情境部署哪一种。Yudkowsky 给工具箱倾向者开出的处方:尝试把规律视为描述性陈述,而非规范性的。
三角不等式不是规范性理想——它就是欧几里得几何中关于空间距离的真理。卡诺极限不是膜拜对象;它是热力学事实。贝叶斯更新最好不要理解为「正确的推理方式」;而要理解为一条描述性规律:你从一次观察中提取的证据不可能多于其似然比所给出的——无论你用什么方法,无论你的偏好如何,即便你拥有一辆自行车。
在最深层次上理解一致性定理,意味着把它们读作:不是「如果你不用概率论你就会不一致」,而是:「如果你在 X 意义上不一致,它将对应于 Y 意义上的受支配策略——这对所有工具箱中所有工具普遍成立。」
结语
对于「有趣理论」之类的东西,规范性理想背后也许不存在更深的自然事实。但在数学和科学中,像卡诺循环或贝叶斯决策论这样的规范性理想,几乎总是某个更简单的描述性事实的体现——我们只是被诱惑着把它重新贴上「规范性」的标签。放下规范性框架、去理解底层事实,才是更深的一步——尽管先理解规范性框架,是通往那一步的完全合理的路径。
FAQ常见问答
What exactly is the difference between 'descriptive', 'normative', and 'prescriptive' as Yudkowsky uses them?Yudkowsky 所用的「描述性」「规范性」「处方性」三个层次究竟有何区别?
A descriptive truth simply holds — the triangle inequality is true of Euclidean distances whether or not anyone cares. A normative ideal says what is optimal in principle (the Carnot cycle is the best possible heat engine; Bayesian updating is the best possible belief revision). A prescriptive rule says what you ought to do given your actual resources and context — which may differ from the normative ideal if you can't compute an exact Bayesian update or if you need to avoid stairs. The essay argues most confusion arises from collapsing these three levels into one.
描述性真理就是成立的——三角不等式对欧几里得距离成立,不管有没有人在乎。规范性理想说的是原则上什么是最优的(卡诺循环是理论上最优的热机;贝叶斯更新是原则上最优的信念修正)。处方性规则说的是,在你实际的资源与情境下你应该做什么——如果你无法计算精确的贝叶斯更新,或者你需要避开楼梯,那它可能与规范性理想不同。文章认为,大多数混乱来源于把这三个层次压缩成一个。
Why does the left-hand rule example matter so much?靠左手法则这个例子为何如此重要?
It shows that the wise prescriptive choice (use the left-hand rule given limited memory) can differ from the normative ideal (walk the shortest path), and that this is fine — but only because we can see the difference. If you couldn't conceptualise 'shortest path' at all, you'd have no benchmark to motivate building a phone-app improvement. The example also illustrates that a single locally smart deviation from a rule can be catastrophically worse in expectation than the rule itself.
它表明,明智的处方性选择(在记忆有限的情况下使用靠左手法则)可以与规范性理想(走最短路径)不同,而这完全没问题——但前提是我们能看见这个差距。如果你根本无法概念化「最短路径」,你就没有任何基准来激励自己去开发手机 App 这样的改进。这个例子还说明,对规则的单次局部聪明偏离,其期望结果可能比规则本身糟糕得多。
Is Yudkowsky saying everyone should just become a 'Lawful' thinker?Yudkowsky 是在说每个人都应该成为「规律式」思维者吗?
No. He explicitly invokes the fallacy of the golden mean caveat to avoid being read that way, then argues both styles are metatools and context-sensitive deployment of both is better than either dogma. His main message is narrower: extreme Toolbox thinkers sometimes harm themselves by refusing to reason about unreachable ideals at all, and the corrective is to understand Laws as descriptive facts — not to abandon the toolbox.
不是。他明确点出黄金中道谬误的警告,以避免被这样解读,然后论证两种风格都是元工具,情境敏感地部署两者优于任何一种教条。他的核心信息更窄:极端的工具箱思维者有时因拒绝对不可达理想进行推理而伤害自己,纠正之道是把规律理解为描述性事实——而不是放弃工具箱。
What is the 'motte-and-bailey' move Yudkowsky attributes to Toolboxers?Yudkowsky 归咎于工具箱派的「稻草人-堡垒」动作是什么?
A motte-and-bailey is an argument pattern where someone defends a bold claim (the "bailey") but when challenged, retreats to a trivially true safe claim (the "motte"). In the toolbox context, the Lawist suspects the Toolboxer of asserting "rationality/Bayesian inference is the right approach" (bailey) and retreating to "it's just one useful tool" (motte) under pressure. Yudkowsky grants this suspicion has some real basis — the world does contain people who play this game — while arguing it is wrong to assume everyone behaves this way.
「稻草人-堡垒」是一种论证模式:某人捍卫一个大胆主张(「领地」),但一旦被挑战,就退守到一个显然为真的安全主张(「堡垒」)。在工具箱语境下,规律派怀疑工具箱派先主张「理性/贝叶斯推断是正确方法」(领地),被追问后又退守「这只是一个有用的工具」(堡垒)。Yudkowsky 承认这种怀疑有一定的真实基础——世界上确实存在玩这种把戏的人——同时论证把所有人都假定成这样是错的。
How does the Carnot engine example strengthen the argument?卡诺热机的例子如何强化了论证?
Carnot efficiency is a thermodynamic fact, not a normative ideal you join a cult to worship. Knowing it lets engineers identify specific gaps — heat loss from the combustion chamber, for instance — as targets for improvement. If engineers refused to conceptualise the Carnot limit because "no real engine can reach it," they'd lose an invaluable diagnostic tool. The same logic applies to Bayesian updating: refusing to reason about the ideal update, on grounds that it's unreachable, eliminates a reference that helps you identify where your current method loses information.
卡诺效率是热力学事实,不是你加入某个教派去膜拜的规范性理想。知道这一点,工程师就能找出具体的改进目标——例如燃烧室的热损失。如果工程师以「真实热机达不到卡诺极限」为由拒绝概念化这一极限,就会失去一个无价的诊断工具。同样的逻辑适用于贝叶斯更新:以不可达为由拒绝对理想更新进行推理,会消除一个帮助你判断当前方法在哪里丢失信息的参照。
What does 'even if you own a bicycle' mean in context?文中反复出现的「即便你拥有一辆自行车」是什么意思?
It is Yudkowsky's recurring punchline for a key point: the triangle inequality is true of spatial distances in Euclidean geometry regardless of your mode of transportation, your knowledge, your preferences, or your practical options. Adding the qualifier "even if you own a bicycle" dramatises that these are not conditional prescriptions but unconditional facts about the structure of the situation. Similarly, the fact that you can't exceed the Carnot efficiency holds whether or not you're even thinking about thermodynamics at the time.
这是 Yudkowsky 反复使用的一个妙语,用以强调一个关键点:三角不等式对欧几里得几何中的空间距离成立,无论你用什么交通方式、你掌握什么知识、你有什么偏好或实际选项。加上「即便你拥有一辆自行车」这个限定语,是为了戏剧性地表明:这些不是有条件的处方,而是关于情境结构的无条件事实。类似地,无法超越卡诺效率这一事实,不管你当时是否在思考热力学,都成立。
In-depth Analysis · Pros & Cons深入解读 · 优缺点
This essay attempts something philosophically ambitious: diagnosing a structural miscommunication between two intelligent camps and offering a conceptual key (Laws as descriptive rather than normative) that dissolves the confusion. It sits at the intersection of philosophy of science, decision theory, and epistemology, and is one of Yudkowsky's more technically precise pieces.
这篇文章尝试完成一件哲学上颇具野心的事:诊断两个聪明阵营之间的结构性误沟通,并给出一把概念性钥匙(规律是描述性的而非规范性的)来化解这一混乱。它处于科学哲学、决策论与认识论的交叉点,是 Yudkowsky 技术上较为精确的文章之一。
- Precise three-way distinction精确的三层区分Separating descriptive / normative / prescriptive is genuinely clarifying and not standard in popular epistemology writing. The maze/left-hand-rule example makes the distinction concrete and memorable.将描述性/规范性/处方性分开,是真正有澄清意义的,在通俗认识论写作中并不常见。迷宫/靠左手法则的例子使这一区分具体而令人难忘。
- Real case study instead of hypothetical真实案例而非假设情境Grounding the dichotomy in an actual Twitter exchange (Chapman-Galef) gives empirical weight and helps readers recognize the pattern in real-world disagreements they've encountered.将二元对立扎根于一场真实的推特交流(Chapman-Galef),赋予了文章实证分量,帮助读者在自身遇到的真实分歧中识别这一模式。
- Symmetric charity对称的善意诠释Yudkowsky steelmans both sides — the Toolbox objection about idealism's corrupting effects (wheelchair path, Gaussian MBS) is presented seriously, not dismissed. This makes the eventual critique more credible.Yudkowsky 对两方都给出了最强版本的论证——关于理想主义的腐化效果(轮椅路径、高斯房贷证券)的工具箱派反对意见被认真对待,而非轻描淡写。这让最终的批判更具说服力。
- The 'bicycle' heuristic「自行车」启发式The recurring punchline 'even if you own a bicycle' elegantly reminds the reader that Laws are not conditional on practical reachability — a rhetorical technique that reinforces the descriptive-not-normative point without repeating the whole argument.反复出现的妙语「即便你拥有一辆自行车」,优雅地提醒读者:规律不以实践可达性为条件——这一修辞技巧无需重复全部论证,就强化了「描述性而非规范性」这一核心点。
- The descriptive/normative reframe is not always available描述性/规范性重新框架并非总是可行Yudkowsky's prescription — 'just see Laws as descriptive facts' — works cleanly for mathematical and physical examples (Euclidean geometry, Carnot cycle). But in decision theory and ethics, the normative framing often isn't merely a surface gloss on a deeper descriptive fact; it's load-bearing. The essay gestures at this with Fun Theory but doesn't fully reckon with how far the strategy extends.Yudkowsky 的处方——「只需把规律视为描述性事实」——在数学和物理例子中(欧几里得几何、卡诺循环)运作流畅。但在决策论和伦理学中,规范性框架往往不只是更深描述性事实的表面粉饰;它本身就是承重的。文章对「有趣理论」有所暗示,但没有充分考量这一策略能延伸多远。
- Overstates coherence theorems' force高估一致性定理的力量The essay implies that coherence theorems 'universally' show that any non-Bayesian tool is executing a dominated strategy. In practice, coherence theorems have significant scope conditions (complete preference orderings, idealized agent models), and the dominated-strategy interpretation requires additional premises. The claim is not false but is stated with more confidence than the technical literature warrants.文章暗示,一致性定理「普遍地」表明任何非贝叶斯工具都在执行受支配策略。但实际上,一致性定理有重要的适用范围限制(完整的偏好排序、理想化的智能体模型),而「受支配策略」的解释需要额外的前提。这一主张并非虚假,但被陈述得比技术文献所能支撑的更有把握。
- The motte-and-bailey attribution may itself be uncharitable「稻草人-堡垒」的归因本身可能缺乏善意Yudkowsky accuses Toolboxers of potentially playing a motte-and-bailey game, while also granting they might genuinely mean the weaker claim. But the structural analysis reads the Toolboxer's position somewhat uncharitably: a sophisticated Toolboxer like Chapman is not merely hedging; they have a positive epistemological thesis (metarationality as its own skill). The essay partly acknowledges this but doesn't engage with it at full depth.Yudkowsky 指控工具箱派可能在玩「稻草人-堡垒」把戏,同时也承认他们可能真诚地只持有较弱的主张。但这一结构分析对工具箱派立场的解读有些缺乏善意:像 Chapman 这样的成熟工具箱派并非只在打含糊球;他们有一个正向的认识论论题(元理性本身是一种技能)。文章部分承认了这一点,但没有深入与之交锋。
- Practical guidance remains thin实践指导仍然薄弱After establishing that both styles are metatools and that seeing Laws as descriptive is the deeper move, the essay doesn't give the reader much traction on when to switch modes. The closing hint about 'Laws you don't know yet' is genuinely interesting but deliberately left undeveloped, which may frustrate readers hoping for concrete heuristics.在确立两种风格都是元工具、把规律视为描述性的是更深进路之后,文章并未给读者提供多少关于何时切换模式的操作性把握。关于「你尚不知晓的规律」的结尾暗示颇为有趣,但被刻意留作未展开,这可能会让希望获得具体启发式方法的读者感到沮丧。
A lucid and philosophically careful essay that successfully names a real and common form of talking-past-each-other in rationality discourse. Its core contribution — the three-way descriptive/normative/prescriptive distinction and the maze analogy — is genuinely clarifying and worth the read. The main caveat is that the reframing strategy ('Laws as descriptive facts') works more cleanly for math and physics than for ethics or social epistemology, and the essay slightly undersells the genuine insight in the Toolbox tradition it is critiquing.
一篇思路清晰、哲学上审慎的文章,成功地为理性话语中一种真实而常见的「鸡同鸭讲」现象命名。其核心贡献——描述性/规范性/处方性三层区分以及迷宫类比——确实具有澄清意义,值得一读。主要的保留意见是:「把规律视为描述性事实」的重新框架策略,在数学与物理领域比在伦理学或社会认识论中运作得更顺畅;文章也稍微低估了它所批判的工具箱传统中真正有见地的成分。
Original Text原文
Tl;dr:
I've noticed a dichotomy between "thinking in toolboxes" and "thinking in laws".
The toolbox style of thinking says it's important to have a big bag of tools that you can adapt to context and circumstance; people who think very toolboxly tend to suspect that anyone who goes talking of a single optimal way is just ignorant of the uses of the other tools.
The lawful style of thinking, done correctly, distinguishes between descriptive truths, normative ideals, and prescriptive ideals. It may talk about certain paths being optimal, even if there's no executable-in-practice algorithm that yields the optimal path. It considers truths that are not tools.
Within nearly-Euclidean mazes, the triangle inequality - that the path AC is never spatially longer than the path ABC - is always true but only sometimes useful. The triangle inequality has the prescriptive implication that if you know that one path choice will travel ABC and one path will travel AC, and if the only pragmatic path-merit you care about is going the minimum spatial distance (rather than say avoiding stairs because somebody in the party is in a wheelchair), then you should pick the route AC. But the triangle inequality goes on governing Euclidean mazes whether or not you know which path is which, and whether or not you need to avoid stairs.
Toolbox thinkers may be extremely suspicious of this claim of universal lawfulness if it is explained less than perfectly, because it sounds to them like "Throw away all the other tools in your toolbox! All you need to know is Euclidean geometry, and you can always find the shortest path through any maze, which in turn is always the best path."
If you think that's an unrealistic depiction of a misunderstanding that would never happen in reality, keep reading.
Here's a recent conversation from Twitter which I'd consider a nearly perfect illustration of the toolbox-vs.-laws dichotomy:
David Chapman: "By rationalism, I mean any claim that there is an ultimate criterion according to which thinking and acting could be judged to be correct or optimal... Under this definition, 'rationalism' must go beyond 'systematic methods are often useful, hooray!'... A rationalism claims there is one weird trick to correct thinking, which guarantees an optimal result. (Some rationalisms specify the trick; others insist there must be one, but that it is not currently knowable.) A rationalism makes strongly normative judgments: everyone ought to think that way."
Graham Rowe: "Is it fair to say that rationalists see the world entirely through rationality while meta-rationalists look at rationality as one of many tools (that they can use fluently and appropriately) to be used in service of a broader purpose?"
David Chapman: "More-or-less, I think! Although I don’t think rationalists do see the world entirely through rationality, they just say they think they ought to."
Julia Galef: "I don't think the 'one weird trick' description is accurate. It's more like: there's one correct normative model in theory, which cannot possibly be approximated by a single rule in practice, but we can look for collections of 'tricks' that seem like they bring us closer to the normative model. e.g., 'On the margin, taking more small risks is likely to increase your EV' is one example."
David Chapman: "The element that I’d call clearly meta-rational is understanding that rationality is not one well-defined thing but a bag of tricks that are more-or-less applicable in different situations."
Julia then quoted a paper mentioning "The best prescription for human reasoning is not necessarily to always use the normative model to govern one's thinking." To which Chapman replied:
"Baron’s distinction between 'normative' and 'prescriptive' is one I haven’t seen before. That seems useful and maybe key. OTOH, if we’re looking for a disagreement crux, it might be whether a normative theory that can’t be achieved, even in principle, is a good thing."
I'm now going to badly stereotype this conversation in the form I feel like I've seen it many times previously, including e.g. in the discussion of p-values and frequentist statistics. On this stereotypical depiction, there is a dichotomy between the thinking of Msr. Toolbox and Msr. Lawful that goes like this:
Msr. Toolbox: "It's important to know how to use a broad variety of statistical tools and adapt them to context. The many ways of calculating p-values form one broad family of tools; any particular tool in the set has good uses and bad uses, depending on context and what exactly you do. Using likelihood ratios is an interesting statistical technique, and I'm sure it has its good uses in the right contexts. But it would be very surprising if that one weird trick was the best calculation to do in every paper and every circumstance. If you claim it is the universal best way, then I suspect you of blind idealism, insensitivity to context and nuance, ignorance of all the other tools in the toolbox, the sheer folly of callow youth. You only have a hammer and no real-world experience using screwdrivers, so you claim everything is a nail."
Msr. Lawful: "On complex problems we may not be able to compute exact Bayesian updates, but the math still describes the optimal update, in the same way that a Carnot cycle describes a thermodynamically ideal engine even if you can't build one. You are unlikely to find a superior viewpoint that makes some other update even more optimal than the Bayesian update, not without doing a great deal of fundamental math research and maybe not at all. We didn't choose that formalism arbitrarily! We have a very broad variety of coherence theorems all spotlighting the same central structure of probability theory, saying variations of 'If your behavior cannot be viewed as coherent with probability theory in sense X, you must be executing a dominated strategy and shooting off your foot in sense Y'."
I currently suspect that when Msr. Law talks like this, Msr. Toolbox hears "I prescribe to you the following recipe for your behavior, the Bayesian Update, which you ought to execute in every kind of circumstance."
This also appears to me to frequently turn into one of those awful durable forms of misunderstanding: Msr. Toolbox doesn't see what you could possibly be telling somebody to do with a "good" or "ideal" algorithm besides executing that algorithm.
It would not surprise me if there's a symmetrical form of durable misunderstanding where a Lawist has trouble processing a Toolboxer's disclaimer: "No, you don't understand, I am not trying to describe the one true perfect optimal algorithm here, I'm trying to describe a context-sensitive tool that is sometimes useful in real life." Msr. Law may not see what you could possibly be doing with a supposedly "prudent" or "actionable" recipe besides saying that it's the correct answer, and may feel very suspicious of somebody trying to say everyone should use an answer while disclaiming that they don't really think it's true. Surely this is just the setup for some absurd motte-and-bailey where we claim something is the normative answer, and then as soon as we're challenged we walk back and claim it was 'just one tool in the toolbox'.
And it's not like those callow youths the Toolboxer is trying to lecture don't actually exist. The world is full of people who think they have the One True Recipe (without having a normative ideal by which to prove that this is indeed the optimal recipe given their preferences, knowledge, and available computing power).
The only way I see to resolve this confusion is by grasping a certain particular abstraction and distinction - as a more Lawfully inclined person might put it. Or by being able to deploy both kinds of thinking, depending on context - as a more Toolbox-inclined person might put it.
It may be that none of my readers need the lecture at this point, but I've learned to be cautious about that sort of thing, so I'll walk through the difference anyways.
Every traversable maze has a spatially shortest path; or if we are to be precise in our claims but not our measurements, a set of spatially shortest-ish paths that are all nearly the same distance.
We may perhaps call this spatially shortest path the "best" or "ideal" or "optimal" path through the maze, if we think our preference for walking shorter distances is the only pragmatically important merit of a path.
That there exists some shortest path, which may even be optimal according to our preferences, doesn't mean that you can come to an intersection at the maze and "just choose whichever branch is on the shortest path".
And the fact that you cannot, at an intersection, just choose the shorter path, doesn't mean that the concepts of distance and greater or lesser distance aren't useful.
It might even be that the maze-owner could truthfully tell you, "By the way, this right-hand turn here keeps you on the shortest path," and yet you'd still be wiser to take the left-hand turn... because you're following the left-hand rule. Where the left-hand rule is to keep your left hand on the wall and go on walking, which works for not getting lost inside a maze whose exit is connected to the start by walls. It's a good rule for agents with sharply bounded memories who can't always remember their paths exactly.
And if you're using the left-hand rule it is a terrible, terrible idea to jump walls and make a different turn just once, even if that looks like a great idea at the time, because that is an excellent way to get stuck traversing a disconnected island of connected walls inside the labyrinth.
So making the left-hand turn leads you to walk the shortest expected distance, relative to the other rules you're using. Making the right-hand turn instead, even if it seemed locally smart, might have you traversing an infinite distance instead.
But then you may not be on the shortest path, even though you are following the recommendations of the wisest and most prudent rule given your current resources. By contemplating the difference, you know that there is in principle room for improvement. Maybe that inspires you to write a maze-mapping, step-counting cellphone app that lets you get to the exit faster than the left-hand rule.
And the reason that there's a better recipe isn't that "no recipe is perfect", it isn't that there exists an infinite sequence of ever-better roads. If the maze-owner gave you a map with the shortest path drawn in a line, you could walk the true shortest path and there wouldn't be any shorter path than that.
Shortness is a property of paths; a tendency to produce shorter paths is a property of recipes. What makes a phone app an improvement is not that the app is adhering more neatly to some ideal sequence of left and right turns, it's that the path is shorter in a way that can be defined independently of the app's algorithms.
Once you can admit a path can be "shorter" in a way that abstracts away from the walker - not better, which does depend on the walker, but shorter \- it's hard not to admit the notion of there being a shortest path.
I mean, I suppose you could try very hard to never talk about a shortest path and only talk about alternative recipes that yield shorter paths. You could diligently make sure to never imagine this shorterness as a kind of decreased distance-in-performance-space from any 'shortest path'. You could make very sure that in your consideration of new recipes, you maintain your ideological purity as a toolboxer by only ever asking about laws that govern which of two paths are shorter, and never getting any inspiration from any kind of law that governs which path is shortest.
In which case you would have diligently eliminated a valuable conceptual tool from your toolbox. You would have carefully made sure that you always had to take longer roads to those mental destinations that can be reached the fastest by contemplating properties of ideal solutions, or distance from ideal solutions.
But why? Why would you?
I think at this point the Toolbox reply - though I'm not sure I could pass its Ideological Turing Test - might be that idealistic thinking has a great trap and rottenness at its heart.
It might say:
Somebody who doesn't wisely shut down all this thinking about "shortest paths" instead of the left-hand rule as a good tool for some mazes - someone who begins to imagine some unreachable ideal of perfection, instead of a series of apps that find shorter paths most of the time - will surely, in practice, begin to confuse the notion of the left-hand rule, or their other current recipe, with the shortest path.
After all, nobody can see this "shortest path", and it's supposedly a virtuous thing. So isn't it an inevitable consequence of human nature that people will start to use that idea as praise for their current recipes?
And also in the real world, surely Msr. Law will inevitably forget the extra premise involved with the step from "spatially shortest path" to "best path"- the contextual requirement that our only important preference was shorter spatial distances so defined. Msr. Law will insist that somebody in a wheelchair go down the "best path" of the maze, even though that path involves going up and down a flight of stairs.
And Msr. Law will be unable to mentally deal with a helicopter overflying the maze that violates their ontology relative to which "the shortest path" was defined.
And it will also never occur to Msr. Law to pedal around the maze in a bicycle, which is a much easier trip even if it's not the shortest spatial distance.
And Msr. Law will assume that the behavior of mortgage-backed securities is independently Gaussian-random because the math is neater that way, and then derive a definite theorem showing a top-level tranche of MBSs will almost never default, thus bringing down their trading firm -
To all of which I can only reply: "Well, yes, that happens some of the time, and there are contextual occasions where it is a useful tool to lecture Msr. Law on the importance of having a diverse toolbox. But it is not a universal truth that everyone works like that and needs to be prescribed the same lecture! You need to be sensitive to context here!"
There are definitely versions of Msr. Law who think the universal generalization they've been told about is a One Weird Trick That Is All You Need To Know; people who could in fact benefit from a lecture on the importance of diverse toolboxes.
There are also extreme toolbox thinkers could benefit from a lecture on the importance of thinking that considers unreachable ideals, and how to get closer to them, and the obstacles that are moving us away from them.
Not to commit the fallacy of the golden mean or anything, but the two viewpoints are both metatools in the metatoolbox, as it were. You're better off if you can use both in ways that depend on context and circumstance, rather than insisting that only toolbox reasoning is the universally best context-insensitive metaway to think.
If that's not putting the point too sharply.
Thinking in terms of Law is often useful. You just have to be careful to understand the context and the caveats: when is the right time to think in Law, how to think in Law, and what type of problems call for Lawful thinking.
Which is not the same as saying that every Law has exceptions. Thermodynamics still holds even at times, like playing tennis, when it's not a good time to be thinking about thermodynamics. If you thought that every Law had exceptions because it wasn't always useful to think about that Law, you'd be rejecting the metatool of Law entirely, and thinking in toolbox terms at a time when it wasn't useful to do so.
Are there Laws of optimal thought governing the optimal way to contextualize and caveat, which might be helpful for finding good executable recipes? The naturally Lawful thinker will immediately suspect so, even if they don't know what those Laws are. Not knowing these Laws won't panic a healthy Lawful thinker. Instead they'll proceed to look around for useful yet chaotic-seeming prescriptions to use now instead of later - without mistaking those chaotic prescriptions for Laws, or treating the chaos of their current recipes as proof that there's no good normative ideals to be had.
Indeed, it can sometimes be useful to contemplate, in detail, that there are probably Laws you don't know. But that's a more advanced metatool in the metatoolbox, useful in narrower ways and in fewer contexts having to do with the invention of new Laws as well as new recipes, and I'd rather not strain Msr. Toolbox's credulity any further.
To close out, one recipe I'd prescribe to reduce confusion in the toolbox-inclined is to try to see the Laws as descriptive statements, rather than being any kind of normative ideal at all.
The idea that there's a shortest path through the maze isn't a "normative ideal" instead of a "prescriptive ideal", it's just true. Once you define distance there is in fact a shortest path through the maze.
The triangle inequality might sound very close to a prescriptive rule that you ought to walk along AC instead of ABC. But actually the prescriptive rule is only if you want to walk shorter distances ceteris paribus, only if you know which turn is which, only if you're not trying to avoid stairs, and only if you're not taking an even faster route by getting on a bicycle and riding outside the whole maze to the exit. The prescriptive rule "try walking along AC" isn't the same as the triangle inequality itself, which goes on being true of spatial distances in Euclidean or nearly-Euclidean geometries - whether or not you know, whether or not you care, whether or not it's useful to think about at any given moment, even if you own a bicycle.
The statement that you can't have a heat-pressure engine more efficient than a Carnot cycle isn't about gathering in a cultish circle to sing praises of the Carnot cycle as being the ideally best possible kind of engine. It's just a true fact of thermodynamics. This true fact might helpfully suggest that you think about obstacles to Carnot-ness as possible places to improve your engine - say, that you should try to prevent heat loss from the combustion chamber, since heat loss prevents an adiabatic cycle. But even at times when it's not in fact useful to think about Carnot cycles, it doesn't mean your heat engine is allowed on those occasions to perform better than a Carnot engine.
You can't extract any more evidence from an observation than is given by its likelihood ratio. You could see this as being true because Bayesian updating is an often-unreachable normative ideal of reasoning, so therefore nobody can do better than it. But I'd call it a deeper level of understanding to see it as a law saying that you can't get a higher expected score by making any different update. This is a generalization that holds over both Bayes-inspired recipes and non-Bayes-inspired recipes. If you want to assign higher probability to the correct hypothesis, it's a short step from that preference to regarding Bayesian updates as a normative ideal; but the idea begins life as a descriptive assertion, not as a normative assertion.
It's a relatively shallow understanding of the coherence theorems to say "Well, they show that if you don't use probabilities and expected utilities you'll be incoherent, which is bad, so you shouldn't do that." It's a deeper understanding to state, "If you do something that is incoherent in way X, it will correspond to a dominated strategy in fashion Y. This is a universal generalization that is true about every tool in the statistical toolbox, whether or not they are in fact coherent, whether or not you personally prefer to avoid dominated strategies, whether or not you have the computing power to do any better, even if you own a bicycle."
I suppose that when it comes to the likes of Fun Theory, there isn't any deeper fact of nature underlying the "normative ideal" of a eudaimonic universe. But in simpler matters of math and science, a "normative ideal" like the Carnot cycle or Bayesian decision theory is almost always the manifestation of some simpler fact that is so closely related to something we want that we are tempted to take one step to the right and view it as a "normative ideal". If you're allergic to normative ideals, maybe a helpful course would be to discard the view of whatever-it-is as a normative ideal and try to understand it as a fact.
But that is a more advanced state of understanding than trying to understand what is better or best. If you're not allergic to ideals, then it's okay to try to understand why Bayesian updates are often-unreachable normative ideals, before you try to understand how they're just there.
长话短说:
我注意到一种二元对立——「工具箱式思维」与「规律式思维」。
工具箱式思维认为,拥有一大包可根据情境灵活运用的工具至关重要;极度工具箱化的人往往会怀疑:凡是大谈存在某种单一最优方法的人,不过是对其他工具的用途一无所知。
规律式思维,若做得正确,能够区分描述性真理、规范性理想与处方性理想。它可以谈论某些路径是最优的,即便不存在任何可实际执行并产生最优路径的算法。它考量的是那些并非工具的真理。
在近似欧几里得的迷宫中,三角不等式——AC 路径的空间距离永不长于 ABC 路径——始终成立,但并非总是有用。三角不等式有着处方性的蕴含:如果你知道某个路径选择将经过 ABC,另一个将经过 AC,并且你所在意的唯一实用路径优点是走最短空间距离(而非比如说因为同行者坐轮椅而需要避开楼梯),那么你应该选择 AC 路线。但三角不等式对欧几里得迷宫的支配,并不取决于你是否知道哪条路是哪条,也不取决于你是否需要避开楼梯。
如果向工具箱思维者解释得不够透彻,他们可能会对这种普遍规律性的主张极度怀疑,因为在他们听来,这像是在说:「把工具箱里所有其他工具都扔掉吧!你只需要知道欧几里得几何,就能永远找到穿越任何迷宫的最短路径,而那条路径又总是最优路径。」
如果你认为这是对一种永远不会在现实中发生的误解的不切实际描绘,请继续读下去。
这里有一段最近的推特对话,我认为它几乎是工具箱 vs. 规律这一二元对立的完美示例:
David Chapman:「所谓理性主义,我指的是任何主张存在某个终极标准,使得思维与行动可以被判断为正确或最优的论断……在这个定义下,'理性主义'必须超越'系统方法有时很有用,万岁!'……理性主义声称存在一个奇技来达到正确思维,从而保证最优结果。(某些理性主义明确指出了这个技巧;另一些则坚称它一定存在,只是目前尚不可知。)理性主义提出强有力的规范性判断:每个人都应该那样思考。」
Graham Rowe:「是否可以说,理性主义者完全通过理性来看世界,而元理性主义者则把理性视为许多工具(他们能流畅而恰当地使用)之一,用于服务更广泛的目的?」
David Chapman:「大致如此,我想!虽然我认为理性主义者实际上并不完全通过理性来看世界,他们只是说他们认为自己应该那样做。」
Julia Galef:「我不认为'一个奇技'的描述是准确的。更接近于:理论上存在一个正确的规范性模型,但它在实践中根本不可能被单一规则所近似,不过我们可以寻找一些似乎让我们更接近规范性模型的技巧集合。例如,'在边际上承担更多小风险,可能会提高你的期望价值'就是一个例子。」
David Chapman:「我会称之为明显元理性的那个要素,是理解理性并非一个定义明确的单一事物,而是一包在不同情境下或多或少适用的技巧。」
Julia 随后引用了一篇论文中提到的话:「对人类推理的最优处方,不一定总是用规范性模型来支配自己的思维。」对此 Chapman 回应:
「Baron 对'规范性'与'处方性'的区分,是我以前没见过的。这似乎有用,也许是关键所在。另一方面,如果我们在寻找分歧的核心,也许在于:一个即便在原则上也无法实现的规范性理论,是否是一件好事。」
现在,我要对这场对话进行一番拙劣的刻板化处理,以我觉得自己多次见过的形式呈现它,包括例如在 p 值与频率派统计的讨论中。在这幅刻板图景中,工具箱先生与规律先生的思维之间存在如下二元对立:
工具箱先生:「学会灵活运用种类繁多的统计工具、并根据情境加以调整,这一点非常重要。各种计算 p 值的方法构成了一大类工具;这组工具中的任何特定工具都有好的用法和坏的用法,取决于情境和你具体怎么操作。使用似然比是一种有趣的统计技术,我确信在合适的情境下它有其优点。但如果声称那是每篇论文、每种情况下都应做的唯一最优计算,那就太令人意外了。如果你声称它是普遍最优方法,我怀疑你不过是盲目的理想主义、对情境和细微差别不敏感、对工具箱里所有其他工具一无所知,以及冒冒失失的年轻人的愚蠢。你只有一把锤子,没有使用螺丝刀的实际经验,所以你认为所有东西都是钉子。」
规律先生:「在复杂问题上,我们也许无法计算精确的贝叶斯更新,但这套数学仍然描述了最优更新——就像卡诺循环描述热力学理想热机,即便你无法构造一台。你不太可能找到一个优越的视角,使得某种其他更新比贝叶斯更新更加最优,这需要大量基础数学研究,也许根本就不存在。我们不是随意选择那套形式主义的!我们有非常广泛的一致性定理,它们都聚焦于概率论的同一核心结构,以各种变体的方式说:'如果你的行为无法被视为在 X 意义上与概率论相一致,你必定在执行受支配策略,并以 Y 的方式搬起石头砸自己的脚。'」
我目前猜测,当规律先生这样说时,工具箱先生听到的是:「我向你开出以下行为处方——贝叶斯更新,你应该在每一种情境下执行它。」
在我看来,这似乎也经常演变成那种可怕的持久性误解之一:工具箱先生不明白,把一个算法称为「好的」或「理想的」,除了意味着「去执行那个算法」之外,还能告诉人们什么。
我不会对称的情况感到意外——规律主义者有时也难以消化工具箱派的免责声明:「不,你不理解,我在这里并非试图描述唯一真正完美的最优算法,我试图描述的是一个在现实生活中有时有用的、情境敏感的工具。」规律先生也许不明白,一个据称「审慎」或「可操作」的方法,除了声称它就是正确答案之外,还能有什么用处;他也许会对那种声称每个人都应该使用某个答案、同时又声称不认为它真的正确的人,产生强烈的怀疑。这肯定只是某种荒谬的「稻草人-堡垒」把戏的铺垫——我们先声称某样东西是规范性答案,一旦受到挑战,就立刻退守说那「不过是工具箱中的一个工具」。
而且,工具箱派试图说教的那些冒冒失失的年轻人,并非真的不存在。世界上充满了认为自己拥有那个唯一真正方法的人(而没有一个规范性理想,可以证明这确实是给定其偏好、知识和可用计算能力后的最优方法)。
我所看到的化解这种混乱的唯一方式,是掌握某个特定的抽象概念和区分——正如一个更偏向规律的人可能会这样说。或者是能够根据情境分别运用两种思维——正如一个更偏向工具箱的人可能会说的那样。
也许我的读者此时都不需要这堂课,但我已经学会对此类事情保持谨慎,所以我还是会把其中的区别讲一遍。
每一个可以穿越的迷宫都有一条空间上最短的路径;或者,如果我们要在主张上精确而在测量上不那么精确,则是一组空间上「差不多最短」的路径,它们的距离几乎相同。
我们也许可以称这条空间上最短的路径为穿越迷宫的「最优」或「理想」路径——如果我们认为对走更短距离的偏好,是路径唯一实用意义上重要的优点。
存在某条最短路径(甚至可能按照我们的偏好是最优的),并不意味着你可以站在迷宫的一个路口,「随手挑走在最短路径上的那条岔路」。
而你无法在路口随手挑出更短的那条路,也不意味着距离的概念以及距离更长或更短的概念毫无用处。
甚至可能是这样:迷宫的主人可以告诉你:「顺便说一下,这里向右拐可以让你保持在最短路径上。」——但你还是应该向左拐……因为你正在遵循靠左手法则。靠左手法则就是让左手一直贴着墙走,它适用于那些出口通过墙壁与入口相连的迷宫,能保证你不会迷路。对于那些记忆力严格有限、无法精确记住自己走过路径的行动者来说,这是一条好规则。
而且,如果你正在使用靠左手法则,则翻过墙壁、偶尔改变一次转向,是一个糟糕得要命的主意——即便那在当时看起来是个绝佳的主意——因为那是让你陷入在迷宫内部永无止境地穿越一座孤立的相连墙壁小岛的绝佳方式。
所以,向左转能让你走出期望最短的距离——相对于你所使用的其他规则而言。相反,向右转,即便局部上看起来更聪明,可能反而让你走无限远的路。
但你也许根本就没走在最短路径上,即便你遵循的是给定你当前资源的最明智、最审慎的规则的建议。通过思考这种差别,你知道原则上存在改进的余地。也许这激励你去开发一个能绘制迷宫地图、计算步数的手机 App,让你比靠左手法则更快地找到出口。
而存在更好的方法的原因,不是「没有任何方法是完美的」,不是存在一个无穷的越来越好的路的序列。如果迷宫主人给了你一张标有最短路径的地图,你就能走真正的最短路径,不会再有比这更短的路径。
短暂性是路径的属性;倾向于生成更短路径是方法的属性。让手机 App 成为一种改进的,不是因为 App 更整洁地遵循了某个理想的左右转序列,而是因为路径更短——这种短暂性可以被独立于 App 算法地定义。
一旦你能承认路径可以是「更短的」——以一种从行走者那里抽象出来的方式——不是更好(这确实取决于行走者),而是更短——你就很难不承认存在一条最短路径的概念。
我是说,我想你可以非常努力地从不谈论最短路径,而只谈论能产生更短路径的替代方法。你可以勤勤恳恳地确保自己从不把这种「更短性」想象成从任何「最短路径」出发的某种性能空间中的减小距离。在考虑新方法时,你可以非常确保自己作为一个工具箱派保持意识形态纯洁性——只问哪些规律支配两条路径哪条更短,永远不从任何支配哪条路径最短的规律中获得启发。
在这种情况下,你就已经勤勤恳恳地从你的工具箱中删除了一个有价值的概念工具。你已经小心翼翼地确保自己总是需要走更长的路才能到达那些可以通过思考理想解的属性、或距离理想解的距离来最快抵达的心智目的地。
但为什么呢?你为什么要那样做?
我认为,工具箱派在这一点上的回答——虽然我不确定我能否通过其意识形态图灵测试——可能是:理想主义思维有一个巨大的陷阱和腐烂在其核心。
它可能会说:
那些没有明智地停止所有这些关于「最短路径」(而不是把靠左手法则当作某些迷宫的好工具)的思考的人——那些开始想象某种遥不可及的完美理想、而不是一系列大多数时候能找到更短路径的 App 的人——在实践中,肯定会开始把靠左手法则或他们目前的其他方法,与最短路径混同起来。
毕竟,没人能看见这条「最短路径」,它又被认为是一种美德。那么,根据人性,人们难道不会必然地开始用那个概念来为自己当前的方法唱赞歌吗?
并且,在现实世界中,规律先生肯定最终会忘记从「空间上最短路径」到「最优路径」这一步中涉及的额外前提——即:我们唯一重要的偏好是如此定义的更短空间距离这一情境要求。规律先生会坚持让坐轮椅的人走迷宫的「最优路径」,即便那条路径要上下楼梯。
而规律先生将无法在心智上处理一架飞越迷宫的直升机——那架直升机违反了他们的本体论,相对于那一本体论,「最短路径」才得以被定义。
而且规律先生也永远不会想到骑着自行车绕迷宫走——即便那是一种轻松得多的旅程,哪怕它不是最短的空间距离。
而规律先生会假设房贷支持证券的行为是独立高斯随机的,因为那样数学更简洁——然后推导出一个确定性定理,证明房贷支持证券的顶层分档几乎永远不会违约,从而把自己所在的交易公司拖垮——
对此我只能回答:「好吧,是的,这种事有时确实会发生,在某些情境下,就'拥有多样工具箱的重要性'对规律先生进行说教,确实是一个有用的工具。但并非每个人都是这样运作的、都需要被开出同一堂说教课,这并非一个普遍真理!你需要在这里对情境保持敏感!」
确实存在这样的规律先生:他们认为自己被告知的普遍概括就是那个「你只需了解的唯一奇技」;这些人确实能从一堂关于多样工具箱重要性的说教课中获益。
也存在极端的工具箱思维者,他们能从一堂关于思考遥不可及的理想、如何更接近它们、以及是什么障碍让我们偏离它们的重要性的说教课中获益。
并不是要犯黄金中道谬误或类似的错误,但两种观点都是元工具箱中的元工具,可以这么说。如果你能根据情境和环境灵活运用两者,你会比坚持认为只有工具箱式推理才是放之四海而皆准的最优元思维方式,要好得多。
如果这样说没有把重点说得太尖锐的话。
规律式思维通常是有用的。你只需要注意理解情境和注意事项:什么时候适合进行规律式思维,如何进行规律式思维,以及什么类型的问题需要规律式思维。
这不等于说每条规律都有例外。热力学在打网球这类时候仍然成立,即便那不是思考热力学的好时机。如果你认为每条规律都有例外,因为它并不总是有用于思考那条规律,你就是在把规律的元工具完全拒绝掉,并在一个不适合进行工具箱式思维的时候进行了工具箱式思维。
是否存在最优思维的规律,支配着将情境化和注意事项加以运用的最优方式——这可能有助于找到好的可执行方法?天然偏向规律式思维的人会立刻怀疑是的,即便他们不知道那些规律是什么。不知道这些规律,并不会让一个健康的规律式思维者感到恐慌。相反,他们会继续寻找现在可以使用(而非留待以后)的有用但看似杂乱的处方——同时不会把那些杂乱的处方误认为规律,也不会把当前方法的杂乱性视为不存在好的规范性理想的证明。
确实,有时候详细地思考存在自己尚不知晓的规律,是有用的。但那是元工具箱中一个更高级的元工具,在与发明新规律以及新方法有关的更窄范围和更少情境中才有用,我宁愿不再过度挑战工具箱先生的怀疑心。
作为结语,我会给倾向于工具箱的人开出一个减少混乱的处方:尝试把规律视为描述性陈述,而非任何形式的规范性理想。
迷宫中存在最短路径这一想法,不是「规范性理想」而非「处方性理想」——它就是真的。一旦你定义了距离,迷宫中确实存在一条最短路径。
三角不等式听起来可能非常接近于一条处方性规则,说你应该沿 AC 而非 ABC 走。但实际上,处方性规则只成立于:只有在你想走更短距离的前提下,只有在你知道哪条路是哪条的前提下,只有在你不需要避开楼梯的前提下,也只有在你没有通过骑上自行车绕迷宫外面走到出口来走一条更快路线的前提下。处方性规则「试着沿 AC 走」与三角不等式本身并不相同——三角不等式对欧几里得或近似欧几里得几何中的空间距离继续成立——不管你是否知道,不管你是否在乎,不管在任何特定时刻思考它是否有用,即便你拥有一辆自行车。
说你无法拥有比卡诺循环更高效的热-压力引擎,并不是关于聚在宗教圆圈里唱赞歌,把卡诺循环颂扬为理想中最好的可能引擎类型。这不过是热力学的一个真实事实。这一真实事实可能会有益地提示你,把妨碍卡诺性的因素当作可能改进引擎的地方来思考——比如说,你应该尽量防止燃烧室的热损失,因为热损失妨碍了绝热循环。但即便在实际上不适合思考卡诺循环的时候,这也不意味着你的热机在那些场合被允许比卡诺引擎表现得更好。
你无法从一次观察中提取比其似然比所给出的更多的证据。你可以认为这是真的,因为贝叶斯更新是一个经常不可达的规范性推理理想,因此没有人能做得比它更好。但我会把这理解为一个更深层的理解:这是一条规律,说你无法通过做出任何不同的更新来获得更高的期望分数。这是一个既适用于受贝叶斯启发的方法,也适用于不受贝叶斯启发的方法的概括。如果你想要给正确的假设分配更高的概率,那么从那个偏好到把贝叶斯更新视为规范性理想,只是一小步;但这个想法起初是作为描述性断言诞生的,而非规范性断言。
对一致性定理有一种相对肤浅的理解,说的是:「好吧,它们表明如果你不使用概率和期望效用,你就会不一致,而那是不好的,所以你不应该那样做。」有一种更深的理解,是这样陈述的:「如果你做了在 X 方式上不一致的事,它将以 Y 方式对应于一个受支配策略。这是一个关于统计工具箱中每个工具的普遍概括,不论它们实际上是否一致,不论你是否个人更偏好避免受支配策略,不论你是否有计算能力来做得更好,即便你拥有一辆自行车。」
我想,就有趣理论之类的东西而言,在「幸福宇宙」这一「规范性理想」背后,并不存在任何更深的自然事实。但在更简单的数学和科学事务中,像卡诺循环或贝叶斯决策论这样的「规范性理想」,几乎总是某个更简单事实的体现,该事实与我们想要的东西如此紧密相关,以至于我们忍不住向右迈一步,把它视为「规范性理想」。如果你对规范性理想过敏,也许一个有益的做法是,丢弃把某样东西视为规范性理想的观点,转而尝试把它理解为一个事实。
但那是比理解什么是更好或最好更高级的理解状态。如果你对理想不过敏,那么先尝试理解为什么贝叶斯更新是经常不可达的规范性理想,然后再尝试理解它们如何只是如此存在,这也是完全可以的。