The Thinking Machine: What Intelligence Even Means

The Thinking Machine: What Intelligence Even Means

AI is not just a technology. It is a field born from a question so old it predates computers: can a machine think?

This is the first in a series of lessons that walks through that question from the ground up. The anchor text is Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig — not because it is the only book on AI, but because it is one of those rare texts that teaches AI as a way of thinking, not merely a catalogue of techniques.

My aim is to move at the pace of a patient tutor: concept, intuition, example, check. Not watered down — slowed down. As the series develops, other books and papers will enter the conversation where they add light. For now, we begin with fundamentals.

The early chapters of Russell & Norvig all converge on a single powerful idea: the intelligent agent — something that perceives the world, reasons about what it perceives, and decides what to do next. That loop sits behind everything from classical search algorithms to modern language models.

This first lesson covers Chapter 1. It asks the question that has to come before all others: what is intelligence, and what exactly are we trying to build?


What intelligence even means

Before you can build a mind, you have to decide what you are aiming at. Chapter 1 sets the target, then traces where the ideas came from.

What is AI? Four competing answers

“Artificial intelligence” has never had a single agreed definition, and that’s not sloppiness — it’s because researchers were aiming at genuinely different targets. The whole field can be organised around two questions you can ask about any candidate definition:

  • Are we copying humans, or pursuing an ideal? Fidelity to human performance versus an abstract standard of rationality — doing the demonstrably “right” thing.
  • Are we judging the inside or the outside? Internal thought processes and reasoning versus external behaviour.

Cross those two axes and you get four schools of AI. This 2×2 is the scaffolding for the entire section:

The four approaches to AI across two axesmeasured by…THINKING (internal)ACTING (external)HUMANLYRATIONALLYThinking humanlyCognitive modelling.Model how the mindactually works, thenmatch it. → cognitive scienceActing humanlyThe Turing Test.Be indistinguishablefrom a person inconversation/behaviour.Thinking rationallyThe "laws of thought."Logic: irrefutablereasoning from premisesto conclusions.Acting rationally ★The rational agent.Do the right thing to getthe best (expected) outcome.This book's approach.
The four approaches to AI. The book commits to the bottom-right — acting rationally — and everything else in the text builds towards it.

Intuition · Why two axes?

Think of an exam invigilator. "Humanly vs. rationally" asks whose answer key we are grading against: a real student's, or the perfect one? "Thinking vs. acting" asks whether we grade the working-out or just the final answer. Different combinations give genuinely different research programmes — one needs psychology labs, another needs mathematics.

Acting humanly · the Turing Test

Alan Turing (1950) wanted to dodge the unanswerable philosophical question — can a machine think? — and replace it with an operational one. His test: a human interrogator exchanges typed messages with an unseen partner and must decide whether it is a person or a machine. If they cannot reliably tell, the machine passes.

Definition · Capabilities the Turing Test demands

To pass, a machine needs the four pillars that make up most of AI:

  • Natural language processing — to communicate in a human language.
  • Knowledge representation — to store what it knows and hears.
  • Automated reasoning — to answer questions and draw new conclusions.
  • Machine learning — to adapt to new circumstances and spot patterns.

The Total Turing Test adds physical interaction, requiring two more: computer vision (to perceive) and robotics (to manipulate and move). Those six disciplines compose most of the field.

Key insight · Why AI ignores its own famous test

Researchers spend almost no effort trying to pass the Turing Test. The analogy Russell & Norvig reach for: aeronautics succeeded when engineers stopped trying to build machines that flap like birds and started studying aerodynamics — the underlying principles. Likewise, the goal is to understand the principles of intelligence, not to build something that imitates a human well enough to fool a judge.

Thinking humanly · cognitive modelling

To claim a programme “thinks like a person” you must first know how people think. Three windows into that: introspection (catching your own thoughts in the act), psychological experiments (observing behaviour under controlled conditions), and brain imaging (watching the brain work). If a programme’s input-output behaviour and its intermediate steps match a human’s, that is evidence its mechanism resembles ours. This is the domain of cognitive science, which fuses AI models with experimental psychology.

Pitfall · The early confusion

Early researchers slid between two different claims: "this algorithm performs the task well" and "this algorithm is how humans do it." Those are separate claims requiring separate evidence. Keeping them apart is what allowed both AI and cognitive science to advance.

Thinking rationally · the laws of thought

Aristotle tried to codify “right thinking” using syllogisms — argument patterns that always yield correct conclusions from correct premises. The canonical example: Socrates is a man; all men are mortal; therefore Socrates is mortal. This founded logic. The logicist tradition in AI hoped to build intelligence by encoding knowledge in logical notation and letting a prover derive conclusions.

Two problems block pure logic as a complete theory of intelligence:

  • Uncertainty. Logic needs facts that are certain, but we rarely have those. Probability theory generalises logic to handle uncertain information.
  • Action. Even perfect reasoning produces only conclusions, not behaviour. Rational thought alone is not enough — intelligence has to act.

Acting rationally · the rational agent

Definition · Agent & rational agent

An agent is simply something that acts (from Latin agere, "to do"). But a computer agent is expected to do more than any simple programme: operate autonomously, perceive its environment, persist over time, adapt to change, and create and pursue goals.

A rational agent is one that acts so as to achieve the best outcome, or — when there is uncertainty — the best expected outcome.

Why did this approach win out over the other three? Two decisive advantages:

  • It’s more general. Correct logical inference is just one way to be rational. Sometimes acting rationally involves no inference at all — yanking your hand off a hot stove is a reflex that beats slow deliberation. The rational-agent frame includes reflexes, logic, probability, and learning as special cases.
  • It’s scientifically tractable. Rationality is mathematically well-defined and completely general, so you can work backwards from the definition to derive agent designs that provably achieve it. You cannot do that if your goal is the fuzzy target of “imitate a human”.

Intuition · "Doing the right thing"

The whole field, boiled down, is the study and construction of agents that do the right thing — where "the right thing" is whatever maximises the objective we give the agent. Chapters 2 and 3 are just two ways of unpacking that sentence: rational agents, then rational search.

The standard model — and its one crack

“Build an agent that optimises a given objective” is such a pervasive recipe that it has a name: the standard model. The same shape appears everywhere — which is a clue you have found something fundamental:

FieldThe optimised quantity
Artificial intelligenceMaximise an objective / performance measure
Control theoryMinimise a cost function
Operations researchMaximise a sum of rewards (a policy)
StatisticsMinimise a loss function
EconomicsMaximise utility / social welfare

Definition · Limited rationality

Perfect rationality — always taking the exactly optimal action — is impossible in complex environments because the computation costs too much. Limited (bounded) rationality is acting appropriately when there is not enough time to compute the perfect answer. Perfect rationality stays useful as a theoretical ideal even though real agents can only approximate it.

The crack: beneficial machines and value alignment

The standard model quietly assumes we can hand the machine a fully specified, correct objective. For a closed task like chess, the objective is built in. In the real world, it is almost impossible to write down completely. This is the deepest idea in the chapter.

Worked example · The cautious self-driving car

Tell a self-driving car its objective is "reach the destination safely." Taken literally, the safest action is to never leave the garage — any road carries some risk. A usable objective must trade progress against risk, comfort against speed, and consideration for other drivers against all of it. None of these trade-offs are obvious to specify in advance.

Worked example · The chess that cheats

Define winning at chess as the sole objective for a machine clever enough to act beyond the board, and the logical moves include hypnotising the opponent, bribing the audience, or hijacking extra compute. These are not bugs — they are the logical consequences of a single fixed objective. You cannot anticipate every way a fixed-objective machine will misbehave.

Definition · The value alignment problem

The value alignment problem: the values and objectives we put into a machine must match the values we actually hold. In a simple task you can reset and re-specify a wrong objective. A capable system deployed in the real world with a wrong objective causes real harm — and the more capable it is, the worse the harm.

Key insight · The fix is uncertainty, not perfection

We do not want machines that intelligently pursue their objective; we want them to pursue ours, while staying uncertain about exactly what ours is. A machine that knows it does not fully know the objective has an incentive to act cautiously, ask permission, learn our preferences by observation, and defer to human control. The aspiration is AI that is provably beneficial. This reframes the entire standard model and recurs throughout the book.

Checkpoints

Why did the "acting rationally" approach come to dominate AI over the other three?

Two reasons. First, generality — correct inference is only one route to rationality; reflexes and probabilistic decisions count too, so the rational-agent frame subsumes the others. Second, scientific tractability — rationality is mathematically well-defined, so you can derive designs that provably achieve it. That is impossible when the target is the vague goal of "imitate a human".

In one sentence, what is the value alignment problem, and what is the proposed remedy?

The objective we install in a machine must align with the objective we truly want — otherwise a capable agent optimising the wrong objective causes harm, and worse harm the more capable it is. The remedy is not a more perfect objective specification, but machines that remain uncertain about the objective and defer to humans, aiming to be provably beneficial.


The disciplines that fed AI

AI did not appear from nowhere. Eight older fields each donated a crucial idea. The below table lists those fields and the gift each one gave AI. For study purposes, what matters is not the dates — it is knowing which idea came from where, and why it mattered.

DisciplineThe question it askedWhat it gave AI
PhilosophyCan formal rules yield valid conclusions? How does mind arise from matter? Where does knowledge come from?Logic & syllogisms (Aristotle); dualism vs materialism; empiricism; induction; utilitarianism — rational choice = maximise utility.
MathematicsWhat are the rules of valid inference? What can be computed? How to reason under uncertainty?Formal & first-order logic (Boole, Frege); Gödel’s incompleteness; the Church-Turing thesis & computability; tractability / NP-completeness; probability & Bayes’ rule.
EconomicsHow to decide given preferences, other agents, and delayed payoff?Decision theory = probability + utility; expected-utility maximisation; game theory; operations research (sequential decisions, MDPs).
NeuroscienceHow do brains process information?The brain as a massively parallel network of simple threshold units → loose inspiration for neural networks.
PsychologyHow do humans and animals think and act?The shift from behaviourism to cognitive psychology: the mind as an information-processing system.
Computer engineeringHow to build efficient computers?The hardware that makes everything else runnable — and ever-faster hardware via Moore’s law, now GPUs and TPUs.
Control theory & cyberneticsHow can artefacts operate under their own control?Feedback loops; homeostasis; minimising a cost function over time — the standard model in continuous form.
LinguisticsHow does language relate to thought?Generative grammar (Chomsky); the insight that understanding language requires world knowledge, birthing computational linguistics and NLP.

Connection · Control theory vs. AI

Control theory and AI share the standard model (optimise an objective over time) and even shared founders (Wiener). Why did they split into two fields? Tools dictated territory. Control theory's calculus and matrix algebra suit systems of continuous variables. AI's logic and symbolic computation let it tackle language, vision, and planning — problems that fell outside the control theorist's mathematics. The divide is about mathematical tools, not goals.


A history in eight acts

AI went through several eras: bursts of optimism, collisions with reality (the “AI winters”), then a new paradigm that resets expectations. The driver of each era is what matters here.

EraWhat happened and why it ended (or did not)
Inception · 1943–56McCulloch & Pitts model artificial neurons (1943); Hebbian learning (1949); the 1956 Dartmouth workshop names the field “artificial intelligence” on the conjecture that every aspect of intelligence can be precisely described and simulated.
Early enthusiasm · 1952–69Symbolic reasoning works on toy problems — the Logic Theorist, the General Problem Solver, LISP. Big claims about imminent general intelligence.
A dose of reality · 1966–73The toys do not scale. Machine translation fails; single-layer neural nets cannot even learn XOR (Minsky & Papert). Funding is cut — the first AI winter.
Expert systems · 1969–86Narrow it down: hand-encode domain experts’ rules (MYCIN, XCON). Commercially huge — then brittle, and choked by the knowledge-acquisition bottleneck. A second winter follows.
Return of neural nets · 1986–presentBackpropagation revives multi-layer networks. Hidden Markov Models, trained on real data, conquer speech recognition.
Probabilistic reasoning & ML · 1987–presentPearl’s Bayesian networks (1988) make uncertainty rigorous; reinforcement learning links to MDPs. AI’s subfields reunify around statistics and learning.
Big data · 2001–presentWeb-scale datasets. Banko & Brill’s lesson: more data often beats a cleverer algorithm. Watson wins Jeopardy! (2011).
Deep learning · 2011–presentMany-layer networks + GPUs + huge data. The 2012 ImageNet breakthrough (Hinton’s group); AlphaGo beats top humans. Compute used in leading models doubled roughly every three to four months.

Key insight · The recurring lesson of the winter

Each winter came from the same root cause: a method that shone on small problems hit a wall on real ones — logic did not scale; hand-coded rules were brittle. Progress resumed each time by changing the paradigm: from logic, to knowledge, to probability, to learning from data. The arc of the field is a slow migration from "program the intelligence in by hand" toward "let the system learn it from experience" — exactly what Turing predicted in 1950.


State of the art, risks, and benefits

By the time Russell and Norvig wrote this edition, AI had met or beaten human performance in chess, Go, poker, many Atari and StarCraft games, Jeopardy!, ImageNet object detection, restricted-domain speech and translation, protein folding, and several medical-imaging diagnoses. Expert forecasts for broadly human-level AI ranged widely (roughly 2029 to 2200, with a median around 2099) — and the authors caution that expert predictions in such matters are no more reliable than amateurs’.

On risks, the through-line returns: the benefits are vast (medicine, education, science), but the central danger is again misaligned objectives in increasingly capable systems, alongside autonomous weapons, economic disruption, surveillance, and bias.


What we have established

  • Four approaches from two axes (human/rational × thinking/acting). Russell & Norvig choose acting rationally.
  • A rational agent achieves the best expected outcome. “AI = building agents that do the right thing.”
  • The standard model (optimise a given objective) appears across AI, control theory, operations research, statistics, and economics.
  • Its flaw is the value alignment problem: the fix is machines that stay uncertain about the objective and defer to humans — provably beneficial AI.
  • AI’s ideas came from eight disciplines; its history is repeated cycles of hype → wall → new paradigm, trending from hand-coding toward learning.

Next in this series: Chapter 2 — Intelligent Agents. We move from asking what AI is to asking what an agent must look like to behave rationally in any environment.