Introduction to Neural Networks/Ch. 1 — Foundations of Neural Networks/2 of 7

The Artificial Neuron

Before we build a skyscraper, we need to understand a single brick. In neural networks, that brick is the artificial neuron.

The biological neuron has three main components that matter to us: dendrites, which receive incoming signals; an axon, which carries the processed signal forward; and terminals, which pass the signal to the next neuron. These signals are electrochemical — a mix of electrical and chemical processes that scientists have been trying to replicate mathematically for decades.

McCulloch and Pitts' thought: what if we could model this with math?

Their answer was elegant and, in hindsight, obvious. An artificial neuron computes a weighted sum of its inputs:

f(x, w) = x₁w₁ + x₂w₂ + … + xₙwₙ

Each input x_i has a corresponding weight w_i. The neuron multiplies each input by its weight and adds everything up. In the original 1943 model, those weights were set entirely by hand — there was no learning yet, just configuration.

Linear regression as a single neuron: inputs multiplied by weights, summed to produce an output — The price of a house is equal to distance from beach, number of bedrooms, and whether there is a pool. Does this look familiar?

✦

This Is Just Linear Regression

If this equation looks familiar, trust that instinct. It is familiar — this is linear regression. And this is not a coincidence.

Linear regression is the foundational operation of virtually every algorithm in machine learning. The neuron is a linear regression. Stacked neurons become neural networks. The sophistication comes not from abandoning linearity, but from combining many linear operations in clever ways — and adding just enough non-linearity to unlock remarkable capabilities.

Checkpoint•Reflective Question

In what sense is a single artificial neuron equivalent to a linear regression model? What does each weight represent?

◈

Real-World Application

In practice, every prediction your neural network makes is ultimately a series of weighted sums passed through transformations. When you are debugging a model that is producing nonsense outputs, one of the first questions to ask is: "Are my inputs scaled properly?" Because a weighted sum is sensitive to the magnitude of its inputs, preprocessing your data — normalizing or standardizing it — is not optional housekeeping. It is foundational to getting the model to work at all.

←

A Story 90 Years in the Making

The Perceptron and Its Limits

→