What a transistor can do
A transistor is really simple—and really complex. Let's start with
the simple part. A transistor is a miniature electronic component that
can
do two different jobs. It can work either as an amplifier or a switch.
When it works as an amplifier, it takes
in a tiny
electric current at one end and (an
input current) produces a much bigger electric current (an output
current) at the
other. In other words, it's a kind of current booster. That comes in
really useful in things like hearing aids, one of the first things
people used transistors for. A hearing aid has a tiny microphone in it
that picks up sounds from the world around you and turns them into
fluctuating electric currents. These are fed into a transistor that
boosts them and powers a tiny loudspeaker,
so you hear a much louder
version of the sounds around you.
William Shockley once explained amplifiers to a student in a more
humorous way: "If you take a bale of hay and tie it to the
tail of a mule and then strike a match and set the bale of hay on fire,
and if you then compare the energy expended shortly thereafter by the
mule with the energy expended by yourself in the striking of the match,
you will understand the concept of amplification."
Transistors can also work as switches. A
tiny electric
current flowing through one part of a transistor can make a much bigger
current flow through another part of it. In other words, the small
current switches on
the larger one. This is essentially how all computer chips work. For
example, in a memory chip, the small current turns the transistor on or
off. Since the transistor can be in two distinct states, a computer can
use it to store two different numbers, zero and one. With lots of
transistors, a computer can lots of billions of zeros and ones and use
them to represent ordinary numbers and letters. More about this in a
moment.
Two kinds of silicon
The great thing about old-style machines was that you could take
them apart to figure out how they worked. It was never too hard, with a
bit of pushing and poking, to discover which bit did what and how one
thing led to another. But electronics is entirely different. It's all
about using electrons to control electricity. An electron is a
minute
particle inside an atom. It's so small, it weighs just under
0.000000000000000000000000000001 kg! The most advanced transistors work
by controlling the movements of individual electrons, so you can
imagine just how small they are. In a modern computer chip, the size of
a fingernail, you can find around 500 million separate transistors.
There's no chance of taking a transistor apart to find out how it
works, so we have to understand it with theory and imagination instead.
Transistors are made from silicon, which does not normally conduct
electricity (it doesn't allow electrons to flow through it easily).
Silicon is a semiconductor, which means it's
neither really a
conductor (something like a metal that lets electricity flow) nor an
insulator (something like plastic that stops electricity flowing). If
we treat silicon with impurities (a process known as doping),
we can make it behave in a different
way. If we dope silicon with the chemical elements arsenic, phosphorus,
or
antimony, the silicon gains some extra electrons—so electrons flow out
of it more naturally. Because electrons have a negative charge, silicon
treated this way is called n-type (negative
type). We can also dope silicon with other impurities such as boron,
gallium, and
aluminum. Silicon treated this way will lose some electrons, so
electrons in nearby
materials will tend to flow into it. A lack of electrons is the
same
thing as a positive charge, so we call this sort of silicon p-type
(positive type
Silicon sandwiches
We now have two different types of silicon. If we put them together
in layers, making sandwiches of p-type and n-type material, we can make
different kinds of electronic components that work in all kinds of
ways.
Suppose we join a piece of n-type silicon to a piece of p-type
silicon and put electrical contacts on either side. Exciting and useful
things start to happen at the junction between the two
materials. If we turn
on the current, we can make electrons flow through the junction from
the n-type side to the p-type side and out through the circuit. This
happens because the lack of electrons on the p-type side of the
junction pulls electrons over from the n-type side and vice-versa. But
if
we reverse the current, the electrons won't flow at all. What we've
made here is called a diode (or rectifier).
It's an electronic
component that lets current flow through it in only one direction. It's
useful if you want to turn alternating (two-way) electric current into
direct (one-way) current. Diodes can also be made so they give off
light when electricity flows through them. You might have seen these
light-emitting diodes (LEDs) on pocket calculators and electronic
displays on hi-fi stereo equipment.
How a junction transistor works
Now suppose we use three layers of silicon in our sandwich instead
of two. We can either make a p-n-p sandwich (with a slice of n-type
silicon as the filling between two slices of p-type) or an n-p-n
sandwich (with the p-type in between the two slabs of n-type). If we
join electrical contacts to all three layers of the sandwich, we can
make a component that will either amplify a current or switch it on or
off—in other words, a transistor. Let's how it works in the case of an
n-p-n transistor.
So we know what we're talking about, let's give names to the three
electrical contacts. We'll call the two contacts joined to the two
pieces of n-type silicon the emitter and the collector,
and the contact
joined to the p-type silicon we'll call the base. When no
current is
flowing in the transistor, we know the p-type silicon is short of
electrons (shown here by the little plus signs, representing positive
charges) and the two pieces of n-type silicon have extra electrons
(shown by the little minus signs, representing negative charges).

Another way of looking at this is to say that while the n-type has a
surplus of electrons, the p-type has holes where electrons
should be.
Normally, the holes in the base act like a barrier, preventing any
significant current flow flowing from the emitter to the collector and
the transistor is in its "off" state.
A transistor works when the electrons and the holes start moving
across the two junctions between the n-type and p-type silicon.
Let's
connect the transistor up to some power. Suppose we attach a small
positive voltage to the base, make the emitter negatively charged, and
make the collector positively charged. Electrons are pulled from the
emitter into the base—and then from the base into the collector. And
the transistor switches to its "on" state:

The small current that we turn on at the base makes a big current
flow between the emitter and the collector. By turning a small input
current
into a large output current, the transistor acts like an amplifier. But
it
also acts like a switch at the same time. When there is no current to
the base, little or no current flows between the collector and the
emitter. Turn on the base current and a big current flows. So the base
current switches the whole transistor on and off. Technically, this
type of transistor is called bipolar because
both two different
kinds ("polarity") of electrical charge (negative electrons and
positive holes) are involved in making the current flow.
How a field-effect transistor (FET) works
All transistors work by controlling the movement of electrons, but
not all of them do it the same way. Like a junction transistor, a FET
(field effect transistor) has three different terminals—but they
have the names source (analagous to the emitter), drain
(analagous to the
collector), and gate (analagous to the base). In a FET, the
layers of
n-type and p-type silicon are arranged in a slightly different way and
coated with layers of metal and oxide. That gives us a device called a
MOSFET (Metal Oxide Semiconductor Field
Effect Transistor).

Although there are extra electrons in the n-type source and drain,
they cannot flow from one to the other because of the holes in
the p-type gate in between them. However, if we attach a positive
voltage to the gate, an electric field is created there that allows
electrons to flow in a thin channel from the source to the drain. This
"field effect" allows a current to flow and switches the transistor on:

For the sake of completeness, we could note that a MOSFET is a unipolar
transistor because only one kind ("polarity")
of electric charge is involved in making it work.
Building blocks
In practice, you don't need to know any of this stuff about
electrons and holes unless you're going
to design computer chips for a living! All you need to know is that a
transistor works like an amplifier or a switch, using a small current
to switch on a larger one. But there's one other thing work knowing:
how does all this help computers store
information and make decisions?
We can put a few transistor switches together to make something
called a logic gate, which compares several
input currents and
gives a different output as a result. Logic gates let computers make
very simple decisions using a mathematical technique called Boolean
algebra. Your brain makes decisions the same way. For example,
using "inputs" (things you know) about the weather and what you have in
your hallway, you can make a decision like this: "If it's raining AND I
have an umbrella, I will go to the
shops". That's an example of Boolean algebra using what's called an AND
"operator" (the word operator is just a bit of mathematical jargon to
make things seem more complicated than they really are). You can make
similar decisions with other operators. "If it's windy OR it's snowing,
then I will put on a coat" is
an example of using an OR operator. Or how about "If it's raining AND I
have an umbrella OR I have a coat then it's OK to go out". Using AND,
OR, and other operators called
NOR, NOT, and NAND, computers can add up or compare binary numbers.
That idea is
the foundation stone of computer programs: the logical
series of instructions that make computers do things.
Normally, a junction transistor is "off" when there is no base
current and switches to "on" when the base current flows. That means it
takes an electric current to switch the transistor on or off. But
transistors like this can be hooked up with logic gates so their output
connections feed back into their inputs. The transistor
then stays on even when the base current is removed. Each time a new
base
current flows, the transistor "flips" on or off. It remains in one of
those stable states (either on or off) until another current
comes along and flips it the other way. This kind of arrangement
is known as a flip-flop and it turns a
transistor into a simple
memory device that stores a zero (when it's off) or a one (when it's
on). Flip-flops are the basic technology behind computer memory chips.
A brief bit of history
Transistors were invented at Bell Laboratories in New Jersey in 1947
by three brilliant US physicists: John Bardeen (1908-1991), Walter
Brattain (1902-1987), and William
Shockley (1910-1989). The team, led by Shockley, had been trying to
develop a new kind of amplifier for the US telephone system—but what
they actually invented turned out to have much more widespread
applications. Bardeen and Brattain made the first practical transistor
(known as a point-contact transistor) on Tuesday, December 16, 1947.
Although Shockley had played a large part in the project, he was
furious and agitated at being left out. Shortly afterward, during a
stay in a hotel at a physics conference, he single-handedly figured out
the theory of the junction transistor—a much better device than the
point-contact transistor.
While Bardeen quit Bell Labs to become an academic (he went on to
great success studying superconductors at the University of Illionois),
Brattain stayed for a while before retiring to become a teacher.
Shockley set up his own transistor-making company and helped to inspire
the modern-day phenomenon that is "Silicon Valley" (the prosperous area
around
Palto Alto, California where electronics corporations have
congregated). Two of his employees, Robert Noyce and Gordon Moore, went
on to found Intel, the
world's biggest micro-chip manufacturer.
Bardeen, Brattain, and Shockley were briefly reunited in 1956 when
they shared the world's top science
award, the Nobel Prize in Physics, for their discovery. Their story is
a riveting tale of
intellectual brilliance battling with petty jealousy and it's well
worth reading
more about. You can find some great accounts of it in the books and
websites listed below.
Further Reading
Books you can read
Bridgman, Roger. Eyewitness Electronics.
New York: Dorling
Kindersley, 2000. A great introduction from someone who really
understands his subject.
Riordan, Michael and Lillian Hoddesdon. Crystal
Fire: The Invention
of the Transistor and the Birth of the Information Age. New
York/London: W.W.Norton, 1997. A superb account of the brilliant trio
who developed the transistor, and how bitter personal rivalry quickly
blew the team apart.
Favorite websites
- The Journey Inside: Intel's educational website, all
about transistors and integrated
circuits.
- Transistorized!:
A PBS website about Bardeen, Brattain, Shockley, and the history of the
transistor.
- The Transistor: Learn about transistors in a fun
way, with these games on the Nobel Prize website.