How the World Wide Web (WWW) works
by Chris Woodford. Last updated: November 24, 2016.
Twenty or thirty years ago, there was
something you could take for
granted: you could walk into a public library, open up a reference
book, and find information on almost any subject you wanted. What we
take for granted nowadays is that we can sit down at practically any
computer, almost anywhere on the planet, and access an online
information library far more powerful than any public library on Earth:
the World Wide Web. Twenty-first century
life is so dependent
on the Web that it seems remarkable we ever lived without it. Yet the
Web was invented less than 20 years ago and has been a huge popular
success for only about half that time. One of the greatest inventions
of all time... is also one of the newest!
Photo: Our gateway to the world: the World Wide Web (WWW). Websites like this arrive at your computer when remote computer send data to you over the Internet. The Web is just one of many things that uses the Internet.
What's the difference between the Web and the Net?
Photo: Imagine this little computer network scaled up several hundred million times... and you have a picture of the Internet. Photo courtesy of NASA Glenn Research Center (NASA-GRC).
Let's get one thing straight before we go any further: the Web and
the Internet are two totally different
- The Internet is a
worldwide network of computers,
linked mostly by telephone lines; the
Web is just one of many things (called applications)
that can run on the Internet. When you send an email, you're using the Internet:
the Net sends the words you write over telephone lines to your friends.
When you chat to someone online, you're most likely using the Internet
too—because it's the Net that swaps your messages back and forth.
But when you update a blog or Google for information to help you
write a report, you're using the Web over the Net.
You can read more in our article about how the Internet works.
- The Web is the worldwide
collection of text pages, digital
photographs, music files,
animations you can access over the Internet. What makes the Web so
special (and, indeed, gives it its name) is the way all this
information is connected together. The basic building blocks of the Web
are pages of text, like this one—Web pages as
we call them. A
collection of Web pages on the same computer is called a website.
Every web page (including this one) has highlighted phrases called links
(or hypertext links) all over it. Clicking
one of these takes
you to another page on this website or another website entirely. So
far, so simple.
How computers can talk the same language
The really clever thing about the Internet is that it allows practically
every computer on the planet to exchange information. That's a much
bigger deal than it sounds. Back in the
earlier days of computers,
in the 1960s, 1970s, and 1980s, it was
rare for computers to be able to exchange information at all. The
machines made by one manufacturer were often totally incompatible with
those made by everyone else. In the 1970s, early personal computers
(which were called microcomputers) could not even run the same
programs. Instead, each type of computer had to have programs written
specially for it. Hooking computers up together was possible, but
So most computers were used as standalone machines, like gigantic
pocket calculators. Things like email and chat were all but impossible, except
for a handful of scientists who knew what they were doing.
Photos: Microcomputers like this Apple ][ sold in the hundreds of thousands in the late 1970s
and early 1980s, but machines made by one company couldn't share information with those made by other manufacturers.
All this began to change in the 1980s. The first thing that happened
was that IBM—the world's biggest computer company, famous for its "big
blue" mainframes—introduced a personal computer for small businesses.
Other people started to "clone" (copy) it and, pretty soon, all personal
computers started to look and work the same way. Microsoft came up with a
piece of software called Windows that allowed all these
"IBM-compatible" computers to run the same programs. But there was a
still a problem getting machines like home computers talking to giant
machines in science laboratories or big mainframes in large companies.
How could computers be made to talk the same language?
The person who solved that problem was English computer scientist
Tim Berners-Lee (1955–). In the 1980s, he was working at CERN, the
European particle physics laboratory, which is staffed mostly by people from universities around the
world who come and go all the time, and where people were using all kinds of
different, incompatible computers.
Berners-Lee realized CERN had no "memory": every time
people left, they took useful information with them. A related problem
was that people who used different computers had no easy way of
exchanging their research.
Berners-Lee started to wonder how he could get all of CERN's
computers—and people—talking together.
What's the difference between HTTP and HTML?
Although early computers were pretty incompatible, almost all of
could store or process information using ASCII
(American Standard Code for Information Interchange), sometimes known
as "plain text." In ASCII, the numbers 0–255 are used to represent
letters, numbers, and keyboard characters like A, B, C, 1, 2, 3,
%, &, and @. Berners-Lee used ASCII to come up with two basic
systems of rules (known in computer terminology as protocols).
If all the computers at CERN followed those two rules, he realized they
exchange any information very simply.
He called the first rule HTTP (HyperText
It is essentially a way for two computers to exchange information
through a simple "conversation," whether they're sitting next to one
another in the same room or on opposite sides of the world.
One computer (which is called a client and
runs a program called a web
browser) asks the other computer (which is called a server
or web server) for the information it needs
with a series of
simple messages. The web browser and the web server then chat away for
a few seconds, with the browser sending requests for the things it
wants and the server sending them if it can find them.
The HTTP conversation between a web browser and and a web server is a
bit like being at a dinner table when someone says: "Pass the salt,
please", someone else says "Here it is", and the first person says
HTTP is a sort of simple, polite language that all computers have
learned to speak so they can
swap files back and forth over the Internet.
A computer also needs to be able to understand any files it receives
that have been sent by HTTP.
So Berners-Lee introduced another stroke of genius.
His second rule was to make all the CERN computers exchange files
written in a common language called HTML
Language). It was based on ASCII, so any computer could understand it.
Unlike ASCII, HTML has special codes called tags
structure the text. A Web browser can read these tags and use them to
display things like bold font, italics, headings, tables, or images.
Incidentally, for the curious among you: you can see what the "secret"
HTML behind any web page looks like by right clicking your mouse on a
web page and then selecting the View source
or View page source option.
Try it now!
Photo: An example of HTML code. This is the HTML that produces this web page. The little codes you can see inserted between angled brackets (< and >) are bits of HTML, which identify parts of the text as headings, paragraphs, graphics and so on. For example, new paragraphs are marked out with <p> and medium-sized headings with <h2> (header level 2).
HTTP and HTML are "how the Web works": HTTP is the simple way in
which one computer asks another one for Web pages; HTML is the way
those pages are written so any computer can understand them and display
them correctly. If you find that confusing, try thinking about
libraries. HTTP is like the way we arrange and access books in
libraries according to more or less
the same set of rules: the fact that they have books arranged on
you can ask for help, catalogs where you can look up book titles, and
so on. Since all libraries work roughly the same way, if you've been to
one library, you
know roughly what all the others are like and how to use them.
HTML is like the way a book is made: with a contents at the front, an
index at the back, text on
pages running left to right, and so on. HTML is how we structure
information so anyone can read it. Once you've seen one book, you know
how they all work.
When Browser met Server
Web browsers (clients) and servers converse not in English, French, or German—but HTTP: the
language of "send me a Web page", "Okay, here it is." This is a brief example of how your browser
could ask to see our A-Z index page and what our server would say in response. The actual
page and its information is sent separately.
What the browser asks for
GET /azindex.html HTTP/1.1
User-Agent: Mozilla/5.0 (X11; Ubuntu; Linux i686; rv:44.0) Gecko/20100101 Firefox/44.0
What the server replies
HTTP/1.1 200 OK
Date: Mon, 08 Feb 2016 09:03:23 GMT
Expires: Wed, 09 Mar 2016 09:03:23 GMT
Content-Type: text/html; charset=UTF-8
What does it all mean? Briefly, the browser is explaining what software it is (Firefox), what operating system I'm running
(Linux Ubuntu), which character-sets (foreign fonts and so on) it can accept, which forms of compressed file it can understand
(gzip, deflate), and which file it wants (azindex.html). The server (running software called Apache) is sending a compressed file (gzip), along with data about how long it is (19702 bytes) and what format it's in (text/html, using the UTF-8 character set).
Http status codes
Right at the start of the server's reply, you can see it says HTTP/1.1 200 OK: the 200 "status code" (sometimes called a response code) means the server has correctly located the page and is sending it to the browser. A server can send a variety of other numeric codes too: if it can't find the page, it sends a 404 "Not Found" code; if the page has moved elsewhere, the server sends a 301 "Permanently moved" code and the address of the page's new location; and if the server is down for maintenance, it can send a 503 "Service Unavailable" code, which tells browsers they should try again later.
What is a URL?
There was one more clever thing Berners-Lee thought of—and that was a
way for any computer to locate information stored on any other
He suggested each web page should have something like a zip code, which
a URL (a Universal or Uniform Resource
The URL is the page address you see in the long bar at the top of your
The address or URL of this page is:
What does all that gobbledygook mean? Let's take it one chunk at a
- The http:// bit means your computer can pull this page off my computer using the standard process called HTTP.
- www.explainthatstuff.com is the address or domain name of my computer.
- howthewebworks.html is the name of the file you're currently reading off my computer.
- The .html part of the filename tells your computer it's an HTML file
Taken all together, that stuff tells your computer where to find this
page on my computer, how to access it, and what to do with it to
display it correctly.
And that's how the Web works!
How to set up your own website
The famous American inventor and publisher Benjamin Franklin once
said that two things in life are certain: death and taxes. These days,
he might add something else to that list: websites—because just about
everybody seems to have one! Businesses promote themselves with
websites, television soaps have spinoff sites devoted to their
characters, newlyweds set up sites for their wedding photographs, and
most kids have profiles (statements about themselves and what they
like) on "social-networking sites" such as Facebook. If you
feel like you're getting left behind, maybe it's time to set up a site
yourself? How do you go about it?
What is a website?
The basic idea of the Web is that you can read
information that anyone else has stored on a publicly accessible space
called their website.
If you're familiar with using computers for wordprocessing, you'll
know that when you create a document (such as a letter or a
CV/resumé), it exists on your computer as a file, which
you store in a place called a folder (or directory). A website is simply a
collection of interlinked documents, usually stored in the same
directory on a publicly accessible computer known as a server.
Apart from the main documents (text pages), a website generally also
contains images or graphic files (photographs, typically stored as JPG
files, and artworks, usually stored as GIF or PNG files). So the basic
idea of creating a website involves writing all these text pages and
assembling the various graphic files you need, then putting them all
together in a folder where other people can access them.
What do you need to host a website?
Theoretically, you could turn your own computer into a server and
allow anyone else on the planet to access it to browse your website.
All you have to do is configure your computer in a certain way so that
it accepts incoming traffic from the Internet and also register your
computer with all the other servers on the Internet so they know where
to find it. There are three main reasons why this is not generally a
good idea. First, you won't be able to use your computer for anything
else because it will be spending all its time serving requests for
information from other people. (But if you have more than one computer,
that's not such a problem.) Second, you'd have to make sure that your
computer was switched on and available 24 hours a day—and you might not
want to do that. Third, making your computer available to the Internet
in this way is something of a security risk. A determined hacker might
be able to access all the other folders on your machine and either
steal your information or do other kinds of malicious damage.
So, in practice, people rent web space on a large computer operated
by an Internet service provider (ISP). This is known as getting someone
to host your website for you. Generally, if you want to set up
a website, you will need a hosting package (a basic contract
with an ISP to give you so much disk space and bandwidth (the
maximum amount of information that your website can transfer out to
other people each month). The web space you get is simply a folder
(directory) on the ISPs server and it will have a fairly obscure and
unmemorable name such as: www.myownpersonalISP.com/ABC54321/ That's not exactly the sort of thing you want to paint on the side of
your truck, if you're in business. So you'll need a more memorable name
for your website—also called a domain name. The domain name is
simply a friendly address that you give to your website so that other
people can find it more easily. The domain address is set up to point
to the real address of your site at your ISPs server
(www.myownpersonalISP.com/ABC54321/ ), so when people type your
domain name into their Web browser, they are automatically redirected
to the correct address without actually having to worry about what it
Some ISPs offer a user-friendly system where you simply purchase a
domain name and hosting package for a single annual payment (generally,
it will be less than about $60 or £30 per annum). With other
ISPs, you have to buy the domain name and the hosting package
separately and that works out better if you are hosting several
different domains with the same ISP. Buying a domain name makes you its
legal owner and you'll find that you are immediately registered on a
central database known as WHOIS, so that other people can't use
the same name as well.
How do you create web pages?
Setting up a domain name and Web hosting package takes all of five
minutes; creating a website can take an awful lot longer because it
means writing all the information you need, coming up with a nice page
layout, finding your photographs, and all the rest of it. Generally,
there are three ways to create web pages.
The most basic way of creating web pages is to use a text
editor such as notepad or WordPad on Windows and build up your pages
from raw HTML web page coding as you go. Generally, this gives you a
much better understanding of how web pages work, but it's a bit harder
for novices to get the hang of it—and unless you're a geek you may not
want to bother. Instead of creating pages from scratch, you can use ready-made ones called
templates. They're bare-bones, pre-designed HTML files into
which you simply insert your own content. Just change the bits you need
and you have an instant website! The main drawback of templates is that
you can end up with a me-too site that looks the same as everyone
Another approach is to use an editing program that does all the hidden
Web-page coding (known as HTML) for you. This is called a WYSIWYG
(what you see is what you get) editor because you lay out your pages on
the screen broadly as you want them to appear to everyone who browses
your site. Popular programs such as Dreamweaver work in this way. Most word
processors, including Microsoft Word and OpenOffice, let you convert
existing documents into web pages ("export HTML files") with a couple
of mouse clicks.
Content management systems
The final method is to use what's called a content-management system (CMS),
which handles all the technical side of creating a website automatically. You simply set up a basic page template,
style its visual appearance with what's called a "theme," create your various interlinked pages
based on the template, and then upload them. CMS systems like Wordpress, Drupal, and Joomla
work this way. You can add various extra functions to them using what are called plugins.
How do you upload web pages?
Once you've created your web pages and you have your domain name and
web space, you simply need to upload the pages onto your web
space using a method called FTP (file transfer protocol). It's
very easy: just like copying files from one folder of your computer to
another. When you've uploaded your files, your website should be
publicly accessible within seconds (assuming that your domain name has
already been registered for at least a couple of days first). Updating your web pages
is then simply a matter of updating them on your local computer, as
often as you wish, and copying the changes onto your web space as
necessary. Generally it's best to do all your updating on copies of
your pages on your own computer rather than editing live pages on the
server itself. You avoid embarrassing mistakes that way, but you also
have a useful backup copy of the entire site on your computer in case
the server crashes and loses all your files.
How can you promote a website?
You want lots of other people to find your website, so you'll need
to encourage other websites to make links to yours. You'll also need to
register your site with search engines such as Google, Bing,
and all the dozens of others. Sooner or later, search engines like
Google will pick up your site if it's linked by other sites that
they're already indexing, because they're constantly "crawling" the web
looking for new content.
And that's pretty much all there is to it. The best way to learn
about websites is to build one for yourself. So, off you go and do it!
You can learn all about building basic web pages by playing with HTML
files on your computer. Once you're confident about what you're doing,
it's easy to take the next step and make a world-wide website for the whole wide world!
Find out more
On this site
- Google SEO Starter Guide: You've figured out how to build a site and you've got it working okay, but how do you successfully promote it through a search engine? Google's excellent introduction explains the best practices for getting search engines to work for you.
- Tim Berners-Lee's home page: Read all about the man who invented the Web.
- Creating a Web Site: The Missing Manual by Matthew MacDonald. O'Reilly, 2015. A very comprehensive guide that goes from absolute basics right up to CSS and server-side includes. One of the best single-volume introductions I've found.
- Building a Web Site for Dummies by David A. Crowder. Dummies, 2010. A basic introduction, though assumes you have some knowledge of the Web already.
History of the Internet and Web
- 100 Ideas that Changed the Web by Jim Boulton. Laurence King, 2014. From browsers and emoticons to HTML and CSS, an accessible introduction to the key concepts that underpin the Web. (For some reason, this isn't currently listed on Google Books.)
- The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution by Walter Isaacson. Simon and Schuster, 2014. A whistle-stop biography of the people who pioneered modern computer and Internet technology.
- Weaving the Web: The
Original Design and Ultimate Destiny of the World Wide Web by its Inventor by Tim Berners-Lee with Mark Fischetti. HarperCollins, 1999. Tim Berners-Lee tells his own story of what he hopes to achieve with the World Wide Web.
- The Internet: A Historical Encyclopedia.
by Christos Moschovitis, Hilary Poole, Laura Lambert, and Chris Woodford. ABC-Clio: 2005. A three-volume history of the Internet, including a chronology, a book of biographies, and a look at the key social and technical issues challenging the Internet's development. (I wrote the second volume about issues.)
More to explore on our website...