
Augmented reality (AR)
by Chris Woodford. Last updated: September 4, 2011.
Not so long ago, future-minded people enjoyed making a sharp distinction between "reality" (the often tiresome, problematic world around us) and "cyberspace" (the promising new "virtual" world inside computers). Back in 1996, when the Web was still a novelty and relatively few people were online, John Perry Barlow famously declared that cyberspace is "a civilization of the Mind... both everywhere and nowhere, but it is not where bodies live."
Some years down the line, it's clear that black-and-white distinctions between real and virtual were incredibly wide of the mark. What most people actually want from computers is much more pragmatic: intuitively easy-to-use technology that will enhance their busy, everyday, real lives—to help them find a decent coffee shop, to make shopping less of a chore, to meet new friends who share their interests, or whatever it might be. Now computers are smaller and more portable than ever, and you can go online almost anywhere on the planet, using online information to enhance ("augment") real life is where the smart money is heading. Augmented reality, as it's known, is something we'll be hearing an awful lot more about in the next few years. Let's take a closer look at how it works!
Photo: Augmented reality: What's that plant? Just point your tablet computer at the pesky thing so the digital camera can take a photo, press a button, and the plant-finder app automatically identifies what you're looking at. In seconds, it's called up the Wikipedia entry (left) and a selection of photos of the species on Flickr (right) so you can double-check. This is only one of the possible, future applications of augmented reality, which means blending computer-generated information with the world you can see in front of you. (Simulated image.)
What is augmented reality?
Augmented reality sounds horribly theoretical and abstract, but it's actually very simple: real life plus topical, relevant, background information equals something massively more useful. A few quick examples will make the idea clearer:
- You're watching a tennis grand-slam on TV and there's a controversial call from one of the line judges. Was that serve in or out? The TV station runs an instant replay with a computer animation showing the exact trajectory of the ball and where it landed—just outside the line. Then a little table comes up on the screen showing how many serves have been in or out for each player and how the figures have changed over the course of the match.
- You're walking the streets of London, England and you suddenly come across an amazing bit of architecture. What is this fantastic building? Who was the architect? Is that really titanium? You're dying to find out more, but the building is closed and there's no information about it at all. So you hold your cellphone up and take a quick photo. The phone uses its built-in GPS (satellite navigation) system to figure out roughly where you are, then quickly searches Google Images to find similar photos taken in the same neighborhood. In a couple of seconds, it's identified the building and brought up a Wikipedia page telling you all about it.
- You're a fighter pilot flying over a warzone with anti-aircraft fire shooting up at you. You really have to concentrate and looking down at all the gauges on your instrument panel is a distracton you can do without. Fortunately, you're wearing what's called a heads-up display (HUD), a set of goggles with built-in, miniaturized computers that automatically project instrument readings so they "float" in front of your eyes. You can find out everything you need to know without taking your eyes off the sky.
- You're driving down the freeway sometime in 2020 and you start to feel hungry. Wink your right eye twice and a computer display overlays your windshield with a list of eating places in nearby towns. Wink your eye to select the one that looks most promising and your sat-nav system reads out directions for how to get there. You can email an order in advance so it'll be ready when you arrive.
Photo: Left: A relatively cumbersome, prototype helmet-mounted display developed by NASA in the early 1990s: computer generated images are blended with what the pilot sees through his eyes. Photo courtesy of NASA Langley Research Center (NASA-LaRC).
Right: A more recent prototype developed for the US Air Force in 2008. Here the augmented reality display is incorporated into an ordinary pair of wraparound eyeglasses. Photo by Jonathan Snyder courtesy of US Air Force and Defense Imagery.
You can see that augmented reality is actually a mixture of real life and virtual reality, somewhere in between the two, so it's often referred to as mixed reality. The key point is that the extra information it gives you is highly topical and relevant to what you want to do or know in a certain place and time.
How does AR tracking work?
If you're out and about in the real world with your laptop, netbook, or cellphone, it's easy enough to get information: just bring up Google and type in some words. In the brave new world of augmented reality, it's even easier: you get the extra information automatically. That means your portable computing device needs some automatic way of finding out where you are or what you're looking at—a problem known as tracking.
The simplest form of tracking is for the device to use GPS (or some other satellite navigation system) to figure out your position automatically, which is fine if you want broad, background information about a place your visiting (a local street map, a list of nearby coffee bars, directions to the nearest hotel, or whatever). It's relatively easy to use tracking information from Wi-Fi hotspots as well. But what if you're somewhere like an art gallery or museum and what you actually want is information about each picture or exhibit automatically coming up on your cellphone as you walk through the building? GPS isn't (yet) precise enough for an application like that, so what could we do instead? Broadly speaking, there are two different solutions known as marker-based and markerless tracking.
- Markerless tracking: You could point your phone at each picture or exhibit and have some kind of pattern recognition or feature-detection system try to identify it. That's how our own perceptual systems work, after all: our eyes see things and our brains figure out what we're looking at, then "call up" background information. Our brains are amazingly good at this and make it seem very easy, but it's a much harder problem for a computer to tackle, not least because the best computer vision systems are only a fraction as good as our own.
- Marker-based tracking: A simpler option would be for the gallery or museum to print small, two-dimensional barcodes (also called data-matrix codes) next to each item on display. Then you'd simply point your phone's camera at one of them, your phone would turn the barcode into a web address, and its browser would call up an appropriate web page with further information. AR systems can be designed to read all kinds of other markers (or fiduciary markers) as these "added reference points" are called.
Marker-based tracking will probably be popular initially, but in the longer term fully automatic, markerless tracking seems certain to win out because that's how our own visual systems work—and it's what most users prefer. After all, we recognize our friends automatically without them having to walk around with barcodes printed on their foreheads!
Photo: Marker-based tracking. This advertisement features a QR code® (a popular type of data-matrix code). You're supposed to see the ad and then point your phone camera at it for more information. Your phone will decode the black-and-white pattern into a URL (website address) that its browser will then load automatically. Photo by courtesy of Gaku, published on Flickr in 2007 under a Creative Commons Licence.
Displaying augmented reality
Virtual reality has taken off in a much more limited way than many people supposed. One reason for this is the difficulty of "immersing" people in a computer-generated world: ideally, you need to wear a sophisticated headset that completely blocks out the real world and replaces it with a computer-drawn equivalent. With augmented reality, the problem is different: what we need is a way of overlaying computer-generated information on what we can see already. When people first started talking about augmented reality, there was a feeling we'd all end up walking around in heads-up displays, like fighter pilots. Now Net-connected cellphones with cameras are commonplace, they've become the focus for AR—although that may change in future. Already, researchers are working on prototype contact lenses with built-in computer displays, which would superimpose things like Web pages onto our field of vision. Another possibility is wearing eyeglasses with miniature, forward-facing cameras built into the frames and transparent screens added to the lenses.

What kind of AR applications already exist?
There's plenty of online information and no shortage of reality—the trick is bringing the two things together. Some exciting strides are already being made in this area by cellphone applications such as Layar, Wikitude, and Yelp. With Layer, you simply look through your cellphone camera at the world in front of you and see layers of extra information, like transparent webpages, added on top. Wikitude offers an application called World Browser, which overlays useful information about landmarks and other points of interest seen through a camera phone. Yelp gives you instant access to reviews of shops, restaurants, hotels and so on in American cities such as San Francisco. Expect to see lots more AR applications along these lines in the next few years!
Photo: Wikitude: Point your phone's camera at the world around you and it automatically loads information about places nearby. You can also browse interesting things nearby based on the phone's location sensor, as I'm doing here.
Further Reading
On this website
Books
- Augmented Reality: Placing Artificial Objects in Real Scenes by Reinhold Behringer et al (eds). A K Peters, 1999. Proceedings of the First International Workshop on Augmented Reality.
- Professional Augmented Reality Browsers for Smartphones: Programming for Junaio, Layar and Wikitude by Lester Madden. John Wiley and Sons, 2011.
- Emerging Technologies of Augmented Reality: Interfaces and Design by Michael Haller, Mark Billinghurst, and Bruce H. Thomas. Idea Group Inc (IGI), 2007.
Articles
- How AR could change our view of the world by Spencer Kelly, BBC Click, 24 June 2011. A short video demonstration of cutting-edge augmented reality on tablet computers.
- The promise of augmented reality: Gaga in a living room by Alex Hudson, BBC Click, 25 June 2011. Genuinely useful or just a gimmick?
- Aurasma: Augmented reality future or forgettable fun by Rory Cellan-Jones, BBC News, 26 May 2011. Is AR really useful in real life?
- Augmented reality: it's like real life, but better by Charles Arthur, The Guardian/Observer, 21 March 2010. Argues that AR could revolutionize sport, advertising, and much more.
- Wired: Augmented reality Read recent posts about AR from the Wired Gadget Lab.


