You are on page 1of 14

Journal of Sci. Engg. & Tech. Mgt.

Vol 2 (2), July 2010

“Multi Touch Technology”


The Burgeoning Mean of Human Computer Interface
1
H.B.Kekre, 2Sudeep D. Thepade, 3Digvijay Singh, 4Tushant Mitaal, 5Aman Pratap Singh
1
Senior Professor, 2Associate Proffesor, 3,4,5B.Tech. Student
Computer Engineering Department,
MPSTME, SVKM’s NMIMS University, Mumbai

Abstract
Multi-touch (or multi-touch) denotes a set of interaction techniques which allow computer or
mobile users to control graphical user interface with more than one finger at either application or
system level. The paper takes quick review of Touch screen technology for multi touch displays.
Further the hardware constitution details are discussed. Software support along with applications
are also discussed with some future directions.

opposed to the single touch screen (e.g.


I. INTRODUCTION computer touchpad, ATM), which recognizes
only one touch point. This effect is achieved
Multi-touch (or multitouch) denotes a through a variety of means, including: heat,
set of interaction techniques which allow finger pressure, high capture rate cameras,
computer or mobile users to control graphical infrared light, optic capture, tuned
user interface with more than one finger at electromagnetic induction, ultrasonic
either application or system level [1]. An receivers, transducer microphones, laser
interaction technique, user interface technique rangefinders, and shadow capture [2].
or input technique is a combination of Many products using multi-touch
hardware and software elements that provides a interfaces exist and are being developed.
way for computer users to accomplish a single Multi-touch is used on portable devices
task. For example, one can go back to the including the Apple iPhone, HTC Hero,
previously visited page on a Web browser by Microsoft's Zune HD, and the Palm Pre, as
either clicking a button, pressing a key, well as desktop products such as the Microsoft
performing a mouse gesture or uttering a Surface and the DELL Latitude XT2.
speech command. It is a key concept in human- Hobbyists have developed extremely low cost
computer interaction. touch screen technologies using webcams,
projection of image, and software that senses
touch patterns via the webcam. Multi touch
screens larger than 50" are possible at costs
under $100 to $400 depending on the projector
& equipment used. DIY multi-touch screens of
extremely large size (wall sized or larger) are
also quite possible and very cheaply
Fig.1.1. Multi-touch Screen [1] constructed and posted in numerous places on
the web.
While touch sensing is commonplace
for single points of contact, multi-touch
sensing enables a user to interact with a system
with more than one finger at a time, as in
chording and bi-manual operations. Such
sensing devices are inherently also able to
accommodate multiple users simultaneously,
which is especially useful for larger interaction Fig.1.2. View of Multi-touch Screens [28]
scenarios such as interactive walls and
tabletops. Multi-touch consists of a touch II. HISTORY
screen (screen, overlay, table, wall, etc.) or
touchpad, as well as software that recognizes
multiple simultaneous touch points, as

25
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

Multi-touch technology dates back to H. Dispersive signal technology [11] [3].


1982, when Nimish Mehta at the University of I. Acoustic pulse recognition [12] [3].
Toronto developed the first finger pressure
multi-touch display [2]. In 1983, Bell Labs at IV. Multi-touch displays
Murray Hill published a comprehensive
discussion of touch-screen based interfaces. A. Perceptive Pixel[13][1]
[27] In 1984 Bell Labs engineered a touch
screen that could change images with more
than one hand. The group at the University of
Toronto stopped working on hardware and
moved on to software and interfaces,
expecting that they would have access to the
Bell Labs work. Mainstream exposure to
multi-touch technology occurred in the year
2007, when Apple unveiled the iPhone and Fig.4.1. Jefferson Y. Han debuting his initial
Microsoft debuted surface computing. The multi-touch system at TED [1]
iPhone in particular has spawned a wave of
interest in multi-touch computing Perceptive Pixel is a company founded
by New York University consulting research
III. Conventional Touch Screens scientist Jefferson Y. Han that creates wall
displays and tables. The displays use infrared
Touchscreen is a display that can light emitting diodes along with an infrared
detect the presence and location of a touch camera to determine the point of contact. Han
within the display area. The term generally envisions large collaborative spaces that will
refers to touch or contact to the display of the allow multiple users to work and interact.
device by a finger or hand. Touchscreens can Perceptive Pixel’s technology is currently
also sense other passive objects, such as a being utilized, in the form of the Multi-Touch
stylus. However, if the object sensed is active, Collaboration Wall, by CNN and an
as with a light pen, the term touchscreen is unspecified government contractor. But its 6-
generally not applicable. The ability to interact figure price even scared most interested
directly with a display typically indicates the medium business away.
presence of a touchscreen. The touchscreen B. Microsoft Surface[14][15]
has two main attributes. First, it enables one to In 2001 Steve Bathiche and Andy
interact with what is displayed directly on the Wilson of Microsoft began work on an idea for
screen, where it is displayed, rather than an interactive table that mixes both physical
indirectly with a mouse or touchpad. Secondly, and virtual worlds.
it lets one do so without requiring any
intermediate device, again, such as a stylus
that needs to be held in the hand. Such
displays can be attached to computers or, as
terminals, to networks. They also play a
prominent role in the design of digital
appliances such as the personal digital Fig.4.2. Microsoft Surface [1]
assistant (PDA), satellite navigation devices,
mobile phones, and video games. Research and Development expanded
There are a number of types of rapidly in 2004, once the idea caught the
touchscreen technologoies, some of them are attention of Microsoft Chairman Bill Gates. In
as listed below.. 2007 Microsoft introduced Microsoft Surface,
A. Resistive Touch Screen [4] [3] a functional multi-touch table-top computer
B. Surface acoustic wave [5] [3] based on a standard PC platform including an
C. Capacitive Touch Screen [3] Intel Core 2 Duo processor, Windows Vista,
D. Surface capacitance [6] [3] and 2 GB of RAM. Essentially, Microsoft
E. Infrared Touch Screen [3] Surface is a computer embedded in a table
F. Strain gauge [8], [9], [10]. with a large, flat, touch-responsive display on
G. Optical imaging[3] the top. The table uses small cameras (as

26
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

opposed to finger pressure or heat) that enable standard touch-screen (e.g. computer
it to react to the touch of any object. The unit touchpad, ATM), which recognizes only one
has eight different modes that allow users to touch point.
perform an array of activities, ranging from The Natural User Interface and its
organizing pictures and videos to ordering influence on multi-touch gestural interface
food at a restaurant. Multiple users have the design has brought key changes in computing
ability to work on the table at one time. The hardware design, especially the creation of
preliminary launch was on April 17, 2008, “true” multi-touch hardware systems (i.e.
when Surface became available for customer support for more than two inputs). The aim of
use in AT&T stores. Although the product is NUI Group is to provide an open platform
not available for widespread purchase, it can where hardware and software knowledge can
possibly be bought directly from Microsoft for be exchanged freely; through this free
$13,500. exchange of knowledge and information, there
C. 3M Multi-touch[1] has been an increase in development in regards
3M is a company known for many to hardware. On the hardware frontier, NUI
innovations in the touch field, including Group aims to be an informational resource
overlays incorporated in other vendor's hub for others interested in prototyping and/or
products. 3M provides a Multi-touch constructing, a low cost, high resolution open-
developer kit which includes a projected source multi-input hardware system. Through
capacitive glass surfaced 19" monitor with the community research efforts, there have
stand and a software development kit which been improvements to existing multi-touch
includes support for the upcoming Windows 7 systems as well as the creation of new
operating system. The unique features at this techniques that allow for the development of
price point are 10 finger multi-touch and 15ms not only multi-touch hardware systems, but
touch response with full edge support and also multi-modal devices. At the moment there
finger identification when both are touching. are five major techniques being refined by the
D. Tyco Electronics: Elo Touch Systems community that allow for the creation of a
Tyco Electronics is a electronics stable multi-touch hardware systems; these
component manufacturer which also sells include: Jeff Han’s pioneering Frustrated Total
touch technology through their Elo Touch Internal Reflection (FTIR), Rear Diffused
Systems division. Elo Touch Systems provides Illumination (Rear DI) such as Microsoft’s
integrator components, custom solutions, kits, Surface Table, Laser Light Plan (LLP)
and fully integrated touch monitors through pioneered in the community by Alex Popovich
resellers. Elo Touch Systems provides multiple and also seen in Microsoft’s LaserTouch
technologies for single touch screens: Acoustic prototype, LED-Light Plane (LED-LP)
Pulse Recognition (APR); IntelliTouch developed within the community by Nima
Surface Acoustic Wave (SAW); and Motamedi, and finally Diffused Surface
CarrollTouch InfraRed (IR). For Multi-touch Illumination (DSI) developed within the com-
they have been an early integrator working on munity by Tim Roth. These five techniques
Microsoft Windows 7, initially announcing being utilized by the community work on the
support for 2 finger multi-touch support with principal of Computer Vision and optics
IntelliTouch Surface Acoustic Wave and (cameras). While optical sensing makes up the
technology built on CarrollTouch IR touch vast majority of techniques in the NUI Group
technologies, in both 19" and 22" form factors. community, there are several other sensing
techniques that can be utilized in making
V. HARDWARE CONSTITUTION natural user interface and multitouch devices.
Some of these sensing devices include
Multi-touch (or multi-touch) denotes a proximity, acoustic, capacitive, resistive,
set of interaction techniques that allow motion, orientation, and pressure. Often,
computer users to control graphical various sensors are combined to form a
applications with several fingers. Multi-touch particular multitouch sensing technique. In this
devices consist of a touch screen (e.g., chapter, we explore some of the mentioned
computer display, table, wall) or touchpad, as techniques. Optical or light sensing (camera)
well as software that recognizes multiple based solutions make up a large percentage of
simultaneous touch points, as opposed to the multi-touch devices. The scalability, low cost

27
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

and ease of setup are suggestive reasoning for per second (FPS) is recommended. Higher
the popularity of optical solutions. Stereo frame rates provide a smoother and more
Vision, Overhead cameras, Frustrated Total responsive experience.
Interal Reflection, Front and Rear Diffused Interface: Basically there are two types of
Illumination, Laser Light Plane, and Diffused interfaces that can be used to connect a camera
Surface Illumination are all examples of device to a computer. An IEEE 1394 device is
camera based multi-touch systems. Each of recommend because it usually has less
these techniques consist of an optical sensor overhead and lower latency in transferring the
(typically a camera), infrared light source, and camera image to the computer. Again, lower
visual feedback in the form of projection or latency results in a more responsive system.
LCD. Prior to learning about each particular Lens type: Most consumer webcams contain
technique, it is important to understand these an infrared (IR) filter that prevents IR light
three parts that all optical techniques share. from reaching the camera sensor. This is done
A. Introduction to Optical Multi-Touch to prevent image distortion. However for our
Technologies [16] purpose, we want to capture and use IR light.
1. Infrared Light Sources Choosing the right lens can be a difficult task,
Infrared light is mainly used to fortunately many manufactures provide an
distinguish between a visual image on the online lens calculator. The calculator
touch surface and the object(s)/finger(s) being calculates the required focal length based on
tracked. Since most systems have a visual two input parameters which are the distance
feedback system where an image from a between the lens and the object (touch surface)
projector, LCD or other display is on the touch and the width or height of the touch surface.
surface, it is important that the camera does Be sure to check if the calculator chooses a
not see this image when attempting to track proper lens. Lenses with a low focal length
objects overlyaed on the display. In order to often suffer from severe image distortion
separate the objects being tracked from the (Barrel distortion / fish eye), which can
visual display, a camera, as explained above, is complicate the calibration of the touch
modified to only see the infrared spectrum of tracking software.
light; this cuts out the visual image (visible For Multi-touch Firewire cameras have
light spectrum) from being seen by the camera some benefits over normal USB webcams:
and therefore, the camera is able to see only • Higher framerate
the infrared light that illuminates the • Capture size
object(s)/finger(s) on the touch surface. • Higher bandwidth
2. Infrared Cameras • Less overhead for driver (due to less
For multitouch setups they need to be compression)
modified first. Regular webcams and cameras
block out infrared light, letting only visible B. Frustrated Total Internal Reflection
light in. We need just the opposite. Typically, (FTIR)[16]
by opening the camera up, you can simply pop When this happens, no refraction
the filter off, but on expensive cameras this occurs in the material, and the light beam is
filter is usually applied directly to the lens and totally reflected. Han’s method uses this to
cannot be modified. great effect, flooding the inside of a piece of
Resolution: The resolution of the camera is acrylic with infrared light by trapping the light
very important. The higher the resolution the rays within the acrylic using the principle of
more pixels are available to detect finger or Total Internal Reflection. When the user
objects in the camera image. This is very comes into contact with the surface, the light
important for the precision of the touch device. rays are said to be frustrated, since they can
For small multi-touch surfaces a low now pass through into the contact material
resolution webcam (320 x 240 pixels) can be (usually skin), and the reflection is no longer
sufficient. Larger surfaces require cameras total at that point. [Fig.5.1.] This frustrated
with a resolution of 640x480 or higher in order light is scattered downwards towards an
to maintain the precision. infrared webcam, capable of picking these
Frame rate: In order to cope with fast ‘blobs’ up, and relaying them to tracking
movements and responsiveness of the system a software.
camera with at least a frame rate of 30 frames

28
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

Diffused Illumination (DI) comes in two main


forms: Front Diffused Illumination and Rear
Diffused Illumination. Both techniques rely on
the same basic principles - the contrast
between the silent image and the finger that
touches the surface.
Front Diffused Illumination
Visible light (often from the ambient
surroundings) is shined at the screen from
above the touch surface. A diffuser is placed
Fig.5.1. Schematic diagram depicting the on top or on bottom of the touch surface.
bare minimum of parts needed for a FTIR When an object touches the surface, a shadow
setup [16] is created in the position of the object. The
camera senses this shadow.
1. FTIR Layers Rear Diffused Illumination
Acrylic Infrared light is shined at the screen from
According to the paper of Han, it is necessary below the touch surface. A diffuser is placed
to use acrylic for the screen. The minimum on top or on bottom of the touch surface.
thickness is 6 mm however large screens When an object touches the surface it reflects
should use 1 cm to prevent the screen from more light than the diffuser or objects in the
bending. background; the extra light is sensed by a
Before a sheet of acrylic can be used for a camera. [Fig.5.2.]Depending on the diffuser,
multi-touch screen it needs to be prepared. this method can also detect hover and objects
Because acrylic often gets cut up roughly, it is placed on the surface. Rear DI, as
required to polish the sides of the sheet. This is demonstrated in the figure below, requires
done to improve the illumination. infrared illuminators to function.
Baffle
The baffle is required to hide the light that is
leaking from the sides of the LEDs. This can
be a border of any material (wood/metal).
Diffuser
Without a diffuser the camera will not only see
the touches, but also all objects behind the
surface. By using a diffuser, only bright
objects (touches) will be visible to the camera.
All other ‘noise data’ will be left out.
Compliant layer
With a basic FTIR setup, the performance Fig5.2. Rear DI schematic [16]
mainly depends on how greasy the fingertips
of the user are. Wet fingers are able to make C. Laser light plane (LLP)
better contact with the surface. Dry fingers and Infrared lasers are an easy and usually
objects won’t be able to frustrate the TIR. To inexpensive way to create a MT setup using
overcome this problem it is recommended to the LLP method. Most setups go with 2-4
add a ‘compliant layer’ on top of the surface. lasers, postioned on the corners of the touch
Instead of frustrating the total internal surface. The laser wattage power rating
reflection by touch, a compliant layer will act (mW,W) is related to the brightness of the
as a proxy. The complaint layer can be made laser, so the more power the brighter the IR
out of a silicon material such as ELASTOSIL plane will be. The common light wavelengths
M 4641. To protect and improve the touch used are 780nm and 940nm as those are the
surface, rear projection material such as Rosco wavelengths available on the Aixiz.com
Gray #02105 can be used. With this setup it is website where most people buy their laser
no longer required to have a diffuser on the modules. Laser modules need to have line
rear side. lenses on them to create a light plane. The 120
2. Diffused Illumination (DI) degree line lens is most commonly used, so as

29
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

to reduce the number of lasers necessary to and the light travels over the touch surface.
cover the entire touch surface. Safety when This picture [Fig.5.5.] shows the layers that are
using lasers of any power is important, so common in an LED-LP setup.
exercise common sense and be mindful of
where the laser beams are travelling.

Fig5.3. LLP schematic [16]


Fig5.5. LED-LP 3D Schematic created in
D. Diffused Surface Illumination (DSI)
SecondLife [16]
DSI uses a special acrylic to distribute
the IR evenly across the surface. Basically use
The infrared LEDs are placed around
your standard FTIR setup with an LED Frame
the touch surface; with all sides being sur-
(no compliant silicone surface needed), and
rounding preferred to get a more even
just switch to a special acrylic. [Fig.5.4.]
distribution of light. Similar to LLP, LED-LP
creates a plane of IR light that lays over the
touch surface. Since the light coming from the
LEDs is conical instead of a flat laser plane,
the light will light up objects placed above the
touch surface instead of touching it. This can
be adjusted for by adjusting filter settings in
the software (touchlib/Community Core
Vision) such as the threshold levels to only
pick up objects that are lit up when they are
very close to the touch surface.
A touch surface is a strong, durable
Fig5.4. DSI Schematic [16] surface that can take the pressure of user
interaction that is optically clear, should be
This acrylic uses small particles that used. This is usually acrylic or glass. If using
are inside the material, acting like thousands of in a projector setup, the image is stopped and
small mirrors. When you shine IR light into displayed on the projection layer. If using in
the edges of this material, the light gets an LCD setup, the diffuser is placed below the
redirected and spread to the surface of the LCD screen to evenly distribute the light from
acrylic. The effect is similar to DI, but with the LCD backlight. The source of infrared
even illumination, no hotspots, and same setup light for an LED-LP setup comes from
process as FTIR. Evonic manufactures some infrared LEDs that are placed around at least 2
different types of Endlighten. These vary in sides of the acrylic right above the touch
their thickness and also in the amount of surface. Typically the more sides surrounded,
particles inside the material. Available thick- the better the setup will be in IR prevalent
ness ranges between 6-10mm, follwing “L”, lighting conditions. Refer to the LED section
“XL” and “XXL” for particle amount. The 6 for more information on IR LEDs. A computer
mm (L) is too flexible for a table setup, but the webcam is placed on the opposite site of the
10 mm (XXL) works nicely. touch surface so that is can see the blobs. See
E. LED Light Plane (LED-LP) the camera section for more information on
LED-LP is setup the same way as an cameras that are commonly used.
FTIR setup except that the thick acrylic that
the infrared light travels through is removed VI. SOFTWARE SUPPORT[16]

30
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

clever tricks are employed. However, with the


A. Introduction to Software Programming current infrared (IR) approaches in multi-touch
Programming for multi-touch input is hardware (e.g., FTIR or DI), an adaptive
much like any other form of coding; however background model turns out to be overkill.
there are certain protocols, methods, and Due to the fact that the captured images filter
standards in the multi-touch world of out (non-infrared) light, much of the
programming. Through the work of NUI background is removed by the hardware.
Group and other organizations, frameworks Given these IR images, it is often sufficient to
have been developed for several languages, simply capture a single static background
such as Action Script 3, Python, C, C++, C#, image to remove nearly all of the ambient
and Java. Multi-touch programming is two- light. This background image is then
fold: reading and translating the “blob” input subtracted from all subsequent frames, the
from the camera or other input device, and resultant frames have a threshold applied to
relaying this information through pre-defined them, and we are left with images containing
protocols to frameworks which allow this raw ‘blobs’ of foreground objects (fingers or
blob data to be assembled into gestures that surface widgets we wish to track). Unlike
high-level language can then use to interact general tracking solutions, we know that from
with an application. TUIO (Tangible User one frame of video to another (~33ms for
Interface Protocol) has become the industry standard refresh rate of 30Hz), a human finger
standard for tracking blob data, and the will travel only a limited distance.
following chapters discuss both aspects of C. Gesture Recognition
multi-touch software: touch tracking, as well The future of Human-Computer-
as the applications operating off of the Interaction is the Natural User Interface which
tracking frameworks. is now blurring its boundary with the present.
B. Tracking With the advancement in the development of
Object tracking has been a cheap yet reliable multi-touch hardware, it
fundamental research element in the field of wouldn’t be long when we can see multi-touch
Computer Vision. The task of tracking consists screens not only in highly erudite labs but also
of reliably being able to re-identify a given in study rooms to drawing rooms and maybe
object for a series of video frames containing kitchens too. The mouse and the GUI interface
that object (estimating the states of physical has been one of the main reasons for the huge
objects in time from a series of unreliable penetration of computers in the society.
observations). In general this is a very difficult However the interaction technique is indirect
problem since the object needs to first be and recognition based. The Natural User
detected (quite often in clutter, with occlusion, Interface with multi-touch screens is intuitive,
or under varying lighting conditions) in all the contextual and evocative. The shift from GUI
frames and then the data must be associated to Gesture based interface will further make
somehow between frames in order to identify a computers an integral but unobtrusive part of
recognized object. our lifestyle. In its broadest sense, “the notion
1. Tracking for Multi-Touch of gesture is to embrace all kinds of instances
Tracking is very important to multi- where an individual engages in movements
touch technology. It is what enables multiple whose communicative intent is paramount,
fingers to perform various actions without manifest, and openly acknowledged” [17]
interrupting each other. We are also able to Communication through gestures has been one
identify gestures because the trajectory of each of the oldest form of interaction in human
finger can be traced over time, impossible civilization owing to various psychological
without tracking. Thankfully, today’s multi- reasons which, however, is beyond the scope
touch hardware greatly simplifies the task of of present discussion. The GUI systems
tracking an object, so even the simplest filter leverages previous experience and familiarity
in actuality becomes unnecessary. In fact with the application, whereas the NUI
much of the performance bottleneck of interface leverages the human assumptions and
tracking systems tends to come from gen- its logical conclusions to present an intuitive
erating and maintaining a model of the and contextual interface based on gestures.
background. Computational costs of these Thus a gesture based interface is a perfect
systems heavily constrict CPU usage unless candidate for social and collaborative tasks as

31
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

well for applications involving an artistic


touch. The interface is physical, more visible
and with direct manipulation. However, only
preliminary gestures are being used today in
stand-alone applications on the multi-touch
hardware which gives a lot of scope for its
critics. Multi-touch interface requires a new
approach rather than re-implementing the GUI
and WIMP methods with it. The form of the
gestures determines whether the type of
interaction is actually multi-touch or single-
touch multi-user. We will discuss the kind of
new gesture widgets required, development of
gesture recognition modules and the Fig.6.2. Gesture Widgets Example [16]
supporting framework to fully leverage the
utility of multi-touch hardware and develop Gesture widgets should be designed
customizable, easy to use complex multi-touch with creative thinking, proper user feedback
applications. keeping in mind the context of the application
1. Gesture Widgets and the underlying environment. These gesture
Multi-touch based NUI setups provide widgets can then be extended by the
a strong motivation and platform for a gestural application developer to design complex
interface as it is object based ( as opposed to applications. They should also support
WIMP) and hence remove the abstractness customizable user defined gestures.
between the real world and the application.
The goal of the interface should be to realize a
direct manipulation, higher immersion
interface but with tolerance to the lower
accuracy implied with such an interface. The
popular gestures for scaling, rotating and
translating images with two fingers, commonly
referred as manipulation gestures, are good Fig.6.3. New Generation Gesture Widgets
examples of natural gestures [Fig.6.1.] [16]
[Fig.6.2.].
New types of gesture-widgets
[Fig.6.3.][Fig.6.4.] are required to be build to
fully implement the concept of direct
manipulation with the objects (everything is an
object in NUI with which the user can interact)
and a evocative and contextual environment
and not just trying to emulate mouse-clicks
with a gesture.

Fig.6.4. A View of New Gesture Widgets


[16]

2. Gesture Recognition Process


The primary goal of gesture
recognition research is to create a system
Fig.6.1. Gestures [16] which can identify specific human gestures
and use them to convey information or for
device control. In order to ensure accurate
gesture recognition and an intuitive interface a
number of constraints are applied to the model.

32
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

The multi-touch setup should provide the device input management, in the form of touch
capability for more than one user to interact events being sent through TUIO protocol
with it working on independent (but maybe irrespective of the underlying hardware. The
collaborative) applications and multiple such gesture recognition task which will realize the
setups working in tandem. true potential of multi-touch surfaces, is still
the job of the client. Often, however, some
basic gestures are already included, in
particular those for natural manipulation (e.g.
of photos), but in general these frameworks
aren’t focused on gestural interfaces. They
rather tend to port the GUI and WIMP canons
to a multitouch environment. Gesture and
gesture recognition modules have currently
gained a lot of momentum with the coming up
of the NUI interface. Some of the important
frameworks are:
1.Sparsh-UI [18]
Sparsh-UI, published under LGPL
Fig6.5. Some Gesture Recognized by Touch
license, seems to be the first actual multitouch
screen [16] gesture recognition framework. It can be
Gestures are defined by the starting point
connected to a variety of hardware devices and
within the boundary of one context, end point supports different operating systems,
and the dynamic motion between the start and
programming languages and UI frameworks.
end points. With multi-touch input it should Touch messages are retrieved from the
also be able to recognize meaning of
connected devices and then processed for
combination of gestures separated in space or gesture recognition. Every visual component
time. The gesture recognition procedure can be
of the client interface can be associated to a
categorized in three sequential processes: specific set of gestures that will be attempted
• Detection of Intention: Gestures should
to be recognized. New gestures and
only be interpreted when they are made recognition algorithms can be added to the
within the application window.
default set included in the package.
• Gesture Segmentation: The same set of 2. Grafiti
gestures in the same application can map
Grafiti is a C# framework built on top
to several meanings depending on the of the Tuio client that manages multi-touch
context of the touch events. Thus the touch
interactions in table-top interfaces. The
events should be again patterned into parts possible use of tangible objects is particularly
depending on the object of intention.
contemplated. It is designed to support the use
These patterned data will be sent to the of third party modules for (specialized) gesture
gesture recognition module.
recognition algorithms. However a set of
• Gesture Classification: The gesture modules for the recognition of some basic
recognition module will work upon the
gestures is included in this project.
patterned data to map it to the correct 3. NUIFrame
command.
NUIFrame is a C++ framework based
There are various techniques for on the above discussed model (currently under
gesture recognition which can be used alone or
development). It provides a separate gesture
in combination like Hidden Markov Models, recognition module to which besides the
Artificial Neural Networks, and Finite State
touch-event sequence, contextual information
Machine etc. regarding the view of the interface is also
D. Development Frameworks
provided by the client application. This
A number of frameworks have been ensures that the same gesture on different
released and are being developed to help in the
objects can result in different operations
development of multi-touch applications depending on the context. It will also support
providing an interface for the management of
custom gestures based on user specification for
touch events in an object-oriented fashion. a particular command. The set of gesture
However the level of abstraction is still till the
widgets will also support automatic debugging

33
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

by using pattern generation, according to the capabilities” [20]. Dell has partnered with
gestures supported by it. multi-touch technology startup N-trig to
4. AME Patterns Library enable multi-touch capabilities for its tablet.
The AME Patterns library is a new C+ There have been numerous reports of
+ pattern recognition library, currently focused serious problems with the XT multi-touch
on real-time gesture recognition. It uses drivers or hardware interface, which, as of
concept-based programming to express gesture October 2008, remain unresolved by Dell. The
recognition algorithms in a generic fashion. major issue is cantered on the multi-touch
The library has recently been released under drivers failing when the computer is booted
the GNU General Public License as a part of when USB devices are connected at boot, or
AMELiA (the Arts, Media and Engineering when certain internal media drives are present.
Library Assortment), an open source library Often, but not always, the issue is resolved if
collection. It implements both a traditional the computer is booted without the USB
hidden Markov model for gesture recognition, devices and they are connected after boot. A
as well as some reduced-parameter models that similar, but not identical, issue may also be
provide reduced training requirements (only 1- caused by interfering software, such as the
2 examples) and improved run-time iTunes helped. Issue resolution is complicated
performance while maintaining good by the possible failure of the driver install
recognition results. program to function in the presence of
5. Python[19] damaged drivers, requiring manual removal of
Python is a dynamic object-oriented related files and registry keys. See External
programming language that can be used for Links for links to user discussions of this
many kinds of software development. It offers issue. With the introduction of Windows 7
strong support for integration with other which offers native multi-touch support,
languages and tools, comes with extensive preliminary tests have shown that many of
standard libraries, and can be learned in a few these issues with the XT have been solved,
days. Many Python programmers report together with a much improved N-Trig Win 7
substantial productivity gains and feel the driver.
language encourages the development of 3. Sharp Mobius
higher quality, more maintainable code. In April 2009, Sharp unveiled the first
In many ways both Python and multi-touch laptop with an optical-sensor LCD pad. The
have emerged for similar reasons of making optical-sensor LCD technology combines LCD
computers easier to use. Although as a and CCD elements within its pixels and can
programming language Python certainly detect fingers and objects such as a stylus, and
requires advanced technical knowledge of is claimed to be able to scan a business card
computers, a major goal is to let developers placed on top of the screen. Further
write code faster and with less hassle. Python’s improvements to this function are expected to
dynamic nature, rapid development enable fingerprint authentication in the future.
capabilities, make it an ideal language for 4. Lenovo Tablets (x200)
prototyping and developing multi-touch Currently, Lenovo's definition of
interactions or applications quickly. "Multitouch" implies that the screen responds
to either a finger or a digitizer pen. However,
VII. Multi-touch Applictions it is not capable of detecting simultaneous
touch points on the tablet screen nor any forms
A. Notebooks and tablets[1] of multi-finger gestures, while the Windows 7
1. HP Touchsmart TX2 already supports simple dual-touch ones.
Released in December 2008, the
Touchsmart tx2 was touted as the first B. Software[1]
consumer notebook and tablet PC with on- 1. Mac OS X Tiger
screen multi-touch control. Apple Mac OS X 10.4, as well as
2. Dell Latitude XT derivatives such as iPhone OS, support multi-
In July 2008, Dell released multi- touch. This extends from the windowing
touch touch-screen drivers for the Latitude XT environment, through to Safari, iLife and other
Tablet, claiming the "industry’s first software.
convertible tablet with multi-touch

34
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

2. Linux C. Phones and Music players:


Various versions of Linux, and 1. Apple iPhone
especially those versions used on small The iPhone is an Internet-connected,
devices such as the Asus Eee PC and Palm multimedia GSM smartphone designed and
Pre, have multi-touch support. marketed by Apple Inc. Because its minimal
3. Windows 7 hardware interface lacks a physical keyboard,
While previous versions of Windows the multi-touch screen renders a virtual
do not support multi-touch screens, Windows keyboard when necessary. The iPhone
7 will support multi-touch when released [21]. functions as a camera phone (also including
The operating system is known to have a text messaging and visual voicemail), a
multi-touch mapping application, photo portable media player (equivalent to a video
viewing program, and incorporation in Internet iPod), and an Internet client (with email, web
Explorer 8. In January 2009, Microsoft joined browsing, and Wi-Fi connectivity). The first-
with other investors who invested twenty-four generation phone hardware was quad-band
million dollars in N-Trig Ltd., which plans to GSM with EDGE; the second generation
make computer hardware that takes advantage added UMTS with 3.6 Mbps HSDPA; the third
of Windows 7's multi-touch support. generation adds support for 7.2 Mbps HSDPA
downloading but remains limited to 384 Kbps
uploading as Apple had not implemented the
HSPA protocol.
2. Apple iPod Touch[22]
The iPod Touch (trademarked and
marketed as iPod touch) is a portable media
Fig.7.1. A multi-touch photos application is player, personal digital assistant, and Wi-Fi
demonstrated in a pre-release version of mobile platform designed and marketed by
Windows 7 [1] Apple Inc. The product was launched on
September 5, 2007 at an event called The Beat
4. 22MILES Goes On. The iPod Touch adds the multi-
One of the advantages of 22MILES touch graphical user interface to the iPod line.
multi-touch framework is OS and multi-touch It is the first iPod with wireless access to the
hardware independent. Their multi-touch iTunes Store, and also has access to Apple's
software can run Windows 7, XP, Vista, App Store, enabling content to be purchased
Ubuntu Linux and Mac OS X, while it and downloaded directly on the device.
supports various multi-touch hardware 3. Palm Pre[23]
technology, such as Infrared Matrix/Camera, The Palm Pre is a multimedia
Projected Capacitive, SAW (dual-touch), smartphone designed and marketed by Palm
resistive, FTIR/ DI/ DSI, LED-LP. But unlike with a multi-touch screen and a sliding
open-source, their proprietary software and keyboard. The phone was launched on June 6,
multitouch gestures only target to business, 2009, and is the first to use Palm's new Linux-
such as digital signage, Medical Industry, GIS, based[1] operating system, webOS. The Pre
Real Estate, and so on. functions as a camera phone, a portable media
5. Multitouch for Java (MT4j) player, a GPS navigator, and an Internet client
Another multi-touch framework is (with text messaging, email, web browsing,
MT4j. It is cross-plattform (XP, Vista, and and local Wi-Fi connectivity).
Linux) and supports various multi-touch 4. Palm Pixi[24]
hardware technology based on the TUIO The Palm Pixi is a multimedia
protocol. MT4j is open source. MT4j supports smartphone under development by smartphone
application development in 2D and 3D and maker Palm. The device is viewed as a
many common input formats such as SVG and successor to the Palm Centro smartphone and
3ds. Using OpenGL for most rendering work, will be Palm's second webOS device, after the
MT4j based applications perform very well in Palm Pre. The device was officially announced
combination with a hardware-accelerated on September 8, 2009, and is expected to be
graphics card. released by the end of 2009.

35
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

5. Zune HD[25] touch-based mechanics alone. Multi-touch on


The Zune HD is a Portable Media the iPhone and iPhone 3G is goo for
Player in the Zune product family by mechanical manipulation, but without
Microsoft released on September 15, 2009 in complementary input modalities like voice or
16GB and 32GB capacities.[4] It utilizes a finger-recognition, the current multi-touch
touchscreen interface for navigation and setup falls short of offering truly immersive
includes Wi-Fi for synchronization, access to user interaction. Imagine trying to resize and
the Zune Marketplace and Web browsing. The rotate a picture while changing its color and
Zune HD utilizes the Nvidia Tegra APX 2600 inserting a bit of text. It wouldn’t make sense
chip, allowing it to play 720p video through to resize and rotate the picture using your
the optional HDMI Zune dock on a high- voice. There are size and rotation-degree
definition television. Otherwise, content will parameters that are just more well-suited for
be scaled down to 480x272 pixels on the manipulation through touch-based multi-touch
player's OLED screen. technology – it’s just easier to pinch/stretch
and rotate the picture with your finger than
VIII. FUTURE SCOPE OF MULTI- with voice commands. Likewise, it’s easier to
TOUCH use voice commands to change the colour of
A. Classrooms of the future to have multi- the background picture or add a text string to
touch desks the picture. Apple proposes, in their latest
patent application, that future multi-touch
technologies will integrate almost all
foreseeable input modalities to complement
touch-based multi-touch. Mechanical
manipulations could be assigned to touch-
based commands, while voice commands are
relegated to functions that require choosing an
option from a list.
Fig.8.1. Future Classrooms
Fig
. 8.2
Researchers at Durham University in
.
the UK are working hard to ensure that the
academic sector isn't left out of the multi-touch
craze. The still-unnamed "interactive multi-
touch desk" is a major part of the SynergyNet
framework, an interactive classroom
environment they're building based on a
gaming engine called jMonkey. There's no
word on how soon SynergyNet will change the Image Resizing [26]
world, but when it does the software will be
open source -- allowing anyone to get into the
code and tweak it to their liking. In the
meantime, schools looking for a "high tech
initiative" of their own will have to be content
with throwing laptops at their students.
B. Apple’s vision of future multi-touch[26]
Apple’s vision of future multi-touch
includes voice, sight, and finger-recognition.
Leave it to the pioneers of multi-touch in the
mobile space to apply for a patent on all future
versions of multi-touch. Apple (NSDQ:
AAPL)’s latest patent filing suggests that
Cupertino is working to integrate more than
just touch-based input modalities in to future Fig.8.3. Using Voice command and Multi-
versions of multi-touch. The problem with the Touch Together [26]
current multi-touch setup is that it’s tailored to

36
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

Furthermore, Apple envisions future using a force and position sensitive


multi-touch setups incorporating finger- screen". Computer Graphics. (accessed
recognition that would allow the user to assign on 12 October 2009)
different functions to each finger – which [9] Keuling, Christopher (2008-11-03).
would allow for more complex commands to "Touchscreens Press Deep Into Consumer
be issued. Gaze-vector tracking could allow Electronics". ECN Magazine. (accessed
your future laptop or mobile phone to on 12 October 2009)
recognize which menu you’d like to select and [10] Sensors help make ticket machines vandal
automatically drill down to the appropriate proof". Engineeringtalk (accessed on 12
category. Then there’s the proposition of using October 2009)
finger-tracking and force-sensing technologies [11] Beyers, Tim (2008-02-13). "Innovation
that would allow the user to affect different Series: Touchscreen Technology". The
degrees of control depending on finger- Motley Fool. (accessed on 12 October
pressure and position. Combine all that with 2009)
accelerometer support and some pixie dust and [12] Acoustic Pulse Recognition Touchscreens,
you’ve got the makings of a seriously Elo Touch Systems, 1888-07-31
immersed control environment. We can’t wait (accessed on 12 October 2009)
for a future filled with multi multi-touch and [13] http://www.perceptivepixel.com/
pixie dust! (accessed on 12 October 2009)
[14] http://en.wikipedia.org/wiki/Microsoft_Su
IX. REFERENCES rface (accessed on 12 October 2009)
[15] http://www.microsoft.com/surface/
[1] http://en.wikipedia.org/wiki/Multi- (accessed on 12 October 2009)
touch#cite_note-11 (accessed on 12 [16] http://nuicode.com/projects/wiki-
October 2009)
book/files (accessed on 12 October 2009)
[2] Buxton, Bill. 2008. Multi-Touch Systems
[17] flosc: Flash Open Sound Control
that I Have Known and Loved.
http://www.benchun.net/flosc/ (accessed
http://www.billbuxton.com/multitouchOve
on 12 October 2009)
rview.html (accessed on 29 September
[18] http://code.google.com/p/sparsh-ui/
2009).
(accessed on 12 October 2009)
[3] http://en.wikipedia.org/wiki/Touchscreen
[19] http://www.python.org/ (accessed on 12
(accessed on 12 October 2009)
October 2009)
[4] "Multi-touch comes to resistive
[20] Dell Propels Laptop Innovation With First
touchscreens",
Multi-Touch Convertible Tablet, Larger
http://www.umpcportal.com/2009/02/mult
Solid State Drive (accessed on 12 October
i-touch-comes-to-resistive-touchscreens
2009)
(accessed on 29 September 2009)
[21] Windows Vista Team Blog. "Microsoft
[5] Patschon, Mark (1988-03-15), Acoustic
demonstrates Multi-touch". MSDN Blogs.
touch technology adds a new input
Retrieved 2008-05-28. (accessed on 12
dimension, Computer Design(accessed on
October 2009)
12 October 2009)
[22] http://en.wikipedia.org/wiki/IPod_Touch
[6] Please Touch! Explore The Evolving
(accessed on 12 October 2009)
World Of Touchscreen Technology".
[23] http://en.wikipedia.org/wiki/Palm_Pre
electronicdesign.com.
(accessed on 12 October 2009)
http://electronicdesign.com/Articles/Index.
cfm?AD=1&ArticleID=18592. (accessed [24] http://en.wikipedia.org/wiki/Palm_Pixi
on 12 October 2009) (accessed on 12 October 2009)
[7] Kable, Robert G. (1986-07-15), [25] http://en.wikipedia.org/wiki/Zune_HD
Electrographic Apparatus, United States (accessed on 12 October 2009)
Patent 4,600,807, http://rwservices.no- [26] http://www.intomobile.com/2008/09/05/ap
ip.info:81/pens/biblio86.html#Kable86 ples-vision-of-future-multi-touch-
(accessed on 12 October 2009) includes-voice-sight-finger-
[8] Minsky,M.R. (1984-07-01). "Manipulating recognition.html (accessed on 12 October
simulated objects with real-world gestures 2009)

37
Journal of Sci. Engg. & Tech. Mgt. Vol 2 (2), July 2010

[27] Nakatani, L. H., John A Rohrlich (1983). has more than 06 years of experience in
"Soft Machines: A Philosophy of User- teaching and industry. He was Lecturer in
Computer Interface Design". Proceedings Dept. of Information Technology at
of the ACM Conference on Human Thadomal Shahani Engineering College,
Factors in Computing Systems (CHI’83):
Bandra(w), Mumbai for nearly 04 years.
12-15.
[28] Jeff Han, Multi Touch interaction Currently working as Assistant Professor in
Research http://cs.nyu.edu/~jhan/ftirtouch/ Computer Engineering at Mukesh Patel
(accessed on 13 october 2009) School of Technology Management and
Engineering, SVKM’s NMIMS University,
Vile Parle(w), Mumbai, INDIA. He is
X. Author Biographies member of International Association of
Engineers (IAENG) and International
Dr. H. B. Kekre has received
Association of Computer Science and
B.E. (Hons.) in Telecomm.
Information Technology (IACSIT),
Engineering. from Jabalpur
Singapore. His areas of interest are Image
University in 1958, M.Tech
Processing and Computer Networks. He has
(Industrial Electronics) from IIT Bombay in
about 45 papers in National/International
1960, M.S.Engg. (Electrical Engg.) from
Conferences/Journals to his credit with a Best
University of Ottawa in 1965 and Ph.D.
Paper Award at International Conference
(System Identification) from IIT Bombay
SSPCCIN-2008 and Second Best Paper
in 1970 He has worked as Faculty of
Award at ThinkQuest-2009 National Level
Electrical Engg. and then HOD Computer
paper presentation competition for faculty.
Science and Engg. at IIT Bombay. For 13
years he was working as a professor and head Digvijay Singh is pursuing B.Tech.
in the Department of Computer Engg. at (Computer Science) degree from Mukesh
Thadomal Shahani Engineering. College, Patel School of Technology Management and
Mumbai. Now he is Senior Professor at Engineering, SVKM’s NMIMS University,
MPSTME, SVKM’s NMIMS University. He Mumbai. His interest in Multi Touch
has guided 17 Ph.Ds, more than 100 Technology developed during the seminar
M.E./M.Tech and several B.E./ B.Tech report generation as part of third year B.Tech
projects. His areas of interest are Digital curricula.
Signal processing, Image Processing and
Computer Networking. He has more than 250 Tushant Mitaal is pursuing B.Tech.
papers in National / International Conferences (Computer Science) degree from Mukesh
and Journals to his credit. Presently He is Patel School of Technology Management and
Fellow of IETE and Life Member of ISTE Engineering, SVKM’s NMIMS University,
Recently six students working under his Mumbai. His interest in Multi Touch
guidance have received best paper awards. Technology developed during the seminar
Currently ten research scholars are pursuing report generation as part of third year B.Tech
Ph.D. program under his guidance. curricula.

Sudeep D. Thepade has Aman Pratap Singh is pursuing B.Tech.


Received B.E.(Computer) (Computer Science) degree from Mukesh
degree from North Maharashtra Patel School of Technology Management and
University with Distinction in Engineering, SVKM’s NMIMS University,
2003. M.E. in Computer Engineering from Mumbai. His interest in Multi Touch
University of Mumbai in 2008 with Technology developed during the seminar
Distinction, currently Perusing Ph.D. from report generation as part of third year B.Tech
SVKM’s NMIMS University, Mumbai. He curricula.

38

You might also like