Professional Documents
Culture Documents
MAY 2015
Business
Information
INSIGHT ON MANAGING AND USING DATA
SPECIAL ISSUE
Different
Doors for Data
The database management systems
market is a large one already, and
bigger, more powerful options are
added all the time. But don’t settle
for less—or more.
EDITOR’S NOTE | JASON SPARAPANI
HOME
Underneath It All
EDITOR’S NOTE
DATABASE LANDSCAPE
IS LUSH, VARIED
THINK FOR A moment about the term database man- Not that relational is going anywhere. In fact, as Mul-
IS RELATIONAL
THE DATABASE
agement system. It strings together Latin and Greek lins advises, “It’s wise to evaluate relational database op-
RELATIONSHIP derivatives. It has eight syllables. Now think about the tions first.” They can be used for so many purposes: data
YOU NEED?
technology behind the words. It powers operational and management and analytics applications, online transac-
KNOW YOUR analytics applications; it’s the very source of business tion processing and batch processing, to name a few.
OPTIONS BEFORE
SAYING YES TO
livelihood. For humans, a comparable resource comes to But evaluating databases, relational or otherwise, isn’t
NoSQL mind: water. One Anglo-Saxon word, two syllables. easy. Mullins recommends matching any shopping list
The colorless initialism DBMS also fails to get across against criteria such as the underlying architecture or
the fundamental purpose database systems serve. Their administration requirements. Then there’s the question
importance should never be understated—and neither of whether the job market can provide the right program-
should evaluating the different categories before making ming skills. That becomes a critical one to answer when
a purchase. The wrong choice can leave your data-driven considering newer, NoSQL databases.
business operations feeling parched. But even if you decide to do things the NoSQL way,
In this special edition of Business Information, data which way is that? There are four product categories,
management consultant and author Craig S. Mullins each suited to particular uses. And, Mullins warns, there
gives advice on today’s database market. It’s not the one are no standards, as there are with relational systems.
of 30, 20, 10 or even five years ago, when the relational Here’s another, very human analogy. Just as the body
database was essentially the only database in town. To- relies on a skeletal frame to allow us our every move,
day, large-scale Web applications, streaming video and your data needs the right underlying database system for
audio information and social media data are too much support. Only then will it be able to deliver the informa-
for conventional relational systems to handle. New data tion and insight that enables you to make better business
types call for radically new approaches, like storing data decisions. Bones. One word, one syllable. n
in memory instead of on disk, or stepping aside from
systems based on the SQL programming language and What database systems are you looking into? Write to me at
turning to NoSQL technologies. jsparapani@techtarget.com.
DATABASE
LANDSCAPE IS The database management system
LUSH, VARIED
The variety of database products on the market
is the heart of today’s operational and
analytical business systems. Data is
is multifarious—and each system type has its own
the lifeblood of any organization, and
advantages and uses. the database is the conduit by which
data is stored, managed, secured
and served to applications and users.
Nowadays, though, there is an in-
creasing variety of database products
to consider.
Relational databases have been the norm in IT for
nearly 30 years. First developed in the 1970s, they be-
came more popular as they—and the server hardware
that they primarily ran on—became both more powerful
and more affordable, making them a practical choice for
more and more organizations. But shortcomings became
more apparent with the full computerization of business.
Today, IT departments trying to process unstructured
data or data sets with highly variable structures may
HOME want to consider NoSQL technologies. Applications
HOME that require speed or perform complex tasks can benefit NoSQL. Web-scale data processing and big data require-
from in-memory databases. Some IT departments will ments challenge the capabilities of relational databases.
EDITOR’S NOTE want to look at combining multiple database technolo- Products with more flexible schemas, less rigid consis-
DATABASE LANDSCAPE
gies for certain processing needs. tency models and reduced processing overhead can be
IS LUSH, VARIED advantageous in a rapidly changing and dynamic envi-
The Expanding Database Galaxy
IS RELATIONAL
Until relatively recently, the relational database, or
BIG DATA AND AN ONSLAUGHT
THE DATABASE
RELATIONSHIP RDBMS, was the only database category worth con-
OF NEW TECHNOLOGIES ARE
YOU NEED?
sidering. But big data has brought new types of worthy
KNOW YOUR
OPTIONS BEFORE
products. Also, an onslaught of new technologies and COMPLICATING THE DATABASE
capabilities are being added, further complicating the
SAYING YES TO
NoSQL mix. Here are the three main categories.
LANDSCAPE.
RDBMS. The undisputed leader continues to be the re- ronment. Enter NoSQL. While the RDBMS requires
lational database. It provides data storage, access and a rigidly defined schema, a NoSQL database permits a
protection with reasonable performance for most appli- flexible one. For loosely defined data structures that may
cations, whether operational or analytical. For the better evolve over time, a NoSQL database can be more practi-
part of three decades, the primary operational database cal. NoSQL addresses some of the problems encountered
has been relational, led by industry titans such as Oracle, by relational technologies, making it simpler to work
Microsoft and IBM. The RDBMS is reliable and adapt- with large amounts of unstructured data.
able to most uses; it also has been bolstered by years of
use in industry applications at organizations of all sizes. In-memory. Sometimes referred to as a main-memory
Of course, such stability comes at a cost: Relational data- database system, an in-memory DBMS relies mostly on
base systems aren’t cheap. memory to store data. The primary use for the in-mem-
Support for ensuring transactional atomicity, consis- ory database is to improve performance. Because data
tency, isolation and durability—collectively known as is maintained in memory, as opposed to on a disk, I/O
the ACID properties—is a compelling feature of rela- latency is greatly reduced. Mechanical disk movement,
tional databases. ACID compliance guarantees that all seek time and buffer transfer can be eliminated because
transactions are completed correctly or that a database is the data is immediately accessible.
returned to its previous state if a transaction fails. An in-memory database can also be optimized to
HOME access data in memory; a traditional DBMS is optimized columns of many rows at once. They are not optimized
to access data from a disk. In-memory products can re- for writing data.
EDITOR’S NOTE duce overhead because the internal algorithms usually
DATABASE LANDSCAPE
are simpler, with fewer CPU instructions. n opular in the 1990s and still available from some
P
IS LUSH, VARIED vendors, object-oriented databases are designed to
More Places for Data work with object-oriented programming languages.
IS RELATIONAL
THE DATABASE
A growing category is the multi-model database system,
RELATIONSHIP which supports more than one type of storage engine. n re-relational databases include hierarchical systems,
P
YOU NEED?
Many NoSQL products support more than one data such as IBM’s Information Management System, and
KNOW YOUR model—for example, document and key-value. And rela- network systems, like CA IDMS, running on large
OPTIONS BEFORE
SAYING YES TO
tional database products are evolving to support NoSQL mainframes.
NoSQL capabilities.
There are other DBMS categories, but they’re not What’s Next
as prevalent as relational, NoSQL and in-memory: Organizations considering a database management
system should determine what their specific needs are
n ML databases are designed to support XML data.
X as well as examine the leading DBMS products in each
But most relational databases today provide XML category. Doing so will require more details on each of
support. the different types and a better understanding of the uses
each technology is optimized for. Indeed, there are many
n olumnar databases are based on the SQL program-
C variables that need to be evaluated to ensure you make a
ming language and are optimized for reading a few wise decision when shopping for database software. n
IS RELATIONAL
THE DATABASE When shopping for a database man-
HOME Perhaps the strongest capability of the relational databases can comfortably support.
database is its robust implementation of transactional Of course, there are additional areas of concern when
EDITOR’S NOTE atomicity, consistency, isolation and durability. What are considering whether to buy a relational database. One
DATABASE LANDSCAPE
known as the ACID properties guarantee that database of the biggest hurdles is the high cost of acquisition. The
IS LUSH, VARIED transactions are processed reliably. When a transaction is purchase price ranges from several thousand dollars to
executed on a relational database, it will either complete, more than $1 million, depending on the amount of data
IS RELATIONAL
THE DATABASE
producing correct and up-to-date results, or terminate. that needs handling or the size of the computer the data-
RELATIONSHIP In either case, the resulting condition of the database will base will run on.
YOU NEED?
be a consistent state. Feature bloat is another issue. Although they have ro-
KNOW YOUR Delivering ACID support requires processing overhead bust, time-tested functionality, relational databases also
OPTIONS BEFORE
SAYING YES TO
and can reduce data availability, because subsequent come with built-in features that may not be needed. Such
NoSQL transactions must wait until changed data is committed features can be difficult to learn how to use and sup-
to the database. This is a fair tradeoff when dealing with port—and they can bog down performance.
business-critical data such as financial transactions. And relational databases require a rigid schema. All
Banking data and other critical information must be up- of the columns must be predefined with a specific data
to-date and accurate at all times. type and length before a table can be used. Newer,
NoSQL database systems don’t have such a requirement,
Relational Uses, Shortcomings allowing developers to adapt schemas based on changing
Most traditional data management and analysis applica- data needs.
tions, including online transaction processing, batch pro-
cessing, mixed workloads and business intelligence, are What to Look For
valid uses for relational databases. They’re also the prime When evaluating any type of database management sys-
choice for consistent data structures and definitions and tem, IT directors should keep several standard criteria
for processing traditional data types such as numbers, top of mind.
dates and alphanumeric values.
But relational systems can struggle to keep up with n Architecture. Relational database architecture is
applications that call for many different types and suitable for most data management needs but can be
amounts of data. Social media data, streaming audio problematic for projects requiring flexible schemas or
and video, and information from the Internet of Things complex inter-relationships between data elements.
all require more flexibility than traditional relational (Continued on page 9)
HOME
KNOW YOUR
OPTIONS NoSQL database technology rep-
HOME but the value is a blob of meaningless data that must be The Weighing Game
interpreted after it has been read. So why consider a NoSQL database instead of a relational
EDITOR’S NOTE one? Perhaps the greatest strength of a NoSQL system
DATABASE LANDSCAPE
n A document database manages and stores data at the is its decentralized, scalable, fault-tolerant nature. Most
IS LUSH, VARIED document level. It’s similar to a key-value database, but NoSQL database technology is implemented to scale and
a structure is imposed on the data. Instead of the value survive outages.
IS RELATIONAL
THE DATABASE
component being a blob of data, the data is described in While relational databases are engineered to be prac-
RELATIONSHIP a document structure, typically JSON or XML. A docu- tical for various uses, NoSQL databases are designed for
YOU NEED?
ment database can be queried by any component of its specific uses. By embracing polyglot persistence—the
KNOW YOUR defined schema, while a key-value database can be que- concept of using different database systems for different
OPTIONS BEFORE
SAYING YES TO
ried only by its key. applications and uses—an organization can choose the
NoSQL database technology that best suits a particular use.
n A column database, also known as a column store or Also, most NoSQL products are lightweight and there-
wide column store, reorients the focus of the data from fore require less overhead than relational systems do.
the row to the column, storing data as sections of col- And there’s less functionality than in most relational
umns rather than as rows. With traditional relational da- databases, which are designed for a broader set of uses.
tabases, data is modeled as rows of columns with access So a NoSQL database will require less code, which poten-
always by row. The NoSQL wide column store manages tially confers a performance benefit over a more complex
records in “families” capable of holding a large number system.
of dynamic columns. There is no fixed schema, meaning Of course, NoSQL has its disadvantages. First, the
column names and keys can vary. A column database is many differences in the range of options make evaluating
suitable when writes are uncommon, ACID properties— NoSQL systems difficult. There are different architec-
atomicity, consistency, isolation and durability—aren’t a tures for the different NoSQL database types, and even
hard requirement and the schema is variable. for the products in each category. Also, there are no
standards, including no standard way of accessing data.
n A graph database focuses on relationships between This means specific tools and APIs must be adopted and
values and stores data using a graph structure with learned to use a NoSQL database.
nodes, edges and properties to represent and store data. Another issue is ACID support, which is standard for
Every element contains a direct pointer to its adjacent relational database management systems but nonexistent
element, and index lookups are unnecessary. in many NoSQL systems today. One of the early selling
HOME points for NoSQL was that it supported transactions of support could keep prospective buyers away.
that didn’t require full ACID support, thereby slimming And there’s also the lack of support for the SQL pro-
EDITOR’S NOTE down the database and improving performance. But gramming language. In its 40-year-plus lifespan, SQL
DATABASE LANDSCAPE
many applications rely on ACID compliance, so a lack has become the lingua franca of data access. A database
IS LUSH, VARIED
IS RELATIONAL
THE DATABASE
RELATIONSHIP
YOU NEED?
There’s a NoSQL Database for That
KNOW YOUR THE RAISON D’ÊTRE for NoSQL is to apply data per- transaction processing or for applications that re-
OPTIONS BEFORE
sistence techniques that are well-suited for specific quire data aggregation.
SAYING YES TO
NoSQL uses. Let’s examine each of the four database de-
signs and what they’re suitable for. n Column store. Useful when writes are rare and
there’s a need to read a few columns of many rows
n Key-value. Designed for the high-availability, at once. Column stores work well for event logging,
low-latency requirements of applications in fields like content management and both counting and catego-
gaming, retail and mobile. A flexible schema makes rizing data for analytics. They’re also good with ex-
key-value databases good for serving ad content and piring data; a column can be set up to automatically
managing user or product profiles. They aren’t good expire. Column stores aren’t well-suited for ACID
for managing complex relationships between differ- transactions.
ent sets of data.
n Graph. Designed for interconnected data elements.
n Document. Stores different types of data in doc- The most common use is for a social media network,
ument-like structures. Document databases have a such as LinkedIn or Facebook. Another is to support
flexible schema and allow for searching across all a recommendation engine, such as those used by
of the data. They work well for event logging, online online retail sites. Graph databases aren’t well-suited
Check out this shopping, content management and in-depth analyt- for frequently changing data and real-time updates
online guide for ical processing. They aren’t well-suited for complex across large amounts of data. —C.M.
more on NoSQL
databases.