You are on page 1of 8

The #1 Platform for

Connected Data H E L P I N G T H E W O R L D M A K E S E N S E O F D ATA

“ Otas eni deribus dus aut volupti umquossi apel


ium nonserum, adi officiur? Quisi odissim sunt
volesti usaecat rat fugitio. Optur?

Aliquam, coremporempe num iducit et estio descilicium fuga. sequid min evel moluptiureic totaece perspe nosandamus
Unt anda se la dollabo ratquidunt parumen imagnam eum fu- delestio cum esse maionse quatur sunt molupta doloritium
gitio mostiatur, con ne sedis invereiciae volenes eum quam, am soluptae natur?

FROM GRAPH TO
optatio. Ratem rerunt imolorum qui beatem ea eum quia Os et vel iustis ium re se cum ut quias aut vel inctor aut
quis int et ab inciti aut idestoreius iligenim labor si commos odissunto il imporrum estenesti delis arcimin pelisqui
necturernam, quide net rehent exceaqui antempo repelic omnisim inihitatio mod modit fugia seque con repra simus
ationet urerum exceptatiae. Officimagnat qui blatur autest debisimpos adi voluptur rerum ium il ipsam nonsed quia
imo doluptio et mincidipsam consequi te pre latur? consequam ut plab ius dit hillabo rrovid maxima quas volecto

KNOWLEDGE GRAPH
Illum quam, quatquae sus il ium cor re maximus comniates volorit ommolut reiur?
sequodici as intion none as dit ditias aut omnimi, coris illesti- Ditet eius ex expligent erspienissin cus volorrum reperibus
is ant quia pa ea ventibusdae denisi acestiur aut quam, ut si nis pore el imilige niscium consequunti reperibus dia
fugit lanto verumquias eat offictem est, volorrum renectur? voluptaspel iunt.

A short journey to
Ehenet que dia dolupit aut exercie netusda simus volupta Ed minctot aspit, ant omnia eruptas eliquas expla nos ra
cum int. quodit officius cust, simin porro coribusandam iliquas nobis
Gitatis maiosan deriae autecti dustior uptaect atibearume inctem doles ut essitios dus.
voluptiamus aut modit expellorent aut arciliq uossereius, aut Bit ex ea eum rem quod moditint.
iusdam voluptatius nos volor ressum, sin rem inciis saperio. Dignimin nimodi sam rem ni volest velique volorum quos-

unlimited insights
Daepti consequibus inciatur, omnitaeri unt et ut exped mos sin ctiissit excerio nsequae estiaectorem rerit qui que pra
nonsed quiditio quis explatiustis rest occuptassit quuntion dolut fuga. Nem sequiamusae rerspie nitatusaped quam
netum eate versper atectatur sum imi, con rest, es rereris di nimodit quatectiatem autest, corem qui ut et, et laceritatiis
consequatate volor alibus sinvenim ra aspid quam dolorepro quam faccus doloreped que cuptatis sim ut eume labor sin
berepro te con coreper spiduciatem eicimende non re cum- esciusandent aut lanis ex estibusae consenis porest ut mint
quisit, quassint pratia pore, verupta quiate con none abo. voluptur, sus dese non con none dest que veles quuntius
Nam nullam escipsum, opturit harit volum es esto exceped
ut etureic aernates adiae aut od mod maxim aut que int. Headline can be two line if possible would make it
Maya Natarajan
Picatquid eatem facerunt eaque veligen tiatemp orporiam
ressim ut veniam, volorectae rerovid elluptate quunditatia
look better I think
rae aut quae nobiti quam harcia quibus doluptas mollorio
nonet ventiorro vitatempos si ommodi offic tem volore con- vellaborepra vella doles dem doluptaspel eossita tendiae
Ssent,
E NqueI Oexplant
R DfugitI Rmaiosae
ECTO R , omnia
et arum, P R OquaeDU CT
non- Mestios
A Rmagnatet
K E T Iliquibus
N G autatiis exerrum dolupta voluptasi

2
Helping the world
make sense of data

Introduction
Unleashing the power of knowledge is imperative It’s time to connect data so it’s manageable and
for enterprises looking for a competitive edge. useful. Anyone in your organization who needs data
to do their job should know where to look for it, be
Everyone wants to capture knowledge, to connect confident in its accuracy, and be able to solve their
everything that they know. However, turning data own problems with ease.
into knowledge is still very much an ongoing effort
– though progress has been made, most data You get there by starting small, gaining immediate
landscapes are far from mature. value, and then expanding outward – from graph to
knowledge graph – the start of a short journey to
We need a way to connect the data we have, unlimited insights.
across all systems, clouds, backups, and data lakes.
Some of our data pipelines may be strong – serving What contributes to the data challenges we see
real-time, clean data to dashboards and informing today?
business processes – but that’s not enough. That
success must be replicated everywhere.

2
Helping the world
make sense of data

Data Trends and Challenges

Silos If GDPR compliance says you need to forget a


person’s data, do you know where all their data is?
First, there’s siloed data. Data stores and applications What about cookies that you may not even realize
often serve individual groups or departments. The belong to that person? What about data in your
HR team has one platform, and the sales team may logistics system and in your partner’s systems?
use another such as Salesforce. Systems of record
are important because they maintain and control Historical Data Goes Right Out the Window
data and establish governance policies, but data
silos also reduce speed to analytics, accuracy of Historical data fuels machine learning predictions.
reporting, and data quality. But the COVID-19 pandemic made historical data
obsolete as disruption rippled across the economy.
Data Sprawl and Data Lakes For example, historical data would normally be used
to predict purchasing behavior.
Most organizations have data lakes, data
warehouses, relational databases that underlie The pandemic caused purchasing behavior to
systems of record, customer data, transactional change almost overnight as online shopping came
data, product data, and order data, and the list to dominate the market during the lockdowns. Due
goes on. This scattering of data contributes to data to these dramatic shifts in consumer behavior,
sprawl. historical data could no longer make accurate
predictions about purchasing behavior.
Data lakes are popular as a low-cost option for storing
large amounts of structured, semi-structured, and With limited data, connections in the data increase
unstructured data. Object storage such as Amazon in importance and value. Putting data in a graph
S3 is often used to create data lakes. database captures connections and relationships.

From a cost standpoint, data lakes are attractive; The data you have is valuable, but storing the
they give you a place to store any type of data, relationships in and across that data – relationships
including log files generated by applications and that already exist – increases your ability to predict
services. It’s easy and convenient to put data into even in the absence of relevant historical data.
a data lake. However, governing that data and even That’s because connections and relationships are
knowing what is there becomes a challenge. the most predictive elements in the data.

Cloud Storage All of these factors are driving enterprises like yours
to move toward connecting data in a graph database
Cloud computing is revolutionary and yet remains to gain knowledge. The next section examines
a governance challenge, with more data stored in exactly how a graph becomes a knowledge graph.
more systems. Consider your own personal cloud
data, which may live in iCloud, Google Drive, Dropbox,
and more, as well as in Evernote, Gmail, and Notes.
“ Murky, messy data
This leaves us with a massive amount of data that
is not only in different data formats and partially is a compliance
redundant but also largely disconnected.
nightmare.

3
Helping the world
make sense of data

What Is a Knowledge Graph?


Here is a definition: A satellite manufacturer wanted to integrate all of
A knowledge graph is an interconnected dataset its processes and data across the entire life cycle
enriched with semantics so we can reason about of its highly complex products – Product 360.
the underlying data and use it confidently for
complex decision-making. “In the past we’ve had someone manually identify
the root cause of a failure,” said the chief data
Now let’s unpack that definition a little at a time. officer. “They’d look at everything that could
have influenced a part’s failure. Is it engineering?
The Value of Connected Data Procurement? Supplier? Is it a vendor issue or a
manufacturing defect? The idea is to let the graph
A knowledge graph is an interconnected dataset
do those traversals and find variance and report it
enriched with semantics so we can reason about
back instantaneously versus a human taking weeks
the underlying data and use it confidently for
to do it manually.”
complex decision-making.
Connecting data adds context and improves out-
Enriched with Semantics
comes. The best doctors do their homework. If
a doctor takes the time to review the reason for A knowledge graph is an interconnected dataset
the appointment – your medical conditions, your enriched with semantics so we can reason about
medical history, any lab tests you’ve had, and your the underlying data and use it confidently for
vital signs – that visit will be far more productive. complex decision-making.
The doctor has context: the context of you. Connecting data makes that data inherently more
valuable and provides dynamic context. Adding even
A more obvious example is LinkedIn, where the more information – semantics – takes that value to
number of connections we have is considered a a new level.
measure of success (500+ connections). Like all
prominent social networks, LinkedIn is built on a
graph data structure.

RDF: The Semantic Elephant in the Room


In 1989, Dr. Tim Berners-Lee created what we How can those interested in a short trip to a
now know as the web. He invented it because the knowledge graph make use of that kind of data?
scientists associated with CERN, a particle physics
laboratory in Switzerland, had a hodge-podge of Neo4j knowledge graphs incorporate RDF as well
computers and if they could simply share their as any other type of data you may have. Neo4j is
research,
A knowledge they
graph could
is an make progress
interconnected datasetmuch with semantics asoproperty
faster.
enriched graph.
we can reason That
about themeans it’s data
underlying simple,
and intuitive,
use it
confidently for complex decision-making. and scalable. Simple because it can capture any in-
Not quite 10 years later, in 1998, Dr. Berners-Lee formation expressed in RDF or any other data mod-
declared
“When you that the
start looking Semantic
at what Web wouldyou
kind of documents behave
the and
nexthow you’reel.
ableIntuitive because
to transform it works
those into on knowledge
actionable straightforward
for your end
bigimprove
users, you thing. your
Anddecision-making,”
work began on standards
said David Meza,like XML
senior building
data scientist at NASA.blocks of nodes and relationships, which
and the Resource Description Framework, or RDF. makes it easy to understand and explain. Scalable
Dooloo, a startup based in France, has developed a knowledge graph-based platform because it canpeople
to assist accommodate billions
who experience of nodes
chronic and
pain. By
Since
combining that time,
a person’s unique much
history diligent
with the work has been
latest research andin-
treatments,trillions
patientsof andrelationships.
their caregivers quickly discover next steps to
improve vested in ontologies
their quality such manage
of life and better as FIBOtheir
andsymptoms.
SnoMed us-
ing RDF-based vocabularies. Large public datasets The bottom line: Use the richness of RDF with the
like WikiData are also available as RDF. simplicity, intuitiveness, and scalability of Neo4j
knowledge graphs.
4
Helping the world
make sense of data

In practical terms, if you create a graph, even a graphs become increasingly valuable.
graph on a whiteboard, you will find yourself want-
ing to add more detailed information. That additional For example, with NASA’s Lessons Learned Knowl-
information is semantics: an overlay of meaning on edge Graph, the engineers could look at any partic-
top of the graph. ular part and gain insight into how it is used. Engi-
neers could ask for a particular part and find out
At NASA, David Meza sought to incorporate hun- which subsystem it relates to. They might explore
dreds of millions of documents, reports, project data how all of the subsystems managed by different
summaries, lessons learned, scientific research, groups come together and try to get a feel for the
medical analysis, and more into a knowledge graph. system as a whole.
Meza ran the text of all these documents through a
topic modeling algorithm that uncovered tags that Fraud investigators find themselves drowning in
could be applied to the content, enriching the graph. thousands of alerts of potential fraudulent activi-
ty. Sifting through these alerts requires looking in
According to Meza, NASA’s Lessons Learned several systems, which is time-consuming. A knowl-
knowledge graph accelerated the mission to Mars, edge graph brings together many data sources, in
saving at least a year and over $2M in research and this case customer attributes, credit scores, loca-
development. tion, payment history, and more, so that investiga-
tors quickly hone in on fraudulent activities and see
So given his experience, what is Meza doing now? patterns they would not have noticed otherwise.
He’s moved onto the next frontier: finding the right
people to move NASA forward at a time when the
Use It Confidently
private sector is diving in deep.
A knowledge graph is an interconnected dataset
Talent is naturally a graph. NASA’s new knowledge enriched with semantics so we can reason about
graph is used to find people with particular the underlying data and use it confidently for
knowledge, skills, and abilities. But roles and job complex decision-making.
titles are not standardized, so Meza mapped them Data brawls. You may not have seen a bar room
to existing ontologies from government databases, brawl, but most of us have seen a data brawl (or,
including the US Department of Labor’s O*NET, OPM more politely, a data disagreement). They arise from
(Office of Personnel Management) classifications, discrepancies where the numbers just don’t match.
and the EU’s ESCO. “Where did you get your numbers?” someone asks.
“That’s not what my data says.”
While NASA has particular needs when it comes
to hiring talent, this is a broad challenge most How can we use data confidently? By knowing
organizations face – one where a knowledge graph where it came from, how much transformation it’s
could be of benefit no matter the mission. been through, whether it’s been cleansed, when it
was last updated, and who updated it.
Reason About the Underlying Data
With a knowledge graph, it is easy to add this type
A knowledge graph is an interconnected dataset of metadata to the graph. Knowledge graphs excel
enriched with semantics so we can reason about at capturing details about data, and this metadata
the underlying data and use it confidently for becomes part of the knowledge graph. In the
complex decision-making. event of a data brawl (or data discussion), these
Knowledge graphs provide deep, dynamic context. types of details are invaluable in investigating any
They enable people to find all related information discrepancies.
in one place, with all of the relationships across
that data. As you add more information, knowledge

5
Helping the world
make sense of data

For Complex Decision-Making Dooloo, a startup based in France, has developed


a knowledge graph-based platform to assist people
A knowledge graph is an interconnected dataset who experience chronic pain. By combining a
enriched with semantics so we can reason about person’s unique history with the latest research and
the underlying data and use it confidently for treatments, patients and their caregivers quickly
complex decision-making. discover next steps to improve their quality of life
“When you start looking at what kind of documents and better manage their symptoms.
you have and how you’re able to transform those
into actionable knowledge for your end users, you
improve your decision-making,” said David Meza, se-
nior data scientist at NASA.

Categories of Knowledge Graphs

The world of knowledge graphs, like knowledge it- and recommendations. Actioning knowledge graphs
self, is multifaceted and broad, almost endless. In bring together all data about customers, products,
general, knowledge graphs fall into two categories: and more into a 360-degree view that drives a wide
actioning knowledge graphs and decisioning knowl- variety of actions such as identification of custom-
edge graphs. ers in danger of churning, as well as recommenda-
tions about the kind of offer that might persuade
Actioning Knowledge Graphs for them to stay.
Data Management
Decisioning Knowledge Graphs for
Data management is an important use case for Data Analytics
knowledge graphs. Born-digital companies like Lyft
and Airbnb thrive on data, and enabling their data Knowledge graphs form the foundation of modern
scientists to find the latest, freshest data is key to data and analytics. With data captured in a knowl-
their success. edge graph, you no longer need to guess at correla-
tions: all the relationships inherent in your data are
These companies and others have used knowledge captured and stored. In this way, knowledge graphs
graphs to create metadata hubs to capture data represent a more faithful representation of data
lineage: where their data came from, how it was and enable you to unlock its predictive power.
transformed, and how it was cleansed. Knowledge
graphs model complex data pipelines so that With a decisioning knowledge graph, the ultimate
consumers and producers of data can be easily goal is to make a better decision, whether that’s
identified and new data sources can be integrated. a human decision or an algorithmic decision. Those
decisions can be supported in several ways.
With a strong foundation regarding the provenance
of data, you can take action on that data, confident Graph queries enable you to answer any question
of where it came from and who its producers and of your knowledge graph, at scale. Boston Scientific
consumers are. uses advanced queries to do root cause analysis
and identify combinations of at-fault components
In addition to data management use cases, action- that result in defects (an anti-recommendation of
ing knowledge graphs are used for personalization sorts).

6
Helping the world
make sense of data

Graph algorithms identify patterns in your data, AstraZeneca uses graph algorithms and machine
such as the shortest path between two points or learning on its knowledge graph to identify patient
the most influential customers. journey archetypes and patterns. This research
enables the company to identify influential
OrbitMI uses a decisioning knowledge graph to touchpoints for early interventions to improve
perform complex route planning for container ships. patient outcomes for illnesses like kidney disease.
Using pathfinding algorithms, they plan maritime
routes in less than a second. Furthermore, their
knowledge graph backs their SaaS analytics offering.
The knowledge graph’s impact is not only economic:
greater efficiency in complex route planning reduced
carbon emissions by 60,000 tons.

Graph queries and algorithms can also uncover


predictive features for machine learning.

7
Helping the world
make sense of data

A Short Journey to Unlimited Insights


A graph becomes a knowledge graph when we over- If you decide to tweak the structure of your knowl-
lay an organizing principle. In this way, knowledge edge graph or add data sources, Neo4j makes it sim-
graphs make data smarter. ple to modify the structure of your graph and reflow
the data into that structure. That agility is part of
The transition from graph to knowledge graph is the underlying graph database.
subtle and organic. Create a graph of your supply
chain, for example. Add in information about how This deceptively simple structure supports ad-
long you have worked with partners in that supply vanced AI and machine learning. It is no wonder
chain, their locations, modes of transportation, that every graph data science project starts with a
discounts, and any delays in shipping that must be knowledge graph.
factored in – and you’ve got yourself a knowledge
graph. Neo4j offers the most comprehensive knowledge
graph on the market for data management, data
This is the beauty of knowledge graphs in Neo4j. You analytics, and all the way to machine learning.
can start small, solving a practical challenge of any
kind, and enrich your knowledge graph incrementally Learn more about the fastest path to knowledge
to solve additional use cases and serve additional graphs at neo4j.com/knowledge-graph.
stakeholders.

Neo4j is the leader in graph database technology. As the world’s most


widely deployed graph database, we help global brands – including
Comcast, NASA, UBS, and Volvo Cars – to reveal and predict how people, Questions about Neo4j?
processes and systems are interrelated. Using this relationships-
first approach, applications built with Neo4j tackle connected data Contact us around the globe:
challenges such as analytics and artificial intelligence, fraud detection, info@neo4j.com
neo4j.com/contact-us
real-time recommendations, and knowledge graphs. Find out more at
neo4j.com.

You might also like