Professional Documents
Culture Documents
Bourke - Product Data Finding It - Using It by Deploying A Search Engine Solution
Bourke - Product Data Finding It - Using It by Deploying A Search Engine Solution
0068
An Application Brief
July 2013
Contents
Introduction 1
The Power of Enhanced Search Engines 3
Answers and Comments – Before Moving Ahead 5
Justifying an Investment 7
Searching Methods 9
The Searching Process 11
A Year End Status Report 13
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Introduction
Background
To start, I’ll give a brief description of some basic terms and capabilities.
A key point: SESs use semantic technology. Semantics is the
relationship of words in order to understand their meaning. Suffice to
say, elements of semantics are a fundamental building block for SESs.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 1|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Additional issues and questions will be tackled later, for instance, to cite
a few:
How should an SES coexist with PLM?
How do SESs relate to other product data-oriented systems, such as
Master Data Management?
How do they negate costly and time-consuming data repurposing
and migrating efforts to obtain search results?
As this series unfolds, I’ll describe in more detail the “how” of purpose-
designed search solutions and their importance to the entire knowledge
worker community, comprising more than just the product development
community. To accept the value (benefit) propositions for an SES
compels more understanding of how these systems work and what they
do – and, for the sake of clarity and understanding – what they can’t do.
In other words, let’s have clear expectations.
Acknowledgement
A big measure of thanks to the staff of The vdR Group, Orange, CA, for their assistance in the
preparation of this series. They are a software development company focused on integration
technologies and solutions for engineering and manufacturing environments. Since the mid-
2000s, the firm has been involved in developing Search-Based Applications.
1
July 2013. Readers should be aware that this list is only representative of a wide range of available options.
Other vendors might be considered if they meet an enterprise’s selection criteria.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 2|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Preface
When mentioning search, many readers think first of Google or Bing.
Most general-purpose search engines, however, are not suitable – as is
– for engineering and manufacturing search applications. Nonetheless,
some can be enhanced to find product data scattered throughout an
enterprise in various applications, repositories and files.
In this column, I’ll describe the power of a search engine solution (SES)
as it may be applied to meet the needs of engineering and
manufacturing enterprises. Be aware, though, that software choices
listed in Part 1, make use of a wide range of semantic-based
technologies. The following, therefore, is a generalized description of
SES capabilities.
Indexing
Search engines operate with an index – an optimized file format that
supports rapid data accessing and displaying of search results. It is
defined as:
“A systematically arranged list; in computerized systems it is a
representation of content to speed retrieval by the governing
algorithms.” (Outsell/Gilbane)
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 3|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
In other words, full text does not rely on the enterprise’s metadata being
consistent. (A real “gotcha” with a relational database query is that it
does rely on consistent metadata – a questionable assumption.) Users,
therefore, need full text capabilities to drive through various file formats
to access all relevant product data crucial for decision-making purposes.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 4|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Preface
Search Engine Solutions (SESs) have been leveraged to help
engineering and manufacturing companies in their constant struggle to
find relevant product data – for timely decision-making. Finding it –
Using it supports strategic initiatives, such as Design Re-use, for
minimizing the risk, time and cost of product development.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 5|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Awareness is Growing
As engineering and manufacturing enterprises become overwhelmed
with unstructured product data, there is growing awareness of the
persistent pressure for search solutions.
One of the most definitive reports yet, however, is Best Practices for
Managing Design Data, published recently by Tech-Clarity. Of the best
practices identified in this comprehensive survey, sponsored by
Siemens, fast product data searching and retrieval is paramount.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 6|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Justifying an Investment
Introduction
With any emerging information technology (IT), a key question arises: What’s
in it for us? Fortunately, there are compelling reasons for companies to invest
in a search engine solution (SES). Now, let’s explore this subject – before
returning to the “How to” of SESs.
Numerous surveys over the years have noted the excessive time lost by
knowledge workers searching for needed product data. Consulting firms and
manufacturing companies have identified figures as high as 25% in a day
because of wasteful and often futile efforts to find data. Moreover, CIMdata
cites that engineers spend half or more of their time searching for data and
then validating it.
These time outlays and costs are avoidable and should be estimated as an
essential element for justifying an SES investment.
Another category of “Costs to Avoid:” Those that have not yet incurred, even if
under consideration, or already budgeted. One prime example, from above, is
the potential costs for repurposing/migrating existing classification/coding
conventions and data. This approach is supposedly considered necessary for
effective information retrieval. Avoiding such expenditures, nevertheless, is a
valid and significant benefit for deploying an SES.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 7|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Claiming just time-savings, though, may not resonate sufficiently with those
who approve new IT projects.
Furthermore, this discussion has focused on the “gross” numbers for savings
and values. For a complete justification case, however, a “netted” number
would account for costs of software, training and implementation.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 8|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
Searching Methods
Preface
This part expands on the earlier discussion of the two dimensions of
product data searching methods: Deep and Wide. Together they
comprise the power to discover all structured and unstructured data in
an enterprise.
First, some perspective about them, carried forward from Parts 1 and 2:
Structured data is usually already accessible in current systems;
however, unstructured data is the most prevalent form
Most unstructured data is not reachable by PLM systems that do not
have SES capabilities
Searching implies lost data; though it is not really lost – just
undiscovered
Data that can’t be discovered has no value
Deep Search
The goal is to drive past metadata into file and document contents by
taking advantage of “full text” capability.
“The search engine examines all words/objects in every stored
document as it tries to match the search criteria entered by the
user. This distinguishes it from searches based on metadata or
parts of the original texts represented in databases, such as titles or
selected sections.” (Source: Wikipedia).
In other words, full text does not rely on the enterprise’s metadata being
consistent. A real “gotcha” with a relational database query is that it
does rely on consistent metadata – a questionable assumption. Users,
therefore, need the full text capabilities of an SES to access all relevant
product data crucial for decision-making purposes. And, the depth of the
search may extend even into notes on CAD drawings.
Wide Search
Its purpose is to unify access to all forms of product data stored
throughout the enterprise in multiple repositories by employing an array
of appropriate “connectors.” These individual software modules extract
metadata, data relationships and file contents. They allow users to
discover product data previously hidden in isolated silos.
© Bourke Consulting Associates, 2013 Reproduction permissible provided attribution is retained 9|P a g e
Product Data: Finding It and Using It – by Deploying a Search Engine Solution
The ability of an SES to search widely across the enterprises many files
and systems is dependent on the availability of connectors. Some SES
vendors offer many connectors; however, an SES suitable for
engineering and manufacturing must support collaborative product
development (CPD).
Thus, in addition to the “usual suspects,” e.g., MS Word files, and other
unstructured documents, the ability to open CAD files is mandatory.
Cracking these files will expose textual data and relationships among
files as they reference one another. In the future, this capability probably
will be extended to include cracking rich media files, such as video.
Sounds simple enough to add more data sources. But, determining the
inclusions and exclusions is not a matter to be taken lightly. There will
be trade-offs. Some exclusion decisions should be relatively simple. For
instance, security guidelines must be maintained for personnel records
and sensitive Intellectual Property information.
Preface
In Part 5, brief descriptions of the two dimensions of searching methods
were presented: Deep (full text) and Wide (unified access). The purpose
of this Part 6 is to describe the general searching process in two
phases: Initial and subsequent. For readers who have done on-line
shopping, these processes will look familiar. With SESs, however, the
navigational aids are more extensive.
When a user enters a search criteria – a word or phrase – in the UI, the
SES presents an initial results list to the user for further analysis and
action. To facilitate this process, an SES may use powerful techniques
for normalizing and classifying data by use of thesauri – think synonyms.
The means to relate similar terms include 1) Taxonomy (hierarchical
relationships) and/or 2) Ontology (a network of relationships)
methodologies.
Relevant
Relevant – the desired search end is an “a ha” moment. Two factors
come into play: 1) SES algorithms and 2) user actions. In the world of
semantic technology, the end is still the same; the means are based on
various algorithms.
Thus, the second factor is the ability of a user to determine the correct
choices for further discovery and ultimately, decision-making.
From a user’s standpoint, the SES will have done its job when relevant
data is presented. To keep the power of an SES in perspective, though,
bear in mind that algorithms can’t judge quality and any duplications
aren’t removed, just identified. Well-informed users are crucial to getting
results with an SES.
Preface
The stated intent of this Search Engine Solutions series was to
introduce the basic concepts of Search-Based Applications for
engineering and manufacturing enterprises.
Big Data
I purposely avoided this subject, as there is an overwhelming amount of
information now available. Besides, most current applications of Big Data
seem focused on customer-facing activities, such as sentiment analysis.
Growing Awareness
Interest in applying to product development is on the rise according to a
recent report: “Product Lifecycle Management and the Data Deluge:
Transforming Data to Enhance Performance,” CIMdata, November 2012.
**********************************************************************************