Professional Documents
Culture Documents
ii.
iii.
Low
Source of information
structural information
Middle Management
Operational
Management
Internal
High
External
Low
Top
Source of
Information
Structured
Information
Middle
Management
Internal
Operational
Management
High
Phases of
decision making
process
Intelligence
Design
Choice
Explanation
implement.
Flow of activity from intelligence to design to choice, but at any
phase there may be a return to the previous phase.
3.
4.
EIS Components
The components of an EIS can typically be classified into the following
categories:
Hardware
When talking about hardware for an EIS environment, we should focus on
the hardware that meet executives needs. The executive must be put the first
and the executives needs must be defined before the hardware can be
selected. The basic computer hardware needed for a typical EIS includes
four components: (1) Input data-entry devices. These devices allow the
executive to enter, verify, and update data immediately; (2) The central
processing unit (CPU), which is the kernel because it controls the other
computer system components; (3) Data storage files. The executive can use
this part to save useful business information, and this part also help the
executive to search historical business information easily; (4) Output
devices, which provide a visual or permanent record for the executive to
save or read. This device refers to the visual output device or printer. In
addition, with the advent of local area networks (LAN), several EIS products
for networked workstations became available. These systems require less
support and less expensive computer hardware. They also increase access of
the EIS information to many more users within a company.
Software
Choosing the appropriate software is vital to design an effective EIS.
Therefore, the software components and how they integrate the data into one
system are very important. The basic software needed for a typical EIS
includes four components: (1) Text base software. The most common form
of text is probably the word processing document; (2) Database.
Heterogeneous databases residing on a range of vendor-specific and open
computer platforms helps executives access to both company internal and
external data; (3) Graphic base. Graphics can turn volumes of text and
statistics into visual information for executives. Typical graphic types are:
time series charts, scatter diagrams, maps, motion graphics, sequence charts,
and comparison-oriented graphs (i.e., bar charts); (4) Model base. The EIS
models contain routine and special statistical, financial, and other
quantitative analysis. Now perhaps the more difficult problem to those
executives is how to choose EIS software rather than how to use them,
because the latest EIS software packages are more intelligible to
manage a sufficient flow to maintain current operations but also can figure
out how to expand operations that are contemplated over the coming years.
Also, the combination of EIS and EDI environment can help cash managers
to review the companys financial structure so that the best method of
financing for an accepted capital project can be concluded. In addition, the
EIS is a good tool to help the executive to review financial ratios, highlight
financial trends and analyze a companys performance and its competitors.
Following are some real-world EIS applications related to finance.
Advantages and Disadvantages of EIS
Advantages
Easy for upper-level executives to use, extensive computer experience is not
required in operations
Provides timely delivery of company summary information
Information that is provided is better understanding
Filters data for management
Improves to tracking information
Disadvantages
Functions are limited, can not perform complex calculations
Hard to quantify benefits and to justify implementation of an EIS
Executives may encounter overloaded information
System may become slow, large, and hard to manage
Difficult to keep current data
May lead to less reliable and secure data
Small companies may encounter excessive costs for implementation
Future Trends in EIS
The future of executive info systems will not be bound by mainframe
computer systems. This trend allows executives escaping from learning
different computer operating systems and substantially decreases the
implementation costs for companies. Because utilizing existing software
applications lies in this trend, executives will also eliminate the need to learn
a new or special language for the EIS package. Future executive information
systems will not only provide a system that supports senior executives, but
also contain the information needs for middle managers. The future
executive information systems will become diverse because of integrating
potential new applications and technology into the systems, such as
The concept of correctness is that the decision maker tends to use only
information that is readily available and only in the form in which it is
displayed. There will be a tendency not search for a data stored in a memory
or to transfer or manipulate the data that is present. Explicit, available
information thus has an advantage over data that must be obtained or
manipulated before use. In particular, individuals relies on concrete
observable characteristics of evidence and neglect other information related
to the process or context of evidence, leading to possible errors in judgment.
The idea of anchoring and adjustment is that humans tends to make
adjustments establishing an anchor point and making adjustments from this
point. The anchoring and adjustment behavior reduces information
processing requirements. It is a common phenomenon in budgeting,
planning and pricing. The anchoring and the adjustment process can have
negative effect on judgments in two ways. First, individual may use
inappropriate criteria for choosing an anchor. One common criteria is past
experience and due to phenomenon of give response interaction already
described, they may tend to use a scale or a unit of measure which is the
same as the value to be judge. Second, when a value is compared to the
anchor, the adjustment process tends to undervalue the importance of the
new evidence. Since it is only considered relative to some what arbitrary
anchor point.
Que: What is decision making? Explain the Simons model for decision
making. (Pravin Yadav)
Ans: The systems approach is ideal to study the decision making. It is
important to study how decision are made, what inputs are necessary for
decision making and what impedes the decision making process. Too much
information can be worse than too little information. Swamping the manager
with too much information may force the manager to rush into the decision
making process or may force him to ignore certain vital bits of information
while making the decision . Therefore, it is very important to have a
Choice
The model consists of three major phases, Intelligence, Design and Choice.
Let us understand these terms in detail:
1 . Intelligence: Intelligence in this context does not refer to native
intelligence(or the brains one is born with), but with the process of gathering
information. It involves an awareness of the environment, an active attempt
to gather information from the environment . This phase may be a
continuous, ongoing phases or an intermittent effort, depending upon the
requirements of the decision to be made.
For example, a marketing executive may make periodic visits to key
customers to review possible problems and identify new customer needs. Or,
a driver on the road continuously scans the traffic in front, at the sides and
behind him to see if there is any need for corrective action while driving.
Intelligence involves Problem Finding and Problem Formulation. Problem
finding means finding a difference between the existing state and a desired
state. For example, f the desired state is customer satisfaction and the
existing state is mild customer dissatisfaction, then there exists a problem
Problem Formulation. On the other hand, is to identify and clarify the exact
problem. In our previous example, the fact that a problem exists means
little. There is need to identify why there is gap between th e current state
and customer satisfaction. In other words, some complexity has to be
reduced and a manageable problem formulated. As Bring. N.B Grant,
Management Expert, often commented, Gentlemen, define the problem
well and solution will suggest itself.
(A) LAW OF REQUISITE VARIETY :The law of requisite variety means that for a system to be controlled,
every controller(human and machine) must be provided with:(i)
Enough control responses (what to do in each case) to cover all
possible conditions the system may face.
(ii)
The decision rules for generating all possible control responses
OR
(iii)
The authority to become a self organizing system in order to
generate control responses.
Enumerating all responses is possible in simple cases. In complex
systems providing control responses is very difficult.
(i) INTELLIGENCE
(ii)DESIGN
(iii)CHOICE
INTELLIGENCE
DESIGN
CHOICE
3.
Choice: Once the manager has enough ideas or alternatives to work
with, he can apply a rational process to choose the most viable alternative to
work with. He thus makes a choice form the alternatives available to select
the path that would most likely solve the problem at hand.
(D) System approach to MIS:MIS is, by definition, a system. How, it helps if we study mis in the
light of sub system theory and understand exactly how mis behaves as a
system. This will also help in the design of mis system.
(i) Information system as a system the information system it self can be
understood as a system with in the environment of the org. , data forms
the input which is processed by the information system into information
which Forms the output.
data
process
informati
on
The above diagram, however, makes the assumption that all data flows in at
the same time. This is rarely true. Data flows into the system in bits and
pieces. Often, the system has to wait for related items of data to flow in
before processing can begin. Therefore, the basic model has to be modified
to include a Data Storage element. The addition of this element means that
the Information System not only processes data into information, but also
stores data for future use. Below figure makes this clear :-
Data
Storage
Input
Data
Output
Process
Info
(E) Life cycle Approach :While several organizations are moving to prototyping in developing
Decision Support Systems, certain large projects still demand the System
Development Life Cycle (SDLC) approach. These projects have many users
and require the expertise of several different people. The prototyping
approach does not work in such a case because it is difficult to have the
prototype development by a team of experts and tested by several different
users.
The Life Cycle approach offers a structured, well-defined
methodology to design, test and implement a new system. It structures the
creative process and avoids wastage of time due to excessive testing and
reworking. Control procedures can be laid down for each step of the cycle
and there is a general agreement on the inputs, outputs and processing
methodology.
In general, the SDLC has the following phases :1.
2.
3.
4.
Definition.
Development.
Installation.
Operation.
Definition refers to the phase where the information needs are defined.
Feasibility studies and cost-effectiveness studies are also undertaken at this
stage. These requirements are then translated into a physical system
consisting of input forms, procedures, programs, output reports, etc. this is
the development stage and it consists of using system design, computer
programming and procedure development to construct a new system. Once
constructed and tested, the third phase begins, where the new system is
installed and operators/users are trained to use the new system. Once
training is over, the fourth phase, operation, begins. Operation also consists
of maintenance procedures, where changes may be made to the system based
on new requirements or where existing parts of the system are not as
efficient as they.
ANURAG DADHEECH
SHORT NOTES :(A) Phase in the decision making:The new science of mgt. decision ,postulates a model for decision
making. In this model for decision-making. In his model, decisions-making
follows three distinct phases, Intelligence, Design and Choice. Let us
understand these terms in detail:
1.
Intelligence in this context does not refer to native intelligence (or the
brains one in born with), but with the process of gathering information. It
involves an awareness of the environment, an active attempt to gather
information from the environment. This phase may be a continuous, ongoing
phase or an intermittent effort depending upon the requirements of the
decisions to be made.
2.
Design indicates the generation of alternatives to solve the problem
formulated. This is a creative process. For a long time, it was thought that
creativity is born. However, Management Experts have proved that
creativity can indeed be taught and nurtured. There are several techniques
that can be used for Ideation, including Free Thinking, Analogy,
Brainstorming, Checklists, etc.
3.
Choice: Once the manager has enough ideas or alternatives to work
with, he can apply a rational process to choose the most viable alternative to
work with. He thus makes a choice form the alternatives available to select
the path that would most likely solve the problem at hand.
Top level
Middle
level
Lower
level
(i) Action v/s non action:The information which include action , is called an action the
information which communicate only the status of a situation is non action
info.
(2)Recurring v/s non recurring information:The information generate at regular intervals is a recurring info the
monthly sales report ,the stock statements , materials balance etc are
recurring info the financial analysis or the report on the marketing research
study is non recurring information.
(3)Internal v/s non internal information:The information generated to the internal sources of the org is as
internal information , while the information generated through govt. reports
the industry surveys ,etc is termed as external information .
The bottom line is that the information systems use all of the above to
implement, control, and monitor plans, strategies, tactics, new products, new
business models or new business ventures.
In the expert system approach all of the problem related expertise is encoded
in data structures only; none is in programs.
Several benefits immediately follow from this organization.
An example may help contrast the traditional problem solving program with
the expert system approach.
The example is the problem of tax advice.
In the traditional approach data structures describe the taxpayer and tax
tables, and a program in which there are statements representing an expert
tax consultant's knowledge, such as statements which relate information
about the taxpayer to tax table choices.
It is this representation of the tax expert's knowledge that is difficult for the
tax expert to understand or modify.
In the expert system approach, the information about taxpayers and tax
computations is again found in data structures, but now the knowledge
describing the relationships between them is encoded in data structures as
well.
The programs of an expert system are independent of the problem domain
(taxes) and serve to process the data structures without regard to the nature
of the problem area they describe.
For example, there are programs to acquire the described data values
through user interaction, programs to represent and process special
organizations of description, and programs to process the declarations that
represent semantic relationships within the problem domain and an
algorithm to control the processing sequence and focus.
The general architecture of an expert system involves two principal
components: a problem dependent set of data declarations called the
knowledge base or rule base, and a problem independent (although highly
data structure dependent) program which is called the inference engine
4.Individuals involved with expert systems:
There are generally three individuals having an interaction with expert
systems.
Primary among these is the end-user; the individual who uses the system for
its problem solving assistance.
In the building and maintenance of the system there are two other roles: the
problem domain expert who builds and supplies the knowledge base
providing the domain expertise, and a knowledge engineer who assists the
experts in determining the representation of their knowledge, enters this
if the diner is a wine drinker, and the preferred wine is French, then there is
strong evidence that the restaurant choice should include French.
It is very difficult to implement a general explanation system (answering
questions like Why and How) in traditional systems. The response of the
expert system to the question WHY is an exposure of the underlying
knowledge structure. It is a rule; a set of antecedent conditions which, if
true, allow the assertion of a consequent. The rule references values, and
tests them against various constraints or asserts constraints onto them. This,
in fact, is a significant part of the knowledge structure. There are values,
which may be associated with some organizing entity. For example, the
individual diner is an entity with various attributes (values) including
whether they drink wine and the kind of wine. There are also rules, which
associate the currently known values of some attributes with assertions that
can be made about other attributes. It is the orderly processing of these rules
that dictates the dialog itself.
@ The knowledge engineer
Knowledge engineers are concerned with the representation chosen for the
expert's knowledge declarations and with the inference engine used to
process that knowledge. He / she can use the knowledge acquisition
component of the expert system to input the several characteristics known to
be appropriate to a good inference technique including:
1. A good inference technique is independent of the problem domain.
In order to realize the benefits of explanation, knowledge transparency, and
reusability of the programs in a new problem domain, the inference engine
must not contain domain specific expertise.
2. Inference techniques may be specific to a particular task, such as
diagnosis of hardware configuration. Other techniques may be committed
only to a particular processing technique.
3. Inference techniques are always specific to the knowledge structures.
4. Successful examples of rule processing techniques include:
(a) Forward chaining
(b) Backward chaining
5.The inference rule:
2. Expert systems can process multiple values for any problem parameter.
This permits more than one line of reasoning to be pursued and the results of
incomplete (not fully determined) reasoning to be presented.
3. Problem solving is accomplished by applying specific knowledge rather
than specific technique. This is a key idea in expert systems technology. It
reflects the belief that human experts do not process their knowledge
differently from others, but they do possess different knowledge. With this
philosophy, when one finds that their expert system does not produce the
desired results, work begins to expand the knowledge base, not to reprogram the procedures.
There are various expert systems in which a "rulebase" and an "inference
engine" cooperate to simulate the reasoning process that a human expert
pursues in analyzing a problem and arriving at a conclusion. In these
systems, in order to simulate the human reasoning process, a vast amount of
knowledge needed to be stored in the knowledge base. Generally, the
knowledge base of such an expert system consisted of a relatively large
number of "if then" type of statements that were interrelated in a manner
that, in theory at least, resembled the sequence of mental steps that were
involved in the human reasoning process.
Because of the need for large storage capacities and related programs to
store the rulebase, most expert systems have, in the past, been run only on
large information handling systems. Recently, the storage capacity of
personal computers has increased to a point where it is becoming possible to
consider running some types of simple expert systems on personal
computers.
In some applications of expert systems, the nature of the application and the
amount of stored information necessary to simulate the human reasoning
process for that application is just too vast to store in the active memory of a
computer. In other applications of expert systems, the nature of the
application is such that not all of the information is always needed in the
reasoning process. An example of this latter type application would be the
use of an expert system to diagnose a data processing system comprising
many separate components, some of which are optional. When that type of
expert system employs a single integrated rulebase to diagnose the minimum
system configuration of the data processing system, much of the rulebase is
not required since many of the components which are optional units of the
system will not be present in the system. Nevertheless, earlier expert
systems require the entire rulebase to be stored since all the rules were, in
effect, chained or linked together by the structure of the rulebase.
When the rulebase is segmented, preferably into contextual segments or
units, it is then possible to eliminate portions of the Rulebase containing data
or knowledge that is not needed in a particular application. The segmenting
of the rulebase also allows the expert system to be run with systems or on
systems having much smaller memory capacities than was possible with
earlier arrangements since each segment of the rulebase can be paged into
and out of the system as needed. The segmenting of the rulebase into
contextual segments requires that the expert system manage various
intersegment relationships as segments are paged into and out of memory
during execution of the program. Since the system permits a rulebase
segment to be called and executed at any time during the processing of the
first rulebase, provision must be made to store the data that has been
accumulated up to that point so that at some time later in the process, when
the system returns to the first segment, it can proceed from the last point or
rule node that was processed. Also, provision must be made so that data that
has been collected by the system up to that point can be passed to the second
segment of the rulebase after it has been paged into the system and data
collected during the processing of the second segment can be passed to the
first segment when the system returns to complete processing that segment.
The user interface and the procedure interface are two important functions in
the information collection process.
@The user interface:
The function of the user interface is to present questions and information to
the operator and supply the operator's responses to the inference engine.
Any values entered by the user must be received and interpreted by the user
interface. Some responses are restricted to a set of possible legal answers,
others are not. The user interface checks all responses to insure that they are
of the correct data type. Any responses that are restricted to a legal set of
answers are compared against these legal answers. Whenever the user enters
an illegal answer, the user interface informs the user that his answer was
invalid and prompts him to correct it. As explained in the cross referenced
application, communication between the user interface and the inference
engine is performed through the use of a User Interface Control Block
(UICB) which is passed between the two.
error in either the hardware or software. In general, asking the operator for
information is the most straightforward way for the expert system to gather
information assuming, of course, that the information is or should be within
the operator's understanding. For example, in diagnosing a personal
computer, the expert system must know the major functional components of
the system. It could ask the operator, for instance, if the display is a
monochrome or color display. The operator should, in all probability, be able
to provide the correct answer 100% of the time. The expert system could, on
the other hand, cause a test unit to be run to determine the type of display.
The accuracy of the data collected by either approach in this instance
probably would not be that different so the knowledge engineer could
employ either approach without affecting the accuracy of the diagnosis.
However, in many instances, because of the nature of the information being
solicited, it is better to obtain the information from the system rather than
asking the operator, because the accuracy of the data supplied by the
operator is so low that the system could not effectively process it to a
meaningful conclusion.
In many situations the information is already in the system, in a form of
which permits the correct answer to a question to be obtained through a
process of inductive or deductive reasoning. The data previously collected
by the system could be answers provided by the user to less complex
questions that were asked for a different reason or results returned from test
units that were previously run.
7.Advantages and disadvantages:
Expert systems exercise information technology to acquire and utilize
human expertise. It can be beneficial for organizations that have clear
objectives, rules and procedures. Expert systems can:
Provide consistent answers for repetitive decisions, processes and tasks
Hold and maintain significant levels of information
Reduce employee training costs
Centralize the decision making process
Create efficiencies and reduce time needed to solve problems
Combine multiple human expert intelligences
Reduce the amount of human errors
Give strategic and comparative advantages creating entry barriers to
competitors
Review transactions that human experts may overlook
Description
Software
Database
The database contains all data utilized by application
software. An individual set of stored data is often referred to
as a file. The physical existence of stored data is evidenced
by the physical storage media (computer tapes, hard-disk
drives etc.) used for secondary storage.
Procedures
Operations
personnel
1. Marketing
2. Manufacturing
3. Logistics
4. Personnel
6. Information Processing
7. Top Management
Description
The economical, social , and legal
environment in which the
organization operates
The characteristics and behavior of
the marketplace in which the
organization operates.
The capabilities, strengths,
weaknesses, constraints, and other
USERS VIEW
PROGRAMMERS VIEW
DESIGNERS VIEW
MIS
SQL
4GL PROGRAMMES
DSS
SQL
4GL PROGRAMMES
DSS
Model Building:-
number
of
cases,
user
may
not
be
able
to
formulate
another
approach
to
information
requirements
e a s i l y, b a s e d o n u s e r o r c h a n g e s t o t h e s y s t e m . T h i s i s c a l l e d
" h e u r i s t i c d e v e l o p m e n t " i t i s s i m p l y a n o t h e r t e r m f o r p r o t o t yp i n g .
Although heuristics are employed primarily
for solving ill-structured problems, they can also be used to provide
satisfactory solutions to certain complex, well-structured problems
much more quickly and cheaply than optimization organization. The
main difficulty in using heuristics is that they are not as general as
algorithms. Therefore, they can normally be used only for the
s p e c i f i c s o l u t i o n f o r w h i c h t h e y w h e r e i n t e n d e d . An o t h e r p r o b l e m
with heuristics is that they may produce a poor solution.
Heuristic programming is the approach of using heuristics
to arrive at feasible and good enough solutions to some complex
problems.
Good enough is usually in the range of 90-99.9% of the objective
value of an optimal solution.
Heuristics thinking does not necessarily proceed in direct
m a n n e r. I t i n v o l v e s s e a r c h i n g , l e a r n i n g , e v a l u a t i n g , a n d t h e n r e searching, relearning, and reappraising as exploring and probing
take place. The knowledge gained from success or failure at some
point is fed back to and modifies the search process.
Limitation of Heuristics
I. An optional solution cannot be guaranteed.
Heuristic Approach
Analytical Approach
Dimension
1. Approach to
It employs a planned
sequential approach to
learning
spontaneous action
It uses formal rational
analysis and no
spontaneous actions takes
place
3. Approach to
analysis
Analysis
problem situation.
It reduces the problem to
smaller tasks.
GENERAL APPROACH
OR APPROACH
Problem
Solving
Observation
Statement of a problem
Statement of a
Collection of data
Development of
hypotheses for solution of
the problem.
Evaluation of alternative
hypothesis
problem.
Collection of data
Development and
testing of a model
representing the
problem solution.
Manipulation of the
model to determine the
outcomes of various
Decision
making and
action
input conditions
Selection of best
course of action.
Implementation of
best alternative.
Implementation of the
solution.
Review of results.
Control of the model
by maintaining a
check on its validity as
time goes by.
Answer.
Expert systems attempt to capture the knowledge of human expert and make
it
available through a computer system.
An expert system is:
One which addresses problems normally thought to require human
expertise for their solution.
1) Expert systems are expected to achieve significant actual performance
in a specialized area that normally requires a human expert for
successful performance, e.g. medicine, geology, investment,
counseling, etc
2) Expert systems have been some of the most successful applications
A.I. Since these programs must perform in real world, they encounter
important issues for A.I.:
i)
Lack of sufficient input information.
ii)
Probrabilistic reasonsing.
Q. Differentiate between DSS and MIS?
Answer.
A DSS is a computer based system, using hardware and software. It is used
by decision makers for managerial decision making. It helps to arrange
information for decision making.
DSS is part of MIS. Thus, it forms a subset of MIS, with MIS providing the
general enviroment within which the DSS operates. The spectic difference
between MIS and DSS are as follows:
1. MIS support decision making in both structured and unstructed
problem enviroments. DSS, on the other hand, only works with
structured problems.
2. MIS supports decision-making at all levels of the organisation. DSS is
usually used at the executive level and above to provide a basis for
decision-making.
3. MIS is intended to be integrated into the organization, not standing
alone. A DSS system may even be apart from the general MIS, created
by or for a specific executive.
4. MIS supports all aspects of the decision making process. DSS only
tackles the routine and repetitive decisions and provides data nonrepetive decisions.
5. MIS is composed of people, computer, procedures, database,
interactive query use. While DSS also also has these elements, it does
not face the constraint of having to be made easy to use.
NOTICE
Following is the list of the students along with the questions allotted for
Management Support System Question Bank. The students are
supposed
to
type
the
answers
and
mail
it
to
aquanaveen@rediffmail.com latest by Saturday, i.e, 12th
August, 2006.
Naveen Jain
Vikas
Abhishek
Rastogi
Anurag
Dhadich
Pravin
Gawde
Pravin Yadav
Bhushan
Belsare
Shailesh
Yalkar
Vivek
i)
Open and closed System
ii)
Deterministic and probabilistic system.
What are the functions of MIS.
Instructions:
Font Face: Times New Roman
Font Size = 14
Questions should be bold and italicized.
Underline wherever required.
Put diagrams where needed.
Col V.V.Jadav
Extra:
A White Paper on :
Text Mining and the Knowledge Management Space
Factors Driving Knowledge Management
The biggest business story of the 90's is the tremendous flow, or glut, of
information we face every day. The fundamental nature of information has
changed in terms of volume, availability and importance.
With the Internet, Intranets, email and GroupWare systems more data is
available to the knowledge worker than ever before. Customer comments
and communications, trade publications, internal research reports and
competitor web sites are just a few examples of available electronic data.
Intellectual property and assets are contained within the volumes of
information. Leveraging this value is increasingly important in the
competitive market. Even if you do not make the most of your information,
you can rest assured your competitor will.
Making sense of all this information has become a difficult proposition. What
information is needed and what can be done with it?
Information Publishers need to differentiate their information, enhance its
value and increase revenue. Information Managers need to lower the cost for
tasks that require document analysis, to provide better service to their
customers and improve the quality of management information systems.
Information Users need to have direct access to relevant information, to gain
rapid awareness of content, and to discover new ideas and relationships.
These needs have given rise to a rapidly growing class of software products:
enterprise knowledge management products. Recognizing the enormous
need to manage and control great quantities of textual and other information
that drive businesses, numerous vendors have entered the knowledge
management market with a wide variety of products.
One of the challenges with emerging, dynamic markets is that they are
inherently confusing. Many vendors are eager to be in this market; all
offering a wide range of products covering different parts of the overall
problem. The already overburdened knowledge worker, looking into
knowledge management products, must sort through a range of document
management systems, text scanning products, collaborative workgroup
Knowledge Repository
Until recently, the only technique available for knowledge discovery was data
mining. Data mining works with structured data, often numerical in nature,
stored in a cleansed, static database (the data warehouse or data mart). This
required significant pre-processing and organization efforts.
Text mining is analogous to data mining in that it uncovers relationships in
information. Unlike data mining, text mining works with information stored in
an unstructured collection of text documents.
A specific example of a text mining technique is a discovery engine. Unlike a
search engine, a discovery engine does not require you to know what you
have in order to find valuable information.
Search vs. Discovery
How do search and discovery engines compare? As shown in the diagram
below, both a search engine and a discovery engine deal with words and
documents.
A search engine provides the sole function of locating documents. The end
user must provide key word(s) to begin the search process.
While a discovery engine does provide a means of locating documents, that is
not its primary value. A discovery engine automatically extracts valuable and
relevant textual data and then provides a graphical, dynamic and navigable
index. The visual presentation of the concepts enables rapid understanding of
the underlying content and structure of textual data, leading to the improved
productivity of the knowledge worker.
Search engines and discovery engines are complementary solutions.
Knowledge Management Market Overview
The following diagram summarizes the different categories of the Knowledge
Management market and gives examples of products in each category. Each
of the categories provides functionality that is essential for a comprehensive
Knowledge Management initiative.
Why Use Text Mining?
Text mining is best suited for "discovery" purposes; learning and discovering
information that was previously unknown. Example reasons for using text
mining include:
While a valuable tool, text mining is not suited to all purposes. Just as you
would not use data mining technology to do a simple query of your database,
text mining is not the most efficient way to isolate a single fact.
Text mining is not an end in itself; it is a support tool. A text mining product
supports and enhances the knowledge worker's creativity and innovation with
open-ended exploration and discovery.
The individual applies intelligence and creativity to bring meaning and
relevance to information, turning information into knowledge. Text mining
advances this process, empowering the knowledge worker to explore and
gain knowledge from the knowledge base.
Text mining is particularly relevant today because of the enormous amount of
knowledge, either within an organization or outside of it, that resides in text
documents. The advent of online publishing has enormously increased the
amount of textual information. Our most frequent and most comfortable form