Professional Documents
Culture Documents
net/publication/224129737
CITATIONS READS
9 3,740
6 authors, including:
Some of the authors of this publication are also working on these related projects:
Employing Takaful Islamic Banking through State of the Art Blockchain: A Case Study of Saudi Arabia View project
All content following this page was uploaded by Mohammad Abdeen on 06 February 2017.
Abstract—A mind map is a diagram used to represent words, central keyword. As an example, 0 depicts a mind map of
ideas, or other items linked to and arranged around a central Google tools.
keyword or idea. Mind maps are used to generate, visualize,
structure, and classify ideas, and as an aid in organization, study,
project management, problem solving, decision making, and
writing. It has been long used in brainstorming and as an
effective educational tool.
The parser returns all possible parse trees for each sentence in
the text according to the English grammar rules in effect. A
filtering process takes place in which the grammatically correct
parse trees are chosen for each sentence. The Syntax Analyzer
is the module which produces the final correct parse trees of
the input text. Further filtering is done based on a score given
to each parse tree based on its internal structure. The Semantic
Analyzer receives correct parse trees of the text, selects correct
meaning for each word and produces a new Text Meaning
Representation (TMR) or uses it to update an existing one. The Figure 3 The class diagram for the Mind Map conversion module
Semantic Analyzer consists of three sub-modules: the
Discourse Analyzer, the Word Sense Disambiguation and the
Text Meaning Representation. The Discourse Analyzer is
concerned with assigning each pronoun to the noun which this
pronoun refers to. The Word Sense Disambiguation is
concerned with assigning the most proper sense for each word
according to the formulation of the sentence. For example, the
word "ball" has several senses, including a round object used in
games, a formal dance, and a pitch in baseball that is not a
strike.
96
III. IMPLEMENTATION
As mentioned previously, the tool consists of five modules.
In this section we present the design, implementation notes and
testing of the modules. For the morphological analysis module,
we used the Microsoft WordNet 2.1 analyzer [6]. Generating a
mind map requires the knowledge of the verb tense (for verbs)
while for adjectives it is needed to know if it is comparative or
superlative. The following modifications had to be introduced
to the WordNet so that it can be used in mind map generation:
1. For regular verbs and adjectives, the rules return the
removed suffix to get the morph as additional output.
The verb tense is determined accordingly (past for
“ed” and present for “s”).
2. For irregular verbs and adjectives the reverse of their
exceptional lists were built. That list will later be
searched to determine a verb tense.
In the remainder of this section we show in details, and as
an illustration, the design of the syntactic and the semantic
analyzer modules. Figure 5 A detailed class diagram for the Parser class
97
B. The Semantic analyzer module
Semantic analysis is the process of relating syntactic
structures, from the levels of phrases, clauses, sentences and
paragraphs to the level of the writing as a whole, to their
language-independent meanings, removing features specific to
particular linguistic and cultural contexts. It has three functions
and each is performed by a separate module.
1) Word Sense Disambiguation (WSD):
This module is concerned with assigning the most proper
sense (meaning) for each word according to the structure of the
sentence.
2) Discourse analysis
This module is basically concerned with assigning each
pronoun to the noun which this pronoun refers to. It includes
the implementation of the RAP algorithm [12] that is Figure 7 The output of the discourse analysis module of the tool
concerned with assigning an antecedent noun to each pronoun.
It depends on:
3) Meaning Representation class
• Selecting the nouns in the context.
The verbs, adverbs, nouns and adjectives in each sentence are
• Eliminating the nouns which do not agree with the extracted along with any relations (case roles) between words.
pronoun in the person, number and gender. This sub-module is totally dependent on the correctly chosen
parse trees from the Syntax Analyzer. A Case Role is an
• Adding salience values to each noun according to certain
argument or typical role that a predicate can take. It appears as
criteria.
a property of an event (verb) in a TMR. For example, in the
• Selecting the noun with the maximum salience as the sentence "Shakespeare was born in 1564," "born" is the event
anaphora (antecedent) of the pronoun. (verb), the verb is passive therefore "Shakespeare" is the
theme case role (the party that was affected by the verb),
We have tested this module with the following text excerpt:
"1564" is the time case role of the event. There are several
“Shakespeare is a great writer in the history of literature. other case roles including: Agent (The party performing the
He was born in 1564 and he had 3 children. He was earning action), Source, Destination, Location, etc.
his living from buying and selling the agricultural products. He
lived in Stratford. He died in 1616” The following Figure is the Text Meaning Representation for
The output is as follows: the Shakespeare excerpt that was used above.
IV. RESULTS
To demonstrate the tool’s capabilities we have run a simple
test on a text defining Shakespeare, the great writer. The text is
the one listed in the previous section.
98
The following Figure is the final mind map. it is presented as a separate mind map with that certain node as
the central node or main topic. We also have current work in
this area.
In addition to that, we are working on improving the image
production module to be able to synthesize images form a very
small and manageable database other than using Google image
search facility. This has the advantage of allowing the tool to
work offline and to eliminate any network delays during the
search process.
ACKNOWLEDGMENT
The authors would like to thank Mr. M. Hamdy for his help
and support during the course of the work.
REFERENCES
Figure 9 Final mind map of the Shakespeare text [1] “The Mind Map Book: How to Use Radiant Thinking to Maximize Your
Brain's Untapped Potential” Tony and Barry Buzan, E P Dutton 1994
It is to be noted that some of the images assigned to the [2] David A. Watkins & John B. Biggs. “Teaching the Chinese learner:
Psychological and pedagogical perspectives” Hong Kong : Comparative
nodes are not accurately representing them. This is essentially Education Research Centre 2001.
due to the way Google image search is tuned to perform image [3] http://www.wisemapping.com
retrieval. There are two approaches to solving this problem. [4] http://www.imindmap.com/
The first is to choose tighter search criteria for Google that [5] “An API for Google Image Search”, http://www.codeproject.com/
eliminates any ambiguity in the search. The second is to create KB/IP/google_image_search_api.aspx
a complete image database. The latter option is what we [6] George A. Miller, Christiane Fellbaum ,WordNet 2.1 Documentation,
adopted in the improvements we are currently performing on Cognitive Science Laboratory Princeton University
this tool. [7] Bell C., Bowden M., and Trott A. “Implementing flexible learning.
London” Kogan page Limited, 1997.
V. CONCLUSIONS AND FUTURE WORK [8] "Lexical Semantics & Word Sense Disambiguation", CMSC 35100,
May 15, 2003
In this paper, we presented M2Gen, a software tool that [9] Diana McCarthy, Rob Koeling, Julie Weeds, John Carroll, "Using
automatically generates mind maps from input text. To the best Automatically Acquired Predominant Senses for Word Sense
of our knowledge, this is the first time this tool is presented. Disambiguation", Department of Informatics, University of Sussex, 2003
Due to the size of the English language and the variety of rules [10] Mikyoeng Kim, "A Study on the Algorithms for Dialogue
within it, the current version of the tool covers a relatively Segmentation", School of Knowledge Science, Japan Advanced Institute
of Science and Technology (JAIST), March 2000.
small subset of the English language. More work is currently
being done to expand the scope of the tool to include a larger [11] H. Stahl, J. Muller, M. Lang, “An efficient top-down parsing algorithm
for understanding speech by using stochastic syntactic and semantic
subset of the language. On the other hand, the current version models”, in the proceedings of IEEE International Conference on
only supports single layer mind map and does not produce a Acoustics, Speech, and Signal Processing 1996.
hierarchical (multileveled) map (i.e., maps within maps). This [12] Long Qiu, Min-Yen Kan, Tat-Seng Chua, "A Public Reference
is because not all the information extracted from the text is Implementation of the RAP Anaphora Resolution Algorithm", In
displayed in one mind map, but only the most important nouns Proceedings of Language Resources and Evaluation Conference (LREC
2004), Lisbon, Portugal, 2004.
and verbs are. This means that if further information needs to
be obtained about a certain node, if there is more information,
99
View publication stats