Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Save to My Library
Look up keyword
Like this
2Activity
0 of .
Results for:
No results containing your search query
P. 1
An Intelligent Agent Based Text-Mining System: Presenting Concept through Design Approach

An Intelligent Agent Based Text-Mining System: Presenting Concept through Design Approach

Ratings: (0)|Views: 266 |Likes:
Published by ijcsis
Text mining is a variation on a field called data mining and refers to the process of deriving high-quality information from unstructured text. In text-mining the goal is to discover unknown information, something that may not be known by people. Now here the aim is to design an intelligent agent based text-mining system which reads on the text (input) and based on the keyword provide the matching documents (in the form of links) or options (statements) according to the user’s query. In this paper the effort is to depict design approach for intelligent agent based text mining system.
Text mining is a variation on a field called data mining and refers to the process of deriving high-quality information from unstructured text. In text-mining the goal is to discover unknown information, something that may not be known by people. Now here the aim is to design an intelligent agent based text-mining system which reads on the text (input) and based on the keyword provide the matching documents (in the form of links) or options (statements) according to the user’s query. In this paper the effort is to depict design approach for intelligent agent based text mining system.

More info:

Published by: ijcsis on May 11, 2011
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

03/27/2014

pdf

text

original

 
An Intelligent Agent Based Text-MiningSystem: Presenting Concept through DesignApproach
1
Kaustubh S. Raval
2
Ranjeetsingh S. Suryawanshi
3
Professor DevendraM. Thakore
M.Tech. (Computer Engineering)M.Tech. (Computer Engineering)Department of Computer Engineeringraval_kaustubh@yahoo.co.inranjeetsuryawanshi06@gmail.com dmthakore@bvucoep.edu.in
1, 2, 3
Bharati Vidyapeeth Deemed University,College of Engineering, Pune 411043.
 Abstract
Text mining is a variation on a fieldcalled data mining and refers to the process of deriving high-quality informationfrom unstructuredtext. In text-mining the goal is to discover unknowninformation, something that may not be known bypeople.Now here the aim is to design an intelligentagentbasedtext-mining systemwhich reads on thetext (input) and based on the keyword provide thematching documents (in the form of links) or options(statements) according to the user’s query.In thispaper the effort is to depict design approach forintelligent agentbased text mining system.
 Keywords –Data Mining, Text Mining, Intelligent agent.
I.I
NTRODUCTION
First of all, we need basic information aboutvarious terms on which this work is to be carriedout.
 Data Mining:
Data mining is the analysis of (often large) observational data sets to findunsuspected relationships and to summarize thedata in novel ways that are both understandable anduseful to the data owner.It derives businessintelligence from the data warehouse by usingadvanced analytical techniques such as neuralnetwork heuristics, fuzzy logic, statistical analysisetc.
 Automated Data Mining:
Using automated dataminingwe can sweep through databases anddiscover previously unknown patterns.In theirpaper [1], Dr. V. Saravanan and J. Rajanproposedan automated data mining system which compassesfamiliar data mining algorithms. According to themthe system willautomatically select the appropriatedata mining techniqueand select the necessary fieldneeded from the databaseat the appropriate timewithout expecting the users to specify the specifictechniques and the parameters.
Text Mining:
Text-mining is a variation on afield called data-mining and refers to the process of deriving high-quality information from theunstructuredtext.‘High quality’ intext-mining
(IJCSIS) International Journal of Computer Science and Information Security,Vol. 9, No. 4, April 2011112http://sites.google.com/site/ijcsis/ISSN 1947-5500
 
usually refers to some combination of relevance,noveltyand interestingness.[3]
 Intelligent Agents:
Intelligent agents aresoftware entities that carry out some set of operations on behalf of a user with some degree of independence or autonomy, and in doing so,employ some knowledge or representation of theuser’s goals or desires.Software agents are usefulin automating repetitive tasks, finding and filteringinformation, intelligently summarizing complexdata, and so on, but more importantly, just like theirhuman counterparts, intelligent agents can havecapability to learn from the managers and evenmake recommendations to them regarding aparticular course of action.Agents haveseveralcommon characteristics, such as their ability tocommunicate, cooperate, and coordinate with otheragents in system. Each agent iscapable of actingautonomously, cooperatively, and collectively toachieve the collective goal of a system. Thecoordination capability helps manage problemsolving so that co-operating agents work together asa singleteam.[9]
 Motivation
The literature study of various research papersand my interest in the field of ‘Data Mining’motivated me to take up this as my dissertationtopic for post-graduation.Study of existing biomedical text miningsystem, named, ‘PolySearch’ also provide theinsights to overall ‘text mining system’ and thuslead me to take up ‘Intelligent Software AgentBased Text Mining’ as my dissertation topic.Working scenario of ‘Google Search Engine’also has been the motivational factor to take up thistopic as my dissertation work. ‘Google SearchEngine’ is the best example of optimized intelligentsoftware agent based text-mining systemencompassing a very large domain of web.II.S
YSTEM
D
ESIGN
System designincludes use-case diagram andsequence diagram.Use-case diagram depictshowthe user interactswith the proposed intelligentagentbased systemwhereas the sequence diagramdepicts how the flow of actions carried out bydifferent agents in the system.Fig. 1User Interacting with systemAs shown in the Fig.1userwill typethe textthen text miner agent 1, which is keyphrase-based,
(IJCSIS) International Journal of Computer Science and Information Security,Vol. 9, No. 4, April 2011113http://sites.google.com/site/ijcsis/ISSN 1947-5500
 
willdecide the keyword then intelligent agent willdecide the context for that ‘keyword’ then textminer agent 2, which is keyword based, will decidethe meaning of the keyword in particular context,find out related documents, calculate weight matrixvalue and then attach that value to the document.Then intelligentagent will rank the documentsbased on weight-matrix values.Fig. 2Sequence DiagramFig. 2shows the sequence diagram of thesystem interaction diagram between differentagents of the system.III.S
YSTEM
D
ESCRIPTION
System description is the context whichincludes the details about the overall working of theexisting or proposed system.
Why Agents?
Text mining mainly includes the field of information retrieval which means the finding of documents which contain answers to questions andnot the finding of answers itself and for this toachieve statistical measures and methods are used.By using statistical measures and methodsautomatic processing of text data and comparison togiven question is performed. But the issue here ishow to automate the processing of text data? Andthat is where ‘Agents’ come into picture.
System Architecture
Fig.5 shows the architectural diagram forintelligent agentbased text-mining system. Itincludes all the components required to make thesystem workable and the relationship andinteraction between them. There are mainly threeagents, one dataset, the user category, and onecache/log component.Working of the Intelligent Agent in two phases::
Phase 1:
Takes the input from Text Miner Agent 1 (thatis key-phrase/keyword).
Find out the contexts(documents)for key-phrase word.
Phase 2:
Takes input from Text Miner Agent 2 that islinks and their associated weight matrix values.
Compare the weight matrix values of variouslinks and decide which one is the ‘close-to-best-match’ for user’s query.
The link with the highest weight matrix valueranked first, the link with second highestweight matrix value ranked second, the link 
(IJCSIS) International Journal of Computer Science and Information Security,Vol. 9, No. 4, April 2011114http://sites.google.com/site/ijcsis/ISSN 1947-5500

Activity (2)

You've already reviewed this. Edit your review.
1 thousand reads
1 hundred reads

You're Reading a Free Preview

Download
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->