Bayes classifier [Peng 2004]; the researchers claimthat they achieved good results.II.
MATERIALS AND METHODS
Words of working type, action oriented,different categories of prepositions, pronouns,adjectives, adverbs, conjunctions and interjections aregiven in Table 1 to Table 3. These words are used asfiltering and as templates. When an email is analyzedfor uniqueness, the extracted features are based onlist of words presented in the tables. Hence,unnecessary words are eliminated and the number of unique words that represent an email is minimum.
TABLE 1 SAMPLE WORDS USED FOR FILTERING
Work (70)Action(524)Preposition _1 (94)Preposition_2(30)
analyze Accelerate Aboard according toannotate Accommodate About ahead of ascertain Accomplish Above as of attend Accumulate Absent as per audit Achieve Across as regards build Acquire After aside fromcalculate Act Against because of consider Activate Along close toconstruct Adapt Alongside due tocontrol Add Amid except for TABLE 2 SAMPLE WORDS USED FOR FILTERING
Preposition _3 (16)Preposition _4 (9)Pronoun(77)Adjectives(395)
as far as apart from All earlyas well as but Another abundant by means of except Any adorablein accordancewith plus anybody adventurousin addition to save Anyone aggressivein case of concerning anything agreeablein front of considering Both alertin lieu of regarding Each alivein place of worth each other amusedin point of Either ancientTABLE 3 SAMPLE WORDS USED FOR FILTERING
Adverbs (331) Conjunctions (25) Interjections (77)
Abnormally And Absolutelyabsentmindedly But AchooAccidentally For Ack Acidly Nor AgreedActually Or AhaAdventurously So AhemAfterwards Yet AhhAlmost after AhoyAlways although Alack Angrily as Alas
To avoid misinterpretation, work words will analyze how an author writes his emailand what clarity he has in the mail. The number of work words will indicate performance task requirements in a neat, unambiguous manner byusing the work words that translate exactly what anauthor has in his mind. Action words: It indicatessome actions during an expressing in the email.Preposition, adjectives, adverbs, conjunctions andinterjections have their standard meanings.The total number of words used as basic dictionary is1648 (work + action + prepositions + adjectives +adverbs + conjunctions + Interjections). The numbersmentioned in the paranthesis are the total in eachcategory whereas, only few words are shown in thetables for understanding.A schematic diagram for implementation of the proposed work is presented din Figure 1.
Fig.1 (a) Training the systemFig.1 (b) Testing the system
Email: The email received in the systemExtract words: all the words in the email arearranged.Filter words: The words given in Table 1-3 aresearched in the extracted words. Subsequently, theword frequencies are found.Author matrix: A matrix with column as authors andvertical rows with word frequencies.Training patterns: The columns of the matrix are usedas training patterns and labeling are introduced.Emails
Extract wordsFilter wordsusingtemplateFind thefrequencyand thewords for eachTrain RBFand storefinalweightsCreateauthor matrixIdentifytheauthor Emails
Extract wordsFilter wordsusingtemplatewords givenFind thefrequency andthe words for each categoryProcesswith finalweights
(IJCSIS) International Journal of Computer Science and Information Security,Vol. 9, No. 1, January 201169http://sites.google.com/site/ijcsis/ISSN 1947-5500