Professional Documents
Culture Documents
Window HashOpen
File
Operations
Word HashEntry
Document
File Operation
Reading text files Buffered Reader
Eliminating stopwords
•
Punctuations Regex
•
Indexing
Key generation :
Hash Functions:
File Operations
• 35 seconds
Indexing
• For Chaining => 8 seconds
• For Linear Probing => 9 seconds
• For Quadratic Probing => 7 seconds
• For Double Hashing => 6 seconds
Query Executing
TF • Term Frequency
• TF = frequency/totalWord
EUCLI • sqrt(pow((Frequency*IDF/TF),2))+sqrt(pow(tf,2))
•
D
Sorting
According to ranking values of user’s query, this sorting
algorithm is performing background and the results is given to
user. So that we are presenting most relevant document quickly
and efficiently.
PROBLEMS
ENCOUNTERED
o One of the project members did not have internet
connection. Because of this problem all project member
had to meet continuously in deparment.