You are on page 1of 16

Prepared by;

2014510013 SEREN BOLAT


2014510043 ÖZGÜR HEPSAĞ
2014510091 ABDULSAMET
İLERİ
11.8.2016
OUTLINE
INTRODUCTION
 PROGRESS SUMMARY
PROBLEMS ENCOUNTERED
CONCLUSION
QUESTIONS
REFERENCES
INTRODUCTIO
N
* The name of project is "A Text-Based Search Engine".

* 670 text files are searched for user’s queries.

* The most important part of project is present the most relevant


documents quickly for user queries and save user’s time.
PROGRESS
SUMMARY
Class Desing
HashTable

Window HashOpen

File
Operations

Word HashEntry

Document
File Operation
Reading text files Buffered Reader

Eliminating stopwords

Punctuations Regex

Indexing

HashTable has been used for indexing.

Open adressing methods and chaining method have been


used.

Key generation :
Hash Functions:

Rehasing for open adressing


Performance Comparision

File Operations
• 35 seconds

Indexing
• For Chaining => 8 seconds
• For Linear Probing => 9 seconds
• For Quadratic Probing => 7 seconds
• For Double Hashing => 6 seconds
Query Executing

Query executing are based on ranking to save users time so..


Ranking

TF • Term Frequency
• TF = frequency/totalWord

IDF • Inverse Document Frequency 


• IDF
=1+loge(totalWord/numberOfDocumentsWithTer
m)

EUCLI • sqrt(pow((Frequency*IDF/TF),2))+sqrt(pow(tf,2))

D
Sorting
According to ranking values of user’s query, this sorting
algorithm is performing background and the results is given to
user. So that we are presenting most relevant document quickly
and efficiently.
PROBLEMS
ENCOUNTERED
o One of the project members did not have internet
connection. Because of this problem all project member
had to meet continuously in deparment.

o Computer of the other one was broken accidently in first


week. So all project member had to use same computer.
CONCLUSION
 All project members worked perfectly and the
project is given on time.

SOS is created by the project


members!
Questions
REFERENCES
[1] https
://janav.wordpress.com/2013/10/27/tf-idf-and-cosine-similarity/
[2] http://www.regular-expressions.info/quickstart.html

You might also like