You are on page 1of 1

Measuring the similarity between documents is an important operation in the text processing field.

designed by using a similarity function to measure the closeness between the text objects. The mo
function which is used commonly in the text domain is the cosine similarity function. Computation
fundamental problem in information retrieval. Although most of the work in information retrieval ha
the similarity of a keyword query and a text document, rather than the similarity between two doc
similarity functions can also be applied to optimize the similarity function for clustering. The proble
widely studied in the data mining, machine learning, database, and information retrieval communi
number of diverse domains, such as target marketing, medical diagnosis, news group filtering, and

You might also like