The author has disabled downloads on this document.
Are you interested in downloading Splog Detection Using Self Similarity Analysis? You can easily send the publisher a message on Scribd.
research paper from May 8, 2007
"The machine-generated nature of splogs make them good candidates for statistical assessment. This assessment can include looking at:
1. Post Time: Two measures that capture regularity in posting time (micro) e.g. posts go live in the morning before the blogger's "real" job as well as a macro time view e.g. a large gap in posting due to a vacation.
2. Post Content: A measure of the topic drift by the blogger. Commonly a blogger will remain focused on a topic, but will sometimes write about other topics.
3. Post Links: The links on the blog can be telling. A large proportion directed to a particular domain, for example, suggests a relationship with that destination domain.
"
8 Pages
Date Added |
01/09/2009 |
Category |
Uncategorized. |
Tags |
|
Groups |
|
Copyright |
|
More info » |
|