Professional Documents
Culture Documents
About The BookID™ Copyright Protection System - Scribd Help Center
About The BookID™ Copyright Protection System - Scribd Help Center
How it works
BookID algorithmically analyzes computer-readable text for semantic data that it then
encodes into a digital "fingerprint.” BookID stores the fingerprints of known copyrighted
works on a secure server that is inaccessible to the Internet and the general public.
BookID scans every document uploaded to Scribd and removes those that have the
same, or a substantially similar, fingerprint. BookID intermittently scans the entire Scribd
library to remove matching content that was uploaded prior to fingerprinting. BookID’s
approach reduces misidentifications and enables the detection of infringing works even
if they have been altered to some degree.
Limitations
BookID relies on computer-readable text, which is not necessarily the same as text that
is readable by humans. Content scanned from paper sources may not contain computer-
readable text, which makes those sources unsuitable for fingerprinting. Similarly, text
that is encoded with optical character recognition (OCR) technology may contain garbled
or partial data. These conditions make it very difficult, if not impossible, to detect
matches.
BookID’s fingerprint scanner cannot detect specific keywords, titles, names, copyright
notices, or other disclaimers that are part of a document's text. In other words, BookID
cannot be programmed to block all documents that contain a book’s title. Likewise,
BookID cannot translate different languages. If BookID fingerprints an English-language
document, it can only detect subsequent uploads that are also in English.
BookID cannot detect images, illustrations, and sheet music at this time.
False positives
BookID contains fingerprints of educational textbooks and other works that contain long
excerpts of classic literature, religious texts, legal documents, and government
publications in the public domain. This occasionally results in the temporary removal of
non-copyrighted, authorized, or public domain material from Scribd.com and the mobile
app.
Comments (0)
Submit a request