Professional Documents
Culture Documents
Inverted Index
Another option is to build some data structures (called indices) from the
document collection to speed up retrieval or search.
2
Inverted Index
The inverted index, which has been shown superior to most other
indexing schemes, is a popular one. It is perhaps the most
important index method used in search engines.
Fig. (A) is a simple version, where each term is attached with only an
inverted list of IDs of the documents that contain the term.
Each inverted list in Fig (B) is more complex as it contains additional
information, i.e., the frequency count of the term and its positions in
each document.
Index Construction
Let us build an inverted index for the three documents in previous
Example.
To build the index efficiently, the trie is usually stored in memory. However,
in the context of the Web, the whole index will not fit in the main memory.