Professional Documents
Culture Documents
• In small farms, this role can be shared on a server with the application server Host A Host B
Web Servers
Crawl and component processes role. Web Server Web Server
The crawl and content processing architecture includes the crawl component, crawl Application server with search components Index Index component Index replicas Web Server Web Server
database and content processing component. Both search components can be scaled out The index component is the logical representation • Each index partition holds one or more index
based on crawl volume and performance requirements. • Hosts all of the search components if only one server is configured. of an index replica. replicas that contain the same information. Office Web Apps Server Office Web Apps Server *
Otherwise, it holds components associated with the server, as configured by • You have to provision one index component for
1 About the crawl component Index partitions each index replica.
the administrator.
• The crawl component is responsible for crawling content sources. It delivers crawled items • You can divide the index into discrete portions, • To achieve fault tolerance and redundancy,
– both the actual content as well as their associated metadata – to the content processing • Holds the entire search index if only one index partition is configured. each holding a separate part of the index. create additional index replicas for each index * Office Web Apps Server VMs can share the same host as a SharePoint Web or application
server.
component. • An index partition is stored in a set of files on a partition and distribute the index replicas over
Otherwise, it holds portions of the index that are associated with the index
• The crawl component invokes connectors or protocol handlers that interact with content disk. multiple application servers.
partitions as configured by the administrator. Host C Host D Host E Host F
Application Servers
sources to retrieve data. Multiple crawl components can be deployed to crawl • The search index is the aggregation of all index
simultaneously. q The query processing component routes incoming queries to index partitions. Application Server Application Server Application Server Application Server
• The content processing component is placed between the crawl component and the index any given time. All other Application Server
Roles
All other Application Server
Roles
component. It processes crawled items and feeds these items to the index component. Application Server Application Server
• The content processing component transforms crawled items into artifacts that can be Content Processing Content processing component Analytics Analytics
included in the search index by carrying out operations such as document parsing and Carries out various processes on the crawled Content processing Content processing
property mapping. items, such as document parsing and property
• Both the content processing component and the query processing component perform Analytics Analytics processing component mapping.
Application Server Application Server
linguistics processing. Examples of linguistics processing during content processing are Carries out search analytics and usage analytics. Admin Admin
Crawl Crawl
language detection and entity extraction. Content processing Content processing
• The content processing component writes information about links and URLs to the link
database. Database server Search Admin DB Search administration database
Stores search configuration data. Only one search
• Hosts search databases. administration database per Search service Host C Host D
Database Servers
Crawl DB Crawl database
Index and query processes • Can host other SharePoint Server 2016 or SharePoint Server 2019 databases. • Stores the crawl history.
application. All SharePoint Databases
Search admin DB
All SharePoint Databases
query volume, and performance requirements. • To increase performance and capacity, consider adding disks to the database crawl components associated with it. Crawl DB
Analytics DB
server or adding database servers (depending on the bottleneck).
4 About the index component All other SharePoint
Databases
Link DB Link database
• An index component is the logical representation of an index replica. In the search Stores the information extracted by the content
architecture, you have to provision one index component for each index replica. processing component and also stores click-
Paired hosts for fault tolerance
• The index component receives processed items from the content processing component Analytics DB Analytics reporting database through information.
and writes those items to an index file. Stores the results of usage analytics.
• The index component receives queries from the query processing component and
provides results sets in return.
• Queries are sent to the index replicas through the query processing component. The
system routes and load balances the incoming queries to the index replicas.
About the index partition
• An index partition is a logical portion of the entire search index. The search index is the Analytics processes
• The query processing component is between the search front-end and the index 3 About the analytics processing component
component. Content sources
• The query processing component analyzes and processes search queries and results. • The analytics processing component performs two types of analyses: search analytics
• Both the query processing component and the content processing component perform and usage analytics. This component uses information from these analyses to improve
linguistics processing. Examples of linguistics processing during query processing are search relevance, create search reports, and generate recommendations and deep links. HTTP
word-breaking and stemming. • Search analytics is about extracting information such as - links, the number of times an
• When the query processing component receives a query from the search front-end, it item is clicked, anchor text, data related to people, and metadata – from the link Content is fed to the search index in this direction Queries are sent to the search index in this direction
analyzes and processes the query to attempt to optimize precision, recall, and relevancy. database. This information is important to relevance. File shares
The processed query is then submitted to the index component. • Usage analytics is about analyzing usage log information received from the front-end
• The index component returns a result set based on the processed query back to the query via the event store. Usage analytics generates usage and statistics reports. Query Front-end
processing component, which in turn processes that result set before sending it back to • The results from the analyses are added to the items in the search index. In addition, SharePoint
the search front-end. results from usage analytics are stored in the analytics reporting database.
1 2 4 5
B About the link database User profiles
Content
Search administration • The link database stores information extracted by the content processing component.
Crawl
component
processing
component
Index component Query processing component
Client application
Search administration is composed of the search administration component and its In addition, it stores information about search clicks; the number of times people click Exchange
corresponding database. on a search result from the search result page. This information is stored unprocessed,
to be analyzed by the analytics processing component.
6 About the search administration component Lotus Notes
C About the analytics reporting database
• The search administration component is responsible for running a number of system
processes that are essential to search. • The analytics reporting database stores the results of usage analytics. A
3
• This component carries out provisioning, which is to add and initialize additional instances • In addition, the analytics reporting database also stores statistics information from the Documentum Crawl database
Analytics B
of the other search components. analyses. SharePoint uses this information to create Excel reports that show different processing Link database
component
statistics.
D About the search administration database
E About the event store
• The search administration database stores search configuration data, such as the
topology, crawl rules, query rules, and the mappings between crawled and managed • The event store holds usage events that are captured on the front-end, such as the Custom
Index file store on disk
6
D
Search
administration Search
component administration database Event store
© 2016 Microsoft Corporation. All rights reserved. To send feedback about this documentation, please write to us at ITSPdocs@microsoft.com.