Professional Documents
Culture Documents
Content Server
Version 6.6
EMC Corporation
Corporate Headquarters:
Hopkinton, MA 01748-9103
1-508-435-1000
www.EMC.com
EMC believes the information in this publication is accurate as of its publication date. The information is subject to change
without notice.
The information in this publication is provided as is. EMC Corporation makes no representations or warranties of any kind
with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness
for a particular purpose. Use, copying, and distribution of any EMC software described in this publication requires an
applicable software license.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. All other trademarks
used herein are the property of their respective owners.
© Copyright 2010 EMC Corporation. All rights reserved.
Table of Contents
Preface ................................................................................................................................. 7
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 3
Table of Contents
Overview ......................................................................................................... 49
Adding the full-text indexing system after upgrading from a 5.3
repository ....................................................................................................... 49
Upgrading a full-text system on Content Server upgraded from 6.5 or
later ................................................................................................................. 50
Upgrading a full-text system on Content Server upgraded from 5.3
SP6, 6.0, or 6.0 SP1 ............................................................................................ 51
4 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Table of Contents
List of Figures
Figure 1. The first available document processor handles each incoming document ................ 33
Figure 2. Mapping file stores to collections to index columns ................................................. 41
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 5
Table of Contents
List of Tables
6 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Preface
Intended audience
This manual is intended for the person installing Content Server and the full-text indexing software.
Typically, a system administrator installs the software.
Related documentation
• The EMC Documentum Content Server Full-text Indexing Administration Guide contains instructions
about configuring and administering your full-text indexing system.
• The Content Server Installation Guide contains information on installing Content Server.
• The Documentum Administrator online help system contains instructions for managing the index
queue and for starting and stopping the index server and index agent, in basic, consolidated,
and high-availability deployments. No Documentum Administrator support is provided for
multinode deployments.
• The EMC Documentum Search Development Guide contains complete information on querying.
Revision history
The following revisions have been made to this document:
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 7
Preface
Revision History
Date Description
May 2010 Initial publication for version 6.6.
8 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Chapter 1
Prepare for Installation
Use the information in this chapter to prepare your installation for the full-text indexing component
installation.
These topics are included:
• Pre-installation procedure, page 9
• Preinstallation checklist, page 13
Pre-installation procedure
1. Install Content Server and configure a repository.
Use the instructions in the Content Server Installation Guide.
2. Host names
You need to identify the host where the index server and index agent are installed by a fully
qualified domain name (FDQN). For example, the host name isolde.documentum.com is
acceptable, but an IP address, for example, 172.04.8.275 is not acceptable.
3. Ports to use for the index agent
The index agent runs in the application server container. When an index agent instance is
configured, you need to designate two ports for the index agent and application server to use.
The default ports for the first index agent on a host are 9200 and 9080. The default for any other
index agents are 90220 for index agent 2, 9240 for index agent 3, and so on. If the index agent is
on the Content Server host, ensure that the ports are not the ports used for the application server
instance in which the Java method server and ACS server run.
4. Ports to use for the index server
The index server requires a contiguous range of 4000 free ports. You must designate which ports
to use during installation. The default range is from 13000 to 17000.
5. Index server operating system and host
Note: Installing the index server installs Java on the host. The installed version of Java is the same
version as the Java version installed with Content Server.
You need to install the index server on a supported operating system. EMC Documentum
recommends using a host on which a clean installation of the operating system has been
performed.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 9
Prepare for Installation
Install the index server on a disk partition separate from the system partition and larger than
the system partition.
Constraints on third-party software on the index server host — The following restrictions
apply to the index server host:
• Do not run network security scanning software on a host where the index server is installed.
Network security scanners might lock index server processes, which can create intermittent
search and indexing failures.
• Do not run backup utilities while the index server is running.
Backup utilities might lock indexing processes.
• Do not run antivirus software on the %FASTSEARCH% directory, where the index server and
indexes are stored.
Antivirus software interferes with index server startup and proper functioning. Antivirus
software might quarantine log and other frequently-changed files.
EMC Documentum recommends testing any third-party monitoring tools on a development
system before the tools are deployed to a production system where the index server is installed.
Windows host requirements for the index server — The following restrictions apply to
Windows hosts:
• Do not run the Windows Index System on the index server host.
• On 32-bit Windows hosts, do not set the /3GB option in the boot.ini file.
• Disable automatic Windows updates on index server hosts.
• Do not install the index server on a domain controller.
Note: If a 5.2.x repository is running on a Windows host and you are performing a pre-upgrade
index migration, you must install the index agent and index server on a host other than the
Content Server host. For more information, refer to Chapter 3, Upgrade.
6. Host time settings
Set the time zone on the host where the index server runs to Greenwich Mean Time (GMT) or
Universal Time Coordinated (UTC). On Windows hosts, clear Automatically adjust clock for
daylight saving changes.
7. Ensuring correct network configuration
If you are installing the indexing software on a host other than the Content Server host, ensure
that the domain name service (DNS) entries for the two machines are correct so that they are able
to locate each other on the network.
To verify the DNS entries:
a. On the index server host, look up the Content Server host:
nslookup FQDN_of_Content_Server_host
where FQDN_of_Content_Server_host is the FQDN of the Content Server host.
This returns one or more IP addresses for the Content Server host.
10 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Prepare for Installation
$DOCUMENTUM/fulltext/
fast
$DOCUMENTUM_SHARED/
dfc
$DOCUMENTUM_SHARED/
IndexAgents/ftintegrity
FASTSEARCH Location of the index server $DOCUMENTUM/fulltext/
IndexServer
DISPLAY Controls the display localhost:0.0
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 11
Prepare for Installation
10. Ensuring that the index server environment is correct on UNIX and Linux hosts
The index server installation includes a script that sets required environment variables for
running the index server. The script is setupenv.sh or setupenv.csh, depending on the shell from
which you run, and it is located in the indexserver_install_dir/bindirectory. You can source this
script to ensure that the environment variables are correct.
11. The deprecated DFC_DATA environment variable on UNIX hosts
The DFC_DATA environment variable was deprecated after the 5.1 EMC Documentum
release, but it is still used by Documentum installers for backward compatibility. If you are
installing the indexing software on a UNIX host where older EMC Documentum software
required setting DFC_DATA, the installer uses the value of DFC_DATA to create the /config
directory ($DFC_DATA/config). However, the startupIndexAgent.sh script expects to
find the $DOCUMENTUM_SHARED variable set and expects the /config directory to be
$DOCUMENTUM_SHARED/config.
If the /config directory is not $DOCUMENTUM_SHARED/config, edit the startupIndexAgent.sh
script so that it points to the valid /config directory path on the index agent host. Replace these
lines:
CLASSPATH=$DOCUMENTUM_SHARED/dctm.jar:$DOCUMENTUM_SHARED/config:
$DOCUMENTUM_SHARED/dfc/dfc.jar:$DOCUMENTUM_SHARED/dfc/dfcbase.jar:
$DOCUMENTUM_SHARED/dfc/log4j.jar
with:
CLASSPATH=$DOCUMENTUM_SHARED/dctm.jar:$DOCUMENTUM/config:
$DOCUMENTUM_SHARED/dfc/dfc.jar:$DOCUMENTUM_SHARED/dfc/dfcbase.jar:
$DOCUMENTUM_SHARED/dfc/log4j.jar
12 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Prepare for Installation
Preinstallation checklist
Use following the checklist to ensure that you have performed all required tasks before installing
or upgrading the full-text indexing software.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 13
Prepare for Installation
14 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Chapter 2
Install
Overview
This chapter contains instructions for installing the full-text indexing software and creating
full-text indexes, whether you are upgrading from an earlier Documentum release or creating new
repositories.
The high-level procedure for installing the full-text indexing server and full-text indexing components
for a new installation is:
1. Install the index server and index agent configuration program.
Use the instructions in Installing the index server and the index agent configuration program,
page 17.
2. Configure the index agent.
Configuring the index agent, page 24 has instructions.
3. Start the index agent in normal mode.
To install a consolidated deployment, install the index agent and configure an index agent for each
repository, then index each repository.
To install a high availability configuration, you need to install the indexing software on the first host,
perform steps on the Content Server host, install the indexing software on the second host, and
perform additional steps on the Content Server host.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 15
Install
1. If you do not already have an existing full-text indexing system, install the full-text indexing
software on the first host and configure an index agent.
Note: On Windows, you need to log on as the same user, in the same domain, as the user who
installed the Content Server installation.
Use the instructions in Installing the index server and the index agent configuration program,
page 17 and Configuring the index agent, page 24.
2. Log on to the Content Server host as the installation owner.
3. Ensure that no users are connected to the repository.
4. Shut down the first index agent.
5. Navigate to the directory where the create_fulltext_objects_ha.ebs script is located:
• On Windows, the %DM_HOME%\install\admin folder
• On UNIX or Linux, the $DM_HOME/install/admin directory
6. If you are upgrading the system, run the following command:
dmbasic -f create_fulltext_objects_ha.ebs
-e HACleanupBeforeUpgradeStep --
repository_name
Superuser_name Superuser_password
where repository_name is the name of the repository, Superuser_name is the user name of a user
with superuser privileges in the repository, and Superuser_password is the superuser’s password
7. Run the create_fulltext_objects_ha.ebs script using this syntax, where repository_name is the name
of the repository, Superuser_name is the user name of a user with Superuser privileges in the
repository, and Superuser_password is the Superuser’s password:
dmbasic -f create_fulltext_objects_ha.ebs
-e HAPreInstallStep -- repository_name
Superuser_name Superuser_password
8. Install the full-text indexing software on the second indexing host and configure an index agent,
using the instructions in Installing the index server and the index agent configuration program,
page 17 and Configuring the index agent, page 24.
Do not start the new index agent. The repository now contains two full-text index objects, two
ft index agent config objects, and two ft engine config objects.
9. Log on to the Content Server host as the Content Server installation owner.
10. Navigate to the directory where the create_fulltext_objects_ha.ebs script is located:
• On Windows, the %DM_HOME%\install\admin folder
• On UNIX or Linux, the $DM_HOME/install/admin directory
16 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
11. Run the create_fulltext_objects_ha.ebs script using this syntax, where repository_name is the name
of the repository, Superuser_name is the user name of a user with superuser privileges in the
repository, and Superuser_password is the superuser’s password:
dmbasic -f create_fulltext_objects_ha.ebs
-e HAPostInstallStep -- repository_name
Superuser_name Superuser_password
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 17
Install
Installing the index server also installs Java on the index server host. The Java version level is the
same as the version level of Java installed with Content Server.
The installer installs the index agent configuration program, which you use to configure an index
agent instance. If you do not configure the index agent immediately after installing the configuration
program, you can do so at a later time.
To install the index server and the index agent configuration program:
1. On HPUX-IA11.23, use these commands to unset the SHLIB_PATH variable:
• csh: unsetenv SHLIB_PATH
• sh: unset SHLIB_PATH
Otherwise, you might get the following error message:
Error occurred during initialization of VM. Unable to load native library: some symbol required
This is a known issue. (CS-14260)
After installation completes, you can user these commands to reset the SHLIB_PATH variable:
• csh: setenv SHLIB_PATH
• sh: set SHLIB_PATH
2. On HP-UX, you must set the following parameter values:
• Set maxdsiz or data seg size at 2 GB (0x80000000)
• Enable Largefiles
• Set maxusers to 256 or higher
• Set max_thread_proc to 256 or higher
• Set maxfiles to 1024 or higher
3. Ensure that the repository (for which you are installing the index server and index agent)
is running.
4. Log on to the index server and index agent host as the Content Server installation owner.
On Windows, this means you need to log on as the same user, in the same domain, as the user
who installed the Content Server installation.
5. Copy the installation files from the EMC Documentum download site or distribution CDs to
a temporary location on the host.
6. Start the installation program.
• On Windows, double-click fulltextWinSuiteSetup.exe.
• On UNIX and Linux, type
% fulltextoperatingsystemSuiteSetup.bin
and press Enter, where operatingsystem is the operating system on which you are installing.
A Welcome dialog box is displayed.
7. Click Next.
The license agreement dialog box is displayed.
8. Click I accept the terms of the license agreement and click Next.
A dialog box is displayed that lists the products you can install.
18 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 19
Install
Choosing the parts of speech to be indexed can reduce the size of the indexes and the disk
space required for maintaining the indexes. You can enable grammatical normalization only
for the languages listed on the dialog box. If you enable grammatical normalization, it is
enabled by default for Japanese and Korean and you cannot disable it.
Note: Content files in languages that are not chosen or that are not available for normalization
are still indexed.
e. Choose the languages for which to perform grammatical normalization and the parts of
speech to be indexed by selecting the appropriate check boxes.
The recommended choice is to normalize only nouns.
f. Accept the default directory for the full-text indexes or type the name of a different directory,
then click Next.
The default on Windows is %DOCUMENTUM%. The default on UNIX or Linux is
$DOCUMENTUM. If you choose another directory, the name cannot contain any blank
spaces. The installer creates the directory \data\fulltext (/data/fulltext on UNIX or Linux)
under the location you designate.
On HP-UX (B.11.23 U 9000/800), you cannot install the Index Server in a directory that
contains an “_d” in the directory path.
Note: If the DOCUMENTUM environment variable is set on the host, you need to accept the
default directory. The installation program uses the value of the DOCUMENTUM variable to
generate the default directory and will not allow you to change the directory.
A summary dialog box is displayed, listing the products that will be installed.
16. Click Next.
The products are installed and a panel is displayed indicating success when the installation is
completed.
17. Ensure that the index server starts.
• On Windows, select Yes, restart my computer., then click Next.
— If the computer does not restart automatically, click Start > Shutdown > Restart and restart
the computer manually.
— If the index server does not automatically start, click Start > Programs > Administrative
Tools > Services and start the FAST InStream service.
— If the system restarts, the index server starts automatically as a Windows service.
• On UNIX and Linux, navigate to the $DOCUMENTUM/fulltext/IndexServer/bin directory
(the installation location), type startup.sh, and press Enter.
The index server is started.
18. On HPUX-IA11.23, after installation completes, you can user these commands to reset the
SHLIB_PATH variable (that you unset in Step 1):
• csh: setenv SHLIB_PATH
• sh: set SHLIB_PATH
Caution: Do not run backup utilities while the index server is running, because they may lock
indexing processes.
20 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 21
Install
22 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
No default value for the format_class property exists. You need to set it manually to designate
whether a format is always indexed or is the preferred format for indexing.
By default, the first content file in a format whose can_index property is set to true is indexed. Other
renditions of the object are not indexed. If the primary content of an object is not in an indexable
format, create a rendition in an indexable format. The Full-text Indexing Administration Guide,
“Supported and Unsupported Formats” appendix, contains a complete list of indexable formats.
If the content file associated with a SysObject exists in a no-indexable format, its properties are
still indexed. To index the content, create a rendition of the SysObject in an indexable format. Use
Documentum Content Transformation Services or third-party client applications to create the
rendition.
See the Full-text Indexing Administration Guide, “Supported and Unsupported Formats” appendix,
for a list of the formats considered indexable by the index server.
Some formats in the appendix are not represented in the repository by a format object. The
formats.cvs file, which is located in $DM_HOME/install/tools, contains a complete list of supported
mime_types and the formats with which they are associated. If a supported mime_type is not
represented by a format object, create a format object in the repository and map the supported
mime_type to the format.
2. Set the param_value property at the corresponding index position to the list of error codes
representing the additional errors you wish to catch.
The error codes are the four-digit codes returned in the error message. For example, here is an
error message with the four-digit error code highlighted in bold:
[DM_FULLTEXT_E_SEARCH_NEW_FAIL]error:"dmFTSearchnew
failed with error: QRServer Error (1012):
Resource limit exceeded, Error from QRServer, error code: -2"
Separate multiple errors codes with commas.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 23
Install
24 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 25
Install
If the paths to the content files are different, do not modify the value of <all_filestores_local>, but
instead, create a file store map within the <exporter> element.
For example, if Content Server is on a host called Dandelion where filestore_01 is physically
located in the directory /Dandelion/Documentum/data/repository_name/content_storage_01 and
the index agent and index server on a host from which the drive on the Content Server host is
shared as /mappingtoDandelion/repository_name/content_storage_01, create an alias as follows:
<local_filestore_map>
<local_filestore>
26 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
<store_name>filestore_01</store_name>
<local_mount>/mappingtoDandelion/
repository_name
/content_storage_01</local_mount>
</local_filestore>
<!-- and so on for each filestore --!>
</local_filestore_map>
If you are indexing content stored on a NAS device or a Windows 2003 Server host, you may see
the following error message in the dmi_queue_item’s message attribute:
DocumentRetriever :ERROR Retrieval error: Couldn’t open file
<file path/name> ERROR Processor error status:
DataNotAvailable Not read permission
To resolve this error, edit the <local_mount> element or elements in the IndexAgent.xml file that
reference the storage area or areas on the NAS device. Add two back slashes immediately after
the opening <local_mount> element. For example, assume the following references a storage
are on an NAS device:
<local_mount>\\100.2.4.32\share3\c\data_for_example
\content_storage_1</local_mount>
After editing, it is now:
<local_mount>\\\\100.2.4.32\share3\c\data_for_example
\content_storage_1</local_mount>
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 27
Install
The InstallProfile.xml file lists the host machines for each node and specifies which index server
processes to install on each host. See Creating the InstallProfile.xml Configuration File, page 28.
2. Run the index server installation program on each host machine, as described in Installing a
multinode configuration, page 36.
3. Install an index agent to provide communication with Content Server.
The index agent is usually installed on the host machine where the index server’s administrative
processes are running, or on the Content Server host. See Configuring the index agent, page 37.
4. If you plan to use directed routing, configure the index agent and index server to provide the
desired mapping of documents to index columns.
See Configuring directed routing, page 40.
Host section
The host section of the InstallProfile.xml configuration file identifies the hosts in the multinode
deployment and defines some basic configuration parameters for each host. To complete this section,
you need the following information:
• The fully-qualified domain name for each host computer
28 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
The items you should update with the details for your configuration are shown below in italic font.
Do not change any other values, such as the remote-cmd property or its value.
Add an additional <host> section for each host in the configuration, following the pattern shown
below. All <host> sections need to appear inside the <host-set> element.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 29
Install
Note: On Windows hosts, the value of the property username needs to start with a period and
backslash, for example, .\dmadmin. On UNIX hosts, the value would be dmadmin.
<host-set>
<host id="Fully_qualified_host_name">
<property name="hostname" value=
"Fully_qualified_host_name"/>
<property name="platform" value=
"platform"/>
<property name="username" value=
"installation_owner_name"/>
<property name="install-dir" value=
"installation_directory"/>
<property name="temp-dir" value=
"temporary_directory"/>
<property name="remote-cmd" value="SSH2"/>
<property name="baseport" value=
"base_port_number"/>
</host> </host-set>
The administrative services section identifies which host will server as the administrative node
and the index server components that are installed on the administrative node of the multinode
deployment. To complete this section, you need the fully-qualified name of the host where the
administrative node will be installed.
Below is a sample of the administration section of the InstallProfile.xml file, in which
norway1.performance.documentum.com is identified as the host where the administrative services
are installed:
<administration-set>
<configuration-service id="CS1" host-ref="norway1.
performance.documentum.com">
</configuration-service>
<log-service id="LGS1" host-ref="norway1.performance.
documentum.com"></log-service>
<administration-gui id="ADM1" host-ref="norway1.
performance.documentum.com"></administration-gui>
<vespa id="VSP1" host-ref="norway1.performance.
documentum.com"></vespa>
<storage-service id="STOR1" host-ref="norway1.
performance.documentum.com" admin="root" adminpassword=
"d4tAs34rch" user=“dmadmin" userpassword=“d3v3l0p">
</storage-service>
<name-service id="NAME1" host-ref="norway1.
performance.documentum.com">
</name-service>
</administration-set>
The items you should update with the details for your configuration are shown below in italic font.
Do not change any other values; for example, do not change the admin user name and admin
password in the <storage-service> element.
Note: The user attribute value cannot contain a period. On Windows hosts, the user attribute
value cannot start with a period and backslash (.\). You are not required to use the Documentum
installation owner user name and password for the user and userpassword attributes.
<administration-set>
30 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
The content distributor section of the InstallProfile.xml file identifies the host on which the content
distributor service runs. This is typically the administrative node, which is the node you identified
in the Administrative services section, page 30. To complete this section, you need to know the
fully-qualified name of the computer where the administrative services are installed.
Below is a sample of this section, in which the content distributor is running on a host called
norway1.performance.documentum.com:
<content-distributor-set>
<content-distributor id="CD1" host-ref="norway1.
performance.documentum.com"/>
</content-distributor-set>
The item you should update with the details for your configuration is shown below in italic font.
<content-distributor-set>
<content-distributor id="CD1" host-ref=
"Fully_qualified_host_name"/>
</content-distributor-set>
The status server section identifies the host where the status server is installed. This is typically the
administrative node, which is the node you identified in the Administrative services section, page
30. To complete this section, you need to know the fully-qualified name of the computer where the
administrative services are installed.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 31
Install
Below is an example of this section, in which the status server is running on a host called
norway1.performance.documentum.com:
<status-service>
<status-server id="STATUS1" host-ref="norway1.
performance.documentum.com">
</status-server>
</status-service>
The item you should update with the details for your configuration is shown below in italic font.
<status-service>
<status-server id="STATUS1" host-ref=
"Fully_qualified_host_name">
</status-server>
</status-service>
The search engine section identifies the hosts on which the indexers and search servers are installed.
Install both components on each host in the multinode deployment.
To complete this section, you need to know the fully-qualified names of the computers in the
deployment. You provided these names in the Host section, page 28.
Below is a sample of this section, in which the indexer and search server are running on hosts called
norway1.performance.documentum.com and norway2.performance.documentum.com:
<search-engine-set>
<search-engine id="RTS1" host-ref="norway1.
performance.documentum.com">
<property name="search" value="true"/>
<property name="index" value="true"/>
</search-engine>
<search-engine id="RTS2" host-ref="norway2.
performance.documentum.com">
<property name="search" value="true"/>
<property name="index" value="true"/>
</search-engine>
</search-engine-set>
The items you should update with the details for your configuration are shown below in italic font.
Do not change any other values. Do not change the property names indicating that the search server
and indexer are installed or change the value true.
Add an additional <search-engine> section for each host in the configuration, following the pattern
shown below. All <search-engine> sections need to appear inside the <search-engine-set> element.
<search-engine-set>
<search-engine id="RTS1" host-ref=
"Fully_qualified_host_name1">
<property name="search" value="true"/>
<property name="index" value="true"/>
</search-engine>
<search-engine id="RTS2" host-ref=
"Fully_qualified_host_name2">
<property name="search" value="true"/>
<property name="index" value="true"/>
</search-engine>
</search-engine-set>
32 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
The document processor section of the InstallProfile.xml file defines the hosts on which document
processors are installed. To complete this section, you need to know the fully-qualified names of the
computers in the deployment. You provided these names in the Host section, page 28.
Below is a sample in which the document processors are running on hosts called
norway1.performance.documentum.com and norway2.performance.documentum.com:
<document-processor-set>
<document-processor id="DP1" host-ref="norway1.
performance.documentum.com"/>
<document-processor id="DP2" host-ref="norway2.
performance.documentum.com"/>
</document-processor-set>
A common configuration has one document processor on each host. However, you can install
multiple document processors on the same host or install one or more document processors on host
separate from the nodes where the indexers and columns reside. When the Content Distributor
routes a document, it routes it to the first non-busy document processor, regardless of which host
the document processor is running on. The document processor passes the document to the proper
indexer.
Figure 1. The first available document processor handles each incoming document
The items you should update with the details for your configuration are shown below in italic font.
Do not change any other values.
Add an additional <document-processor> section for each document processor you want to create,
following the pattern shown below. All <document-processor> sections need to appear inside the
<document-processor-set> element. All host names need to match hosts listed in the Host section,
page 28.
<document-processor-set>
<document-processor id="DP1" host-ref=
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 33
Install
"Fully_qualified_host_name1"/>
<document-processor id="DP2" host-ref=
"Fully_qualified_host_name2"/>
<document-processor id="DPN" host-ref=
"Fully_qualified_host_nameN"/>
</document-processor-set>
The query server section of the InstallProfile.xml file defines the host on which the Query and Results
Server is installed. This is typically the administrative node, which is the node you identified in the
Administrative services section, page 30. of the multinode deployment. Do not install more than one
Query and Results Server.
The section also specifies which linguistic rules are used to process search queries and which
language dictionaries are installed. The default-language property sets the (single) language whose
linguistic rules are used to perform grammatical normalization, spell-checking, and so on. The
languages property sets the list of languages whose dictionaries are installed.
To complete this section, you need to know the fully-qualified name of the computer where the
administrative services are installed and the two-character language codes for the languages you
want to enable. For a list of the available language codes, see the Content Server Fulltext Indexing
Installation and Administration Guide.
Below is an example in which the Query and Results Server is running on a host called
norway1.performance.documentum.com, the default language is English (en), and the English and
French (fr) dictionaries are installed:
<query-result-processor-set>
<query-result-processor id="QRP1" host-ref="norway1.
performance.documentum.com">
<property name="default-language" value="en"/>
<property name="languages" value="en,fr"/>
</query-result-processor>
</query-result-processor-set>
The items you should update with the details for your configuration are shown below in italic font. If
you include multiple language codes for the language property, separate them with commas.
<query-result-processor-set>
<query-result-processor id="QRP1" host-ref=
"Fully_qualified_host_name">
<property name="default-language" value=
"language_code"/>
<property name="languages" value=
"language codes"/>
</query-result-processor>
</query-result-processor-set>
Webcluster section
The webcluster section defines which index columns are associated with which search engines.
34 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
The Search engine section, page 32 provides an ID for the search engine running on each host (as
the value of the id parameter of the <search-engine> element). They have sequential names based
on which node they are running on: RTS1 for node 1, RTS2 for node 2, and so on. The webcluster
section identifies the index columns and associates each column with the search engine running on
the same node. The IDs for the columns are also sequential, however the numbering starts at zero.
Therefore, as shown in the sample below, column 0 is associated with search engine RTS1, column 1
is associated with search engine RTS2, and so on.
<search-engine-cluster-set>
<search-engine-cluster id="webcluster">
<property name="use-memsearch" value="false"/>
<property name="index-profile-type" value="custom"/>
<property name="index-profile-filename"
value="..\config\datasearch\index-profiles
\index-profile-dctm.xml"/>
<search-engine-column id="col0">
<search-engine id-ref="RTS1"/>
</search-engine-column>
<search-engine-column id="col1">
<search-engine id-ref="RTS2"/>
</search-engine-column>
<query-result-processors>
<query-result-processor id-ref="QRP1"/>
</query-result-processors>
</search-engine-cluster>
</search-engine-cluster-set>
The items you should update with the details for your configuration are shown below in italic font. Do
not modify anything else in the webcluster section. Do not change the value of the index-profile-type.
It needs to remain “custom.”
Add an additional <search-engine-column> section for each host in the configuration, following
the pattern shown below. All <search-engine-column> sections need to appear inside the
<search-engine-cluster> element. The value of the <search-engine-column id> needs to be the string
“col” followed by the next sequential number after the previous column. The number needs to match
the partition_id for the column as specified in the CSConfig.xml file; see Adding a node, page 45.
The value of the <search-engine id-ref> property for each node needs to match the value specified for
that host in the Search engine section, page 32. Similarly, the value of <query-result-processor id-ref>
needs to match the ID provided in the Query server section, page 34.
<search-engine-cluster-set>
<search-engine-cluster id="webcluster">
<property name="use-memsearch" value="false"/>
<property name="index-profile-type" value="custom"/>
<property name="index-profile-filename"
value="..\config\datasearch\index-profiles
\index-profile-dctm.xml"/>
<search-engine-column id="col0">
<search-engine id-ref=
"search-engine ID for node 1"/>
</search-engine-column>
<search-engine-column id="col1">
<search-engine id-ref=
"search-engine ID for node 2"/>
</search-engine-column>
<query-result-processors>
<query-result-processor id-ref="QR Server ID"/>
</query-result-processors>
</search-engine-cluster>
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 35
Install
</search-engine-cluster-set>
Other sections
The configuration file also contains data-source, anchor-service, status-service, and license-services
sections. Do not modify the data-source section. In the anchor-service, status-service, and
license-service sections, provide the fully-qualified name of the administrative node, which is the
node you identified in the Administrative services section, page 30. Do not modify any other values.
<anchor-service>
<anchor-server id="anch1" host-ref="Fully_qualified_host_name">
<property name="dispatcher" value="true"/>
</anchor-server>
</anchor-service>
<status-service>
<status-server id="STATUS1" host-ref="Fully_qualified_host_name">
</status-server>
</status-service>
<license-services>
<license-service id="LCS1" host-ref="Fully_qualified_host_name">
</license-service>
</license-services>
and press Enter, where operatingsystem is the operating system on which you are installing.
4. Select index server installation only.
36 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
The installation program detects the InstallProfile.xml file and asks you to confirm that you
require multinode installation.
5. Click Yes.
The correct index server processes are installed and configured on the host.
6. After the installation is completed, ensure that the index server starts.
• On Windows, select Yes, restart my computer., then click Next.
— If the computer does not restart automatically, click Start > Shutdown > Restart and restart
the computer manually.
— If the index server does not automatically start, click Start > Programs > Administrative
Tools > Services and start the FAST InStream service.
— If the system restarts, the index server starts automatically as a Windows service.
• On UNIX and Linux, run $DOCUMENTUM/fulltext/jboss4.2.0/IndexServer/bin/startup.sh.
The index server is started.
7. Install and configure an index agent as described in Configuring the index agent, page 37.
8. To configure directed routing, perform the steps described in Configuring directed routing,
page 40 on the administrative host.
9. Repeat Step 1 through Step 6 on each additional node.
10. Confirm that the index server processes are running correctly.
• On the administrative node, type dsadmin listmodules at a command prompt.
The running processes, their version numbers, host name, and port numbers are displayed.
• On each nonadministration node, use the nctrl command to verify that the processes are
running correctly:
$ cd $FASTSEARCH/bin
$ ../setupenv.sh
$ nctrl sysstatus
Each running module name is listed, as well as the process name, process ID, and status.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 37
Install
The administrative node is identified in the Administrative services section, page 30 of the
InstallProfile.xml file. It is the host identified by the <configuration-service> element.
2. Ensure that the index server is running.
3. Navigate to the directory %DOCUMENTUM_SHARED\IndexAgents (on Windows) or
$DOCUMENTUM_SHARED/IndexAgents (on UNIX).
4. Use a text editor to create a file named setup.ini with these two lines as its content:
[INDEX_AGENT]
AUTO_START_INDEX_AGENT=false
5. Start the index agent configuration program using the appropriate command for your platform.
• Windows: IndexAgent_Configuration_Program.exe –config setup.ini
• AIX: IndexAgent_Configuration_Program.aix –config setup.ini
• Solaris: IndexAgent_Configuration_Program.bin –config setup.ini
• HP-UX: IndexAgent_Configuration_Program.hp –config setup.ini
• On Linux, IndexAgent_Configuration_Program.linux –config setup.ini
A Welcome dialog box is displayed.
6. Click Next.
7. On Windows, type in the installation owner’s password, then click Next.
8. Type in the port used to communicate with application server for administration purposes,
then click Next.
The default port, 9200, is for the first index agent on the host. The default port for any other index
agents are 9220 for index agent 2, 9240 for index agent 3, and so on.
9. Select the repository for which the index agent will prepare documents, then click Next.
The drop-down list contains the repositories that project to the connection brokers listed in
the dfc.properties file on the host.
10. Type in the user name and password for the Superuser account that the index agent will use to
connect to the repository.
11. Indicate whether to run the index agent in normal mode or migration mode.
12. Type in the host where the index server for this index agent is running and the base port number
for the index server, then click Next.
A summary dialog box is displayed.
13. Click Next.
14. To exit from the configuration program, click Finish.
15. To complete the installation process:
• If you are mapping the file stores, complete the instructions in Modifying the IndexAgent.xml
file to map file stores, page 26.
• If you are using directed routing, complete the instructions in Configuring directed routing,
page 40.
16. On Windows, to start the index agent manually, start the IndexAgent service to start IndexAgent
instance.
38 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
If you have shared or mounted the drives containing the repository’s file stores and installed the
indexing software, the index agent configuration file needs to be manually edited to indicate that the
drives are shared. The changes depend on whether the file system paths to the content are identical
on the Content Server host and index server host.
Using shared or mounted drives improves performance, because the index agent does not need to
copy documents from Content Server to a staging area. See the Documentum System Planning Guide
for more information about sharing file store drives.
4. If the paths to the content files are different, do not modify the value of <all_filestores_local>, but
instead, create a file store map within the <exporter> element.
For example, if Content Server is on a host called Dandelion where filestore_01 is physically
located in the directory /Dandelion/Documentum/data/repository_name/content_storage_01 and
the index agent and index server on a host from which the drive on the Content Server host is
shared as /mappingtoDandelion/repository_name/content_storage_01, create an alias as follows:
<local_filestore_map>
<local_filestore>
<store_name>filestore_01</store_name>
<local_mount>/mappingtoDandelion/
repository_name
/content_storage_01</local_mount>
</local_filestore>
<!-- and so on for each filestore --!>
</local_filestore_map>
If you are indexing content stored on a NAS device or a Windows 2003 Server host, you may see
the following error message in the dmi_queue_item’s message attribute:
DocumentRetriever :ERROR Retrieval error: Couldn’t open file
<file path/name> ERROR Processor error status:
DataNotAvailable Not read permission
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 39
Install
To resolve this error, edit the <local_mount> element or elements in the IndexAgent.xml file that
reference the storage area or areas on the NAS device. Add two backslashes immediately after
the opening <local_mount> element. For example, assume the following references a storage
are on an NAS device:
<local_mount>\\100.2.4.32\share3\c\data_for_example
\content_storage_1</local_mount>
After editing, it is now:
<local_mount>\\\\100.2.4.32\share3\c\data_for_example
\content_storage_1</local_mount>
40 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
All documents submitted for indexing are assigned to a collection, which is a logical set of data to
which the index server applies the same indexing rules. In a basic implementation, all documents
are typically assigned to a single default collection that is created automatically as part of the index
server installation process.
With directed routing, you want the index server to treat documents differently depending on which
file stores contain their content files — specifically, you want the index server to route them to
different nodes for inclusion in different columns. To enable the index server to route documents to
different nodes, you need to assign the documents to different collections.
The first step is to create additional collections. You create one collection for each node.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 41
Install
Once you have created the necessary full-text collections, you associate them with file stores in the
indexagent.xml configuration file, which resides on the index agent host machine.
42 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
<partition>
<storage_name>name_of_file_store</storage_name>
<collection_name>name_of_collection</collection_name>
</partition>
</partition_config>
The name_of_default_collection needs to match the value of the <fds_collection> element that
appears later in the indexagent.xml file. It is the name of the collection created during installation.
Include a <partition> element for each file store whose documents you want to assign to a specific
collection. The name_of_file_store needs to match the file store name from the repository, and
the name_of_collection needs to match the collection name you specified when you created the
collection in Creating full-text collections, page 41.
For example, the <partition_config> element below assigns documents from filestore_01,
filestore_02, filestore_03, and filestore_04 to three collections (repb01, repb02, and repb03).
Documents from other file stores are assigned to repb01, which is designated as the default
partition. Notice also that filestore_03 and filestore_04 are assigned to the same collection.
<partition_config>
<default_partition>
<collection_name>repb01</collection_name>
</default_partition>
<partition>
<storage_name>filestore_01</storage_name>
<collection_name>repb01</collection_name>
</partition>
<partition>
<storage_name>filestore_02</storage_name>
<collection_name>repb02</collection_name>
</partition>
<partition>
<storage_name>filestore_03</storage_name>
<collection_name>repb03</collection_name>
</partition>
<partition>
<storage_name>filestore_04</storage_name>
<collection_name>repb03</collection_name>
</partition>
</partition_config>
</indexer>
When using directed routing, the index agent assigns documents to collections based on which file
store contains their content files. The index server needs to route the documents to different nodes
based on which collection they belong to. To configure the index server to do this, you create a routing
configuration file (routing.cfg) and update the Status Server configuration file (NodeConf.xml) so
that it refers to the routing configuration file.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 43
Install
44 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
Adding a node
Use these instructions to add an additional node to an existing multinode configuration.
d. Within the <search_clusters><cluster> element, add a new <column> element for the new
node. The <column> element has this format:
<column host="fully_qualified_host_name"
port="base_port_number" mode="NORMAL"
partition_id="partition_number"
ft_mode="0" docapiport="15500">
The partition_number should be the integer specified as part of the <search-engine-column>
ID in the Webcluster section, page 34 of the IndexProfile.xml file; if the IndexProfile.xml file
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 45
Install
The port is the index server’s base port number plus the constant 3099. If the index server
is using the default base port 13000, then the port value is 16099. The first integer after
’webcluster’ is the partition_id of the column hosted on the node.
f. Rebuild the index by running the FIXML feeder on each node. Enter this command:
cobra fixmlfeeder.py -i path_to_temp_dir
The path_to_temp_dir is the path to the directory where you backed up the FIXML files
at step a.
g. When the feeding is complete, use the FAST InStream administration tool to reset all
collections to use the DFTXML (webcluster) pipeline (see step d above).
h. Resume indexing on each node. Enter this command to resume indexing:
rtsadmin localhost port webcluster
partition_id 0 resetindex
9. Restart the index server on each node and restart the index agent.
46 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Install
Removing a node
Use these instructions to remove a node from an existing multinode configuration.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 47
Install
48 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Chapter 3
Upgrade
Overview
This chapter provides the instructions for upgrading the full-text indexing components.
Depending on the version from which you are upgrading, you may need to reindex:
• If you are upgrading from 5.3 SP6, 6.0, or 6.0 SP1, you must reindex.
• If you are upgrading from 6.5 or later, you are not required to reindex.
In a consolidated deployment, a single index server provides indexing services to multiple
repositories. In any indexing configuration, the indexing software and Content Servers must have the
same version number. Therefore, to upgrade a consolidated deployment, the indexing software and
all repositories must be upgraded simultaneously.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 49
Upgrade
4. Use the Index Agent Configuration program to remove your existing Index Agent and configure
a new one in the new mode that you want (Migration to Normal mode or vice versa).
Caution: Do not use this procedure if the deployment is a multinode deployment. Contact
Professional Services (or qualified third-party integrators) for instructions on upgrading a
multinode deployment.
50 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Upgrade
Caution: Do not use this procedure if the deployment is a multinode deployment. Contact
Professional Services (or qualified third-party integrators) for instructions on upgrading a
multinode deployment.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 51
Upgrade
52 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Chapter 4
Uninstall
Uninstall order
Use a particular order to uninstall Content Server, a repository, the index agent, and the index server.
To uninstall an index agent, the repository its servers need to be running. To uninstall an index server,
the repository needs to be shut down. If the index server is on the Content Server host, additional
issues arise because of shared libraries in the software installations.
To uninstall a multinode configuration, stop all running processes on all hosts.
Uninstall the software components in this order:
1. Shut down and uninstall the index agent.
2. Shut down the repository.
3. Shut down and uninstall the index server.
4. Delete the repository, if required.
5. Uninstall the Content Server software, if required.
6. Uninstall the Index Agent Configuration Program, if required.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 53
Uninstall
Use the instructions in the Full-text Indexing Administration Guide, “Starting and stopping
the Index Agent” topic, or in Documentum Administrator online help, depending on which
repository version the index agent is running against.
3. Start the Index Agent Configuration Program.
• On Windows, click Start > Programs > Documentum > Index Agent Configuration Program.
• On UNIX and Linux, navigate to $DOCUMENTUM_SHARED/IndexAgents and start the
configuration program for your operating system:
— On AIX, IndexAgent_Configuration_Program.aix
— On Solaris, IndexAgent_Configuration_Program.bin
— On HP-UX, IndexAgent_Configuration_Program.hp
— On Linux, IndexAgent_Configuration_Program.linux
A Welcome dialog box is displayed.
4. Click Next.
5. Select Delete index agent and click Next.
6. Read the information and click Next.
The index agent is deleted.
7. To run the configuration program again, check the check box.
8. Click Next.
9. If you checked the check box to run the configuration program again, skip to Step 5; otherwise,
the program exits.
The index agent software and configuration program are still on the host.
54 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Uninstall
6. Click Finish.
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 55
Uninstall
56 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Index
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 57
Index
H new repositories, 15
high-availability deployments
installing, 16 L
host names, 9 log files, 27
host requirements, 10
HP-UX directory constraint, 20
HP-UX parameter values required to N
install, 18 network configuration, 10
I O
index agent object types, disabling indexing of, 21
configuring, 24, 37
DFC_DATA environment variable
(deprecated), 12
P
installing configuration program, 17 permissions, checking against
required ports, 9 repository, 24
uninstalling, 53 ports
uninstalling components, 53 index agent, 9
index agent configuration program index server, 9
deleting, 54 preinstallation requirements
index routing environment variables, 11 to 12
index server configuration full-text indexing, 9
requirements, 43 host names, 9
index server HP-UX parameter values, 18
antivirus software, 10 index agent, 9
constraints on Windows hosts, 10 index server, 9
deleting, 55 operating system and host, 9
disk partition requirement, 10 upgrading, 49
host time zone settings, 10 user accounts, 11
installing, 17
Java installation, 9 Q
query retries, configuring, 23
qtf-config.xnl file, modifying, 21
required ports, 9
query processing
indexable formats, 22 to 23
batches, configuring use of, 24
indexagent.xml file, 26, 39
query results
install directory constraint for HP-UX, 20
acl_check_db parameter, 24
installation
query retries, configuring, 23
basic deployments, 15
querying
consolidated deployments, 15
configuring permission checking, 24
constraints on Windows hosts, 10
synonym searches, enabling, 21
high-availability deployments, 16
index agent configuration program, 17
index server, 17 R
log files, 27 repositories
upgrading 5.3 full-text system, 51 full-text indexing installation, 15
VMware constraint, 10
installation accounts, 11
installation logs, 27
S
installation order synonym searches, enabling, 21
58 EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide
Index
U
uninstalling full-text indexing software, 53
EMC Documentum Content Server Version 6.6 Full-text Indexing Deployment Guide 59