Professional Documents
Culture Documents
SAP® Solutions
OpenText™ Business Center Capture for
SAP® Solutions
Customizing Guide
®
OpenText™ Invoice Capture Center for SAP Solutions
®
OpenText™ Business Center Capture for SAP Solutions
Customizing Guide
Rev.: V7.5 SP4 / V16 (2016-03-21)
Open Text SA
40 Avenue Monterey, Luxembourg, Luxembourg L-2163
Tel: 35 2 264566 1
Open Text Corporation
275 Tompa Drive, Waterloo, Ontario, Canada, N2L 0A1
Tel: +1-519-888-7111
Toll-Free Canada/USA: +1-800-499-6544, International: +800-4996-5440
Fax: +1-519-888-0677
Support: http://support.opentext.com
For more information, visit: http://www.opentext.com
Copyright 2016 Open Text SA and/or Open Text ULC. All Rights Reserved.
Open Text is a trademark or registered trademark of Open Text SA and/or Open Text ULC. The list of trademarks is not exhaustive of
other trademarks, registered trademarks, product names, company names, brands and service names mentioned herein are property of
Open Text SA and/or Open Text ULC or other respective owners.
RecoStar is a registered trademark of Open Text Document Technologies GmbH
ACKNOWLEDGEMENTS
Portions copyright 2003-2015 AllMyPapers.
Portions copyright 2015 Artifex Software Inc.
This software is based in part on the work of the Independent JPEG group.
Portions copyright 2001 URW++. All rights reserved.
Portions copyright 2015 Bejing Wintone Science & Technology Corporation Ltd.
Portions copyright 2015 Tulpep.
Portions copyright 2015 Newtonsoft
Portions ([incr Tcl]) copyright The Regents of the University of California
Portions ([incr Tcl]) copyright Sun Microsystems, Inc.
Portions ([incr Tcl]) copyright Lucent Technologies, Inc.
This work uses Boost C++ libraries.
SAP®, R/3® and SAP CRM® are registered trademarks of SAP AG.
Microsoft®, Microsoft SQL®, and Microsoft SharePoint® are either registered trademarks or trademarks of Microsoft Corporation in the United States
and/or other countries.
Artifex®, the Artifex logo, Ghostscript®, and the Ghostscript logo are registered trademarks of Artifex Software Inc.
PostScript® is a trademark of Adobe Systems Incorporated.
PCL® is a trademark of Hewlett Packard Company.
All other products or company names are used for identification purposes only, and are trademarks of their respective owners.
Disclaimer
No Warranties and Limitation of Liability
Every effort has been made to ensure the accuracy of the features and techniques presented in this publication. However, Open Text
Corporation and its affiliates accept no responsibility and offer no warranty whether expressed or implied, for the accuracy of this
publication.
Contents
1 Introduction 1
1.1 Functional overview .................................................................................................... 1
1.2 ICC/BCC runtime scenarios ........................................................................................ 3
1.3 Components of ICC/BCC ............................................................................................ 3
1.4 Customizing................................................................................................................. 4
1.5 Hot Spots ..................................................................................................................... 5
1.6 Achieving optimum recognition results....................................................................... 6
7 Index 213
ICC/BCC Recognition extracts the essential data (for example date, amount, order number etc. for invoices) from the
scanned documents in the archive.
For recognition of invoices, it uses a knowledge base that has been trained on samples of thousands of invoices from
many countries around the world. In addition, it uses fuzzy database matching: ICC/BCC Recognition maps the read data
– such as names, addresses, vendor numbers, purchase order numbers etc. – to SAP ERP vendor master data and SAP
ERP purchase order data during analysis.
The extracted information is passed on to VIM, or BC, respectively.
During application development, you work mainly with the Application view.
Note that the Application tab does not provide view buttons. There is only one view for the Application tab. Selecting
the Application tab automatically opens the respective view.
The views and tools of the General tab are needed for the technical configuration and for monitoring of an ICC/BCC
production system. You might need these facilities during testing of your applications.
All tools of the Application tab work on the currently opened application whereas the General tab permits to show
global data of the Recognition Server – independently of the opened application.
The tools and views of the General tab and the Validation tools of the Application tab are not discussed in detail in
this guide. For more information about monitoring and technical configuration, see ICC/BCC - Administrator`s Guide;
for more information about Validation, see ICC/BCC - User`s Guide. This guide concentrates on application
development.
New Application
Creates a new application. A wizard starts. Follow the instructions of the wizard and click Create.
Application creation takes some seconds. When the application has been created, click Close.
Note: Normalization for invoice applications has been modified incompatibly with ICC 6.0. If you open an
invoice application created with ICC 5.2 that uses scripting or custom normalization, you are asked whether the
application should be migrated. If you answer with No, the application is not upgraded and the documentation of
ICC 5.2 is still valid with regard to normalization for this application. If you answer with Yes, you have to adapt
your scripting code. Applications that use neither scripting nor custom normalization are always upgraded.
Note: Normalization for invoice applications has been modified incompatibly with ICC 6.0. If you open an
invoice application created with ICC 5.2 that uses scripting or custom normalization, you are asked whether the
application should be migrated. If you answer with No, the application is not upgraded and the documentation of
ICC 5.2 is still valid with regard to normalization for this application. If you answer with Yes, you have to adapt
your scripting code. Applications that use neither scripting nor custom normalization are always upgraded.
To close the current application, click Close Application on the application menu, and then select the appropriate
application on the submenu.
2. Select a system from the System drop-down list and enter client, user name, and password.
The Language field does not take effect with this release; you can leave this field empty.
3. Optionally click Check Login to check whether the login data are valid for the specified SAP system.
4. Click OK.
Depending on the system configuration, you may be able to use SSO ticket handling.
To log in using SSO ticket handling:
1. Select the SSO ticket handling check box.
The dialog box changes and shows a Ticket issuer url box.
2. Enter client and ticket issuer url and then click OK.
Tip: You can use this command to export only the learning data to a file by clearing all check boxes except the
Learning data check box. For a BCC application, you can export only feedback data by clearing all check boxes
except the CSA data check box.
The available check boxes depend on the data contained in the selected archive file, that is, a check box is
available if the corresponding check box was selected in the Select Application dialog box during export. If
you want to import an application and the Settings check box is not available, you have selected an unsuitable
archive file; in this case terminate import by clicking Cancel and try again with a suitable archive file.
4. If available data should not be imported, clear the respective check boxes.
5. Click OK to start import.
6. If one of the applications on the Recognition Server has the same name as the application in the archive file, a
message is shown. In this case you can either cancel the import, or overwrite the application on the Recognition
Server with the application from the archive file.
When you import an application from file, up to three Hot Spots are created. These Hot Spots have to be configured for
the current Recognition Server. For more information about Hot Spots, see "Hot Spots (SAP links)" on page 16.
If a new application is created during import, databases for learning data are created; for an invoice application,
additionally databases for download data are created. If no learning data or no download data are available or if you
cleared the corresponding check box, the respective database is empty. If no download data are imported for an invoice
application, ensure that the vendor master data and purchase order data database are automatically filled with the correct
download values after configuration and activation of the SAP Download Link Hot Spots.
Usually you fill the learning databases by performing new training. If you want to use the learning databases of the
source system, you can create backups of the learning databases on the source system and restore them on the current
system. For more information, see “Backups of the Database” in the ICC/BCC - Administrator’s Guide.
Tip: The Import Application command is also used to import learning data for an existing application or to
import feedback data for a BCC application. Either you import from an archive that contains only the respective
type of data, or you clear the other check boxes if they are present so that only either the Learning data, or the
CSA data check box remains selected. If you clear the Settings check box or if it is not present, you can select
the target application by clicking the desired application in the application list at the top of the dialog box.
Important
The application is deleted locally on the Recognition Server. It is not deleted in SAP ERP. For information
about deleting an invoice application in SAP ERP, see “Deleting an ICC application from SAP tables” in
OpenText Vendor Invoice Management for SAP Solutions - Administration Guide.
Important
For all SAP ERP downloads, the data size is restricted to 5 GB if you work with VIM 5.2 SP3 or higher. In this
case, a download is terminated as soon as 5 GB data have been downloaded.
After creation of the application, all Hot Spots are disabled. After configuring the Hot Spots, start those Hot Spots that
are used by your application.
All Hot Spots are created with a default scheduling behavior. Each Hot Spot checks its input channel regularly for new
data according to its scheduling configuration. For each Hot Spot you can configure separately whether it should check
its input channel once a day at a specified time, or regularly after waiting a specified interval.
All types of SAP Hot Spots fetch their data directly from SAP ERP using a specified SAP ERP user account. For each
Hot Spot you can configure the SAP ERP user account separately. By default, the SAP ERP configuration of the Hot
Spots is set to the SAP ERP user account data that the user has entered during application creation.
Configuration of a SAP Extraction Link and a SAP Feedback Link comprises the archive document types processed by
the application.
The configuration of a SAP Download Link provides additional settings specifying the download data.
If you later modify archive document types using the Settings dialog box, or if SAP ERP login data change, you must
adapt the Hot Spot configurations manually.
Caution
Within ICC/BCC, Hot Spots are always created and deleted automatically together with the corresponding
application. You should not create Hot Spots manually, and you should never delete a Hot Spot of an
application manually.
2.3.1.1 Scheduling
For all Hot Spot types, you can configure the polling cycle in the Scheduling area of the Properties dialog box. You
have the following options to specify the days:
All days: Select to poll every day.
Workdays: Select to poll only on workdays (Monday through Friday).
Custom: Select and specify the days.
You have the following options to specify the time:
Every minute: Specify the period in minutes.
Every hour: Specify the period in hours.
Fixed time: Specify a time. The Hot Spot is polled only once a day at the specified time.
Caution
Do not use daily full download on SAP ERP side, and daily delta download on ICC side. This would lead to
fast growth of the database on the ICC/BCC side with many duplicate entries.
Tip. To determine how long a download takes, start the Trace Viewer, go to the DOKuStar Load Manager
section of the trace file list and look into traces files whose name starts with DOKuStarClusterNode. There
trace records permit to determine how long the three steps of a download took: downloading the data, writing the
data to the database, and updating the vendor index file.
Note: The scripting code is executed even if you have not compiled it. Therefore always make sure that the
scripting code is correct and executable when you close the Settings dialog box.
2.4.3.4 IScriptHelper
An IScriptHelper object is passed as parameter to each Validation scripting method. You can use this object to set the
status of a field to error and to show a message in the status line of Validation.
2.4.3.5 Tracing
An ITrace object is passed as parameter to each scripting method. You can use this object to write trace messages to the
ICC/BCC trace folder.
For more information about tracing, see ICC/BCC - Administrator's Guide.
2.4.3.6 Cultures
Recognition and Validation use an internal format for dates, amounts, and numbers:
dates: de-DE
amounts and numbers: en-US with decimal separator (.), but without thousands separator (,)
Be sure to use these internal formats in the scripting entry points.
If your application uses a custom field with amount or date values, make sure that the field values are also in these
internal formats if standard rules at the action point Export SAP are used.
If an application uses custom rules for the standard amount and date fields, the rules should also convert the field values
to these internal formats.
Important
When the Validation Client user changes the user interface language, the culture of the Validation Client
program changes. This may cause some .NET methods to behave differently. Therefore take care that your
scripting code for Validation entry points is culture-independent. In addition, you should test your code with
different language settings in the Validation Client.
Note: If the user selects an entry in a Lookup List control, the scripting method is called for each data field
belonging to the Lookup List.
Note: The scripting method will not be called for data fields belonging to a Lookup List because these fields never
have the focus and never lose the focus.
//Parameters:
//DataPool data
//ITrace trace
…
}
}
Note: Modifications of the Document Extraction project do not take effect until you close the Settings dialog box
with OK afterward. It is not sufficient to save the project in the Document Extraction Design Studio.
Error Description
Wrong field To specify a field at a wrong logical position is a severe error during training,
because this yields an extraction error each time the sample is used for extraction.
In this case Adaptive Recognition always extracts the field at a wrong position.
Field not present on the Some fields may not be present on all documents of a specific layout. If a
training document candidate does not contain such a field, it is not suitable for training. Wait for a
better candidate that contains all fields where Adaptive Recognition is used.
Wrong logical field size Sometimes the tooltip contains an additional character or word that is printed
closely to the field value. For example a VAT rate could contain a following
percent sign, or a value could include a following comma. In this case, hold
CTRL down while you draw a rectangle around the field value so that the
captured value only contains the desired characters.
On the left side of the Adaptive dialog box you find a tree structure presenting the ART and ACT repositories
of the current application.
2. Select the respective tree node to display the corresponding repository on the right side of the Adaptive dialog
box.
Adaptive Recognition is used for the following:
For some optional fields (Supply Date, Freight Amount, or Handling Charges). These fields are not used
by default. They have to be activated manually (see "Activating optional fields" on page 107).
For any of the standard fields of applications with country setting China (Traditional), Japan, Korea, and
Thailand.
Adaptive Recognition can be used for custom fields (see "Adaptive Recognition" on page 28).
Adaptive Recognition can be activated for some standard invoice fields (see "Using Adaptive Recognition for
standard invoice fields" on page 134).
Adaptive Recognition fields have to be trained (see "Training of Adaptive Recognition fields" on page 31). Learning
data are kept in different repositories within a database. There is a single Adaptive Recognition (ART) repository for all
fields of an application that use the Adaptive Recognition automation method.
If you do not use Adaptive Recognition fields in an application, the ART repository of the application is empty.
Adaptive Classification can also be used for custom fields (see "Adaptive Classification (invoice applications)" on page
30). Adaptive Classification is not used by default. Adaptive Classification fields have to be trained (see "Training of
Adaptive Classification fields" on page 33). Learning data are kept in different repositories within a database. There is a
separate Adaptive Classification repository (ACT) for each field of an application that uses the Adaptive Classification
automation method. If you do not use Adaptive Classification fields in an application, there will not be ACT repositories
to administer.
For each sample (trained document) of Adaptive Recognition, the trained field values are shown in addition to
administrative data.
Usage Count
Indicates how often the sample has been used.
If you click Reset usage counters ( ) in the toolbar, last usage date and usage count of all samples are being
reset.
State
The state of a sample, indicated by the State column, affects recognition and training.
A sample in state confirmed is never removed automatically.
State unconfirmed is the default state for newly trained samples. A sample in state unconfirmed may
be removed during training of a new sample if it has been trained more than 30 days ago, its usage count is
zero, and repository size has reached its maximum size of 2000 samples.
A sample in state excluded is not used for data extraction. In addition, it is not removed automatically.
Newly trained samples always have state unconfirmed. The first three buttons of the toolbar permit to
assign a different state to selected samples. You can use this to prevent removal of samples by setting their
state to confirmed after you have checked that they are useful and have been trained correctly.
It is recommended to let the state unconfirmed unchanged for all samples. Thus samples that are no longer
needed can be removed automatically during training if the repository has reached its maximum size. If you
want to change the state of a sample to confirmed, you should be sure that the sample has been trained
correctly and that it will not become obsolete in the future.
Note: Take notice that samples that have been trained with ICC 6.0 or earlier, show state confirmed.
When you select samples, the context menu and corresponding toolbar buttons permit to delete the selected samples from
the repository, or to save the images to a folder.
2. Click Show samples on the context menu or the corresponding toolbar button ( ).
3. If you selected several samples, you can use the arrow buttons to browse through the samples.
4. On the left side, the trained field values are shown. When you select a field result, and the respective box
coordinates are available, the result is highlighted on the document. When you position the pointer on a field,
additional data, such as state and confidence value are shown.
For all samples in the repository it shows, amongst others, the identification number, which is used to identify the
sample, the date when it was added to the repository, the file name of the image file used for training, and the class.
Each possible field value of an ACT field is a class. When you select a field value for an ACT field in the Validation
Client or in the Adaptive dialog box, you classify the invoice.
The context menu of an entry permits to open or remove a sample, or to change its document class (that is, the field
value of the ACT field) directly:
Delete
Removes the sample from the repository. Use this command to delete incorrectly trained samples.
Change Class
Permits to change the document class of the selected samples directly. A Change Class dialog box opens
where you can select the new document class. Use this command to correct incorrectly trained samples.
The toolbar of the view permits to refresh data, and to start training.
For each document of the repository, the Analysis tab displays, how this document would have been classified if it
were not trained. Thus this view shows how Adaptive Classification would classify unknown documents using the
documents in the repository. Thus, if you have a set of documents, there is no need to divide it into a set of training
candidates and a second set you use to test classification afterwards. You can train all your documents and still you can
then test classification and get a detailed analysis about the quality of classification for the different classes.
Test classification may yield different results: For example, a document may not be classified at all if there are no similar
documents in the repository. It may be rejected if its confidence value is below a threshold, or it may be classified to its
trained class (correct) or to another class. These different states are represented by traffic light colors:
Green: Documents that have been classified to their trained class (correct)
Yellow: Documents that have been rejected or could not be classified at all (reject)
Red: Documents that have been classified to another class (incorrect)
Note: If only a single document has been trained for a class, it is necessarily classified to a different class in this
test classification. But you should train additional samples for such a class anyway.
The Analysis tab shows the classification results of all documents in two ways:
The bar on the right visualizes the state of all documents in the repository.
In the center, the states are displayed in a map per result class. The result classes are sorted by the number of
documents assigned to the respective class. The width of a column indicates the number of documents.
Thus every area unit in the map represents the same number of documents. Thus it provides a view to the contents of the
repository.
You have the following options:
The map can be zoomed, if necessary.
When you move the pointer over the map or the bar, a tooltip appears describing the details of the area.
Now horizontal lines appear in the reject and incorrect areas, separating areas of documents rejected or
misclassified to another class. In green yellow, you can also see the documents that are correct, but rejected.
The documents in these areas can be selected in the same way as described above.
Refresh
The first button of this group starts a refresh of the data shown in the view. Use this button if you modified the
repository.
The second button permits to restrict the analysis to the last trained samples. When you click the button, a dialog box
opens where you can specify a date. When you confirm the dialog box with OK, the view is refreshed with the samples
that were trained after the specified date. If new samples are trained, after your application has already been released and
is running in production, you can see how the new samples fit the classifier. If you select all substitutions, you can see
only the problematic samples added since the last released version of the repository.
Zooming
With these buttons you can enlarge or reduce the representation shown in the view horizontally or vertically. As soon as
you enlarge, a corresponding scroll bar is displayed, which permits to scroll to parts of the view that are not visible. With
the last button you can return to the default representation with a single click.
Mode selection
The first button shows the distribution of rejects and incorrect samples. If the button is not pressed, all rejects and
substitutions for a class are shown as a single area. If the button is pressed, the yellow areas tell not only the rate of
documents that have been classified to the wrong class, but which classes were substituted how often. The green-yellow
areas show, how many samples were classified correctly, but were rejected as unreliable.
The second button toggles between two different representations of the classification results in the Analysis tab. The
view either shows classification rates per trained class, or classification rates per result class.
The first representation indicates the quality of the different trained classes. The second representation shows you how
reliable the result is if a specific class has been found.
Selecting
The first two buttons of this group permit to select all documents that have been classified to the wrong class during
analysis, or the documents that could not be classified at all, respectively. The last button removes all previous
selections.
Color
This button toggles between color and black-and-white representation of the view.
Edit Samples
Opens all selected samples, that is, samples of the areas shown hatched in the view, on a Training tab so that
they can be edited.
Show Bars
The Analysis tab indicates the number of samples trained for a class through the size of the respective area.
By default the bars for the different classes all have full height but different width according to the number of
samples. If this command is marked, all bars are shown with same width but different height.
Save Report
Saves the result statistics of the data currently shown in the view to a text file. All data items in the files are
separated by tabs to make it easy to import the data into other programs. The following example shows a
report file:
Important
Normalization has been modified incompatibly with ICC 6.0. If you open an invoice application created with
ICC 5.2 that uses scripting or custom normalization, you are asked whether the application should be migrated.
If you answer with No, the application is not upgraded and the documentation of ICC 5.2 is still valid with
regard to normalization for this application. If you answer with Yes, you must adapt your scripting code.
Applications that use neither scripting nor custom normalization are always upgraded.
1) The scripting entry point Custom Extraction, the component ICC Processing, and the Action Point Export
Validation are only present for invoice applications and applications of type Custom documents. For applications
of type Custom and for Solution Accelerator applications, these items are not present.
After data extraction, the Extraction action point can be used to specify custom rules that take effect before
the Custom Extraction scripting entry point and ICC/BCC Processing. Then, two scenarios have to be
distinguished.
After the Data Extraction scripting entry point, the Extraction step provides data for export to SAP ERP in case
that the document is not sent to Validation. Here, at an action point Export SAP, default normalization rules
for standard fields are applied, which convert the standard fields to the format expected by SAP ERP.
For export to Validation, separate rules can be specified at the Export Validation action point.
In Validation there is an action point Display, which converts the internal language-independent format into
the format that should be displayed. At a second action point Manual Input data captured by the Validation
operator are checked and normalized. Before the data are exported to SAP ERP in Validation, again the
normalization rules of the action point Export SAP are applied.
In the project tree of the Settings dialog box, all normalization rules can be found under a Rules node; at an invoice
application, it is a subnode of the Fields node. The Rules node has a subnode for each action point:
At the field definition nodes, a Rules subnode with the same structure is shown. But here only those rules are
shown that are attached to the respective field. The following example shows the InvoiceDate field node
where you can again see that a custom rule has been attached at the Display action point, which replaces the
standard rule:
Select the rules that should be exported, and then click OK.
3. In the file dialog box select a folder, enter a file name, and then click Save.
The selected rules are written to an XML file with the extension .rules.
2.8.8.7 Checksum
This action checks the check sum of the field value according to the specified check sum algorithm. If the check sum is
not correct, the field state is set to Error. Three different check sum algorithms are available:
modulo10_FI
modulo10_NO_SE
modulo11_NO_SE
To modify the action list, right-click an item in the action column; if you just created a new sequence, use the empty first
line that is shown. The following context menu appears:
Note: If you click a rule that is a subnode of a Predefined Rules node, its action list is shown, but the context
menu is not available because the default action sequences cannot be modified.
If your invoice application does not need to use the new extended customizing, you do not need to migrate the
application. In this case, the application runs in legacy mode and normalization behaves as in ICC 5.2. If you open such
an application in the Customizing Client, answer with No when you are asked whether the application should be
migrated.
If you want to use the new extended customizing features or if you want to write country-independent scripting code,
your application must be migrated. If your application uses scripting or if you modified normalization using the
Extended Customizing dialog box, you must check and adapt your application. The following table gives an
overview how you have to proceed depending on your application:
When an application is loaded, ICC/BCC performs some checks and decides how to proceed. Three cases have to be
distinguished:
If the application has been created with ICC 6.0 SP1 or later, nothing has to be done.
If the application has been created with an earlier version, but it contains neither scripting nor extended
customizing (that is, no modifications were made in the Extended Customizing dialog box), automatic
migration is performed.
If the application has been created with an earlier version, and contains scripting in Validation or extended
customizing, the application runs in legacy mode, that is, normalization behaves as in version 5.2. If such an
application is opened in the Customizing Client, you are asked whether the application should be migrated.
Before you answer with Yes, take care to keep a copy of your application for later references.
With this in mind, check your Validation scripting and adapt it, if necessary. In Validation, field values are now
formatted for display and are controlled after manual input by extended customizing.
For countries of the second list the application is preconfigured with regard to character recognition, culture settings
controlling data conversion and formatting, and VAT rates are specified. For some of these countries, data extraction
also yields good results if the respective country uses the same keywords and phrases as one of the countries of the first
list. For some countries, additional customizing is needed to achieve high recognition rates. For more information, see
"Invoice application for new country" on page 78. The second country list contains the following countries:
Select the required countries and click Add until the list on the right side contains all required receiving countries.
On this page you specify the vendor languages, that is, the languages occurring on incoming documents, and
corresponding data formats. Select the required items in the list on the left side and click Add until the list on the right
side contains all needed languages.
Some countries have a special date format. For example in the United States a date starts with the month whereas in
other countries with language English, the date starts with the day. Therefore some items in the list show a country in
parentheses.
Due to possible ambiguities, order in which formats are checked can be important. Therefore you can change the order of
the items in the list on the right side. To move an item, select it and then click Move up or Move down.
To add an archive document type, click Add, type the name of an existing SAP ArchiveLink document type in the
appearing dialog box, and confirm with OK.
The archive document types must be the same as in VIM.
For more information about configuring ArchiveLink document types in SAP ERP, see "Configuring ArchiveLink" in
OpenText Vendor Invoice Management for SAP Solutions - Configuration Guide.
The connection to SAP ERP is established by Hot Spots (for more information, see "Hot Spots (SAP links" on page 16).
The specified archive document type list is used to pre-configure the SAP Extraction Link Hot Spot. The configuration
of the Hot Spots can be changed later (see "Configuring Hot Spots" on page 17).
Note: You can modify the archive document types later using the Settings dialog box (for more information, see
"Configuring an invoice application" on page 82).
Three different methods are available to determine the company code. You can use:
A single, fixed company code for all documents of the current application
A separate company code for each archive document type
Automatic company code detection
To be able to use automatic company code detection, you need to prepare a database in text file format (see "Recipient
data" on page 136), which contains the address data of the recipients together with their company codes and SAP ERP
target systems.
To use a fixed company code:
1. In the Company code detection list, click Use a fixed company code.
2. Enter the company code and the name of the target SAP ERP system all documents shall be transferred to.
To use different company codes for the different archive document types:
1. In the Company code detection list, click Use a company code per archive document type. The
dialog box shows a list of all archive document types that you specified on the Archive document types
page.
2. For each shown archive document type, select the first archive document type, click Edit, enter the company
code and target SAP ERP system name for the document type, and confirm with OK.
To use automatic detection:
1. In the Company code detection list, click Automatic company code detection.
2. Enter a company code and target SAP ERP system name, which is used if the recipient of a document cannot be
detected.
3. Click Import recipients, select the prepared database file containing recipient data with the corresponding
company codes and target SAP ERP system names in the appearing file dialog box and confirm with Open.
Tip: If you process invoices for one company code only, choose Use a fixed company code.
Note: You can modify company code detection later using the Settings dialog box. For more information, see
"Configuring an invoice application" on page 82. The company code field is shown in Validation only if automatic
company code detection has been specified. This setting cannot be modified, that is, selecting or clearing the
Visible check box on the Presentation tab in the Settings dialog box does not take effect for the company code
field.
Click Check login to test whether you have specified all needed values correctly.
Tip: If a saplogon.ini file is present, you can select a suitable SAP ERP system in the System list of the
Connection to SAP page before you click Advanced. The SAP Connection dialog box then shows the
parameters of the selected SAP ERP system, and you can modify values instead of typing everything in.
When you have entered all values correctly, type a suitable name in the System name box and then click Apply to
close the dialog box. The new SAP ERP connection appears in the System list.
The new connection is used for the Hot Spot connections of the current application but its data are not stored
separately. If you switch to another connection and close the Customizing Client, the connection specified in the
SAP Connection dialog box no longer appears in the System list afterward and the corresponding data get lost.
Click Next to proceed to the next page.
Note: You can modify the customizing settings using the Settings dialog box. For more information, see
"Configuring an Invoice Application" on page 82.
Note: The values of these fields are not checked, neither after Recognition nor after Validation.
If you use a different syntax or length for purchase order numbers, you have to change PO number settings. Mark Use
Regular Expression, and then click Edit in order to enter a regular expression:
For more information about regular expressions, see "Using regular expression" on page 91.
The official Microsoft homepage for regular expressions is: http://msdn.microsoft.com/en-us/library/hs600312.aspx.
As long as the Use Regular Expression check box is selected, the standard format and previously specified number
ranges are not checked. Therefore the number ranges are not shown and you cannot add or remove a number range in
this case.
If the specified regular expression accepts PO numbers with less than 10 digits, you must change to Manual Input rule.
It must accept PO numbers with leading zeroes, and it must add leading zeroes to PO numbers with less than 10 digits in
order to return only 10-digit numbers.
In the example code, fixed strings are assigned for the two fields. Alternatively, the application could use barcodes. In
this case add a custom field of type BarCode and use scripting to extract system and company code from the result of
this field.
The fields must not be empty. Therefore take care to provide a default value if necessary. Also take care to assign a value
to both fields; setting only one of the fields does not take effect.
The value of the CompanyCode field must comprise four digits. The value of the System field is a string.
The following example code shows the scripting code needed to preset a value for the field InvoiceVendorNumber:
The example code assigns a fixed value. In a real application, a value would be supplied in a variable by preceding code.
A vendor ID supplied by scripting is still be checked against downloaded vendor master data. If the supplied vendor ID
is not present in the master data, it does not take effect and the InvoiceVendorNumber field remains empty.
The following example shows scripting code that could be used to preset a value for the field ListPO:
The example code assigns the value of a field PreListPO, which could have been extracted using a custom automation
method. The field in the document PresetInvoice must have the name PreListPO; it may contain additional POs
as alternatives. If the field PreListPO exists in the document PresetInvoice, its content is used in ICC
exclusively. This is also true if PreListPO is empty.
The following example shows scripting code that could be used to preset a value for the field ListDN:
The example code assigns the value of a field PreListDN, which could have been extracted using a custom automation
method. The field in the document PresetInvoice must have the name PreListDN; it may contain additional DNs
as alternatives. If the field PreListDN exists in the document PresetInvoice, its content is used in ICC
exclusively. This is also true if PreListDN is empty.
Note: The Settings button is only available if an application is open. The current application is shown in the title
bar of the Customizing Client. To open an application, click Open Application on the application menu, and
then click the appropriate application.
On the left side, the Settings dialog box shows all configurable items of the current application in a tree view. To
modify an item, you open the respective nodes and click the desired item. The right side of the dialog box then shows the
properties of the selected item.
At the uppermost level, four nodes are shown:
The General node permits to modify name and description of the application at its Name subnode.
The Configuration node permits to specify archive document types and to control training in the Validation
Client.
The Fields node permits to specify all settings controlling data extraction.
The Advanced node permits to enter scripting code, and to control which pages should be excluded from data
extraction.
Note: For applications created with ICC 5.2 Service Pack 4 or earlier, changing the application name in ICC only
changes the field description in SAP ERP, whereas the SAP ERP application name is still JobClass<n>
(<n>=1..10).
Important
If Adaptive Recognition is used for applications with country setting China (Mainland), you should only
train invoice layouts that are not supported by ICC. Adaptive Recognition results are discarded for invoice
layouts that are supported by ICC.
3. To add a type, click Add, type the name of an existing SAP ArchiveLink document type in the appearing dialog
box and confirm with OK.
4. To modify a defined type, select it and click Edit. You can then modify the name in the appearing dialog box.
Be sure to use the name of an existing ArchiveLink document type in SAP ERP.
5. To remove a type, select it and click Remove.
For information about configuring ArchiveLink document types in SAP ERP, see "Configuring Archive Link" in
OpenText Vendor Invoice Management for SAP Solutions- Configuration Guide.
3. To add an email address, click Add, type a complete email address (john.smith@company.com), or an entire
domain (@company.com) in the appearing dialog box, and confirm with OK.
4. To modify an entry, select it and click Edit. You can then modify the email address in the appearing dialog box.
5. To remove an entry, select it and click Remove.
2. If additional information contained in the line items, such as additional costs and discounts, should be extracted,
select the Enable line item qualifying and filtering check box.
3. If the line items should not be extracted, clear the Extract line items check box.
4. If purchase order data are downloaded, and the invoice line items should be matched against these data, select
the Check line items against downloaded data check box.
Note: PO line number, PO number, and delivery note number are not returned by the recognition step. Therefore
these fields are empty in the invoice item data if the Check line items against downloaded data check box is
not selected.
Note: Download comprises items with ITEMTYPE 3 (planned delivery costs), and ITEMTYPE 4 (goods receipt
for planned delivery costs). These items are not used at the moment.
If the sum of the recognized item amounts equals the recognized invoice amount and only a single invoice item could not
be mapped to a single remaining purchase order item, this item is mapped without regard to the field values.
If the unit price or the amount of an invoice item on the invoice differ(s) slightly from the unit price or amount values in
the corresponding purchase order item in the downloaded data, line item mapping will not find the corresponding
purchase order item in the downloaded data by default. To allow mapping despite of slightly different values, enter a
tolerance value under Set line item mapping tolerance. To specify an absolute tolerance, accept the default option
Absolute; otherwise select Percent. If the difference is in the specified tolerance range, line item mapping finds the
corresponding purchase order item. A percentage value may not work as expected due to the nature of the algorithm,
which is optimized for fast search.
By default, only PO numbers found on the invoice are returned. If you select the If no PO number found, determine
PO number from DB check box, a PO number found in the database is returned if the PO number could not be found
on the invoice.
To prevent high runtimes or erroneous results, line item mapping is not performed if no PO number could be found and
more than 200 line item candidates were detected.
Note: New item units only bring about that the specified values are accepted in the Validation Client; specifying
additional item units does not affect Recognition.
3. If found rows for additional qualifier types should also be returned, select the check box in the Deliver column
for the respective item.
4. To define additional application-specify qualifier types, right-click into the table, click Add key word row,
type a name in the Qualifier column, and add phrases to the phrase list of the new item. If the detected items
should be removed from the result, clear the check box in the Deliver column.
5. In the Phrases column, click the ellipsis button for the respective item. In the Phrase dialog box opens add,
edit, or remove phrases.
6. The ExclusionPhrases list contains phrases that are similar to some item qualifiers but may occur in other
types of items lines. Therefore they must not be found to avoid confusion.
Caution
Do not use the Delete key word row command on the context menu on the predefined items; otherwise the
preconfigured phrase list gets lost.
Note: If you select the Enable line item qualifying and filtering check box, the Visible check box at the
ItemQualifier column of the InvoiceItems field is also selected automatically so that this column is shown as
Qualifier column in the table view of the Validation Client. This column is a list where the Validation operator
can select either an empty item or one of the qualifiers whose Deliver check box is selected on the ItemQualifier
tab.
Note: The PO number ranges (and the regular expression) only affect recognition. Their purpose is to prevent that
other numbers found on the invoices, such as phone numbers, are interpreted as PO numbers. If PO numbers are
checked against downloaded data, or during line item mapping, PO numbers may be returned that are contained in
downloaded data but are outside the specified number ranges (or do not match the regular expression).
To modify the number ranges:
Open the Settings dialog box, expand the nodes Fields > Predefined Fields > PO Numbers.
7. To add a range, click Add, type the first two digits of the purchase order number range, and confirm with OK.
8. To modify an entry, select it and click Edit. You can then modify the first two digits in the appearing dialog
box.
9. To remove an entry, select it and click Remove.
If your purchase order numbers have a different structure that cannot be determined by number ranges, you can use a
regular expression to determine the purchase order numbers (see below).
Note: This check box only affects the returned header PO number. To get PO numbers for line items, you need to
select the Check line items against downloaded data check box at the Extraction subnode of the
InvoiceItems field node.
Note: To check the purchase order numbers, not all entries of the downloaded purchase order data are used.
Invoice Capture Center only uses the entries whose VENDORID field value matches the found vendor number,
whose COMPANYCODE field value matches the found company code, and whose SYSTEM field value matches the
found SAP ERP target system.
4. To test your regular expression, type a test string in the Sample input data box and click Test. If the test
string matches the regular expression, the Result color box changes to green. If the regular expression uses
3. If the first method for Vendor ID determination should not be used, clear the Use PO number for vendor
detection check box.
4. If company code and SAP system should not be checked within the second method, select the Ignore
company code and SAP logical system at vendor detection check box.
This is typically used by companies using identical vendor IDs in all company codes and in all subsystems, or in
a single-system environment. It permits to reduce download data volume considerably by downloading PO data
only for a single company code.
The list on the right side has an item for each currency supported by ICC/BCC; each currency is represented by
its three-letter ISO code. At the currencies that are preconfigured for the current application the Active check
box is selected; these currencies are shown on top.
3. To modify the set of valid currencies, select or clear Active check boxes as needed.
4. To show the supported notations, click the ellipsis button in the respective currency row. A Phrase dialog box
opens where you can add, modify, and delete notations as needed.
Tip: To specify the default currency, go to the InvoiceCurrency field, and click a currency in the Default
value list on the Data tab.
To add a new currency:
1. Right-click the item unit list, click Add translation line, and type a name in the appearing new line.
2. Click the ellipsis button to open the Phrase dialog box and specify phrases for the new currency.
Note: New currencies only bring about that the specified values are accepted in the Validation Client; specifying
additional currencies does not affect Recognition. Only selecting and clearing of the Active check box for standard
currencies affects Recognition. Modification of notations and added new currencies only affect manual input and
Single Click Entry.
Note: If a dollar sign is found, no currency is returned because it cannot be determined whether the dollar sign
indicates currency AUD, CAD, HKD, or USD.
3. To add a VAT rate, click Add, type the VAT rate in the appearing dialog box and confirm with OK.
As VAT rate, you can enter an integer, or a decimal with up to two decimal places and a dot as decimal
separator. The value must be between 0 and 100.
4. To modify a VAT rate, select the respective entry, click Edit, and type the new value in the appearing dialog
box.
5. To remove a VAT rate, select the respective entry and click Remove.
With country setting Canada, the VAT rates box looks different and permits to enter valid tax rates of the different tax
types (Provincial Sales Taxes, Goods and Services Tax, and Harmonized Sales Tax):
With country setting India, the VAT rates box also looks different and permits to specify values for Indian tax types:
Note: The VAT Rates node is not available for applications with country setting USA.
Note: Take notice that the company code field is shown in Validation only if automatic company code detection
has been specified. This setting cannot be modified, that is, selecting or clearing the Visible check box on the
Presentation tab of the Settings dialog box does not take effect for this field.
For each company code detection method, the company code is connected with a SAP ERP target system. At runtime, if
ICC/BCC has detected the company code of an invoice, it also knows the SAP ERP target system for the invoice. When
ICC/BCC exports the recognition results to Vendor Invoice Management, it also exports the found target SAP ERP
system. Thus ICC/BCC supports SAP ERP multi-backend systems.
To prepare the recipients data file, you can use transaction /OPT/VIM_COMP_DN and report
/OPT/VIM_COMPCODE_DOWNLOAD of Vendor Invoice Management which creates a file in CSV format containing
address data for all company codes. To optimize company code detection, insert additional lines with all address variants
that may occur on invoices for each company code manually before you import the file.
When you later export the application settings to a file or to SAP ERP, the recipient data are be included automatically so
that they are present when you import an application on the production system.
3. Use the buttons Add, Edit, and Delete to modify the list of phrases as needed.
The Phrase list tab contains phrases in different languages, which are used to detect a credit memo. To
optimize recognition you could remove phrases if invoices of the respective language do not occur at your
application, or you could add phrases of countries that are not supported by ICC/BCC.
4. If certain phrases result in an erroneous detection of a credit memo, click the Exclusion list tab, and add the
respective phrases.
For example, the word Credit appears in the company names of several banks. If such a company name
occurs on an invoice, this invoice would be erroneously classified as credit memo. This can be prevented by
adding the respective company names to the exclusion list.
Caution
Do not modify format specifications in the Formats tab of the Region and Language dialog box and the
Customize Format dialog box of the Windows Control Panel because this would affect checks and
formatting within ICC/BCC.
In the Format interpretation list, you can modify the list of cultures used for conversion. You can do the following:
Reorder the cultures, for example if invoices from some foreign country occur often in your input.
Delete cultures of languages that do not occur at your application.
Add a culture of a country that is not supported by ICC/BCC.
The culture list always contains the culture of the country of your application as first item, and you do not modify this
first item in normal applications, but you do modify this item if you customize an application for a country that is not
supported by ICC/BCC.
Important
The available cultures depend on the operating system version. If you want to add cultures, make sure that the
specified culture is supported on your operating system, and the operating system of the intended production
system. On the Internet you can find lists of the supported cultures of the different operating system versions on
the MSDN website..
You must not modify format specifications in the Formats tab of the Region and Language dialog box
and the Customize Format dialog box of the Windows Control Panel because this would affect checks and
formatting within ICC/BCC.
If all invoices do not exceed a certain number of pages, specify that number at Stop extraction after number
pages. Extraction then stops reading after the specified number of pages if no other condition is fulfilled earlier.
If the page following the last page of each invoice is marked with a barcode label, click the respective barcode type in
the on page with barcode list. For this feature to work reliably, the barcodes should not be too short and should be
placed horizontally without too much skew. By default, any barcode of the specified type causes extraction to stop. If
you select the with specific content check box, extraction only stops if the barcode contains the specified value. If
you append ".*" to the specified string, each barcode whose value starts with the specified string causes extraction to
stop. Take notice that the barcode may only contain Latin characters and digits.
If you select the OR on page with phrases check box, you can specify phrases. Extraction stops if one of the
specified strings is found on a page. To make sure that a phrase is always found even if not all characters could be
recognized correctly, the algorithm will allow small differences between a specified phrase and a found string. To
prevent that text occurring within the invoice data inadvertently stop extraction, you can also specify a list of exclusion
strings. If a string matches a specified phrase but also matches an exclusion string, it will not stop extraction. To specify
the strings, click Edit phrases and enter phrases and exclusions in the Phrase List dialog box. The specified phrases
and exclusion strings may only contain Latin characters; Asian characters cannot be used.
Caution
Make sure that you do not specify too short strings as phrases. Otherwise similar strings occurring within the
invoice could erroneously terminate extraction so that important invoice data could get lost.
For each group, the dialog box shows a rectangle highlighted with the currently used color.
For each group, the color used for highlighting is a lighter version of a dark base color. Below the colored rectangle of
each group, a slider controls the color used for highlighting. If you move the slider to the left, the color becomes lighter
until it becomes white if you move the slider to the left end. If you move the slider to the right, the color becomes darker
until it equals the dark base color if you move the slider to the right end.
To select a different base color for a group, click the respective rectangle. The Color dialog box opens. In this dialog
box you can select a color from a set of basic colors or specify your own color.
Note: The Settings button is only available if an application is open. The current application is shown in the title
bar of the Customizing Client. To open an application, click Open Application on the application menu, and
then click the appropriate application.
On the different tabs of the field settings you:
Activate optional fields.
Change the order of the fields or columns for Validation.
Activate Adaptive Recognition for standard invoice fields.
Set limits for date fields.
Caution
It is recommended to change field settings of an application only in ways described in this guide. Changing
field settings without expert knowledge may deteriorate recognition results.
These fields are adaptive recognition fields. The Server and the Recognition Nodes must learn how to extract data for
these fields. The learning data for these fields have to be provided by training (see "Training of Adaptive Recognition
fields" on page 31).
For Austria, Germany, and Switzerland, Supply Date is a standard field, which is read by a special automation method.
To make an optional field visible:
1. Open the Settings dialog box and expand the nodes Field >Predefined Fields.
2. Click the respective field to show its properties.
3. On the Presentation tab, select the Visible check box:
Note: The invoice items table is always shown as last field without regard to its tab order value. Therefore it is not
meaningful to assign a lower tab order value to the InvoiceItems field.
To modify column order of the invoice items table:
1. Open the Settings dialog box and expand the nodes Fields > Predefined Fields > InvoiceItems > Elements.
2. Select the different column nodes and on the Presentation tab modify the values of the Tab order property
as needed.
To improve the recognition result, you have to perform training. For more information, see "Training of Adaptive
Recognition fields" on page 31.
If the limits are set to a negative value, only date values are found and returned that are smaller (that is earlier) than the
date of processing.
If Lower limit is set to an absolute date, only date values are found and returned that are greater (that is later) than or
equal to the specified date. If Upper limit is set to an absolute date, only date values are returned that are smaller (that is
earlier) than or equal to the specified date.
3.5.6 Showing additional tax rate and VAT amount fields in Validation
In order to support processing of invoices with several different tax rates, there are four pairs of additional fields for
additional tax rates InvoiceVatRate1 .. InvoiceVatRate4, and InvoiceVatAmount1 .. InvoiceVatAmount4.
These fields are processed if additional tax data can be found on an invoice, but by default these fields are not shown in
Validation.
To make these fields visible in Validation, open the Settings dialog box, and for each desired field, select the field in
the field list in the tree view, click the Presentation tab, and select the Visible check box.
The Lookup list column determines the assigned database column. Do not modify this property.
The Column confidence property takes effect if Deliver OCR results or Deliver database entry
checked by OCR is selected. If you type a value in the Column confidence box, a value is only returned
for the column if the value read from the invoice matched the database column value with a higher confidence.
If Deliver OCR results is selected, the value read from the invoice is returned in this case. If Deliver
database entry checked by OCR is selected, the column value from the database is returned; this makes
sure that the value is returned in the correct format for ICC/VIM.
If you set Column confidence to 100, a value is only returned if the read value was identical to the column
value. For some standard fields, this property has the default value 100.
For some fields, Deliver database entry checked by OCR is selected by default, whereas for most fields
Deliver OCR results is selected.
If Deliver database entry is selected, the value of the respective database column is returned even if it is not
present on the invoice.
If the Refresh from base field check box is selected, the field value is updated if another database entry is
being selected by the Validation user. If the field value should not be updated during Validation, clear the check
box.
By default, the field is positioned within a group together with the corresponding base field and its tab order is
modified accordingly. If the additional field should be the last field in tab order, clear the Place near base
field check box. If you select the check box again later, modify the tab order to position the field as desired.
4. In the Field name box, type the name that identifies the field.
5. Enter a default value if necessary; otherwise the field remains empty if it is not captured.
6. Select the Field cannot be empty check box if the field is mandatory. Validation then indicates an error if
the user tries to submit the invoice while the field is still empty.
If you select ComboBox, the dialog box changes, and shows an empty value list, and buttons Add and
Remove:
Click Add to add a value; a text box appears where you can enter the text. The list shows the already
specified values. To remove a value, select it and click Remove.
For the values, you can use uppercase and lowercase letters, but during processing lowercase letters are used.
If you add the same name with different spelling, this variant is ignored. Captured data may use any spelling,
that is, any combination of uppercase and lowercase letters. The returned result value uses the spelling of the
value you specified in the value list.
To achieve a special order of the items, you can either move a selected item up or down or click Sort to sort
the items alphabetically.
LookupList
Select LookupList to configure database support for the field.
ListBox
If you select ListBox, the field value must be a comma-separated list of strings which is shown as a list in
the Validation Client:
If values are captured for such a field, the field remains active in contrast to a textbox field. Thus the
Validation operator can capture several list items using the mouse or the keyboard. In addition, list items can
be selected and removed.
When the button is pressed, the code defined at the scripting entry point Validation (Field Changed) is
called.
If the button should show a fixed label, specify this text at the Default value property on the Data tab.
2. If you configure database support for the first time, the list is empty. In this case, click the ellipsis button behind
the Lookup list column list. The Manage Lookup List dialog box opens:
With this dialog box you can configure a list of all database files used by the current application.
3. To add a database, click Import. The Import a Table dialog box opens:
10. Click Import. The database file is imported, and appears in the file list of the Manage Lookup List dialog
box:
Tip: If your database file changes frequently, an automatic reload can be configured using the program
LookupDatabaseUpdater.exe. For more information, see section "Configuring Lookup Lists in" the chapter
“Technical Operations” in Invoice Capture Center - Administrator’s Guide..
The Lookup list column list on the Presentation tab shows all available columns of all loaded database files as
<filename>.<columnname>. In the following example, a database file MyDatabase.txt has been imported:
To assign a database column to the current field, click the desired column.
If you specify LookupList as control for further fields and assign columns of the same database file, the fields are
shown as a group in the Validation Client. As first control in the group, a search box is shown, which is used to
determine the matching database record manually. If you type a search string in this box, or capture an according value
from the document, the list of the control then shows all matching database records. As soon as you select an item in the
list, all fields of the group are filled with the respective column values.
If you want to specify automation for such fields, you must use the automation method Generic SnapMatch for all
fields using columns of the same database file. SnapMatch then uses the respective database file to find a database record
that matches the current document, that is, where several column values of the result record could be found on the
document.
Take notice that it is of no relevance for SnapMatch, for which columns of the database you configure fields. Just
configure fields for those columns that should be exported in the invoice result. If you want to export a column whose
content is of no use for the validation operator, you can clear the Visible check box on the Presentation tab for this
field. The field is then filled and exported, but it is not shown in the Validation Client.
The Method list contains four generic automation methods, which can be used for custom fields:
Generic DatabaseColumn method: this method permits to export additional recipient or vendor data.
Generic Snapmatch method: this method permits to find a record in a database that matches data on the
invoice. Therefore this method is best suited if you have a database and expect that some data of one of its
records should be present on the invoice. For more information, see "Configuring SnapMatch automation
method" on page 121.
Generic Adaptive recognition method: with this method you first perform training in order to store typical
invoices and the position of the index field on these invoices as templates in a database. During extraction, a
matching sample is determined for the current document, and the field is then extracted at the position where it
was trained for this sample. This method is suitable for many kinds of fields. For more information, see
"Adaptive Recognition " on page 28.
Generic Adaptive classification method: this method compares the whole textual content of the invoice
with previously trained samples in a database and returns a class value for the best matching samples in the
database. This method is suitable for fields that should determine a property of the whole invoice; for example it
would be suitable to determine the language the invoice is written in. For more information, see "Adaptive
Classification (invoice applications)" on page 30.
In addition, you can create your own rule-based method using the Document Extraction Design Studio. There a
lot of field types are available. For example you can search date or amount values, regular expressions in a
specified area of the invoice, search a field relative to the position of another field, and much more. For more
information, see "Rule-based Recognition automation methods" on page 124. The respective method names are
preceded by Custom.
Note: For applications with language setting China (Traditional), Japan, Korea, and Thailand, Adaptive
Recognition is also used for extraction of the line item table.
Adaptive Recognition cannot be used for:
Line item fields. Exceptions: for China (Mainland), China (Traditional), Japan, Korea, and Thailand Adaptive
Recognition can be used for line items.
Fields using download data, such as vendor and recipient fields
Amount fields of applications with country setting India
Fields occurring at different positions on invoices of the same vendor; therefore extraction of amount fields
does not benefit from training for vendors where the amount is positioned behind the last line item row so that
its position depends on the number of line items.
Note: Take notice that for the standard amount fields values found by Adaptive Recognition are only taken into
account if the standard rule-based method did not return a result.
Note: For China (Mainland), Adaptive Recognition results are only used for invoice layouts that are not yet
supported by ICC. For invoice layouts supported by ICC, Adaptive Recognition results are discarded. Therefore it
is not useful to train samples for these layouts.
For more information about rule-based methods, see "Rule-based methods (Document Extraction)" on page 28 and
OpenText Capture Document Extraction - Developer’s Guide.
Note: Modifications of the Document Extraction project do not take effect until you close the Settings dialog
box with OK afterwards. It is not sufficient to save the project in the Document Extraction Design Studio.
If you add your custom method to this node, it replaces the standard method only for documents of the respective
document type whereas the standard method takes effect for all other documents.
You can even replace the standard field for several user-defined document types by using the same field name at all
document type link nodes.
4. Type the desired column name in the Field name box on the Data tab.
4. Invoice Capture Center takes care of the specified VAT rates only for the standard fields. Therefore take care to
specify the valid VAT rates at the Vat Rates property of the Invoice Items Custom field.
For applications with country setting Hungary or India, special preconfigured Invoice Items Custom fields have to be
used, which are delivered as custom operator files that have to be loaded as global fields in the Design Studio.
For applications with country setting Hungary or India:
1. In the Document Extraction Design Studio, click the Globals tab in the Project Explorer view.
2. Right-click the view, and click Add Custom Operator on the context menu:
3. A file dialog box opens. Go to the CustomItems subfolder of the ICC/BCC installation folder. For Hungary
select the file CeItems_HU.ics, for India select the file CeItems_IN.ics, and then click Open.
4. Click the Classes tab, and under the Unknown node, right-click the IndexFields node.
Note: Modifications of the Document Extraction project do not take effect until you close the Settings dialog
box with OK afterwards. It is not sufficient to save the project in the Document Extraction Design Studio.
2. A new column node appears. Replace the default name with the column name you entered in the Settings
dialog box of the Customizing Client; it is important that these names are identical!
3. Open the new column node, select its index field subnode, and modify the field properties as needed. For more
information about field properties, see OpenText Capture Document Extraction - Developer’s Guide or the
online help of the Design Studio. In the following example, a Regular Expression column with the name
MyCustomColumn has been added; the node has been selected, and the Properties view shows the field
properties:
4. When you click OK to close the Settings dialog box, the application is updated.
Note: Modifications of the Document Extraction project do not take effect until you close the Settings dialog
box with OK afterwards. It is not sufficient to save the project in the Document Extraction Design Studio.
5. The Lookup list column list shows all database columns of the current vendor and recipient database files.
Click the desired item in the list.
6. The Column confidence property takes effect if Deliver OCR results or Deliver database entry
checked by OCR is selected. If you specify a value at the Column confidence box, the column value is
only returned if the value read from the invoice matched the database column value with a higher confidence. If
you set it to 100, the value is only returned if the read value was identical to the column value.
7. By default, the field returns the data that were read from the invoice for this column if it could be found. To
return the value of the respective database column, click Deliver database entry. To deliver the database
column value only if the value could be found on the invoice, click Deliver database entry checked by
OCR; this makes sure that the value is returned in the correct format for ICC/VIM.
8. If the Refresh from base field check box is selected, the field value is updated if another database entry is
being selected by the Validation user. If the field value should not be updated during Validation, clear the check
box.
9. By default, the field is positioned within a group together with the corresponding base field and its tab order is
modified accordingly. If the additional field should be the last field in tab order, clear the Place near base
field check box. If you select the check box again later, modify the tab order to position the field as desired.
Take notice that Deliver OCR results and Deliver database entry checked by OCR can only be used if the
respective database column is searched on the documents. In addition to the columns used by the standard fields, these
are the following columns:
STREET and POBOX of the Recipients database and
PHONE, FAX, EMAIL, WWW, SWIFT, RESERVE1, and RESERVE2 of the Vendors database.
If you use these options with one of the remaining columns, the field always returns an empty result.
For vendors with country India„ in VIM system with India localization, origin of VAT ID data is as follows (download
program for India):
Note: For download you should use a SAP ERP system with current vendor and purchase order data in order to
achieve good recognition results on matching invoices.
5. At the Index Column list, click the ellipsis button. The Manage Lookup List dialog box opens.
5. At the Index Column list, click the ellipsis button. The Manage Lookup List dialog box opens.
6. Select the Show internal tables check box.
7. Select the list entry Positions, and then click Reload. In the appearing file dialog box select the file that you
want to load (PO_Header.txt).
8. Wait until a dialog box indicates that import has been finished. Close the dialog box with OK.
9. Select the list entry PositionDetails, and then click Reload. In the appearing file dialog box select the file
that you want to load (PO_Items.txt).
10. Wait till the Import finished dialog box opens. Close the dialog box with OK. Then close the Manage
Lookup List dialog box with Close.
11. Close the Settings dialog box with OK.
The application is now being updated.
6. Click Import recipients. In the appearing file dialog box select the file that you want to import and then click
Open.
7. Wait till the Import finished dialog box opens. Close the dialog box with OK.
8. Close the Settings dialog box with OK.
The application is now being updated.
You can now check the recognition results. Fields that could be extracted correctly and empty fields that are
optional are highlighted with green color; empty mandatory fields are highlighted with red color.
5. Click Submit or Cancel to close the Validation view. The recognition results are deleted.
Note: Vendor master data and purchase order data have to be available for local testing to get good recognition
results.
Tip: By default, the Open command is not available. To be able to import invoice from a SAP ERP system, click
ICC Options on the application menu, and then select the Activate Validation mode in Customizing Client
check box.
3.13.2 Validation
After opening an invoice within the Customizing Client, the Application tab shows the invoice image and the extracted
invoice data.
The Validation Client and the Customizing Client provide the same validation facilities. Thus, for more information
about the Validation user interface, see ICC/BCC - User’s Guide. Note that the validation view of the Validation Client
shows the original, scanned invoice image whereas the Customizing Client shows a preprocessed image.
3.13.4 Monitoring
During application testing you may want to monitor the system. For information about monitoring, see ICC/BCC -
Administrator’s Guide.
Description:
Shows by which user and at which date the application was created. You can also enter a comment describing
application details.
OCR character set:
This property specifies the character set used for character recognition. Thus it also determines the languages
of the processed documents. The default character set Western Europe (CP 1252 WinLatin1) is suitable for
languages of Western European and American countries and also for many other countries where the
supported languages are also used.
Important
Do not change the character set after you have added rule-based methods (Document Extraction) to your
application. Otherwise the character sets of these methods become invalid and they may not work
correctly.
Format interpretation list:
Specify default languages (cultures) used for conversion at the normalization rules of the types Check type
and Check type and formatting if the property Parse Culture List is empty at the respective rule. It
should contain the languages of the documents that should be processed by the application. Typically, the first
item specifies the language of the most frequent documents.
At the rules, the current field value is checked against the cultures in the specified order. If a culture returns a
successful result, checking is terminated, the field state is set to OK, and depending on the rule type, the value
may be converted according to the determined culture.
3. In the Regional format list, select in which format culture-specific data such as amounts or dates should be
displayed in the Validation Client.
Normally, the value of this property must equal the first item in the Format interpretation list in the general
application settings. If these values are different, you are asked for confirmation when you click OK in the
Settings dialog box.
4. You can change the colors used to highlight different groups of items in the Validation Client.
For each group, the color used for highlighting is a lighter version of a dark base color. Below the colored
rectangle of each group, a slider controls the color used for highlighting. If you move the slider to the left, the
color becomes lighter until it becomes white if you move the slider to the left end. If you move the slider to the
right, the color becomes darker until it equals the dark base color if you move the slider to the right end.
To select a different base color for a group, click the respective rectangle. The Color dialog box opens. In this
dialog box you can select a color from a set of basic colors or specify your own color.
5. Select the Activate "Mark for Training" in Validation check box if the command Mark for training
should be available in the Validation Client. If the application uses Adaptive Recognition (ART), select this
check box if the Validation user should be able to mark documents that should be trained later. For applications
that do not use ART, selecting this check box is not meaningful.
6. Clear the Activate "Reject" in Validation check box if the Reject command should not be available.
7. If pages that are excluded from extraction at all fields should still be shown in the Validation Client, select the
Show all pages in validation client check box. By default, those pages are not shown.
Note: For Solution Accelerators, names of preconfigured fields are by default shown in the language specified at
the Regional format property if this language is supported by BCC.
Note: During classification with Document Extraction, only the first page of a document is used. The options in
the Used pages area do not take effect for classification with Document Extraction.
Note: Adaptive Classification will only return results after documents of at least two different classes have been
processed.
Note: The algorithm is able to cope with more difficult scenarios, for example strings with varying lengths or
values that are arranged in two separate columns containing string with a different syntax. Therefore you should try to
use this extraction methods even in complex list types and test whether data are extracted correctly after some training.
To specify a rule-based method using Document Extraction:
1. Select the CDE check box.
2. Click Edit rule-based method and configure a suitable field using the Document Extraction Design Studio.
You can use fields that have an Occurrences property. At the Occurrences property, select the value all so
that the field can return several values.
3. Save the project and exit the Design Studio.
4. Click the field prepared in step 2 in the Operator list.
If you select both extraction methods, CSA will be used first. If it does not find a value, CDE will be called.
By default, selected automation methods will process all pages of the document. If they should only process the first or
last pages of the document, click Extract on these pages and enter a number of pages at the First and Last
properties. For example, to extract the field on the first three pages, set First to 3 and Last to 0.
Note: If you want to use the F9 function key in the Validation view of a BCC application containing a Partner
function during local testing with the command Open from File, you must log in to the respective SAP ERP
system so that the required staging table can be accessed.
Note: If you want to use the F9 function key in the Validation view of a BCC application containing a
LookupList field during local testing with the command Open from File, you must log in to the respective SAP
ERP system if a staging table has been specified.
Note: If Address view is selected on the Validation tab, the conflicting properties at the column nodes of the
project tree (Display label, Tab order, and the Display options) are ignored.
After you have imported a database file or defined columns manually, the names of all columns are shown in
the column list in the Compact columns area.
3. Specify the columns for which compactness is required: Either click Specific columns and select the check
boxes at the respective columns, or click All columns.
To add a filter:
1. Click Add. A new filter row is shown.
2. To specify a column as mandatory for this filter, click the down-arrow at the respective column, and click a
confidence value in the list:
In this example, matching data satisfy this filter only if a value for the CITYNAME column could be found on
the document with a confidence greater than 80.
3. Repeat step two for all columns that must appear together.
Display label
Field name that is shown in the Validation Client.
If it is empty at a predefined field of a Solution Accelerator, the field name is displayed in the language
specified by the Regional format property at the Validation node if this language is supported by BCC. If
it is empty for a custom field, the function name is used.
Tab order
Field order in the field list in the Validation Client is specified by increasing values of this property, that is,
the field with the lowest tab order value is displayed first. If you want the field to be displayed at a different
position, enter a value for Display tab order that lies between that of the desired predecessor and successor.
Visible,
Read-only
Hidden
By default, a field is displayed in the Validation Client and can be modified by the Validation operator.
If the Validation operator should not be able to modify the value, click Read-only.
If the field should not be displayed in the Validation Client, click Hidden.
Set error if empty
Select this check box if the field is mandatory. The Validation Client then indicates an error if the user tries to
submit the invoice while the field is still empty.
Note: The same general properties are available at each column of a Table function and a LookupList function.
Caution
If you change the name or a property of a CSA function, all previously collected templates will be deleted as
soon as the first document is processed with the modified application.
Important
If you use Table Auto-Completion at a Table field during initial training, check all table rows and delete all
rows that are not correct. When you click Submit, it is not harmful if some correct table rows are missing, but
the captured data must not contained erroneous rows.
Note: If you want to use the F9 function key in the Validation view of a Delivery Note Solution Accelerator or a
Sales Order Solution Accelerator during local testing with the command Open from File, you must log in to the
respective SAP ERP system so that the required staging table can be accessed.
Note: If an entry exists for VIM internal name, the field value is visible in VIM baseline screens or can be made
visible by configuration. It can be included in the business rule Missing mandatory information by
configuration.
Statements under VIM process focus on the Business Rules or Check Rules. If there are no business rules
for a field but the field value is used elsewhere, this is mentioned. If nothing is stated under VIM process, the
VIM baseline makes no use of this field except for showing it in the screens or checking it in business rule
Missing mandatory information.
5.1.31 PO Number
Internal name Countries Validation Export VIM internal name
Visible Mandatory
InvoiceOrderNumber All Yes Yes Yes EBELN
5.1.69.1 House No
Internal name Countries Validation Export VIM internal name
Visible Mandatory
RemitHouseNumber US Yes No Yes REMIT_HSNM1
5.1.69.3 City
Internal name Countries Validation Export VIM internal name
Visible Mandatory
RemitCity US Yes No Yes REMIT_AD_CITY1
5.1.69.4 ZIP
Internal name Countries Validation Export VIM internal name
Visible Mandatory
RemitZIP US Yes No Yes REMIT_PSTLZ
5.1.69.5 Country
Internal name Countries Validation Export VIM internal name
Visible Mandatory
RemitCountry US Yes No Yes REMIT_LAND1
5.1.70.1 House No
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ShipHouseNumber US Yes No Yes SHIPTO_HSNM1
5.1.70.2 Street
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ShipStreet US Yes No Yes SHIPTO_STREET
5.1.70.3 City
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ShipCity US Yes No Yes SHIPTO_AD_CITY1
5.1.70.5 Country
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ShipCountry US Yes No Yes SHIPTO_LAND1
5.1.70.6 Region
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ShipRegion US Yes No Yes SHIPTO_REGIO
5.2.2 PO Number
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ItemOrderNumber All Yes No Yes EBELN
5.2.4 Amount
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ItemAmount All Yes No Yes WRBTR
5.2.5 Quantity
Internal name Countries Validation Export VIM internal name
Visible Mandatory
ItemQuantity All Yes No Yes MENGE
Application
An application bundles all settings that are needed to process documents of a certain kind, for example which fields have
to be extracted by which extraction method, or which fields should be visible in the Validation Client.
ART
Abbreviation of Adaptive Recognition Technology
BCC
Abbreviation of Business Center Capture.
Cluster
See Recognition Cluster
Cluster Node
A Cluster Node is a component of a Recognition Cluster performing job processing. The Recognition Cluster contains
one Cluster Node for each processor of the Recognition Cluster. The Load Manager performs load balancing and
distributes recognition jobs among all Cluster Nodes of the Recognition Cluster. If there is only a Recognition Server
and no Recognition Nodes, there will be as many Cluster Nodes as processors in the Recognition Server machine. If the
Content-Based Classification
See Adaptive Classification
Customizing
ICC/BCC provides various settings to adapt the system to customer needs. These customizing settings are bundled in
applications. To customize ICC/BCC for customer needs you have to create applications.
Customizing Client
The Customizing Client is a Windows program that is used on the development system to customize ICC/BCC, that is, to
create and configure applications. It is also used to test applications, to export them to a SAP ERP system, and to import
them on a test or production System. In addition, it is used to monitor and administer an ICC/BCC system.
Document Reader
In an ICC/BCC system the actual data extraction from document images and super-ordinate tasks, like splitting of pages
or export of results, are performed by a component named Document Reader.
Hot Spot
The Recognition Server uses Hot Spots to fetch recognition jobs, download data, and learning data from its SAP ERP
system. A Hot Spot becomes active regularly after expiring of a specified interval or at a specified time on specified days
and then tries to fetch new data. Each application uses Hot Spots of two different types. A SAP Extraction Link is used
to fetch new recognition jobs from SAP ERP. A SAP Download Link is used to fetch new data, such as vendor master
data and purchase order data.
ICC
Abbreviation of Invoice Capture Center
Learning
While special, optimized methods are used to extract the standard fields, for adaptive recognition fields and adaptive
classification fields ICC/BCC has to learn how to fill the field values. The learning data have to be provided by training.
The Recognition Server keeps the learning data in its local database.
Link
Links are used to download recognition jobs, download data, and feedback data from SAP ERP. Links are implemented
as so-called Hot Spots. The Hot Spots used for the Links connect to SAP ERP regularly at specified times and fetch
available jobs or data.
Load Manager
ICC/BCC uses load balancing in order to distribute recognition jobs on several processors of the Recognition Server and
on additional Recognition Nodes. Load balancing is performed by the Load Manager. During configuration of the
ICC/BCC system, you configure the Load Manager on the Recognition Server and create computer nodes for all
additional Recognition Nodes and Runtime Nodes for all processors on each Recognition Node.
Master
On a Recognition Cluster, the Load Manager performs load balancing over all processors of the Server and the
Recognition Nodes. The Load Manager on the Server acts as Master. It transfers recognition jobs that are fetched from
SAP ERP regularly to free Cluster Nodes for processing.
Recipient Data
For an invoice application, ICC/BCC can derive the company code and target SAP ERP system automatically using
recipient address data found on the invoice. During application configuration, a database containing recipient data
together with the corresponding company codes and SAP ERP system names has to be supplied if automatic company
code detection shall be used.
Recognition
Recognition comprises the extraction of essential data, such as date, amount, order number etc., from the scanned
documents and the intelligent verification of these data. Recognition is performed by the Recognition Cluster.
Recognition Cluster
The Recognition Cluster is used for automatic data extraction. On smaller systems with low throughput it consists of the
Recognition Server - typically a computer with several processors - that also hosts the Customizing Client. On large
systems with high throughput, further Recognition Nodes can be added. The Load Manager performs load balancing and
distributes recognition jobs among all processors of the Server and the Recognition Nodes.
Recognition Node
For huge systems the Recognition Server can be supported by additional machines, installed as Recognition Nodes. The
Load Manager on the Recognition Server fetches jobs from SAP ERP using a SAP Extraction Link and distributes the
jobs among all processors of the Server and the different Recognition Nodes. By adding Recognition Nodes, the
processing power of the whole Recognition Cluster can be adapted to the required throughput.
Recognition Server
Document processing is performed on a Recognition Cluster consisting of a Recognition Server and possibly additional
Recognition Nodes. The Recognition Server comprises the Customizing Client that is used to customize, configure, and
monitor the ICC/BCC system. It also comprises the databases containing vendor master data, purchase order data,
feedback data, and learning data. It also hosts the Hot Spots that are used to fetch jobs, download data, and feedback data
from the SAP ERP system regularly. The Load Manager of the Recognition Server is the master of the recognition
cluster. It contains the cluster configuration and distributes recognition jobs on the processors of the Server and
additional Recognition Nodes.
Slave
On a Recognition Cluster, the Load Manager performs load balancing over all processors of the Server and the
Recognition Nodes. The Load Manager acts as a Slave on each Recognition Node. When a Runtime Node terminates a
job, it waits until the Load Manager on the Server that acts as Master supplies it with a new job.
Server
See Recognition Server
Standalone System
For development and testing of applications, all components of ICC/BCC can be installed on a standalone system, that is,
on a single computer.
Training
To be able to extract adaptive recognition fields or adaptive classification fields automatically, some training has to be
performed first. Training can be performed manually by the user on the Customizing Client.
Validation Client
The Validation Client is a Windows program that is used by a clerk to check and correct data that have been extracted
automatically and to capture additional fields that could not be read automatically. The corrected and completed data are
then passed on to SAP ERP.
VIM
Abbreviation of Vendor Invoice Management