Professional Documents
Culture Documents
AWS Comprehend User Guide
AWS Comprehend User Guide
USER GUIDE
Version: 6.4
Document Revision: 1.0
The information contained in this document is the proprietary and confidential information of Blue Prism Limited and should not be
disclosed to a third party without the written consent of an authorised Blue Prism representative. No part of this document may be
reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying without the written
permission of Blue Prism Limited.
© Blue Prism Limited, 2001 – 2016
®Blue Prism is a registered trademark of Blue Prism Limited
All trademarks are hereby acknowledged and are used to the benefit of their respective owners.
Blue Prism is not responsible for the content of external websites referenced by this document.
Blue Prism Limited, Centrix House, Crow Lane East, Newton-le-Willows, WA12 9UY, United Kingdom
Registered in England: Reg. No. 4260035. Tel: +44 870 879 3000. Web: www.blueprism.com
As the market for RPA grows, also grows the interest of what RPA can do and how easy it can integrate with every
ecosystem available. With the advent of Artificial Intelligence (AI) to the marketplace, interest has grown in
capabilities that provide integrations with different pre-trained AI services in the cloud.
This document focuses on the design of the integration between Blue Prism® and Amazon’s Comprehend Service.
Amazon provides these in the form of web services, which are consumed via RESTful APIs.
Amazon Comprehend is a natural language processing (NLP) service that uses machine learning to find insights and
relationships in text. No machine learning experience required.
There is a treasure trove of potential sitting in your unstructured data. Customer emails, support tickets, product
reviews, social media, even advertising copy represents insights into customer sentiment that can be put to work
for your business. The question is how to get at it? As it turns out, Machine learning is particularly good at
accurately identifying specific items of interest inside vast swathes of text (such as finding company names in
analyst reports) and can learn the sentiment hidden inside language (identifying negative reviews, or positive
customer interactions with customer service agents), at almost limitless scale.
Amazon Comprehend uses machine learning to help you uncover the insights and relationships in your
unstructured data. The service identifies the language of the text; extracts key phrases, places, people, brands, or
events; understands how positive or negative the text is; analyzes text using tokenization and parts of speech; and
automatically organizes a collection of text files by topic.
https://aws.amazon.com/comprehend/features/
Amazon recommends using Identity and Access Management (IAM) for fine grained control of AWS resources.
Information about IAM is here:
https://aws.amazon.com/iam/
Details for setting up identity-based policies for Amazon Comprehend are found here:
https://docs.aws.amazon.com/comprehend/latest/dg/auth-and-access-control.html
AWS Authorization
AWS employs a propriety algorithm for generating an authorization token, when is provided in the HTTP header.
This algorithm involves creating a signature based on several criteria, including the HTTP request body. The Blue
Prism Comprehend VBO includes the ability to create this authorization token, so there is no need to install the
AWS CLI tools.
AccessID
Access Key ID assigned when setting up IAM
SecretKey
Secret access key assigned when setting up IAM
Amazon Comprehend Detect Dominant Language feature can detect the following languages:
Response
Parameter Data Type Description
Detect Entities
An entity is a textual reference to the unique name of a real-world object such as people, places, and commercial
items, and to precise references to measures such as dates and quantities.
Response
Parameter Data Type Description
Response
Parameter Data Type Description
Detect Sentiment
You can determine if the sentiment is positive, negative, neutral, or mixed. For example, you can use sentiment
analysis to determine the sentiments of comments on a blog posting to determine if your readers liked the post.
Request
Parameter Data Type Description
Response
Parameter Data Type Description
Detect Syntax
Analyze your documents to parse the words from the document and return the part of speech, or syntactic
function, for each word in the document. You can identify the nouns, verbs, adjectives and so on in your document.
Use this information to gain a richer understanding of the content of your documents, and to understand the
relationship of the words in the document.
For example, you can look for the nouns in a document and then look for the verbs related to those nouns. In a
sentence like "My grandmother moved her couch" you can see the nouns, "grandmother" and "couch," and the
verb, "moved." You can use this information to build applications for analyzing text for word combinations that you
are interested in.
Request
Parameter Data Type Description
Response
Parameter Data Type Description
Response Collection Collection of text elements and the part of speech they are
categorized as.
ADJ Adjective
Words that typically modify nouns.
ADP Adposition
The head of a prepositional or postpositional phrase.
ADV Adverb
Words that typically modify verbs. They may also modify adjectives and other adverbs.
AUX Auxiliary
Function words that accompanies the verb of a verb phrase.
DET Determiner
Articles and other words that specify a particular noun phrase.
INTJ Interjection
Words used as an exclamation or part of an exclamation.
NOUN Noun
Words that specify a person, place, thing, animal, or idea.
NUM Numeral
Words, typically determiners, adjectives, or pronouns, that express a number.
O Other
Words that can't be assigned a part of speech category.
PRON Pronoun
Words that substitute for nouns or noun phrases.
PUNCT Punctuation
Non-alphabetical characters that delimit text.
SYM Symbol
Word-like entities such as the dollar sign ($) or mathematical symbols.
VERB Verb
Words that signal events and actions.