Professional Documents
Culture Documents
The following instructions make the software very simple to use, and
feature many screen-shots to illustrate a step-by-step introduction to
both the software and the concordancing process.
i
Table of Contents
Opening a file or corpus . . . . . . . . . . . . . . . . . . . . . . . . .1
Search Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3
Making a corpus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7
PDF documents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
ii
How to use A d TAT
The Adelaide Text Analysis Tool explained
To use this software you will need a corpus, or collection of text. A corpus can be a
single text file or a group of text files (.txt format). When you have some text for your
corpus, open the A d TAT package to begin. You should see a screen like this:
Double-click on a
folder to open it. 1
The Adelaide Text Analysis Tool explained
One of the simplest functions of A d TAT is a basic word search. This software will
search a loaded corpus, finding every occurrence of the search term to show you
other words that appear around it: the collocates used by the writers of the text in your
corpus.
Search Results
If your keyword is found, This process helps you to Select a concordance and
the program will generate see which words are this panel shows where
a concordance list. commonly grouped the word appears in the
together (collocates) in source text, giving the
the text(s) being original context.
searched.
3
The Adelaide Text Analysis Tool explained
This example
shows the results
of a search for all
words that
contain “differ”.
5
The Adelaide Text Analysis Tool explained
Making a corpus
The best way to use the concordancing process is first to gather a collection of
articles which are relevant to the kind of writing you want to investigate. If you want to
examine research articles in a particular discipline, for example, a useful corpus would
consist of published articles from that discipline. This would allow you to search for
language features that are commonly used in this kind of writing.
The size of a corpus depends on the searches you intend doing. There are drawbacks
to having too little text in your corpus as you may not find enough examples of little-
used terms and expressions. Similarly, a corpus which is too large can result in too
many examples, especially of common words, to allow an easy evaluation of
language features.
The following steps will help you to contruct your own corpus quickly and easily:
1. Make sure the documents you want to use are written in current English, with
standard usage of prepositions, articles, verb tenses and other grammatical
features. This requirement can be covered by selecting articles for which at
least some of the authors are likely to be “native speakers” of English, and
ensuring that the articles are from a reputable source – check author and
publisher information, as well as the text itself, for guidance on this.
2. Obtain electronic copies of the articles and save only the text (sentences and
paragraphs, no page numbers, headers or footers, tables and figures), and
save them as text files (.txt format). Your sources may be web pages, PDF
documents or word processor files. See the following section, Preparing text
for concordancing, for more details on coverting text from these sources.
3. Save all the .txt files in a single folder on your computer.
If you are copying text from web pages, select all the text, copy and paste
immediately into a word processor document. If you are using a hard copy, you need
to scan the document and save it as a text only document.
7
The Adelaide Text Analysis Tool explained
PDF documents
When using PDF-based texts, software such as Omnipage or PDF2Text can be used
for conversion. If you use this method, follow the instructions provided wth the
software.
The easiest method in the long term is to copy text from one page or column at a time
and paste each ‘capture’ into the new word processor file. Repeat this process, then
later ‘repair’ the text so that it is restored to its original continuous flow by ensuring
that there are appropriate breaks between words, sentences and paragraphs. This
avoids copying the unwanted parts from the beginning of the process and also
provides a text document which will display correctly in A d TAT .
The process of text conversion and editing seems complex at first, but becomes an
almost mechanical routine with practice.
Windows, MS Word and Microsoft Word are registered trade marks of Microsoft Corporation.
Macintosh is a registered trade mark of Apple Inc.
Omnipage is a registered trade mark of Nuance Communications, Inc.
PDF2Text is a registered trade mark of Retsina Software Solutions.
Adobe Reader is a registered trade mark of Adobe Systems Incorporated.
The above products are referred to in this documentation for information purposes only.
AdTAT is not affiliated with, nor has it been authorized, sponsored, or otherwise approved by
the organisations responsible for the products mentioned. The developers of AdTAT likewise
do not intend such references to be taken as endorsement of these products. 9