Professional Documents
Culture Documents
AN-T80020
Application Note
Diagnostics Procedure
This document provides:
• A procedure for first-line diagnostics
This explains how to gather the appropriate data for each situation. It is easy to lose evidence after a
module or system failure. This procedure explains how to collect the evidence to give to a support
engineer, who can use the rest of this document to diagnose the problem.
• Details of all error codes produced by the I/O modules
For I/O module faults, the first-line diagnostics will usually provide a fault code. This section explains
the fault that has been found and advises on a course of action.
• A guide to the Analysis Tool
The Analysis Tool is a program which helps with the collection and analysis of diagnostic data from
the system. It includes advice on all the error codes and will warn about problems found on the
system.
Issue Record
Issue Date Revised by Technical Authorised Modification
Number Check by
6 March Nick Owens Pete Stock Gerry Derived from 552936 rev 05.
09 Creech Fault codes revised, converted
to AN, advice rewritten.
7 May 09 Nick Owens Pete Stock Gerry Added action on processor
Creech shutdown
8 July 09 Nick Owens Andy Holgate Pete Stock Revised Analysis Tool manual
9 Jan 10 Nick Owens Andy Holgate Pete Stock Revised Analysis Tool manual
10 May 10 Nick Owens Andy Holgate Pete Stock Analysis Tool v3.6 manual
11 Oct 10 Nick Owens Andy Holgate Pete Stock Analysis Tool v4.0 manual
12 Feb 11 Nick Owens Andy Holgate Pete Stock Analysis Tool v4.1 manual
13 Apr 19 Nick Owens Analysis Tool v7.021 manual
WDOG/PWRFAIL error codes
Diagnostics Procedure
Table of Contents
Diagnostics Procedure .............................................................................................................................2
Table of Contents...................................................................................................................................2
First-line Diagnostics ................................................................................................................................4
Diagnostics Flowchart ..............................................................................................................................5
LED Interpretation ....................................................................................................................................6
Toolset Diagnostics ..................................................................................................................................9
Other data on Equipment Definitions ...................................................................................................13
Analysis Tool ..........................................................................................................................................14
Online ...................................................................................................................................................15
Online Options ...................................................................................................................................17
Auto set time/date .............................................................................................................................17
I/O Module Options ...........................................................................................................................17
Erase Logs ........................................................................................................................................19
Analyse Data .....................................................................................................................................19
System Graphic / Online Tree Window .............................................................................................20
Manual Command Entry....................................................................................................................21
Offline ...................................................................................................................................................22
File Menu ...........................................................................................................................................23
View Menu .........................................................................................................................................23
Log View ............................................................................................................................................24
Find Menu..........................................................................................................................................24
Analysis View ....................................................................................................................................24
Bookmarks ........................................................................................................................................27
Module Versions................................................................................................................................31
System Health ...................................................................................................................................32
Bookmarks ........................................................................................................................................33
Main Processor System Logs.................................................................................................................34
Clearing the MP Non-Volatile RAM (NVRAM) Memory .........................................................................37
GALPAT errors (TN20014) ...............................................................................................................37
Processor build 122 and System.INI changes (TN20061)................................................................37
Clearing the non-volatile RAM ..........................................................................................................37
Action on Processor Shutdown ..............................................................................................................38
Normal Shutdown Action ...................................................................................................................38
Processor LED States..........................................................................................................................38
First-line Diagnostics
Every day, check the processor’s System Healthy LED. If this is green, there are no system faults.
There may still be communications problems and field wiring problems.
If the System Healthy LED is flashing red, there is a system fault. Look at the other diagnostic LEDs in
the table on the next few pages.
Each module has ‘Healthy’ LEDs, one for each slice of the module’s circuitry. The Communications
Interfaces (8151 or 8151B) are not triplicated and so only have one LED.
Do not press the main processor reset pushbutton or remove and reinsert a module unless specifically
advised to do so in the procedures below. Pressing the reset pushbutton may clear important
diagnostic information. Removing and reinserting a module may cause shutdowns and will also clear
some fault information.
Keep a logbook for recording error codes from I/O modules. Record the error code, module position
(chassis and slot or reference number), date and time. If the advice in this document for that error
code is to act only if it is persistent (returns later after pressing Reset), use the logbook to look for
earlier records of the same fault.
The following pages give a flowchart for diagnostics and some interpretations of LED colours.
Diagnostics Flowchart
LED Interpretation
LED colour Reason Procedure
Processor Processor Collect the processor’s current system log. The procedure is
'Healthy' module fault. described in a section below.
LEDs are Obtain a replacement processor module of the same or later build.
red Swap to the replacement module.
If the fault appeared after the processor was restarted, it is likely that
a memory corruption has occurred because the education process
had not completed. Refer to TN20014.
Processor No application The Standby Processor takes a few minutes to synchronise with the
'Educated' Active Processor. It should not be removed during this time. If it has,
LED is not memory corruption may occur. Refer to TN20014.
steady The Active Processor may have no application loaded.
green
The Standby Processor may not have started or completed its
education from the Active processor.
Active Application not The 'Run' light is always steady green on the Standby processor.
Processor running The Active processor should show a flashing green 'Run' LED.
'Run' light
is not The 'Run' LED is off when the application in the Active processor is
flashing stopped.
green If the ‘Run’ LED has stopped when it should not have, then the
system has detected a fault of some kind. This should be reported to
Technical Support. Refer to the section below describing Action on
Processor Shutdown. This describes how to collect diagnostic
information which may be lost during attempts to restart.
Processor The processor The 'Inhibit' LED flashes green when any input or output is locked, as
'Inhibit' cannot be hot a warning. This LED also flashes green when the current Standby
LED is swapped Processor has an incompatible system configuration. A changeover
flashing from the Active to the Standby processor will not work if the Inhibit
green LED is flashing. To enable a swap in this second case, remove and
reinsert the Standby processor to load the system configuration.
Expander No comms These three LEDs monitor the communications on the three cables
Processor from the Expander Interface. Note that no communications will be
Tx/Rx shown whilst the Expander Processor is in Standby, the Expander
LEDs not Interface is not operating or the system is starting up. If one
flickering communications link LED is off, check the cable and connections for
yellow that link.
I/O module A fault has been Collect the error code using the Toolset debugger, as described in a
'Healthy' detected on the section below. Look up the advice in the Error Code Descriptions
LED slice but the below. Note the fault code, module reference, date and time in a log
flashing slice is still book and press Reset.
red operating
I/O module A slice of the You cannot clear these faults or restart the slice by pressing Reset.
'Healthy' triplicated You cannot get any logs from this slice without restarting it.
LED module has Obtain a replacement module of the same or later build. Swap to the
steady red been set offline replacement module. Remove the faulted module.
and has been
disconnected. Insert the faulted module into an unused slot (one for which scanning
is not disabled in the system configuration, but which is not
connected to an I/O cable or hot-swap cable).
If the slice fails to start, return the faulty module for repair.
If the faulted slice starts, collect the I/O module log from that slice
(described in a section below). Read the end of the log for error
codes. Look up the advice in the Error Code Descriptions below.
I/O module Field fault Channel LED settings are system specific and may be configured in
Channel the System Configuration. Check the meaning of the LED colour and
LEDs are investigate the channel wiring.
not off or
steady
green
Toolset Diagnostics
If you need to find an I/O module fault code, use the Toolset to access the data in the system. The
data also includes further data on the system, e.g. temperature, voltages, currents etc.
Open the Toolset, using either your desktop shortcut or the Start menu ( Start | All Programs | Trusted
| Toolset ).
Open the application running in the system by double-clicking on its name.
It is possible to connect to the system using either the processor’s front panel serial port or over
Ethernet via a communications interface. Before connecting to the system, check that the
communications port settings are correct. Select Debug | Link Setup.
If you are using a TC-304 maintenance cable to connect to the processor’s front panel serial port,
check that ‘Communication port:’ is set to COM1 (or whichever serial port you are using on the PC).
Check that the maintenance cable is plugged into the PC serial port and the processor’s front panel
port.
If you are using Ethernet, check that TMR System is selected. This option is at the bottom of the list
and you need to scroll down to see it. Click on Setup and check the IP address is set. Hopefully for
existing site systems, the communications settings will already be set up. Check that the system and
PC are connected to the Ethernet network with addresses on the same subnet.
Ensure the processor keyswitch is turned to ‘Maintain’ if using a Toolset before build 103. Toolsets
before build 103 will not communicate with the system if the keyswitch is in the ‘Run’ position.
Toolsets from build 103 will communicate read-only if the keyswitch is in the ‘Run’ position.
Select Debug | Debug. A long thin window entitled IEC1131 TOOLSET – (application) – Debugger
should appear. This will have a bold black line of text giving the state of the application. This window is
called the Debugger window and it is the key to all online controls. To disconnect from the system,
close this window and all other online windows will also close.
If the black line of text says RUN, you are connected to the system. Go on to the next page.
If the black line of text does not say RUN, and the system is clearly running (flashing ‘Run’ LED on the
processor), then there is likely to be a communications problem.
Using a serial port, you will see ‘Disconnected’. Check that the keyswitch is set to Maintain and the
maintenance cable is connected. Then try to connect again. Using Ethernet, the online session will
abort with the message ‘Cannot install the communication’. You will need to close down all Toolset
windows to reset this error.
Check the keyswitch position. Check that the Ethernet network is connected by sending a ‘ping’
command to the communications interface port using the following command (with the appropriate IP
address) in a command window. Then try to connect again.
Open the I/O connection table by clicking on the icon shown above or menu Project | I/O connection.
Each module in the system has an equipment definition. Imagine this as a marshalling terminal rail,
with several blocks of terminals. These terminals are shown as icons appearing like screws, in several
different terminal blocks called boards. Each board is used to send data to or from the module, and
some of it is useful for diagnostics.
Each equipment definition is described in the product description for the module. Note that equipment
definition ttmrp is the processor (see PD-T8110B) and tci is the communications interface (but there is
no data on the tci boards).
Each equipment definition is allocated to a chassis and slot position where the module is. Click on the
first board in the definition (for the 8403 shown, click on DI). At the top of the data on the right is the
chassis and slot position.
If you are looking for an error code, find the definition with the same chassis and slot number as the
faulty module. Then click on the HKEEPING board and scroll down to the last three channels. In the
example above, the module is healthy. All three slices are reporting a zero error code.
If any of the last three channels is not zero, note the number.
Select ‘Dec’ (decimal) as shown above. Type the error code number. Select ‘Hex’ (hexadecimal). The
calculator will convert the number.
Look up the hexadecimal number in the table in this manual. The error codes are listed in
hexadecimal number order, which is like decimal but has six extra digits:
0,1,2,3,4,5,6,7,8,9,A,B,C,D,E,F,10,11,...
In this case, 52?? Indicates an input channel fault. It is on channel 16, because 10 in hexadecimal is
16 in decimal.
Follow the advice given for the error code.
You can avoid having to convert the error code if an integer variable is wired to each channel, and its
format is set to 1A2B (four digit hexadecimal). The toolset will convert the value into hexadecimal for
you.
It is important to check for error codes before pressing the main processor reset pushbutton, because
pressing reset will clear all fault indications and fault filter counters. It is possible that a rare or slow
fault has occurred, due to a genuine problem, which may not occur again for some time.
Once you have collected all fault codes and noted them, you may press the processor Reset
pushbutton to clear them.
Analysis Tool
This application can collect command line diagnostics online from a live system. It can analyse the
collected data and provide advice and reports. It can also analyse logs taken by the macro program
Dumptrux in the same way. It can erase the system logs.
The program installs itself by default into the same directory as the Toolset and other 8000 series
software. It also provides the option of a desktop icon and a Quick Launch icon.
On opening, you can either go online (Online Comms Setup ...) or choose an existing file to analyse
(File Open file). At each point, the bottom banner shows the options available to you.
Open file will open both saved log files and analysed data. A log file has the raw text from the system,
and analysed data files have the log and all reports in it. After opening a log file, you can save the
analysed data. Analysed data can be opened quicker than a log file because the analysis work has
already been done.
Online
The online comms setup allows Ethernet or serial connection. Choose the Ethernet IP address or
serial port number as appropriate.
Choose the automatic diagnostic collection option. If in doubt, use Not I/O.
1) None
This just makes a connection and opens the terminal.
2) Choose
This collects enough data to discover the modules fitted in the system and takes only a few seconds.
It then provides a picture or tree of the system which can be clicked on to get data from each module.
2) Not I/O
This collects all diagnostic data from the processor(s), communication interfaces, expander modules
and chassis, which will only take one or two minutes. This information is often the most important, and
is necessary even if the fault is in an I/O module. It also provides a picture or tree of the system, so
that data can be gathered from any I/O module.
3) All
This collects all the above data, but also collects from all the I/O modules. This will take some time,
especially if the I/O module logs are collected. It may be appropriate to ask a user to collect all data, to
avoid needing to explain which data is required.
For a serial connection, use a TC-304 maintenance cable. Check that the maintenance cable is
plugged into the PC serial port and the processor’s front panel port. Ensure the processor keyswitch is
turned to ‘Run’. The Analysis Tool cannot communicate if the keyswitch is in the ‘Maintain’ position. If
the processor front panel port does not seem to work, the Analysis Tool can also be connected to a
communication interface front panel port.
If you are using Ethernet, check that the system and PC are connected to the Ethernet network with
addresses on the same subnet. The keyswitch can be in either position.
On clicking OK, you should see a title line (which assists the analysing code) and a prompt (ci:? for
Ethernet and mp:? for serial). If you only see an empty window or the program reports it cannot make
a connection, there is no communications. For a serial connection, check the cable and the keyswitch
position, then press Enter to request a prompt. For an Ethernet connection, check that the Ethernet
network is connected by sending a ‘ping’ command to the communications interface port using the
‘ping’ command (with the appropriate IP address) in a command window as shown in the Toolset
diagnostics section above. Then try Online | Comms Setup again.
Online Options
Once the online collection has finished, more options are available on the Online menu.
Auto set time/date sets the time automatically from the computer’s clock. It firstly measures the
delay between sending a new line and receiving a prompt, and then plans a moment to set the time
every ten seconds. On setting the time, the new line is sent at a calculated moment before the second
to attempt the most accurate time synchronization. Serial is the fastest medium with latencies of a few
milliseconds; Ethernet latencies are at least twenty times longer. This still cannot be accurate to the
millisecond, and only IRIG can provide true millisecond timestamping.
The animation shows the current time and the planning for the next event to set the time. This
animation takes significant processing time, and so the time setting will be more accurate without the
animation.
Use the Time Offset to adjust the time to be set, in case the system must be in a different time zone to
the computer.
I/O Module Options provides the diagnostic privileged access password and chooses the data to
be collected from I/O modules. The Analysis Tool must have the diagnostic password entered before
collecting data from I/O modules. The options dialog will appear if you try collecting or erasing I/O
modules without having entered the password.
Enter the diagnostic password. The password may be changed using the System.INI configuration file;
if in doubt, contact your technical support.
You can also choose:
Collect logs: whether to get the event logs from the I/O modules plus general data, or just the
general data. This can be used to speed up collection if you aren't interested in the logs; they can take
a long time to collect and can be very large.
If you de-select Erase all logs in system, you can choose which module logs to erase. You can
select one I/O module from a drop-down list.
The Analysis Tool will collect the first 90 lines of each I/O module log before erasing it. It checks these
lines for manufacturing test entries, and puts these test entries back into the log after erasing it. This
assists with module diagnosis and repair. Note that Dumptrux does not preserve these test entries
and should no longer be used for erasing logs.
Analyse Data passes the collected data to the analysing side of the program, which will prepare
reports. Choose this option when you have collected all the data you want, and you want to analyse it.
After this, the program behaves exactly as if you had opened an existing file through File Open file.
You can still go back online however, using menu option Online Terminal. The analysis side of the
tool is described later.
Offline
You can either:
1) analyse some data you have collected online (see above) or
2) open an existing log file (collected using the Analysis Tool, Dumptrux or a terminal program)
or
3) open a file of analysed data previously made by this program.
If you want to analyse your online data, use Online | Analyse Data.
If you want to analyse a saved file, use File | Open log file.
The analyser will read each log command and gather its data, then prepare the data for the reports.
This process can take time on large logs. If it takes forever, you can cancel it by clicking the menu
option Click here to stop. The reports will only contain information from the data analysed so far.
Sometimes the Analysis Tool will be unable to open the log. If it fails, it will give the message shown
above. Some logs are corrupted due to communications noise, and sometimes the I/O modules will
not store their logs properly (the logging process is not the highest priority task and it can be
interrupted). In some cases, the system may contain modules which reply with message formats that
have not been tested (or accounted for) with the Analysis Tool, causing the code to abort. There is
little that can be done to cope with corrupted logs, but it may be possible to account for an unknown
reply format. Please send the log as instructed.
File Menu
The File menu provides basic tools as follows, depending on which view is shown:
• Open file: chooses a new file to open. (Ctrl-O will also work)
• Save analysed data: saves a file containing the log and all report data. (Ctrl-S will also work)
This can be opened later by the Open analysed data option above. You also have the option
to make a compressed file. This will be easier to email.
• Save Versions CSV: saves a text file of serial numbers and firmware versions for each
module.
• Save this report: saves the currently displayed report to a file.
• Copy selection: copies the selected text for pasting into other documents. (Ctrl-C will also
work)
• Print this view: sends the current view to the printer. (Ctrl-P will also work)
• Exit: closes the Analysis Tool.
View Menu
The View menu lets you choose the different reports and displays that the Analysis Tool provides. The
contents of this menu will change depending on the report view you have chosen. The reports are
described on the next pages.
Log View
This shows the initial view of the whole file, as if it was in a text editor. Experienced engineers can
search through the data for details that may not be captured by the reports. Use the Find menu to help
with searching.
Find Menu
The Find menu has the following options for manual searching through the log.
• Text … : simple text search as in Notepad, with
similar options. (Ctrl-F will also work)
• Current Log: goes to next ls d command in the log
(from the active processor, standby processor or
communications interface)
• Backup Log: goes to next ls b command in the log
• Chassis/slot ... : goes to the next I/O module prompt for the given
chassis, slot and slice.
Analysis View
This view shows a representation of the system and provides detailed reports on parts of the system.
It will show the active processor’s individual report when opened. A navigational window lets you
choose a report.
This shows a simple picture of the system, and provides access to the
reports described below.
Hovering the mouse over a module will show its description in the title,
and an explanation of the colour.
Each I/O module has up to seven reports. Left-click on the module for the main report. This provides
serial numbers, versions and the error codes currently reported on each module. These are the same
error codes as found through the Toolset Diagnostics.
Right clicking provides the system event logs on each slice (if the file includes them), and two or three
data reports. The system event logs include a lot of data, so the main events have been extracted for
the report.
Hold the mouse pointer over a row in the table to get some advice on an event. All faults with fault
codes have advice for further diagnostics and remedy. This is taken from the advice in the Error
Codes Descriptions in Appendix A of this document, and includes all its maintenance advice.
You can filter this data using the menu option View | I/O Log Options.
• You can see only recent events by selecting Only from and choosing a date.
• You can chose to see only important categories by selecting Only at/above and choosing a
level. Events at this category or higher will be shown.
To see your new choice of data, click OK and open the log report again.
Bookmarks
If you click on a log event, you can add a bookmark and a comment by choosing the Bookmark menu
option. The event in the log then has a bookmark icon. Bookmarks and their comments are saved with
the report data when you choose ‘Save analysed data’, so that you can send the analysed data file for
further investigation.
When there are bookmarks, the View menu has an extra option ‘Bookmarks’. This shows all the
bookmarks in the analysis, so a support engineer can find them later.
• Channel Data is useful for comparing the measurements or states of the three slices. Note
that log data collection results in snapshots from the three slices at different times, so
differences may just reflect changing channel states.
• Threshold Data provides the input state measurement thresholds operating in the module.
Module Versions
This is a simple report of all the module firmware versions in the system. It is useful for collecting
serial numbers, firmware versions and module types and gives a quick idea of the shape of the
system.
System Health
This report collects all the most important advice that the Analysis Tool has provided into one report. It
is a global report of the health and state of the system. You can use this report as a single point of
advice for the whole system. It reports all current faults on I/O modules, advises on firmware upgrade
needs, reports communication configuration problems and I/O module configuration problems.
Entries are categorised and coloured in a similar way to the I/O module logs and graphical layout (see
page 22).
Hover the mouse over an entry for more advice.
Bookmarks
If you click on an entry, you can add a bookmark and a comment by choosing the Bookmark menu
option. The entry then has a bookmark icon. Bookmarks and their comments are saved with the report
data when you choose ‘Save analysed data’, so that you can send the analysed data file for further
investigation.
When there are bookmarks, the View menu has an extra option ‘Bookmarks’. This shows all the
bookmarks in the analysis, so a support engineer can find them later.
These logs show date-stamps at the start of each day (e.g. MON 2009-03-23 above). Each entry has a
time stamp, the number and name of the task that wrote the entry (e.g. 26 IMB:) and a text description
of the entry. Most entries will be due to normal operation. To find the current state of the system, read
the log from the bottom, up to the first ‘fault reset’ entry. Some possible entries are shown below.
NIO: Disabling interface (Rack 1 Slot A module slice is being set offline by the processor;
5), slice C
this will appear in the I/O module as an 0x8741 fault.
Check the First-Line Diagnostics advice for a steady
red I/O module 'Healthy' LED
NIO: CLI error (FAIL) - Chassis 3 Slot 7 The I/O module slice has stopped communicating
Slice C
NIO: Lost (Rack 3 Slot 1) … A module has been removed
NIO: Simulating … The system is running without the module
NIO: Module(s) The processor is warning that it is still running without
removed/unconfigured/simulated
all modules
IMB: Expander FCR fault set - … Communications via expander modules has failed
NIO: Channel 6 Discrepancy (Rack 6 Slot I/O point measurement discrepancy; this will appear in
5), slice C
the I/O module as a 0x70nn fault. Check the First-Line
Diagnostics advice for a flashing red I/O module
'Healthy' LED
CLI Response Error FAIL - Chassis 2 Slot A module is missing on startup
11
FPS: [Manual] System fault reset The reset pushbutton has been pressed
NIO: Linked (or) Unlinked Chassis 3 Slot Two I/O modules have been partnered for a hot swap
7 - Chassis 3 Slot 12
(linked) or the partnership has been broken (unlinked)
NIO: Impending module removal set (or) The ejector tabs on this module have been opened (or)
cleared - Chassis 2 Slot 9
closed. If the log reports alternating set/cleared
messages, the ejector switches are faulty.
NIO: Slice fault - Chassis 4 Slot 9 A module has a fault but is still running. Check the
First-Line Diagnostics advice for a flashing red I/O
module 'Healthy' LED
NIO: Module not properly configured - The I/O module has rejected its System.INI
Chassis 3 Slot 2
configuration. It will have shut down on starting the
system.
NIO: Illegal 'standby' module state The I/O module was set Active but went back to
(fatal) - Chassis 4 Slot 1
Standby. This probably indicates that it has rejected its
System.INI configuration.
IMB: Permanent minor fault (MBCU) Indicates a permanent problem with system
Chassis 4 Slot 13 FCR C
communication to the given chassis, slot and slice.
IMB: Permanent fatal fault (MBCU) - The I/O module has shut down
Chassis 3 Slot 2
NIO: Slice state discrepancy - Chassis 2 This slice is in a different state to the other two
Slot 11 Slice A
Self Test: FCR A(B,C) BACKGROUND monitor A confirmed discrepancy was detected between
- permanent fault
memory data on the processor’s three slices.
‘Transient’ faults indicate that the fault has been found
but has not been confirmed yet
Self Test: FCR (A,B,C) MBIU SAFETY LAYER The voting circuits on the processor’s interface to the
COMMON test - permanent fault
system bus are faulty. This is often caused by inserting
an I/O module into one of the processor slots, which
damages the voter ICs.
I2K: Peer connection lost I2K is ICS2000 interface; irrelevant if not used
UART: Port 2 not supported by hardware - An old (8110) processor has been upgraded with new
config. Ignored
firmware; this is not a problem unless the nonexistent
IRIG and serial facilities are needed
CFS: Overflow in SOE buffer - Chassis 0 Event data may have been lost during MP startup
Slot 0
CFS: Overflow in SOE buffer - Chassis 1 Event data may have been lost during MP hotswap
Slot 8
IMB: Expander FCR fault set - chassis 1 A fault has been reported on an expander interface
slot 1 FCR B
24 IMB faulted The IMB comms has been starved of processing time.
24 IMB trip watchdog … If a few seconds after application load, on a system
with no native I/O, then the sleep period has been set
too short. 32ms is the recommended default.
ISaGRAF: Scanning started (or) stopped The application has started (or) stopped
IMB: Signal discarded due to slow A communication interface is not responding
connection
PIO: Stopped Peer Comm … Peer to Peer communications stopped
PIO: Bad chassis/slot for … board Incorrect Peer to Peer configuration in this or another
system
PIO: Received an invalid ack. to board The standby processor has failed to acknowledge the
from standby peer
transfer of the Peer to Peer board setup. When a new
application or online update is loaded, it is transferred
to the standby processor. This message may indicate
that the standby processor has failed to educate.
Check the standby processor system log.
A/S: Standby processor static education The standby processor has finished receiving and
completed
saving the new application.
A/S: Standby processor reporting ill The standby processor is not responding to the
health
education process. Check the standby processor log.
The usual cause is that the ejector tabs are not closed
or the ejector switches are faulty (see TN20016).
A/S: Handover inhibited - ISaGRAF Variables have been locked or unlocked in the
variables locked
application. The processors will not hot-swap if there
A/S: ISaGRAF variables released
are locked variables (the Inhibit LED will flash on the
active processor).
IRIG: Maximum update interval exceeded The system is configured to receive IRIG-B time
signals but is not receiving a signal. This is a common
cause of the System Healthy LED flashing red with no
other indication of fault.
FPS: Module ejectors open The processor’s ejector tabs are open (or the ejector
switches are faulty)
SYS: Module power fail The processor has been turned off or removed (usually
the last entry in a backup log)
Here, a discrepancy between slice A (‘FCR’, Fault Containment Region) and the other two slices was
detected in address F76CA0 hex. This was seen five times, which is the count required to declare a
permanent fault. At this point, the slice was disabled. As a result, the IMB later detected a
communications fault. This is much less common after firmware build 115 because errors are
corrected as they are found.
The MP should be swapped to a spare MP, to allow operation to continue. Then the memory can be
cleared to allow a fresh start.
Cold start
Attempting auto boot, press <ESC> to abort ...
At this point, press the Escape key (ESC) at the terminal program. The MP will report Auto Boot
Aborted and show the boot prompt ‘>’. There is now no application running in the MP; it is only
running the basic low-level boot system.
Type the following:
> envram
This command ‘Erases the Non-Volatile RAM’. This deletes the FAT table that addresses the flash
memory storing the INI and user application, and also deletes the logs and retained variables. The
processor is then unaware of its INI or application.
Restart the MP by cycling power. Load the system.ini file, restart and load the application.
Paste this into the Windows Paint program and save the file.
Start the application if it reports “No Application” (this may restart the system). If it reports
‘Disconnected’, the Toolset was not able to connect.
Close the debugger. The procedure continues on the next page.
Shutdown Flowchart
This section provides detailed descriptions for each of the error codes reported by I/O modules,
including what is wrong and what to do.
Error codes are described in four digit hexadecimal numbers. The first two digits describe the category
of error, and for many categories the last two digits are a “subcode”, narrowing down which part of the
module was noted as faulty. Fault codes above 0x8000 cannot be cleared by pressing Reset.
“Subcodes” specifiy a faulty channel number, group number, or other attribute. Group number always
designates an output power group. Channel number can designate an input/output channel or a
housekeeping data channel.
“Action” specifies how the slice state or channel state is affected by the fault. The slice can either:
• Continue running. The Healthy LED will flash red. The slice will still communicate, so the fault
is reported in the Toolset debugger and the slice system log can be collected. Note that the
Healthy LED also flashes red when the slice is still in its boot mode and has not started the
firmware.
• Be turned off (OFFLINE). The Healthy LED will be steady red. The slice is not communicating,
so the only way to diagnose it is to swap or remove it, restart the module and collect the slice
system log.
Keep a logbook for recording error codes from I/O modules. Record the error code, module position
(chassis and slot or reference number), date and time. If the advice in this document for that error
code is to act only if it is persistent (returns later after pressing Reset), use the logbook to look for
earlier records of the same fault.
Glossary
Codes 0x0001 to 0x01FF These fault codes report firmware coding errors and should not be
Modules: All seen in the field. Any example of these fault codes should be reported
to Technical Support. The module will fail to start.
Codes 0x0200 to 0x02FF These indicate faults in the Flash memory. Return the module for
Modules: All repair.
Codes 0x0400 to 0x04FF These indicate faults in programming the host interface ASIC. The
Modules: All except 8480 (see module will fail to start. Return the module for repair.
note) The faults are reported by the 8480 as 0x8400, 0x8401 or 0x8402.
IMB FATAL ERROR Detects communication errors on the IMB, including the chassis
0x07nn (to 0x073F) backplane. It also detects faults in the HIA.
Modules:All Transient errors may occasionally occur in normal operation, so the
firmware logs these faults to check for a pattern. Single cases can
nn: Error flags in 6 bit word: therefore be recorded and then ignored, but repetitive cases indicate a
1: BSU Finite State Machine error module fault.
2: Timeout error If faults appear on more than one modules in a chassis, then the
4: Slot error expander processor is faulty. If the module goes offline, replace the
module.
8: Framing error
Action:
10: Symbol error
5 occurrences => enable logging (limited to 20 entries)
20: Packet error
50 occurrences => disable logging, slice OFFLINE
IMB_DOUT_RESET The processor has requested a slice reset. This provides a
0x0740 mechanism to reset the slice without removing / inserting the module.
The slice goes into a dormant state (less severe than 0x8740 because
Modules: 8442 the 8442 must hold its last output states) then resets itself.
This error is simply a by-product of the reset process; the I/O module
The other modules report this log or processor log may indicate the reason in earlier events.
fault as 0x8740. Action:Slice in ‘Shutdown’ state (same as red Active LED on inserting
a module in an active slot), then it should automatically restart.
IMB_DOUT_DISABLE This provides a means for the processor to disable a slice. Some slice
0x0741 faults can only be detected by the processor. In this case the
processor must have a mechanism for turning off a faulty slice. The
Modules: 8442 action is less severe than 0x8741 because the 8442 must hold its last
output states.
The other modules report this This fault is always a secondary symptom of an earlier fault; the I/O
fault as 0x8741. module log or processor log will indicate the primary fault.
Action:Slice in ‘Shutdown’ state (same as red Active LED on inserting
a module in an active slot), will reactivate on pressing Reset.
FIA_INVALID_CALIBRATION Detects invalid calibration data stored in FLASH. The calibration data
0x0804 will also be invalid if the module has never been calibrated. Return the
module for repair.
Modules: All except 8442, 8472,
8473 Action:Slice OFFLINE on transititon to ACTIVE
Some of these codes refer to ‘upstream’ and ‘downstream’ slices. The definitions depend on the circuit
concerned, but for the 0x1000 and 0x2000 codes relating to inter-slice communications, the definitions
are:
On slice A: ‘upstream’ = slice B, ‘downstream’ = slice C
On slice B: ‘upstream’ = slice A, ‘downstream’ = slice C
On slice C: ‘upstream’ = slice A, ‘downstream’ = slice B.
FLASH_VPP_LOW_ERROR Detects a bad FLASH device, interface, or low supply voltage whilst
0x1002 erasing or writing firmware. Check the system supply voltage. If the
supply is healthy, return the module for repair.
Modules:All when starting
Action:Slice will not boot
HIU_OVI_FAULT Checks the operation of the power supply over-current (OVI) trip.
0x1004 Return the module for repair.
Modules:All except 8424, 8442, Action:Slice fault
8480
HIU_OVV_FAULT Checks the operation of the power supply over-voltage (OVV) trip.
0x1005 Check the system supply voltage, else return the module for repair.
Modules:All except 8424, 8442, Action:Slice fault
8480
HIU_HKAD_TIMEOUT This is a timeout on fetching the HIU HKAD data. Detects a faulty
0x1006 Housekeeping A/D or faulty interface to the HIA. Return the module for
repair.
Modules:All
Action:Slice fault
HIU_ISL_NO_SYNC Checks for two fault conditions -
0x1007 1) test loop synchronization with neighbor slices via ISL (increments
Modules:Output except 8480 fault filter on failure)
2) test loop stall (4.37 minutes max)
Detects faults in the ISL and/or a dead slice that prevents test
synchronization. Also detects a test loop stall, possibly caused by
excessive switch command transitions that cause test abortion.
If another slice is offline, ignore this fault. Failing this, check for heavy
switching demand. Failing this, return the module for repair.
This fault can lead to a module shutdown on firmware before release
3.5 (see TN20056).
Action:Slice fault
HIU_ISL_CRC_ERR This indicates a data check failure on data from the other slices over
0x1008 the inter-slice link. Detects a corrupted ISL link, including faulty ISL
RAM in the sending or receiving slice.
Modules:All except 8442, 8472,
8473, 8480 If a slice is offline, these faults are common in the remaining two slices
and can be ignored. If all slices are online, and the fault is persistent,
return the module for repair.
Action:Slice fault
HIU_ISL_STUCK_U_ERR This indicates stalled communications with the ‘upstream’ slice.
0x1009 If the fault is persistent, return the module for repair.
Modules:Input Action:Slice fault
HIU_ISL_STUCK_D_ERR This indicates stalled communications with the ‘downstream’ slice.
0x100A If the fault is persistent, return the module for repair.
Modules:Input Action:Slice fault
HIU_HKAD_ERR This test verifies min/max limits on the host interface (HIU)
0x101n Housekeeping A/D (HKAD) channels. Detects faulty operation of the
HIU power system and overload faults on the HIU and front panel unit.
Modules:All It also detects errors in the Housekeeping A/D and its serial link to the
Subcode: n = HKAD channel host interface ASIC. The HKAD data can be checked with the
5V HIU, n = 0 to 7 command ‘get reg HKAD’.
3.3V HIU, n = 0 to 10 These faults may be tripped by severe genuine power supply voltage
or temperature excesses (at 37 volts or 90 degC). Due to the likely
damage and ageing, the advice remains:
Return the module for repair.
Action:Slice OFFLINE
HIU_SSL_ERR This test compares the received SSL (Smart Slot Link) tag from the
0x1020 partner module, to the value that the MP says should be received.
Modules:All Verifies that the SSL connection between partnered modules is
operational. It detects opens and shorts on one or more of the smart
slot links.
In order for this fault to be detected, the module must already be
partnered. If the module is inserted without the Smart Slot jumper
cable, the MP will not partner the module and the fault will not be
detected.
The fault is not consistently indicated on both the ACTIVE and
STANDBY modules, although it is always reported on the STANDBY
module.
During a hot swap, if the new module indicates faults, try another
module. If the second module is successful, send the first for repair.
When swapping back from a Smart Slot to a default slot, always
remove the Smart Slot module from the chassis before disconnecting
the cable. If the cable is removed first, this fault will occur and the
processor will swap back to the Smart Slot module, which is now
disconnected from the field.
Action:Slice fault
HIU_ISL2_SYNC_TST Detects latent faults in the inter-slice link voter logic and discrepancy
0x103n detection logic by injecting discrepant data every minute.
Modules:8442,8472, 8473 If a slice has gone offline, this fault may appear on the other two slices
and may be ignored. Otherwise return the module for repair.
Subcode: n = discrepant slice ID
Action:Slice fault
(0, 1, 2 for slice A,B,C)
HIU_ISL2_SYNC_ERR Detects a failed slice or a faulty ISL bus connection between slices.
0x104n If a slice has gone offline, this fault may appear on the other two slices
Modules:Output and may be ignored. Otherwise return the module for repair.
Subcode: n = Sync Error bit 0 = fault on this slice
position (0, 1, 2) 1 = fault on upstream slice
2 = fault on downstream slice
Action:Slice fault
HIU_PWRFAIL_DISC Detects a discrepant Power Fail signal from the processor.
0x1050 The Power Fail signal seen on this slice differs from that seen by the
Modules: All from firmware 201 other two slices. The Power Fail signal reports loss of power in the
processor and also fatal faults in the expander interface and expander
processor. 2oo3 of these signals put the I/O module into Standby.
This requires a sequence of elimination.
• If there are more than one I/O modules with firmware 201 or
later in the system, in different expander chassis, and they all
report the fault, then the processor or the expander interface
module has the fault.
• If only the modules in one expander chassis report the fault
and others with firmware 201 or later in other chassis do not
report the fault, then the expander processor has the fault.
• If only one I/O module reports the fault and others with
firmware 201 or later do not report the fault, then the I/O
module has the fault.
Action: Slice fault
HIU_WDOG_DISC Detects a discrepant Watchdog signal from the processor.
0x1051 The Watchdog signal seen on this slice differs from that seen by the
Modules: All from firmware 201 other two slices. The Watchdog signal is wired from the processor’s
own hardware watchdog circuits and indicates a processor hardware
fault or stalled interrupt.
If there are more than one I/O modules with firmware 201 or later in the
system, and they all report the fault, then the processor has the fault. If
only one I/O module reports the fault and others with firmware 201 or
later do not report the fault, then the I/O module has the fault.
Action: Slice fault
Some of these codes refer to ‘upstream’ and ‘downstream’ slices. The definitions depend on the circuit
concerned, but for the 0x1000 and 0x2000 codes relating to inter-slice communications, the definitions
are:
On slice A: ‘upstream’ = slice B, ‘downstream’ = slice C
On slice B: ‘upstream’ = slice A, ‘downstream’ = slice C
On slice C: ‘upstream’ = slice A, ‘downstream’ = slice B.
HIA_INVALID_IMAGE This test checks the Host Interface ASIC (HIA) programming file in
0x2000 FLASH on either loading the boot code or the application firmware.
Detects a FLASH memory fault or missing HIA boot or application
Modules:BOOT CODE code. The module will stay in boot mode. Return the module for repair.
Action:Slice will not boot
ISL_STARTUP_ERR The slice has failed to establish synchronisation with the other slices
0x2000 after 20 seconds from starting up. Return the module for repair.
Modules: 8480 Action:Slice OFFLINE
HIA_INVALID_IMAGE_CRC This test performs a CRC check on the HIA programming file in FLASH
0x2001 on loading the boot code or application firmware (only the one being
loaded is checked). Detects a FLASH memory fault or corrupted HIA
Modules:BOOT CODE boot or application code. The module will stay in boot mode. Return
the module for repair.
Action:Slice will not boot
HIA_CONFIGURE_ERROR This test reports on errors seen whilst programming the Host Interface
0x2002 ASIC. Detects a faulty HIA or a faulty interface between the HIA and
the DSP. The module will stay in boot mode. Return the module for
Modules:BOOT CODE repair.
Action:Slice will not boot
HIA_HKAD_TIMEOUT This is a timeout on fetching the HIU HKAD data. Detects a faulty
0x2003 Housekeeping A/D or faulty interface to the HIA. Return the module for
repair.
Modules:BOOT CODE (5V only)
Action:Slice will not boot
(See 0x1006 for 3.3V modules)
ISL_MULTI_ERR Communications status errors detected with both of the other two
0x2070 slices.
Modules: 8480 Return the module for repair.
Action:Slice OFFLINE
ISL_CRC_ERR CRC data check failed on communications with another slice.
0x208n Return the module for repair.
Modules: 8480 Action:Slice fault
Subcode: n = discrepant slice
(1 = ‘upstream’, 2 =
‘downstream’)
FIA_INVALID_IMAGE This test checks the application firmware in FLASH on loading the
0x3000 Field Interface firmware. Detects a FLASH memory fault or missing
firmware. The module will stay in boot mode. Return the module for
Modules:All repair.
Action:Slice will not boot
FIA_INVALID_IMAGE_CRC This test performs a CRC check on the firmware in FLASH on loading
0x3001 the Field Interface firmware. Detects a FLASH memory fault or
corrupted firmware. The module will stay in boot mode. Return the
Modules:All module for repair.
Action:Slice will not boot
FIA_CONFIGURE_ERROR The field interface adapter controllers couldn’t be initialized. The
0x3002 module will stay in boot mode. Return the module for repair.
Modules:All Action:Slice will not boot
FIA_NOT_PRESENT The field interface adapter did not respond when turned on. The
0x3003 module will stay in boot mode. Return the module for repair.
Modules:All Action:Slice will not boot
FIA_POWERUP_ERROR The field interface adapter did not draw the expected current when
0x3004 turned on. The module will stay in boot mode. Return the module for
repair.
Modules:All
Action:Slice will not boot
FIA_CONFIGURE_TIMEOUT The field interface adapter controllers’ initialization took too long. The
0x3005 module will stay in boot mode. Return the module for repair.
Modules:All Action:Slice will not boot
FIA_HIA_SYNC_FAULT Synchronisation fault on the Host Interface adapter. Return the module
0x3006 for repair.
Modules:BOOT CODE Action:Slice fault
FIA_FIA_SYNC_FAULT Synchronisation fault on the Field Interface adapter. Return the module
0x3007 for repair.
Modules:BOOT CODE Action:Slice fault
FIA_CHANNEL_FAULT Detects a faulty channel on the field interface adapter (see also
0x31nn 0x52nn)
Subcode:nn = output channel Swap the module and test it in an unused unconnected slot. If it still
indicates the same fault, return the module for repair. If its replacement
0x01 to 0x28 (1 to 40) has the same fault, check the field circuits and earthing.
Modules:8480 Action:Slice fault
APP_INVALID_IMAGE This test checks the application firmware in FLASH on either loading
0x4000 the boot code or the application firmware. Detects a FLASH memory
fault or missing firmware. The module will stay in boot mode. If loading
Modules:All application firmware, erase and try again, else return the module for
repair. If loading boot firmware, return the module for repair.
Action:Slice will not boot
APP_INVALID_IMAGE_CRC This test performs a CRC check on the firmware in FLASH on either
0x4001 loading the boot code or the application firmware. Detects a FLASH
memory fault or corrupted firmware. The module will stay in boot
Modules:All mode. If loading application firmware, erase and try again, else return
the module for repair. If loading boot firmware, return the module for
repair.
Action:Slice will not boot
APP_BOOT_ERROR Checks to see if the firmware application failed to boot the previous
0x4002 time. A code is stored in memory when the boot process fails. This
code prevents further boot attempts. If the module fails to start, return it
Modules:All for repair, otherwise ignore this fault.
Action:Slice will not boot
APP_STACK_FAULT This test checks the stack in the background. Checks for S/W errors
0x4003 that corrupt the stack. Report to Technical Support.
Modules:All Action:Slice OFFLINE
APP_SELF_TEST_FAULT Declares a fault if the test task does not execute within 2 minutes
0x4004 (input modules) or 30 minutes (output modules). Also set if one of the
checkpoints in the code is not run, i.e. not all expected code has run.
Modules:All
This is common after a poor startup or a bypass timer lockout and is
usually a secondary symptom of these or other faults. If it occurs with
no recent fault that would have caused the slice to turn off, report to
Technical Support.
Action:Slice OFFLINE
SFIU_PI_LATE The pulse input testing has been delayed too long. This is caused by
0x4020 abnormal processing burden but may indicate a task scheduling
problem. Report to Technical Support.
Modules:8442
Action:Slice fault
FIU_FIA_SYNC_FAULT Detects noise in the IFIU comms link, also in the HIA and FIA. Return
(input modules) the module for repair.
0x5002 Action:Slice OFFLINE
Modules:Input
FIU_FIA_SYNC_FAULT Detects serious errors (or noise) in the OFIU comms link such as a
(output modules) faulty optocoupler or a dead output group. Return the module for
repair.
0x5002
Action:Slice OFFLINE
Modules:Output except
8472,8442
FIU_REF_DRIFT Checks internal reference channel 41 for drifting. This channel is used
0x5004 to monitor the live channels. If the fault is persistent, return the module
for repair.
Modules:Input
Action:Slice fault
OFIU_FIA_SYNC_FAULT This is similar to 500n above for 8472, but shows that all four
0x5004 quadrants are faulty. This may be a slice fault, so the slice is set
offline. Return the module for repair.
Modules:8442,8472
Action:Slice OFFLINE
FIU_REF_OUTOFBOUNDS Checks internal reference channel 41 for passing limits. This channel
0x5005 is used to monitor the live channels. Return the module for repair.
Modules:Input Action:Slice OFFLINE
HSIU_RAM_TEST_FAULT Detects any fault in free RAM. Return the module for repair.
0x5005 Action:Slice OFFLINE
Modules:8442
OFIU_IMON_BAL_FAULT Checks for imbalance in output circuit current measurement. If the fault
0x5005 is persistent, return the module for repair.
Modules:8472 Action:Slice fault
IFIU_FREQ_FAULT Checks the HIU and FIU logic that is used to control and generate the
0x5006 FIU operating frequency. The operating frequencies are critical for
input channel fault detection. Return the module for repair.
Modules:Input
Action:Slice OFFLINE
HSIU_RAM_PAGE_FAULT Detects any fault in free RAM that is related to RAM paging. Return the
0x5006 module for repair.
Modules:8442 Action:Slice OFFLINE
OFIU_PLL_FAULT Detects faults in the field interface timing control circuitry (the test
0x5006 schedule is locked to the AC cycle). These faults may be a secondary
effect of other faults. If the fault is persistent, return the module for
Modules:8472 repair.
Action:Slice fault
IFIU_FIUCTRL_TEST_FAULT Detects serious failures in the FIU comms link path between the IHIA
0x5007 and IFIA, using an echoed token. Return the module for repair.
Modules:Input Action:Slice OFFLINE
OFIU_ISLSEQ_FAULT Detects failures in the FIU comms link path between slices using a
0x5007 sequential counter. Return the module for repair.
Modules:8472 Action:Slice fault
FIU_DSCRP_TST_LATE The slice discrepancy data collection has been delayed so that the
0x5008 data from each slice is now too far apart to be relevant. Check for other
faults that may explain the delay; otherwise return the module for
Modules: 8442, 8472 repair if the fault appears again after pressing Reset.
Action:Slice fault
HSIU_RELAY_CMD_ Detects errors in the commands to output relays. Return the module
CHECKSUM ERR for repair.
0x5009 Action:Slice OFFLINE
Modules:8442
OFIU_XMON_CNT_ALL_ All channel PICs have failed to respond properly on one quadrant
FAULT suggesting a common fault. Similar to 56mn (indicates a single PIC at
fault). Return the module for repair.
0x501n
Action:Slice fault
Subcode: n = quadrant (0 to 3)
Modules:8472
OFIU_ISL_SW_TEST_DISCREP Software test register is discrepant between this slice and the other
0x5014 two. This fault may be a secondary effect of other faults. If the fault is
persistent, return the module for repair.
Modules:8472
Action:Slice fault
SFIU_IO_TEST_REG_SLICE_ Test control register is discrepant between this slice and the other two.
DISCREP If the fault is persistent, return the module for repair.
0x5014 Action:Slice fault
Modules:8442
SFIU_IO_TEST_ERR Test control register is different to other slices but no slice fault
0x5015 indicated. If the fault is persistent, return the module for repair.
Modules:8442 Action:Slice fault
SFIU_TEST_CTRL_ERR Incorrect data in test control register. If the fault is persistent, return the
0x5016 module for repair.
Modules:8442 Action:Slice fault
SFIU_TRIP_AUTO_RESET Auto-reset of a 1oo3 trip that persisted for too long. This may be the
0x502n by-product of other faults. If the fault is persistent, return the module for
repair.
Subcode: n = output (0 to 5)
Action:Slice fault
Modules:8442
SFIU_TRIP_AUTO_TRIP 2oo3 trip that co-erced a 3oo3 trip to maintain congruency. This may
0x503n be the by-product of other faults. If the fault is persistent, return the
module for repair.
Subcode: n = output (0 to 5)
Action:Slice fault
Modules:8442
SFIU_TRIP_AUTO_RE_TRIP Re-trip after an auto-reset to prevent a spurious drive signal. This may
0x504n be the by-product of other faults. If the fault is persistent, return the
module for repair.
Subcode: n = output (0 to 5)
Action:Slice fault
Modules:8442
HSIU_ISLDATA_AUX_ERR Background checks on the inter-slice change-over data (‘sanity check’
0x505n data word is wrong, CRC mismatch, sequence error). This may be the
by-product of other faults. If the fault is persistent, return the module for
Subcode: n = slice (see text) repair.
Modules:8442 1 = fault on upstream slice
2 = fault on downstream slice
On slice A: ‘upstream’ = slice B, ‘downstream’ = slice C
On slice B: ‘upstream’ = slice A, ‘downstream’ = slice C
On slice C: ‘upstream’ = slice A, ‘downstream’ = slice B.
Action:Slice fault
IFIU_CHAN_INDEP_FAULT Detects crosstalk between channels. This could be a channel to
0x51nn or 0x51mn channel external short or interaction, or a short or interaction inside the
module.
Modules:Input, 8442
For inputs: subcode nn = channel, 0x00 to 0x29 (0 to 41)
(channels 0 and 41 are internal reference channels.)
For 8442: subcode mn, m = group 0 – 2, n = speed input channel 0 –
8.
Swap the module and test it in an unused unconnected slot. If it still
indicates the same fault, return the module for repair. If its replacement
has the same fault, check the field circuits and earthing.
Action:Slice fault
IFIU_CHANNEL_FAULT Detects a faulty input channel. Faults could be on the field interface
0x52nn or 0x52mn adapter or due to field effects; later firmware is more robust to field
faults.
Modules:Input, 8442, 8480
For inputs and 8480: subcode nn = channel, 0x00 to 0x29 (0 to 41)
(channels 0 and 41 are internal reference channels.)
For 8442: subcode mn, m = quadrant 0 – 2, n = speed input channel 0
– 8.
Swap the module and test it in an unused unconnected slot. If it still
indicates the same fault, return the module for repair. If its replacement
has the same fault, check the field circuits and earthing.
For 8480 firmware before 110, this fault can occur during some output
ramps; upgrade to latest firmware.
For 8442, if there are faults reported in the log on more than one
quadrant, the fault is probably outside the module (T8846 or field
input).
Action:Slice fault
IFIU_CHAN_PIN_FAULT Detects a short at the IFIA PRM DAC drive pins and the IFIA sigma
0x53nn delta feedback drive pins. This includes shorts between adjacent pins
and shorts to the power rails. Return the module for repair.
Modules:Input
Subcode nn = channel, 0x00 to 0x29 (0 to 41)
(channels 0 and 41 are internal reference channels.)
Action:Slice fault
SFIU_SPEED_CHAN_QUAD_ Detects a discrepancy between measured tooth periods between
DISCREP quadrants. If the fault is persistent, return the module for repair.
OFIU_IMON_AV_DC_ERR Detects offset errors in the current monitor op-amp. This can be
0x57mn caused by a gain/offset resistor fault or op-amp fault. If the fault is
persistent, return the module for repair.
Modules:8472
Action:Slice fault
Subcode:
m = quadrant (0 to 3),
n = channel (0 to F)
OFIU_VMON_AV_DIFF_ERR Detects a leaky MOSFET in the back-to-back MOSFET pair that
0x58mn comprises each AC switch. It also detects a fault in the VMON sense
resistor for each MOSFET drain. Leakage currents will cause a VMON
Modules:8472 phase imbalance as the AC voltage changes direction. These faults
Subcode: have been seen in modules that may have been contaminated at
m = quadrant (0 to 3), manufacture. If the fault is persistent, return the module for repair.
OFIU_GFSS_BIAS_FAULT Detects a stuck bias signal in the Group Fail Safe Switch (GFSS). This
0x603n fault may also be caused by channel current noise, which may be
caused by poor zero volt referencing or nonlinear loads (the test is less
Modules:Output (not 8442,72 or sensitive from TUV 3.5 firmware). For persistent faults, check the load
80) linearity and try a replacement module.
Subcode:n = output group Action:Slice fault
8471: n = 0 to 3
Others: n = 0 to 4
OFIU_GFSS_EN_FAULT Detects a stuck FETin the Group Fail Safe Switch (GFSS). This also
0x604n includes a stuck GFSS_BIAS_ENABLE in the owner slice, such that
the bias cannot be turned off. This fault may also be caused by
Modules:Output (not 8442,72 or channel current noise, which may be caused by poor zero volt
80) referencing or nonlinear loads (the test is less sensitive from TUV 3.5
Subcode:n = output group firmware). For persistent faults, check the load linearity and try a
8471: n = 0 to 3 replacement module.
OFIU_STUCK_ON_FAULT Checks for an output channel that is stuck ON, i.e the channel has
0x605n been commanded OFF and there is a problem with the load voltage or
load current that suggests the load may still be powered. This is a
Modules:Output (not 8480) critical fault for a safety related output that must be able to de-
Subcode: energise.
8471: n = output group 0 to 3 There are two main sets of fault conditions:
8472: n = channel (0 to F) 1) Output commanded off, but current AND voltage on output
8442: n = output (0 to 5) 2) Output commanded off, and channel is not off, the slice recently
Others: n = output group 0 to 4 changed state and another slice went offline.
See AN-80004 for details on channel state definitions.
Batches of all possible 0x609n, An, Bn and Cn faults have been seen in 8472s with many 58nn faults;
these may be the by-products of manufacturing problems. Return the module for repair.
OFIU_LINK_DSCRP_CMD_TST Detects latent faults in the Field Interface Adapter (FIA) command
0x609n voter logic and discrepancy detection logic. These faults are likely to
be internal to the FIA unless accompanied by 0x60Bn.
Modules:8472, 8442
If a single fault code is persistent, return the module for repair.
Subcode: n = quadrant (0 to 3)
If faults are raised on all quadrants, it may be caused by a dead slice.
If there is not a slice offline, return the module for repair.
Action:Slice fault
OFIU_ LINK_DSCRP_CFG_TST Detects latent faults in the FIA config clock and data voter logic and
0x60An discrepancy detection logic. These faults are likely to be external to
the FIA.
Modules:8472, 8442
If a single fault code is persistent, return the module for repair.
Subcode: n = quadrant (0 to 3)
If faults are raised on all quadrants, it may be caused by a dead slice.
If there is not a slice offline, return the module for repair.
Action:Slice fault
The RPM and acceleration discrepancy faults below have incorrect subcodes. The last digit is always
the input channel number within the speed group. However, the speed group number (0,1,2) has been
OR-masked with the third digit, so these faults will be spread over 60Dn to 60Fn. This error is present
in 8442 firmware builds up to 136 and has not been fixed at the time of writing. Review the other
messages in the log to find the true fault and group number.
8472: mn; m = quadrant (0 to 3), For 8472, the test covers shorts to each switch as indicated by the
quadrant. If the fault is persistent, return the 8472 module for repair.
n = channel (0 to F)
Action:Slice fault
8442: see below
SFIU_RELAY_CONTACT_TST This test exercises the module’s output channel FETs as above but
0x61mn indicates a fault according to the relay that the channel is driving. If the
fault is persistent, return the module for repair. If the fault indicates
Modules:8442 random channels, check the field supply voltage.
Subcode: mn; m = quadrant (0 to Action:Slice fault
3), n = output (0 to 5)
OFIU_CHANNEL_TYPE_ERR A fault is reported if the channel is not configured as an INPUT or an
0x62nn OUTPUT. Since there is no normal cause for this fault, report it to
Technical Support.
Modules:8448
Action:Slice fault
Subcode:nn = channel, 0x00 to
0x27 (1 to 40)
SFIU_RELAY_CONTACT_ERR An output relay contact has failed on the Speed Output FTA. Replace
0x62mn and return the FTA when possible.
Modules:8442 Note that on firmware release 3.5 this error occurs with no SOFTA
present even if marked as absent in the INI; in this instance a firmware
Subcode: mn = quadrant (0 to 3), upgrade is recommended.
output (0 to 5)
Action:Slice fault
SFIU_RELAY_IMON_TST This tests the current flowing in a relay on a speed output FTA during
0x63mn switching tests. Replace and return the FTA when possible.
Modules:8442 Action:Slice fault
Subcode: mn:
m = quadrant (0 to 3) + current
monitor (0 or 4 for #1 or 2),
n = output (0 to 5)
SFIU_RELAY_IMON_ERR Current is measured in a relay coil when it should be de-energised or
0x64mn vice versa on the Speed Output FTA. Replace and return the FTA
when possible.
Modules:8442
Note that on firmware release 3.5 this error occurs with no SOFTA
Subcode: mn: present even if marked as absent in the INI; in this instance a firmware
m = quadrant (0 to 3) upgrade is recommended.
n = output (0 to 5) Action:Slice fault
OFIU_HWOVC_FAULT Detects a faulty overcurrent (OVC) detector. An OVC has been
0x640n detected but it did not lead to a de-energised load, and the OVC alarm
can’t be reset. Return the module for repair.
Modules:8472
Action:Slice fault
Subcode: n = channel (0 to F)
OFIU_OVP_FAULT Detects a faulty overvoltage (OVP) detector. If the fault is persistent,
0x65nn return the module for repair.
Modules:Output (not This error code does not apply to 8448 and 8449 channels that are
8472,8442,8480) configured as inputs.
Subcode:nn = output channel Action:Slice fault
8471: 0x00 to 0x1F (1 to 32) 8442: see below.
Others: 0x00 to 0x27 (1 to 40)
SFIU_RELAY_DRIVE_TST A relay command or diagnostic failed a test for latent faults. This could
0x65mn indicate a module fault or a speed output FTA fault. Replace the
module; if the fault appears on the new module, the fault is on the
Modules:8442 speed output FTA. Action:Slice fault
Subcode: mn = quadrant (0 to 3),
output (0 to 5)
OFIU_LINK_ERR Detects serious errors in field interface comms link such as faulty
0x66nn optocouplers or a dead output group. Also detects channel address
errors in host interface RAM. Return the module for repair.
Modules:Output (not 8472,
8442,8480) Action:Slice OFFLINE
Subcode:nn = output channel 8442: see below.
8471: 0x00 to 0x1F (1 to 32)
Others: 0x00 to 0x27 (1 to 40)
SFIU_RELAY_DRIVE_ERR A relay command or diagnostic failed on the Speed Output FTA.
0x66mn Replace the module; if the fault appears on the new module, the fault
is on the speed output FTA.
Modules: 8442
Note that on firmware release 3.5 this error occurs with no SOFTA
Subcode: mn = quadrant (0 to 3), present even if marked as absent in the INI; in this instance a firmware
output (0 to 5) upgrade is recommended.
Action:Slice fault
OFIU_DATA_FAULT This test provides a high level of integrity for all ADC data
0x67nn (voltage/current monitoring and HKAD). It detects faults in the ADCs
(excluding the input multiplexer) and the data path between the ADCs
Modules:8448 and HIU RAM. This test is necessary on an 8448 because the voltage
Subcode:nn = channel, 0x00 to and current monitoring is used for inputs. Return the module for repair.
0x27 (1 to 40) Action:Slice OFFLINE
SFIU_RELAY_CONTACT_ This test detects crosstalk between relay contacts, group to group.
XTALK_FAULT Replace and return the Speed Output FTA when possible.
0x67mn Action:Slice fault
Modules:8442
Subcode: mn = quadrant (0 to 3),
output (0 to 5)
FIU_HKAD_ERR Field interface housekeeping measurement out of range. This test
0x68mn verifies min/max limits on all field housekeeping A/D channels.
Modules:All The fault can be triggered by noise but is usually indicating a module
fault.
Inputs:
For 8442, check the field supply voltage, else return the module for
680n, n = HKAD channel, 0 to 7 repair.
(see right)
For 8461, hardware revision L, see TN20049.
In all other cases, return the module for repair.
Outputs including 8480 but not
8442, 8472: Action:Slice OFFLINE except 8472 (slice fault)
68mn, m = output group (0 to 3 Input HKAD channels 0 to 7 in order: Condensation, FIU internal
for 8471, 0 to 4 for others), supply voltage, DAC_X2, FIU unregulated input voltage, FIU board
temperature, DAC_X2, DAC_X3, FIU internal supply current.
n = HKAD channel, 0 to 7 (see
right) Output (except 8480) HKAD channels 0 to 7 in order: CHFSS bias
voltage, GFSS bias voltage, Field zero volts voltage, FIU internal
supply voltage, Field supply voltage, FIU board temperature, FIU
8472: 68m0, m = quadrant (0 to unregulated input voltage, FIU internal supply current.
3) 8480 HKAD channels 0 to 7 in order: CHFSS bias voltage, GFSS bias
8442: voltage, Field zero volts voltage, FIU internal supply voltage, FIU board
m=0: quadrant current, temperature, Field supply voltage below ‘top rail’, FIU unregulated
n=quadrant (0 – 3) input voltage, FIU internal supply current.
m=1: 24v 1, n=group (0 – 2) Note that field voltages are measured downwards from the group
common ‘top rail’ so some numbers are negative; see AN-80004.
m=2: 24v 2, n=group (0 – 2)
On the 8472, this test checks the current drawn by the field interface
m=3: 15v 1, n=group (0 – 2) unit, divided into four quadrants. The 8442 also gives quadrant current
m=4: 15v 2, n=group (0 – 2) if subcode m is 0.
0x6B0n Note that on firmware build 130 this error occurs with no SOFTA
present even if marked as absent in the INI; in this instance a firmware
Modules: 8442 upgrade is recommended. If there is a SOFTA fitted on that group,
Subcode: n = group 0 – 2 replace the SOFTA.
Action:Slice fault
OFIU_SW_INDEP All modules except 8472:
0x6Cnn Detects crosstalk between output channels in the same power group.
Modules:Output (except 8442, Also detects shorted switches.
8480) The test is tripped by either a 25% change in current on another
Subcode:nn = output channel channel in the power group during the test, or a current on the channel
greater than NLTHRESH when it had been switched off.
8471: 0x00 to 0x1F (1 to 32)
The load current on all channels in the same power group must be
Others: 0x00 to 0x27 (1 to 40) relatively constant during the test. Beacons and flashing LEDs can
8472: mn: m = quadrant (0 to 3), cause test failures. These devices may require parallel loads or
n = channel (0 to F) smoothing/soft-start circuits to reduce current disturbance.
Check the current fluctuations on all channels in the same power
group. The log may show a channel which is changing current. A very
low NLTHRESH setting may also trip this fault. 8448s and 8461s will
show the cause in an extra line in the log (“NLThresh” or “25pc
change”)
If there is no evidence of noisy loads, return the module for repair.
Firmware from release 3.5 (build 130) is more robust to nonlinear
loads.
8472 only:
8472 is a very different design and this test is a clear indication of a
module fault. Each switch is inverted in turn and the change in state is
monitored on this and other switches. Return the module for repair.
Action:Slice fault
If the channel should be able to drive the input, return the module for
repair.
Action:Slice fault
SFIU_CONTACT_QUAD_ Detects a discrepancy between the two timers that measure the length
DISCREP of a contact change on the SOFTA diagnostic trace link.
0x6F0n Return the module for repair.
Modules: 8442 Action:Slice fault
Subcode: n = group (0 to 2)
MP_CHAN_DISCREP_ERROR This fault indicates that the channel state or value reported by this slice
0x70nn is discrepant with respect to the other two slices. The front panel
channel LED will not indicate the fault unless two or more slices
Modules:All receive the same fault code. A channel discrepancy may be due to a
Subcode:nn = input/output faulty IO channel on that slice or to field conditions.
channel (1-based) For inputs, builds 37 and later detect discrepancy by actual reading,
8471: 0x01 to 0x20 (1 to 32) but earlier builds detect by state. A common cause in earlier builds is
8472: 0x01 to 0x10 (1 to 16) where one slice is on the other side of a state threshold to the other
two channels. This may lead to a slice discrepancy which will result in
Others: 0x01 to 0x28 (1 to 40) the slice being set offline (0x7100)
For inputs from build 37, noisy signals and drifting calibration may
create discrepancies. The discrepancy thresholds may be increased in
the INI if precision is not necessary; see PD-8110B.
For outputs, discrepancies are still detected by state because the
states are more defined, e.g. short circuit, de-energised. A common
cause of output discrepancies is loads near the minimum current,
where one slice may be starved of current and reporting open circuit.
Consider adding resistors to increase the load, or change the no-load
threshold for the group. Collect similar loads on each group. See
TN20031 for 8461. Very noisy loads may also create discrepancies;
check for logged transitions in and out of fault states on that channel.
Check the channel conditions as above. If there is no external reason
for discrepancies, try a replacement module.
Action:Slice fault, channel fault, channel state = 15
MP_SLICE_DISCREP_ERROR This fault indicates that this slice reported a slice state that is
0x7100 discrepant with respect to the other two slices. An example would be
two slices in ACTIVE and one slice in STANDBY. A slice state
Modules:All discrepancy is primarily due to a fault within the slice that either forced
or inhibited a state change.
This fault is almost always a secondary effect of a slice going offline.
Check the log history for the initial cause.
Action:Slice OFFLINE
MP_LRAM_ERROR This fault indicates an LRAM (Local RAM) test failure. The processor
0x7200 occassionally transmits a command containing an ASCII test pattern
(which can often be seen near the beginning of logs). The I/O module
Modules:All calculates a check word for the pattern which is verified by the
processor. The LRAM is a storage buffer in the I/O module which is
part of the interface with the processor.
The I/O module cannot detect external memory access faults and
therefore relies on the processor. Return the module for repair.
Action:Slice OFFLINE
MP_CONFIGURATION_ERROR This fault indicates that the configuration data (from the System.INI file)
0x7300 on this slice is discrepant with respect to the other two slices. After
loading the configuration data, the processor compares the CRC data
Modules:All checks returned from the three slices. If they are different, this fault is
raised on the discrepant slice (or all slices if there is a three-way
discrepancy).
It may be a side-effect of other IMB communications faults causing
data corruptions.
It will occur soon after going into Standby mode. Restart the module to
load the configuration again. If it keeps failing, return the module for
repair.
Action:Slice fault
MP_SLICE_NOT_RESPONDING This fault indicates that the I/O module slice is not communicating with
0x7350 the processor. The fault code may be seen on the HKEEPING board
even though the other channels for that slice are zero (because there
Modules:All is no communications). This fault can never appear in a log.
The likely reason is that the slice has not started. The slice is still in
boot mode, indicated by the slice Healthy LED flashing red (the same
indication as for a slice fault).
A large system log inside the I/O module will delay it from starting up
on all firmware up to 130. This is fixed in later issues of firmware, some
of which are released. If you can successfully restart the module,
erase its logs.
If the slice will not start and go healthy in an unused slot, return the
module for repair.
Action:None; slice stays in boot mode
MP_SAFETY_LAYER_ERROR This tests the I/O module’s Safety Layer, primarily the IMB voting and
0x7400 fault detection circuits in the host interface. The MP occasionally
requests a test packet. The response packet contains a test pattern
Modules:All which is generated by exercising the voter/fault detector logic in the
safety layer. The processor compares the response packet to the
expected pattern to determine the health of the safety layer.
Faults appearing on the same slice on several modules in a chassis
indicates expander module or bus faults.
If the fault appears on only one module, and it returns repeatedly after
pressing Reset, return the module for repair.
Action:Slice fault
MP_PACKET_ERROR This indicates a fault in the Error Packet Generator logic. The
0x7401 processor occasionally sends a packet with a faulty error code and
verifies that the module signals a Packet Error. Return the module for
Modules:All repair.
Action:Slice fault
MP_BIU_TRANSIENT_ERROR This indicates a bus discrepancy due to an I/O module fault. It may
0x7402 also be caused by expander module or bus faults.
Modules:All Faults appearing on the same slice on several modules in a chassis
indicates expander module or bus faults.
If the fault appears on only one module, and it returns repeatedly after
pressing Reset, return the module for repair.
Action:Slice fault
MP_FCR_DECODE_ERROR This fault indicates that the processor detected a slice ID error. The
0x7403 processor writes a different data pattern to each slice and verifies that
the slice returns its own data. Return the module for repair.
Modules:All
Action:Slice OFFLINE
Codes 0x8400 to 0x8402 These indicate faults in programming the host interface ASIC. The
Modules: 8480 module will fail to start. Return the module for repair.
The faults are reported by all other modules as 0x0400, 0x0401 or
0x0402.
FIA_INVALID_IMAGE Validates the field interface adapter code file prior to and during
0x8500 loading. Detects flash memory fault or missing file. The slice will not
boot. Return the module for repair.
Modules:All
FIA_INVALID_IMAGE_CRC Validates the field interface adapter code file prior to loading. Detects
0x8501 flash memory fault or corrupted file. The slice will not boot. Return the
module for repair.
Modules:All
APP_MEMORY_FAULT Detects RAM errors. Return the module for repair.
0x8601 Action:Slice OFFLINE
Modules:All
APP_TIMEOUT The field interface unit has not serviced its watchdog within a timeout
0x8666 (35ms on input modules, 30ms on 8442, 200ms on output modules
except 600ms on 8472).
Modules:All
This has been seen on input modules and indicates that the field
interface unit has stalled or failed to start or failed to warm-start when
requested. All examples were on firmware before build 130.
If the module had recently been started (in the last minute), restart it
and let it try again.
If the module had been running for some time, swap the module to
another and swap back. If it fails again, return for repair.
Action:Slice OFFLINE
IMB_DOUT_RESET The processor has requested a slice reset. This provides a
0x8740 mechanism to reset the slice without removing / inserting the module.
The slice goes offline but resets itself.
Modules:All except 8442
This error is simply a by-product of the reset process; the I/O module
The 8442 reports this fault as log or processor log may indicate the reason in earlier events.
0x0740.
Action:Slice OFFLINE then it should automatically restart.
IMB_DOUT_DISABLE This provides a means for the processor to disable a slice. Some slice
0x8741 faults can only be detected by the processor. In this case the
processor must have a mechanism for turning off a faulty slice.
Modules:All except 8442
This fault is always a secondary symptom of an earlier fault; the I/O
The 8442 reports this fault as module log or processor log will indicate the primary fault.
0x0741.
Action:Slice OFFLINE
FIA_CONFIGURE_ERROR Appears on startup. Verifies that the field interface adaptor (FIA) has
0x8800 been configured. Detects serious errors in the field interface comms
link such as a faulty optocoupler or a dead FIA. The slice will not boot.
Modules:All
Re-insert the module; if it fails again, return the module for repair.
FIA_OVERCURRENT The field interface drew too much current on one quadrant.
0x8806 The slice will not boot.
Modules: 8442, 8472 Return the module for repair.
HIU_DSP_CORE_FAULT Checks the operation of the DSP Core using its self-test functions.
0x90nn Detects internal DSP faults.
Modules:All Return the module for repair.
Subcode:nn = returned DSP test Action:Slice OFFLINE
result register
HIU_MEMORY_ACCESS_FLT Detects faulty memory in the interface to the IMB.
0x9001 Return the module for repair.
Modules:All Action:Slice OFFLINE
HIU_FCRID_FAULT Detects a fault in the hard-wired slice ID code.
0x9002 Return the module for repair.
Modules:All Action:Slice OFFLINE
APP_STACK_FAULT Checks for application firmware errors that corrupt the stack.
0xC003 Report to Technical Support.
Modules:All Action:Slice OFFLINE
APP_TASK_TIMEOUT Application firmware failed to reset the hardware watchdog (e.g. it is
0xC005 locked in a loop). The hardware watchdog will turn off the slice.
Modules:All except 8480 Report to Technical Support.
Action:Slice halted
Count
Fault Threshold
decremen
Pass
Pass
Pass
Fail
Fail
Fail
Fail
Fail
Fail
Test
interval Tests