Monitoring Line of Business Applications
2
Introduction
Typically, organizations select a monitoring solution for their line of businessapplications based on feedback from a few individuals charged with evaluating products.This process is not perfect since the few individual evaluators may not adequatelyrepresent the total population of users or those who will benefit from the monitoringsolution. For example, if the evaluators are from Operations, their requirements may be toselect a solution that will notify them of impending system problems. On the other hand,evaluators from Development will be more likely to select a solution that providesdetailed diagnostic information about unexpected problems in their code.The question remains: what is the best monitoring approach for ensuring your line of business application achieves optimal performance and uptime?
Application Problems Defined
Before answering this question, we should define application problems as any issue thatcauses an application to deviate from its intended design – whether the issue is related tosystem failures like network or operating system errors, or application failures in which arequired task cannot be executed.In addition, problems are categorized as
expected
or
unexpected
. Expected problemsare those which have been planned and coded for during the design phase, or for whichworkarounds have been identified during the QA phase. Expected issues generally avoidcatastrophic failures by providing enough information to enable an end user to respondappropriately in order to remedy the problem. A common expected problem is a “Printerout of paper” error. This failure does not cause the text editor to stop working; rather, adeveloper has specifically anticipated the potential for this error to occur and has writtencode to appropriately handle it. The error message provides enough information for theend user to resolve the problem.While expected problems are typically well structured and correspond to unsuccessfuluse-case scenarios, unexpected problems have no clear context and do not result from aset pattern of user behavior. For example, when an application leaks memory, iteventually fails with an “Out of memory” error. The only thing a developer can do underthis circumstance is to make the code fail gracefully with a “Server unavailable”message, and to try to collect as much information as possible for later analysis.Typically, unexpected or unhandled problems are the most time consuming andcomplicated to track down and resolve, and thus the most costly to a business.
Add a Comment