defining case based reasoning cases with xml

2 downloads 0 Views 474KB Size Report
Literature has, however, indicated that Case Based Reasoning (CBR) is a suitable technique for use in the interpretation of UNDT data. This paper supports the ...
IADIS International Conference Applied Computing 2007

DEFINING CASE BASED REASONING CASES WITH XML E. Asemota*†, S. Gallagher†, G. McRobbie† and S. Cochran‡ MicroScale Sensors, School of Computing† and School of Engineering and Science‡, University of Paisley, Paisley, PA1 2BE, UK

ABSTRACT The interpretation of ultrasonic non-destructive destructive testing (UNDT) data is, largely, a human-centric activity which requires a significant level of expertise. As such, it is difficult to clearly define the rules used for interpreting this data. Many computational techniques have been both proposed and deployed to assist in the interpretation of this data: from procedural language developments through to advanced Artificial Intelligence-based solutions involving Expert Systems and Artificial Neural Networks. However, literature indicates that due to the ill-defined nature of the interpretation of UNDT data, solutions developed using the aforementioned techniques are often considered as being less than complete and robust solutions. Literature has, however, indicated that Case Based Reasoning (CBR) is a suitable technique for use in the interpretation of UNDT data. This paper supports the notion of using CBR in this domain with one caveat. That is, that within any such CBR solution for this domain, the use of rigid case definition is inappropriate. Rigid case definitions make it more likely that some degree of redevelopment of the CBR system will be required when a case definition changes. As it is expected that case definition will change in an ill-defined domain such as the interpretation of UNDT data, an alternative approach that manages such changes in case definition, is required. Such an approach would be able to handle the CBR case changes without necessitating any degree of system redevelopment, i.e. is one which is both flexible and configurable at runtime. This paper outlines the development of a run-time configurable case definition technique which enables a flexible case definition. The paper presents the pragmatic implementation of an XML-based, run-time configurable case definition solution. The body of work presented herein forms part of an overall program of research for the automatic interpretation of UNDT data. KEYWORDS CBR, XML, Ultrasonic NDT, ill-define domain

1. INTRODUCTION Once designed and manufactured, many engineering systems require periodic, in-situ, monitoring; reasons for this include optimum performance and safety. To assist this monitoring, non-destructive testing (NDT) techniques are often employed of which UNDT is one such technique (Pyzdek et al, 2003; Jayakumar and Thavasimuthu, 2002; Achenbach, 1999). UNDT is both an efficient and an effective technique, as it permits the examination of the internal and surface properties these engineering systems. In order to achieve this, UNDT transmits generated ultrasonic energy via an ultrasonic probe into the system under investigation. The technique yields significant volumes of data, which requires a high-level of expertise and, often, many manhours for its interpretation. Currently in industry, such interpretations are primarily performed by human inspectors (Ma et al, 2002; Simpson and Blitz, 1996). However, as literature identifies, such “human-centric” interpretations are often error prone and as these interpretation often determine the safety of such systems, there is a need to develop alternative approaches for this interpretation (Alexiev and Mihovski, 2000; Murgatroyd et al, 1990; Smith, 1998). Although the application of computer-based technique presents an alternative approach to the data interpretation, the complex nature of the data concerned and the lack of clearly defined interpretation rules makes it difficult to apply solutions such as expert systems within this domain (Jarmulak, 2001). Similarly, if an artificial neural network is utilised, the number of patterns such a system will be required to recognise will be impractically large (McNabb and Cornwell, 1995). Hence, case based reasoning (CBR) has been proposed

449

ISBN: 978-972-8924-30-0 © 2007 IADIS

as an appropriate computer-based technique to be employed in the computer-based interpretation of UNDT data as it does not require interpretation rules or data pattern for its training (Jarmulak, 2001). In addition, a CBR system is able to “learn” through the addition of new cases and the removal “irrelevant” (no longer relevant or appropriate) cases from the case library (Watson, 1997). Paradoxically, the lack of clearly defined rules for interpreting UNDT data (Drury, 1978) means that developing a CBR system for this domain is considered a difficult task with respect to the design of the cases. Traditionally, a CBR case defines all the features which the CBR system will utilise during its operations (retrieval, matching and adaptation) (Shiu and Pal, 2004; Riesbeck and Schank, 1989). As such, inaccurately designed cases may adversely affect the operation and performance of the entire system (Watson, 1997). However, as this domain (UNDT data interpretation) is ill-defined, the exact content of the contained cases is difficult to ascertain during the development process. As such, if the case definition changes after the system has been developed, the system may need to be redeveloped in order to support the new case definition. To mitigate these concerns, a run-time configurable case definition infrastructure has been implemented within this program of research. The implementation of this infrastructure is rooted in the use of extensible markup language (XML). As XML provides inbuilt structure and extensibility, other processes within the CBR operation can be abstracted to operate on the structure defined via the XML as opposed to the specifics of the features of a particular case. This paper presents the details of this infrastructure.

2. CASE BASED REASONING CBR problem resolution involves mapping current problems to previously encountered problems. To achieve this, a knowledge base (case library) which consists of a series of “cases” is employed. Each of these cases contains a “case-problem”, a “case-solution” and, optionally, a “case-outcome” and defines a specific experience within the domain which the CBR solution is designed to resolve. Within this experience, a “snap shot” of the problem is stored. That is, the relevant factors which define this experience (problem). Cases can be represented in a variety of formats. For example, they can be represented as simple text, video, etc appropriate to the specific domain. In this way, cases can be used to represent numerous experiences (Shiu and Pal, 2004; Kolodner, 1993; Schank, 1982; Hammond, 1988). The operation of a typical CBR system is depicted in Figure 1. Within this figure, an input problem is entered into the system. If “similarity” is established, the system then retrieves similar cases from the case library from which it reuses the best matching case’s solution as the solution of the input problem. The reuse of the retrieved best matching case’s solution is achieved through the case adaptation operation. If no similarity is established, then the input problem is categorised as a new case which may be added to the case library (Watson, 1997; Lebowitz, 1983; Kolodner, 1993; Koton, 1989; Leake, 1996; Hammond, 1988). The problem resolution capability of a CBR solution depends not only on the number of different cases stored within the case library but also on the content of the cases. The storage of cases which sufficiently describe the problem domain enhances the capability of the system (Watson, 1997).

Figure 1. High-level depiction of a typical CBR operation

450

IADIS International Conference Applied Computing 2007

3. AN XML-BASED CBR CASE DEFINITION INFRASTRUCTURE Within the current programme of research, an XML-based run-time configurable CBR case definition infrastructure was developed. In this infrastructure, each case contains a “case-problem”, a “case-solution” and a “case-outcome” (see Section 3). The case-problem captures the inspection variables (material, probe and signal characteristics, the signal type, the probe operational frequency and the B-Scan image). These variables affect the pattern of the resultant data from an inspection (Drury, 1978). The solution applied to the case-problem is recorded via the case-solution. The case-solution is defined by a Boolean value (true or false) specifying whether the feature is a defect or not. The case-outcome records the outcome of an interpretation. The case-outcome is also a Boolean value with “true” indicating a successful interpretation and “false” indicating the converse. In the developed CBR system, cases stored in the case library are represented using Extensible Markup Language (XML) (Means and Harold, 2002; Marchal, 2001). Figure 2 shows the Markup for a typical, implemented case in the case base. Within this application, no case-solution can exist unless there is a caseproblem to which the case-solution is applied. Also, more than one case-problem can have the same casesolution. That is, two (or more) cases within the case library may contain case-problems which are, in themselves, different but have the same case-solutions. For example, these case-solutions may have the value of “true”.

Figure 2. A typical implemented case from the case library

Rather than store this solution twice, the physical storage of the cases is implemented in a manner that the three parts of a case (case-problem, case-solution and case-outcome) are stored in different data stores. Each case-problem has a “ProblemID” node which is a globally unique identifier (GUID) (Hamilton, 2003). Each case-solution and case-outcome also has a “ProblemID” node which records all case-problems that the specific case-solution or case-outcome applies to. The resultant effect of this implementation is that only one version of a particular case-solution is ever physically stored within the case library. Thus, the sizes of both the case-solution and case-outcome data stores are kept at minimum which in turn, enhances retrieval operations from these data stores. The above relationship between a case and its constituent parts is depicted

451

ISBN: 978-972-8924-30-0 © 2007 IADIS

in Figure 3. Within this diagram, each case comprises a case-problem which is associated with a casesolution and a case-outcome via the case-problem “ProblemID”.

Figure 3. Relationship between constituent parts of a case

The developed CBR application is part of an overall solution which, at its core, contains a controller module that monitors the various parts of the solution (Asemota et al., 2006). This controller generates the CBR input problem’s Markup (XML case definition) and transfers this problem to the CBR application. In order to generate this Markup, the controller accumulates data from all modules activated within the solution. However, the number of modules activated varies depending on the particular requirements of an inspection. In addition, the controller may interface with external systems and in such scenarios, the case definition must reflect the impact of these external systems on the UNDT data which the CBR application is designed to interpret. Thus, the need for this implemented flexible and configurable case definition structure. Figure 4 illustrates this overall solution indicating the modules contained therein and their interactions with the controller (framework controller).

Figure 4. High-level architecture of the overall research solution

4. CONCLUSION This paper has identified the fact that as UNDT data interpretation is both complex and ill-defined in nature, the application of CBR in this domain requires a flexible infrastructure for case definition. Within the current program of research, a CBR system which utilises XML to define cases in its case library was developed. As

452

IADIS International Conference Applied Computing 2007

XML is both structured and extensible, the case definition infrastructure is able to manage any required changes to case definition at run-time without necessitating any redevelopment of the application. The developed infrastructure forms part of an implemented research solution for the automatic interpretation of UNDT data which is able to integrate external systems. The flexibility offered by this implemented XML-based case structure allows the framework to handle data from these external systems without requiring any redevelopment of the CBR-Based application.

REFERENCES Achenbach J.D. (2000). Quantitative nondestructive evaluation. International Journal of Solids and Structures 37 (2000) 13-27 Alexiev Al. and Mihovski M. (2000). Human Reliability in Ultrasonic Inspection. In proceedings of 15th World Conference on NDT. Rome, Oct 2000, p 713. Asemota, E., Gallagher, S., Lees, B. and Cochran, S. (2006). Towards the automatic interpretation of ultrasonic nondestructive testing data through the application of image threshold and region growing segmentation. Proc. IMechE Vol. 220 Part B: J. Engineering Manufacture, 220(B6), 1011-1016. Drury, J. C. (1978). Ultrasonic Flaw Detection for Technicians. OIS plc, Cleveland. Hamilton B. (2003) ADO.NET Cookbook. O'Reilly. ISBN: 0596004397. Hammond K.J. (1988). Case Based Planning: Viewing planning as a Memory Task. In proceedings of the DARPA CaseBased Reasoning Workshop, edited by J.L. Kolodner. San Francisco: Morgan Kaufmann Publishers. Jarmulak, J., Kerckhoffs, Eugene J.H., vant’t Veen, P., 2001. Case-based Reasoning for Interpretation of Data from Nondestructive Testing. Engineering Applications of Artificial Intelligence 14 (2001), 401 -417. Jayakumar T. and Thavasimuthu M. (2002). Practical Non-destructive Testing. Woodhead Publishing. ISBN:1855736004 Kolodner, J.L. (1993). Case Based Reasoning. San Francisco: Morgan Kaufmann Publishers. Koton, P. (1989). Using Experience in Learning and Problem Solving. Ph.D. thesis, MIT/LCS/TR-441, Laboratory of Computer Science, Massachusetts Institute of Technology. Leake, D.B., (edited). (1996). Case-Based Reasoning-Experiences, Lessons and Future Directions. Cambridge, MA: AAAI Press / MIT Press. Lebowitz, M. (1983). Generalisation from natural language text. Cognitive Science, 7(1). Ma H. W., Zhang X. H. and Wei J. (2002). Research on an ultrasonic NDT system for complex surface parts. Journal of Materials Processing Technology Volume 129, Issues 1-3 , 11 October 2002, Pages 667-670 McNab, A., Cornwell, I., (1995). Review of Artificial Intelligence Applied to Ultrasonic Defect Evaluation. INSIGHT, Vol. 37, No. 1. Marchal B. (2001). Xml by Example. Que Publishing. ISBN: 0789725045 Means W. S. and Rusty E. (2002). XML in a Nutshell, 2nd Edition O'Reilly. ISBN: 0596002920. Murgatroyd, R. A, Seedm, H, Crutze, S. and Van den Bergh, R. 1990. Human Reliability Studies in PISC III. 10th International Conference on NDE in the Nuclear and Pressure Vessel Industries, ed. M.J. Whittle, J.E. Doherty, K. Lida. ASM International, 1990. ISBN 087170-393-9 Pyzdek T., Keller P. A. and Pyzdek P. (2003). Quality Engineering Handbook. Marcel Dekker. ISBN: 0824746147 Riesbeck, C. K., and Schank, R. C. (1989). Inside Case-Based Reasoning. Nortvale, NJ: Lawrence Erlbaum Associates. Schank, R. C. (1982). Dynamic Memory: A Theory of Learning in Computers and People. New York: Cambridge University Press. Shiu S. and Pal S. K. (2004). Foundations of Soft Case-Based Reasoning. Wiley-Interscience. ISBN: 0471086355 Simpson G. and Blitz J. (1996).Ultrasonic Methods of Non-Destructive Testing. Springer. ISBN: 0412604701 Smith, P.1998. A Recording and Presentation System For Manual Ultrasonic Inspections Using A Speech Recognition Interface. PhD Thesis University of Paisley Watson, I. (1997). Applying Case-Based Reasoning: Techniques for Enterprise Systems. San Francisco: Morgan Kaufmann Publishers.

453