Towards a maintainability model for business processes: Adapting a ...

THIS IS A PRE-PRINTED AUTHOR's VERSION. Please refer to the following for the original printed version: Turetken, O. (2013). Towards a Maintainability Model for Business Processes. 1st International Workshop on Communicating Business Process and Software Models (CPSM). IEEE, pp. 30–33.

Towards a Maintainability Model for Business Processes Adapting a Software Maintainability Model (Position Paper) Oktay Turetken Information Systems Group, School of Industrial Engineering Eindhoven University of Technology (TU/e) Eindhoven, Netherlands [email protected] Abstract—This position paper introduces the initial steps towards a business process maintainability model. The proposed model is grounded on a software maintainability model, which has been empirically validated and currently being used in practice. The paper briefly discusses the structure of the model, options for the base measures that can be applied in the business process domain, and challenges and future research for the construction and the validation of the proposed model. Keywords—business process maintainability; maintainability model; software maintainability, business process quality metrics

I.

INTRODUCTION

With the presumption that the amount of effort needed to maintain a software system is related to the technical quality of the source code of that system, several approaches and models that are based on internal quality metrics have been developed to quantify the maintainability of software systems (see [1] for a survey). These metrics have been successfully used as a guidance to improve software designs to make them easier to maintain. Software quality metrics have long been exploited in the business process management (BPM) field due the commonalities shared by software applications and (business) processes1. In the recent years, several studies have been performed on measuring the quality of BPs ([2], [3], [4], [5]). However, despite several base metrics available in the literature, work on the maintainability of BP models is limited. Adopting the maintainability definition in the software domain [6], we can define BP maintainability as “the ease with which a single or a collection of business processes can be modified to correct faults, improve performance or other attributes, or adapt to a changed environment”. Existing research typically focuses on the quality property of a single process model (e.g., [7], [8], [9]). However, changes in the business environment would likely to impact a collection of interrelated process models rather than merely a single model. Understanding the relationships between the process models to better analyze the impact of a change and to ensure the integrity of the entire collection is maintained is utmost 1

See [35] and [36], as well as [37] for the discussions on the commonalities and particularities of software applications and process models.

important, as it constitutes a key part in the maintenance process. Therefore, in quantifying the maintainability of BP models, we should also consider the maintainability of an entire collection of business process models. This requires the integration of both intra-process and inter-process metrics and indicators both over the process and the collection level. In this paper, we discuss the applicability of a software maintenance model to indicate the maintainability of a collection of interrelated business process models. We have chosen the SIG (Software Improvement Group) Maintainability Model [10] as the basis, since: •

It has been empirically validated ([10], [11]).

•

Unlike many other maintainability models, it allows for the analysis of root causes of poor performance and guides improvement efforts.

• •

It is pragmatic and relatively easy to use and automate. It has a clear reference to an agreed standard—i.e. the ISO/IEC 9126 [12]. II.

SIG MAINTAINABILITY MODEL

The ISO/IEC 9126 standard for software product quality [12] distinguishes three perspectives on product quality: internal quality, external quality, and quality-in-use. Internal quality represents the totality of characteristics of the software product from an internal view. Source code metrics serve as good indicators to determine internal quality. External product quality refers to the quality when the software is executed. Quality-in-use represents the user’s view of the quality of the software product when it is used in a specific environment and a specific context of use. For internal and external quality, the ISO/IEC 9126 categorizes software quality attributes into six characteristics, which are further subdivided into 20 sub-characteristics [12]. One of the six main characteristics is maintainability, which is further broken down into analyzability, changeability, stability and testability. The SIG maintainability model is a pragmatic model that operationalizes the quality definitions of ISO/IEC 9126 by mapping a selection of source code metrics to the maintainability characteristic and its sub-characteristics [11]. Being based on static analysis of source code, this model

measures maintainability under the perspective of internal product quality. It combines a set of source code metrics into a single rating of the maintainability of a software product.

The thresholds used by the model are founded on the statistical analysis of a set of representative systems [10], expert opinion and works in the literature [13].

A. Mapping Source Code Metrics to Product Properties The SIG Model has a layered structure (Fig. 1) for measuring and rating the quality of a software system in terms of the quality characteristics of ISO/IEC 9126. The first layer consists of the measures based on the source code analysis performed on the software system. The analysis involves basic metrics, such as Lines of Code (LOC), duplicated LOC, McCabe complexity metric, parameter counts and dependency counts. These metrics are programming language independent and collected on the level of command lines, units (e.g. methods or functions) and modules (e.g. files or classes).

B. Mapping Product Properties to ISO/IEC 9126 Maintainability Sub-characteristics At the third layer, the model has ISO/IEC 9126 maintainability sub-characteristics that are conceptually associated with product properties. Basically, to arrive at a system-level score, a weighted average is computed of each product property rating that is relevant according to the cross association between product properties and sub-characteristics (as depicted in Fig. 1). For instance, changeability subcharacteristic is associated with duplication, unit complexity and module coupling product properties. A system with ‘**’ for duplication, ‘***’ for unit complexity and ‘***’ for module coupling, will have the weighted average of ‘***’ for changeability sub-characteristic. Details regarding the mapping scheme in the SIG model are available in [11] and [13].

On the second layer, the source code measures are aggregated and mapped onto ratings for product properties at the level of the entire software product, such as volume, duplication, and unit complexity. The ratings are in unitless ordinal scale ranging from 1 to 5 (stars). Using such a rating scheme facilitates communication and benchmarking of the product quality at the entire software product level. For volume and duplication properties, LOC-based metrics are translated into ratings based on predefined threshold values. For example, a system with less than 3% code duplication is considered to be well designed and rated as ‘*****’(five stars). For the mapping to the remaining product properties, the model applies a risk profiling technique. Accordingly, the system is partitioned volume-wise into four risk categories: low, moderate, high and very high-risk. For example, a software unit is considered to have a ‘high-risk’ in terms of unit complexity if McCabe number is between 21 and 50. Once the risk level for each unit is determined, first, an aggregation of complexities per unit is performed by counting for each risk level what percentage of lines of code falls within units categorized at that level. For example, if, for a system with 10.000 LOC, the high-risk units together amount to 1200 LOC, then the aggregate number computed for that risk category is 12%. Next, given the risk profile of a system for a particular product quality property, the rating for that property can be computed based on a mapping schema. Following the same example, a system with 10% to 15% of code categorized as high-risk is rated with ‘**’ (two stars out of five). Source Code Measurements

Product Properties Volume

Duplication

Unit Complexity

Unit Size

Unit Interfacing

ISO/IEC-9126 Analyzability

Changeability Maintainability Stability

Testability

Coupling

Fig. 1. SIG Maintainability Model [10] maps source code measurements onto ISO/IEC 9126 [12] quality characteristics.

C. Model Validation For the validation of the model, empirical studies have been conducted to demonstrate a statistical correlation between the quality of software products as measured by the SIG model and the speed at which software issues (of both defect and enhancement type) are solved by the development and/or maintenance teams of these software products [10], [11]. The studies indicated a significant predictive effect of the model for the efficiency of development and maintenance tasks, such as resolution of defects and implementation of enhancements. III.

ADOPTING THE SIG MODEL FOR BUSINESS PROCESSES

There are several challenges in adopting the SIG model to BP maintainability. At this stage, two of these challenges can be considered as vital for an effective adoption. First, representative and well-established measures of BP models should be applied for the replacement of the sources code measures used in the SIG model. SIG uses several thresholds derived from the analysis of a large set of representative software systems, expert opinions and related literature. So, the second challenge is to establish such threshold values for the process models, which requires extensive empirical studies and evidence that is currently vastly limited in the BPM domain (particularly when compared with the maturity of the measurement field in the software engineering domain). The first challenge is a prerequisite for the efforts to cope with the second one. Therefore at this stage, we focus on the first challenge of selecting and applying the most effective base metrics of BP models to represent the product properties. Here, the product that we consider as a subject matter is a collection of BP models that are interrelated (e.g. through control flow, message exchange, aggregation). We should also note that, in the following discussions, a software unit (module, method, function, etc.) translates into a process model for our purposes. The remaining paragraphs discuss the base measures that can be applied in representing the properties of a BP model collection (i.e. volume, duplication, unit complexity, unit size, unit interfacing, and module coupling).

A. Volume The total size of a system is a typical factor that has a heavy influence in any measure of maintainability. This is why it is usually revealed as a significant predictor in such models [1]. The larger the system, the more effort it requires for the maintenance, as it becomes harder to understand and analyze. This view is also supported in the BPM domain [14]. Although there are several variations, the predominant metric used to quantify the size of a process model is the total number of activities ([15], [16], [17], [18]). Mendling et al. also proposes the total number of nodes and the diameter metrics (length of the longest path from a start node to an end node in the process model) as alternatives for the size (volume) [3]. Following the SIG approach, the overall volume can be an aggregation of the sizes of units (single process models). B. Duplication Unlike volume, duplication is hardly considered as a measure in software maintainability models [1], which makes SIG model peculiar from that perspective. A certain degree of duplication is considered typical for systems; however, unnecessary amounts are damaging to its maintainability [19]. Discovery of duplicate code is fairly straightforward, while it can be challenging in the BPM domain. Existing work on determining a degree of duplication between process models is limited. The measures proposed by Dijkman et al. [20] can be exploited to quantify different aspects of duplication through the notion of process model similarity. Likewise, techniques prosed by Ekanayake et al. [21] for retrieving clusters of approximate BP clones taking into account measures such as cluster quality, should be considered. C. Unit Complexity The complexity is a well-studied property of systems, which is frequently referred as factor influencing several aspects of maintainability [22], [23]. Various metrics have been proposed in the BPM field to quantify the complexity of process models. For example; Cardoso proposes the Control Flow Complexity (CFC) metric for measuring control-flow complexity of process models [24]. Vanderfeesten et al. [25] introduces the cross-connectivity metric, that aims to capture the cognitive effort to understand the relationship between any pair of process model. The study in [26] surveys other important works on the complexity of process models, including [3], [27], and [15]. D. Unit Size Apart from the overall size (volume) of the BP model collection and complexity per unit (a single business process model), the size of individual units may also explain its level of maintainability. It is fairly intuitive that larger the units, more difficult it is to analyze and test them. The metrics we discussed in section A are readily applicable for quantifying the unit size. E. Unit Interfacing In the SIG model, the unit interfacing is measured in terms of the number of parameters declared in the interface of each unit. It is assumed that a larger interface means more parameters to keep track of, which in turn influences

maintainability, in particular the stability. Unit interfacing would render into the interfacing of a process model, i.e., the number (and possibly complexity) of data input/outputs that are exchanged with other process models. The existing literature on the measurement of business process quality considers mostly the control-flow aspect of process models and emphasizes rarely on the data/information aspect of process models (that incorporate data elements usually referred as: work products, messages, business objects, information elements, etc.). The metrics proposed by Canfora et al. [17] (such as NDWPIn - the number of input dependences of the work products with the activities in the process) and Rolon et al. [28] offer constructs to define metrics to quantify the unit interfacing property of process models. F. Coupling Coupling is another important property that frequently appears among the factors influencing the maintainability of systems [1], [22]. It measures the number of interconnections among software units. SIG model uses the number of incoming invocations for each module (a delimited group of units, e.g., class or file). Canfora et al. [17] defines activity coupling metric as a ratio of activities to the number of precedence dependences between activities. The coupling metric developed by Reijers et al. [29] (which is further enhanced in [7]) also take the data aspect into account. It counts the overlap of data elements for each pair of activities using a static description of the product. Two activities are 'coupled' if they contain one or more common data elements. In addition to these metrics that consider activity coupling, our model should also consider the coupling of process models and take works such as [30] into account for this purpose. IV.

DISCUSSION AND FUTURE WORK

A BP maintainability model would be of interest to different BPM stakeholders, both in practice and research. It will provide guidance to BPM tool vendors for integrating such metrics, and presenting aggregated views from different perspectives. Quantifying the maintainability on the model and repository levels would help -particularly business process designers and practitioners- to get insight on the quality of the process models in general, understand deficiencies as well as strengths, and pinpoint parts where improvements are possible. The maintainability model can also be used to predict the efficiency of modeling and maintenance tasks, such as resolution of defects and implementation of enhancements. The predictive power of these models can be further improved by taking several other factors, such as BP designer characteristics (e.g., experience, knowledge level), modeling notation and tool, etc. into consideration. Nevertheless, the development and the validation of the model require research in several directions. The applicability of current BP quality metrics should be thoroughly analyzed to make sure that they accurately represent the BP properties. New BP metrics may have to be proposed for a better coverage. Additional quality properties such as structuredness (which is validated to play a part in the understandability of BP models [31]) can be incorporated to enhance the correlation.

The empirical studies should be conducted; first, to establish the threshold values and values used for risk profiling; and second, for the overall validation of the model. This requires carefully designed multiple case studies and/or experiments that involve process repositories of different characteristics being subject to diverse types of maintenance tasks. The experiments can be conducted to analyze the effectiveness and efficiency of the maintenance tasks (structurally similar to the studies such as [32], [8]) and may employ statistical analysis techniques to uncover possible correlations between the effectiveness and efficiency of maintenance tasks and the BP quality metrics as measured by the BP maintainability model. In determining thresholds for metrics, Mendling et al. [9] proposes threshold values for diverse BP quality metrics for discriminating models of high and low error probability. Similarly, Sánchez-González et al. [33] proposes threshold values for complexity of BPs. These values as well as the statistical methods used for determining such thresholds can be adapted in future experiments. The proposed structure of the BPM maintainability model is based merely on internal process quality metrics and does not consider the role of cognitive physiology on the understandability of process models. Future work should also consider underpinnings on the cognitive dimension by following studies such as [34]. REFERENCES [1] M. Riaz, E. Mendes, and E. Tempero, “A systematic review of software maintainability prediction and metrics,” ESEM'09 pp. 367–377, 2009. [2] I. T. P. Vanderfeesten, J. Cardoso, J. Mendling, H. A. Reijers, W. M. P. van der Aalst, and W. M. P. van der Aalst, “Quality metrics for business process models,” in 2007 BPM and workflow handbook, pp. 179–190. [3] J. Mendling, Metrics for Process Models, vol. LNBIP 6. Berlin, Heidelberg: Springer-Verlag, 2008. [4] S. Overhage, D. Q. Birkmeier, and S. Schlauderer, “Quality Marks, Metrics, and Measurement Procedures for Business Process Models,” BISE, vol. 4, no. 5, pp. 229–246, Sep. 2012. [5] H. A. Reijers, J. Mendling, and J. Recker, “Business Process Quality Management,” in Handbook on Business Process Management 1 Springer, 2010, pp. 167–185. [6] IEEE Computer Society, “IEEE Std. 610.12-1990. Standard Glossary of Software Engineering Terminology,” Los Alamitos, CA, 1993. [7] I. T. P. Vanderfeesten, J. Jorge Cardoso, and H. A. Reijers, “A Weighted Coupling Metric for Business Process Models,” in Workshop Proceedings of CAiSE 2007, 2007, pp. 41–44. [8] H. A. Reijers and J. Mendling, “A Study Into the Factors That Influence the Understandability of Business Process Models,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 41, no. 3, pp. 449–462, 2011. [9] J. Mendling, L. Sánchez-González, F. García, and M. La Rosa, “Thresholds for error probability measures of business process models,” Journal of Systems and Software, vol. 85, no. 5, pp. 1188–1197, 2012. [10] D. Bijlsma, M. A. Ferreira, B. Luijten, and J. Visser, “Faster issue resolution with higher technical quality of software,” Software Quality Journal, vol. 20, no. 2, pp. 265–285, May 2011. [11] R. Baggen, J. P. Correia, K. Schill, and J. Visser, “Standardized code quality benchmarking for improving software maintainability,” Software Quality Journal, vol. 20, no. 2, pp. 287–307, May 2011. [12] International Organization for Standardization (ISO), “ISO/IEC 9126-1: Software engineering-product quality-part 1: Quality model,” 2001. [13] I. Heitlager, T. Kuipers, and J. Visser, “A Practical Model for Measuring Maintainability,” in QUATIC'07, 2007, pp. 30–39.

[14] J. Mendling, H. A. Reijers, and W. M. P. van der Aalst, “Seven process modeling guidelines (7PMG),” Information and Software Technology, vol. 52, no. 2, pp. 127–136, Feb. 2010. [15] V. Gruhn and R. Laue, “Approaches for Business Process Model Complexity Metrics,” in Technologies for Business Information Systems, Springer, 2007, pp. 13–24. [16] J. Cardoso, J. Mendling, G. Neumann, and H. A. Reijers, “A Discourse on Complexity of Process Models,” in BPM Workshops’06, Lecture Notes in Computer Science Volume 4103, 2006, vol. 4103, pp. 117–128. [17] G. Canfora, F. García, M. Piattini, F. Ruiz, and C. a. Visaggio, “A family of experiments to validate metrics for software process models,” Journal of Systems and Software, vol. 77, no. 2, pp. 113–129, Aug. 2005. [18] A. Dikici, O. Turetken, and O. Demirors, “A Case Study on Measuring Process Quality: Lessons Learned,” in Euromicro Conf. on Software Engineering and Advanced Applications (SEAA), 2012, pp. 294–297. [19] C. Kapser and M. Godfrey, “‘Cloning Considered Harmful’ Considered Harmful,” in Working Conf. on Reverse Engineering, 2006, pp. 19–28. [20] R. Dijkman, M. Dumas, B. van Dongen, R. Käärik, and J. Mendling, “Similarity of business process models: Metrics and evaluation,” Information Systems, vol. 36, no. 2, pp. 498–516, Apr. 2011. [21] C. C. Ekanayake, M. Dumas, L. Garcia-Banuelos, M. La Rosa, and A. H. M. Hofstede, “Approximate clone detection in repositories of business process models,” in BPM 2012, LNCS 7481, 2012, pp. 1–16. [22] J. Saraiva, “A Roadmap for Software Maintainability Measurement,” in ICSE’2013, 2013, pp. 1453–1455. [23] L. S. González, F. G. Rubio, F. R. González, and M. P. Velthuis, “Measurement in business processes: a systematic review,” Business Process Management Journal, vol. 16, no. 1, pp. 114–134, 2010. [24] J. Cardoso, “Business Process Control-Flow Complexity,” International Journal of Web Services Research, vol. 5, no. June, pp. 49–76, 2008. [25] I. T. P. Vanderfeesten, H. A. Reijers, J. Mendling, W. van der Aalst, and J. Cardoso, “On a Quest for Good Process Models: The CrossConnectivity Metric,” in LNCS 5074, Springer, 2008, pp. 480–494. [26] G. M. Muketha, A. A. A. Ghani, Selamat M.H., and Atan R., “A Survey of Business Process Complexity Metrics,” Information Technology Journal, vol. 9, no. 7, pp. 1336–1344, 2010. [27] K. B. Lassen and W. M. P. van der Aalst, “Complexity metrics for Workflow nets,” Information and Software Technology, vol. 51, no. 3, pp. 610–626, Mar. 2009. [28] E. Rolon, L. Sanchez, F. Garcia, F. Ruiz, M. Piattini, D. Caivano, and G. Visaggio, “Prediction Models for BPMN Usability and Maintainability,” in IEEE Conference on CEC’09, 2009, pp. 383–390. [29] H. A. Reijers and I. T. P. Vanderfeesten, “Cohesion and coupling metrics for workflow process design,” in BPM, LNCS 3080, 2004, pp. 290–305. [30] M. Perepletchikov and C. Ryan, “A Controlled Experiment for Evaluating the Impact of Coupling on the Maintainability of ServiceOriented Software,” IEEE TSE, vol. 37, no. 4, pp. 449–465, Jul. 2011. [31] M. Dumas, M. La Rosa, J. Mendling, R. Mäesalu, H. A. Reijers, and N. Semenenko, “Understanding Business Process Models: The Costs and Benefits of Structuredness,” in CAiSE'12, LNCS 7328, 2012, pp. 31–46. [32] J. Mendling, M. Strembeck, and J. Recker, “Factors of process model comprehension—Findings from a series of experiments,” Decision Support Systems, vol. 53, no. 1, pp. 195–206, Apr. 2012. [33] L. Sánchez-González, F. Ruiz, C. Real, J. Cardoso, L. Sanchez, F. Ruizg, and F. Garcia, “Towards Thresholds of Control Flow Complexity Measures for BPMN models,” in SAC’11, 2011, pp. 1445–1450. [34] S. Zugal, J. Pinggera, and B. Weber, “Assessing Process Models with Cognitive Psychology,” in EMISA ’11, 2011, pp. 177–182. [35] L. J. Osterweil, “Software Processes are Software Too,” in International Conference on Software Engineering (ICSE'87), 1987, pp. 2–13. [36] L. J. Osterweil, “Software Processes are Software Too, Revisited,” in ICSE’97. 1997, pp. 539–548. [37] V. Gruhn and R. Laue, “What business process modelers can learn from programmers,” Science of Computer Programming, v.65, pp. 4-13, 2007.