Oct 6, 2008 - School of Civil, Environmental and Mining Engineering, University of ... briefly examines the areas of geotechnical engineering to which ANNs ...
th
The 12 International Conference of International Association for Computer Methods and Advances in Geomechanics (IACMAG) 1-6 October, 2008 Goa, India
Future Challenges for Artificial Neural Network Modelling in Geotechnical Engineering M. B. Jaksa, H. R. Maier School of Civil, Environmental and Mining Engineering, University of Adelaide, Adelaide, Australia
M. A. Shahin Department of Civil Engineering, Curtin University of Technology, Perth, Australia Keywords: artificial neural networks, artificial intelligence ABSTRACT: Artificial neural networks (ANNs) are a form of artificial intelligence and, since the mid-1990s, ANNbased models have been successfully applied to virtually every problem in geotechnical engineering. This paper briefly examines the areas of geotechnical engineering to which ANNs have been applied, provides a brief overview of the operation of ANN models, and highlights and discusses four important issues which require further attention in the future. These are model robustness, transparency and knowledge extraction, extrapolation, and uncertainty. For ANN models to be more effective and useful in the future, it is essential that further work be undertaken in these four areas, particularly in the context of geotechnical engineering.
1 Introduction Since the early 1990s, artificial neural networks (ANNs) have been applied to almost every problem in geotechnical engineering, including: blasting (Lu, 2005); dams (Kim and Kim, 2007); earth retaining structures (Goh et al., 1995; Kung et al., 2007); environmental geotechnics (Shang et al., 2004); ground anchors (Shahin and Jaksa, 2004; 2005a; 2005b; 2006); liquefaction (Goh, 1994b; 1996a; Agrawal et al., 1997; Ali and Najjar, 1998; Najjar and Ali, 1998; Ural and Saka, 1998; Juang and Chen, 1999; Goh, 2002; Javadi et al., 2006; Young-Su and Byung-Tak, 2006; Goh and Goh, 2007); pile foundations (Goh, 1994a; 1995b; 1996b; Chan et al., 1995; Lee and Lee, 1996; Teh et al., 1997; Abu-Kiefa, 1998; Nawari et al., 1999; Rahman et al., 2001; Hanna et al., 2004; Goh et al., 2005; Das and Basudhar, 2006); rock mechanics (Gokceoglu et al., 2004); site characterisation (Zhou and Wu, 1994; Basheer et al., 1996; Najjar and Basheer, 1996a; Rizzo et al., 1996); shallow foundations (Sivakugan et al., 1998; Shahin et al., 2002a; 2002b; 2003a; 2003b; 2003c; 2004a; 2005a; 2005b; Provenzano et al., 2004; Padmini et al., 2007; Rezania and Javadi, 2007; Samui, 2007); slope stability (Ni et al., 1996; Ferentinou and Sakellariou, 2007; Zhao, 2007); soil properties and behaviour (Ellis et al., 1992; Agrawal et al., 1994; Gribb and Gribb, 1994; Penumadu et al., 1994; Basheer and Najjar, 1995; Cal, 1995; Ellis et al., 1995; Goh, 1995a; 1995c; Najjar and Basheer, 1996b; Najjar et al., 1996a; 1996b; Romero and Pamukcu, 1996; Penumadu and Jean-Lou, 1997; Basheer and Najjar, 1998; Ghaboussi and Sidarta, 1998; Sidarta and Ghaboussi, 1998; Tutumluer and Seyhan, 1998; Zhu et al., 1998a; 1998b; Najjar et al., 1999; Penumadu and Zhao, 1999; Kurup and Dudani, 2002; Habibagahi and Bamdad, 2003; Lee et al., 2003; Shahin and Indraratna, 2006; Erzin, 2007; Hung and Ni, 2007; Najjar and Huang, 2007); and tunnels and underground openings (Lee and Sterling, 1992; Moon et al., 1995; Shi et al., 1998; Shi, 2000; Yoo and Kim, 2007). The interested reader is referred to Shahin et al. (2001) where the pre-2001 papers are reviewed in some detail. As geotechnical engineering deals with materials which, by their very nature, can exhibit extreme variability, ANNs are particularly amenable to modelling the complex behaviour of these materials and have generally demonstrated superior predictive performance when compared with traditional methods. The aim of this paper is to highlight and discuss a number of important issues associated with the development of ANN models which require further attention in the future.
2 Brief overview of artificial neural networks Whilst a comprehensive description of ANNs is beyond the scope of this paper, and is provided by several authors (e.g. Hecht-Nielsen, 1990; Maren et al., 1990; Zurada, 1992; Fausett, 1994; Ripley, 1996), in the context of this paper, it is useful to provide a brief overview. ANNs are a form of artificial intelligence which attempt to mimic, in a very simplistic fashion, the behaviour of the human brain and nervous system. Typically, the
1710
architecture of ANNs consists of a series of processing elements (PEs), or nodes, that are usually arranged in layers: an input layer, an output layer and one or more hidden layers, as shown in Figure 1. x
f(I j
0
x1
Ij
w j0 w j1
. . .
. .
x2
. . .
w j2 Weights
Hidden Input
w jn
Output
yj
Sum
Output path Processing Element
xn Figure 1. Typical structure and operation of an ANN model. (After Maier and Dandy, 1998). The input from each PE in the previous layer xi is multiplied by an adjustable connection weight wji. At each PE, the weighted input signals are summed and a threshold value θj is added. This combined input Ij is then passed through a non-linear transfer function f(.) to produce the output of the PE yj. The output of one PE provides the input to the PEs in the next layer. This process is summarised in Eqns (1) and (2) and illustrated in Figure 1.
Ij =
∑w
ji x i
+θ j
y j = f (I j )
summation
(1)
transfer
(2)
The propagation of information in an ANN starts at the input layer where the input data are presented. The network adjusts its weights on the presentation of a training data set and uses a learning rule to find a set of weights that will produce the input/output mapping that has the smallest possible error. This process is called ‘learning’ or ‘training.’ Once the training phase of the model has been successfully accomplished, the performance of the trained model needs to be validated using an independent testing set. The main steps involved in the development of an ANN, as suggested by Maier and Dandy (2000), are illustrated in Figure 2. Several of these steps are discussed in some depth in the following section.
Choice of model architecture
Choice of performance criteria
Connection type (e.g. feedforward, feedback) Degree of connectivity (e.g. fully connected) Number of layers Number of nodes per layer (trial & error, constructive or pruning methods)
Training Speed Processing speed during recall Prediction accuracy
Choice of data sets
Choice of stopping criteria
Number of data sets (e.g. two, three, holdout method) Method for data division
Fixed number of iterations Training error Cross-validation
Data pre-processing
Choice of optimization method Scaling Transformation to normality Removal of non-stationarities
Local first order (e.g. back-propagation) Local second order (e.g. Levenberg-Maquardt , Conjugate Gradient) Global (e.g. simulated annealing, genetic algorithm)
Choice of model inputs
Validation Choice of variables Choice of lags
Figure 2. The main steps involved in ANN model development. (After Maier and Dandy, 2000). As described above, ANNs learn from data examples presented to them and use these data to adjust their weights in an attempt to capture the relationship between the model input variables and the corresponding
1711
outputs. Consequently, ANNs need no prior knowledge regarding the nature of the relationship between the input and output variables. This is one of the main benefits of ANNs when compared with most empirical and statistical methods.
3 Issues requiring further attention in the future In the context of modelling environmental systems, Maier and Dandy (2000; 2001) highlighted the lack of guidance when developing ANN models and identified several issues that need to be considered. These include data transformation, the determination of appropriate model inputs and network geometry, the optimisation of connection weights and validation of model performance. These issues are equally relevant in the development and application of ANN models in geotechnical engineering, and interested readers are referred to these papers for a detailed treatment on each of these topics. Figure 2, presented earlier, provides a clear summary of the ANN development process recommended by Maier and Dandy (2000). Shahin et al. (2004b) provided guidance in relation to the second step of choice of data sets, using a geotechnical engineering example, and recommended the use of three, statistically consistent but independent data sets, one for each of training, testing and validation. Key issues which have received recent attention and also require further research in the future include developing approaches that: (i) ensure the development of robust models; (ii) increase model transparency and enable knowledge to be extracted from trained ANNs; (iii) improve extrapolation ability; and (iv) deal with uncertainty. Each of these is treated below.
3.1 Model robustness Kingston et al. (2005b) stated for “ANNs are to become more widely accepted and reach their full potential…, they should not only provide a good fit to the calibration and validation data, but the predictions should also be plausible in terms of the relationship modelled and robust under a wide range of conditions.” In addition, the authors stated that while ANNs validated against error alone may produce accurate predictions for situations similar to those contained in the training data, they may not be robust under different conditions unless the relationship by which the data were generated has been adequately estimated. Hence, it is also important to assess the relationship that has been modelled in the validation of an ANN, rather base it on an error measure alone. Shahin et al. (2005c) presented a geotechnical engineering case study of settlement data of shallow foundations on granular soils using two different ANN models. Both models were developed using the same software (NeuralWare, 1997), model parameters and architecture (5 inputs, one hidden layer with two nodes, and a single output), except that they were optimised with different sets of random starting weights. It can be seen from Table 1 that both models perform very well when assessed against traditional measures such as the coefficient of correlation, r; root mean square error, RMSE; and mean absolute error, MAE. In the absence of any further information, one would normally adopt one or other of the two models and use it for predictive purposes within the range of the input data used to train the models. Table 1. Performance of the ANN models developed. Model No.
Data set
1
Training
0.92
Testing
0.94
8.4
5.8
Validation
0.88
12.9
9.8
2
r
RMSE [mm] 10.8
MAE [mm] 7.4
Training
0.94
9.1
6.3
Testing
0.94
9.1
6.8
Validation
0.89
11.8
9.6
Figure 3 shows the results of sensitivity analyses performed to assess the generalisation ability of both models. The results were obtained by adopting the approach used by Goh (1995a), where all input variables, except one, were fixed to the mean values used for training and a set of synthetic data (whose values lie between the minimum and maximum values used for model training), were generated for the single input that was allowed to vary. The synthetic data were generated by increasing their values in increments equal to 5% of the total range between the minimum and maximum values. These input values were then entered into both ANN models and the corresponding outputs obtained. The robustness of the models was then determined by examining how well the predicted outputs, in this case footing settlement, agree with the known underlying physical processes over
1712
the range of inputs examined. It can be seen that the results obtained for Model-1 agree with those that one would expect based on the known physical behaviour of settlement of shallow foundations on granular soils. For example, in Figures 3(a), (b) and (d), there is an increase in the predicted settlement, in a relatively consistent and smooth fashion, as the footing width, footing net applied pressure and footing geometry, respectively, increase. On the other hand, in Figures 3(c) and (e), the predicted settlement decreases, also in a consistent and smooth fashion, as the average SPT 35
40 35
Predicted settlement (mm)
Predicted settlement (mm)
45 Model-1 Model-2
30 25 20 15 10 5
30
Model-1
25
Model-2
20 15 10 5 0
0 0
5
10
15
20
25
30
35
40
0
45
100
200
300
Footing width (m)
500
600
700
(b)
(a) 12
70
Model-1
60
Model-2
Predicted settlement (mm)
80 Predicted settlement (mm)
400
Footing net applied pressure (kPa)
50 40 30 20 10 0
11
Model-1
10
Model-2
9 8 7 6 5
0
5
10 15 20 25 30 35 40 45 50 55 60
1
2
Average SPT blow count
3
4
5
6
7
8
Footing geometry
(c)
(d)
Predicted settlement (mm)
10 Model-1
9
Model-2 8 7 6 5 0
0.5
1
1.5
2
2.5
3
Footing embedment ratio
(e) Figure 3. Results of the sensitivity analysis to test the robustness of the ANN models. (After Shahin et al., 2005c). blow count and footing embedment ratio, respectively, increase. In contrast, it can be seen from Figure 3 that the results obtained for Model-2 have an unexpected shape that is difficult to justify from a physical understanding of footing settlement. For example, there are abrupt changes in the predicted settlement in some instances and no change in predicted settlement for a range of inputs in others.
1713
Shahin et al. (2005c) argued that since cross-validation (Stone, 1994) was adopted during the model development phase and an independent validation set was used to test the predictive ability of both models, the only plausible explanation for the different behaviour exhibited by both models was the connection weights included in each model. Garson (1991) suggested the connection weights be examined as part of the interpretation of ANN model behaviour and, as a result, these are shown in Figure 4. It can be seen that the values of the weights obtained for Model-1 are more consistent than those of Model-2. Some of the weight values for Model-2 are significantly larger than the others, which can often indicate a problem with the model (Bailey and Thompson 1990), and result in erratic behaviour. The large values of the weights of Model-2 generally cause the node activations to be large, and as a result, the nodal outputs can become trapped in the flat spots at the extreme values of the transfer functions used in the hidden and output layers of Model-2.
20 Weight absolute value
18
Model-1 Model-2
16 14 12 10 8 6 4 2 0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Weight No
Figure 4. Bar charts of the weights obtained for both ANN models. (After Shahin et al., 2005c). However, individual weights cannot be associated with any physical meaning. This is in contrast with the relative input contributions, which can be associated with physical meaning and hence the use of such measures can facilitate the assessment of a trained ANN and enable constraints to be applied to the objective function during calibration to direct the search for the optimum weight vector into physically feasible regions of the search space (Kingston et al., 2005b). In their study, Kingston et al. (2005b) adopted the Connection Weight Approach of Olden et al. (2004) to determine the input contributions in order to assess the relationship modelled by the ANNs. The authors concluded that this approach provided the best overall methodology for quantifying ANN input importance in comparison to other commonly used methods, albeit with a few limitations. They went on to state that an improved measure for quantifying input contributions is needed.
3.2 Model transparency and knowledge extraction Closely related to the previous issue, and one of the major criticisms often levelled at ANNs, is their lack of transparency; that is, they are often referred to as ‘black-box’ models because they do not consider nor explain the underlying physical processes. Jain et al. (2004), with respect to hydrological engineering, addressed this issue by examining whether or not the physical processes in a watershed were inherent in a trained ANN rainfallrunoff model. They assessed the strengths of the relationships between the distributed components of the ANN model, in terms of the responses from the hidden nodes, and the deterministic components of the hydrological process, computed from a conceptual rainfall runoff model, along with the observed input variables, using correlation coefficients and scatter plots. They concluded that the trained ANN in fact captured different components of the physical process and a careful examination of the distributed information contained in the trained ANN can inform one about the nature of the physical processes captured by various components of the ANN model. Again in the context of hydrological engineering, Sudheer (2005) performed perturbation analysis to assess the influence of each individual input variable on the output variable and found it to be an effective means of identifying the underlying physical process inherent in the trained ANN. Sudheer and Jain (2004) and Kingston et al. (2006) also addressed this issue of model transparency and knowledge extraction. In terms of knowledge extraction, Shahin et al. (2002b) and Shahin and Jaksa (2005a) expressed the results of the trained ANNs in the form of relatively straightforward equations. This was possible due to the relatively small number of input and output variables, and hidden nodes. More recently, Rezania and Javadi (2007), using genetic programming techniques, improved upon and greatly simplified the formula developed by Shahin et al. (2002b), further enhancing the transparency of the ANN model.
1714
Neurofuzzy applications are another means of knowledge extraction that facilitate model transparency. Examples of such applications in geotechnical engineering include Ni et al. (1996), Shahin et al. (2003a, b, c), Gokceoglu et al. (2004) , Provenzano et al. (2004), Shahin et al. (2005b), and Padmini et al. (2007).
3.3 Extrapolation It is generally accepted that ANNs perform best when they do not extrapolate beyond the range of the data used for calibration (Flood and Kartam, 1994; Minns and Hall, 1996; Tokar and Johnson, 1999). Whilst this is not unlike other models, it is nevertheless an important limitation of ANNs, as it restricts their usefulness and applicability. Extreme value prediction is of particular concern in several areas of civil engineering, such as hydrological engineering, when floods are forecast, as well as in geotechnical engineering when, for example, liquefaction potential and the stability of slopes are assessed. Sudheer et al. (2003) highlighted this issue and proposed a methodology, based on the Wilson-Hilferty transformation, for enabling ANN models to predict extreme values with respect to peak river flows. Their methodology yielded superior predictions when compared with those obtained from an ANN model using untransformed data.
3.4 Uncertainty Finally, a further limitation of ANNs is that the uncertainty in the predictions generated is seldom quantified (Maier and Dandy, 2000). Failure to account for such uncertainty makes it impossible to assess the quality of ANN predictions, which severely limits their efficacy. In an effort to address this, a few researchers have applied Bayesian techniques to ANN training (e.g. Buntine and Weigend, 1991; MacKay, 1992; Neal, 1992; and Kingston et al., 2005a in the context of hydrological engineering; and Goh et al., 2005 with respect to geotechnical engineering). Goh et al. (2005) observed that the integration of the Bayesian framework into the backpropagation algorithm enhanced the neural network prediction capabilities and provided assessment of the confidence associated with the network predictions. Shahin et al. (2005a, b) also incorporated uncertainty in the ANN process by developing a series of design charts expressing the reliability of settlement predictions for shallow foundations on cohesionless soils. Research to date has demonstrated the value of Bayesian neural networks, although further work is needed in the area of geotechnical engineering.
4 Conclusions This paper has presented a brief overview of artificial neural networks (ANNs) and their application in the field of geotechnical engineering. In addition, four issues have been highlighted, namely model robustness, transparency and knowledge extraction, extrapolation, and uncertainty, as areas which require further research attention in the future. Together, improvements in these four areas will greatly enhance the usefulness of ANN models with respect to geotechnical engineering applications.
5 References Abu-Kiefa, M.A. 1998. General regression neural networks for driven piles in cohesionless soils, Journal of Geotechnical and Geoenvironmental Engineering, 124(12), 1177-1185. Agrawal, G., Chameau, J.A., Bourdeau, P.L. 1997. Assessing the liquefaction susceptibility at a site based on information from penetration testing. In Artificial neural networks for civil engineers: fundamentals and applications, N. Kartam, I. Flood, J.H. Garrett, eds., New York (USA), 185-214. Agrawal, G., Weeraratne, S., Khilnani, K. 1994. Estimating clay liner and cover permeability using computational neural st networks. Proc.1 Congress on Computing in Civil Engineering, Washington (USA). Ali, H.E., Najjar, Y.M. 1998. Neuronet-based approach for assessing liquefaction potential of soils, Transportation Research Record No. 1633, 3-8. Bailey, D.L., Thompson, D.M. 1990. Developing neural-network applications, AI Expert, 5(9), 34-41. th
Basheer, I.A., Najjar, Y.M. 1995. A neural-network for soil compaction. Proc., 5 Int. Symp. on Numerical Models in Geomechanics, G. N. Pande and S. Pietruszczak, eds., Balkema, Rotterdam (The Netherlands), 435-440. Basheer, I.A., Najjar, Y.M. 1998. Modeling cyclic constitutive behavior by neural networks: Theoritical and real data. Proc. 12 Engineering Mech. Conf., La Jolla (USA), 952-955.
th
Basheer, I.A., Reddi, L.N., Najjar, Y.M. 1996. Site characterisation by neuronets: An application to the landfill siting problem, Ground Water, 34, 610-617. Cal, Y. 1995. Soil classification by neural-network, Advances in Engineering Software, 22(2), 95-97. Chan, W.T., Chow, Y.K., Liu, L.F. 1995. Neural network: An alternative to pile driving formulas, Computers and Geotechnics, 17, 135-156.
1715
Das, S.K., Basudhar, P.K. 2006. Undrained lateral load capacity of piles in clay using artificial neural network, Computers and Geotechnics, 33, 454–459. Ellis, G.W., Yao, C., Zhao, R. 1992. Neural network modeling of the mechanical behavior of sand. Proc. Engineering Mechanics, ASCE, 421-424. Ellis, G.W., Yao, C., Zhao, R., Penumadu, D. 1995. Stress-strain modelling of sands using artificial neural networks, Journal of Geotechnical Engineering, 121(5), 429-435. Erzin, Y. 2007. Artificial neural networks approach for swell pressure versus soil suction behaviour, Canadian Geotechnical Journal, 44(10), 1215-1223. Fausett, L.V. 1994. Fundamentals neural networks: Architecture, algorithms, and applications, Prentice-Hall, Englewood Cliffs (USA). Ferentinou, M.D., Sakellariou, M.G. 2007. Computational intelligence tools for the prediction of slope performance, Computers and Geotechnics, 34(5), 362-384. Flood, I., Kartam, N. 1994. Neural networks in civil engineering I: Principles and understanding. Journal of Computing in Civil Engineering, 8(2), 131-148. Garson, G.D. 1991. Interpreting neural-network connection weights, AI Expert, 6(7), 47-51. Ghaboussi, J., Sidarta, D.E. 1998. New nested adaptive neural networks (NANN) for constitutive modelling, Computers and Geotechnics, 22(1), 29-52. Goh, A.T.C. 1994a. Nonlinear modelling in geotechnical engineering using neural networks, Australian Civil Engineering Transactions, CE36(4), 293-297. Goh, A.T.C. 1994b. Seismic liquefaction potential assessed by neural network, Journal of Geotechnical and Geoenvironmental Engineering, 120(9), 1467-1480. Goh, A.T.C. 1995a. Back-propagation neural networks for modeling complex systems, Artificial Intelligence in Engineering, 9, 143-151. Goh, A.T.C. 1995b. Empirical design in geotechnics using neural networks, Geotechnique, 45(4), 709-714. Goh, A.T.C. 1995c. Modeling soil correlations using neural networks, Journal of Computing in Civil Engineering, 9(4), 275-278. Goh, A.T.C. 1996a. Neural-network modeling of CPT seismic liquefaction data, Journal of Geotechnical Engineering, 122(1), 70-73. Goh, A.T.C. 1996b. Pile driving records reanalyzed using neural networks, Journal of Geotechnical Engineering, 122(6), 492495. Goh, A.T.C. 2002. Probabilistic neural network for evaluating seismic liquefaction potential, Canadian Geotechnical Journal, 39, 219-232. Goh, A.T.C., Goh, S.H. 2007. Support vector machines: Their use in geotechnical engineering as illustrated using seismic liquefaction data, Computers and Geotechnics, 34(5), 410-421. Goh, A.T.C., Kulhawy, F.H., Chua, C.G. 2005. Bayesian neural network analysis of undrained side resistance of drilled shafts, Journal of Geotechnical and Geoenvironmental Engineering, 131(1), 84-93. Goh, A.T.C., Wong, K.S., Broms, B.B. 1995. Estimation of lateral wall movements in braced excavation using neural networks, Canadian Geotechnical Journal, 32, 1059-1064. Gokceoglu, C., Yesilnacar, E., Sonmez, H., Kayabasi, A. 2004. A neuro-fuzzy model for modulus of deformation of jointed rock masses, Computers and Geotechnics, 31, 375-383. nd
Gribb, M.M., Gribb, G.W. 1994. Use of neural networks for hydraulic conductivity determination in unsaturated soil. Proc. 2 Int. Conf. on Ground Water Ecology, J.A. Stanford and H.M. Valett, eds., Bethesda MD: Amer, Water Resources Assoc., 155163. Habibagahi, G., Bamdad, A. 2003. A neural network framework for mechanical behavior of unsaturated soils, Canadian Geotechnical Journal, 40, 684-693. Hanna, A.M., Morcous, G., Helmy, M. 2004. Efficiency of pile groups installed in cohesionless soil using artificial neural networks, Canadian Geotechnical Journal, 41, 1241-1249. Hecht-Nielsen, R. 1990. Neurocomputing, Addison-Wesley Publ. Co., Boston (USA). Hung, C.-C., Ni, S.-H. 2007. Using multiple neural networks to estimate the screening effect of surface waves by in-filled trenches, Computers and Geotechnics, 34(5), 397-409. Jain, A., Sudheer, K.P., Srinivasulu, S. 2004. Identification of physical processes inherent in artificial neural network rainfall runoff models, Hydrological Processes, 118(3), 571–581. Javadi, A.A., Rezania, M., Mousavi Nezhad, M. 2006. Evaluation of liquefaction induced lateral displacements using genetic programming, Computers and Geotechnics, 33, 222-233. Juang, C.H., Chen, C.J. 1999. CPT-based liquefaction evaluation using artificial neural networks, Computer-Aided Civil and Infrastructure Engineering, 14(3), 221-229. Kim, Y.-S., Kim, B.-T. 2007. Prediction of relative crest settlement of concrete-faced rockfill dams analyzed using an artificial
1716
neural network model, Computers and Geotechnics, 34(6), 423-434. Kingston G.B., Lambert M.F, Maier H.R. 2005a. Bayesian parameter estimation applied to artificial neural networks used for hydrological modeling, Water Resources Research, 41, W12409, doi:10.1029/2005WR004152. Kingston, G.B., Maier, H.R., Lambert, M.F. 2005b. Calibration and validation of neural networks to ensure physically plausible hydrological modelling, Journal of Hydrology 314 (2005) 158–176. Kingston G.B., Maier H.R., Lambert M.F. 2006. A probabilistic method to assist knowledge extraction from artificial neural networks used for hydrological prediction, Mathematical and Computer Modelling, 44(5-6), 499-512. Kung, G.T.C., Hsiao, E.C.L., Schuster, M., Juang, C.H. 2007. A neural network approach to estimating deflection of diaphragm walls caused by excavation in clays, Computers and Geotechnics, 34(5), 385-396. Kurup, P.U., Dudani, N.K. 2002. Neural networks for profiling stress history of clays from PCPT data, Journal of Geotechnical and Geoenvironmental Engineering, 128(7), 569-579. Lee, C., Sterling, R. 1992. Identifying probable failure modes for underground openings using a neural network, Int. Journal of Rock Mechanics and Mining Science & Geomechanics Abstracts, 29(1), 49-67. Lee, I.M., Lee, J.H. 1996. Prediction of pile bearing capacity using artificial neural networks. Computers and Geotechnics, 18(3), 189-200. Lee, S.J., Lee, S.R., Kim, Y.S. 2003. An approach to estimate unsaturated shear strength using artificial neural network and hyperbolic formulation, Computers and Geotechnics, 30, 489-503. Lu, Y. 2005. Underground blast induced ground shock and its modelling using artificial neural network, Computers and Geotechnics, 32, 164-178. Maier, H.R., Dandy, G.C. 1998. The effect of internal parameters and geometry on the performance of back-propagation neural networks: An empirical study, Environmental Modelling & Software, 13, 193-209. Maier, H.R., Dandy, G.C. 2000. Neural networks for the prediction and forecasting of water resources variables: A review of modelling issues and applications, Environmental Modelling & Software, 15(2000), 101-124. Maier H.R., Dandy G.C. 2001. Neural network based modelling of environmental variables: A systematic approach, Mathematical and Computer Modelling, 33(6-7), 669-682. Maren, A., Harston, C., Pap, R. 1990. Handbook of neural computing applications, Academic Press, San Diego (USA). Minns, A.W., Hall, M.J. 1996. Artificial neural networks as rainfall-runoff models. Hydrological Sciences Journal, 41(3), 399-417. Moon, H.K., Na, S.M., Lee, C.W. 1995. Artificial neural-network integrated with expert-system for preliminary design of tunnels th and slopes. Proc. 8 Int. Congress on Rock Mechanics, T. Fujii, ed., Rotterdam: Balkema, 901-905. th
Najjar, Y.M., Ali, H.E., Basheer, I.A. 1999. On the use of neurons for simulating the stress-strain behavior of soils. Proc. 7 Int. Symp. on Numerical Models in Geomechanics, NUMOG VII, Graz (Austria), 657-662. Najjar, Y.M., Ali, H.E. 1998. CPT-based liquefaction potential assessment: A neuronet approach. Geotechnical Special Publication, ASCE, 1, 542-553. Najjar, Y.M., Basheer, I.A. 1996a. Neural network approach for site characterization and uncertainty prediction. Geotechnical Special Publication, ASCE, 58(1), 134-148. Najjar, Y.M., Basheer, I.A. 1996b. Utilizing computational neural networks for evaluating the permeability of compacted clay liners, Geotechnical and Geological Engineering, 14, 193-221. Najjar, Y.M., Basheer, I.A., McReynolds, R. 1996a. Neural modeling of Kansan soil swelling, Transportation Research Record No. 1526, 14-19. Najjar, Y.M., Basheer, I.A., Naouss, W.A. 1996b. On the identification of compaction characteristics by neuronets, Computers and Geotechnics, 18(3), 167-187. Najjar, Y.M., Huang C. 2007. Simulating the stress–strain behavior of Georgia kaolin via recurrent neuronet approach, Computers and Geotechnics, 34(5), 346-361. Nawari, N.O., Liang, R., Nusairat, J. 1999. Artificial intelligence techniques for the design and analysis of deep foundations, Electronic Journal of Geotechnical Engineering, http://geotech.civeng.okstate.edu/ejge/ppr9909/index.html. NeuralWare. 1997. NeuralWorks Predict Release 2.1, NeuralWare Inc., Pittsburgh (USA). Ni, S.H., Lu, P.C., Juang, C.H. 1996. A fuzzy neural network approach to evaluation of slope failure potential. Journal of Microcomputers in Civil Engineering, 11, 59-66. Olden, J.D., Joy, M.K., Death, R.G., 2004. An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data, Ecological Modelling, 178(3–4), 389–397. Padmini, D., Ilamparuthi, K., Sudheer, K.P. 2007. Ultimate bearing capacity prediction of shallow foundations on cohesionless soils using neurofuzzy models, Computers and Geotechnics, in press. Penumadu, D. Jean-Lou, C. 1997. Geomaterial modeling using artificial neural networks, Artificial Neural Networks for Civil Engineers: Fundamentals and Applications, ASCE, 160-184. Penumadu, D., Zhao, R. 1999. Triaxial compression behavior of sand and gravel using artificial neural networks (ANN), Computers and Geotechnics, 24, 207-230.
1717
Penumadu, D., Jin-Nan, L., Chameau, J.-L., Arumugam, S. 1994. Rate dependent behavior of clays using neural networks. th Proc. 13 Conf. Int. soc. Soil Mech. & Found. Engrg., New Delhi (India), 4, 1445-1448. Provenzano, P., Ferlisi, S., Musso, A. 2004. Interpretation of a model footing response through an adaptive neural fuzzy inference system, Computers and Geotechnics, 31, 251-266. Rahman, M.S., Wang, J., Deng, W., Carter, J.P. 2001. A neural network model for the uplift capacity of suction caissons, Computers and Geotechnics, 28, 269-287. Rezania, M., Javadi, A.A. 2007. A new genetic programming model for predicting the settlement of shallow foundations, Canadian Geotechnical Journal (in press). Ripley, B.D. 1996. Pattern recognition and neural networks, Cambridge University Press, Cambridge (UK). Rizzo, D.M., Lillys, T.P., Dougherty, D.E. 1996. Comparisons of site characterization methods using mixed data. Geotechnical Special Publication, ASCE, 58(1), 157-179. Romero, S., Pamukcu, S. 1996. Characterization of granular meterial by low strain dynamic excitation and ANN, Geotechnical Special Publication, ASTM-ASCE, 58(2), 1134-1148. Samui, P. 2007. Support vector machine applied to settlement of shallow foundations on cohesionless soils, Computers and Geotechnics, in press. Shahin, M.A., Indraratna, B. 2006. Modeling the mechanical behavior of railway ballast using artificial neural networks, Canadian Geotechnical Journal, 43, 1144-1152. Shahin, M.A., Jaksa, M.B. 2004. Probabilistic assessment of the uncertainty associated with the pullout capacity of marquee th anchors. Proc. 9 Australia New Zealand Conf. on Geomechanics, Auckland (NZ), Vol. 2, 917-921. Shahin, M.A., Jaksa, M.B. 2005a. Neural network prediction of pullout capacity of marquee ground anchors, Computers and Geotechnics, 32(3), 153-163. Shahin, M.A., Jaksa, M.B. 2005b. Modelling the pullout capacity of marquee ground anchors using neurofuzzy technique. Proc. Int. Congress on Modelling and Simulation, MODSIM 2005, Melbourne (Australia), 66-72. Shahin, M.A., Jaksa, M.B. 2006. Pullout capacity of small ground anchors by direct CPT methods and neural networks, Canadian Geotechnical Journal, 43(6), 626-637. Shahin, M.A., Jaksa, M.B., Maier, H.R. 2001. Artificial neural network applications in geotechnical engineering, Australian Geomechanics, 36(1), 49-62. Shahin, M.A., Jaksa, M.B., Maier, H.R. 2002b. Artificial neural network–based settlement prediction formula for shallow foundations on granular soils, Australian Geomechanics, 37(4), 45-52. Shahin, M.A., Jaksa, M.B., Maier, H.R. 2003a. Neurofuzzy networks applied to settlement of shallow foundations on granular th soils. Proc. 9 Int. Conf. on Applications of Statistics and Probability in Civil Engineering, San Francisco (USA), Vol. 2, 1379-1383. Shahin, M.A., Jaksa, M.B., Maier, H.R. 2004a. Invited theme paper: Applications of neural networks in foundation engineering. Proc. e-Conference on Modern Trends in Foundation Engineering: Geotechnical Challenges and Solutions (http://www.ecms.adelaide.edu.au/civeng/staff/pdf/e-Conf_2004.pdf) Shahin, M.A., Jaksa, M.B., Maier, H.R. 2005a. Neural network based stochastic design charts for settlement prediction, Canadian Geotechnical Journal, 42(1), 110-120. Shahin, M.A., Jaksa, M.B., Maier, H.R. 2005b. Stochastic simulation of settlement prediction of shallow foundations based on a deterministic artificial neural network model. Proc. Int. Congress on Modelling and Simulation, MODSIM 2005, Melbourne (Australia), 73-78. Shahin, M.A., Maier, H.R., Jaksa, M. B. 2003b. Neural and neurofuzzy techniques applied to modelling settlement of shallow foundations on granular soils. Proc Int. Congress on Modelling and Simulation, MODSIM 2003, Vol. 4, 1886-1891. Shahin, M.A., Maier, H.R., Jaksa, M.B. 2002a. Predicting settlements of shallow foundations using artificial neural networks, Journal of Geotechnical and Geoenvironmental Engineering, 128(9), 785-793. Shahin, M.A., Maier, H.R., Jaksa, M.B. 2003c. Settlement prediction of shallow foundations on granular soils using B-spline neurofuzzy models, Computers and Geotechnics, 30(8), 637-647. Shahin, M.A., Maier, H.R., Jaksa, M.B. 2004b. Data division for developing neural networks applied to geotechnical engineering, Journal of Computing in Civil Engineering, 18(2), 105-114. Shahin, M.A., Maier, H.R., Jaksa, M.B. 2005c. Investigation into the robustness of artificial neural network models for a case study in civil engineering. Proc. Int. Congress on Modelling and Simulation, MODSIM 2005, Melbourne (Australia), 79-83. Shang, J.Q., Ding, W., Rowe, R.K., Josic, L. 2004. Detecting heavy metal contamination in soil using complex permittivity and artificial neural networks, Canadian Geotechnical Journal, 41, 1054-1067. Shi, J.J. 2000. Reducing prediciton error by transforming input data for neural networks, Journal of Computing in Civil Engineering, 14(2), 109-116. Shi, J., Ortigao, J.A.R., Bai, J. 1998. Modular neural networks for predicting settlement during tunneling Journal of Geotechnical and Geoenvironmental Engineering, 124(5), 389-395. Sidarta, D.E., Ghaboussi, J. 1998. Constitutive modeling of geomaterials from non-uniform material tests, Computers and
1718
Geotechnics, 22(1), 53-71. Sivakugan, N., Eckersley, J.D., and Li, H. 1998. Settlement predictions using neural networks, Australian Civil Engineering Transactions, CE40, 49-52. Stone, M. 1974, Cross-validatory choice and assessment of statistical predictions, Journal of Royal Statistical Society, B 36, 111-147. Sudheer, K.P. 2005. Knowledge extraction from trained neural network river flow models, Journal of Hydrologic Engineering, 10(4), 264-269. Sudheer, K.P., Jain, A. 2004. Explaining the internal behaviour of artificial neural network river flow models, Hydrological Processes, 118(4), 833–844. Sudheer, K.P., Nayak, P.C., Ramasastri, K.S. 2003. Improving peak flow estimates in artificial neural network river flow models. Hydrological Processes, 17, 677–686. Teh, C.I., Wong, K.S., Goh, A.T.C., Jaritngam, S. 1997. Prediction of pile capacity using neural networks, Journal of Computing in Civil Engineering, 11(2), 129-138. Tokar, S.A., Johnson, P.A. 1999. Rainfall-runoff modeling using artificial neural networks. Journal of Hydrologic Engineering, 4(3), 232-239. Tutumluer, E., Seyhan, U. 1998. Neural network modeling of anisotropic aggregate behavior from repeated load triaxial tests. Transportation Research Record 1615, National Research Council, Washington, D.C. Ural, D.N., Saka, H. 1998. Liquefaction assessment by neural networks, Electronic Journal of Geotechnical Engineering, http://geotech.civen.okstate.edu/ejge/ppr9803/index.html. Yoo, C., Kim, J.-M. 2007. Tunneling performance prediction using an integrated GIS and neural network, Computers and Geotechnics, 34, 19-30. Young-Su, K., Byung-Tak, K. 2006. Use of artificial neural networks in the prediction of liquefaction resistance of sands, Journal of Geotechnical and Geoenvironmental Engineering, 132(11), 1502-1504. Zhao, H.-B. 2007. Slope reliability analysis using a support vector machine, Computers and Geotechnics, in press. Zhou, Y., Wu, X. 1994. Use of neural networks in the analysis and interpretation of site investigation data, Computers and Geotechnics, 16, 105-122. Zhu, J.H., Zaman, M.M., Anderson, S.A. 1998a. Modeling of soil behavior with a recurrent neural network.” Canadian Geotechnical Journal, 35(5), 858-872. Zhu, J.H., Zaman, M.M., Anderson, S.A. 1998b. Modelling of shearing behaviour of a residual soil with recurrent neural network. Int. Journal of Numerical and Analytical Methods in Geomechanics, 22(8), 671-687. Zurada, J.M. 1992. Introduction to artificial neural systems, West Publ. Co., St. Paul (USA).
1719