JOURNAL OF GEOPHYSICAL RESEARCH: EARTH SURFACE, VOL. 119, 138–152, doi:10.1002/2013JF002981, 2014
A statistical framework to quantify spatial variation in channel gradients using the integral method of channel profile analysis Simon M. Mudd,1 Mikaël Attal,1 David T. Milodowski,1 Stuart W. D. Grieve,1 and Declan A. Valters 1 Received 12 September 2013; revised 5 December 2013; accepted 6 December 2013; published 6 February 2014.
[1] We present a statistical technique for analyzing longitudinal channel profiles. Our
technique is based on the integral approach to channel analysis: Drainage area is integrated over flow distance to produce a transformed coordinate, χ, which has dimensions of length. Assuming that profile geometry is conditioned by the stream power law, defined as E = KAmSn where E is erosion rate, K is erodibility, A is drainage area, S is channel gradient, and m and n are constants, the slope of a transformed profile in χ-elevation space should reflect the ratio of erosion rate to channel erodibility raised to a power 1/n; this quantity is often referred to as the channel steepness and represents channel slope normalized for drainage area. Our technique tests all possible contiguous segments in the channel network to identify the most likely locations where channel steepness changes and also identifies the most likely m/n ratio. The technique identifies locations where either erodibility or erosion rates are most likely to be changing. Tests on a simulated landscape demonstrate that the technique can accurately retrieve both the m/n ratio and the correct number and location of segments eroding at different rates where model assumptions apply. Tests on natural landscapes illustrate how the method can distinguish between spurious channel convexities due to incorrect selection of the m/n ratio from those which are candidates for changing erodibility or erosion rates. We also show how, given erosion or uplift rate constraints, the method can be used to constrain the slope exponent, n. Citation: Mudd, S. M., M. Attal, D. T. Milodowski, S. W. D. Grieve, and D. A. Valters (2014), A statistical framework to quantify spatial variation in channel gradients using the integral method of channel profile analysis, J. Geophys. Res. Earth Surf., 119, 138–152, doi:10.1002/2013JF002981.
1.
Introduction
[2] Fluvial incision is one of the fundamental geomorphic processes that drive the evolution of eroding landscapes. Early geomorphologists recognized that channel gradients vary systematically with drainage area and reasoned that all else being equal, steeper channels should erode more rapidly [e.g., Davis, 1899]. This reasoning has motivated much research into bedrock rivers because it raises the possibility of using channel gradients to infer erosion rates. Howard and Kerby [1983] proposed that in bedrock rivers, erosion should be proportional to the power the river expends on its bed. Stream power depends on the flux of water in the channel, which is a function of drainage area, and the channel gradient. Thus, in areas of uniform lithology and climate a characteristic relationship between slope and drainage area can be expected for a given erosion rate. Later workers found 1
School of GeoSciences, University of Edinburgh, Edinburgh, UK.
Corresponding author: S. M. Mudd, School of GeoSciences, University of Edinburgh, Drummond St., Edinburgh EH8 9XP, UK. (
[email protected]) ©2013. The Authors. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. 2169-9003/14/10.1002/2013JF002981
that the geometry of bedrock river profiles is broadly consistent with a class of models similar to that first suggested by Howard and Kerby [1983] and extended by others [e.g., Seidl and Dietrich, 1992; Howard, 1994]. Subsequently, so-called slope-area analysis has frequently been used to reveal patterns of erosion and, by inference, tectonic activity in channel networks [Stock and Montgomery, 1999; Kirby and Whipple, 2001; Kirby et al., 2003; Snyder et al., 2003; Wobus et al., 2006; Miller et al., 2007; DiBiase et al., 2010; Kirby and Ouimet, 2011; Kirby and Whipple, 2012]. [3] Topographic data are, however, inherently noisy. The noise arises from uncertainties in the data collection [e.g., Hodgson and Bresnahan, 2004; Eckert et al., 2005] but also from the natural heterogeneity that characterizes geomorphic systems: Channel beds are often uneven and local topographic features (e.g., steps in a river profile due to fractures in bedrock; accumulation of boulders at the base of a rock face) may not reflect regional erosion rates. Because slope is the spatial derivative of topography, slope data contain yet more noise, so analysis of channel slopes requires a degree of smoothing and data binning that may reveal broad patterns in the landscape [e.g., Wobus et al., 2006] but will fail to identify subtler spatial and/or temporal variations in erosion rates. Perron and Royden [2013], building on earlier work by Royden et al. [2000], offered an elegant solution to this problem by using an integral transformation of the river
138
MUDD ET AL.: CHANNEL SEGMENT FITTING
profile’s horizontal coordinate into a variable χ (chi), which has units of distance but accounts for longitudinal variations in drainage area. Plots of channel elevations against this transformed length variable (or “chi plots;” Perron and Royden [2013]) can be used to reveal the steepness of river reaches without having to calculate channel slopes. This form of channel analysis, which Perron and Royden [2013] have called the integral method, has a number of attractive features. Critically, as the integral method does not involve differentiation, there is a significant reduction in noise relative to slope-area analysis. Consequently, estimates of channel steepness using the integral method have reduced uncertainty, and there is no longer a requirement to log-bin the data, which permits analysis of much finer details in the channel network (see Perron and Royden [2013] for a full discussion of the advantages and potential drawbacks of this method). The integral method has the potential to provide precise quantitative information about erosion rates and their spatial and temporal variations over areas for which topographic data are available, which could then be used to gain insight into the spatial and temporal variations in the controlling external forces, namely tectonics and climate. To achieve this, however, a transparent, robust, and objective method for discriminating between river reaches with different normalized steepness values must exist. In the following sections we (i) outline the theoretical principles of the integral method and discuss its application in steady state and transient and/or heterogeneous landscapes, (ii) develop a statistical framework with which to apply this method, and (iii) demonstrate the utility of this technique using a series of examples based on both numerical simulations and analysis of real landscapes.
2. The Integral Method of Channel Profile Analysis
2.1. Chi Analysis: Steady State Example [5] To introduce the integral method of channel analysis, we consider a simplified system in which K and U are constant in space and time, and uplift is balanced by erosion such that the elevation of the channel does not change in time (a so-called steady state channel): 1=n dz ¼ U AðxÞm=n : dx K
[6] Equation (2) implies that plotting slope (|dz/dx|) against drainage area in log-log space should produce a straight line with a gradient-m/n (often referred to as the channel concavity or concavity index, e.g., Whipple and Tucker [1999]), whereas the vertical position of the line in the plot should be indicative of the ratio of uplift to erodibility (U/K) raised to the power 1/n. This relationship has been exploited by numerous authors to infer both erodibility [e.g., Stock and Montgomery, 1999] and tectonic uplift rates (see reviews by Wobus et al. [2006] and Kirby and Whipple [2012]). However, the noise inherent in slope data requires smoothing and averaging of slope-area data before they can be used to make inferences about erosion or tectonics [Wobus et al., 2006]. In addition, slope-area data tend to show discontinuities, due to the fact that drainage area does not increase steadily downstream: Abrupt increases in drainage area at tributary junctions cause clustering of the data and create large uncertainties on regressions. An alternative approach is to avoid differentiation and thus eliminate the main source of noise [e.g., Royden et al., 2000; Pritchard et al., 2009; Perron and Royden, 2013]. One means of retaining drainage area and elevation information is to separate and then integrate equation (2) [e.g., Royden et al., 2000; Harkins et al., 2007; Perron and Royden, 2013]:
[4] The evolution of a channel experiencing uplift and erosion driven by stream power can be stated as follows [e.g., Howard and Kerby, 1983; Whipple and Tucker, 1999]: n ∂z ∂z ¼ U ðx; t Þ þ K ðx; t ÞAm ; ∂t ∂x
(1)
where z (dimensions length; dimensions are hereafter denoted in brackets and abbreviated as [L]ength, [M]ass, and [T]ime) is the elevation of the channel, x [L] is a longitudinal coordinate, A [L2] is the drainage area, U [L T1] is the uplift rate, and K [T1 L(12m)] is an erodibility coefficient which encapsulates the influence of climate, lithology, and other factors. The exponents m and n are empirically derived coefficients, although values of n have been proposed based on the mechanics of bedrock incision, ranging between 2/3 and 7/3 [Whipple et al., 2000]. Both U and K may be functions of space and time, and K may also be a function of sediment supply [e.g., Sklar and Dietrich, 2004; Cowie et al., 2008; Hobley et al., 2011] and incorporate the influence of an erosion threshold and channel width adjustment [e.g., Snyder et al., 2003; Lague, 2010; Attal et al., 2011]. The stream power approach, its limitations, and alternatives are reviewed by Lague [2013]. The integral method devised by Perron and Royden [2013] (see also Royden and Perron [2013]) is based on the assumption that the evolution of the studied channels can be described by equation (1).
(2)
U ∫dz ¼ ∫ KAm
1=n dx:
(3)
[7] In a system in which K and U are constant in space and time, equation (3) may be integrated upstream from some base level at xb to arrive at zðxÞ ¼ zðxb Þ þ
U KAm 0
1=n χ;
(4a)
where χ¼
x ∫xb
A0 m=n dx; AðxÞ
(4b)
and A0 is a reference drainage area, introduced to ensure the integrand in equation (4b) is dimensionless. The transformed coordinate, χ, has dimension of length and the elevation, z(x), is a linear function of χ in this situation (K and U constant in space and time) according to equation (4a). Thus, a channel that obeys equation (2) will appear as a straight line in a chi plot, and the slope of the line will be a function of the uplift to erodibility ratio (U/K) raised to the power 1/n. Perron and Royden [2013] pointed out several additional features that make chi plots an attractive alternative to slope-area plots. Importantly, the coordinate χ depends on the m/n ratio,
139
Channels lie on same χ curve, but not linear.
Elevation
Channels linear, but not collinear.
Elevation
Elevation
MUDD ET AL.: CHANNEL SEGMENT FITTING Channels linear and collinear.
χ
χ
χ
Figure 1. Sketch illustrating examples of linearity or nonlinearity of channels in χ-elevation space. Channel networks with spatially homogenous uplift and erodibility that are at steady state should have channels that are not just linear but collinear.
2.2. Chi Plots in Transient and Spatially Heterogeneous Landscapes [8] Royden and Perron [2013] demonstrated that chi plots could be used to analyze steady state channels in landscapes characterized by uniform uplift and erodibility, both spatially and temporally. They demonstrated further that the coordinate transformation of equation (4b) can be used to find families of analytical solutions for transient channel behavior. Using these solutions, Royden and Perron [2013] demonstrated the presence of what they termed “slope patches,” which are mathematical entities that migrate upstream in χ-elevation space. If n = 1, slope patches move upstream in χ space at a fixed rate, regardless of the uplift rate, and the χ-elevation plot is composed entirely of linear segments (Figure 2). If n > 1, slope patches move quicker if they are generated by an increase in uplift rate; if n < 1 slope patches generated by a decrease in uplift rate move faster. This behavior is consistent with theoretical work [Rosenbloom and Anderson, 1994; Weissel and Seidl, 1998; Tucker and Whipple, 2002] which showed that in river systems obeying equation (2), the celerity (C) of perturbations migrating up the channel is C ¼ KAm S n1 :
(5)
[9] For n = 1, the celerity of perturbation is independent of slope (and thus of uplift rate); if n > 1, the celerity of the perturbation is greater on steeper slopes and vice versa. When n ≠ 1 there exists a possibility that an upstream slope patch may move faster than its downstream neighbor, resulting in nonlinear sections Royden and Perron [2013] termed
“stretch zones.” In addition, when n ≠ 1 downstream patches may move quicker than upstream patches leading to a loss of information along the channel network; Royden and Perron [2013] termed these entities “consuming knickpoints.” In the case of time invariant but spatially varying uplift, Royden and Perron [2013] demonstrated that transformed channel profiles are made up of piecewise linear segments which reflect local uplift rates.
1800
t = 0.5Ma after uplift increase to 0.9 mm yr
1600 1400
Elevation (m)
which is not known a priori but has been fixed in most slopearea analyses through the use of a reference concavity index: A value of 0.45–0.5 is frequently used, representing the concavity that river profiles should theoretically achieve should erosion rates be proportional to the specific stream power or shear stress [e.g., Wobus et al., 2006]. However, Perron and Royden [2013] proposed two independent criteria to estimate and test the most likely m/n ratio for a steady state channel experiencing uniform uplift and incising rocks with uniform erodibility (equation (2)): (i) individual channels in the network should be linear in χ-elevation space and (ii) all channels in the network should be collinear in χ-elevation space (Figure 1). Therefore, the best fitting m/n value can be found by iterating through a range of m/n values and maximizing the R2 value of the elevation data in χ space for the main stem channel and then for its tributaries.
1200 t = 0Ma after uplift increase to 0.9 mm yr
1000 800 t = 0.5 Ma
600 400 Slope patch representing the 1Ma long phase of uplift = 0.6 mm yr
200 0 0
50
100
150
200
Figure 2. Illustration of upstream migration of channel segments in χ-elevation space. Profiles generated using the CHILD model (see text) with n = 1. A channel in steady state with an uplift rate of 0.3 mm yr1 experiences two consecutive step increases in uplift rate 1 Ma apart, to 0.6 and then 0.9 mm yr1. The black curves show chi profiles at the time of the second step change and every 0.1 Ma thereafter. The blue, green, and red lines are parallel to portions of the channel adjusted to the different uplift rates 0.3, 0.6, and 0.9 mm yr1, respectively. The dashed red line is parallel to the solid red line: It is drawn to illustrate how a line crossing the intersection of the segments formed by uplift rates of 0.6 and 0.3 mm yr1 is parallel to the segment formed by an uplift rate of 0.9 mm yr1. This demonstrates that when n = 1, channel segments representing a given phase during which external forcing was constant (or “slope patches,” see schematic chi plot in inset) maintain a constant length and slope in χ-elevation space. When n does not equal to 1, information can be lost as segments consume other segments, but as long as segments are not “erased,” they will retain some steepness information that can, in principle, be related to the uplift rates or climatic conditions under which they formed.
140
MUDD ET AL.: CHANNEL SEGMENT FITTING
b. = 0.3 = 0.99213
c. = 0.5 = 0.95290
maximum
at
= 0.3
a.
Figure 3. (a) Plot shows the R2 values of a linear fit to channel data generated by the transient simulation described in Figure 2 as a function of the m/n ratio. (b) Plot shows the profile in χ-elevation space with an m/n ratio of 0.3, which has the highest R2 for a linear fit to the channel data. (c) The true m/n is 0.5, and the χ- elevation profile for this m/n ratio is shown. 2.3. Need for a Statistical Approach to Create Chi Plots From Topographic Data [10] To construct chi plots, the m/n ratio must be assigned. Perron and Royden [2013] suggest performing linear regression for a range of m/n ratios and selecting the ratio with the minimum value of R2. However, as discussed in the previous section, the results of Royden and Perron [2013] imply that, in transient landscapes, spatial or temporal changes in uplift rates could lead to channel profiles that are piecewise linear. This hinders attempts to find the most likely m/n value. For example, a channel that has undergone stepwise increases in uplift rates will be composed of two or more segments and will thus appear convex up (with a low R2 value) in the chi plot with the true m/n ratio (Figure 3). Perron and Royden [2013] also suggested that in simple transient cases the collinearity of tributaries can be used to identify the correct m/n ratio, highlighting the need for statistical tests that go beyond minimization of an R2 value. Thus, if our aim is to identify slope patches that are indicative of spatial or temporal changes in boundary conditions (uplift or climate) or rock properties, we need an objective, statistically robust, and reproducible method for doing so. In this contribution we present one such method.
3. A Statistical Method to Identify the Most Likely Combination of Segments in χ- Elevation Space [11] The method begins with a channel network for which elevation (z), flow distance (x), and drainage area (A) have been extracted. Our approach tests every possible combination of linear channel segments to find the most likely m/n ratio and the most likely piecewise linear fit to channel profile data in χ-elevation space. 3.1. Integration of the Channel Profile [12] The first step is to calculate the χ coordinate by integrating equation (4b) for each node in the channel network. For integration, we use the rectangle rule because increases in drainage area are focused at tributary junctions; the rectangle rule better captures the stepped nature of changes in drainage area along channels than other methods such as the trapezoid rule. 3.2. Segment Fitting Algorithm [13] Our method relies on an algorithm that separates data into the most likely combination of piecewise linear segments.
We favor a technique that uses piecewise fitting over other approaches such as spline fitting [e.g., Hansen and Kooperberg, 2002], which joins segments that share end nodes, otherwise called “knots.” We do not tie segments because we want a technique that can capture the two types of knickpoints described by Kirby and Whipple [2012] as “vertical step” knickpoints, where there is a step between adjacent nodes in the channel network, as well as those they describe as “slope break” knickpoints where channel steepness differs above and below the knickpoint. 3.2.1. Partitioning the Data [14] The first step in the segment fitting algorithm is to “partition” the data. Partitioning involves finding every combination of positive integers that sum to some integer N (e.g., 5 can be partitioned as 5; 4 + 1; 3 + 2; and 3 + 1 + 1). In our application, N is the number of nodes in χ-elevation space that we use for a particular channel, channel segment, or channel network. We use the partitioning algorithm described in Skiena [1990] and implemented by Frank Ruskey (http://theory.cs.uvic.ca/ inf/nump/NumPartition.html). To allow the user to define a minimum number of nodes in a segment, we have modified the Skiena [1990] algorithm to reject partitions that have less than the minimum number of nodes. The user is allowed to select a minimum number of nodes in a segment because we reason that allowing segments made up of two nodes, while always resulting in a perfect fit, will not be useful in reconstructing channel dynamics. We explore the sensitivity of results to the minimum segment length in sections 4.1 and 4.3. [15] The partition function, often denoted by P(N), describes how many partitions there are for a given integer N. This function is highly nonlinear. Example values of P(N) are P(10) = 42; P(100) = 190,569,292; P(200) = 397,299,029,388 [Andrews, 1998]. The nonlinearity of the partition function precludes analysis of more than a few hundred nodes due to computational expense. We address this impediment to sampling every node in the network by using a Monte-Carlo sampling approach; each iteration samples a subset of nodes at reduced computational expense. This procedure is repeated until all nodes are sampled multiple times (section 3.2.2). [16] Partitioning alone does not sample all possible channel segments, because segments could be arranged in different orders. Consider a channel with five nodes, further constrained by a minimum segment length of two; the possible partitions are therefore 5 and 3 + 2; however, the possible segment
141
MUDD ET AL.: CHANNEL SEGMENT FITTING
combinations are 5, 3 + 2, and 2 + 3. We therefore permute all partitions to sample all possible segment combinations. [17] A linear regression is performed on each segment. The slope, intercept, residuals, and R2 of each segment are stored. Royden and Perron [2013] showed that under certain conditions (e.g., transience caused by changes in uplift rate with n ≠ 1), linear segments of the channel profile in the chi plot will be connected by nonlinear segments (or “stretch zones”). Rather than performing both a linear and nonlinear fit on every segment, we perform only a linear fit. The rationale for this is that (i) we do not wish to overconstrain the fit, (ii) selecting the incorrect m/n ratio results in nonlinear sections in χ-elevation space, and we do not wish to “reward” nonlinearity as this could lead to poor constraints on the m/n ratio, and (iii) there is a simple relationship between the linear segments’ slope and erosion rate in χ-elevation space which does not apply in the stretch zones [see Royden and Perron, 2013, equation (16)]. [18] As the number of nodes in the channel grows, there is an ever increasing likelihood that some segments will be sampled more than once. For example, consider a channel with nine nodes and a minimum segment length of three. The partitions would then be 9; 6 + 3; 5 + 4; and 3 + 3 + 3. The segment starting on the first node and ending on the third node will be used in the partition 3 + 6 as well as the partition 3 + 3 + 3. To reduce computational expense, we precede partitioning by regressing all segments and storing the slope, intercept, R2, and a maximum likelihood estimator (MLE) of the regressed χ-elevation data in sparse matrices where the row represents the starting node of the segment and the column represents the finishing node of the segment. 3.2.2. Determining the Most Likely Combination of Segments [19] The partition algorithm finds every possible combination of segment lengths, and permuting through these segment lengths results in every possible combination of segments along a thinned channel network. Because the partition function is highly nonlinear, it is necessary to limit the number of nodes considered by the segment fitting algorithm. The algorithm allows the user to select the maximum number of nodes to be analyzed in any given iteration of the segment fitting algorithm. Whereas partitioning 100 nodes requires 109 linear regressions, partitioning 200 nodes requires ~1012 regressions. Beyond 200 nodes the computational expense is prohibitive; we recommend setting the maximum number of nodes (or “target nodes”) to 80–150. This creates a problem of resolution however because, in high-resolution data, channels may be made of more than 150 nodes; furthermore, we want to avoid a situation in which the resolution of the analysis depends on the number of nodes in the channels. We address this through an iterative approach which, for computational efficiency, samples a subset of data in each iteration but through iterations samples every data point in the channel network. [20] We use what we call a “skipping” algorithm to select a subset of nodes from a data set. First, the number of nodes in the data set is compared to the target nodes to determine the mean number of nodes that must be “skipped” between two selected nodes to retain the correct number of target nodes in the thinned data set. For example, if the data set has 300 nodes and target nodes = 100, then for every node retained, the algorithm will skip two nodes. Skipping of data, however, is done probabilistically, although the first and last
nodes are always selected. After a node is selected, the code selects the next number of nodes to skip from a uniform distribution centered on the mean skip value with a range twice the mean skip value; for example, if the mean skip value is 2, skip values will be chosen in the range 0 to 4. [21] The skip interval is determined by the number of nodes in the data set, but the user selects a “target skip” value which is the final resolution of the data at which the user wants to perform the analysis. Our iterative approach breaks the data set into smaller and smaller pieces until the mean skip (determined by dividing the number of data nodes by the number of target nodes) matches that of the target skip value. [22] To iterate to the target skip value, all the data are thinned using the probabilistic skipping algorithm, and these data are then passed to the segment fitting algorithm. The fitting algorithm then determines the most likely segments for the data and returns the segment number associated with each node (Figure 4). This is recorded for each node in the analysis, and the process is repeated. Because the skipping algorithm is probabilistic, a different subset of nodes is analyzed in each iteration. After many iterations, each node will be associated with a data set comprising the number of the segment to which it will have been attributed in each iteration (Figure 4). The mean of these numbers is then calculated for each node. For nodes that lie unambiguously in a particular segment (for example, the first node is always in the first segment), the mean segment number will be an integer. Between unambiguous segments lie noninteger mean segment numbers. If the mean skip is greater than the target skip, the data are “broken” at natural segment breaks at nodes nearest to half an integer value (i.e., 1.5, 2.5, and 3.5). After breaking the data, each smaller data set is passed back to the segment fitting algorithm. This process is repeated until all of the “breaks” are analyzed at the resolution determined by the user-defined target skip value. [23] Because a probabilistic approach is used to select data nodes repeatedly, each node records the slope and intercept of the segments in χ-elevation space, as well as the segment number and R2 value. The mean and standard deviation of these data for each node is then calculated. Of these, the slope in χ-elevation space, which we call Mχ, is perhaps the most useful because it is related to the ratio between erosion rate (E) and erodibility (K): Mχ ¼
E KA0 m
1=n :
(6)
[24] The intercept in χ-elevation space, which we call Bχ, can be used to identify steps in the channel network (e.g., segments that have the same Mχ but are separated by a vertical waterfall). Thus, by constraining both Mχ and Bχ we are able to identify two types of knickpoints: a change in Bχ with no change in Mχ above and below the knickpoint indicates a vertical step knickpoint, whereas a change in Mχ reveals a slope break knickpoint. [25] The next step is to calculate an MLE for each segment: " # N 1 ðri Þ2 MLE ¼ ∏ pffiffiffiffiffi exp 2 ; 2σ i¼1 2π σ
(7)
where σ [L] is the standard deviation of the measured elevations. Many topographic data sets contain metadata
142
MUDD ET AL.: CHANNEL SEGMENT FITTING
a.
(i.e., fracture patterns in rocks, bedrock sedimentary bedding, and boulders in the channel) that adds roughness to the channel profile and clouds the landscape-scale geometry of the river profile which is indicative of regional tectonics, lithology, or climate. The choice of σ will therefore depend on the application, and it is therefore crucial that users of the software report the value of this parameter. In section 4.1 we report on how much of an impact changing the value of σ can have on the number of segments selected by the software. Note that the MLE is multiplicative so that its value for a sequence of segments will be the product of the MLE of the individual segments. [26] For a fixed number of segments, the most likely combination of these segments simply has the lowest MLE value. However, increasing the number of segments always results in a higher likelihood. Using the MLE alone is thus insufficient to differentiate models with different numbers of segments. [27] An established statistical method for selecting a model that balances goodness of fit against model complexity is the Akaike Information Criterion (AIC) [Akaike, 1974]:
Elevation
Raw Data
0
5
10
15
20
25
30
Node number (with equally spaced χ)
0
5
10
15
20
25
node 30
node 27
node 21
node 17
node 12
Elevation
b.
30
Node number (with equally spaced χ)
c.
0
5
10
15
20
25
30
Node number (with equally spaced χ)
d.
AICc ¼ AIC þ
5
10
15
20
node 30
node 21
node 7
node 17
Elevation 0
25
(8)
where k is the number of parameters. Each linear segment has two parameters, the slope of the line and its intercept, so k = 2s where s is the number of segments. The model with the minimum value of the AIC is deemed the best model. Burnham and Anderson [2002] suggest that a correction should be made to the AIC if there are a finite number of samples; we therefore employ a corrected AICc [Hurvich and Tsai, 1989] to select the best model:
node 30
node 27
node 21
node 7
node 12
Elevation
AIC ¼ 2k 2 lnðMLEÞ;
30
Node number (with equally spaced χ)
Figure 4. Cartoon depicting the skipping process and the selection of segment numbers. (a) A hypothetical data set with 31 data points is shown. (b–d) Plots show data after skipping. In this case the mean skip = 1, so skip values are randomly selected between 0, 1, and 2. In this hypothetical example, there are three iterations of the skipping algorithm. We highlight several nodes with yellow dots. Node 0 and node 7 are always in the first segment, so the mean segment number of these nodes will be = 1. The mean segment number of node 12 will be 1.5 since it is in segment 1 in the first iteration and segment 2 in the second iteration. Node 17 has an integer mean segment number, whereas nodes 21, 27, and 30 all have noninteger mean segment numbers. describing the uncertainty of the elevations, which can guide the choice of σ. However, we also suggest considering “geomorphic noise,” that is, variation in the channel profile due to heterogeneity on scales of meters to tens of meters
2k ðk þ 1Þ ; N k1
(9)
where N is the sample size, or in this case the number of channel nodes used in the analysis. The AICc penalizes over fitting in small data sets, and Burnham and Anderson [2002, p. 445] suggest using AICc when N/k < 40. Because of the restriction on the number of nodes due to the computational expense of partitioning, almost any fitting of χ-elevation data will meet this criterion, and AIC converges to AICc when N is large so in the rare instances when the criterion fails (i.e., N/k > 40) AICc and AIC will take similar values. Thus, there is little reason to risk biasing the result by using the AIC rather than AICc. Figure 5 illustrates this partitioning process. We do not penalize segments whose fitted downstream node is lower than the elevation of the fitted upstream node; this is because if sections are connected by stretch zones, a linear fit to a nonlinear stretch zone can result in segments that do not always exhibit end nodes of monotonically increasing elevations. [28] The choice of standard deviation (σ) in equation (7) will affect the output of the fitting algorithm. For a fixed number of segments, the most likely segments will always be the same, regardless of the value of σ. That is, the numeric value of the MLE will change but the most likely combination will not. Modifying the numeric value of the MLE, however, changes the second term of equation (8) but does not affect the first term of this equation. The result is that greater
143
MUDD ET AL.: CHANNEL SEGMENT FITTING
Figure 5. Illustration of the partitioning process (minimum partition length = five nodes in this case). values of σ result in greater “penalties” for extra segments and in general the lower the value of σ, the more segments will be selected. 3.3. Selection of the m/n Ratio [29] To differentiate between profiles generated using different m/n ratios, we use the MLE and AICc information. Again, the MLE is not sufficient to discriminate fitted channel segments against each other because different m/n ratios may generate different numbers of best fit segments. We therefore store the cumulative AICc values for all m/n values. The m/n ratio that produces the minimum cumulative AICc value is considered the most likely. We perform two tests to determine the most likely m/n ratio: a collinearity test and a test for individual channels. [30] In the collinearity test, the data from the entire channel network are pooled. This pooled data set undergoes the breaking processes described in section 3.2.2, but the AICc values reported are cumulative for all the data rather than on a channel by channel basis. Our testing based on numerical simulations (see section 4.1) indicates this test can better identify the m/n ratio when analyzing data based on numerical simulations. Our tests on channels in natural environments (e.g., section 4.2) indicate however that there can be pitfalls to this method. If a tributary is hanging, for example, the data from this tributary will be lying above the main
stem’s data in χ-elevation space (by definition); because the algorithm will seek the m/n value that best superimposes data from the tributary and the main stem, the collinearity test may give spurious results for the best fit m/n ratio. [31] The second test calculates the cumulative value of the AICc for each individual channel. In this analysis, all tributaries are extended from their source to the drainage outlet. The rationale for this is that transient signals of uplift rate are translated through the channel network with celerity proportional to drainage area (see equation (5)) [Whipple and Tucker, 1999]. This means that any upstream propagation of uplift signals will slow down significantly as it enters small tributaries: steepened reaches in small tributaries will consequently often have limited spatial extent. The result is that if tributary channels are extended only to the junction with the main stem, the algorithm will not be able to find the short transient segment. The segment can be found, however, if the analyzed channel data include the tributary data and the main stem data downstream of the confluence. [32] As described in section 3.2.2, the mean AICc values are a result of many iterations of the segment fitting algorithm. The code also returns the standard deviation of the AICc values for each m/n ratio, for the cumulative data (used in the collinearity test), and for each individual channel. The variability is due to the algorithm being performed on a different subset of data, chosen at random by the skipping algorithm,
144
MUDD ET AL.: CHANNEL SEGMENT FITTING
The second is from a landscape in the Apennines in Italy that has been subject to a previously constrained change in tectonic uplift rate [e.g., Whittaker et al., 2008].
Figure 6. Flowchart for the segment fitting algorithm. during individual iterations. The most likely m/n ratio is that with the minimum AICc value, but the standard deviation of the AICc values can be used to infer the range of plausible m/n values for the channel network. We recommend that authors interpret the m/n ratios with AICc values that are within 1 standard deviation of the minimum AICc values as being “plausible.” Figure 6 shows a flowchart of the segment fitting algorithm. We have made the code available at the Community Surface Dynamics Modeling System website: http://csdms.colorado.edu/wiki/Model:Chi_analysis_tools.
4.
Examples
[33] We illustrate the technique using three examples. The first example uses topography generated from the ChannelHillslope Integrated Landscape Development (CHILD) model [Tucker et al., 2001]. This allows us to test whether the technique can identify known segments and retrieve a known m/n ratio. The next two examples are from natural landscapes. The first is from the Appalachian Plateau in Pennsylvania, U.S., where the combination of relatively homogenous sandstone bedrock and tectonic quiescence is thought to result in steadily eroding channels [Hack, 1960].
4.1. A Numerical Example: A Simulated Three-Stage Acceleration in Uplift [34] Our first example deploys our chi analysis method on a landscape simulated with the CHILD landscape evolution model. We have chosen to run the method on a numerical simulation because it allows us to enforce equation (1) and to prescribe both the uplift history and m/n ratio. Our simulation involves three development stages. The model domain is a 6 × 6 km2 with an average node spacing of 50 m. Throughout the simulation, one boundary is set to a fixed elevation (defined as z = 0 m), and the other boundaries are no flux boundaries. We use model parameters adapted from Attal et al.’s [2011] study of the evolution of catchments responding to tectonics in the Apennines using the CHILD model. Erosion is purely detachment limited: Erosion rates are calculated according to equation (1); furthermore, there is neither threshold for erosion nor adjustment in channel geometry [Attal et al., 2011]. The parameters used in the model and the calculation of erosion rates are detailed in the Appendix A. The hillslope transport coefficient is set to 0.001 m2 yr1, channel erodibility (K) is set to 1.67 × 106 yr1, m is set to 0.5 and n is set to 1 (Appendix A). In the first stage, a fixed uniform uplift rate of 0.3 mm yr1 is prescribed. This uplift rate is maintained until the model domain achieves dynamic steady state, that is, erosion rates match the uplift rate of 0.3 mm yr1 throughout the entire landscape. At this stage, rivers are characterized by smooth concave up river profiles. The uplift rate is then increased to 0.6 mm yr1 for 1 Ma and then further increased to 0.9 mm yr1 for a further 0.5 Ma. In response to the successive increases in uplift rate, steepened reaches develop upstream of the base level, separated from the upstream reaches by profile convexities that migrate upstream through time, representing the propagation of erosional waves that is diagnostic of detachment-limited systems [e.g., Whipple and Tucker, 1999; Attal et al., 2011]. [35] The CHILD model runs on a triangulated irregular network (TIN), and channel profiles are extracted directly from the TIN. We use equation (1) to calculate erosion rates along the channels at the end of the run and find three sections with distinct erosion rates separated by transition zones (Figure 7a). We infer that these transition zones are present because CHILD uses an explicit numerical method (i.e., it uses model topographic parameters to calculate topographic changes at the following time step; Tucker et al. [2001]), which is susceptible to numerical dispersion [e.g., Fagherazzi et al., 2002]. At the end of our simulation, the main stem river profile features two prominent convexities delineating three reaches (Figure 7b): upstream of the uppermost convexity, the river profile is adjusted to the initial uplift rate of 0.3 mm yr1 and has not responded to any of the increases in uplift rate; between the two convexities, the river profile has adjusted to the uplift rate of 0.6 mm yr1; downstream of the lowermost convexity, the steep reach is in equilibrium with the final uplift rate of 0.9 mm yr1. The two convexities are migrating upstream in concert at a celerity given by equation (5). Because n = 1 in the model, the celerity is independent of slope and thus of uplift rate so the convexities never meet, and the intermediate segment is
145
MUDD ET AL.: CHANNEL SEGMENT FITTING
Figure 7. Results from CHILD model runs. (a) Erosion rates calculated along the channel (b, c) denoted by a star using equation (1), which is prescribed in the model. Figure 7b denotes channel longitudinal profiles colored by the extracted Mχ values. Figure 7c shows Mχ values as a function of χ; each tributary is denoted by a different color. The tributary depicted in Figure 7a (star in Figure 7b) is plotted in black. (d) Chi plot of fitted and transformed data of all channels. In Figures 7a, 7c, and 7d the red- and green-shaded bars denote the transition zones between domains adjusted to the different phases of uplift. The horizontal red, green, and blue lines in Figure 7c represent the Mχ values corresponding to erosion rates of 0.9 mm yr1, 0.6 mm yr1, and 0.3 mm yr1, respectively, calculated using equation (6). Figures 7b–7d were generated with a mean skip value of 1, a minimum segment length of 12, 100 target nodes, and a σ value of 3 m. preserved until the wave of erosion has propagated all the way to the top of the network. [36] Results from segment extraction with the best fit m/n ratio of 0.5 (see next paragraph) demonstrate that the algorithm is able to detect the sections of the channel network responding to different uplift rates (Figures 7b–7d). In addition to being able to identify distinct channel segments, the algorithm is also able to retrieve Mχ values extremely close to the theoretical values predicted by equation (6) (Figure 7c). [37] Our segment fitting algorithm is able to tightly constrain the most likely m/n ratio for these simulations. Figures 8a and 8b show that plausible values of the m/n ratio (those that have AICc values within 1 standard deviation of the m/n ratio with the minimum AICc value) clustered tightly around 0.5 (in Figures 8a and 8b, m/n ratios of 0.475–0.525 are considered plausible). The method correctly identifies the m/n ratio of 0.5 as the most likely. 4.1.1. Sensitivity Analysis and Recommended Default Parameters [38] We performed a sensitivity analysis on the effect of changing the number of target nodes, the minimum segment length, the value of σ and the mean skip value using results from the CHILD simulations. We varied σ from 1 to 9 m, the mean skip value from 1 to 4, the minimum segment length from 6 to 20 nodes, and the number of target nodes from 60 to 100. Parameters generated similar results over a wide range of values for the main stem. The minimum
segment length and number of target nodes, however, can significantly alter computational time. Running the method on the CHILD data with target nodes = 100 and minimum segment length = 20 took a few minutes, whereas the run with target nodes = 100 and minimum segment length = 5 took 4 days on our Linux servers (i.e., 5–6 orders of magnitude longer). This strong dependence on computation time is due to the nonlinearity of the partition function. Sensitivity analysis shows that (i) increasing σ, the minimum segment length or the mean skip decreases the number of segments; (ii) one consequence of reducing the number of segments is that tributaries will be fit with single segment and information about changing steepness will be lost; and (iii) the number of target nodes can have a large influence on computational time but does not have a commensurate effect on the extracted segments. The least conclusive results were obtained when high values of mean skip (≥ 3) or minimum segment length were used. [39] In every analysis performed by the method on the CHILD model runs, the most likely m/n ratio has been calculated as 0.5. The best fit m/n ratio has similarly retuned 0.5 as the most likely m/n ratio for all individual channels except in the cases where minimum segment length or mean skip parameters have resulted in