1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44
Predicting changes in housing prices from two transport investments: A case study of two fixed link projects in rural areas in Norway Øyvind Lervik Nilsen, Corresponding Author Department of Civil and Transport Engineering Norwegian University of Science and Technology/Rambøll Norway 7491 Trondheim, Norway/Fjordgaten 13, 3103 Tønsberg Tel: 0047-45280334 Email:
[email protected] Petter Arnesen Department of Safety and mobility, SINTEF S.P. Andersens veg 5, 7031 Trondheim, Norway Tel: 0047-98083886 Email:
[email protected] Stig Nyland Andersen Norwegian University of Science and Technology/Norwegian Public Roads Administration 7491 Trondheim, Norway/Askedalen 4, 6863 Leikanger, Norway Tel: 0047-90958569 Email:
[email protected] María Díez Gutiérrez Department of Civil and Transport Engineering Norwegian University of Science and Technology 7491 Trondheim, Norway Tel: 0047-47735050 Email:
[email protected]
Trude Tørset Department of Civil and Transport Engineering Norwegian University of Science and Technology 7491 Trondheim, Norway Tel: 0047-97038649 Email:
[email protected]
(working paper) Word count:
6645 words including references 5663 words excluding references Abstract count: 194 words
Submission Date
26.03.2018
Nilsen, Arnesen, Andersen, Tørset
1 2
2
ABSTRACT The first signs of increased attractiveness for residential location and population
3
growth can usually be seen in a growth in housing prices. Several studies have shown a
4
positive relationship between accessibility and housing prices often using hedonic regression
5
techniques. However, most of these studies have mainly focused on urban areas and have not
6
tested the model’s predictions abilities. This paper will contribute in filling this knowledge
7
gap by investigating the relationship between housing prices and accessibility in a more rural
8
context with special emphasis on islands connected by fixed links. By developing spatially
9
and non-spatially dependent regression models based on approximately 8,500 actual housing
10
sales on the western coast of Norway we test the model’s ability to fit the data set and their
11
prediction capabilities. Our results show that the spatially dependent model outperforms the
12
non-spatially dependent model for model fitting, while we cannot conclude the same for the
13
model’s predictive power. Furthermore, the models show that accessibility to the main
14
regional centre seems to be an important part also in a more rural setting. This suggests that
15
rural areas that get improved accessibility via transport investments might expect a growth in
16
housing prices.
17 18
Keywords: hedonic regression modelling, housing prices, accessibility, fixed link projects
Nilsen, Arnesen, Andersen, Tørset
1
3
1. INTRODUCTION
2
One main concern with transport investments is to promote infrastructure projects that
3
contribute to regional cohesion and economic growth. Many large infrastructure projects have
4
been proposed on the very idea that they reduce distances between settlements and trigger
5
economic growth making the areas affected more attractive for residential and firm location.
6
However, it varies substantially to what degree these projects have had a positive impact on
7
population and economic development (Louw, Leijten, & Meijers, 2013; Melo, Graham, &
8
Brage-Ardao, 2013; Nilsen, Díez Gutiérrez, Andersen, & Tørset, 2017; Tveter, Welde, &
9
Odeck, 2017). Hence, a deeper understanding of how transport investments affect firm and
10
residential location is essential. This is a relationship that is seldom taken into account in
11
today’s transport planning (Acheampong & Silva, 2015; Waddell, Ulfarsson, Franklin, & Lobb,
12
2007), even though it is of crucial importance in understanding the long-term effects for
13
transport investments and of increased interest of policy makers (Vickerman, 2016).
14
The first signs of changes in the attractiveness for residential location are usually seen
15
in the housing market (Wegener, 2004) . Where people choose to live is highly dependent on
16
the real-estate market (Coppola & Nuzzolo, 2011; Ho & Hensher, 2014; Waddell, 2011). The
17
real-estate market is influenced by both internal and external housing characteristics. The
18
proximity to job opportunities, public amenities, schools etc. are important factors in explaining
19
the selling price for a house and the differences between them. The relationship between
20
housing prices and accessibility has been verified in a growing number of empirical studies
21
often using hedonic regression techniques (Efthymiou & Antoniou, 2013; Á. Ibeas, Cordera,
22
Dell’Olio, & Coppola, 2013; Liv Osland, 2010). Hedonic models assume that price is
23
determined by the internal and external characteristics of the goods being sold (Rosen, 1974).
24
In a real estate market context, this method is appealing as it deals with heterogeneous goods
25
such as houses. Even though the methodology is widely used, it has some challenges when it
Nilsen, Arnesen, Andersen, Tørset
4
1
comes to spatial autocorrelation and spatial heterogeneity which could lead to biased or
2
inefficient parameters (LeSage & Pace, 2009). These issues are often dealt with by using spatial
3
econometric techniques such as the simultaneous autoregressive model (SAR), spatial Durbin
4
model (SDM) (A. Ibeas, Cordera, Dell’Olio, Coppola, & Dominguez, 2012; L Osland &
5
Thorsen, 2013) or geographically weighted regression (GWR) (Gadziski & Radzimski, 2015).
6
Several empirical studies have investigated the relationship between changes in
7
accessibility and the housing market (Andersson, Shyr, & Fu, 2010; Geng, Bao, & Liang, 2015;
8
Mulley, Ma, Clifton, Yen, & Burke, 2016). However, most of these studies are using models at
9
a city level mainly focusing on fitting a given data set of housing transactions into the model,
10
and not on their predicting performance. A knowledge gap exists in terms of using these kinds
11
of models in a more rural context and testing their predictive performance after a transport
12
investment is made. These two issues are essential in the modelling of changes in the housing
13
market due to large infrastructure projects, and will be addressed in this paper.
14
The ability to estimate the increase in real-estate prices from changes in accessibility is
15
important for several reasons. Firstly, by developing models for areas that are heterogenous in
16
nature, we contribute in the research on using hedonic regression models which mainly have
17
been used within homogenous areas (L Osland & Thorsen, 2013) . Secondly, if a proposed
18
project significantly increases the real-estate prices in one area, it might trigger development
19
effects that might alter the residential location pattern. When these changes are used to
20
recalculate the residential location pattern, this might be important input in transport analysis,
21
which is highly dependent on accurate population forecasts. Today, population prognoses are
22
often treated as an exogenous variable in the transport models. Thus, they are normally not
23
depedent of changes in accessibility from transport projects (Ortúzar & Willumsen, 2011). This
24
has been the case for transport analysis done on fixed link projects along the Western Coast of
25
Norway (Tørset, Nilsen, & Spilsberg, 2013). Thirdly, by testing the models’ predictive power
26
on actual transport projects we shed light on their actual use. This is an issue that is scarcely
Nilsen, Arnesen, Andersen, Tørset
1
5
studied in the literature, but is essential for practitioners.
2
This paper empirically investigates the relationship between accessibility and housing
3
prices for two heterogenous counties on the western coast of Norway. By developing spatially
4
and non-spatially dependent regression models we shed light on the importance of accessibility
5
in a more rural context. How well the models explain the changes in housing prices is then
6
tested for two large infrastructure projects and for the counties they are located in. Fixed link
7
projects that replace ferries are chosen as infrastructure projects as these projects represent a
8
substantial change in accessibility. Thus, potential changes in housing prices should be
9
expected.
10
The structure of the paper is as follows. Section 2 describes the area studied. Section 3
11
describes the methodology, including the estimation of the accessibility measure and the
12
hedonic regression models used in the analysis. In Section 4, the results are presented while
13
Section 5 discusses the findings. Section 6 summarizes the main findings from our
14
investigation.
15 16 17
2. STUDY AREA AND DATA
18
This paper focuses on two large infrastructure projects on the western coast of Norway.
19
The cases are both fixed link projects, involving subsea tunnels replacing ferry services that
20
connect islands to the mainland. Figure 1 shows the two cases studied, including the fixed links.
Nilsen, Arnesen, Andersen, Tørset
6
1 2 3
Figure 1. Location of cases studied, (a) Eiksundsambandet, (b) Finnfast.
Table 1 describes the characteristics of the connected islands before and after the
Nilsen, Arnesen, Andersen, Tørset
7
1
opening of the fixed links, while Table 2 describes the two counties. All places experienced a
2
growth in traffic volumes to/from the connected islands.
3
Table 1: Characteristics of the connected islands
Opening year Travel time to the nearest city before the fixed link (including waiting time for ferry) Travel time to the nearest city after the fixed link Ferry ticket price Toll
Eiksundsambandet 2008 70 min.
Finnfast
50 min.
50 min.
EUR 7 EUR 0 (EUR 8 until 2014)
EUR 8 EUR 17 (EUR 23 until 2011) 1500
Number of jobs that can be reached 16,500 within 45 min, before the fixed link Number of jobs that can be reached 22,000 within 45 min, after the fixed link Traffic the year before the opening 800 (AADT) Traffic five years after the opening 2200 (AADT) Average square meter price before 690 EUR in 2004-2007 (N=103) opening (single unit house) Average square meter price after opening (single unit house) 970 EUR in 2008-2011 (N=158)
4
2009 90 min
150,000 400 700 980 EUR in 2006-2009 (N=26) 1230 EUR in 2010-2013 (N=46)
To estimate the changes in housing prices from these two fixed link projects, regression
5 6
models were developed for the counties the fixed links were located in.
7
Table 2: Description of the characteristics for the five counties analyzed
County 1 (Rogaland)
County 2 (Møre og Romsdal)
Number of inhabitants
470,000
265,000
Workforce in employment
237,000
132,000
4.7 %
3.4 %
9,400 km2
15,100 km2
Unemployment rate Size 8 9
The two counties, Møre og Romsdal and Rogaland, have relatively low population
10
density (see Table 2). This is especially the case for Møre og Romsdal which only has a few
11
cities in the range of 10,000 – 50,000 inhabitants.
Nilsen, Arnesen, Andersen, Tørset
8
1 2
3
3. METHOD
4
In a hedonic modelling framework, it is essential to distinguish between two categories
5
of attributes. One is the physical attributes of the dwelling and the other is related to the spatial
6
structure of the neighborhood and accessibility to job and recreational opportunities. The
7
housing price will be described by a function of these two factors. In general, the hedonic price
8
function can be written as:
9 10
Pit = f (Dit , A it )
11 12 13
Equation 1: The general form of the hedonic price function
Pit is the selling price of house i in year t. Dit is the physical attributes of the house i while Ait is its spatial characteristics.
14 15
3.1. Accessibility measure
16
The accessibility measure should generally reflect the degree of economic and
17
employment opportunities in one area as a function of the distance to and from other nearby
18
areas. It should address how accessibility decay with distance, suggesting that areas with many
19
activities in close proximity have a higher weight than areas further away. Based on Hansen
20
(1959) the generalized cost of travel often appears through a negative exponential function in
21
a measure of accessibility. Numerous accessibility measures have been developed after
22
Hansen's (1959) pioneering work. In this article an accessibility measure is used based on the
23
work of Cascetta (2001) and Nuzzolo & Coppola (2007).
24
Nilsen, Arnesen, Andersen, Tørset
1
{
Ai = å E aj 1 * exp[a 2 * GCij]
9
}
j
2
Equation 2: Accessibility measure
3
where the accessibility, Ai, of a given zone i is a proxy of the opportunity of reaching
4
activities in other nearby zones j. Ej is the number of jobs within zone j, GC is the generalized
5
travel cost of travelling between i and j and α1 and α2 are parameters to be estimated. The
6
general travel costs are calculated using time weights from the Norwegian time study (Farideh,
7
Flügel, Samstad, & Killi, 2010) between all areas. The travel time includes simplified
8
calculations of congestion and waiting time for ferries (half the ferry frequency). These
9
calculations were made in the Norwegian transport model (Tørset, Malmin, Bang, & Bertelsen,
10
2013). By doing a logarithmic transformation of Equation 2 and using number of trips from
11
zone i to j as a proxy for accessibility the parameters α1 and α2 can be estimated using ordinary
12
least squares (Coppola, Ibeas, Dell’Olio, & Cordera, 2013) (see equation 3).
13 14 15
log(TRIPSij) = a1 * log(Ej) + a 2 * GCij Equation 3: Estimation of α1 and α2
16
The estimated values of the parameters were for α1 0,18 and for α2 -0,04. The number
17
of trips from zone i was obtained from a transport model. The accessibility measure suggested
18
in Equation 2 should in principle address the importance of the distance to the central business
19
district and more local subcentres/towns, in house price calculation. However, very often these
20
two variables are included in addition to the accessibility measure (see (Liv Osland, 2010)) as
21
these centres might include other factors not captured by the more general accessibility measure
22
(presence of universities, hospitals, cultural facilities etc.).
23
The accessibility measure was calculated for 3452 basic statistical units for two
24
counties on the Western coast of Norway (see Figure 2). Each of these units included the number
Nilsen, Arnesen, Andersen, Tørset
10
1
of jobs within different sectors and number of inhabitants. Statistics Norway produced this data
2
set. By combining this information with the generalized cost from the Norwegian transport
3
model, it was possible to estimate accessibility measures for each of these statistical units.
4 5
Figure 2: Accessibility for Rogaland and Møre og Romsdal
Nilsen, Arnesen, Andersen, Tørset
11
1
The housing data used was on a coordinate level. Therefore, the housing data were
2
aggregated to match the transport model zones used to calculate the accessibility measures.
3
The same calculated accessibility for all housing sales within each statistical unit was used in
4
the development of the regression models.
5 6
3.2. The data used in developing the model
7
In total around 2,800 housing transactions were used to estimate the model for Møre
8
og Romsdal and 5,700 for Rogaland. These sales were from 2007. The model’s predictive
9
performance was then tested on approximately 2,500 housing transactions from 2010 in Møre
10
og Romsdal and 5,100 in Rogaland. The housing data were collected from the Land Registry
11
and Cadastre, and the data consisted of actual housing sales in the open market. Their detailed
12
level made it possible to conduct disaggregated level analysis of each sale. The sales data
13
consist of various housing specific variables and x,y coordinates and street name. For each sale,
14
the internal and external characteristics were obtained (see Table 3).
15 16
Table 3: Localization and housing attributes
External housing characteristics ACC GC_SUBC GC_RegCen Internal housing characteristics PRICE SIZE GARAGE AGE BATH ROOMS LOT_SIZE HOUSING_TYPE
17
Definition Accessibility measure The generalized cost in travelling to nearest subcentre/town by car The generalized cost in travelling to nearest regional centre by car Selling price of house Living area measured in square meteres Number of garages Age of the building Number of bathrooms Number of rooms Lot size measured in square meteres 1=single unit dwelling, 2= semidetached house, 3=Row house, 4=apartment
Nilsen, Arnesen, Andersen, Tørset
1
12
3.3. Hedonic regression models
2
Spatial data are characterized by spatial autocorrelation and spatial heterogeneity
3
(LeSage & Pace, 2009). While spatial heterogeneity refers to the violation of the assumption
4
in ordinary least squares (OLS) about constant variance in the residuals, spatial autocorrelation
5
is the violation of the assumption about uncorrelated error terms. In a housing market context,
6
cross-sectional data from housing transactions often have these issues to overcome, in order to
7
get unbiased estimates of the parameters (Liv Osland, 2010).
8
To evaluate the inclusion on spatial dependency within the modelling of housing prices
9
we will test three models. A simple multinomial linear regression model (MLR) with zero mean
10
Gaussian independent noise , used as a baseline and two spatial simultaneous autoregressive
11
lag models; the autoregressive error model (SAR) and the spatial durbin model (SDM) (LeSage
12
& Pace, 2009). For the regression model we choose the following functional form after some
13
exploratory investigation of the data sets:
14 15 16 17 18
log(
_
)= log(
+ =
+
log(GC_RegCen) + )+
log(
sHouse +
)+
log(
log(
)+
_
detHouse + β
)+
log(
_
ℎ+
)+
19 20
Equation 4: functional form of the regression model
21
ROOMS were taken out of the model due to high correlation with SIZE. Furthermore, the
22
variable GARAGE had to be taken out because many observations were missing.
23
The simultaneous autoregressive model (SAR) and the spatial Durbin model (SDM)
24
are often proposed as suitable approaches to deal with spatial autocorrelation. By adding the
25
average-neighbor values of the dependent variable to the linear model the SAR model deals
Nilsen, Arnesen, Andersen, Tørset
1
13
with the issue of spatial autocorrelation (see Equation 5).
2 3 4
log(
)=
(
+
) + ε
Equation 5: The Simultaneous Autoregressive model (SAR)
5
is the internal and external housing characteristics similar to the linear model and
6
is the spatial weights matrix that specifies the spatial dependence structure between the
7
houses. In this case, houses within the same statistical unit, and neighboring statistical units,
8
are assumed to be spatially dependent. Finally,
9
parameter. If ρ is zero, there is no spatial dependency between the observations. Then normal
10
MLR can be used. Positive ρ values indicate positive dependency. In the spatial Durbin model,
11
the spatial lagging of the explanatory variable,
12
means that the characteristics of neighboring houses could have an influence on the price of
13
each house in the sample.
is referred to as the spatial correlation
( )
, is also included (see Equation 6). This
14 15 16 17
log(
)=
(
)+
W is the spatial matrix and ( )
19
while ρ and
20
model is equal to the SAR model.
18
+
( )
+
Equation 6: The Spatial Durbin model (SDM)
is the internal and external housing characteristics
are spatial autocorrelation parameters. In contrast to the SAR model there is
also a spatial correlation parameter for the independent variables. Thus, if
( )
= 0, the Durbin
21 22
3.4. Evaluation of model predictions
23
To evaluate the differences between the spatially independent regression model and the spatial
24
simultaneous autoregressive lag models a measure of the predicted residual error sum of
Nilsen, Arnesen, Andersen, Tørset
14
1
squares (PRESS) (see equation 6) was estimated. Both the independent and the dependent
2
spatial regression models are based on housing transactions from 2007. These models are used
3
to predict housing transaction prices for 2010 for the same two counties.
4
(pricei - · pricei ) 2 PRESS = åi=1 n n
5 6 7
Equation 7: Predicted error sum of squares (PRESS)
8
is the predicted price for house and pricei is the actual selling price for house
9
i. The ability to predict changes in housing prices for areas that experience a significant change
10
in accessibility is also tested by isolating PRESS values for the two fixed link projects presented
11
in section 2. In additional to the PRESS values, the percentage of estimations with prediction
12
error less than 30 % is presented.
13 14 15
4. RESULTS
16
The results from the three different hedonic regression models estimated are shown in TABLE
17
4 and TABLE 5. TABLE 4 is for the county of Møre og Romsdal while TABLE 5 is for the
18
county of Rogaland. The sign of the coefficient (β) indicate whether the variable will have a
19
positive or negative impact on housing prices. For example, an increase in the generalized
20
travel cost from the regional centre would decrease the selling price of a house, when all other
21
variables are held constant. This is because the coefficient for RegCen is negative. ρ indicates
22
spatial autocorrelation in residuals while the Moran I’s values indicate clustering or dispersion
23
of the residuals in the model. A statistical significantly positive Moran I value, typically implies
24
a tendency of clustering of similar values, while a negative Moran I value means dispersion
Nilsen, Arnesen, Andersen, Tørset
15
1
and values close to 0 mean that the residuals are random. The AIC is the Akaike information
2
criterion, and it indicates the quality of the model and is helpful in model selection. Lower AIC
3
values are usually preferred over high values.
4 5 6
Nilsen, Arnesen, Andersen, Tørset
1
16
TABLE 4: Estimated parameters for MLR model, SAR and SDM model, Møre og Romsdal
β 12.65**
MLR S.E 0.08
t-value 159.9
β 7.91**
Constant Internal housing characteristics Log of housing age -0.14** 0.01 -14.24 -0.12** Log of lot size 0.01** 0.01 3.60 0.01** Log of housing size 0.39** 0.02 22.45 0.38** Number of bathrooms 0.03** 0.01 4.29 0.03** Apartment (reference group) Single unit house -0.11** 0.03 -4.39 -0.12** Detached house -0.05** 0.01 -2.88 -0.06** lag. of housing age lag. of lot size lag. of housing size lag. of number of bathrooms lag. of single unit house lag. of detached house External housing characteristics log of accessibility 0.14** 0.00 9.54 0.08** log of generalized cost to regional centre -0.09** 0.00 -7.79 -0.09** log of generalized cost to subcentre/town -0.03** 0.00 -3.16 -0.03** lag. of accessibility lag. of generalized cost to Regional centre lag. of generalized cost to subcentre/town ρ Number of observations, N 2282 Global Morans’I 12.33** AIC 1561 ** Significant at 99 % confidence interval, * significant at the 95 % confidence interval 2
SAR S.E 0.558
z-value 14.18
β 7.13**
SDM S.E 0.73
z-value 9.70
0.01 0.00 0.02 0.01 0.02 0.02
-12.25 3.53 22.01 4.04 -4.84 -3.35
-0.13** 0.01** 0.38** 0.03** -0.11** -0.05** 0.03 -0.01 -0.03 -0.03 -0.08 -0.03
0.01 0.00 0.02 0.01 0.03 0.02 0.03 0.01 0.15 0.04 0.09 0.06
-10.72 3.97 21.82 4.48 -3.87 -2.23 0.81 -1.69 -0.20 -0.82 -0.85 -0.43
0.01 0.01 0.01
-7.65 -3.76 -3.76
0.12** -0.08 -0.05 -0.05 0.02 0.03
0.05 0.06 0.03 0.05 0.08 0.04 0.41** 2282 -0.82 1494
2.43 -1.48 -1.39 -0.87 0.30 0.79
0.35** 2282 1.115 1492
Nilsen, Arnesen, Andersen, Tørset
1
17
TABLE 5: Estimated parameters for MLR model and SAR model, Rogaland
β 12.44**
MLR S.E 0.08
t-value 159.9
β 4.96**
Constant Internal housing characteristics Log of housing age -0.06** 0.00 -7.61 -0.02** Log of lot size 0.01 0.00 -0.128 -0.01 Log of housing size 0.53** 0.01 48.89 0.50** Number of bathrooms -0.01 0.01 -0.87 -0.01 Apartment (reference group) Single unit house -0.28** 0.02 -17.91 -0.26** Detached house -0.13** 0.01 -12.12 -0.13** lag. of housing age lag. of lot size lag. of housing size lag. of number of bathrooms lag. of single unit house lag. of detached house External housing characteristics log of accessibility 0.06** 0.00 9.14 0.06** log of generalized cost to regional centre -0.11** 0.00 -23.24 -0.06** log of generalized cost to subcentre/town -0.02** 0.00 -3.32 0.01 lag. of accessibility lag. of generalized cost to regional centre lag. of generalized cost to subcentre/town ρ Number of observations, N 5688 Residuals Morans’I 40.70** AIC 3351 ** Significant at 99 % confidence interval, * significant at the 95 % confidence interval 2
SAR S.E 0.33
z-value 15.22
β 4.25**
SDM S.E 0.36
z-value 11.76
0.01 0.00 0.01 0.01 0.01 0.01
-2.00 -0.98 45.20 -1.25 -17.06 -12.66
-1.48 -0.01** 0.51** -0.01** -0.23** -0.10** 0.02 0.01 -0.23** 0.05** 0.05** 0.01
0.01 0.00 0.01 0.01 0.02 0.01 0.02 NA 0.02 0.02 0.02 NA
-1.71 -5.42 47.86 -2.25 -15.61 -10.18 -1.05 NA -12.03 2.44 2.41 NA
0.01 0.01 0.01
6.68 -10.50 -0.02
0.10** 0.04 -0.01** -0.06** -0.08 0.02
0.03 0.00 0.01 0.03 0.04 0.01 0.62** 5688 4.63 2876
3.97 0.94 -2.27 -2.26 -1.78 1.79
0.51** 5688 5.78** 2958
Nilsen, Arnesen, Andersen, Tørset
18
1
From TABLE 4 and TABLE 5 the MLR and SAR model gives theoretically correct signs and
2
all coefficients are significant except number of bathrooms and lot size in the model developed
3
for Rogaland. The SDM model show some less significant variables with some unexpected
4
signs. The low and insignificant Morans’I values indicates that the SAR and SDM models
5
explain all the spatial dependency for Møre og Romsdal while there seem to be some spatial
6
dependence left in the SAR model for Rogaland. The results suggest that we need to introduce
7
the extra spatial term in the SDM model to reduce the Moran’s test to be insignificant for
8
Rogaland.
9
In terms of AIC values, the SAR model outperforms the MLR for both counties.
10
Regarding the interpretation of the results it is important that the SAR model considers the lag
11
of the dependent variable, as simultaneous feedback exists because changes in one observation
12
affect the neighboring observations. Therefore, in the case of SAR and SDM models, the
13
estimated parameters should be interpreted as a state of equilibrium in the modelling process.
14
The estimated parameters for SAR models showed similar signs as the MLR model. Moreover,
15
the variables found significant in the SAR model remained the same as in the MLR model.
16
In terms of AIC values, the SDM model outperforms the MLR for both counties. It
17
also seems to have lower AIC values than the SAR model for Rogaland but not for Møre og
18
Romsdal. However, the SDM model has less significant variables than the other two models.
19
The spatial lag of the independent variables was in most cases insignificant. Interestingly, these
20
findings are similar to Ibeas et al. (2012) who also found insignificantly lagged independent
21
variables when using the SDM model for the metropolitan area of Santander.
22
The SDM model does not suggest an increase in model fit for the county of Møre og
23
Romsdal and just slightly for Rogaland, and the lack of significant variables in the SDM model
24
makes the SAR model more suitable for the prediction calculations. Therefore, the SAR model
25
is preferred over the SDM in the future analysis of model predictions in this paper.
26
Nilsen, Arnesen, Andersen, Tørset
19
1 2
4.1. How well the models predict the housing sales
3
To evaluate the performance of the MLR and SAR model, we have considered the PRESS
4
statistics on data from 2010. We have considered the whole data set consisting of the whole
5
county, and a subset of the data, corresponding to all housing sales on the islands that have
6
been connected to the main land with a tunnel between the years of the two data sets (2007 and
7
2010). The PRESS statistics are shown in Table 6.
8
Table 6: Prediction calculations, PRESS values and prediction error in % for 2007
Model estimation 2007 Møre og Romsdal (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5% Rogaland (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5% Fixed link projects Eiksundsambandet (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5% Finnfast (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5% 9
MLR
SAR
4.87*10^11
4.72*10^11 (-3.1 %)
72 % 56 % 35 % 19 % 5.85*10^11
72 % 57 % 35 % 20 % 5.41*10^11 (-7.5%)
74 % 54 % 32 % 20 %
76 % 59 % 35 % 21 %
2.60*10^11
2.40*10^11 (-7.7%)
74 % 65 % 42 % 28 % 2.95*10^11
72 % 59 % 33 % 20 % 2.98*10^11 (+1.01%)
N/A N/A N/A N/A
N/A N/A N/A N/A
Table 7:Prediction calculations, PRESS values and prediction error in % for 2010
Model prediction for 2010 Møre og Romsdal (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5%
MLR
SAR
2.24*10^12
2.28*10^12 (+1.8%)
59 % 36 % 15 % 8%
56 % 33 % 14 % 7%
Nilsen, Arnesen, Andersen, Tørset
Rogaland (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5% Fixed link projects Eiksundsambandet (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5% Finnfast (PRESS value) Prediction error less than.. 30 % 20 % 10 % 5%
20
1.19*10^12
1.23*10^12 (+3.3%)
68 % 49 % 25 % 14 %
67 % 49 % 26 % 14 %
5.93*10^11
6.42*10^11 (+7.6%)
55 % 29 % 9% 4% 1.64*10^12
51 % 20 % 5% 3% 1.88*10^12 (+12.8%)
N/A N/A N/A N/A
N/A N/A N/A N/A
1 2
The PRESS values on the 2007 data set show better model performance for the SAR model
3
than the MLR for both counties. For the county of Møre og Romsdal the model shows a 3.1 %
4
increase in model fit, while it was 7.5 % for Rogaland. For the two areas connected by a fixed
5
link, the SAR models seem to outperform the MLR for Eiksundsambandet, while the results
6
are similar for Finnfast. However, only 17 observations were available at Finnfast.
7
Interestingly, the MLR seems to outperform the SAR model when it comes to model
8
prediction in 2010. For the county of Møre og Romsdal the values of PRESS show 1.8 %
9
reduction in predictive performance for SAR compared to MLR. This seems also to hold for
10
Rogaland where the SAR models have a reduction of 3.3 % in predicative performance
11
compared to the MLR. The same trend can be seen when isolating the two fixed link projects.
12
Based on the 175 house sales connected to the main land by Eiksundsambandet the predictive
13
performance is reduced with 7.6 % in the SAR model. For Finnfast the predictive performance
14
was 12.8 % lower for SAR than MLR. Even though it is not uncommon for simpler models to
15
outperform more advanced models in terms of prediction it, is a bit surprising, especially since
16
the model fit is better with the more advanced SAR model. In this case, explaining the data and
17
to model the spatial dependence is best done by using SAR, while for prediction purposes the
Nilsen, Arnesen, Andersen, Tørset
1
21
simpler MLR model framework cannot be shown to be less useful than SAR.
2
Regarding their ability to estimate the housing prices on the islands before and after
3
the ferry replacement project was built, the models seem to perform better in a situation before
4
the fixed link was built. This seems to hold for the ferry replacement projects in both Møre og
5
Romsdal and Rogaland.
6 7 8
5. DISCUSSION
9
Increased housing prices after a transport investment is usually a sign of increased
10
attractiveness for residential location and an indication of possible population growth in the
11
future. Numerous studies have investigated the relationship between changes in accessibility
12
and housing prices by developing different kinds of hedonic regression models. However, few
13
of these studies have tested the models developed to predict future housing prices after a
14
transport investment is made in a more rural setting. This paper sheds light on how spatially
15
dependent and non-dependent regression models preform regarding both fitting data and
16
predicting housing sales after changes in the transport network is made. Interestingly the spatial
17
regression model preforms the best for model fitting, while for prediction we cannot say that
18
the spatial regression model outperforms the standard linear regression model. Similar results
19
were found in a study by Bourassa et al. (2007) of real-estate transactions in New Zeeland
20
where the MLR outperforms the SAR models in model prediction. However, there are also
21
studies showing that models that deal with spatial heterogeneity outperform MLR in terms of
22
prediction (Füss & Koller, 2016). Thus, further research is needed to give any clear
23
recommendations regarding model selection.
24
The results indicate that both internal and external characteristics are important for the
25
price of a house also in a more rural setting. The price seems to increase with the size of the
Nilsen, Arnesen, Andersen, Tørset
22
1
house and accessibility, and decrease with housing age and distance to the main regional centre.
2
This seems to hold across both counties and two of the three models investigated. These
3
findings are in line with earlier studies done within the field (Ahlfeldt, 2013; L Osland &
4
Thorsen, 2013). The results indicate that a reduction in the distance to the regional centre
5
through a transport investment may be associated with an increase in housing prices. Distances
6
to regional centres seem to be more important than subcentres or smaller towns in determining
7
the selling price of a house. To further improve the models, other spatial characteristics might
8
be included such as distances to schools, a dummy for rural lot size, and crime rate (see (Chung,
9
2015; Ho & Hensher, 2014; L Osland & Thorsen, 2013)). However, most areas in Norway have
10
good access to schools and low crime rates, which sugges that these variables might not be of
11
major importance in improving the model in a Norwegian context.
12
The hedonic regression models tested in this paper show the same tendencies as for
13
models that are made for urban, homogenous areas. However, the models developed seem to
14
have a lower predictive power than models that are made for more homogenous areas. As the
15
areas studied in this paper consist of several different labour and housing markets often
16
connected by ferry services it is not surprising. Ferry services are often bottlenecks in the
17
transport network and the inconvenience of being dependent on a ferry is often substantial
18
(Bråthen & Hervik, 1997). Often beyond the waiting time measure used in the calculations of
19
the accessibility and generalized travel cost in this article. Thus, further research is needed
20
regarding the size of the inconvenience cost to obtain better estimates for the impact of ferry
21
replacement projects on housing prices.
22
It is important to this field of research to focus on developing models useful for
23
prediction. In that respect, applying other types of modelling framework could be crucial to
24
obtain good results. Gaussian Markov random fields (GMRF) (see (Rue & Held, 2005)) have
25
been developed to represent spatial dependence and it would be interesting to test that approach
26
within this field. Moreover, as the data set becomes larger, algorithms from the field of artificial
Nilsen, Arnesen, Andersen, Tørset
1
23
intelligent (AI) could bring value to such analysis in the future.
2
The results from this investigation show that changes in housing prices may be
3
expected from transport investments also in more rural areas, especially, for projects that
4
substantially reduce the generalized travel cost to regional centres and increase accessibility.
5
This could lead to increased attractiveness for residential location and a population growth.
6
Population prognosis is an essential part of transport analysis and the calculations of transport
7
demand. Ignoring potential population growth from these transport investments could lead to
8
biased traffic calculations, which in turn could affect the decisions regarding road standard and
9
the cost benefit ratio of the projects proposed. In order to obtain indications of the degree to
10
which it affects the population growth and pattern, the housing market model may be included
11
as part of the residential relocation models often used within a Land Use Transport Interaction
12
(LUTI) model framework (Cordera, Ibeas, Dell’Olio, & Alonso, 2018). However, this is
13
outside the scope of this article and left for future research.
14 15
6.
CONCLUSION
16
Many large infrastructure projects are often proposed on the very idea that they reduce
17
distances between settlements and widen the labour market making the areas more attractive
18
for firms and residential location. During the last 30 years, a substantial number of ferry
19
replacement projects have been built connecting people to the main land in Norway.
20
A first sign of increased attractiveness for residential location can usually be seen in
21
increased housing prices. Thus, developing tools that estimate to what degree transport
22
investments increase the housing prices is important to get indications of whether a growth in
23
population might be expected or not. By developing three different hedonic regression models
24
at a county level, this article investigates the ability these models have to predict housing prices
25
after a transport investment is made.
Nilsen, Arnesen, Andersen, Tørset
24
1
Our results provide valuable insights into how changes in accessibility in a more rural
2
setting affect housing prices. The results from the models developed, suggest that internal
3
housing and external characteristics do matter in terms of housing prices. The distance to the
4
main regional centre seems to be more important than the distance to smaller towns and
5
subcentres. Thus, areas that experience a major change in accessibility are likely to get an
6
increase in housing prices and attractiveness for residential location.
7
The results from the study show that spatially dependent models seem to outperform
8
traditional non-spatial regression models regarding data fitting. However, this is not found
9
regarding the model’s prediction abilities. The overall prediction abilities seem to be somewhat
10
lower than for similar models developed for more homogenous areas, even though, the same
11
variables seem to be of importance.
12 13
Acknowledgements
14
This research was supported by the Norwegian University of Science and Technology, the
15
Norwegian Public Roads Administration, Rambøll Norway and the Research Council of
16
Norway.
17
18
7. REFERENCES
19
Acheampong, R. A., & Silva, E. A. (2015). Land use – transport interaction modeling : A review
20
of the literature and future research directions. Journal of Transport and Land Use, 8(3),
21
1–28. http://doi.org/http://dx.doi.org/10.5198/jtlu.2015.806
22
Ahlfeldt, G. M. (2013). If we build it, will they pay? predicting property price effects of
23
transport
24
http://doi.org/10.1068/a45429
25
innovations.
Environment
and
Planning
A,
45(8),
1977–1994.
Andersson, D. E., Shyr, O. F., & Fu, J. (2010). Does high-speed rail accessibility influence
Nilsen, Arnesen, Andersen, Tørset
25
1
residential property prices? Hedonic estimates from southern Taiwan. Journal of
2
Transport Geography, 18(1), 166–174. http://doi.org/10.1016/j.jtrangeo.2008.10.012
3
Bourassa, S. C., Cantoni, E., & Hoesli, M. (2007). Spatial dependence, housing submarkets,
4
and house price prediction. Journal of Real Estate Finance and Economics, 35(2), 143–
5
160. http://doi.org/10.1007/s11146-007-9036-8
6 7 8 9
Bråthen, S., & Hervik, A. (1997). Strait crossings and economic development. Transport Policy, 4(4), 193–200. http://doi.org/10.1016/S0967-070X(97)00017-6 Cascetta, E. (2001). Transportation Systems Engineering: Theory and Methods (Vol. 49). http://doi.org/10.1007/978-1-4757-6873-2
10
Chung, I. H. (2015). School choice, housing prices, and residential sorting: Empirical evidence
11
from inter-and intra-district choice. Regional Science and Urban Economics, 52, 39–49.
12
http://doi.org/10.1016/j.regsciurbeco.2015.01.004
13
Coppola, P., Ibeas, Á., Dell’Olio, L., & Cordera, R. (2013). LUTI Model for the Metropolitan
14
Area of Santander. Journal of Urban Planning and Development, 139(3), 153–165.
15
http://doi.org/10.1061/(ASCE)UP.1943-5444.0000146
16
Coppola, P., & Nuzzolo, A. (2011). Changing accessibility, dwelling price and the spatial
17
distribution of socio-economic activities. Research in Transportation Economics, 31(1),
18
63–71. http://doi.org/10.1016/j.retrec.2010.11.009
19 20
Cordera, R., Ibeas, Á., Dell’Olio, L., & Alonso, B. (2018). Land Use transport interaction models. CRC Press Taylor & Francis Group.
21
Efthymiou, D., & Antoniou, C. (2013). How do transport infrastructure and policies affect
22
house prices and rents? Evidence from Athens, Greece. Transportation Research Part A:
23
Policy and Practice, 52, 1–22. http://doi.org/10.1016/j.tra.2013.04.002
24 25 26
Farideh, R., Flügel, S., Samstad, H., & Killi, M. (2010). Value of time, safety and environment in passenger transport – Time. Oslo. Füss, R., & Koller, J. A. (2016). The role of spatial and temporal structure for residential rent
Nilsen, Arnesen, Andersen, Tørset
26
1
predictions.
International
Journal
of
2
http://doi.org/10.1016/j.ijforecast.2016.06.001
Forecasting,
32(4),
1352–1368.
3
Gadziski, J., & Radzimski, A. (2015). The first rapid tram line in Poland: How has it affected
4
travel behaviours, housing choices and satisfaction, and apartment prices? Journal of
5
Transport Geography, 54, 451–463. http://doi.org/10.1016/j.jtrangeo.2015.11.001
6
Geng, B., Bao, H., & Liang, Y. (2015). A study of the effect of a high-speed rail station on
7
spatial variations in housing price based on the hedonic model. Habitat International, 49,
8
333–339. http://doi.org/10.1016/j.habitatint.2015.06.005
9 10 11
Hansen, W. G. (1959). How Accessibility Shapes Land Use. Journal of the American Institute of Planners, 25(2), 73–76. http://doi.org/10.1080/01944365908978307 Ho, C., & Hensher, D. (2014). Housing prices and price endogeneity in tenure and dwelling
12
type
choice
models.
Case
Studies
13
http://doi.org/10.1016/j.cstp.2014.08.001
on
Transport
Policy,
2(3),
107–115.
14
Ibeas, Á., Cordera, R., Dell’Olio, L., & Coppola, P. (2013). Modelling the spatial interactions
15
between workplace and residential location. Transportation Research Part A: Policy and
16
Practice, 49, 110–122. http://doi.org/10.1016/j.tra.2013.01.008
17
Ibeas, A., Cordera, R., Dell’Olio, L., Coppola, P., & Dominguez, A. (2012). Modelling
18
transport and real-estate values interactions in urban systems. Journal of Transport
19
Geography, 24, 370–382. http://doi.org/10.1016/j.jtrangeo.2012.04.012
20 21
LeSage, J. P., & Pace, K. (2009). An Introduction to Spatial Econometrics. Revue d’économie industrielle (Vol. 123). http://doi.org/10.4000/rei.3887
22
Louw, E., Leijten, M., & Meijers, E. (2013). Changes subsequent to infrastructure investments:
23
Forecasts, expectations and ex-post situation. Transport Policy, 29, 107–117.
24
http://doi.org/10.1016/j.tranpol.2013.04.012
25
Melo, P. C., Graham, D. J., & Brage-Ardao, R. (2013). The productivity of transport
26
infrastructure investment: A meta-analysis of empirical evidence. Regional Science and
Nilsen, Arnesen, Andersen, Tørset
1
27
Urban Economics, 43(5), 695–706. http://doi.org/10.1016/j.regsciurbeco.2013.05.002
2
Mulley, C., Ma, L., Clifton, G., Yen, B., & Burke, M. (2016). Residential property value
3
impacts of proximity to transport infrastructure: An investigation of bus rapid transit and
4
heavy rail networks in Brisbane, Australia. Journal of Transport Geography, 54, 41–52.
5
http://doi.org/10.1016/j.jtrangeo.2016.05.010
6
Nilsen, Ø. L., Díez Gutiérrez, M., Andersen, S. N., & Tørset, T. (2017). Do fixed links affect
7
the local labour market and industries? A case study of three fixed link projects in Norway.
8
Case
9
http://doi.org/10.1016/j.cstp.2017.01.006
10 11
Studies
on
Transport
Policy,
5(2),
233–244.
Nuzzolo, A., & Coppola, P. (2007). Accessibility and socioeconomic activities location. In European Transport Conference (pp. 1–23).
12
Ortúzar, & Willumsen, L. G. (2011). Modelling transport (4th ed.). John Wiley & Sons, Ltd.
13
Osland, L. (2010). An Application of Spatial Econometrics in Relation to Hedonic House Price
14
Modeling.
15
http://doi.org/10.5555/rees.32.3.d4713v80614728x1
16
Journal
of
Real
Estate
32(3),
289–320.
Osland, L., & Thorsen, I. (2013). Spatial Impacts, Local Labour Market Characteristics and
17
Housing
18
http://doi.org/10.1177/0042098012474699
19
Research,
Prices.
Urban
Studies,
50(42),
2063–2083.
Rosen, S. (1974). Hedonic Prices and Implicit Markets : Product Differentiation in Pure
20
Competition.
21
http://www.jstor.org/stable/1830899 Accessed : 30-03-2016 04 : 57 UTC Your use of th
22 23
The
University
of
Chicago
Press.
Retrieved
from
Rue, H., & Held, L. (2005). Gaussian Markov random fields: theory and application. CRC Press Taylor & Francis Group.
24
Tveter, E., Welde, M., & Odeck, J. (2017). Do Fixed Links Affect Settlement Patterns: A
25
Synthetic Control Approach. Research in Transportation Economics, 63, 59–72.
26
http://doi.org/10.1016/j.retrec.2017.07.002
Nilsen, Arnesen, Andersen, Tørset
28
2
Tørset, T., Malmin, O. K., Bang, B., & Bertelsen, D. (2013). CUBE - Regional
3
Tørset, T., Nilsen, Ø., & Spilsberg, E. (2013). Can we estimate the impacts of fixed links using
4
current transport models? Experiences from concept evaluations of fixed links on E39. In
5
Strait Crossings 2013. 16-19 June. Statens Vegvesen (pp. 206–223). Bergen: Strait
6
Crossings 2013. 16-19 June. Statens Vegvesen.
1
7
persontransportmodell versjon 3.
Vickerman, R. (2016). Beyond cost-benefit analysis; the search for a comprehensive evaluation
8
of
transport
investment.
Research
9
http://doi.org/10.1016/j.retrec.2017.04.003
in
Transportation
Economics,
1–8.
10
Waddell, P. (2011). Integrated Land Use and Transportation Planning and Modelling:
11
Addressing Challenges in Research and Practice. Transport Reviews, 31(2), 209–229.
12
http://doi.org/10.1080/01441647.2010.525671
13
Waddell, P., Ulfarsson, G. F., Franklin, J. P., & Lobb, J. (2007). Incorporating land use in
14
metropolitan transportation planning. Transportation Research Part A: Policy and
15
Practice, 41(5), 382–410. http://doi.org/10.1016/j.tra.2006.09.008
16
Wegener, M. (2004). Overview of land-use transport models. In D. A. Hensher & K. Button
17
(Eds.), Handbook of transport geography and spatial systems (Vol. Handbook 5, pp. 127–
18
146). Pergamon/Elsevier Science. http://doi.org/10.1007/s10654-011-9614-1
19