In particular, model-based methods such as Classical. Seasonal Decomposition (CSD), Holt-Winters (HW) exponential smoothing and Autoregressive Integrated ...
Comparison of trend extraction methods for calculating performance loss rates of different photovoltaic technologies Alexander Phinikarides, George Makrides, Nitsa Kindyni, George E. Georghiou FOSS Research Centre for Sustainable Energy, Photovoltaic Technology Laboratory, Department of Electrical and Computer Engineering, University of Cyprus 75 Kallipoleos Avenue, P.O. Box 20537, Nicosia, 1678, Cyprus Abstract — In this work, the performance loss rates of eleven grid-connected photovoltaic (PV) systems of different technologies were evaluated by applying linear regression (LR) and trend extraction methods to Performance Ratio, RP, time series. In particular, model-based methods such as Classical Seasonal Decomposition (CSD), Holt-Winters (HW) exponential smoothing and Autoregressive Integrated Moving Average (ARIMA), as well as non-parametric filtering methods such as LOcally wEighted Scatterplot Smoothing (LOESS) were used to extract the trend from monthly RP time series of the first five years of operation of each PV system. The results showed that applying LR on the time series produced the lowest performance loss rates for most systems, but with significant autocorrelations in the residuals, signifying statistical inaccuracy. The application of CSD and HW significantly reduced the residual autocorrelations as the seasonal component was extracted from the time series, resulting in comparable results for eight out of eleven PV systems, with a mean absolute percentage error (MAPE) of 6.22 % between the performance loss rates calculated from each method. Finally, the optimal use of multiplicative ARIMA resulted in Gaussian white noise (GWN) residuals and the most accurate statistical model of the RP time series. ARIMA produced higher performance loss rates than LR for all technologies, except the amorphous Silicon (a-Si) system. The LOESS non-parametric method produced directly comparable results to multiplicative ARIMA, with a MAPE of -2.04 % between the performance loss rates calculated from each method, whereas LR, CSD and HW showed higher deviation from ARIMA, with MAPE of 25.14 %, -13.71 % and -6.39 %, respectively. Index Terms — degradation, photovoltaic systems, performance, statistical methods.
I. INTRODUCTION Various outdoor performance studies of grid-connected photovoltaic (PV) systems demonstrated that both crystalline silicon (c-Si) and thin-film PV technologies exhibit strong seasonality and performance variations [1]–[3]. More specifically, for thin-film technologies the performance variations are further correlated with a tendency for performance loss with increasing outdoor exposure, with the Staebler-Wronski effect [4] and thermal annealing [5] which apply strictly to amorphous Silicon (a-Si) [6], and with the transient response of Cadmium Telluride (CdTe) technologies in regards to light soaking [7]. Although performance loss is usually directly observable from the monthly performance ratio, RP, time series, as a downward trend, further insight into the performance loss rate
of each PV technology can be obtained by extracting the trend using statistical methods [8]. Model-based methods such as Linear Regression (LR), Classical Seasonal Decomposition (CSD) [9], [10], Holt-Winters (HW) exponential smoothing [11], [12] and Autoregressive Integrated Moving Average (ARIMA) [13] assume the specification of a stochastic time series model. The non-parametric filtering methods, such as LOcally wEighted Scatterplot Smoothing (LOESS) [14], [15] do not require specification of a model and are popular because of their simplicity. In the past, CSD, LOESS and multiplicative ARIMA have been successfully used to forecast the RP of grid-connected PV systems of different technologies [16] and LR, CSD and LOESS were used to calculate their performance loss rates [17], [18]. Due to the fact that the LR and CSD methods fit a fixed model, particular characteristics of each time series are therefore not captured and this results in significant autocorrelations in the model residuals. Another model-based method is HW, in which triple exponential smoothing is applied to the time series. Triple exponential smoothing takes into account seasonal changes, as well as trends, through the minimization of the squared one-step ahead prediction error, in contrast to CSD, which bases the calculation of trend, seasonal component and residuals on a centred moving average. Another advanced model-based method used in this evaluation was multiplicative ARIMA, as the method further allowed the calculation of the optimal model orders for each time series. The ARIMA method is more flexible than classical methods since it can effectively deal with seasonal variations, random errors, outliers and level shifts and can therefore be used to specify a model which removes all autocorrelations in the model residuals. Previous investigations showed that multiplicative ARIMA could be used to model monthly PVUSA metrics [19], to calculate degradation on even a relatively short time series, reduce the impact of outliers and other errors on the degradation rate [20] and offer an improvement in forecasting the RP of c-Si technologies over classical methods [16]. Due to the complex nature of ARIMA, it is entirely implemented in software [21], [22]. One of the most widely used software packages, X-13ARIMA [23], developed by the U.S. Census Bureau, was used to apply the parametric X-11 ARIMA method [24], [25] on
the time series and to automatically detect additive outliers and level shifts in the data. In this work, the performance loss rates from the first five years of operation of different technology grid-connected PV systems installed in Nicosia, Cyprus were evaluated using different methods of trend extraction applied on monthly DC RP time series for each system. The trend of the RP time series of each PV technology was extracted using LR, CSD, HW, LOESS and ARIMA and the resulting performance loss rates were calculated from the gradient of the trend. It is important to note that the linear, monthly performance loss rate at the DC side includes the modelling error of each technique, PV array loss factors such as module degradation, soiling, shading, mismatch and initial power stabilization. Additionally, since PV module production is continuously improved, the performance loss rates obtained from this study might be affected by the specific model and batch of modules. II. METHODOLOGY A. Outdoor PV system testing High-resolution measurements from the first five years of operation of different technology, 1 kWp, grid-connected PV systems were used to assess the performance loss rate of each technology. The PV systems were installed in June 2006 at the Photovoltaic Technology test site of the University of Cyprus, in Nicosia, Cyprus and are listed in Table I. Operational and meteorological measurements were recorded every second and averaged every fifteen minutes [26]. The PV tehcnologies range from mono-c-Si, multi-c-Si, Heterojunction with Intrinsic Thin Layer (HIT), Edge defined Film-fed Growth (EFG), Multi-crystalline Advanced Industrial cells (MAIN) to a-Si, Cadmium Telluride (CdTe) and Copper Indium Gallium Diselenide (CIGS).
Fifteen minute average measurements of the DC power at the maximum power point (MPP), PMPP, were used in conjunction with irradiation measurements from a calibrated Kipp & Zonen CM21 pyranometer, mounted on the plane-ofarray (POA), in order to construct monthly RP ratings for the period June 2006 – June 2011. The RP is defined as the ratio of the final energy yield of the PV system, Yf, and the reference yield, Yr [27]:
RP =
Yf
Yr
(1)
where Yf is defined as the monthly yield of the PV array normalized by the maximum power, Pmax of the PV array, provided by the manufacturer and Yr is calculated by dividing the total monthly POA irradiation by the reference irradiance of 1000 W/m2. The RP time series for all PV systems, over the period June 2006 – June 2011 is demonstrated in Fig. 1. The seasonal pattern of all technologies is evident, with differing amplitudes for each technology. Especially the First Solar CdTe and MHI a-Si systems show a weaker seasonal component than the c-Si and CIGS systems, mainly due to the lower temperature coefficients compared to c-Si technologies [28].
TABLE I DESCRIPTION OF PV SYSTEMS UNDER STUDY Manufacturer
Model
Technology
Solon Sanyo Atersa SolarWorld
P220/6+ HIP-205NHE1 A-170M 24V SW165
Sunpower
STM 200 FW
multi-c-Si mono-c-Si (HIT) mono-c-Si multi-c-Si mono-c-Si (back-contact) multi-c-Si (EFG) mono-c-Si (Saturn) multi-c-Si (MAIN) CIGS CdTe a-Si
Schott Solar BP Solar Schott Solar Würth Solar First Solar MHI
ASE-260-DG-FT BP7185S ASE-165-GTFT/MC WS 11007/75 FS60 MA100T2
PMPP (kWp) 1.540 1.025 1.020 0.990 1.000 1.000 1.110 1.020 0.900 1.080 1.000
Fig. 1. Monthly Performance Ratio, RP, of each PV system, over the period June 2006 – June 2011.
By applying temperature correction using the temperature coefficients, specified in the manufacturer datasheets [29], the seasonal component was reduced for all PV technologies, except the a-Si. Fig. 2 shows the monthly RP for each PV
system, using temperature corrected fifteen minute average measurements of DC PMPP and back sheet module temperature, Tmod. Evidently, there were still seasonal variations on the RP, even after normalization of the two major seasonal factors: a) the irradiation, via the RP and, b) the module temperature via temperature correction. This fact signifies the need for trend extraction, even after performing irradiance and temperature corrections, to eliminate the impact of the seasonal variations on the performance loss rate of each PV technology.
relevant seasonal component, S is the seasonal period, and are given in (5), (6) and (7), respectively.
yˆ n +l n = mn + bn + c n − S +l
l = 1,2,K
(4)
mt = α 0 ( y t − c t − S ) + (1 − α 0 )( mt −l + bt −l ) (5) bt = α 1 ( mt − mt −l ) + (1 − α 1 )bt −l ct = α 2
yt + (1 − α 2 )ct − S mt
(6) (7)
with α0, α1 and α2 lying between 0 and 1. The general model for multiplicative ARIMA is given in (8) and is abbreviated as ARIMA(p,d,q)(P,D,Q), where p is the AR (auto-regressive) order, d is the differencing order, q is the MA (moving average) order, P is the seasonal AR order, D is the seasonal differencing order and Q is the seasonal MA order.
Φ(T )Φ s (T s )∇ d ∇ sD y t = θ (T )θ s (T s )et
Fig. 2. Temperature corrected monthly Performance Ratio, RP, of each PV system, over the period June 2006 – June 2011.
B. Model-based trend extraction methods Model-based methods require the definition of a model that fits the data. In the following models, yˆ represents the fitted values. LR requires the linear model in (2), where α and β are the regression coefficients for the slope and intercept respectively.
yˆ = α ∗ t + β
(2)
CSD requires the additive model in (3), where Tt is the trend, St is the seasonal component and et are the residuals.
yˆ = Tt + S t + et
(3)
HW requires the general additive model in (4), where mn is the level component, bn is the slope component and cn-S+l is the
(8)
In order to find the optimal ARIMA model, the time series was initially checked for stationarity and then transformed using differencing to achieve stationarity, as necessary. The lags p, q, P, Q of the model were determined from the autocorrelation function (ACF) and the partial autocorrelation function (PACF) [16]. The model selection procedure yielded multiple models that fit the data well. The optimum model is the one with the lowest order (i.e. parsimonious), with the lowest mean-square-error (MSE) and the minimum value of the corrected Akaike information criterion (AICc). In order to validate the goodness of fit, the residuals were checked for Gaussian white noise (GWN) properties (i.e. un-correlated, normally distributed). The optimal multiplicative ARIMA models for the five year RP time series of all systems are presented in Table II. TABLE II OPTIMAL MULTIPLICATIVE ARIMA MODELS ARIMA model PV system (p,d,q)(P,D,Q) Solon (P220/6+) (0 0 0)(0 1 1) Sanyo (HIP-205NHE1) (0 1 1)(1 1 0) Atersa (A-170M 24V) (1 0 1)(0 1 1) SolarWorld (SW165) (0 1 1)(0 1 2) Sunpower (STM 200 FW) (1 0 1)(0 1 1) Schott Solar (ASE-260-DG-FT) (0 1 1)(1 1 0) BP Solar (BP7185S) (0 1 2)(0 1 1) Schott Solar (ASE-165-GT-FT/MC) (0 1 1)(0 1 2) Würth Solar (WS 11007/75) (0 1 1)(0 1 1) First Solar (FS60) (0 1 1)(0 1 1) MHI (MA100T2) (0 1 1)(0 1 1)
C. Non-parametric filtering methods Non-parametric filtering methods are different than modelbased methods in the sense that an explicit model is not specified. One such method is LOESS, which extracts the trend from locally weighted polynomial fitting. LOESS provides robust estimates of the trend and seasonal components that are not distorted by outliers and missing values [30]. For Rp time series this is especially important since the ratings are prone to distortions due to system outages, sensor faults and uncertainty, missing values and irregular time series. III. RESULTS In the case of LR, the annual performance loss rates were calculated by multiplying the slope coefficient by 12 (12 months in a year). In the case of CSD, HW, ARIMA and LOESS, the rates were calculated by first applying linear regression on the extracted trend and then multiplying the gradient again by 12 to obtain the annual performance loss rate, with positive values defining a gradual performance loss.
provide different performance loss rates, especially in comparison to the simple LR. The manufacturer’s stated annual performance loss limit of 0.8–1.0 %/year was surpassed by all thin-film systems, which exhibited performance loss rates around 2.3 %/year on average for CIGS and CdTe and 1.72 %/year on average for a-Si, whereas the cSi technologies exhibited lower performance loss rates, in the range of 0.25 to 1.37 %/year. In addition, the results showed that the application of LR on the time series produced the lowest performance loss rates for nine out of eleven systems but with significant autocorrelations in the residuals, detected from the residual ACF and PACF plots, signifying model inadequacy. The application of CSD and HW significantly improved the residual autocorrelations as a seasonal component was extracted from the time series, resulting in comparable results between these two methods for eight out of eleven PV technologies, with a mean absolute percentage error (MAPE) of 6.22 % between the performance loss rates calculated from each method. The application of HW also improved the variation of the rates for the Solon system, but performed worse for the BP Solar system, which was partially shaded for short periods of time due to foliage, during the evaluation. Unlike LR, all other trend extraction methods produced very similar performance loss rates between them for each PV system, except for the BP Solar and Solon systems. This, along with the more accurate modelling results, in comparison to LR, validates the statistical significance and superiority of trend extraction approaches for calculating performance loss rates from the RP time series of outdoor exposed, gridconnected PV systems. Finally, the optimal use of multiplicative ARIMA resulted in Gaussian white noise (GWN) residuals and the most accurate statistical model of the RP time series. ARIMA produced higher performance loss rates than LR for all technologies, except the a-Si system. The LOESS nonparametric method produced directly comparable results to multiplicative ARIMA, with a MAPE of -2.04 % between the performance loss rates calculated from each method, whereas LR, CSD and HW showed higher deviation from ARIMA, with MAPE of 25.14 %, -13.71 % and -6.39 %, respectively. IV. CONCLUSIONS
Fig. 3. Performance loss rates for each PV system using different trend extraction methods, over the period June 2006 – June 2011.
Fig. 3 shows the distribution of calculated performance loss rates for all PV systems using the various methods. The results clearly demonstrate that different trend extraction methods
In this work, trend extraction methods were presented in the context of calculating performance loss rates from monthly RP time series of grid-connected PV systems of different technologies. In contrast to the commonly used LR, trend extraction methods were able to separate the time series into their trend, seasonal and residual components. The complexity of each trend extraction method was directly correlated to its modelling accuracy and subsequently to its robustness in calculating statistically accurate performance loss rates. The exception to this rule was offered by the non-parametric
LOESS method, which, despite its easier usage, has shown nearly identical results to the more advanced ARIMA method. Finally, although LR produced the lowest performance loss rates for nine out of eleven PV systems in this study, its application was very inaccurate since there was a prevalent seasonal component in the RP time series, which could not be modelled with a linear fit. It has also been shown that temperature correction of the DC PMPP based on the manufacturer’s temperature coefficients and measurements of module temperature and then normalization with respect to irradiance could not remove the seasonal component in its entirety. Statistically accurate performance loss rates were then calculated using multiplicative ARIMA and the comparison between ARIMA and LOESS has shown that LOESS was also able to calculate statistically accurate results, despite the methods’ differing nature. ACKNOWLEDGEMENTS This work has been co-financed by the European Regional Development Fund and by the Republic of Cyprus in the framework of the project ‘Reliable Assessment of Degradation on new thin-film photovoltaic technologies’ with grant number ΤΕΧΝΟΛΟΓΙΑ/ΕΝΕΡΓ/0311(ΒΙΕ)/12. REFERENCES [1] G. Makrides, B. Zinsser, G. E. Georghiou, M. Schubert, and J. H. Werner, “Outdoor efficiency of different photovoltaic systems installed in Cyprus and Germany,” in 33rd IEEE PVSC, 2008, pp. 1–6. [2] A. J. Carr and T. L. Pryor, “A comparison of the performance of different PV module types in temperate climates,” Sol. Energy, vol. 76, no. 1–3, pp. 285–294, Jan. 2004. [3] T. A. Huld, M. Suri, R. P. Kenny, and E. D. Dunlop, “Estimating PV performance over large geographical regions,” in 31st IEEE PVSC, 2005, pp. 1679–1682. [4] D. L. Staebler and C. R. Wronski, “Optically induced conductivity changes in discharge-produced hydrogenated amorphous silicon,” J. Appl. Phys., vol. 51, no. 6, p. 3262, 1980. [5] D. Wagner and P. Irsigler, “On the annealing behaviour of the Staebler-Wronski effect in a-Si:H,” Appl. Phys. A Solids Surfaces, vol. 35, no. 1, pp. 9–12, Sep. 1984. [6] G. Makrides, B. Zinsser, A. Phinikarides, M. Schubert, and G. E. Georghiou, “Temperature and thermal annealing effects on different photovoltaic technologies,” Renew. Energy, vol. 43, pp. 407–417, Jul. 2012. [7] C. Deline, J. Del Cueto, D. Albin, C. Petersen, and G. Tamizhmani, “Transient Response of Cadmium Telluride Modules to Light Exposure,” in 37th IEEE PVSC, 2011. [8] T. Alexandrov, S. Bianconcini, E. B. Dagum, P. Maass, and T. S. McElroy, “A Review of Some Modern Approaches to the Problem of Trend Extraction,” Econom. Rev., vol. 31, no. 6, pp. 593–624, Nov. 2012. [9] S. G. Makridakis, S. Wheelwright, and R. J. Hyndman, Forecasting: methods and applications. Wiley, 1998. [10] R. J. Hyndman and G. Athanasopoulos, Forecasting: principles and practice. OTexts, 2012.
[11] C. C. Holt, “Forecasting seasonals and trends by exponentially weighted moving averages,” Int. J. Forecast., vol. 20, no. 1, pp. 5–10, Jan. 2004. [12] P. R. Winters, “Forecasting Sales by Exponentially Weighted Moving Averages,” Manage. Sci., vol. 6, no. 3, pp. 324–342, Apr. 1960. [13] G. E. P. Box, G. M. Jenkins, and G. C. Reinsel, Time series analysis: forecasting and control. Prentice Hall, 1994. [14] W. S. Cleveland, “Robust locally weighted regression and smoothing scatterplots,” J. Am. Stat. Assoc., vol. 74, no. 368, pp. 829–836, 1979. [15] W. S. Cleveland and S. J. Devlin, “Locally weighted regression: an approach to regression analysis by local fitting,” J. Am. Stat. Assoc., vol. 83, no. 403, pp. 596–610, 1988. [16] A. Phinikarides, G. Makrides, N. Kindyni, A. Kyprianou, and G. E. Georghiou, “ARIMA modeling of the performance of different photovoltaic technologies,” in 39th IEEE PVSC, 2013. [17] A. Phinikarides, G. Makrides, and G. E. Georghiou, “Initial performance degradation of an a-Si/a-Si tandem PV array,” in 27th EU-PVSEC, 2012. [18] A. Phinikarides, G. Makrides, and G. E. Georghiou, “Comparison of analysis methods for the calculation of degradation rates of different photovoltaic technologies,” in 28th EU-PVSEC, 2013, pp. 3973–3976. [19] A. N. Dunea, D. N. Dunea, V. I. Moise, and M. F. Olariu, “Forecasting methods used for performance’s simulation and optimization of photovoltaic grids,” in IEEE Porto Power Tech Proceedings, 2001, p. 5. [20] D. C. Jordan and S. R. Kurtz, “Analytical improvements in PV degradation rate determination,” in 35th IEEE PVSC, 2010, pp. 2688–2693. [21] R. J. Hyndman and Y. Khandakar, “Automatic Time Series Forecasting: The forecast Package for R,” J. Stat. Softw., vol. 27, no. 3, pp. 1–22, 2008. [22] R. H. Shumway and D. S. Stoffer, Time Series Analysis and Its Applications: With R Examples, 3rd ed. Springer, 2011. [23] B. C. Monsell, “The X-13ARIMA-SEATS Seasonal Adjustment Program,” in Proceedings of the Federal Committee on Statistical Methodology Research Conference, 2007. [24] D. P. Lytras, R. M. Feldpausch, and W. R. Bell, “Determining Seasonality: A Comparison of Diagnostics From X-12ARIMA,” in Proceedings of the International Conference on Establishment Surveys III, 2007. [25] D. Ladiray and B. Quenneville, “Seasonal Adjustment With the X-11 Method,” Technometrics, vol. 43, no. 4, pp. 500–500, Nov. 2001. [26] G. Makrides, B. Zinsser, M. Schubert, and G. E. Georghiou, “Performance loss rate of twelve photovoltaic technologies under field conditions using statistical techniques,” Sol. Energy, vol. 103, pp. 28–42, May 2014. [27] IEC 61724:1998, Photovoltaic system performance monitoring Guidelines for measurement, data exchange and analysis, 1st ed. Geneva, Switzerland: IEC, 1998. [28] G. Makrides, B. Zinsser, G. E. Georghiou, M. Schubert, and J. H. Werner, “Temperature behaviour of different photovoltaic systems installed in Cyprus and Germany,” Sol. Energy Mater. Sol. Cells, vol. 93, no. 6–7, pp. 1095–1099, Jun. 2009. [29] IEC 61853:2010, Photovoltaic (PV) module performance testing and energy rating - Part 1: Irradiance and temperature performance measurements and power rating, 1st ed. Geneva, Switzerland: IEC, 2010. [30] R. Cleveland, W. Cleveland, J. McRae, and I. Terpenning, “STL: A seasonal-trend decomposition procedure based on Loess,” J. Off. Stat., vol. 6, no. 1, pp. 3–73, 1990.