• ### Combining predictive distributions for statistical post-processing of ensemble forecasts(1607.08096)

July 13, 2017 stat.ME, stat.AP
Statistical post-processing techniques are now widely used to correct systematic biases and errors in calibration of ensemble forecasts obtained from multiple runs of numerical weather prediction models. A standard approach is the ensemble model output statistics (EMOS) method, a distributional regression approach where the forecast distribution is given by a single parametric law with parameters depending on the ensemble members. Choosing an appropriate parametric family for the weather variable of interest is a critical, however, often non-trivial task, and has been the focus of much recent research. In this article, we assess the merits of combining predictive distributions from multiple EMOS models based on different parametric families. In four case studies with wind speed and precipitation forecasts from two ensemble prediction systems, we study whether state of the art forecast combination methods are able to improve forecast skill.
• ### D-optimal designs for complex Ornstein-Uhlenbeck processes(1704.05719)

April 19, 2017 math.ST, stat.TH
Complex Ornstein-Uhlenbeck (OU) processes have various applications in statistical modelling. They play role e.g. in the description of the motion of a charged test particle in a constant magnetic field or in the study of rotating waves in time-dependent reaction diffusion systems, whereas Kolmogorov used such a process to model the so-called Chandler wobble, small deviation in the Earth's axis of rotation. In these applications parameter estimation and model fitting is based on discrete observations of the underlying stochastic process, however, the accuracy of the results strongly depend on the observation points. This paper studies the properties of D-optimal designs for estimating the parameters of a complex OU process with a trend. We show that in contrast with the case of the classical real OU process, a D-optimal design exists not only for the trend parameter, but also for joint estimation of the covariance parameters, moreover, these optimal designs are equidistant.
• ### K-optimal designs for parameters of shifted Ornstein-Uhlenbeck processes and sheets(1604.05489)

Oct. 18, 2016 math.ST, stat.TH
Continuous random processes and fields are regularly applied to model temporal or spatial phenomena in many different fields of science, and model fitting is usually done with the help of data obtained by observing the given process at various time points or spatial locations. In these practical applications sampling designs which are optimal in some sense are of great importance. We investigate the properties of the recently introduced K-optimal design for temporal and spatial linear regression models driven by Ornstein-Uhlenbeck processes and sheets, respectively, and highlight the differences compared with the classical D-optimal sampling. A simulation study displays the superiority of the K-optimal design for large parameter values of the driving random process.
• ### Censored and shifted gamma distribution based EMOS model for probabilistic quantitative precipitation forecasting(1512.04068)

April 1, 2016 stat.ME
Recently all major weather prediction centres provide forecast ensembles of different weather quantities which are obtained from multiple runs of numerical weather prediction models with various initial conditions and model parametrizations. However, ensemble forecasts often show an underdispersive character and may also be biased, so that some post-processing is needed to account for these deficiencies. Probably the most popular modern post-processing techniques are the ensemble model output statistics (EMOS) and the Bayesian model averaging (BMA) which provide estimates of the density of the predictable weather quantity. In the present work an EMOS method for calibrating ensemble forecasts of precipitation accumulation is proposed, where the predictive distribution follows a censored and shifted gamma (CSG) law with parameters depending on the ensemble members. The CSG EMOS model is tested on ensemble forecasts of 24 h precipitation accumulation of the eight-member University of Washington mesoscale ensemble and on the 11 member ensemble produced by the operational Limited Area Model Ensemble Prediction System of the Hungarian Meteorological Service. The predictive performance of the new EMOS approach is compared with the fit of the raw ensemble, the generalized extreme value (GEV) distribution based EMOS model and the gamma BMA method. According to the results, the proposed CSG EMOS model slightly outperforms the GEV EMOS approach in terms of calibration of probabilistic and accuracy of point forecasts and shows significantly better predictive skill that the raw ensemble and the BMA model.
• ### Mixture EMOS model for calibrating ensemble forecasts of wind speed(1507.06517)

Dec. 11, 2015 stat.AP
Ensemble model output statistics (EMOS) is a statistical tool for post-processing forecast ensembles of weather variables obtained from multiple runs of numerical weather prediction models in order to produce calibrated predictive probability density functions (PDFs). The EMOS predictive PDF is given by a parametric distribution with parameters depending on the ensemble forecasts. We propose an EMOS model for calibrating wind speed forecasts based on weighted mixtures of truncated normal (TN) and log-normal (LN) distributions where model parameters and component weights are estimated by optimizing the values of proper scoring rules over a rolling training period. The new model is tested on wind speed forecasts of the 50 member European Centre for Medium-Range Weather Forecasts ensemble, the 11 member Aire Limit\'ee Adaptation dynamique D\'eveloppement International-Hungary Ensemble Prediction System ensemble of the Hungarian Meteorological Service and the eight-member University of Washington mesoscale ensemble, and its predictive performance is compared to that of various benchmark EMOS models based on single parametric families and combinations thereof. The results indicate improved calibration of probabilistic and accuracy of point forecasts in comparison with the raw ensemble and climatological forecasts. The mixture EMOS model significantly outperforms the TN and LN EMOS methods, moreover, it provides better calibrated forecasts than the TN-LN combination model and offers an increased flexibility while avoiding covariate selection problems.
• ### Similarity-based semi-local estimation of EMOS models(1509.03521)

Sept. 11, 2015 physics.ao-ph, stat.AP
Weather forecasts are typically given in the form of forecast ensembles obtained from multiple runs of numerical weather prediction models with varying initial conditions and physics parameterizations. Such ensemble predictions tend to be biased and underdispersive and thus require statistical postprocessing. In the ensemble model output statistics (EMOS) approach, a probabilistic forecast is given by a single parametric distribution with parameters depending on the ensemble members. This article proposes two semi-local methods for estimating the EMOS coefficients where the training data for a specific observation station are augmented with corresponding forecast cases from stations with similar characteristics. Similarities between stations are determined using either distance functions or clustering based on various features of the climatology, forecast errors, ensemble predictions and locations of the observation stations. In a case study on wind speed over Europe with forecasts from the Grand Limited Area Model Ensemble Prediction System, the proposed similarity-based semi-local models show significant improvement in predictive performance compared to standard regional and local estimation methods. They further allow for estimating complex models without numerical stability issues and are computationally more efficient than local parameter estimation.
• ### Bivariate ensemble model output statistics approach for joint forecasting of wind speed and temperature(1507.03479)

July 27, 2015 stat.AP
Forecast ensembles are typically employed to account for prediction uncertainties in numerical weather prediction models. However, ensembles often exhibit biases and dispersion errors, thus they require statistical post-processing to improve their predictive performance. Two popular univariate post-processing models are the Bayesian model averaging (BMA) and the ensemble model output statistics (EMOS). In the last few years increased interest has emerged in developing multivariate post-processing models, incorporating dependencies between weather quantities, such as for example a bivariate distribution for wind vectors or even a more general setting allowing to combine any types of weather variables. In line with a recently proposed approach to model temperature and wind speed jointly by a bivariate BMA model, this paper introduces a bivariate EMOS model for these weather quantities based on a truncated normal distribution. The bivariate EMOS model is applied to temperature and wind speed forecasts of the eight-member University of Washington mesoscale ensemble and of the eleven-member ALADIN-HUNEPS ensemble of the Hungarian Meteorological Service and its predictive performance is compared to the performance of the bivariate BMA model and a multivariate Gaussian copula approach, post-processing the margins with univariate EMOS. While the predictive skills of the compared methods are similar, the bivariate EMOS model requires considerably lower computation times than the bivariate BMA method.
• ### Optimal designs for the methane flux in troposphere(1404.1839)

June 1, 2015 math.ST, stat.TH
The understanding of methane emission and methane absorption plays a central role both in the atmosphere and on the surface of the Earth. Several important ecological processes, e.g., ebullition of methane and its natural microergodicity request better designs for observations in order to decrease variability in parameter estimation. Thus, a crucial fact, before the measurements are taken, is to give an optimal design of the sites where observations should be collected in order to stabilize the variability of estimators. In this paper we introduce a realistic parametric model of covariance and provide theoretical and numerical results on optimal designs. For parameter estimation D-optimality, while for prediction integrated mean square error and entropy criteria are used. We illustrate applicability of obtained benchmark designs for increasing/measuring the efficiency of the engineering designs for estimation of methane rate in various temperature ranges and under different correlation parameters. We show that in most situations these benchmark designs have higher efficiency.
• ### Log-normal distribution based EMOS models for probabilistic wind speed forecasting(1407.3252)

July 11, 2014 physics.ao-ph, stat.ME
Ensembles of forecasts are obtained from multiple runs of numerical weather forecasting models with different initial conditions and typically employed to account for forecast uncertainties. However, biases and dispersion errors often occur in forecast ensembles, they are usually under-dispersive and uncalibrated and require statistical post-processing. We present an Ensemble Model Output Statistics (EMOS) method for calibration of wind speed forecasts based on the log-normal (LN) distribution, and we also show a regime-switching extension of the model which combines the previously studied truncated normal (TN) distribution with the LN. Both presented models are applied to wind speed forecasts of the eight-member University of Washington mesoscale ensemble, of the fifty-member ECMWF ensemble and of the eleven-member ALADIN-HUNEPS ensemble of the Hungarian Meteorological Service, and their predictive performances are compared to those of the TN and general extreme value (GEV) distribution based EMOS methods and to the TN-GEV mixture model. The results indicate improved calibration of probabilistic and accuracy of point forecasts in comparison to the raw ensemble and to climatological forecasts. Further, the TN-LN mixture model outperforms the traditional TN method and its predictive performance is able to keep up with the models utilizing the GEV distribution without assigning mass to negative values.
• ### Joint probabilistic forecasting of wind speed and temperature using Bayesian model averaging(1404.3681)

April 14, 2014 stat.ME
Ensembles of forecasts are typically employed to account for the forecast uncertainties inherent in predictions of future weather states. However, biases and dispersion errors often present in forecast ensembles require statistical post-processing. Univariate post-processing models such as Bayesian model averaging (BMA) have been successfully applied for various weather quantities. Nonetheless, BMA and many other standard post-processing procedures are designed for a single weather variable, thus ignoring possible dependencies among weather quantities. In line with recently upcoming research to develop multivariate post-processing procedures, e.g., BMA for bivariate wind vectors, or flexible procedures applicable for multiple weather quantities of different types, a bivariate BMA model for joint calibration of wind speed and temperature forecasts is proposed based on the bivariate truncated normal distribution. It extends the univariate truncated normal BMA model designed for post-processing ensemble forecast of wind speed by adding a normally distributed temperature component with a covariance structure representing the dependency among the two weather quantities. The method is applied to wind speed and temperature forecasts of the eight-member University of Washington mesoscale ensemble and of the eleven-member ALADIN-HUNEPS ensemble of the Hungarian Meteorological Service and its predictive performance is compared to that of the general Gaussian copula method. The results indicate improved calibration of probabilistic and accuracy of point forecasts in comparison to the raw ensemble and the overall performance of this model is able to keep up with that of the Gaussian copula method.
• ### Comparison of BMA and EMOS statistical calibration methods for temperature and wind speed ensemble weather prediction(1312.3763)

Dec. 13, 2013 stat.AP
The evolution of the weather can be described by deterministic numerical weather forecasting models. Multiple runs of these models with different initial conditions and/or model physics result in forecast ensembles which are used for estimating the distribution of future atmospheric variables. However, these ensembles are usually under-dispersive and uncalibrated, so post-processing is required. In the present work we compare different versions of Bayesian Model Averaging (BMA) and Ensemble Model Output Statistics (EMOS) post-processing methods in order to calibrate 2m temperature and 10m wind speed forecasts of the operational ALADIN Limited Area Model Ensemble Prediction System of the Hungarian Meteorological Service. We show that compared to the raw ensemble both post-processing methods improve the calibration of probabilistic and accuracy of point forecasts and that the best BMA method slightly outperforms the EMOS technique.
• ### Optimal designs for parameters of shifted Ornstein-Uhlenbeck sheets measured on monotonic sets(1312.0099)

Nov. 30, 2013 stat.ME
Measurement on sets with a specific geometric shape can be of interest for many important applications (e.g. measurement along the isotherms in structural engineering). In the present paper the properties of optimal designs for estimating the parameters of shifted Ornstein-Uhlenbeck sheets, that is Gaussian two-variable random fields with exponential correlation structures, are investigated when the processes are observed on monotonic sets. Substantial differences are demonstrated between the cases when one is interested only in trend parameters and when the whole parameter set is of interest. The theoretical results are illustrated by computer experiments and simulated examples from the field of structure engineering. From the design point of view the most interesting finding of the paper is the loss of efficiency of the regular grid design compared to the optimal monotonic design.
• ### Probabilistic wind speed forecasting using Bayesian model averaging with truncated normal components(1305.1184)

May 7, 2013 stat.ME
Bayesian model averaging (BMA) is a statistical method for post-processing forecast ensembles of atmospheric variables, obtained from multiple runs of numerical weather prediction models, in order to create calibrated predictive probability density functions (PDFs). The BMA predictive PDF of the future weather quantity is the mixture of the individual PDFs corresponding to the ensemble members and the weights and model parameters are estimated using ensemble members and validating observation from a given training period. In the present paper we introduce a BMA model for calibrating wind speed forecasts, where the components PDFs follow truncated normal distribution with cut-off at zero, and apply it to the ALADIN-HUNEPS ensemble of the Hungarian Meteorological Service. Three parameter estimation methods are proposed and each of the corresponding models outperforms the traditional gamma BMA model both in calibration and in accuracy of predictions. Moreover, since here the maximum likelihood estimation of the parameters does not require numerical optimization, modelling can be performed much faster than in case of gamma mixtures.
• ### Probabilistic temperature forecasting with statistical calibration in Hungary(1303.2133)

March 8, 2013 stat.AP
Weather forecasting is mostly based on the outputs of deterministic numerical weather forecasting models. Multiple runs of these models with different initial conditions result in forecast ensembles which is are used for estimating the distribution of future atmospheric variables. However, these ensembles are usually under-dispersive and uncalibrated, so post-processing is required. In the present work Bayesian Model Averaging (BMA) is applied for calibrating ensembles of temperature forecasts produced by the operational Limited Area Model Ensemble Prediction System of the Hungarian Meteorological Service (HMS). We describe two possible BMA models for temperature data of the HMS and show that BMA post-processing significantly improves calibration and probabilistic forecasts although the accuracy of point forecasts is rather unchanged.
• ### Probabilistic wind speed forecasting in Hungary(1202.4442)

Jan. 7, 2013 stat.AP
Prediction of various weather quantities is mostly based on deterministic numerical weather forecasting models. Multiple runs of these models with different initial conditions result ensembles of forecasts which are applied for estimating the distribution of future weather quantities. However, the ensembles are usually under-dispersive and uncalibrated, so post-processing is required. In the present work Bayesian Model Averaging (BMA) is applied for calibrating ensembles of wind speed forecasts produced by the operational Limited Area Model Ensemble Prediction System of the Hungarian Meteorological Service (HMS). We describe two possible BMA models for wind speed data of the HMS and show that BMA post-processing significantly improves the calibration and precision of forecasts.
• ### Testing stability in a spatial unilateral autoregressive model(1203.4346)

March 20, 2012 math.ST, stat.TH
Least squares estimator of the stability parameter $\varrho := |\alpha| + |\beta|$ for a spatial unilateral autoregressive process $X_{k,\ell}=\alpha X_{k-1,\ell}+\beta X_{k,\ell-1}+\varepsilon_{k,\ell}$ is investigated. Asymptotic normality with a scaling factor $n^{5/4}$ is shown in the unstable case, i.e., when $\varrho = 1$, in contrast to the AR(p) model $X_k=\alpha_1 X_{k-1}+... +\alpha_p X_{k-p}+ \varepsilon_k$, where the least squares estimator of the stability parameter $\varrho :=\alpha_1 + ... + \alpha_p$ is not asymptotically normal in the unstable, i.e., in the unit root case.
• ### Parameter estimation in linear regression driven by a Gaussian sheet(1111.2205)

Nov. 9, 2011 math.ST, stat.TH
The problem of estimating the parameters of a linear regression model $Z(s,t)=m_1g_1(s,t)+ \cdots + m_pg_p(s,t)+U(s,t)$ based on observations of $Z$ on a spatial domain $G$ of special shape is considered, where the driving process $U$ is a Gaussian random field and $g_1, \ldots, g_p$ are known functions. Explicit forms of the maximum likelihood estimators of the parameters are derived in the cases when $U$ is either a Wiener or a stationary or nonstationary Ornstein-Uhlenbeck sheet. Simulation results are also presented, where the driving random sheets are simulated with the help of their Karhunen-Lo\`eve expansions.
• ### Parameter estimation in a spatial unit root autoregressive model(1102.3318)

Oct. 2, 2011 math.ST, stat.TH
Spatial unilateral autoregressive model $X_{k,\ell}=\alpha X_{k-1,\ell}+\beta X_{k,\ell-1}+\gamma X_{k-1,\ell-1}+\epsilon_{k,\ell}$ is investigated in the unit root case, that is when the parameters are on the boundary of the domain of stability that forms a tetrahedron with vertices $(1,1,-1), \ (1,-1,1),\ (-1,1,1)$ and $(-1,-1,-1)$. It is shown that the limiting distribution of the least squares estimator of the parameters is normal and the rate of convergence is $n$ when the parameters are in the faces or on the edges of the tetrahedron, while on the vertices the rate is $n^{3/2}$.
• ### On the variances of a spatial unit root model(1006.5730)

Sept. 3, 2010 math.ST, stat.TH
The asymptotic properties of the variances of the spatial autoregressive model $X_{k,\ell}=\alpha X_{k-1,\ell}+\beta X_{k,\ell-1}+\gamma X_{k-1,\ell-1}+\epsilon_{k,\ell}$ are investigated in the unit root case, that is when the parameters are on the boundary of domain of stability that forms a tetrahedron in $[-1,1]^3$. The limit of the variance of $n^{-\varrho}X_{[ns],[nt]}$ is determined, where on the interior of the faces of the domain of stability $\varrho=1/4$, on the edges $\varrho =1/2$, while on the vertices $\varrho =1$.
• ### On the least squares estimator in a nearly unstable sequence of stationary spatial AR models(0803.2486)

March 17, 2008 math.PR, math.ST, stat.TH
A nearly unstable sequence of stationary spatial autoregressive processes is investigated, when the sum of the absolute values of the autoregressive coefficients tends to one. It is shown that after an appropriate norming the least squares estimator for these coefficients has a normal limit distribution. If none of the parameters equals zero than the typical rate of convergence is n.