MLE Fitting of Normal Bulk and GPD for Both Tails with Single Continuity Constraint at Both Thresholds Extreme Value Mixture Model
MLE Fitting of Normal Bulk and GPD for Both Tails with Single Continuity Constraint at Both Thresholds Extreme Value Mixture Model
Maximum likelihood estimation for fitting the extreme value mixture model with normal for bulk distribution between thresholds and conditional GPDs for both tails with continuity at thresholds. With options for profile likelihood estimation for both thresholds and fixed threshold approach.
phiul: probability of being below lower threshold (0,1) or logical, see Details in help for fgng
phiur: probability of being above upper threshold (0,1) or logical, see Details in help for fgng
ulseq: vector of lower thresholds (or scalar) to be considered in profile likelihood or NULL for no profile likelihood
urseq: vector of upper thresholds (or scalar) to be considered in profile likelihood or NULL for no profile likelihood
fixedu: logical, should threshold be fixed (at either scalar value in ulseq/urseq, or estimated from maximum of profile likelihood evaluated at sequence of thresholds in ulseq/urseq)
pvector: vector of initial values of parameters or NULL for default values, see below
std.err: logical, should standard errors be calculated
method: optimisation method (see optim)
control: optimisation control list (see optim)
finitelik: logical, should log-likelihood return finite value for invalid parameters
...: optional inputs passed to optim
nmean: scalar normal mean
nsd: scalar normal standard deviation (positive)
ul: scalar lower tail threshold
xil: scalar lower tail GPD shape parameter
ur: scalar upper tail threshold
xir: scalar upper tail GPD shape parameter
log: logical, if TRUE then log-likelihood rather than likelihood is output
ulr: vector of length 2 giving lower and upper tail thresholds or NULL for default values
Returns
Log-likelihood is given by lgngcon and it's wrappers for negative log-likelihood from nlgngcon
and nlugngcon. Profile likelihood for both thresholds given by proflugngcon. Fitting function fgngcon returns a simple list with the following elements
call :
optim call
x :
data vector x
init :
pvector
fixedu :
fixed thresholds, logical
ulseq :
lower threshold vector for profile likelihood or scalar for fixed threshold
urseq :
upper threshold vector for profile likelihood or scalar for fixed threshold
nllhuseq :
profile negative log-likelihood at each threshold pair in (ulseq, urseq)
optim :
complete optim output
mle :
vector of MLE of parameters
cov :
variance-covariance matrix of MLE of parameters
se :
vector of standard errors of MLE of parameters
rate :
phiu to be consistent with evd
nllh :
minimum negative log-likelihood
n :
total sample size
nmean :
MLE of normal mean
nsd :
MLE of normal standard deviation
ul :
lower threshold (fixed or MLE)
sigmaul :
MLE of lower tail GPD scale (estimated from other parameters)
xil :
MLE of lower tail GPD shape
phiul :
MLE of lower tail fraction (bulk model or parameterised approach)
se.phiul :
standard error of MLE of lower tail fraction
ur :
upper threshold (fixed or MLE)
sigmaur :
MLE of upper tail GPD scale (estimated from other parameters)
xir :
MLE of upper tail GPD shape
phiur :
MLE of upper tail fraction (bulk model or parameterised approach)
se.phiur :
standard error of MLE of upper tail fraction
Details
The extreme value mixture model with normal bulk and GPD for both tails with continuity at thresholds is fitted to the entire dataset using maximum likelihood estimation. The estimated parameters, variance-covariance matrix and their standard errors are automatically output.
See help for fnormgpd and fgngfor details, type help fnormgpd and help fgng. Only the different features are outlined below for brevity.
The GPD sigmaul and sigmaur parameters are now specified as function of other parameters, see help for dgngcon for details, type help gngcon. Therefore, sigmaul and sigmaur should not be included in the parameter vector if initial values are provided, making the full parameter vector The full parameter vector is (nmean, nsd, ul, xil, ur, xir) if thresholds are also estimated and (nmean, nsd, xil, xir) for profile likelihood or fixed threshold approach.
If the profile likelihood approach is used, then a grid search over all combinations of both thresholds is carried out. The combinations which lead to less than 5 in any datapoints beyond the thresholds are not considered.
Note
When pvector=NULL then the initial values are:
MLE of normal parameters assuming entire population is normal; and
lower threshold 10% quantile (not relevant for profile likelihood for threshold or fixed threshold approaches);
upper threshold 90% quantile (not relevant for profile likelihood for threshold or fixed threshold approaches);
MLE of GPD shape parameters beyond threshold.
Acknowledgments
See Acknowledgments in fnormgpd, type help fnormgpd. Based on code by Xin Zhao produced for MATLAB.
Scarrott, C.J. and MacDonald, A. (2012). A review of extreme value threshold estimation and uncertainty quantification. REVSTAT - Statistical Journal 10(1), 33-59. Available from http://www.ine.pt/revstat/pdf/rs120102.pdf
Zhao, X., Scarrott, C.J. Reale, M. and Oxley, L. (2010). Extreme value modelling for forecasting the market crisis. Applied Financial Econometrics 20(1), 63-72.
Mendes, B. and H. F. Lopes (2004). Data driven estimates for mixtures. Computational Statistics and Data Analysis 47(3), 583-598.