Goodness-of-fit tests for copulas: A review and a power study

https://doi.org/10.1016/j.insmatheco.2007.10.005Get rights and content

Abstract

Many proposals have been made recently for goodness-of-fit testing of copula models. After reviewing them briefly, the authors concentrate on “blanket tests”, i.e., those whose implementation requires neither an arbitrary categorization of the data nor any strategic choice of smoothing parameter, weight function, kernel, window, etc. The authors present a critical review of these procedures and suggest new ones. They describe and interpret the results of a large Monte Carlo experiment designed to assess the effect of the sample size and the strength of dependence on the level and power of the blanket tests for various combinations of copula models under the null hypothesis and the alternative. To circumvent problems in the determination of the limiting distribution of the test statistics under composite null hypotheses, they recommend the use of a double parametric bootstrap procedure, whose implementation is detailed. They conclude with a number of practical recommendations.

Introduction

Consider a continuous random vector X=(X1,,Xd) with joint cumulative distribution function H and margins F1,,Fd. The copula representation of H is given by H(x1,,xd)=C{F1(x1),,Fd(xd)}, where C is a unique cumulative distribution function having uniform margins on (0,1). A copula model for X arises when C is unknown but assumed to belong to a class C0={Cθ:θO}, where O is an open subset of Rp for some integer p1. The books of Joe (1997) and Nelsen (2006) provide handy compendiums of the most common parametric families of copulas.

Copula modeling has found many successful applications of late, notably in actuarial science, survival analysis and hydrology; see, e.g., Frees and Valdez (1998), Cui and Sun (2004) and Genest and Favre (2007) and references therein. However, nowhere has the methodology been adopted and used with greater intensity than in finance. Ample illustrations are provided in the books of Cherubini et al. (2004) and McNeil et al. (2005), notably in the context of asset pricing and credit risk management.

Given independent copies X1=(X11,,X1d),,Xn=(Xn1,,Xnd) of X, the problem of estimating θ under the assumption H0:CC0 has already been the object of much work; see, e.g., Genest et al. (1995), Shih and Louis (1995), Joe, 1997, Joe, 2005, Tsukahara (2005) or Chen et al. (2006). However, the complementary issue of testing H0 is only beginning to draw attention.

The situation is evolving rapidly but at this point in time, the literature on the subject can be divided broadly into three groups:

  • (1)

    Procedures developed for testing specific dependence structures such as the Normal copula (Malevergne and Sornette, 2003) or the equally popular Clayton family, also referred to as the gamma frailty model in survival analysis (Shih, 1998, Glidden, 1999, Cui and Sun, 2004).

  • (2)

    Statistics that can be used to test the goodness-of-fit of any class of copulas but whose implementation involves:

    • (a)

      an arbitrary parameter, as in the rank-based statistic due to Wang and Wells (2000);

    • (b)

      kernels, weight functions and associated smoothing parameters, as in Berg and Bakken (2005), Fermanian (2005), Panchenko (2005) and Scaillet (2007);

    • (c)

      ad hoc categorization of the data into a multiway contingency table in order to apply an analogue of the standard chi-squared test, along the lines of Genest and Rivest (1993), Klugman and Parsa (1999), Andersen et al. (2005), Dobrić and Schmid (2005) or Junker and May (2005).

  • (3)

    “Blanket tests”, i.e., those applicable to all copula structures and requiring no strategic choice for their use. Included in this category are variants of the Wang–Wells approach due to Genest et al. (2006), but also the procedures investigated or used by Breymann et al. (2003), Genest and Rémillard (in press) and Dobrić and Schmid (2007).

And then there are authors who, in applied work, use standard goodness-of-fit statistics as a tool for choosing between several copulas, but without attempting to formally test whether the selected model is appropriate, in the light of a P-value. See, e.g., the analysis of stock index returns by Ané and Kharoubi (2003).

The purpose of this paper is to present a critical review of the blanket goodness-of-fit tests proposed to date, to suggest variants or improvements, and to compare the relative power of these procedures through a Monte Carlo study involving a large number of copula alternatives and dependence conditions. After some general considerations given in Section 2, existing tests are described in Section 3 and new statistics are proposed in Section 4. Listed in Section 5 are the factors considered in the study designed to assess the level and compare the power of the selected tests. Results are reported and discussed in Section 6. Finally, various observations and methodological recommendations are made in the Conclusion.

Section snippets

General considerations

There is a fundamental difference between the problem of estimating the dependence parameter of a copula model C0={Cθ:θO} and the complementary issue of testing the validity of the null hypothesis H0:CC0 for some class C0 of copulas. The distinction is spelled out below, as it helps to understand the technical challenges associated with goodness-of-fit testing in this context.

“Blanket tests” currently available

This section describes five rank-based procedures that have been recently proposed for testing the goodness-of-fit of any class of d-variate copulas. Of all the tests listed in Section 1, these are the only ones that qualify as “blanket”, in the sense that they involve no parameter tuning or other strategic choices.

New procedures based on Rosenblatt’s transform

One avenue not covered by Breymann et al. (2003) or Dobrić and Schmid (2007) consists in working directly with the process, using the full power of Rosenblatt’s transform. The idea is not new, as it appeared in Klugman and Parsa (1999) for bivariate censored data. These authors propose a Pearson chi-square statistic computed from E1,,En. However, their P-value calculation is incorrect, because it assumes wrongly that the limiting distribution is chi-square. The fact that the margins were

Experimental design

A large-scale Monte Carlo experiment was conducted to assess the finite-sample properties of the proposed goodness-of-fit tests for various choices of dependence structures and degrees of association. Two characteristics of the tests were of interest: their ability to maintain their nominal level, arbitrarily fixed at 5% throughout the study, and their power under a variety of alternatives.

To curtail the computational effort, comparisons were limited to the bivariate case and to three degrees

Results

Table 1, Table 2, Table 3 report the level and power of the blanket tests from Sections 3 “Blanket tests” currently available, 4 New procedures based on Rosenblatt’s transform. Each table corresponds to a specific combination of τ{0.25,0.50,0.75} and n=150. Each line of a table shows the percentage of rejection of H0:CC0 associated with the different tests, given a choice of C0 and a true underlying copula family C.

As an example, Table 1 shows that when testing for the Frank copula from a

Observations and recommendations

Based on the experience gained from carrying out this comparative power study of the existing blanket goodness-of-fit tests for copula models, the following general observations and specific recommendations can be made.

  • I.

    General observations:

    • (a)

      In goodness-of-fit testing as in any other inferential context, the greater the sample size, the better. Large data sets not only help to distinguish between copula models but play a role in the reliability of the parametric bootstrap procedures used to

Acknowledgments

Partial funding in support of this work was granted by the Natural Sciences and Engineering Research Council of Canada, by the Fonds québécois de la recherche sur la nature et les technologies, and by the Institut de finance mathématique de Montréal. The authors gratefully acknowledge the GERAD (Montréal) and the Salle des marchés at Université Laval for extensive use of their computing facilities.

References (55)

  • O. Scaillet

    Kernel based goodness-of-fit tests for copulas with fixed smoothing parameters

    Journal of Multivariate Analysis

    (2007)
  • B. Abdous et al.

    Dependence properties of meta-elliptical distributions

  • P.K. Andersen et al.

    A class of goodness of fit tests for a copula based on bivariate right-censored data

    Biometrical Journal

    (2005)
  • T. Ané et al.

    Dependence structure and risk measure

    Journal of Business

    (2003)
  • Beaudoin, D., 2007. Estimation de la dépendance et choix de modèles pour des données bivariées sujettes à censure et à...
  • Berg, D., Bakken, H., 2005. A goodness-of-fit test for copulae based on the probability integral transform. Technical...
  • W. Breymann et al.

    Dependence structures for multivariate high-frequency data in finance

    Quantitative Finance

    (2003)
  • T. Burzykowski et al.

    The validation of surrogate end points by using data from randomized clinical trials: A case-study in advanced colorectal cancer

    Journal of the Royal Statistical Society Series A

    (2004)
  • X. Chen et al.

    Efficient estimation of semiparametric multivariate copula models

    Journal of the American Statistical Association

    (2006)
  • U. Cherubini et al.

    Copula Methods in Finance

    (2004)
  • D.G. Clayton

    A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence

    Biometrika

    (1978)
  • R.D. Cook et al.

    A family of distributions for modelling nonelliptically symmetric multivariate data

    Journal of the Royal Statistical Society Series B

    (1981)
  • S. Cui et al.

    Checking for the gamma frailty distribution under the marginal proportional hazards frailty model

    Statistica Sinica

    (2004)
  • P. Deheuvels

    La fonction de dépendance empirique et ses propriétés: Un test non paramétrique d’indépendance

    Académie Royale de Belgique. Bulletin de la Classe des Sciences, 5e Série

    (1979)
  • J. Dobrić et al.

    Testing goodness of fit for parametric families of copulas: Application to financial data

    Communications in Statistics. Simulation and Computation

    (2005)
  • D. Faraggi et al.

    Competing risks with frailty models when treatment affects only one failure type

    Biometrika

    (1996)
  • J.-D. Fermanian et al.

    Weak convergence of empirical copula processes

    Bernoulli

    (2004)
  • Cited by (1050)

    • Copula modeling from Abe Sklar to the present day

      2024, Journal of Multivariate Analysis
    View all citing articles on Scopus
    View full text