On the use of a local $\hat{R}$ to improve MCMC convergence diagnostic: Multivariate examples using OpenTurns

Théo Moins, Julyan Arbel, Anne Dutfoy, Stéphane Girard

In all the examples we suppose that the margins are uniform to analyse $\hat{R}_\infty$ only as a function of the copulas.

1. Normal Copula

We consider bivariate Normal copulas on our $m$ chains:

\begin{equation*} C_j(\boldsymbol{u}) = \Phi_{\boldsymbol{R}_j}(\Phi^{-1}({u_1}), \ldots, \Phi^{-1}({u_d})), \quad \text{with} \quad \Phi_{\boldsymbol{R}_j} \text{ c.d.f of } \mathcal{N}(0,\boldsymbol{R}_j) \end{equation*}

The choice of $\boldsymbol{R}_j$ is the following:

\begin{align*} \boldsymbol{R}_j &= \begin{pmatrix} 1 & 0\\ 0 & 1 \end{pmatrix} \quad \text{if} \quad j \in \{1, \ldots, m-1\}, \\ \text{And } \boldsymbol{R}_m &= \begin{pmatrix} 1 & \rho_m\\ \rho_m & 1 \end{pmatrix} \quad \text{with} \quad \rho_m \in (-1, 1) \end{align*}

Example with $\rho_m = 0.8$:

Replication of $\hat{R}_\infty$ for different value of $\rho_m$

With different value of $\rho_m$, we replicate 10 times the experiment and report the corresponding value of $\hat{R}_\infty$.

The theoretical counterparts can be computed thanks to the computeCDF method in OpenTURN.

This illustates the influence of the dependence direction in the sensitivity of $\hat{R}_\infty$ (see Section 3.3): in dimension two, $\hat{R}_\infty$ is more sensitive in the negative dependence of the two variables than in the positive one.

When $m=2$, the two theoretical value when $|\rho_m| \to 1$ are known and are equal to $\left(\sqrt{7/6} , \sqrt{1/2 + 1/\sqrt{3}}\right) \approx (1.08, 1.04)$ for the negative and positive comonotonic dependence.

2. Mixture of Copulas : Min and Independant copula

In this example we suppose that all the chains are distributed according to an independent copula, except the last which is a mixture of the min and independant copula:

\begin{align*} C_j(\boldsymbol{u}) &= \Pi(\boldsymbol{u}), \quad \forall j \in \{1, \ldots, m-1\} \\ C_m(\boldsymbol{u}) &= \theta. M(\boldsymbol{u}) + (1-\theta). \Pi(\boldsymbol{u}), \quad \theta \in (0,1) \end{align*}

Copula plot for the last chain for different value of $\theta$

Replication of $\hat{R}_\infty$ for different value of $\theta$

We find again a convergence to the theoretical value of $\sqrt{1/2 + 1/\sqrt{3}} \approx 1.04$ when $\theta \to 1$.

However, the variance of the estimator $\hat{R}_\infty$ of $R_\infty$ seems to increase as the dissimilarity between the chains increases.

3. Gumbel copula

In this example we consider Gumbel copulas for the chains

\begin{equation*} C_j(u_1, u_2) = \exp(-((-\log(u_1))^{\theta_j} + (-\log(u_2))^{\theta_j}))^{1/\theta_j}), \quad \forall j \in \{1, \ldots, m\} \end{equation*}

The parameter $\theta \geq 1$ is fixed by default to $2$ for all the chains except the last one: \begin{align*} \theta_j &= 2, \quad \forall j \in \{1, \ldots, m-1\}\\ \theta_m &\geq 2 \end{align*}

Example of Gumbel copulas

Replication of $\hat{R}_\infty$ for different value of $\theta_m$

We can see here that $\hat{R}_\infty$ is quite insensitive to this difference in distribution. Considering $\hat{R}^{-}_\infty$ instead which the computation on $\mathbb{I}\{\theta_1^{(\cdot)} \leq x_1, \theta_2^{(\cdot)} \geq x_2\}$ (see Section 3.3) allows to be more sensitive as we are in dimension two: