Questions on convergence diagnostics
Posted: Thu Aug 28, 2014 2:14 am
Dear all,
I have some questions regarding convergence diagnostics, especially how to read the univariate and multivariate plots.
1. In the attached file, you may find six plots. "convergence_udiag1_short" and "convergence_udiag2_short" are for univariate measures and "convergence_mdiag_short" is for multivariate measures. These are based 400000 draws. According to the relevant documents, for example "Pfeifer (2014): An Introduction to Graphs in Dynare at https://sites.google.com/site/pfeiferecon/dynare", the parameters in "convergence_udiag1_short" seem to converge because the two lines seem to stabilize horizontally and be close to each other. While in "convergence_udiag2_short", the parameters seem to be problematic as the two lines have not stabilized horizontally after 400000 draws, although the two lines are quite close to each other. In "convergence_mdiag_short", the multivariate measures seem to behave well. In order to see if the problem arises because the chain is not long enough, I run MCMC with "load_mh_file" and the chain now contains 1 million draws. The plots are "convergence_udiag1_long", "convergence_udiag2_long" and "convergence_mdiag_long". As for the short chain, "convergence_udiag1_long" and "convergence_mdiag_long" seem OK. But for "convergence_udiag2_long", I am not quite sure. It seems that the parameters in this plot mix very slowly.
My first question is how to understand these plots and more generally, how to judge the convergence of MCMC. In this case, the multivariate plot seems ok but the univariate plots do not. Do I need even more draws to ensure all the chains to converge?
2. To provide further evidence of the model, I also included two mode_check plots and two prior_posterior plots. I found that two persistence parameters of shock processes are very high, e.g. 0.998 for rho_d and rho_g. So the model itself seems not to absorb the persistence from the data and the shock processes do that.
My second question is that if the slow or non convergence is related to the potentially problematic parameter estimates?
Thanks very much in advance!
I have some questions regarding convergence diagnostics, especially how to read the univariate and multivariate plots.
1. In the attached file, you may find six plots. "convergence_udiag1_short" and "convergence_udiag2_short" are for univariate measures and "convergence_mdiag_short" is for multivariate measures. These are based 400000 draws. According to the relevant documents, for example "Pfeifer (2014): An Introduction to Graphs in Dynare at https://sites.google.com/site/pfeiferecon/dynare", the parameters in "convergence_udiag1_short" seem to converge because the two lines seem to stabilize horizontally and be close to each other. While in "convergence_udiag2_short", the parameters seem to be problematic as the two lines have not stabilized horizontally after 400000 draws, although the two lines are quite close to each other. In "convergence_mdiag_short", the multivariate measures seem to behave well. In order to see if the problem arises because the chain is not long enough, I run MCMC with "load_mh_file" and the chain now contains 1 million draws. The plots are "convergence_udiag1_long", "convergence_udiag2_long" and "convergence_mdiag_long". As for the short chain, "convergence_udiag1_long" and "convergence_mdiag_long" seem OK. But for "convergence_udiag2_long", I am not quite sure. It seems that the parameters in this plot mix very slowly.
My first question is how to understand these plots and more generally, how to judge the convergence of MCMC. In this case, the multivariate plot seems ok but the univariate plots do not. Do I need even more draws to ensure all the chains to converge?
2. To provide further evidence of the model, I also included two mode_check plots and two prior_posterior plots. I found that two persistence parameters of shock processes are very high, e.g. 0.998 for rho_d and rho_g. So the model itself seems not to absorb the persistence from the data and the shock processes do that.
My second question is that if the slow or non convergence is related to the potentially problematic parameter estimates?
Thanks very much in advance!