Choosing the right estimation method
Posted: Sun Jan 27, 2013 6:42 pm
Hi,
I am having trouble choosing the best estimation method, and because different methods lead to different conclusions in my case, I wonder what solution to put more trust in.
I have a 3-level binary random intercepts logit regression, and am under the impression the data is underdispersed. I think the PLQ2 allowing for extra-binomial distribution would be the best option. Certainly because I have very small cluster sizes and many singletons. However, this does not converge, even for the simplest model, even when first estimating PLQ1 and then going 'more' on PLQ2, or first binomial than 'more' on extra-binomial, or when allowing negative values everywhere and changing the convergence tolerance, and changing IGLS to RIGL and all these changes together.
So the dilemma is to now choose between:
- PQL2 and not allow for extra-binomial. Note: the variation on level one goes to 0.365 when estimating in PLQ1, so I feel like have severe underdispersion.
- PLQ1 and allow for extra-binomial distribution. However the second level estimation is preferred by RodrÃguez G. and Goldman N. (2001; Improved estimation procedures for multilevel models with binary response: a case-study. Journal of the Royal Statistical Society, Series A, 164, 339-355.)
- MCMC however this takes a very long time, and does not seem to allow for extra-binomial distribution.
What would be the 'least harm' option, and based on which principles. Or are there no such principles yet. My personal preference goes to PQL1 extra-binomial, but more for subjective reasons.
I am having trouble choosing the best estimation method, and because different methods lead to different conclusions in my case, I wonder what solution to put more trust in.
I have a 3-level binary random intercepts logit regression, and am under the impression the data is underdispersed. I think the PLQ2 allowing for extra-binomial distribution would be the best option. Certainly because I have very small cluster sizes and many singletons. However, this does not converge, even for the simplest model, even when first estimating PLQ1 and then going 'more' on PLQ2, or first binomial than 'more' on extra-binomial, or when allowing negative values everywhere and changing the convergence tolerance, and changing IGLS to RIGL and all these changes together.
So the dilemma is to now choose between:
- PQL2 and not allow for extra-binomial. Note: the variation on level one goes to 0.365 when estimating in PLQ1, so I feel like have severe underdispersion.
- PLQ1 and allow for extra-binomial distribution. However the second level estimation is preferred by RodrÃguez G. and Goldman N. (2001; Improved estimation procedures for multilevel models with binary response: a case-study. Journal of the Royal Statistical Society, Series A, 164, 339-355.)
- MCMC however this takes a very long time, and does not seem to allow for extra-binomial distribution.
What would be the 'least harm' option, and based on which principles. Or are there no such principles yet. My personal preference goes to PQL1 extra-binomial, but more for subjective reasons.