Removing Multivariate Outliers
Posted: Wed Mar 22, 2017 9:11 pm
Hello,
I have a data set of 20,000 students, nested in 1000 classrooms, nested in 300 schools.
I have saved my standardized residuals for level 1, level 2, and level 3. The only thing random in my model is the intercept.
I want to remove the standardized residuals that are >/=2 or </=(-2) (at first level, second level, and third level) for a sensitivity analysis.
I can store these residuals, however, the second and third level residuals only provide me with 1 residual per level-unit (i.e. 1000 at class level and 300 at school level) as opposed to a higher-level residual associated with every observation (i.e. level 1, students). When I export my dataset to SPSS, the second and third level residuals are not matching up with my level 2 and level 3 IDs, so I cannot aggregate the values.
In the MLwiN manual, the only thing I can seem to find about sensitivity analyses/removing outliers, suggested manually pointing-and-clicking every outlying observation in the residual plot and "removing from analysis" by hand. This is not feasible given the size of my dataset.
Does anyone know how to remove multivariate outliers in MLwiN?
Thanks in advance,
Jillian
I have a data set of 20,000 students, nested in 1000 classrooms, nested in 300 schools.
I have saved my standardized residuals for level 1, level 2, and level 3. The only thing random in my model is the intercept.
I want to remove the standardized residuals that are >/=2 or </=(-2) (at first level, second level, and third level) for a sensitivity analysis.
I can store these residuals, however, the second and third level residuals only provide me with 1 residual per level-unit (i.e. 1000 at class level and 300 at school level) as opposed to a higher-level residual associated with every observation (i.e. level 1, students). When I export my dataset to SPSS, the second and third level residuals are not matching up with my level 2 and level 3 IDs, so I cannot aggregate the values.
In the MLwiN manual, the only thing I can seem to find about sensitivity analyses/removing outliers, suggested manually pointing-and-clicking every outlying observation in the residual plot and "removing from analysis" by hand. This is not feasible given the size of my dataset.
Does anyone know how to remove multivariate outliers in MLwiN?
Thanks in advance,
Jillian