large datasets
-
- Posts: 19
- Joined: Thu Sep 03, 2009 1:39 pm
large datasets
I am doing 2-level logistic modeling using MLwiN. My data has >250,000 individuals within >700 clusters. I have less than 10 covariates.
-
- Posts: 16
- Joined: Tue Oct 13, 2009 9:36 am
Re: large datasets
Try using a random sub-sample of your data. With only 2 levels, 700 level 2 units and 10 covariates you do not need nearly as many as 250000 observations to get precise estimates. You will be able to work a lot more quickly with a smaller sample and therefore be able to explore more avenues and potential models etc
-
- Posts: 19
- Joined: Thu Sep 03, 2009 1:39 pm
Re: large datasets
Basically, it doesn't work for the whole data on MLwiN (even single level logistic) although it works for a subset smaller data (about 5000). I don't realize any problems for importing the entire whole data.
-
- Posts: 16
- Joined: Tue Oct 13, 2009 9:36 am
Re: large datasets
You can check whether the entire dataset has been imported properly by comparing the summary statistics: are they the same in MLwiN as in your other stats package
-
- Posts: 19
- Joined: Thu Sep 03, 2009 1:39 pm
Re: large datasets
Errors: There are three types of errors. One shows me 'out of memory'. And the 2nd one shows me 'no sufficient worksheet size. The 3rd one is that MLwiN is just crashed and turn off. I have two questions. 1)can MLwiN handle such a huge data?
-
- Posts: 16
- Joined: Tue Oct 13, 2009 9:36 am
Re: large datasets
Yes. But depends of course on the spec of your computer. See 'How do I know MLwiN can handle the model based on my own data?' (http://www.cmm.bristol.ac.uk/MLwiN/tech ... html#capac)
-
- Posts: 19
- Joined: Thu Sep 03, 2009 1:39 pm
Re: large datasets
How to re-configure/specify worksheet size from the 'option'?
-
- Posts: 16
- Joined: Tue Oct 13, 2009 9:36 am
Re: large datasets
Go to options>worksheet then increase worksheet size to a large amount (experiment)