A subsampled double bootstrap for massive data

dc.contributor.authorSengupta, Srijan
dc.contributor.authorVolgushev, Stanislav
dc.contributor.authorShao, Xiaofeng
dc.date.accessioned2015-08-04T09:10:56Z
dc.date.available2015-08-04T09:10:56Z
dc.date.issued2015
dc.description.abstractThe bootstrap is a popular and powerful method for assessing precision of estimators and inferential methods. However, for massive datasets which are increasingly prevalent, the bootstrap becomes prohibitively costly in computation and its feasibility is questionable even with modern parallel computing platforms. Recently Kleiner, Talwalkar, Sarkar, and Jordan (2014) proposed a method called BLB (Bag of Little Bootstraps) for massive data which is more computationally scalable with little sacrifice of statistical accuracy. Building on BLB and the idea of fast double bootstrap, we propose a new resampling method, the subsampled double bootstrap, for both independent data and time series data. We establish consistency of the subsampled double bootstrap under mild conditions for both independent and dependent cases. Methodologically, the subsampled double bootstrap is superior to BLB in terms of running time, more sample coverage and automatic implementation with less tuning parameters for a given time budget. Its advantage relative to BLB and bootstrap is also demonstrated in numerical simulations and a data illustration.en
dc.identifier.urihttp://hdl.handle.net/2003/34180
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-7812
dc.language.isoende
dc.relation.ispartofseriesDiscussion Paper / SFB 823;27/2015en
dc.subjectbig dataen
dc.subjectresamplingen
dc.subjectsubsamplingen
dc.subjectcomputational costen
dc.subject.ddc310
dc.subject.ddc330
dc.subject.ddc620
dc.titleA subsampled double bootstrap for massive dataen
dc.typeTextde
dc.type.publicationtypeworkingPaperde
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DP_2715_SFB823_Sengupta_Volgushev_Shao.pdf
Size:
444.99 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.12 KB
Format:
Item-specific license agreed upon to submission
Description: