刘展, 潘莹丽. 基于超总体伪设计与组合样本的候选者数据库网络调查的推断研究[J]. 应用概率统计, 2019, 35(3): 221-232. DOI: 10.3969/j.issn.1001-4268.2019.03.001
引用本文: 刘展, 潘莹丽. 基于超总体伪设计与组合样本的候选者数据库网络调查的推断研究[J]. 应用概率统计, 2019, 35(3): 221-232. DOI: 10.3969/j.issn.1001-4268.2019.03.001
LIU Zhan, PAN Yingli. Research on Inference of Candidate Database Web Surveys Based on Superpopulation Pseudo Design and the Combined Sample[J]. Chinese Journal of Applied Probability and Statistics, 2019, 35(3): 221-232. DOI: 10.3969/j.issn.1001-4268.2019.03.001
Citation: LIU Zhan, PAN Yingli. Research on Inference of Candidate Database Web Surveys Based on Superpopulation Pseudo Design and the Combined Sample[J]. Chinese Journal of Applied Probability and Statistics, 2019, 35(3): 221-232. DOI: 10.3969/j.issn.1001-4268.2019.03.001

基于超总体伪设计与组合样本的候选者数据库网络调查的推断研究

Research on Inference of Candidate Database Web Surveys Based on Superpopulation Pseudo Design and the Combined Sample

  • 摘要: 候选者数据库网络调查的推断问题是网络调查发展中迫切需要解决的问题. 基于此, 提出基于超总体伪设计与组合样本的非概率抽样推断方法:对网络候选者数据库的调查样本建立超总体模型来构造伪权数,并根据网络候选者数据库的调查样本和概率样本的组合样本计算总体均值的估计,最后根据超总体模型的方差估计理论推导出目标总体均值估计的方差估计式,同时采用Bootstrap与Jackknife方法来估计总体均值估计的方差,并比较不同方差估计方法的效果. 研究结果表明: 基于超总体伪设计与组合样本的总体均值估计效率高于仅使用概率样本的估计和仅使用网络候选者数据库的调查样本加权的估计,估计效果较好; 方差估计方面, 采用VM1、VM2与VM3方法计算的方差估计相比而言更好.

     

    Abstract: How to solve the inference problem of candidate database web surveys is an urgent problem to be solved in the development of web survey. In order to solve this problem, the inference method of non-probability sampling based on superpopulation pseudo design and the combined sample is proposed. A superpopulation model is firstly built up to construct pseudo weights for a survey sample of the web candidate database. The estimator of the population mean is then computed according to the combined sample composed of the survey sample of the web candidate database and a probability sample. The variance estimator of the population mean estimator is lastly derived according to the variance estimation theory of the superpopulation model. The Bootstrap and Jackknife methods are also used to compute the variance estimator. And all these variance estimation methods are compared. The research results show that the population mean estimator based on superpopulation pseudo design and the combined sample is better, and has higher efficiency than the estimator only using the probability sample and the weighted estimator only using the survey sample of the web candidate database. The variance estimator computed by using the VM1, VM2 and VM3 method are relatively better.

     

/

返回文章
返回