曹萍, 夏志明. 海量数据中均值变点的快速估计方法[J]. 应用概率统计, 2020, 36(5): 493-508. DOI: 10.3969/j.issn.1001-4268.2020.05.005
引用本文: 曹萍, 夏志明. 海量数据中均值变点的快速估计方法[J]. 应用概率统计, 2020, 36(5): 493-508. DOI: 10.3969/j.issn.1001-4268.2020.05.005
CAO Ping, XIA Zhiming. A Fast Estimation Method for Mean Change Point in Massive Data Sets[J]. Chinese Journal of Applied Probability and Statistics, 2020, 36(5): 493-508. DOI: 10.3969/j.issn.1001-4268.2020.05.005
Citation: CAO Ping, XIA Zhiming. A Fast Estimation Method for Mean Change Point in Massive Data Sets[J]. Chinese Journal of Applied Probability and Statistics, 2020, 36(5): 493-508. DOI: 10.3969/j.issn.1001-4268.2020.05.005

海量数据中均值变点的快速估计方法

A Fast Estimation Method for Mean Change Point in Massive Data Sets

  • 摘要: 当样本容量为N时,均值变点的最小二乘估计的计算复杂度为O(N^2),在海量数据情形下亟需降低计算复杂度. 本文针对均值变点估计问题,提出了一种两阶段快速扫描算法,并证明该方法与均值变点的最小二乘估计具有相同的收敛速度和极限分布,且新算法的最佳复杂度为O(N^4/3\cdot b_n^2/3).我们从计算时间和估计效率方面进行了充足的数据实验,结果表明新老方法估计效率相似, 但我们的方法计算时间明显缩短.

     

    Abstract: When the sample size is N, the computational complexity of the least squares estimate of mean change point is O(N^2), and it's necessary to reduce the computational complexity in the case of huge data. In this paper, a two-stage fast scanning algorithm is proposed for the estimation of mean change point, and it is proved that this method has the same convergence speed and limiting distribution as the least squares estimation of mean change point, and the optimal complexity of the new algorithm is O(N^4/3\cdot b_n^2/3). We have conducted sufficient data experiments in terms of computation time and estimated efficiency, and the results show that the estimated efficiency of the new and old methods is similar, but the computation time of our method is obviously shortened.

     

/

返回文章
返回