ZHANG XiaoQin, CHENG YuYing. Imputation of Missing Values for Compositional Data Based on Random Forest[J]. Chinese Journal of Applied Probability and Statistics, 2017, 33(1): 102-110.
Citation: ZHANG XiaoQin, CHENG YuYing. Imputation of Missing Values for Compositional Data Based on Random Forest[J]. Chinese Journal of Applied Probability and Statistics, 2017, 33(1): 102-110.

Imputation of Missing Values for Compositional Data Based on Random Forest

  • Dealing with the missing values is an important object in the field of data mining. Besides, the properties of compositional data lead to that traditional imputation methods may get undesirable result if they are directly used in this type of data. As a result, the management of missing values in compositional data is of great significant. To solve this problem, this paper uses the relationship between compositional data and Euclidean data, and proposes a new method based on Random Forest for missing values in compositional data. This method has been implemented and evaluated using both simulated and real-world databases, then the experimental results reveal that the new imputation method can be widely used in various types of data sets and has good performance than other methods.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return