王军, 刘三民, 刘涛. 具有噪声的动态数据流集成分类方法[J]. 内江师范学院学报, 2017, (8): 51-55. DOI: 10.13603/j.cnki.51-1621/z.2017.08.012
    引用本文: 王军, 刘三民, 刘涛. 具有噪声的动态数据流集成分类方法[J]. 内江师范学院学报, 2017, (8): 51-55. DOI: 10.13603/j.cnki.51-1621/z.2017.08.012
    WANG Jun, LIU Sanmin, LIU Tao. Integrated Classification Method for Dynamic Data Flows with Noise[J]. Journal of Neijiang Normal University, 2017, (8): 51-55. DOI: 10.13603/j.cnki.51-1621/z.2017.08.012
    Citation: WANG Jun, LIU Sanmin, LIU Tao. Integrated Classification Method for Dynamic Data Flows with Noise[J]. Journal of Neijiang Normal University, 2017, (8): 51-55. DOI: 10.13603/j.cnki.51-1621/z.2017.08.012

    具有噪声的动态数据流集成分类方法

    Integrated Classification Method for Dynamic Data Flows with Noise

    • 摘要: 提出一种基于分类器相似性加权和差异性集成的数据流分类方法. 用最新基分类器作为参照分类器,代表数据流中即将出现的概念,基于此分类器通过 Gower相似系数求出基分类器之间的相似性,并以相似性作为基分类器权值进行加权多数投票;同时采用 Q-statistic方法计算出参照分类器与其他基分类器之间的差异性,并根据差异性大小淘汰较弱基分类器保持集成分类模型多样性. 最终构建的集成模型在标准仿真数据集上进行实验仿真.结果表明:在对隐含噪声的动态数据流进行分类时,该方法分类准确率比传统集成分类方法约提高 11 % ,具有良好的分类准确率和抗噪稳定性.

       

      Abstract: A new method of data stream classification based on similarity weighting and differential integration of classifiers is proposed. The method uses the latest base classifier as the reference classifier, representing the upcoming concept in the data stream. Based on this classifier, the similarity between the base classifiers is worked out by use of the Gower’s similarity coefficient, and the similarity is used as the base classifier weights to conduct weighted majority vote. At the same time, Q- statistic method is adopted to calculate the difference between referenced classifiers and other base classifiers, and according to the size of the difference, the relatively weak base classifiers were eliminated so that the diversity of the integrated classification model can be kept. Lastly, simulation experiment is carried out on standard simulation dataset, and the results show that the
      classification accuracy of the presented method is about11% higher than that of the traditional integrated classification method when used to classify dynamic data flow with noise, indicating the method is of good classification accuracy and anti-noise sta- bility.

       

    /

    返回文章
    返回