深度模型融合数据合成机制的长尾目标识别
作者:
作者单位:

华南农业大学电子工程学院(人工智能学院),广州 510642

作者简介:

蔡润基,E-mail:13600374676@163.com

通讯作者:

徐初东,E-mail: cd79cd@126.com

中图分类号:

TP391.41;TP311.13

基金项目:

广东省自然科学基金项目(2020A1515010634)


Synthetic samples combined model-based recognition of long-tailed target
Author:
Affiliation:

College of Electronic Engineering(College of AI),South China Agricultural University,Guangzhou 510642,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为解决昆虫数据集数据分布不均衡导致识别模型在数据量少的尾部类别的识别性能差的问题,提出1种融合数据合成的卷积识别网络模型(synthetic samples combined model,SSCM),该模型包含图像分割与重组模块、主干网络模块和数据纠正分支模块等3个模块。通过图像分割与重组模块对训练的图片进行分割并重组,得到新的训练数据并加入训练集;再使用ResNet-50作为网络主干提取图片的特征,同时数据纠正分支模块采用均方误差与交叉熵计算合成图像与原图像之间的误差,以减少合成图像对尾部数据的不利影响。构建包含300个蝴蝶类别共26 045张图片的数据集验证模型性能,结果显示,SSCM模型在该数据集上的准确率较DRC、BBN、RIDE等主流长尾目标识别模型分别高3、2.14、2.71个百分点。采用公开昆虫数据集IP102进一步验证SSCM算法的有效性,结果显示,SSCM模型准确率比DRC、BBN、RIDE等模型分别高18.94、3.02、3.36个百分点。

    Abstract:

    Insects are the most diverse animal group in nature. Some species are difficult to collect, which makes datasets often highly heterogeneous with long-tailed distributions. This article proposed a convolution recognition network model based on synthetic samples combined model (SSCM) to solve the problem that the uneven distribution of insect datasets leads to the poor recognition performance of recognition models in tail categories with less data. The model contains three modules including image segmentation and shuffle module, backbone network module and data fix branch module. Through the image segmentation and shuffle module, the training image was segmented and shuffled to obtain new training data and added to the training set. ResNet-50 was used as the network backbone to extract features of image. At the same time, the data fix branch module combined the mean square error and cross-entropy to calculate the error between the synthetic samples and the original image to reduce the adverse effect of the synthetic samples on the tail data. A butterfly dataset containing a total of 26 045 images of 300 species was constructed to evaluate the performance of the model proposed. The results showed that the accuracy of SSCM model was 3, 2.14 and 2.71 percentages higher than that of DRC, BBN and RIDE in the butterfly dataset, respectively. In addition, the validity of the SSCM in the public IP102 insect dataset was verified on the public insect dataset IP102. The results showed that the accuracy of SSCM model was 18.94, 3.02 and 3.36 percentages higher than that of DRC, BBN and RIDE, respectively.

    参考文献
    相似文献
    引证文献
引用本文

蔡润基,江方湧,郑涛涛,刘东霖,徐初东.深度模型融合数据合成机制的长尾目标识别[J].华中农业大学学报,2023,42(3):271-280

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2022-09-02
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-06-20
  • 出版日期: