基于机器学习方法的母猪高低产分类模型研究
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金项目(31572375);分子育种与繁殖新技术研发与推广合作协议(70711818605)


Research on sow high and low yield classification model based on machine learning method
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为帮助猪场管理者更好地对母猪进行繁殖管理、预测母猪的高低产、及时淘汰低产母猪,收集和整理包含出生场地、分娩栏位、品种和不同胎次、初生窝重信息的3个母猪群体的生产数据集,制定母猪高低产的分类标准,使用R软件中的Boruta包筛选出影响母猪高低产的重要特征,使用4种不同的机器学习方法——逻辑回归(logistic regression,LOG)、决策树(decision tree,DT)、随机森林(random forest,RF)和支持向量机(support vector machine,SVM)构建母猪高低产的分类模型,并进行决策树视图分析探究影响母猪最高产的相关因素。结果显示:4种机器学习方法构建母猪高产分类模型的分类准确率均在71%左右,最高可达84%,并且发现SVM作为最佳建模方法在所有数据集和不同分类标准下出现的频率最高,其次是LOG和DT。决策树视图显示出生场地、品种和初生窝重是划分最高产母猪的重要叶节点,利用这些特征预测最高产母猪准确率可达73%~82%。以上结果表明在未来的养猪生产中,利用机器学习方法实现母猪高低产的早期预测将会是一个不错的选择。

    Abstract:

    In order to help the pig farm managers better carry out the reproductive management of sows, predict the high and low yield sows, and timely eliminate the low yield sows, in this study, we collected and sorted out the dataset of three sow populations, including birth herd, farrow herd, breed and birth weight of different parities, formulated the classification standard of sow high and low yield, and used boruta package in R software to screen out the important characteristics affecting high and low yield of sows. Four different machine learning methods, logistic regression (LOG), decision tree (DT), random forest were (RF) and support vector machine (SVM) were used to construct the classification model of high and low yield sows, and the decision tree view analysis was carried out to explore the related factors affecting the highest yield of sows. The results showed that the classification accuracy of the four machine learning methods for sow high yield classification model was about 71%, and the highest was 84%. It was also found that SVM as the best modeling method appears most frequently across all data sets and different classification criteria, followed by LOG and DT. The decision tree view showed that birth herd, breed and birth weight of different parties were important leaf nodes for dividing the highest yield sows, and these characteristics can be used to predict the most productive sows, with an accuracy of 73%-82%. These results indicated that it will be a good choice to use machine learning method to predict the high and low yields of sows in the future.

    参考文献
    相似文献
    引证文献
引用本文

李喜阳,李信颉,赵志超,李长春,刘向东.基于机器学习方法的母猪高低产分类模型研究[J].华中农业大学学报,2021,40(3):221-229

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-09-11
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-06-07
  • 出版日期: