Issue |
MATEC Web Conf.
Volume 355, 2022
2021 International Conference on Physics, Computing and Mathematical (ICPCM2021)
|
|
---|---|---|
Article Number | 03008 | |
Number of page(s) | 14 | |
Section | Computing Methods and Computer Application | |
DOI | https://doi.org/10.1051/matecconf/202235503008 | |
Published online | 12 January 2022 |
Research on dairy products detection based on machine learning algorithm
1 Hebei University of Technology, School of artificial intelligence, Tianjin 300130, China
2 Hebei University of Technology, School of artificial intelligence, Tianjin 300130, China
3 Hebei Province livestock breeding workstation, Shi jia zhuang, 050061, China
4 Beijing Goke Agriculture Machinary Co. Ltd, Beijing, 100000, China
5 Lekang Muyi Hebei Technology Co. Ltd, Shi jia zhuang, China
6 Hebei tianniu Agricultural Technology Development Co.Ltd, Shi jia zhuang, China
* Corresponding author: zhanglei@hebut.edu.cn
In this study, an electronic nose model composed of seven kinds of metal oxide semiconductor sensors was developed to distinguish the milk source (the dairy farm to which milk belongs), estimate the content of milk fat and protein in milk, to identify the authenticity and evaluate the quality of milk. The developed electronic nose is a low-cost and non-destructive testing equipment. (1) For the identification of milk sources, this paper uses the method of combining the electronic nose odor characteristics of milk and the component characteristics to distinguish different milk sources, and uses Principal Component Analysis (PCA) and Linear Discriminant Analysis , LDA) for dimensionality reduction analysis, and finally use three machine learning algorithms such as Logistic Regression (LR), Support Vector Machine (SVM) and Random Forest (RF) to build a milk source (cow farm) Identify the model and evaluate and compare the classification effects. The experimental results prove that the classification effect of the SVM-LDA model based on the electronic nose odor characteristics is better than other single feature models, and the accuracy of the test set reaches 91.5%. The RF-LDA and SVM-LDA models based on the fusion feature of the two have the best effect Set accuracy rate is as high as 96%. (2) The three algorithms, Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost) and Random Forest (RF), are used to construct the electronic nose odor data for milk fat rate and protein rate. The method of estimating the model, the results show that the RF model has the best estimation performance( R2 =0.9399 for milk fat; R2=0.9301for milk protein). And it prove that the method proposed in this study can improve the estimation accuracy of milk fat and protein, which provides a technical basis for predicting the quality of dairy products.
Key words: Electronic nose(E-nose) / Milk / Equivalent detection / Cattle identification
© The Authors, published by EDP Sciences, 2022
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.