Wednesday, June 14, 2017

Abstract-Identification of genetically modified cotton seeds by terahertz spectroscopy with MPGA-SVM

  • a School of Mechano-Electronic Engineering, Xidian University, Xi’an, Shanxi 710071, China
  • b School of Electronics and Communication Engineering, Yulin Normal University, Yulin, Guangxi 537000, China
  • c Guangxi Key Laboratory of Automatic Detecting Technology and Instruments, School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin, Guangxi 541004, China
  • d SZU-NUS Collaborative Innovation Center for Optoelectronic Science & Technology, Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province, College of Optoelectronic Engineering, Shenzhen University, Shenzhen 518060, China

The purpose of this paper was to propose a method that combines support vector machine (SVM) and multi-population genetic algorithm (MPGA) for identifying Genetically modified (GM) cotton seeds with THz spectroscopy. The parameters of SVM were tuned by using MPGA. Comparing previous reports of THz-TDS for GM crops detection, we used a larger sample size and more evaluation criterions (i.e. confusion matrix, average accuracy and 95th percentile of accuracy) in the experiment. Principal component analysis (PCA) was utilized to reduce dimensions of THz absorbance spectra, and then used the result of PCA as the input of different classifiers. When the input dimensionality of MPGA-SVM was 12, recall, precision and F-score were more than 97.9%, 96% and 0.9796, respectively, and accuracy was 99%. To further study the performance of MPGA-SVM with different input dimensionality, different number of principal components (ranged from 2 to 16 at intervals of 2) was selected. In addition, the proposed method was compared with traditional classifiers (decision trees (DT), k-nearest neighbor (KNN) and discriminant analysis (DA)). All the results showed that MPGA-SVM has better performance than other classifier. Thus, MPGA-SVM combined with THz spectroscopy is a potential identification tool for GM cotton seeds detection.

No comments: