Received 09 Nov 2014, Accepted 20 Jan 2015
First published online 20 Jan 2015
Coupled with terahertz time-domain spectroscopy (THz-TDS) technology, the feasibility of diagnosis of cervical carcinoma using support vector machines (SVM) and partial least squares-discriminant analysis (PLS-DA) had been studied. The terahertz spectra of 52 specimens of cervix were collected. The performance of preprocessing methods of multiplicative scatter correction (MSC), Savitzky-Golay (SG) smoothing and first derivative, principal component orthogonal signal correction (PC-OSC) and emphatic orthogonal signal correction (EOSC) were investigated for PLS-DA and SVM models, respectively. The effects of the different pretreatments methods with respect to classification accuracy were compared. The PLS-DA and SVM models were validated using the bootstrapped Latin-partition method. The SVM and PLS-DA models optimized with the combination of SG first derivative and PC-OSC preprocessing had the best predictive results with classification rates of 94.0 ± 0.4% and 94.0 ± 0.5%, respectively. The proposed procedure proved that terahertz spectroscopy combined with classifiers provides a technology which has potential as a new diagnosis method for cancer tissue.