Loading...
Prediction of gas chromatographic retention indices of a diverse set of toxicologically relevant compounds
Garkani Nejad, Z ; Sharif University of Technology | 2004
200
Viewed
- Type of Document: Article
- DOI: 10.1016/j.chroma.2003.12.003
- Publisher: Elsevier , 2004
- Abstract:
- For a set of 846 organic compounds, relevant in forensic analytical chemistry, with highly diverse chemical structures, the gas chromatographic Kovats retention indices have been quantitatively modeled by using a large set of molecular descriptors generated by software Dragon. Best and very similar performances for prediction have been obtained by a partial least squares regression (PLS) model using all considered 529 descriptors, and a multiple linear regression (MLR) model using only 15 descriptors obtained by a stepwise feature selection. The standard deviations of the prediction errors (SEP), were estimated in four experiments with differently distributed training and prediction sets. For the best models SEP is about 80 retention index units, corresponding to 2.1-7.2% of the covered retention index interval of 1110-3870. The molecular properties known to be relevant for GC retention data, such as molecular size, branching and polar functional groups are well covered by the selected 15 descriptors. The developed models support the identification of substances in forensic analytical work by GC-MS in cases the retention data for candidate structures are not available. © 2004 Elsevier B.V. All rights reserved
- Keywords:
- Feature selection ; Forensic analysis ; Mathematical modelling ; Molecular descriptors ; Regression models ; Retention indices ; Retention prediction
- Source: Journal of Chromatography A ; Volume 1028, Issue 2 , 2004 , Pages 287-295 ; 00219673 (ISSN)
- URL: https://linkinghub.elsevier.com/retrieve/pii/S0021967303022556
