Seleção de variáveis para categorização de amostras químicas
Main Article Content
Abstract
This paper presents a method to select the best variables to categorize chemical samples into two classes, say conforming or non-conforming. For that matter, PLS regression is combined with a data mining tool, the k-Nearest Neighbor classification technique, through an iterative variable selection process. The recommended subset of variables is chosen based on several criteria: sensitivity, specificity and percent of retained variables. When applied to two datasets related to wine analysis and one associated to QSAR, the proposed method significantly reduced the number of variables required for classification, while yielding superior categorization performance when compared to using all original variables.
Metrics
Article Details
This work is licensed under a Creative Commons Attribution 4.0 International License.
The corresponding author transfers the copyright of the submitted manuscript and all its versions to Eclet. Quim., after having the consent of all authors, which ceases if the manuscript is rejected or withdrawn during the review process.
When a published manuscript in EQJ is also published in other journal, it will be immediately withdrawn from EQ and the authors informed of the Editor decision.
Self-archive to institutional, thematic repositories or personal webpage is permitted just after publication. The articles published by Eclet. Quim. are licensed under the Creative Commons Attribution 4.0 International License.