nearest neighbors (KNN) are known as one of the simplest nonparametric classifiers but in high dimensional setting accuracy of KNN are affected by nuisance features. In this study, we proposed the important neighbors (KIN) as a novel approach for binary classification in high dimensional problems. To avoid the curse of dimensionality, we implemented smoothly clipped absolute deviation (SCAD) logistic regression at the initial stage and considered the importance of each feature in construction of dissimilarity measure with imposing features contribution as a function of SCAD coefficients on Euclidean distance. The nature of this hybrid dissimilarity measure, which combines information of both features and distances, enjoys all good properties of SCAD penalized regression and KNN simultaneously. In comparison to KNN, simulation studies showed that KIN has a good performance in terms of both accuracy and dimension reduction. The proposed approach was found to be capable of eliminating nearly all of the noninformative features because of utilizing oracle property of SCAD penalized regression in the construction of dissimilarity measure. In very sparse settings, KIN also outperforms support vector machine (SVM) and random forest (RF) as the best classifiers.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2yZGJY5
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
A study using a new computer-aided detection (CAD) algorithm found breast cancers... Read more on AuntMinnieEurope.com Related Reading: ...
-
Predictors of postoperative pneumonia in patients undergoing oral cancer resections and its management p. 69 Ridhi Sood, Jerry Paul, Sunil...
-
This work presents the nonlinear dynamical analysis of a multilayer piezoelectric macrofiber composite (MFC) laminated shell. The effects of...
-
Dicyclohexyl phthalate (DCHP) is one of the phthalate plasticizers. The objective of the present study was to investigate the effects of DCH...
-
This study analyzes 24 climate extreme indices over North Thailand using observed data for daily maximum and minimum temperatures and total ...
-
This study proposes an improved metabolism grey model [IMGM] to predict small samples with a singular datum, which is a common phenomenon in...
-
<span class="paragraphSection">Much public health research seeks to identify harm and then use that knowledge as the basis f...
-
Most prescribers and patients in Ghana now opt for the relatively expensive artemether/lumefantrine rather than artesunate-amodiaquine due t...
-
Antineutrophil cytoplasmic antibody- (ANCA-) associated vasculitis (AAV) is a multisystem autoimmune disease affecting mainly microscopic bl...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου