Algorithms and techniques for efficient and effective nearest neighbours classification

Abstract

Although the k-NN classifier is considered to be an effective classification algorithm, it has some major weaknesses that may render its use inappropriate for some application domains and / or datasets. The first one is the high computational cost involved (all distances between each unclassified item and all training data must be computed). Although nowadays systems are equipped with powerful processors, in cases of large datasets, this drawback renders the classification a time-consuming and in some cases a prohibitive procedure. Another weakness is the high storage requirements for maintaining the training data. Eager classifiers (e.g., decision tress, neural networks) can discard the training data after the construction of the classification model in order to save space. In contrast, the k-NN classifier must have all the training data always available. Moreover, the classification accuracy achieved by the classifier depends on the quality of the available training data. Noisy and ...
show more

All items in National Archive of Phd theses are protected by copyright.

DOI
10.12681/eadd/34608
Handle URL
http://hdl.handle.net/10442/hedi/34608
ND
34608
Alternative title
Αλγόριθμοι και τεχνικές για αποδοτική και αποτελεσματική κατηγοριοποίηση εγγυτέρων γειτόνων
Author
Ougiaroglou, Stefanos (Father's name: Anestis)
Date
2014
Committee members
Ευαγγελίδης Γεώργιος
Δέρβος Δημήτριος
Aldama Montes Jose Francisco
Μαργαρίτης Κωνσταντίνος
Σαμαράς Νικόλαος
Κολωνιάρη Γεωργία
Παπαδόπουλος Απόστολος
Discipline
Natural SciencesComputer and Information Sciences
Keywords
Nearest neighbours; Classification; Clustering; Data reduction / Condensing; Prototype selection and abstraction; Data streams / Dynamic environments; Time-series; Editing (noise removal)
Country
Greece
Language
English
Description
247 σ., tbls., fig., ch.
Rights and terms of use
Το έργο παρέχεται υπό τους όρους της δημόσιας άδειας του νομικού προσώπου Creative Commons Corporation:
Usage statistics
VIEWS
Concern the unique Ph.D. Thesis' views for the period 07/2018 - 07/2023.
Source: Google Analytics.
ONLINE READER
Concern the online reader's opening for the period 07/2018 - 07/2023.
Source: Google Analytics.
DOWNLOADS
Concern all downloads of this Ph.D. Thesis' digital file.
Source: National Archive of Ph.D. Theses.
USERS
Concern all registered users of National Archive of Ph.D. Theses who have interacted with this Ph.D. Thesis. Mostly, it concerns downloads.
Source: National Archive of Ph.D. Theses.
Related items (based on users' visits)