Algorithms and techniques for efficient and effective nearest neighbours classification

doi:10.12681/eadd/34608

Home

Browse

Discipline

Date

Author

Country

Language

Degree Grantor

About

Theses Submission

FAQ

Helpdesk

Open Data

Abstract

Although the k-NN classifier is considered to be an effective classification algorithm, it has some major weaknesses that may render its use inappropriate for some application domains and / or datasets. The first one is the high computational cost involved (all distances between each unclassified item and all training data must be computed). Although nowadays systems are equipped with powerful processors, in cases of large datasets, this drawback renders the classification a time-consuming and in some cases a prohibitive procedure. Another weakness is the high storage requirements for maintaining the training data. Eager classifiers (e.g., decision tress, neural networks) can discard the training data after the construction of the classification model in order to save space. In contrast, the k-NN classifier must have all the training data always available. Moreover, the classification accuracy achieved by the classifier depends on the quality of the available training data. Noisy and mislabelled data, as well as outliers and overlaps between data regions of different classes may mislead the algorithm and affect the classification accuracy. The aforementioned weaknesses constitute an active research problem. The dissertation is motivated by these weaknesses and tries to remedy the problem. Therefore, it contributes novel algorithms and techniques that can effectively deal with the aforementioned weaknesses. In other words, it proposes algorithms and techniques for efficient and effective k-NN classification. The contributions are distinguished into three main categories: (i) new data reduction techniques that deal with all the weak points of the classifier and avoid the limitations and disadvantages of existing data reduction techniques, (ii) novel hybrid algorithms that combine different types of speed-up techniques and that can effectively reduce the computational cost of the classifier, and, (iii) improvements and experimentations for existing algorithms.The proposed algorithms, techniques and improvements are evaluated on several datasets and experimentally compared to state-of-the-art methods. The experimental measurements are validated by statistical tests of significance. The results illustrate that the proposed methods satisfy the goals for which they were developed and lead to improved classification, in terms of accuracy, preprocessing and computational cost.

	Read Online
	Download full text in PDF format (4.53 MB) (Available only to registered users) I declare that I have read and unconditionally agree and accept the Terms of Use of the National Archive of Ph.D. Theses, as well as the

All items in National Archive of Phd theses are protected by copyright.

DOI	10.12681/eadd/34608
Handle URL	http://hdl.handle.net/10442/hedi/34608
ND	34608
Alternative title	Αλγόριθμοι και τεχνικές για αποδοτική και αποτελεσματική κατηγοριοποίηση εγγυτέρων γειτόνων
Author	Ougiaroglou, Stefanos (Father's name: Anestis)
Date	2014
Committee members	Ευαγγελίδης Γεώργιος Δέρβος Δημήτριος Aldama Montes Jose Francisco Μαργαρίτης Κωνσταντίνος Σαμαράς Νικόλαος Κολωνιάρη Γεωργία Παπαδόπουλος Απόστολος
Discipline	Natural Sciences ➨ Computer and Information Sciences
Keywords	Nearest neighbours; Classification; Clustering; Data reduction / Condensing; Prototype selection and abstraction; Data streams / Dynamic environments; Time-series; Editing (noise removal)
Country	Greece
Language	English
Description	247 σ., tbls., fig., ch.
Rights and terms of use	Το έργο παρέχεται υπό τους όρους της δημόσιας άδειας του νομικού προσώπου Creative Commons Corporation: Attribution 3.0 (CC-BY)

Usage statistics

VIEWS

Concern the unique Ph.D. Thesis' views for the period 07/2018 - 07/2023.
Source: Google Analytics.

ONLINE READER

Concern the online reader's opening for the period 07/2018 - 07/2023.
Source: Google Analytics.

DOWNLOADS

Concern all downloads of this Ph.D. Thesis' digital file.
Source: National Archive of Ph.D. Theses.

USERS

Concern all registered users of National Archive of Ph.D. Theses who have interacted with this Ph.D. Thesis. Mostly, it concerns downloads.
Source: National Archive of Ph.D. Theses.

Related items (based on users' visits)

Εκπαιδευτική τεχνολογία: ένας διδακτικός μικρόκοσμος για την εισαγωγή στον αντικειμενοστραφή προγραμματισμό

Energy management and consumer modeling in smart grid systems

Ανάπτυξη συστήματος υποστήριξης αποφάσεων σε περιβάλλον γεωγραφικών συστημάτων πληροφοριών με χρήση ασαφών πολυκριτήριων μεθόδων

Εκπαιδευτική τεχνολογία. Προσαρμοστικό διαδικτυακό περιβάλλον συμβατό με το πρότυπο SCORM με χρήση μαθητύπων για εξ' αποστάσεως εκπαίδευση: εφαρμογή στη διδασκαλία του αντικειμενοστραφούς προγραμματισμού

Scalable indexing and exploration of big time series data

Βελτιστοποίηση της απόδοσης ενός χειμερινού κι ενός εαρινού ενεργειακού φυτού

Κατασκευή και εφαρμογές μικροαισθητήρων για χημική ανάλυση

Εξόρυξη γνώσης από βάσεις χρονοσειρών: επιλογή χαρακτηριστικών και κατηγοριοποίηση

Εφαρμογή αλγορίθμων μηχανικής εκμάθησης για εξόρυξη και κατηγοριοποίηση πληροφοριών περιεχομένου στα οπτικοακουστικά μέσα

Αλγόριθμοι διαχείρισης δεδομένων και εξαγωγή γνώσης σε εφαρμογές γράφων

"Algorithms and techniques for efficient and effective nearest neighbours classification"
	Please, type what you see in the image!
I declare that I have read and unconditionally agree and accept the Terms of Use of the National Archive of Ph.D. Theses, as well as the. Iï¿½m aware that this thesis is licensed under the Creative Commons Αναφορά Δημιουργού 3.0 Ελλάδα