Τεχνικές εξόρυξης σύνθετων τύπων δεδομένων

During the last years, the research field of data mining has presented significant advances. The developments in the fields of automatic data collection, very large databases, and data warehouses from heterogeneous data sources, resulted to very large volumes of data. The analysis of such volumes is not feasible without the aid of the efficient and semi-automatic methods of data mining. Recently, there has been developed new databases for more complex forms of data, compared to relational ones. E.g., customer-transaction, object oriented, spatial/temporal, and sequence databases, or various collections of Web data. The main characteristics of the aforementioned databases are: (i) the form of their data, which differs significantly from that of relational data, and (ii) their large size, both due to their complex type and their large volumes. Therefore, there emerges the need for new data mining techniques for this kind of databases, which comprises the motivation of the present dissertation. The contribution of the dissertation focuses on the following subjects. In Chapter 2 we examine the problem of mining patterns from models that have a graph-structure representation (for instance, web-logs). In such models, users navigate via the links of the graph. We present three algorithms, one of which is level-wise, and the two others that are non-level-wise. Moreover, we examine the fact that random accesses (noise) can be interleaved with patterns. The definition of the mined pattern is extended to take this fact into account. The performance of the algorithms and their sensitivity with respect to several parameters are examined experimentally. In Chapter 3 we propose a new technique for similarity searching queries in transactions databases, which find important applications in cases like recommendation systems. We develop a new representation method, for which we prove that it produces correct results. We also propose new algorithms for processing similarity queries. Extended experimental results indicate the superiority of the proposed method. In Chapter 4 we focus on the development of methods for the storage and searching large collections of sequential patterns, an operation that is useful in post-processing data mining results. We describe a family of algorithms that takes into account the ordering of elements within sequential patterns. More-over, we consider the fact that the distribution of elements within sequences is skewed, to propose a new algorithm for approximating the encoding of sequences. Experimental results examine all the proposed algorithms. In Chapter 5 we describe the C2P spatial clustering algorithm. C2P exploits spatial access methods and closest-pair queries. We present extensions for clustering very large spatial databases with noise and clusters of various shapes. Due its characteristics, C2P combines the advantages of existing algorithms without presenting their deficiencies. Its performance is examined with experimental results, which illustrate its good performance with respect to clustering quality and execution time. In Chapter 6 we examine density-biased sampling techniques. This kind of sampling addresses the deficiencies of uniform sampling in cases of spatial databases that contain samples with skewed sizes. It is useful in the pre-processing step of data mining. We develop a new method that exploits spatial indexes and the density information that is preserved within them. The proposed method attains improved sampling quality and reduced execution times. Experimental results indicate its superiority. Finally, Chapter 7 concludes this dissertation, and gives extensions and directions of future work.

περισσότερα

Διαβάστε τη διατριβή (Online)

Κατεβάστε τη διατριβή σε μορφή PDF (4.88 MB) (Η υπηρεσία είναι διαθέσιμη μετά από δωρεάν εγγραφή)

Όλα τα τεκμήρια στο ΕΑΔΔ προστατεύονται από πνευματικά δικαιώματα.

DOI	10.12681/eadd/14777
Διεύθυνση Handle	http://hdl.handle.net/10442/hedi/14777
ND	14777
Συγγραφέας	Νανόπουλος, Αλέξανδρος (Πατρώνυμο: Δημήτριος)
Ημερομηνία	2002
Ίδρυμα	Αριστοτέλειο Πανεπιστήμιο Θεσσαλονίκης (ΑΠΘ). Σχολή Θετικών Επιστημών. Τμήμα Πληροφορικής
Εξεταστική επιτροπή	Μανωλόπουλος Ιωάννης Λάζος Κωνσταντίνος Σελλής Τίμος Μπλέρης Γεώργιος Βλαχάβας Ιωάννης Βακάλη Αθηνά Θεοδωρίδης Ιωάννης
Επιστημονικό πεδίο	Φυσικές Επιστήμες Επιστήμη Ηλεκτρονικών Υπολογιστών και Πληροφορική
Λέξεις-κλειδιά	Εξόρυξη δεδομένων; Σύνθετοι τύποι δεδομένων; Βάσεις δεδομένων; Εξόρυξη ιστού; Εξόρυξη χωρικών δεδομένων
Χώρα	Ελλάδα
Γλώσσα	Ελληνικά
Άλλα στοιχεία	208 σ., εικ.

Στατιστικά χρήσης

ΠΡΟΒΟΛΕΣ

Αφορά στις μοναδικές επισκέψεις της διδακτορικής διατριβής για την χρονική περίοδο 07/2018 - 07/2023.
Πηγή: Google Analytics.

ΞΕΦΥΛΛΙΣΜΑΤΑ

Αφορά στο άνοιγμα του online αναγνώστη για την χρονική περίοδο 07/2018 - 07/2023.
Πηγή: Google Analytics.

ΜΕΤΑΦΟΡΤΩΣΕΙΣ

Αφορά στο σύνολο των μεταφορτώσων του αρχείου της διδακτορικής διατριβής.
Πηγή: Εθνικό Αρχείο Διδακτορικών Διατριβών.

ΧΡΗΣΤΕΣ

Αφορά στους συνδεδεμένους στο σύστημα χρήστες οι οποίοι έχουν αλληλεπιδράσει με τη διδακτορική διατριβή. Ως επί το πλείστον, αφορά τις μεταφορτώσεις.
Πηγή: Εθνικό Αρχείο Διδακτορικών Διατριβών.

Σχετικές εγγραφές (με βάση τις επισκέψεις των χρηστών)

Κουβουκλιώτικα: ένα μικρασιατικό γλωσσικό ιδίωμα

Η αφαίρεση στη νεώτερη ελληνική τέχνη

Ανάκτηση πληροφοριών και εξόρυξη δεδομένων για εξατομίκευση υπηρεσιών παγκόσμιου ιστού

Πολιτισμός και τοπική ανάπτυξη: ο ρόλος των πολιτιστικών και τουριστικών περιοχών στη σύγχρονη πόλη

Οι πηγές της ζωγραφικής αφαίρεσης στην Ελλάδα

Μέθοδοι εξόρυξης και επεξεργασίας ερωτημάτων σε ροές δεδομένων

Ανακάλυψη γνώσης από ακολουθίες και δεδομένα συναλλαγών

Νέες μέθοδοι εκπαίδευσης τεχνητών νευρωνικών δικτύων, βελτιστοποίησης και εφαρμογές

Μέθοδοι μηχανικής μάθησης για αυτόματη ταξινόμηση κειμένων

Ο Πατριάρχης Αλεξανδρείας Θεόφιλος Β' Παγκώστας ο Πάτμιος (1805-1825)

"Τεχνικές εξόρυξης σύνθετων τύπων δεδομένων"
	Πληκτρολογήστε το κείμενο της εικόνας!
Δηλώνω ότι έλαβα γνώση και ανεπιφύλακτα συμφωνώ και αποδέχομαι τους Όρους Χρήσης του Εθνικού Αρχείου Διδακτορικών Διατριβών, καθώς και της .