Non-hierarchical clustering methods on factorial subspaces

Cancella i cookie per la scelta della lingua

Ricerca

Ricerca semplice

Ricerca avanzata

Ultime accessioni

Scorri per

Autore

Settori scientifico-disciplinari

Anno

Tipologia prodotto

Accessibilità del full-text

Informazioni

Policy

Informazioni su fedOA

FAQ

Contatti

Tortora, Cristina (2011) Non-hierarchical clustering methods on factorial subspaces. [Tesi di dottorato] (Inedito)

Anteprima

PDF
tortora_cristina.pdf
Download (1MB) | Anteprima

Tipologia del documento:	Tesi di dottorato
Lingua:	English
Titolo:	Non-hierarchical clustering methods on factorial subspaces
Autori:	Autore Email Tortora, Cristina cristina.tortora@unina.it
Data:	1 Dicembre 2011
Numero di pagine:	181
Istituzione:	Università degli Studi di Napoli Federico II
Dipartimento:	Matematica e statistica
Scuola di dottorato:	Scienze economiche e statistiche
Dottorato:	Statistica
Ciclo di dottorato:	24
Coordinatore del Corso di dottorato:	nome email Lauro, Carlo Natale clauro@unina.it
Tutor:	nome email Marino, Marina mari@unina.it Palumbo, Francesco fpalumbo@unina.it
Data:	1 Dicembre 2011
Numero di pagine:	181
Parole chiave:	Non-hierarchical clustering, Factorial clustering
Settori scientifico-disciplinari del MIUR:	Area 01 - Scienze matematiche e informatiche > INF/01 - Informatica Area 13 - Scienze economiche e statistiche > SECS-S/01 - Statistica Area 13 - Scienze economiche e statistiche > SECS-S/02 - Statistica per la ricerca sperimentale e tecnologica
Depositato il:	15 Dic 2011 18:13
Ultima modifica:	05 Dic 2014 14:36
URI:	http://www.fedoa.unina.it/id/eprint/9002
DOI:	10.6092/UNINA/FEDOA/9002

Abstract

Cluster analysis (CA) aims at finding homogeneous group of individuals, where homogeneous is referred to individuals that present similar characteristics. Many CA techniques already exist, among the non-hierarchical ones the most known, thank to its simplicity and computational property, is k-means method. However, the method is unstable when the number of variables is large and when variables are correlated. This problem leads to the development of two-step methods, they perform a linear transformation of variable into a reduced number of uncorrelated factors and CA is applied on this factors. Two-steps methods minimize two different functions that can be in contrast between them and the first factorial step can in part obscure the clustering structure. Iterative factorial clustering methods overcome these issues; they perform a factorial step and a clustering step iteratively, optimizing a common criterion. In this thesis a new factorial clustering method is proposed: Factorial Probabilistic Distance Clustering (FPDC). It is based on Probabilistic Distance (PD) Clustering that is a non-parametric probabilistic method to find homogeneous groups, PD Clustering seeks for a set of K group centres maximising the empirical probabilities of belonging to a cluster of the n statistical units. As the number of variables tends to be large the solution tends to become unstable. FPDC consists of a two steps iterative procedure: linear transformation of the initial data using Tucker 3 decomposition and PD-clustering on the transformed data. This thesis also shows that Tucker3 decomposition is a consistent transformation to project original data in a subspace defined according to the PD-Clustering criterion. The integration of the PD Clustering and the Tucker3 factorial step makes the clustering more stable and permits to consider datasets with large number of variables and clusters having not elliptical form.

Downloads

Downloads per month over past year

Actions (login required)

Modifica documento

Università di Napoli - Centro di Ateneo per le Biblioteche

fedOA è realizzato con