Treffer: Clustering Algorithm for Zero-Inflated Data
Title:
Clustering Algorithm for Zero-Inflated Data
Authors:
Publication Year:
2020
Collection:
Columbia University: Academic Commons
Subject Terms:
Document Type:
Dissertation
thesis
Language:
English
DOI:
10.7916/d8-kze1-fr94
Availability:
Accession Number:
edsbas.256077EA
Database:
BASE
Weitere Informationen
Zero-inflated data are common in biomedical research. In cluster analysis, the heuristic approach fails to provide inferential properties to the outcome while the existing model-based approach only works in the case of a mixture of multivariate normal. In this dissertation, I developed two new model-based clustering algorithms- the multivariate zero-inflated log-normal and the multivariate zero-inflated Poisson clustering algorithms. I then applied these methods to the questionnaire data and compare the resulting clusters to the ones derived from assuming multivariate normal distribution. Associations between clustering results and clinical outcomes were also investigated.