Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp010z708z75q
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | Singer, Amit | - |
dc.contributor.advisor | Kpotufe, Samory | - |
dc.contributor.author | Jiang, Hanxi Heinrich | - |
dc.date.accessioned | 2015-06-15T15:53:53Z | - |
dc.date.available | 2015-06-15T15:53:53Z | - |
dc.date.created | 2015-05-04 | - |
dc.date.issued | 2015-06-15 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp010z708z75q | - |
dc.description.abstract | We present a new clustering algorithm based on first estimating the modes of the unknown density f from sample points generated by f. Our notion of mode is generalized to include points with densities similar to that of a local mode of f. Thus, we first identify sample points which can be confidently clustered together. We call these the cluster cores. The next step then consists of assigning the remaining points to cluster cores. We make mild assumptions on the underlying distribution and the shapes of the clusters. We develop mode estimation procedures based on the k-NN density estimate and use high probability finite-sample uniform bounds on the k-NN density estimator to obtain theoretically strong guarantees. The procedure can be implemented in O(nk log n) time. We show that the clustering algorithm is competitive against the state-of-art (including DBSCAN, mean-shift, and spectral clustering) based on simulations and real data experiments. | en_US |
dc.format.extent | 69 pages | en_US |
dc.language.iso | en_US | en_US |
dc.title | A practical clustering algorithm based on k-NN density mode estimation with provable learning bounds | en_US |
dc.type | Princeton University Senior Theses | - |
pu.date.classyear | 2015 | en_US |
pu.department | Mathematics | en_US |
pu.pdf.coverpage | SeniorThesisCoverPage | - |
Appears in Collections: | Mathematics, 1934-2020 |
Files in This Item:
File | Size | Format | |
---|---|---|---|
PUTheses2015-Jiang_Hanxi_Heinrich.pdf | 12.11 MB | Adobe PDF | Request a copy |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.