Fast modified global k-means algorithm for incremental cluster construction



Yüklə 1,14 Mb.
tarix31.10.2018
ölçüsü1,14 Mb.
#76910


Fast modified global k-means algorithm for incremental cluster construction

  • Adil M.Bagirov, JulienUgon, DeanWebb PR, 2011

  • Presented by Wen-Chung Liao

  • 2011/01/05


Outlines

  • Motivation

  • Objectives

  • Methodology

  • Experiments

  • Conclusions

  • Comments



Motivation

  • The global k-means algorithm and the modified global k-means algorithm are incremental clustering algorithms.

    • allow one to find global or a near global minimizer of the cluster (or error) function.
  • However, these algorithms are memory demanding

    • they require the storage of the affinity matrix .
  • Alternatively, this matrix can be computed at each iteration, however, this extends the computational time significantly.



Objectives

  • A new version of the modified global k-means algorithm is proposed:

    • apply an auxiliary cluster function to generate a set of starting points lying in different parts of the dataset.
    • the best solution is selected as a starting point for the next cluster center.
    • information gathered in previous iterations of the incremental algorithm to avoid computing the whole affinity matrix.
    • the triangle inequality for distances is used to avoid unnecessary computations


Methodology

  • Modified global k-means algorithm [1]

    • Starts with the computation of one cluster center and attempts to optimally add one new cluster center at each iteration.
    • An auxiliary cluster function
    • The k-means algorithm is applied starting from this point to find the k-partition of the dataset.
  • Fast modified global k-means algorithm

    • auxiliary cluster function to generate a set of starting points
    • the best solution is selected
    • avoid computing the whole affinity matrix


Modified global k-means algorithm





Modified global k-means algorithm



Modified global k-means algorithm



Fast modified global k-means algorithm







Computational complexity

  • The modified global k-means algorithm

    • O(mk2T+km2+kmt)
  • The fast modified global k-means algorithm

    • O(p(mk2T+km2+kmt)) (without complexity reduction schemes)
    • O(p(mk2T+km1 2+km1t)) (with complexity reduction schemes)


Numerical experiments







Conclusions

  • Developed a new version of the modified global k-means algorithm

    • Using the k-1 cluster centers from the previous iteration to solve the k-partition problem.
    • does not rely on the affinity matrix to compute the starting point
    • use more than one starting point to minimize the auxiliary function
    • Two schemes to reduce the amount of computational effort
    • no guarantee that it will converge to the global solution.


Comments

  • Advantages

    • Schemes to avoid computational effort.
  • Shortages

    • Determine the set U is not easy.
  • Applications

    • clustering


Yüklə 1,14 Mb.

Dostları ilə paylaş:




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©www.genderi.org 2024
rəhbərliyinə müraciət

    Ana səhifə