The colon dataset of Spellman et. al. (1998)

 

Figure 1: Relative and cumulative variance of the yeast dataset

1.      OKM

Figure 2: BIC values when applying OKM (SVD reduced to 4 dimensions) on the yeast dataset

Figure 3: (zoom in) BIC Values when applying OKM (SVD reduced to 4 dimensions) on the yeast dataset

Figure 4: Comparison of the internal (BIC) and external (Jaccard) criteria of the yeast dataset (OKM)

Figure 5: (zoom in) Comparison of the internal (BIC) and external (Jaccard) criteria of the yeast dataset (OKM)


2.      OQM

Figure 6: Comparison of the internal (BIC) and external (Jaccard) criteria of the yeast dataset (OQC)

 

 

Method

Jaccard

Purity

Efficiency

Raw data

 

 

 

 

K Means (5 clusters)

0.435

0.617

0.596

 

K Means (4 clusters)

0.488

0.64

0.673

 

Fuzzy C Means (5 clusters)

0.425

0.663

0.542

 

Fuzzy C Means (4 clusters)

0.438

0.458

0.912

 

Competitive Neural Network (4 clusters)

0.424

0.53

0.68

 

Quantum Clustering

NA

NA

NA

Preprocessing

 

 

 

 

K means (5 clusters)

0.406

0.636

0.528

 

K means (4 clusters)

0.46

0.626

0.634

 

Fuzzy C means (5 clusters)

0.4

0.63

0.522

 

Fuzzy C means (4 clusters)

0.459

0.624

0.634

 

Competitive Neural Network (5 clusters)

0.33

0.55

0.458

 

Competitive Neural Network (4 clusters)

0.516

0.658

0.706

 

QC after SVD – 2dims (σ =0.595)

0.554

0.664

0.77

 

QC after SVD – 4 dims (σ =0.5)

0.5

XXX

XXX

Table 1: Comparison for algorithms’ performance the for the yeast dataset


 

 

 

 

Text Box: Figure 7: Jaccard scores of four algorithms tested by on the Golub dataset. Left: before compression, Right: following application of the SVD compression step. Note that an improvement is detected for all methods by a preprocessing step