You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: tutorials/10_Group_analysis/component_clustering_tools.md
+12-2
Original file line number
Diff line number
Diff line change
@@ -286,7 +286,7 @@ You may call the [pop_clust.m](http://sccn.ucsd.edu/eeglab/locatefile.php?file=p
286
286
287
287

288
288
289
-
Several algorithms are available: *kmeans*, *neural network*, and *affinity* clustering.
289
+
Several algorithms are available: *kmeans*, *neural network*, *affinity*, and *affinity* clustering.
290
290
291
291
*Kmeans* requires the MATLAB Statistics Toolbox, while *neural network* clustering uses a function from the MATLAB Neural Network Toolbox. A version of *kmeans* that does not require the MATLAB Statistics Toolbox is also available. *Affinity* clustering does not require any toolbox. We recommend using *affinity* clustering which does not require to specify the number of clusters, then try the *kmeans* algorithm if the results are not satisfactory.
292
292
@@ -303,10 +303,20 @@ defined as components further than a specified number of standard
303
303
deviations (3, by default) from any of the cluster centroids. To turn
304
304
on this option, click the upper checkbox on the left. Identified
305
305
outlier components will be placed into a designated *Outliers* cluster
306
-
(Cluster 2).
306
+
(Cluster 2).
307
307
308
308
Press *Ok*. The cluster editing interface detailed in one of the following sections will automatically pop up.
309
309
310
+
Optimal Kmeans clustering
311
+
-----------------
312
+
We have recently added **Optimal Kmeans** algorithm to the `pop_clust` function. This feature allows you to find the optimal number of clusters for your data. To use this feature, you must have the [MATLAB Statistics and Machine Learning Toolbox](https://www.mathworks.com/products/statistics.html) installed.
313
+
314
+
To use this feature, select the **Optimal Kmeans** option from the **Clustering algorithm** dropdown menu. Then, you need to input a range of cluster numbers to test (in the screenshot below, the minimum is set to 10, and the maximum is set to 30). The algorithm will then test the clustering for each number of clusters in the range and choose the optimal number of clusters based on the **silhouette** score. The **silhouette** score is a measure of how similar an object is to its own cluster compared to other clusters. The optimal number of clusters is the one that maximizes the **silhouette** score. Read more about the **silhouette** score from the [MATLAB documentation](https://www.mathworks.com/help/stats/clustering.evaluation.silhouetteevaluation.html).
315
+
316
+
**Recommended number of clusters:** Following the rationale for the estimated number of clusters above, we recommend setting the lower bound of the cluster range to half the average number of components per subject. For example, if there are 20 components per subject, set the lower bound to 10. Similarly, set the upper bound to 1.5 times the average number of components per subject. For example, for 20 components per subject, set the upper bound to 30. If the returned number of clusters is at its lower or upper bound, consider expanding the range. We also strongly recommend using the option to separate outliers.
317
+
318
+

319
+
310
320
Other clustering methods
311
321
-----------------
312
322
The main method to cluster components in EEGLAB is the *PCA clustering method* described in this tutorial. Other methods are the *Measure Projection method* and the *Scalp Correlation method* available in the EEGLAB plugins described below.
0 commit comments