Assessinf a non-mcl cluster quality with mcl tools #29

apelin20 · 2024-03-21T11:21:23Z

apelin20
Mar 21, 2024

Essentially I want to compare various .gmt files which are gene annotations and every pathway is sort of like a cluster with a defined set of gene in it. The .gmt format is one step away from --abc. The problem is that many annotation contain super redundant pathways and can enrich the strangest of things at times. I am curious to try the three metrics you developed in clm: efficiency, mass fraction and area fraction to assess the quality of arious .gmts as well as compare them to each other.

Answered by micans

Mar 21, 2024

These metrics are really only applicable to clusterings (a partitioning of a dataset into disjoint sets) rather than at the level of a cluster considered by itself. Those annotations are intrinsically overlapping and have different granularity depending ont the scale of pathway considered. The metrics you mention are very basic, and I don't think the notion of say, giving more significance to the highest value is a good idea. Efficiency tends to reward smaller more granular clusterings, modularity for example does the opposite (see #20).

View full answer

micans · 2024-03-21T15:49:12Z

micans
Mar 21, 2024
Maintainer

These metrics are really only applicable to clusterings (a partitioning of a dataset into disjoint sets) rather than at the level of a cluster considered by itself. Those annotations are intrinsically overlapping and have different granularity depending ont the scale of pathway considered. The metrics you mention are very basic, and I don't think the notion of say, giving more significance to the highest value is a good idea. Efficiency tends to reward smaller more granular clusterings, modularity for example does the opposite (see #20).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assessinf a non-mcl cluster quality with mcl tools #29

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Assessinf a non-mcl cluster quality with mcl tools #29

apelin20 Mar 21, 2024

Replies: 1 comment

micans Mar 21, 2024 Maintainer

apelin20
Mar 21, 2024

micans
Mar 21, 2024
Maintainer