Package: mosclust 1.0.2

Jessica Gliozzo

mosclust: Model Order Selection for Clustering

Stability based methods for model order selection in clustering problems (Valentini, G (2007), <doi:10.1093/bioinformatics/btl600>). Using multiple perturbations of the data the stability of clustering solutions is assessed. Different perturbations may be used: resampling techniques, random projections and noise injection. Stability measures for the estimate of clustering solutions and statistical tests to assess their significance are provided.

Authors:Giorgio Valentini [aut], Jessica Gliozzo [cre]

mosclust_1.0.2.tar.gz
mosclust_1.0.2.zip(r-4.7)mosclust_1.0.2.zip(r-4.6)mosclust_1.0.2.zip(r-4.5)
mosclust_1.0.2.tgz(r-4.6-any)mosclust_1.0.2.tgz(r-4.5-any)
mosclust_1.0.2.tar.gz(r-4.7-any)mosclust_1.0.2.tar.gz(r-4.6-any)
mosclust_1.0.2.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
mosclust/json (API)

# Install 'mosclust' in R:

install.packages('mosclust', repos = c('https://anacletolab.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/anacletolab/mosclust/issues

On CRAN:

2.00 score 552 downloads 37 exports 3 dependencies

Last updated from:d605c3e4be. Checks:9 OK. Indexed: yes.

Target	Result	Time
linux-devel-x86_64	OK	145
source / vignettes	OK	137
linux-release-x86_64	OK	140
macos-release-arm64	OK	160
macos-oldrel-arm64	OK	191
windows-devel	OK	121
windows-release	OK	114
windows-oldrel	OK	117
wasm-release	OK	78

Exports:Bernstein.compute.pvalues Bernstein.ind.compute.pvalues Bernstein.p.value Chi.square.compute.pvalues Compute.Chi.sq compute.cumulative.multiple compute.integral compute.integral.from.similarity cumulative.values Do.boolean.membership.matrix do.similarity.noise do.similarity.projection do.similarity.resampling Fuzzy.kmeans.sim.noise Fuzzy.kmeans.sim.projection Fuzzy.kmeans.sim.resampling Hierarchical.sim.noise Hierarchical.sim.projection Hierarchical.sim.resampling Hybrid.testing Hypothesis.testing Intersect Kmeans.sim.noise Kmeans.sim.projection Kmeans.sim.resampling PAM.sim.noise PAM.sim.projection PAM.sim.resampling perturb.by.noise plot_cumulative plot_cumulative.multiple plot_hist.similarity plot_multiple.hist.similarity plot_pvalues sFM sJaccard sM

Dependencies:cluster clusterv MASS

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Model order selection for clustering	mosclust-package mosclust
Function to compute the stability indices and the p-values associated to a set of clusterings according to Bernstein inequality.	Bernstein.compute.pvalues Bernstein.ind.compute.pvalues
Function to compute the p-value according to Bernstein inequality.	Bernstein.p.value
Function to compute the stability indices and the p-values associated to a set of clusterings according to the chi-square test between multiple proportions.	Chi.square.compute.pvalues
Function to evaluate if a set of similarity distributions significantly differ using the chi square test.	Compute.Chi.sq
Function to compute the empirical cumulative distribution function (ECDF) of the similarity measures.	compute.cumulative.multiple cumulative.values
Functions to compute the integral of the ecdf of the similarity values	compute.integral compute.integral.from.similarity
Function to compute and build up a pairwise boolean membership matrix.	Do.boolean.membership.matrix
Function that computes sets of similarity indices using injection of gaussian noise.	do.similarity.noise
Function that computes sets of similarity indices using randomized maps.	do.similarity.projection
Function that computes sets of similarity indices using resampling techniques.	do.similarity.resampling
Function to compute similarity indices using noise injection techniques and fuzzy c-mean clustering.	Fuzzy.kmeans.sim.noise
Function to compute similarity indices using random projections and fuzzy c-mean clustering.	Fuzzy.kmeans.sim.projection
Function to compute similarity indices using resampling techniques and fuzzy c-mean clustering.	Fuzzy.kmeans.sim.resampling
Function to compute similarity indices using noise injection techniques and hierarchical clustering.	Hierarchical.sim.noise
Function to compute similarity indices using random projections and hierarchical clustering.	Hierarchical.sim.projection
Function to compute similarity indices using resampling techniques and hierarchical clustering.	Hierarchical.sim.resampling
Statistical test based on stability methods for model order selection.	Hybrid.testing
Function to select significant clusterings from a given set of p-values	Hypothesis.testing
Function to compute the intersection between elements of two vectors	Intersect
Function to compute similarity indices using noise injection techniques and kmeans clustering.	Kmeans.sim.noise
Function to compute similarity indices using random projections and kmeans clustering.	Kmeans.sim.projection
Function to compute similarity indices using resampling techniques and kmeans clustering.	Kmeans.sim.resampling
Function to compute similarity indices using noise injection techniques and PAM clustering.	PAM.sim.noise
Function to compute similarity indices using random projections and PAM clustering.	PAM.sim.projection
Function to compute similarity indices using resampling techniques and PAM clustering.	PAM.sim.resampling
Function to generate a data set perturbed by noise.	perturb.by.noise
Function to plot the empirical cumulative distribution function of the similarity values	plot_cumulative plot_cumulative.multiple
Plotting histograms of similarity measures between clusterings	plot_hist.similarity plot_multiple.hist.similarity
Function to plot p-values for different tests of hypothesis	plot_pvalues
Similarity measures between pairs of clusterings	sFM sJaccard sM