k_means_cluster 0.3.0 k_means_cluster: ^0.3.0 copied to clipboard
A simple library for partitioning small to medium data sets into clusters using the k-means clustering algorithm.
k-means-cluster #
A very simple implementation of k-means clustering.
Usage #
A clustering session typically involves:
- Setting a distance measurement to use.
distanceMeasure = DistanceType.squaredEuclidean; // default
- Creating a
List
ofInstance
s. This is generally done by mapping from a list of whatever data structures is available.
// For example, data might be a List<String> such
// that each String represents an individual instance.
List<Instance> instances = data.map((datum) {
List<num> coordinates = ...;
String id = ...;
return Instance(location: coordinates, id: id);
}).tolist();
- Creating a
List
ofCluster
s. This can be done manually (e.g. create a set of randomly placed clusters). A convenience functioninitialClusters
exists that takes in the list ofInstances
already created and randomly generates clusters from the instances such that instances more distant to the previous cluster are more likely to seed the next cluster.
List<Cluster> clusters = initialClusters(3, instances, seed: 0);
- Running the algorithm using the
kMeans
function. This is a side-effect heavy function that iteratively shifts the clusters towards the mean position of the associated instances and reassigns instances to the nearest cluster.
kMeans(clusters: clusters, instances: instances);
- Inspecting the
instances
property of each cluster.
clusters.forEach((cluster) {
print(cluster);
cluster.instances.forEach((instance) {
print(" - $instance");
});
});
Please file feature requests and bugs at the issue tracker.