PyDataBlog
diff --git a/‎.gitignore‎
Lines changed: 3 additions & 1 deletion b/‎.gitignore‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎README.md‎
Lines changed: 96 additions & 0 deletions b/‎README.md‎
Lines changed: 96 additions & 0 deletions
diff --git a/‎benchmark/bench01_distance.jl‎
Lines changed: 12 additions & 10 deletions b/‎benchmark/bench01_distance.jl‎
Lines changed: 12 additions & 10 deletions
diff --git a/‎benchmark/extras/README.md‎
Lines changed: 59 additions & 0 deletions b/‎benchmark/extras/README.md‎
Lines changed: 59 additions & 0 deletions
diff --git a/‎benchmark/extras/comparisons.jl‎
Lines changed: 47 additions & 0 deletions b/‎benchmark/extras/comparisons.jl‎
Lines changed: 47 additions & 0 deletions
diff --git a/‎docs/src/index.md‎
Lines changed: 14 additions & 1 deletion b/‎docs/src/index.md‎
Lines changed: 14 additions & 1 deletion
@@ -9,4 +9,6 @@
 /benchmark/tune.json
 .benchmarkci/
 .idea/*
-.vscode/*
+.vscode/*
+test/experiments.jl
+/extras/.ipynb_checkpoints/*
@@ -4,3 +4,99 @@
 [![Dev](https://img.shields.io/badge/docs-dev-blue.svg)](https://PyDataBlog.github.io/ParallelKMeans.jl/dev)
 [![Build Status](https://www.travis-ci.org/PyDataBlog/ParallelKMeans.jl.svg?branch=master)](https://www.travis-ci.org/PyDataBlog/ParallelKMeans.jl)
 [![Coverage Status](https://coveralls.io/repos/github/PyDataBlog/ParallelKMeans.jl/badge.svg?branch=master)](https://coveralls.io/github/PyDataBlog/ParallelKMeans.jl?branch=master)
+[![FOSSA Status](https://app.fossa.com/api/projects/git%2Bgithub.com%2FPyDataBlog%2FParallelKMeans.jl.svg?type=shield)](https://app.fossa.com/projects/git%2Bgithub.com%2FPyDataBlog%2FParallelKMeans.jl?ref=badge_shield)
+_________________________________________________________________________________________________________
+**Authors:** [Bernard Brenyah](https://www.linkedin.com/in/bbrenyah/) & [Andrey Oskin](https://www.linkedin.com/in/andrej-oskin-b2b03959/)
+_________________________________________________________________________________________________________
+
+## Table Of Content
+
+1. [Motivation](#Motivatiion)
+2. [Installation](#Installation)
+3. [Features](#Features)
+4. [Benchmarks](#Benchmarks)
+5. [Pending Features](#Pending-Features)
+6. [How To Use](#How-To-Use)
+7. [Release History](#Release-History)
+8. [How To Contribute](#How-To-Contribute)
+9. [Credits](#Credits)
+10. [License](#License)
+
+_________________________________________________________________________________________________________
+
+### Motivation
+It's a funny story actually led to the development of this package.
+What started off as a personal toy project trying to re-construct the K-Means algorithm in  native Julia blew up after into a heated discussion on the Julia Discourse forums after I asked for Julia optimizaition tips. Long story short, Julia community is an amazing one! Andrey Oskin offered his help and together, we decided to push the speed limits of Julia with a parallel implementation of the most famous clustering algorithm. The initial results were mind blowing so we have decided to tidy up the implementation and share with the world. 
+
+Say hello to our baby, `ParallelKMeans`!
+_________________________________________________________________________________________________________
+
+### Installation
+You can grab the latest stable version of this package by simply running in Julia.
+Don't forget to Julia's package manager with `]`
+
+```julia
+pkg> add TextAnalysis
+```
+
+For the few (and selected) brave ones, one can simply grab the current experimental features by simply adding the experimental branch to your development environment after invoking the package manager with `]`:
+
+```julia
+dev git@github.com:PyDataBlog/ParallelKMeans.jl.git
+```
+
+Don't forget to checkout the experimental branch and you are good to go with bleeding edge features and breaks!
+```bash
+git checkout experimental
+```
+_________________________________________________________________________________________________________
+
+### Features
+
+- Lightening fast implementation of Kmeans clustering algorithm even on a single thread in native Julia.
+- Support for multi-theading implementation of Kmeans clustering algorithm.
+- Kmeans++ initialization for faster and better convergence.
+- Modified version of Elkan's Triangle inequality to speed up K-Means algorithm.
+
+_________________________________________________________________________________________________________
+
+### Benchmarks
+
+_________________________________________________________________________________________________________
+
+### Pending Features
+- [X] Implementation of Triangle inequality based on [Elkan C. (2003) "Using the Triangle Inequality to Accelerate
+K-Means"](https://www.aaai.org/Papers/ICML/2003/ICML03-022.pdf)
+- [ ] Support for DataFrame inputs.
+- [ ] Refactoring and finalizaiton of API desgin.
+- [ ] GPU support.
+- [ ] Even faster Kmeans implementation based on current literature.
+- [ ] Optimization of code base.
+
+_________________________________________________________________________________________________________
+
+### How To Use
+
+```Julia
+
+```
+
+_________________________________________________________________________________________________________
+
+### Release History
+
+- 0.1.0 Initial release
+
+_________________________________________________________________________________________________________
+
+### How To Contribue
+
+_________________________________________________________________________________________________________
+
+### Credits
+
+_________________________________________________________________________________________________________
+
+### License
+
+[![FOSSA Status](https://app.fossa.com/api/projects/git%2Bgithub.com%2FPyDataBlog%2FParallelKMeans.jl.svg?type=large)](https://app.fossa.com/projects/git%2Bgithub.com%2FPyDataBlog%2FParallelKMeans.jl?ref=badge_large)
@@ -7,20 +7,22 @@ using Random
 suite = BenchmarkGroup()
 
 Random.seed!(2020)
-X = rand(100_000, 3)
-centroids = rand(2, 3)
-d = rand(100_000, 2)
-suite["100kx3"] = @benchmarkable ParallelKMeans.pairwise!($d, $X, $centroids)
+X = rand(3, 100_000)
+centroids = rand(3, 2)
+d = Vector{Float64}(undef, 100_000)
+suite["100kx3"] = @benchmarkable ParallelKMeans.colwise!($d, $X, $centroids)
 
-X = rand(100_000, 10)
-centroids = rand(2, 10)
-d = rand(100_000, 2)
-suite["100kx10"] = @benchmarkable ParallelKMeans.pairwise!($d, $X, $centroids)
+X = rand(10, 100_000)
+centroids = rand(10, 2)
+d = Vector{Float64}(undef, 100_000)
+suite["100kx10"] = @benchmarkable ParallelKMeans.colwise!($d, $X, $centroids)
 
 # for reference
 metric = SqEuclidean()
-suite["100kx10_distances"] = @benchmarkable Distances.pairwise!($d, $metric, $X, $centroids, dims = 1)
-
+#suite["100kx10_distances"] = @benchmarkable Distances.colwise!($d, $metric, $X, $centroids)
+dist = Distances.pairwise(metric, X, centroids, dims = 2)
+min = minimum(dist, dims=2)
+suite["100kx10_distances"] = @benchmarkable $d = min
 end # module
 
 BenchDistance.suite
@@ -0,0 +1,59 @@
+# Skoffer comparison between Clustering, SingleThread mode of PKMeans and MultiThreadPKMeans
+
+```julia
+versioninfo()
+
+Julia Version 1.3.1
+Commit 2d5741174c (2019-12-30 21:36 UTC)
+Platform Info:
+  OS: Linux (x86_64-pc-linux-gnu)
+  CPU: Intel(R) Core(TM) i7-7700HQ CPU @ 2.80GHz
+  WORD_SIZE: 64
+  LIBM: libopenlibm
+  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)
+Environment:
+  JULIA_EDITOR = atom  -a
+  JULIA_NUM_THREADS = 4
+```
+
+For `X = rand(60, 1_000_000); tol = 1e-6` output of `TimerOutputs`
+
+```
+Time                   Allocations      
+──────────────────────   ───────────────────────
+Tot / % measured:            1541s / 85.5%           19.5GiB / 99.4%    
+
+Section                ncalls     time   %tot     avg     alloc   %tot      avg
+───────────────────────────────────────────────────────────────────────────────
+Clustering                  1     662s  50.2%    662s   18.6GiB  96.1%  18.6GiB
+10 clusters               1    92.6s  7.03%   92.6s   2.35GiB  12.1%  2.35GiB
+9 clusters                1    89.7s  6.81%   89.7s   2.34GiB  12.1%  2.34GiB
+8 clusters                1    87.1s  6.62%   87.1s   2.33GiB  12.0%  2.33GiB
+7 clusters                1    85.3s  6.48%   85.3s   2.32GiB  12.0%  2.32GiB
+6 clusters                1    80.6s  6.12%   80.6s   2.32GiB  12.0%  2.32GiB
+5 clusters                1    78.3s  5.95%   78.3s   2.31GiB  11.9%  2.31GiB
+4 clusters                1    76.6s  5.82%   76.6s   2.30GiB  11.9%  2.30GiB
+3 clusters                1    50.3s  3.82%   50.3s   1.58GiB  8.16%  1.58GiB
+2 clusters                1    20.9s  1.59%   20.9s    732MiB  3.69%   732MiB
+PKMeans Singlethread        2     491s  37.3%    245s    208MiB  1.05%   104MiB
+9 clusters                1     131s  10.0%    131s   22.9MiB  0.12%  22.9MiB
+10 clusters               1    89.5s  6.80%   89.5s   22.9MiB  0.12%  22.9MiB
+7 clusters                1    77.3s  5.87%   77.3s   22.9MiB  0.12%  22.9MiB
+8 clusters                1    59.4s  4.51%   59.4s   22.9MiB  0.12%  22.9MiB
+6 clusters                1    44.1s  3.35%   44.1s   22.9MiB  0.12%  22.9MiB
+5 clusters                1    35.1s  2.67%   35.1s   22.9MiB  0.12%  22.9MiB
+4 clusters                1    32.9s  2.50%   32.9s   22.9MiB  0.12%  22.9MiB
+3 clusters                1    14.6s  1.11%   14.6s   22.9MiB  0.12%  22.9MiB
+2 clusters                2    6.52s  0.50%   3.26s   23.3MiB  0.12%  11.7MiB
+PKMeans Multithread         1     165s  12.5%    165s    575MiB  2.90%   575MiB
+9 clusters                1    37.2s  2.82%   37.2s   40.1MiB  0.20%  40.1MiB
+8 clusters                1    33.1s  2.51%   33.1s   23.9MiB  0.12%  23.9MiB
+10 clusters               1    25.8s  1.96%   25.8s   24.0MiB  0.12%  24.0MiB
+6 clusters                1    20.9s  1.59%   20.9s   23.6MiB  0.12%  23.6MiB
+7 clusters                1    16.4s  1.25%   16.4s   23.4MiB  0.12%  23.4MiB
+5 clusters                1    13.1s  1.00%   13.1s   23.4MiB  0.12%  23.4MiB
+4 clusters                1    9.90s  0.75%   9.90s   23.4MiB  0.12%  23.4MiB
+3 clusters                1    4.97s  0.38%   4.97s    370MiB  1.87%   370MiB
+2 clusters                1    3.26s  0.25%   3.26s   23.2MiB  0.12%  23.2MiB
+───────────────────────────────────────────────────────────────────────────────
+```
@@ -0,0 +1,47 @@
+using Clustering
+using ParallelKMeans
+using Plots
+using BenchmarkTools
+using TimerOutputs
+using Random
+using ProgressMeter
+
+# Create a TimerOutput, this is the main type that keeps track of everything.
+const to = TimerOutput()
+
+Random.seed!(2020)
+X = rand(60, 1_000_000);
+# Timed assingments
+global a = Float64[]
+global b = Float64[]
+global c = Float64[]
+
+p = Progress(9, 10, "Computing clustering...")
+@timeit to "Clustering" begin
+    for i in 2:10
+        @timeit to "$i clusters" push!(a, Clustering.kmeans(X, i, tol=1e-6, maxiter=300).totalcost)
+        next!(p)
+    end
+end
+
+p = Progress(9, 10, "Computing singlethreaded ParallelKMeans...")
+@timeit to "PKMeans Singlethread" begin
+    for i in 2:10
+        @timeit to "$i clusters" push!(b, ParallelKMeans.kmeans(X, i, tol=1e-6, max_iters=300, verbose=false).totalcost)
+        next!(p)
+    end
+end
+
+p = Progress(9, 10, "Computing multithreaded ParallelKMeans...")
+@timeit to "PKMeans Multithread" begin
+    for i in 2:10
+        @timeit to "$i clusters" push!(c, ParallelKMeans.kmeans(X, i, ParallelKMeans.MultiThread(), tol=1e-6, max_iters=300, verbose=false).totalcost)
+        next!(p)
+    end
+end
+
+plot(a, label="Clustering.jl")
+plot!(b, label="Single-Thread ParallelKmeans")
+plot!(c, label="Multi-Thread ParallelKmeans")
+
+print(to)
@@ -1,4 +1,17 @@
-# ParallelKMeans.jl
+# ParallelKMeans.jl Documentation
+
+```@contents
+```
+
+## Installation
+
+
+## Features
+
+
+## How To Use
+
+
 
 ```@index
 ```