Remove UnivariateGMM #844

simonbyrne · 2019-03-19T03:18:56Z

I think it is completely unnecessary. It could just be implemented as an alias of MixtureModel.

If the aim is to support storing it as vectors of means and stds, it would be better doing it via StructArrays.jl.

The text was updated successfully, but these errors were encountered:

simonbyrne · 2019-03-19T03:31:54Z

(also, I apologise for opening this issue the day #615 was merged)

luiarthur · 2020-08-14T04:59:07Z

Hi,

Not sure if this is enough motivation to keep UnivariateGMM, but here's a little benchmark that seems to show that logpdf on UnivariateGMM is much more efficient than logpdf on MixtureModel of Normals.

using Distributions
using StatsFuns
using BenchmarkTools
using Random

# Function to simulate some data for GMM logpdf.
function gendata(nobs, nmix)
  x = randn(nobs)
  mu = collect(range(-3, 3, length=nmix))
  sig = rand(nmix)
  w = let
    _w = rand(nmix)
    _w / sum(_w)
  end
  return mu, sig, w, x
end

# Shorthand for GMM logpdf
gmm_lpdf(mu, sig, w, x; dims) = sum(logsumexp(normlogpdf.(mu, sig, x) .+ log.(w), dims=dims))

# Generate data.
Random.seed!(0);
mu, sig, w, x  = gendata(100, 5)

### Benchmark ###

@btime gmm_lpdf(mu', sig', w',  x[:, :], dims=2)  # V1: 15.5 μs

@btime sum(logsumexp(logpdf.(Normal.(mu', sig'), x[:,:]) .+ log.(w'), dims=2))  # V2: 20.6μs

@btime sum(logpdf.(UnivariateGMM(mu, sig, Categorical(w)), x))  # V3: 20.9 μs

@btime sum(logpdf.(MixtureModel(Normal.(mu, sig), w), x))  # V4: 317.2 μs

I've timed 4 things that do the same thing here -- compute the sum of the log density of a location-scale mixture of Normals evaluated at a vector of 100 univariate values.

The UnivariateGMM version (V3) is over 10x faster than the MixtureModel one (V4) in this example.

These are all vectorized, so there's definitely a way to optimize further here. But I just want to illustrate the utility of having UnivariateGMM.

simonbyrne added the up for grabs label Mar 19, 2019

matbesancon mentioned this issue May 8, 2019

Release Distributions.jl v1.0 #880

Open

matbesancon added the v1.0 label May 8, 2019

matbesancon added this to the 1.0 milestone Jun 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove UnivariateGMM #844

Remove UnivariateGMM #844

simonbyrne commented Mar 19, 2019

simonbyrne commented Mar 19, 2019

luiarthur commented Aug 14, 2020 •

edited

Loading

Remove UnivariateGMM #844

Remove UnivariateGMM #844

Comments

simonbyrne commented Mar 19, 2019

simonbyrne commented Mar 19, 2019

luiarthur commented Aug 14, 2020 • edited Loading

luiarthur commented Aug 14, 2020 •

edited

Loading