example setting the penalty parameter? #4

wqp89324 · 2019-01-18T18:44:03Z

Is there an example about how to set the pen parameter for Pelt?

deepcharles · 2019-01-20T12:54:03Z

Hello,

Finding a proper value for the pen paper heavily depends on the signal at hand. As a rule of thumb, the more noise, samples or dimensions, the larger this parameter should be.
For parametric changes (such as mean-shifts, scale-shifts,...), the Bayesian Information Criterion (BIC) is a good starting point. For instance, detecting mean-shifts with BIC yields:

T, d = signal.shape  # number of samples, dimension
sigma = ...  # noise standard deviation
bic = sigma*sigma*np.log(T)*d
algo = rpt.Pelt().fit(signal)
my_bkps = algo.predict(pen=bic)

However, BIC tends to produce too low penalty values. When that happens, the simplest procedure, is to test several values, as below:

pen_values = np.logspace(0, 3, 10)  # for instance
algo = rpt.Pelt().fit(signal)
bkps_list = [algo.predict(pen=pen) for pen in pen_values]
# then compare elements of bkps_list

Cheers,

Charles

mylife126 · 2020-10-29T18:15:18Z

Hello thanks for sharing the BIC theory, however, could you please let me know why the way you calculate BIC is bic = sigmasigmanp.log(T)*d ? In the Wiki, i could not find such derivation. Thanks!

deepcharles · 2020-11-03T08:19:33Z

Hello,

There might be a mistake indeed. The formula should be

$BIC=2\sigma^2\log(T)\times(d+ 1)$

This is only valid for the cost function l2 (see Table 1 of this article).
To verify it, assume the signal is multivariate Gaussian with isotropic variance $sigma^2$, use the normal formula for BIC and get rid of all terms that does not depend on $K$, the number of change points.

deepcharles · 2020-11-03T08:20:39Z

~~If you would like to correct it in the docs, do not hesitate to make a pull request ;)~~

oh it is not in the docs yet, never mind then.

DPTPaul · 2021-12-21T09:53:46Z

Hello,

Thanks a lot for this very useful package!
I'm currently working on a signal and have no idea about the number of breakpoints.
I understand that BIC approximation can only be used with a l2 cost function. As I try to detect both mean and variance shifts, I would like to use a `rbf`` cost function.

Is there any trick to set a good penalty value (or at least a sense of scale)?

deepcharles closed this as completed Mar 15, 2019

deepcharles mentioned this issue Jun 8, 2022

how to determine the number of change points using ruptures? #257

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

example setting the penalty parameter? #4

example setting the penalty parameter? #4

wqp89324 commented Jan 18, 2019

deepcharles commented Jan 20, 2019

mylife126 commented Oct 29, 2020

deepcharles commented Nov 3, 2020

deepcharles commented Nov 3, 2020 •

edited

Loading

DPTPaul commented Dec 21, 2021

example setting the penalty parameter? #4

example setting the penalty parameter? #4

Comments

wqp89324 commented Jan 18, 2019

deepcharles commented Jan 20, 2019

mylife126 commented Oct 29, 2020

deepcharles commented Nov 3, 2020

deepcharles commented Nov 3, 2020 • edited Loading

DPTPaul commented Dec 21, 2021

deepcharles commented Nov 3, 2020 •

edited

Loading