Replaygain backend ffmpeg #3056

zsinskri · 2018-10-18T19:54:06Z

Add a replaygain backend using the ffmpeg CLI-tool and its ebur128 filter.
It is a alternative to the bs1770gain to create R128_* tags.

comparision with r128gain backend

#3055 also uses ffmpeg, but with an intermediary python module ("r128gain"). This pull request replicates some of the work done in r128gain, but avoids the (beta-) dependency.

The album gain algorithm implemented in this backend - the mean of all track gains, also used by the bs1770gain backend - gives different results than the implementation in r128gain (calculating the gain for a concatenation of all tracks). Calculating the mean is obviously faster than rescanning the whole album, but might deliver worse results.

This backend also does not implement any kind of threading (a single track is scanned at a time), but r128gain does.

zsinskri · 2018-10-18T19:59:38Z

I'm not really happy with my changes to command_output: it feels wrong to just append both streams. Is there a better way to access stderr with the existing tooling in beets?

zsinskri · 2018-10-18T22:12:58Z

I just found this conversation. It seems like using the mean to calculate the album gain is just plain wrong. I will look into this...

sampsyo

Awesome! Thanks for getting this started. It looks great already; I noted the two issues inline.

We don't currently have a variant of command_output that gets the stderr stream, but doing that is probably a good idea. Maybe it would be best to just refactor that function to always return both streams—and change everywhere that calls it to ignore (or appropriately use) the stderr data?

beets/util/__init__.py

beetsplug/replaygain.py

zsinskri · 2018-10-27T15:33:59Z

beetsplug/replaygain.py

+        # convert db to LUFS according to:
+        # http://wiki.hydrogenaud.io/index.php?title=ReplayGain_
+        #   specification#Reference_level
+        self._target_level = config['targetlevel'].as_number() - 103


How exactly are the options tragetlevel and r128 supposed to work together?
The only backend that already supports r128, Bs1770gainBackend, just completely ignores targetlevel. (That might be a bug?)

In Bs1770gainBackend initialising it as an EBU R128 Backend only sets the target level to -23 LUFS. So it is impossible to configure the target level of R128_* tags.
Maybe it would make sense to add a config r128_targetlevel, defaulting to 80 db (that corresponds to -23 LUFS according to http://wiki.hydrogenaud.io/index.php?title=ReplayGain_specification#Reference_level).
Then add a parameter target_level to Backend.compute_track_gain and Backend.compute_album_gain that is either targetlevel or r128_targetlevel depending on the filetype. That also makes the whole ReplayGainPlugin.r128_backend_instance obsolete, the same backend can just be called with different target levels.
I guess I should create a separate pull request implementing that change?

Also Bs1770gainBackend's method configuration just changes the target level. Shouldn't that be controlled by targetlevel?

Yes, I like that proposed design a lot—with that change, it would be great to dispense with the distinction between different "backend instances" for classic RG and R128.

I also agree that it seems wrong for Bs1770gainBackend to unilaterally change the target level. It does seem like the configuration should control that.

This has been implemented in #3065.

sampsyo · 2018-10-27T16:45:07Z

This is already looking great! Thank you for carefully considering all the details here.

If it seems appropriate to you, let's merge this PR and work on the refactoring you mentioned in a separate PR?

zsinskri · 2018-10-27T19:16:27Z

Yes, I'm fine with merging.

zsinskri · 2018-11-07T18:50:58Z

Have I missed something, that I should do to get this pull request merged, or is it just waiting for a review?

sampsyo · 2018-11-07T20:56:57Z

It's just in my queue to review. Thanks for your patience; I'll get to it real soon!

zsinskri · 2019-06-17T17:02:33Z

Just merged master and resolved conflicts, no new features in latest push.

zsinskri · 2019-06-19T21:06:57Z

Testing this backend's accuracy (especially the custom album gain calculation) with an unscientific sample size of 1:

	beets w/ ffmpeg	r128gain (uses ffmpeg)	beets w/ bs1770gain
R128_TRACK_GAIN	-1536	-1536	-1526
R128_ALBUM_GAIN	-1662	-1664	-1669

These are with #3314 merged (otherwise every backend will produce garbage).

r128gain calculates the album gain by truly concatenating all tracks and letting ffmpeg re-analyse all audio. We are only 0.0078125 LU (= (1664-1662)/pow(2,8)) off. Considering the difference between bs1770gain and r128gain is 3.5 times as large (or for the track gain 5 times as large) this seems quite good.

zsinskri · 2019-06-22T10:41:11Z

Both ffmpeg and bs1770gain can calculate either the sample or true peak. At least to my understanding the sample peak is the highest stored value and thus really fast to compute, while the true peak is the maximum of the resulting waveform and takes somewhat longer.

The ffmpeg backend currently offers the peak configuration option, defaulting to true peak as the played waveform is what we actually care about. (Don't we? Is there any relevant standart?)

But the bs1770gain backend unconditionally chooses the sample peak (by using the -p, not -t, CLI-Option). Should I change the default value of the peak option to sample? Why was the sample peak chosen for the bs1770gain backend?

I guess bs1770gain should also respect peak if we do not decide to remove that option. But that is something for a different PR.

sampsyo · 2019-06-22T13:39:56Z

Huh, good point. I don’t know why that was chosen as the default behavior for the existing backend. It does seem like the backends should be consistent, but that true peak should be the default. Let’s make it do that—perhaps in a PR just for cleaning up the semantics of the “peak” config option?

zsinskri · 2019-06-22T13:45:20Z

The peak config option is introduced by this PR as a ffmpeg specific option (as I did not know that other backends could support it too).

cleaning up the semantics of the “peak” config option?

If its semantics are off that should be fixed right here and only bs1770gain's support added in a separate PR to avoid merging suboptimal code. Or should we move the whole peak option to a new PR?

sampsyo · 2019-06-22T16:35:07Z

Aha! Sorry, I misunderstood. No, I think it’s fine to keep it here, but I like your idea of making the other backend respect the option too (and use the same default).

beetsplug/replaygain.py

zsinskri · 2019-06-27T10:14:29Z

Now everything is uptodate with master and even the version check has been added.

This branch should be ready to merge once again.
If there is anything I can do to help the review process of this merge request, please let me know.

beets/util/__init__.py

Return a namedtuple CommandOutput(stdout, stderr) instead of just stdout from util.command_ouput, allowing separate access to stdout and stderr. This change is required by the ffmpeg replaygain backend (GitHub PullRequest beetbox#3056) as ffmpeg's ebur128 filter outputs only to stderr.

zsinskri · 2019-07-14T16:44:42Z

I have rebased this branch onto #3329 which is now a perquisite for these changes (as per fortes great suggestion to split #3329 off), cleaning up some history in the process.

Previously using EBU R128 forced the use of the bs1770gain backend. This change adds a whitelist of backends supporting R128. When the configured backend is in that list it will also be used for R128 calculations. Otherwise bs1770gain is still used as a default. This should not change the overall behaviour of the program at all, but allow for further R128-supporting backends to be added.

Add replaygain backend using ffmpeg's ebur128 filter. The album gain is calculated as the mean of all BS.1770 gating block powers. Besides differences in gating block offset, this should be equivalent to a BS.1770 analysis of a proper concatenation of all tracks. Just calculating the mean of all track gains (as implemented by the bs1770gain backend) yields incorrect results as that would: - completely ignore track lengths - just using length in seconds won't work either (e.g. BS.1770 ignores passages below a threshold) - take the mean of track loudness, not power When using the ffmpeg replaygain backend to create R128_*_GAIN tags, the targetlevel will be set to -23 LUFS. GitHub PullRequest beetbox#3065 will make this configurable. It will also skip peak calculation, as there is no R128_*_PEAK tag. It is checked if the libavfilter library supports replaygain calculation. Before version 6.67.100 that did require the `--enable-libebur128` compile-time-option, after that the ebur128 library is included in libavfilter itself. Thus we require either a recent enough libavfilter version or the `--enable-libebur128` option.

Add changelog entry for the new ffmpeg replaygain backend.

Use keyword arguments to make the ffmpeg parser more readable.

zsinskri · 2019-07-19T19:58:03Z

Rebasing onto master again so that GitHub does not show changes from #3329 under "Files changed". This should make this PR easier to review.

sampsyo

Just did a full review (at last)—this looks wonderful! Here are just a few suggestions.

sampsyo · 2019-07-20T20:28:24Z

beetsplug/replaygain.py

+            )
+
+        # check that peak_method is valid
+        valid_peak_method = ("true", "sample")


No parentheses are necessary here.

sampsyo · 2019-07-20T20:28:37Z

beetsplug/replaygain.py

+
+# ffmpeg backend
+class FfmpegBackend(Backend):
+    """A replaygain backend using ffmpegs ebur128 filter.


Suggested change

"""A replaygain backend using ffmpegs ebur128 filter.

"""A replaygain backend using ffmpeg's ebur128 filter.

sampsyo · 2019-07-20T20:31:34Z

beetsplug/replaygain.py

+
+    def _analyse_item(self, item, count_blocks=True):
+        """Analyse item. Returns a Pair (Gain object, number of gating
+        blocks above threshold).


Analyse item. Return a pair of a Gain object and the number of gating blocks above the threshold.

sampsyo · 2019-07-20T20:32:42Z

beetsplug/replaygain.py

+
+        line_integrated_loudness = self._find_line(
+            output, b"  Integrated loudness:",
+            start_line=(len(output) - 1), step_size=-1,


No parentheses are necessary here.

sampsyo · 2019-07-20T20:33:26Z

beetsplug/replaygain.py

+    def _parse_float(self, line):
+        """Extract a float.
+
+        Extract a float from a key value pair in `line`.


This extra sentence may not add much?

sampsyo · 2019-07-20T20:35:18Z

docs/plugins/replaygain.rst

+This plugin can use one of many backends to compute the ReplayGain values:
+GStreamer, mp3gain (and its cousin, aacgain), Python Audio Tools or ffmpeg.
+mp3gain can be easier to install but GStreamer, Audio Tools and ffmpeg support
+more audio formats.


I would even say that ffmpeg belongs to the "easy to install" camp! It's available in nearly every package manager ever, and it usually comes as a complete package with everything included.

This commit mostly addresses feedback: - remove some unused parenthesis - fix a typo - expand some docstrings - document that ffmpeg is usually easy to install

Use the POSIX character class instead of `\s` to match all whitespace in a regular expression describing the language of valid inputs, in order to avoid a test failure for the invalid escape sequence `\s` in Python strings.

sampsyo · 2019-07-21T01:40:39Z

This looks perfect! Thank you again for all your hard work on this—this new backend will be a very useful alternative IMO. 🎉

sampsyo reviewed Oct 18, 2018

View reviewed changes

beets/util/__init__.py Outdated Show resolved Hide resolved

beetsplug/replaygain.py Outdated Show resolved Hide resolved

zsinskri commented Oct 27, 2018

View reviewed changes

beetsplug/replaygain.py Show resolved Hide resolved

zsinskri commented Oct 27, 2018

View reviewed changes

This was referenced Oct 27, 2018

Replaygain backend r128gain #3055

Closed

replaygain: target level refactor #3065

Merged

sampsyo mentioned this pull request Jan 24, 2019

replaygain: bs1770gain backend is associated with white nationalism #3127

Closed

sampsyo mentioned this pull request May 12, 2019

make replaygain multithreaded #3265

Closed

zsinskri commented Jun 25, 2019

View reviewed changes

beetsplug/replaygain.py Outdated Show resolved Hide resolved

fortes reviewed Jul 10, 2019

View reviewed changes

beets/util/__init__.py Outdated Show resolved Hide resolved

zsinskri mentioned this pull request Jul 14, 2019

util.command_output: return stderr, too #3329

Merged

zsinskri force-pushed the replaygain-backend-ffmpeg branch from 779ded0 to 0c7b299 Compare July 14, 2019 16:44

zsinskri added 4 commits July 19, 2019 21:54

changelog entry: ffmpeg replaygain backend

b589521

Add changelog entry for the new ffmpeg replaygain backend.

replaygain: ffmpeg: increase parser readability

271a3c9

Use keyword arguments to make the ffmpeg parser more readable.

zsinskri force-pushed the replaygain-backend-ffmpeg branch from 4391c40 to 271a3c9 Compare July 19, 2019 19:55

sampsyo reviewed Jul 20, 2019

View reviewed changes

zsinskri added 2 commits July 21, 2019 01:18

improve wording in the ffmpeg replaygain backend

f9ff56f

This commit mostly addresses feedback: - remove some unused parenthesis - fix a typo - expand some docstrings - document that ffmpeg is usually easy to install

avoid test failure

e5f2fe6

Use the POSIX character class instead of `\s` to match all whitespace in a regular expression describing the language of valid inputs, in order to avoid a test failure for the invalid escape sequence `\s` in Python strings.

sampsyo merged commit bd6a5cf into beetbox:master Jul 21, 2019

snejus mentioned this pull request Dec 5, 2024

Release: Fix changelog formatting #5529

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replaygain backend ffmpeg #3056

Replaygain backend ffmpeg #3056

zsinskri commented Oct 18, 2018

zsinskri commented Oct 18, 2018

zsinskri commented Oct 18, 2018

sampsyo left a comment

zsinskri Oct 27, 2018

zsinskri Oct 27, 2018

sampsyo Oct 27, 2018

zsinskri Jul 22, 2019

sampsyo commented Oct 27, 2018

zsinskri commented Oct 27, 2018

zsinskri commented Nov 7, 2018

sampsyo commented Nov 7, 2018

zsinskri commented Jun 17, 2019

zsinskri commented Jun 19, 2019

zsinskri commented Jun 22, 2019

sampsyo commented Jun 22, 2019

zsinskri commented Jun 22, 2019

sampsyo commented Jun 22, 2019

zsinskri commented Jun 27, 2019

zsinskri commented Jul 14, 2019

zsinskri commented Jul 19, 2019

sampsyo left a comment

sampsyo Jul 20, 2019

sampsyo Jul 20, 2019

sampsyo Jul 20, 2019

sampsyo Jul 20, 2019

sampsyo Jul 20, 2019

sampsyo Jul 20, 2019

sampsyo commented Jul 21, 2019

	"""A replaygain backend using ffmpegs ebur128 filter.
	"""A replaygain backend using ffmpeg's ebur128 filter.

Replaygain backend ffmpeg #3056

Replaygain backend ffmpeg #3056

Conversation

zsinskri commented Oct 18, 2018

comparision with r128gain backend

zsinskri commented Oct 18, 2018

zsinskri commented Oct 18, 2018

sampsyo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sampsyo commented Oct 27, 2018

zsinskri commented Oct 27, 2018

zsinskri commented Nov 7, 2018

sampsyo commented Nov 7, 2018

zsinskri commented Jun 17, 2019

zsinskri commented Jun 19, 2019

zsinskri commented Jun 22, 2019

sampsyo commented Jun 22, 2019

zsinskri commented Jun 22, 2019

sampsyo commented Jun 22, 2019

zsinskri commented Jun 27, 2019

zsinskri commented Jul 14, 2019

zsinskri commented Jul 19, 2019

sampsyo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sampsyo commented Jul 21, 2019