First try adding new albuminfo and trackinfo class #3568

dosoe · 2020-04-25T17:52:55Z

New attempt at dealing with #1547 learning from errors in #2650

dosoe · 2020-04-25T18:01:04Z

This new implementation of AlbumInfo and TrackInfo allows flexible tags. A quick test with some of my data seems to work. It has one significant difference: because I overcharged __getattr__ the function hasattr(track, tag) will always return True. To check if a tag is set, one must check if track.tag==None. This concerns especially plugins, as they sometimes add new tags, but it might be important elsewhere.

dosoe · 2020-04-25T18:36:34Z

This behaviour of __getattr__ is what makes the tests fail, as it is needed in python2 for copy.deepcopy(x) . One solution would be to adapt __getattr__ by telling it to raise an AttributeError each time we want to get a tag that either does not exist or whose value is None. @sampsyo , could that be a concern or can I implement it? Are there tags that should exist with the value None on an item that should be kept as such, or can these tags be removed from the item?

dosoe · 2020-04-25T19:08:44Z

It raises the question: The goal of this is to have some core attributes that are always populated and then some that are flexibly attributed. Which attributes are core attributes? From the definition of AlbumInfo, it seems that only album, album_id, artist, artist_id, tracks are core (as the others are optional) but in the tests it calls for the other values.
Do we want all tags to return None when called (and not populated) or do we want to raise an AttributeError? The latter is compatible with copy.deepcopy in python2, the former isn't.

sampsyo · 2020-04-25T20:05:29Z

Great! Thanks for getting this going!

Here's what I think we should do:

Everything should be flexible, i.e., no built-in attributes.
As a purely transitional matter, maybe we should provide a list of attributes that should produce None when they are missing instead of raising an AttributeError.
When you access any other attribute, like info.foo and there's no value for foo, it should raise an exception rather than silently returning None.
We should probably provide key-style access, like info['foo'] and info.get('foo'). In the future, we can make code that consumes this stuff work more like that.

Does that seem reasonable? Perhaps I'm being too aggressive here about eliminating all built-in attributes, but I think it is probably the right thing to do.

…ng the class

dosoe · 2020-04-25T20:34:07Z

I think in principle, yes, but I'm running into trouble adapting beets/autotag/__init__.py as some tags (like title) are hardcoded, especially in apply_metadata. I'm looking into this.

dosoe · 2020-04-25T21:13:57Z

Ok, that's gonna be a wild chase to find all the if item.attr: and replace them with if attr in item and check before calling each attribute if the attribute exists else we will get AttributeError all over the place. Maybe you could help me to find all instances, @sampsyo ? Or maybe there is a better way?

dosoe · 2020-04-25T21:30:10Z

* We should probably provide key-style access, like `info['foo']` and `info.get('foo')`. In the future, we can make code that consumes this stuff work more like that.

This already works.

* When you access any other attribute, like `info.foo` and there's no value for `foo`, it should raise an exception rather than silently returning `None`.

That's already the case.
So far I'm working with everything being flexible, but then there's a lot to replace in the code, you might be able to help as you know the code better than I do.

…AlbumInfo to the absence of positional arguments

dosoe · 2020-04-27T09:52:16Z

That's a working prototype. Now I kept all the default values (all the tags that were previously set to None still are) as chasing all their occurrences and replacing every test has proven too difficult for me. However, that could be done one tag at a time now. Also, apply_metadata and apply_item_metadata from __init__.py have not be changed, that's also something that could be implemented later, however, I don't really know what it does so I don't know how to properly do it.

sampsyo

Looking great so far! It's awesome that the "apply" functions do not need to change at all (for now).

One thing to eventually clean up: it looks like the Map class is taken from a Stack Overflow answer? https://stackoverflow.com/a/32107024/39182

It might be a bit "overkill" for what we need. For example, I don't think we're using the *args part of the constructor? And similarly I don't think we're using the __dict__ part of the functionality? Maybe something much simpler like Confuse's AttrDict would suffice?
https://github.com/beetbox/confuse/blob/3bf9680e7d242136cc304232f902db06c4cc8e11/confuse.py#L1659-L1667

dosoe · 2020-04-27T13:10:18Z

I do need some of the methods, because it needs to be hashable and I need a __setstate__ and a __getstate__ for deepcopy operations. As for the rest, I will try to clean up a bit the constructor and check which method is essential. What do you mean about the __dict__ part of the functionality? **Edit: ** I use the __dict__ functionality in the __getstate__ method.

sampsyo · 2020-04-27T13:16:37Z

I see… can you elaborate a little bit on which parts of the code require those two things (hashability and deep copying)?

For hashability, I think we might not want a smart approach that hashes based on the data… we might want every object to be "unequal to" every other unique object. I think that's the default for object but not for dict.

dosoe · 2020-04-27T13:38:00Z

I remember deepcopy being a problem at my previous attempt, especially for python2. It seems like it's linked to pickle and needs a __getstate__ and a __setstate__ method, that's also in the commentaries of this StackOverflow answer: https://stackoverflow.com/questions/2352181/how-to-use-a-dot-to-access-members-of-dictionary/32107024#32107024 . I also found this error in one of the test logs of my previous attemps to implement flexible classes (#2650):

Traceback (most recent call last):

  File "/home/travis/build/beetbox/beets/test/test_autotag.py", line 436, in test_comp_track_artists_do_not_match

    self.assertNotEqual(self._dist(items, info), 0)

  File "/home/travis/build/beetbox/beets/test/test_autotag.py", line 340, in _dist

    return match.distance(items, info, self._mapping(items, info))

  File "/home/travis/build/beetbox/beets/beets/autotag/match.py", line 251, in distance

    dist.tracks[track] = track_distance(item, track, album_info.va)

TypeError: unhashable type: 'TrackInfo'

This shows the types have to be hashable.

sampsyo · 2020-04-27T13:40:56Z

OK—would you mind double-checking to see whether deep copying is still necessary anywhere? It might work to just delete those various methods and see if any tests fail…

That traceback is an example showing that the type does indeed need to be hashable, but this doesn't necessarily imply that the implementation should use the actual contents instead of the object identity. In fact, I think it would be incorrect to make two objects be considered equal in that context if their contents are equal. I think a default-ish implementation like id(self) would be appropriate?

dosoe · 2020-04-27T13:46:46Z

grep deepcopy shows it only appears in some tests (namely test_ui and test_autotag) so we should be able to survive without. I will arrange the hash.

sampsyo · 2020-04-28T11:58:37Z

Yep, that's all it does—it only works on bytestrings (i.e., bytes on Python 3) to text strings (str on Python 3). The only question is whether there are fields that want to remain as bytes.

dosoe · 2020-04-28T13:30:54Z

The only ones I could think of are acoustic fingerprints and similar pseudo-strings

dosoe · 2020-05-04T09:14:50Z

Whatever it is, it's probably simpler to just decode all strings into unicode and add a list of exceptions rather than doing it the other way. Is there a reason to convert everything to unicode?

sampsyo · 2020-05-04T11:13:34Z

Yeah: the main reason to use the "positive" rather than "negative" approach is that, with this PR, the list of fields is meant to be extensible. So when someone comes along and adds a new field, perhaps in a plugin, we don't want to have to touch the rest of the code. We would need to choose a "default behavior," and the most sensible default is not to touch any data—to pass it along as the metadata source provided it.

dosoe · 2020-05-05T22:49:39Z

But why do you actually bother to convert anything to unicode? Are regulat bytestrings not good enough? Is it for special characters (cyrillic, japanese, chinese etc.)?

sampsyo · 2020-05-05T23:28:02Z

Yes—eventually, all strings that represent text in beets must be Unicode strings. That's the only way to reliably represent the full range of characters people use in their metadata.

dosoe · 2020-05-06T10:58:52Z

Then it would make sense to make the default behaviour to convert every string to unicode unless stated otherwise.

sampsyo · 2020-05-06T11:28:47Z

But that runs into the problem above: what if a plugin wants to add a field that is supposed to contain bytes? Especially if it's not a built-in beets plugin, it would have no way to instruct beets core to skip the conversion.

dosoe · 2020-05-06T12:39:27Z

Why not? A field gets converted only if it exists.

sampsyo · 2020-05-06T12:46:30Z

Say I write a plugin that provides a fingerprint tag, $myfp. It holds a byte string, intentionally. The beets core has never heard of this before. But the loop you're proposing will look like this:

for field in self.data:
    if isinstance(self[field], bytes) and field not in do_not_convert_these_fields:
        self[field] = self[field].decode('utf8')

Because myfp is a brand-new field I just invented, it can't possibly be in the hard-coded do_not_convert_these_fields list. So it will get converted to text, and there's nothing I (the notional plugin developer) can do about it.

dosoe · 2020-05-07T14:31:37Z

Then is there anything that holds this PR back from being merged? I'm thinking of adding a method like append(key,value): if self.key, self.key+=', '+value.decode(...) so that we don't need to make lists for all the tags that have multiple values, but that's not really a priority, I would like to first get this PR ready to be merged.

sampsyo

Looking great overall! Here's one more code review with a few low-level revisions.

sampsyo · 2020-05-07T22:54:07Z

beets/autotag/hooks.py

@@ -138,53 +144,41 @@ def decode(self, codec='utf-8'):
            if isinstance(value, bytes):
                setattr(self, fld, value.decode(codec, 'ignore'))

-        if self.tracks:
+        if 'tracks' in self:


Perhaps tracks is the one think we should keep non-optional. That is, you must provide a list of track objects—unlike all the other fields, which are different because they are just metadata.

I'm actually not sure why we have this if. The loop just doesn't do anything if the list is empty.

I agree, to me it doesn't make sense to have an album without tracks.

sampsyo · 2020-05-07T22:54:39Z

beets/autotag/hooks.py

            for track in self.tracks:
                track.decode(codec)

+    def dup_albuminfo(self):


Let's just call this copy (because it's obviously a method on AlbumInfo).

sampsyo · 2020-05-07T22:55:24Z

beets/autotag/hooks.py

+            tracks = []
+            for track in self.tracks:
+                tracks.append(track.dup_trackinfo())
+            dupe.tracks = tracks


This loop can be replaced with a list comprehension:

dupe.tracks = [track.copy() for track in self.tracks]

sampsyo · 2020-05-07T22:56:13Z

beets/autotag/hooks.py

@@ -224,6 +219,11 @@ def decode(self, codec='utf-8'):
            if isinstance(value, bytes):
                setattr(self, fld, value.decode(codec, 'ignore'))

+    def dup_trackinfo(self):


Also call this copy?

sampsyo · 2020-05-07T23:06:28Z

test/test_autotag.py

-        trackinfo.append(TrackInfo(u'three', None))
+        trackinfo.append(TrackInfo(title=u'one', track_id=None))
+        trackinfo.append(TrackInfo(title=u'two', track_id=None))
+        trackinfo.append(TrackInfo(title=u'three', track_id=None))


Seems like track_id=None may no longer be necessary?

sampsyo · 2020-05-07T23:06:42Z

test/test_autotag.py

@@ -595,7 +597,8 @@ def item(i, length):
        items.append(item(12, 186.45916150485752))

        def info(index, title, length):
-            return TrackInfo(title, None, length=length, index=index)
+            return TrackInfo(title=title, track_id=None, length=length,


sampsyo · 2020-05-07T23:07:10Z

test/test_autotag.py

@@ -749,13 +752,15 @@ def test_albumtype_applied(self):
        self.assertEqual(self.items[1].albumtype, 'album')

    def test_album_artist_overrides_empty_track_artist(self):
-        my_info = copy.deepcopy(self.info)
+        # make a deepcopy of self.info


This comment is probably not necessary every time?

sampsyo · 2020-05-07T23:07:57Z

test/test_ui.py

+        i2 = library.Item()
+        i2.bitrate = 4321
+        i2.length = 10 * 60 + 54
+        i2.format = "F"


Any particular reason why this is not i2 = self.item.copy()?

I wonder why this hasn't been done long ago, in a docstring of one of the parent classes it explicitly says that deepcopy() doesn't work on these objects.

dosoe · 2020-05-08T14:37:54Z

Should I provide the changelog as well? Should the documentation be altered?

sampsyo · 2020-05-08T22:30:13Z

Awesome! A changelog entry would be great. I can't think of anywhere else in the docs where we mention this stuff, so probably nothing else needs to change.

Thanks for all your work on this!

beets/autotag/hooks.py

Co-authored-by: Adrian Sampson <adrian@radbox.org>

dosoe · 2020-05-09T10:52:14Z

That should be it then.

sampsyo · 2020-05-09T14:53:39Z

Awesome!! Thank you for your careful work on this. This is a long-standing request that will help enable lots of interesting additions in the future. Three cheers!

dosoe · 2020-05-10T15:06:11Z

I'm coming back to this PR: I tried to add new fields, which should be easy now, but I found out we missed an important part in beet/autotag/__init__.py apply_metadata. New PR incoming to sort this out.

First try adding new albuminfo and trackinfo class

981d4dc

arrange __getattr__ to behave normally

da43ff9

dosoe added 2 commits April 25, 2020 22:24

arrange decode, set all attributes to be flexible

fea6ffc

all attributes are flexible, so no positional arguments when initiati…

53ce6f8

…ng the class

dosoe added 2 commits April 25, 2020 23:13

cleaning up beets/autotag/__init__.py

1b2c839

remove prints for testing

62566ee

dosoe force-pushed the beet_test_new_albuminfo branch from e84c272 to 62566ee Compare April 26, 2020 19:47

dosoe added 6 commits April 27, 2020 11:21

reintroduce default arguments, adapt all occurences of TrackInfo and …

f507f04

…AlbumInfo to the absence of positional arguments

forgot a positional argument

32d81d8

typo in tests

805f4e8

scale back some changes in __init__.py and hooks.py

bd54313

mixed two PR

363cbf8

lines too long

14e1b33

dosoe added 2 commits April 27, 2020 12:01

arrange decoder

8df2e5b

dupe test

98389b6

sampsyo reviewed Apr 27, 2020

View reviewed changes

forgot to decode all tracks of an album

63df7cf

dosoe added 2 commits April 28, 2020 14:18

create method for deepcopy

d7ed846

forgot a return

370df62

sampsyo requested changes May 7, 2020

View reviewed changes

cleaning up, renaming dup_XXInfo() to copy()

7c71bb8

sampsyo reviewed May 8, 2020

View reviewed changes

beets/autotag/hooks.py Outdated Show resolved Hide resolved

dosoe and others added 3 commits May 9, 2020 12:29

Update AttrDict docstring

4b7f42d

Co-authored-by: Adrian Sampson <adrian@radbox.org>

changelog entry

8fa103e

Merge branch 'master' into beet_test_new_albuminfo

d07c1de

sampsyo merged commit a907dac into beetbox:master May 9, 2020

dosoe deleted the beet_test_new_albuminfo branch May 9, 2020 14:57

dosoe mentioned this pull request May 10, 2020

Correct beet/autotag/__init__.py to adapt to flexible tags #3587

Merged

This was referenced Jul 10, 2020

Make Tags more flexible to implement #2650

Closed

new generic TrackInfo and AlbumInfo #2654

Closed

sampsyo mentioned this pull request Jul 11, 2020

Create arranger_sort, lyricist_sort, performer and performer_sort tags #2563

Closed

First try adding new albuminfo and trackinfo class #3568

First try adding new albuminfo and trackinfo class #3568

Conversation

dosoe commented Apr 25, 2020

dosoe commented Apr 25, 2020

dosoe commented Apr 25, 2020 • edited Loading

dosoe commented Apr 25, 2020

sampsyo commented Apr 25, 2020

dosoe commented Apr 25, 2020 • edited Loading

dosoe commented Apr 25, 2020 • edited Loading

dosoe commented Apr 25, 2020

dosoe commented Apr 27, 2020 • edited Loading

sampsyo left a comment

Choose a reason for hiding this comment

dosoe commented Apr 27, 2020 • edited Loading

sampsyo commented Apr 27, 2020

dosoe commented Apr 27, 2020 • edited Loading

sampsyo commented Apr 27, 2020

dosoe commented Apr 27, 2020

sampsyo commented Apr 28, 2020

dosoe commented Apr 28, 2020

dosoe commented May 4, 2020

sampsyo commented May 4, 2020

dosoe commented May 5, 2020

sampsyo commented May 5, 2020

dosoe commented May 6, 2020

sampsyo commented May 6, 2020

dosoe commented May 6, 2020

sampsyo commented May 6, 2020 • edited Loading

dosoe commented May 7, 2020

sampsyo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dosoe commented May 8, 2020

sampsyo commented May 8, 2020

dosoe commented May 9, 2020

sampsyo commented May 9, 2020

dosoe commented May 10, 2020

dosoe commented Apr 25, 2020 •

edited

Loading

dosoe commented Apr 25, 2020 •

edited

Loading

dosoe commented Apr 25, 2020 •

edited

Loading

dosoe commented Apr 27, 2020 •

edited

Loading

dosoe commented Apr 27, 2020 •

edited

Loading

dosoe commented Apr 27, 2020 •

edited

Loading

sampsyo commented May 6, 2020 •

edited

Loading