-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem using doc array() and bytes() functions together #3012
Comments
Apologies if I might be missing some information.
|
Thanks for the example, this definitely looks suspicious. Would you mind making a pull request with your example as an |
It looks to me like the problem might be with the |
If doc.from_array() was called with say, only entity information, this would cause doc.is_tagged to be set to False, even if tags were set. This caused tags to be dropped from serialisation. The same was true for doc.is_parsed. Closes #3012.
Yep, if we do Fixed now 🎉 |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
How to reproduce the behaviour
Basically, my use case would be to change some of the tagged entities while keeping other entities. From the small example above, it seems the problem occurs when calling the serialization/deserialization part after entity removal part. When you run it, it shows:
However, commenting out the entity removal part (the call to
to_array
up tofrom_array
) of the code, the serialization works fine and shows:Environment
The text was updated successfully, but these errors were encountered: