Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kids carousel query resulting in inappropriate content #2820

Closed
mekarpeles opened this issue Jan 3, 2020 · 4 comments · Fixed by #2826
Closed

Kids carousel query resulting in inappropriate content #2820

mekarpeles opened this issue Jan 3, 2020 · 4 comments · Fixed by #2826
Assignees
Labels
Affects: Policy Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] Module: Carousels Priority: 1 Do this week, receiving emails, time sensitive, . [managed] State: Work In Progress This issue is being actively worked on. [managed] Type: Bug Something isn't working. [managed]

Comments

@mekarpeles
Copy link
Member

mekarpeles commented Jan 3, 2020

We have a kids carousel on OpenLibrary.org which is populated by an Archive.org ElasticSearch query. The query includes Roald Dahl who has written both kids and somewhat sensitive adult material. In general, the query should be robust to the case where authors may publish both kids and potentially adult material:

$:render_template("home/custom_ia_carousel", title="Kids", key="children", query="preset:children", sorts=["loans__status__last_loan_date+desc"], limit=18, test=test)

Evidence / Screenshot (if possible)

image

Relevant url?

https://openlibrary.org - kids carousel
https://openlibrary.org/works/OL45871W/My_Uncle_Oswald - sensitive result

Proposal & Constraints

A solutions seems to be (a) to change the carousel's ES query to include a new constraint, (b) add said constraint to relevant items' metadata on Archive.org. Flagging the item (using Archive.org explicit flags) doesn’t seem like the appropriate direction in this case.

Question: What metadata [mechanism, value] should we use to identify books as “adult”?

Proposal 1: add a new subject tag, e.g. explicit
Proposal 2: change the query to a tag like Juvenile Fiction
https://archive.org/search.php?query=subject%3A"Juvenile%20Fiction"&and[]=loans__status__status%3A"AVAILABLE"

Related files

$:render_template("home/custom_ia_carousel", title="Kids", key="children", query="preset:children", sorts=["loans__status__last_loan_date+desc"], limit=18, test=test)

https://archive.org/metadata/myuncleoswald00dahl

Stakeholders

@bfalling, @Pezbi

@mekarpeles mekarpeles added Priority: 1 Do this week, receiving emails, time sensitive, . [managed] Module: Carousels Affects: Policy labels Jan 3, 2020
@mekarpeles mekarpeles added this to the 2020-Q1 Sprint 1 Proposed milestone Jan 3, 2020
@mekarpeles mekarpeles added the Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] label Jan 3, 2020
@BrittanyBunk
Copy link
Contributor

One of the reasons why this would pop up is because on the IA, there's an action to produce subject tags (they are on the OL too). If a person uses the tag Juvenile Fiction when they're uploading to IA, or adding a new book to the OL, then it'll appear. I think the reason why this may happen, is because a person may not know what kids books are (or just do a bulk upload - and don't notice), and it's difficult to correct - if the editors don't know what is or isn't either.

To prevent this issue from happening again, there should be something written somewhere that explains what juvenile fiction is, example books, etc. Maybe there could be a warning saying only use the tag if you know for sure. Also, maybe people can't add there, unless they also add a reading level (I don't think that would work - as most people don't know this).

@xayhewalo xayhewalo added State: Work In Progress This issue is being actively worked on. [managed] Type: Bug Something isn't working. [managed] labels Jan 6, 2020
@cdrini cdrini assigned cdrini and tabshaikh and unassigned cdrini Jan 6, 2020
@tfmorris
Copy link
Contributor

tfmorris commented Jan 7, 2020

The current "kids" preset is:

'preset:children': '(creator:("parish, Peggy") OR creator:("avi") OR title:("goosebumps") OR creator:("Dahl, Roald") OR creator:("ahlberg, allan") OR creator:("Seuss, Dr") OR creator:("Carle, Eric") OR creator:("Pilkey, Dav"))',

which seems pretty ad hoc and arbitrary. Why not go with something more supportable like books by winners of the international Hans Christian Anderson award or books that won the ALA Newbery Medal (and equivalents from other countries)?

@tabshaikh
Copy link
Collaborator

@tfmorris The preset was arbitrary therefore we have removed it and querying archive using subject "Juvenile Fiction" returns kid friendly books which was one of the proposals of @mekarpeles

@BrittanyBunk
Copy link
Contributor

@tfmorris not every kids book is going to come from an award. That's why it's better to base it on a repository - like a bookseller - like Amazon or Barnes and Noble: https://www.barnesandnoble.com/b/books/kids/_/N-29Z8q8Ztu1 B&N even separates by age, which is even better - as it's aligned with the other github issue here that goes with it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Affects: Policy Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] Module: Carousels Priority: 1 Do this week, receiving emails, time sensitive, . [managed] State: Work In Progress This issue is being actively worked on. [managed] Type: Bug Something isn't working. [managed]
Projects
None yet
Development

Successfully merging a pull request may close this issue.

6 participants