Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider adding OA collection thoth-archiving-network to OL Import List #9328

Closed
mekarpeles opened this issue May 23, 2024 · 0 comments · Fixed by #9413
Closed

Consider adding OA collection thoth-archiving-network to OL Import List #9328

mekarpeles opened this issue May 23, 2024 · 0 comments · Fixed by #9413
Labels
Affects: Policy Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Needs: Staff Decision Issues that are blocked on a staff member's decision Needs: Staff / Internal Reviewed a PR but don't have merge powers? Use this. Priority: 3 Issues that we can consider at our leisure. [managed]

Comments

@mekarpeles
Copy link
Member

mekarpeles commented May 23, 2024

params = {
'q': ' AND '.join(
[
'mediatype:texts',
'(%s)'
% ' OR '.join(
f'repub_state:{state}' for state in VALID_READY_REPUB_STATES
),
'scanningcenter:*',
'scanner:*',
'scandate:*',
'!collection:opensource',
'!collection:additional_collections',
'!collection:litigationworks',
'!noindex:true',
'!is_dark:true',
'format:pdf',
f'indexdate:{day}*',
# Fetch back to items added before the day of interest, since items
# sometimes take a few days to process into the collection.
f'addeddate:[{day - 60 * DAY} TO {day + 1 * DAY}]',
# TODO: This seems to be getting more records than expected
*(['format:marc'] if marcs else []),
]
),
'fl': 'identifier,format',
'service': 'metadata__unlimited',
'rows': '100000', # This is the max, I believe
'output': 'json',
}
return f'{IA_BASE_URL}/advancedsearch.php?' + urlencode(params)

Pending policy approval

@mekarpeles mekarpeles added Priority: 3 Issues that we can consider at our leisure. [managed] Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] Affects: Policy Needs: Staff / Internal Reviewed a PR but don't have merge powers? Use this. Needs: Staff Decision Issues that are blocked on a staff member's decision labels May 23, 2024
@mekarpeles mekarpeles changed the title Consider adding thoth-archiving-network to OL Import List Consider adding OA collection thoth-archiving-network to OL Import List May 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Affects: Policy Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Needs: Staff Decision Issues that are blocked on a staff member's decision Needs: Staff / Internal Reviewed a PR but don't have merge powers? Use this. Priority: 3 Issues that we can consider at our leisure. [managed]
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant