Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to Limit the Number of Scrapers Running Concurrently #1664

Merged
merged 2 commits into from
Apr 2, 2024

Conversation

toshski
Copy link
Contributor

@toshski toshski commented Mar 20, 2024

Fixes #1626

This allows users to specify the maximum number of scrapers that can run concurrently. It can be set with the Environment Variable CONCURRENT_SCRAPERS or runtime parameter concurrent_scrapers.

Note: it will run the scrapers in batches, all scrapers in the batch must complete before it will start the next batch, ie if you set the number of scrapers to 10 and 10 are been scraped and one scraper finishes another scraper will not start, all 10 must finish then another 10 will start

@crwxaj crwxaj merged commit 7596564 into xbapps:master Apr 2, 2024
1 check passed
@pops64
Copy link
Contributor

pops64 commented Apr 2, 2024

Did you mean string or integer for the env variable? Thanks for adding this :D

@toshski toshski deleted the Limit_Consecutive_Site_Scraping branch May 31, 2024 18:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants