Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: Bulk Downloading\ queue Downloading #10

Closed
manolisky opened this issue Jul 28, 2019 · 7 comments
Closed

Feature: Bulk Downloading\ queue Downloading #10

manolisky opened this issue Jul 28, 2019 · 7 comments
Labels
enhancement New feature or request

Comments

@manolisky
Copy link

Downloading files in bulk or in queue you would be pretty useful. Especially when journal volumes have links that end in sequential numbers. This can be done by either an external file the would contain a list of URLs (like wget), or by an in program gui queue.

@leovan
Copy link
Owner

leovan commented Jul 29, 2019

Thanks for the suggestion. Could you please have an example journal volume link for me. I will dig out how to achieve this feature. May be in a more common way that you can download files with links in a file. Since it may change the UI and some key downloading steps, it may take me some time to get this done, hopefully in next release.

@leovan leovan added the enhancement New feature or request label Jul 29, 2019
@manolisky
Copy link
Author

manolisky commented Aug 2, 2019

A journal volume may contain articles that have their urls formatted like this:

https://www.examplesite.org/journal/xxxxx100
https://www.examplesite.org/journal/xxxxx101
https://www.examplesite.org/journal/xxxxx102

And the next volume would be:
https://www.site.org/journal/xxxxx103 And so on.

So maybe a command that would download from a range like .../xxxxx100-.../xxxxx150 would be really helpful.

But even if it just has a file as an input it would be fine, since it is easy to create these ascending lists with a script.

Now I’m not sure how it will work with captchas, if you can input them just once for verification that would be really good. If you input them for every download that’s fine too, it stills saves time.

@leovan leovan closed this as completed in 42d6db2 Aug 4, 2019
@leovan
Copy link
Owner

leovan commented Aug 4, 2019

Hi @manolovskij , now it supports downloading with a query list file. It's more common way to download with multiple queries. Hope it helps with your problem.

@manolisky
Copy link
Author

manolisky commented Aug 5, 2019

I upgraded to version 3.0.0 and the load option for query files started just fine but it got stuck after I input the Captcha. Then I tried downloading from one URL and it also got stuck after the Captcha.

This is where the output gets stuck:

[INFO] - Fetching PDF ...
Angel [CAPTCHA] is coming!

Then I tried with version 2.1.2 and it worked great, with this output:

[INFO] - Fetching PDF ...
[WARNING] - Angel [CAPTCHA] is coming!
[INFO] - Angel [CAPTCHA] down!
[INFO] - Saved PDF as: ...

It seem the 3.0.0 has a problem with this.
I run the app under macOS Mojave 10.14.5 installed via homebrew.

@leovan
Copy link
Owner

leovan commented Aug 6, 2019

Hi @manolovskij , sorry for the bugs. Since it's hare to reproduce the captcha situation, with double checking the codes, I still could not find the problems. I will open a new issue to add a fake server to test all situations. Hope fix it in next release.

@leovan
Copy link
Owner

leovan commented Aug 8, 2019

Hi @manolovskij , bug is fixed and range pattern in query is also supported. Try the new release.

@manolisky
Copy link
Author

Everything works perfectly. Awesome work, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants