-
-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Realbooru extractor broken #2530
Comments
@mikf |
btw, |
Same in the browser.. their "original" links seem to be broken for many of these. |
404 again, seems realbooru changed something again, but not same like last time. https://realbooru.com/index.php?page=post&s=view&id=813374 haven't dug into the details, maybe later. where is the source code which extract |
You can't reopen this issue, since you didn't create it and are not the repo owner. A new one might have been better, but this works as well.
gallery-dl uses the old gelbooru API from back when realbooru was just a gelbooru fork. It still "works", but, as you said, most URLs are missing their filename extensions. |
Fixed in ac97aca (v1.25.2) by grabbing file URLs from HTML post pages. |
@mikf
['gallery-dl', '-R', '20', '-c', 'C:\\Users\\mo-han\\locallib\\usr\\etc\\gallery-dl.json', '-o', 'base-directory=C:\\Users\\mo-han\\locallib\\usr\\dl\\gldl', '-o', 'cookies-update=true', '-o', 'videos=true', '-o', 'tags=true', '-o', 'filename="{category} {date!S:.10} {id} {md5} ${tags_copyright!S:L40/___/} @{tags_model!S:L80/___/} .{extension}"', '-o', 'directory=["malena_morgan {category} pq"]', '-vv', '--range', '1-10', 'https://realbooru.com/index.php?page=post&s=list&tags=malena_morgan sort:score']
[gallery-dl][debug] Version 1.26.9
[gallery-dl][debug] Python 3.8.10 - Windows-10-10.0.22631-SP0
[gallery-dl][debug] requests 2.31.0 - urllib3 2.1.0
[gallery-dl][debug] Configuration Files ['C:\\Users\\mo-han\\locallib\\usr\\etc\\gallery-dl.json']
[gallery-dl][debug] Starting DownloadJob for 'https://realbooru.com/index.php?page=post&s=list&tags=malena_morgan sort:score'
[realbooru][debug] Using GelbooruV02TagExtractor for 'https://realbooru.com/index.php?page=post&s=list&tags=malena_morgan sort:score'
[cookies][debug] Extracting cookies from C:\Users\mo-han\AppData\Roaming\Mozilla\Firefox\Profiles\4n321zy5.default-release\cookies.sqlite
[cookies][info] Extracted 337 cookies from Firefox
[urllib3.connectionpool][debug] Starting new HTTPS connection (1): realbooru.com:443
[urllib3.connectionpool][debug] https://realbooru.com:443 "GET /index.php?page=dapi&s=post&q=index&tags=malena_morgan+sort%3Ascore&pid=0&limit=100 HTTP/1.1" 200 None
[urllib3.connectionpool][debug] https://realbooru.com:443 "GET /index.php?page=post&s=view&id=755451 HTTP/1.1" 200 None
[realbooru][debug] Active postprocessor modules: [ExecPP]
...
[urllib3.connectionpool][debug] https://realbooru.com:443 "GET /images/cc/9f/cc9f3177b8a185cb33760862cf05cad5.jpeg HTTP/1.1" 404 None
[downloader.http][warning] '404 Not Found' for 'https://realbooru.com/images/cc/9f/cc9f3177b8a185cb33760862cf05cad5.jpeg'
[download][error] Failed to download realbooru 2018-02-19 649682 cc9f3177b8a185cb33760862cf05cad5 $ @aurielee_summers malena_morgan .jpg
...
[urllib3.connectionpool][debug] https://realbooru.com:443 "GET /images/5c/1d/5c1d9cea962d56d1539d1799936c227f.jpeg HTTP/1.1" 404 None
[downloader.http][warning] '404 Not Found' for 'https://realbooru.com/images/5c/1d/5c1d9cea962d56d1539d1799936c227f.jpeg'
[download][error] Failed to download realbooru 2019-09-03 694959 5c1d9cea962d56d1539d1799936c227f $ @aurielee_summers malena_morgan morg .jpg
...
[urllib3.connectionpool][debug] https://realbooru.com:443 "GET /images/3b/2e/3b2e9d686748d3d1f71f606ed131ccf2.jpeg HTTP/1.1" 404 None
[downloader.http][warning] '404 Not Found' for 'https://realbooru.com/images/3b/2e/3b2e9d686748d3d1f71f606ed131ccf2.jpeg'
[download][error] Failed to download realbooru 2019-09-03 694958 3b2e9d686748d3d1f71f606ed131ccf2 $ @aurielee_summers malena_morgan morg .jpg
... |
@mikf |
Realbooru's extractor is broken, and generates 404ing image URLs.
The text was updated successfully, but these errors were encountered: