[bunkr] Broken extractor #5151

Yakabuff · 2024-02-02T06:26:07Z

bunkr now inserts different TLDs into every URL which breaks the extractor.

Currently, we assume that URLs in a bunkr album either

Use the same TLD (starts with /) and we need to extract CDN URL from HTML
Directly links the CDN and the file can be downloaded directly

We will need to add a case where the URL in album does not start with / and is not a CDN URL but also matches base pattern

The text was updated successfully, but these errors were encountered:

Yakabuff · 2024-02-02T07:05:01Z

                     else:
                         domain = domain.replace("cdn", "media-files", 1)
                     url = urlunsplit((scheme, domain, path, query, fragment))
+                else:
+                    scheme, domain, path, query, fragment = urlsplit(url)
+                    try:
+                        url = self._extract_file(text.unescape(path))
+                    except Exception as exc:
+                        self.log.error("%s: %s", exc.__class__.__name__, exc)
+                        continue

- remove legacy code - map legacy domains to bunkr.sk - use input URL domain for newer domains - update tests (some files got slightly modified or deleted)

Yakabuff mentioned this issue Feb 2, 2024

[bunkr] Fix extractor #5153

Closed

mikf added the site:change label Feb 11, 2024

mikf added a commit that referenced this issue Feb 11, 2024

[bunkr] fix extraction (#5088, #5151, #5153)

06cb518

- remove legacy code - map legacy domains to bunkr.sk - use input URL domain for newer domains - update tests (some files got slightly modified or deleted)

mikf closed this as completed Feb 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bunkr] Broken extractor #5151

[bunkr] Broken extractor #5151

Yakabuff commented Feb 2, 2024 •

edited

Loading

Yakabuff commented Feb 2, 2024

[bunkr] Broken extractor #5151

[bunkr] Broken extractor #5151

Comments

Yakabuff commented Feb 2, 2024 • edited Loading

Yakabuff commented Feb 2, 2024

Yakabuff commented Feb 2, 2024 •

edited

Loading