-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
referrer_title is empty, if the web site is not available #88
Comments
Ensure that URL is recorded if client throws exception when requestin…
Should be fixed in #89 |
Thanks @dantleech it works perfectly, as can be seen in this freshly crawled example, where
|
Would it be possible to somehow include in the output that the web site gave no response? Currently, I can't use EDIT: I can probably check for |
With |
I might try and get some other fixes into the next release, otherwise I'll do it this weekend I guess
…On 2 June 2019 22:49:01 CEST, gitressa ***@***.***> wrote:
With `referrer_title` available also for `status: null`, my dead link
harvest can be increased by 10-15%, so is it worth considering a fresh
release to include this new feature? If it is too much work for too
little improvement, I can respect that.
--
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub:
#88 (comment)
--
Sent from my Android device with K-9 Mail. Please excuse my brevity.
|
That sounds great, thanks. Let me know if you would like me to test any new features. |
Thanks for releasing version 0.9.0, which adds I can now use
|
It looks like Fink doesn't include the referrer_title, if the web site is no longer online, even thought it might be quite useful.
Here is an example of a missing page where everything works as expected, since the web site is available. A
status: 404
is returned, and the referrer_title included:Here are a few examples where the web site is no longer online, and the referrer_title not included in the result:
Other scenarios of missing referrer_title, like mistyped URL or time-out:
The text was updated successfully, but these errors were encountered: