feat(links): scan threats with Google Safe Browsing #376

LoneRifle · 2020-08-07T02:46:41Z

Problem

The Virus Scan API for websites provided by Cloudmersive was far
too aggressive, declaring Google Drive links and even a well-known
international organisation as threats. We need a more effective service
that accurately classifies websites that our users link to.

Fixes #377

Solution

Replace Cloudmersive's URL scanning with Google Safe Browsing,
which is more nuanced and more consistent with the browsing experience
for most users, who are probable Chrome users.

Implement SafeBrowsingService, which vets a given URL against threat
lists maintained by Google. If we have a match, log the match and
declare the URL threat
Drop SafeBrowsingService into inversify, and wire that into
UrlCheckController

Deploy Notes

New environment variables:

SAFE_BROWSING_KEY - the Google Cloud API key for Safe Browsing APIs
REDIS_SAFE_BROWSING_URI - Redis URI for caching Safe Browsing threat matches

src/server/services/SafeBrowsingService.ts

liangyuanruo

Suggest soft launching so that we don't have to rollback in case of issues. Also, before a full rollout, we're required to implement caching to avoid spamming which could easily happen in the case of a frustrated user.

yong-jie · 2020-08-10T14:36:26Z

Will we need to modify our warnings to satisfy Google's guidelines? They list 3 things to fulfil

LoneRifle · 2020-08-11T02:10:15Z

Will we need to modify our warnings to satisfy Google's guidelines? They list 3 things to fulfil

I think we already fulfill the first of the three ("Link is likely to be malicious"). Can I propose that we just tweak the message so that the user gets in touch with us via e-mail if this happens? We would likely be proactive in getting in touch with the user anyway, so we can fulfill the other two parts of the guidelines manually

src/server/inversify.config.ts

liangyuanruo

lgtm!

Replace Cloudmersive's URL scanning with Google Safe Browsing, which is more nuanced and more consistent with the browsing experience for most users, who are probable Chrome users. - Implement SafeBrowsingService, which vets a given URL against threat lists maintained by Google. If we have a match, log the match and declare the URL threat - Drop SafeBrowsingService into inversify, and wire that into UrlCheckController New env vars: `SAFE_BROWSING_KEY` - the Google Cloud API key for Safe Browsing APIs

Cache responses from Google Safe Browsing in Redis, expiring the entries after the duration specified in the response - Declare and implement SafeBrowsingRepository and supporting cast - Rework SafeBrowsingService to retrieve entries from Redis, if absent, query Safe Browsing, adding any matches returned - Drop everything in via inversify

LoneRifle requested a review from liangyuanruo August 7, 2020 02:46

liangyuanruo reviewed Aug 7, 2020

View reviewed changes

src/server/services/SafeBrowsingService.ts Outdated Show resolved Hide resolved

liangyuanruo approved these changes Aug 7, 2020

View reviewed changes

LoneRifle force-pushed the feat/links/safe-browsing branch from 2f36a2b to 3ca6647 Compare August 7, 2020 10:19

LoneRifle requested a review from liangyuanruo August 7, 2020 12:07

liangyuanruo reviewed Aug 11, 2020

View reviewed changes

src/server/inversify.config.ts Show resolved Hide resolved

liangyuanruo approved these changes Aug 11, 2020

View reviewed changes

LoneRifle added 3 commits August 11, 2020 13:25

feat(links): allow Safe Browsing threats to only be logged

a57c194

LoneRifle force-pushed the feat/links/safe-browsing branch from 3ca6647 to c5c7e96 Compare August 11, 2020 05:25

LoneRifle merged commit ee33d62 into develop Aug 11, 2020

LoneRifle deleted the feat/links/safe-browsing branch August 11, 2020 05:45

halfwhole mentioned this pull request Sep 30, 2022

Feat/bulk backend/functionality #1993

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(links): scan threats with Google Safe Browsing #376

feat(links): scan threats with Google Safe Browsing #376

LoneRifle commented Aug 7, 2020 •

edited

Loading

liangyuanruo left a comment

yong-jie commented Aug 10, 2020

LoneRifle commented Aug 11, 2020

liangyuanruo left a comment

feat(links): scan threats with Google Safe Browsing #376

feat(links): scan threats with Google Safe Browsing #376

Conversation

LoneRifle commented Aug 7, 2020 • edited Loading

Problem

Solution

Deploy Notes

liangyuanruo left a comment

Choose a reason for hiding this comment

yong-jie commented Aug 10, 2020

LoneRifle commented Aug 11, 2020

liangyuanruo left a comment

Choose a reason for hiding this comment

LoneRifle commented Aug 7, 2020 •

edited

Loading