Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

first captcha will be filled automatically #36

Closed
wants to merge 1 commit into from
Closed

first captcha will be filled automatically #36

wants to merge 1 commit into from

Conversation

cmaspi
Copy link

@cmaspi cmaspi commented Jan 26, 2022

closes #35
For the gif included in the issue, I've used the extension with this update.
The extension won't work on https://aims.iith.ac.in/aims/login/loginHome which appears once a user either logs out of their current session and is taken to the given url, or the session fails to load for some reason.
The reason being /login/loginHome is also intermediate page (one with the second captcha)

@angad11121
Copy link

2nd can be filled with OCR

@cmaspi
Copy link
Author

cmaspi commented Jan 26, 2022

2nd can be filled with OCR

I tried using pytesseract (code), it gives rather unsatisfactory results. I tested it for 3 captchas, it gave correct for only 1. A neural net trained specifically for the given dataset would surely outperform pretrained OCR.
I have obtained the dataset from aims using a script to download 30k captcha images from AIMS and then separated them in form of individual characters, the dataset can be found here. However, I have not started work on CNN as of yet.

@cmaspi
Copy link
Author

cmaspi commented Feb 15, 2022

@RachitKeertiDas

@RachitKeertiDas
Copy link
Contributor

Hi,

So, an ideal solution would definitely be filling both endpoints.
Also, the PR introduces content_scripts. This would also require us to have additional justification while uploading our extension, and additional review time.
Another small nitpick is that once a captcha expires, we need to refill it. But that can be handled later.
There are also possibly some administrative issues in adding this functionality officially, but I think they can be worked around.

For filling the second captcha, we can see later. Ideally, we would like all our code to be bundled together, and not call an external server to just infer the captcha from the image for security reasons. So, you might have to look into solutions such as tensorflow.js to allow inference in js in the bundled extension itself.

Good work on obtaining the dataset though.

P.S. I'm not really an active maintainer for this repository, so my suggestionss might not be of much value in getting the PR Merged.

@cmaspi cmaspi closed this by deleting the head repository Dec 22, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Filling first captcha
3 participants