-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the overlapped identities between LFW and ms1m #24
Comments
Also, there are some overlapped identities between facescrub and ms1m. I downloaded from the freebase the correspondence between MIDs and real names. Please check the attachment |
We're doing such experiment and will be available in our paper soon, slightly worse I think(<0.1). |
I just wrote a script that checks for matches between test persons (subset of facescrub that used in MegaFace challenge) and persons from the training set (your cleaned ms1m list). There are 54/80 persons that are both in training and test sets: |
@azat-d I think it is also very difficult to find ALL overlaps by names matching. |
Agree. But according to my test there are at least 67.5% overlap. I don't trust to any results that are based on celebrity datasets. The most reliable test is NIST FRVT test, which is free for all researchers. |
@azat-d I have removed 500+ identities from MS1M by comparing with facescrub dataset, to test MegaFace. By reference, facescrub have only 530 identities in total. I believe our result is quite reliable. |
Megaface test use only 80 identities from facescrub. And checked YOURS train list against those identities. |
And I've found that 54/80 identities are both in test and in yours training set. |
I'm talking about this https://pan.baidu.com/s/1eTn6O62 training set |
Do you mean that there was additional cleaning of this list? |
500+ identities were removed in my binary packed dataset, not this clean list. You can check it in our paper and there's about 0.3% performance drop(98.3% -> 98.0%) |
Ok, thank you! |
So great to hear that the results about overlapping identities removing, thank you guys, I will also take a look at this then, may update if any new results here. |
closing as this is well discussed here. |
Awesome work!
As I know, there are some overlapped identities between LFW and ms1m, does the clean list has removed the overlapped identities, this may affect the performance on LFW
The text was updated successfully, but these errors were encountered: