pairwise_dist drops to 0，loss is near the margin and can't go down #38

xiaomingdaren123 · 2019-03-16T01:46:46Z

hi,omoindrot
I have encountered some problems，after training for a while,pairwise_dist drops to 0，loss is near the margin and can't go down,visualize the training set and discover that they are all together,I don't know what caused it.
Learning_rate is 0.0001,The network structure is vgg16 and the output dimension is 128,Data augmentation is used because of the small amount of data（random crop and horizontal flip)，This will not happen if I don't use data augmentation.I hope you can reply，thanks!

omoindrot · 2019-03-18T17:51:41Z

It looks like training is collapsing, so you may want to decrease your learning rate maybe?

Or maybe have bigger batches to stabilize training? You can also monitor the average distance between embeddings to see how the collapse happens (suddenly or gradually).

xiaomingdaren123 · 2019-03-19T14:17:42Z

I don't decrease the learning rate(learning rate is 0.001),if I decrease learning rate,training process become slow，batch_size is set to 96。Can triple loss be used directly for classification task? or the data set must be pre-trained by softmax loss?

omoindrot · 2019-03-28T16:24:38Z

Maybe pre-training with a softmax loss could help.

cyrusvahidi · 2019-07-14T17:18:48Z

Maybe pre-training with a softmax loss could help.

Hi Olivier,

I considered this approach to try learning some supervised representation from the data, then refining it with triplet learning. I have not been able to stabilise training an embedding solely using triplet loss.

Could you elaborate on utility of the pre-training approach that you suggest?

omoindrot · 2019-07-15T10:52:51Z

The pretraining approach is just to get a good embedding with a softmax loss, since this loss is very stable and you should be able to converge.

Once you have this good enough representation, the triplet loss may help to separate further the class clusters and get you better performance.

cyrusvahidi · 2019-07-15T10:56:01Z

The pretraining approach is just to get a good embedding with a softmax loss, since this loss is very stable and you should be able to converge.

Once you have this good enough representation, the triplet loss may help to separate further the class clusters and get you better performance.

Is it also important to change the activation of the penultimate label vector layer to linear?

omoindrot · 2019-07-15T11:02:05Z

Yes so you have two steps:

Train with softmax loss. You have the network computing the embedding, then a linear layer with softmax activation
Remove the linear layer. Train with triplet loss using the embedding only

cyrusvahidi · 2019-07-15T11:10:51Z

Ok thanks. I more wanted to ask if the embedding's activation should be linear instead of ReLu, which I have seen mentioned before

omoindrot · 2019-07-15T19:15:02Z

It’s a good point, I think without relu makes more sense since you want the embedding to possibly have negative values.

Repository owner deleted a comment from nadongjin Apr 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pairwise_dist drops to 0，loss is near the margin and can't go down #38

pairwise_dist drops to 0，loss is near the margin and can't go down #38

xiaomingdaren123 commented Mar 16, 2019

omoindrot commented Mar 18, 2019

xiaomingdaren123 commented Mar 19, 2019

omoindrot commented Mar 28, 2019

cyrusvahidi commented Jul 14, 2019 •

edited

Loading

omoindrot commented Jul 15, 2019

cyrusvahidi commented Jul 15, 2019

omoindrot commented Jul 15, 2019

cyrusvahidi commented Jul 15, 2019

omoindrot commented Jul 15, 2019 via email •

edited

Loading

pairwise_dist drops to 0，loss is near the margin and can't go down #38

pairwise_dist drops to 0，loss is near the margin and can't go down #38

Comments

xiaomingdaren123 commented Mar 16, 2019

omoindrot commented Mar 18, 2019

xiaomingdaren123 commented Mar 19, 2019

omoindrot commented Mar 28, 2019

cyrusvahidi commented Jul 14, 2019 • edited Loading

omoindrot commented Jul 15, 2019

cyrusvahidi commented Jul 15, 2019

omoindrot commented Jul 15, 2019

cyrusvahidi commented Jul 15, 2019

omoindrot commented Jul 15, 2019 via email • edited Loading

cyrusvahidi commented Jul 14, 2019 •

edited

Loading

omoindrot commented Jul 15, 2019 via email •

edited

Loading