feat(clip): add linear probe evaluation script #960

SauravMaheshkar · 2024-08-28T19:57:32Z

Adds a script to perform linear probe evaluation using the mlx.data module for data loading. Mostly a mirror of the Linear-probe evaluation script from the official CLIP repository.

References

CLIP: preprocess #959

angeloskath

Thanks for the addition. I would rename the script to linear_probe.py and add the eval. I am wondering if it would be nicer (since it is an example after all) to train a logistic regression model in MLX instead of using scikit-learn.

angeloskath · 2024-08-29T01:46:19Z

clip/eval.py

+    all_features = []
+    all_labels = []
+
+    for _, batch in enumerate(iter):


Why enumerate? Possibly tqdm would be nice.

Fixed in dee703e

angeloskath · 2024-08-29T01:48:19Z

clip/eval.py

+    tr_iter = tr.to_stream().batch(batch_size)
+
+    test = load_cifar10(root=root, train=False)
+    test_iter = test.to_stream().batch(batch_size)


Since it is a very small in memory dataset the stream is unnecessary here. I 'd keep it a buffer so that it is nicer and we have a len as well.

Namely,

train_iter = load_cifar10(root=root).batch(batch_size)

Fixed in dee703e

angeloskath · 2024-08-29T01:49:18Z

clip/eval.py

+
+        image_embeds = model.get_image_features(x)
+        all_features.append(image_embeds)
+        all_labels.append(y)


You need an mx.eval(image_embeds) at some point otherwise you just create a huge graph for the GPU to compute at the same time which leads to memory problems.

angeloskath · 2024-08-29T01:55:21Z

Another thing to consider would be to have two commands in the linear_probe.py script. One that extracts features and saves them in a safe tensors file and another that does trains the logistic regression classifier given that file. The first part might be generally useful for extracting clip features for a dataset for instance.

SauravMaheshkar · 2024-09-25T18:48:53Z

Thanks for the addition. I would rename the script to linear_probe.py and add the eval. I am wondering if it would be nicer (since it is an example after all) to train a logistic regression model in MLX instead of using scikit-learn.

What I had in mind when submitting this PR was to showcase similar performance between the official implementation and the mlx port. Adding a mlx implementation of logistic regression seems like a nice idea but IMO it should reside in a different directory. Maybe another misc/ or core/ directory that contains implementations of various fundamental models.

SauravMaheshkar · 2024-10-22T13:49:55Z

Gentle ping @angeloskath

feat(clip): add linear probe evaluation script

213a950

angeloskath requested changes Aug 29, 2024

View reviewed changes

feat: simplify data handling

dee703e

SauravMaheshkar requested a review from angeloskath September 27, 2024 16:22

Fix linear probe script

6d13a14

angeloskath approved these changes Oct 25, 2024

View reviewed changes

angeloskath merged commit 4971462 into ml-explore:main Oct 25, 2024
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(clip): add linear probe evaluation script #960

feat(clip): add linear probe evaluation script #960

SauravMaheshkar commented Aug 28, 2024

angeloskath left a comment

angeloskath Aug 29, 2024

SauravMaheshkar Sep 14, 2024

angeloskath Aug 29, 2024

SauravMaheshkar Sep 14, 2024

angeloskath Aug 29, 2024

angeloskath commented Aug 29, 2024

SauravMaheshkar commented Sep 25, 2024

SauravMaheshkar commented Oct 22, 2024 •

edited

Loading

feat(clip): add linear probe evaluation script #960

feat(clip): add linear probe evaluation script #960

Conversation

SauravMaheshkar commented Aug 28, 2024

References

angeloskath left a comment

Choose a reason for hiding this comment

angeloskath Aug 29, 2024

Choose a reason for hiding this comment

SauravMaheshkar Sep 14, 2024

Choose a reason for hiding this comment

angeloskath Aug 29, 2024

Choose a reason for hiding this comment

SauravMaheshkar Sep 14, 2024

Choose a reason for hiding this comment

angeloskath Aug 29, 2024

Choose a reason for hiding this comment

angeloskath commented Aug 29, 2024

SauravMaheshkar commented Sep 25, 2024

SauravMaheshkar commented Oct 22, 2024 • edited Loading

SauravMaheshkar commented Oct 22, 2024 •

edited

Loading