Setup ORT for Text and Images #139

HAKSOAT · 2024-10-30T02:36:49Z

Main Changes

This pull requests made the following changes:

Removed dependence on FastEmbed to use ORT solely
Added support for ClipViT Text model
Implements Preprocessors and Postprocessors for Image inputs
Implements Preprocessors and Postprocessors for Text inputs
Brushed up the Python demo file

On removing dependence on FastEmbed

This is because ORT was used in the implementation for ClipViT, meaning other models could as well just use ORT. The reason for using ORT is because FastEmbed takes away control on how preprocessing works from us, taking raw texts and file names as inputs. Fastembed uses ORT under the hood and a lot of their utility methods are private, so can't be called directly.

github-actions · 2024-10-30T02:38:38Z

Test Results

153 tests 153 ✅ 2m 39s ⏱️
8 suites 0 💤
2 files 0 ❌

Results for commit 4f7d123.

♻️ This comment has been updated with latest results.

github-actions · 2024-10-30T03:00:54Z

Benchmark Results

group                                                        main                                   pr
-----                                                        ----                                   --
store_batch_insertion_without_predicates/size_100            1.00    823.5±5.88µs        ? ?/sec    1.00    825.4±4.85µs        ? ?/sec
store_batch_insertion_without_predicates/size_1000           1.02      9.4±0.13ms        ? ?/sec    1.00      9.2±0.08ms        ? ?/sec
store_batch_insertion_without_predicates/size_10000          1.10     94.7±0.66ms        ? ?/sec    1.00     85.9±3.67ms        ? ?/sec
store_batch_insertion_without_predicates/size_100000         1.02   855.9±72.67ms        ? ?/sec    1.00   838.0±76.24ms        ? ?/sec
store_retrieval_no_condition/size_100                        1.00      2.6±0.05ms        ? ?/sec    1.04      2.7±0.06ms        ? ?/sec
store_retrieval_no_condition/size_1000                       1.00     18.7±0.13ms        ? ?/sec    1.04     19.4±0.32ms        ? ?/sec
store_retrieval_no_condition/size_10000                      1.00    186.0±1.60ms        ? ?/sec    1.03    192.5±3.27ms        ? ?/sec
store_retrieval_no_condition/size_100000                     1.00  1893.7±30.97ms        ? ?/sec    1.01  1918.0±20.76ms        ? ?/sec
store_retrieval_non_linear_kdtree/size_100                   1.00      2.8±0.02ms        ? ?/sec    1.03      2.8±0.03ms        ? ?/sec
store_retrieval_non_linear_kdtree/size_1000                  1.00     20.7±0.25ms        ? ?/sec    1.02     21.1±0.19ms        ? ?/sec
store_retrieval_non_linear_kdtree/size_10000                 1.00    206.3±3.98ms        ? ?/sec    1.01    208.9±2.44ms        ? ?/sec
store_retrieval_non_linear_kdtree/size_100000                1.00       2.1±0.03s        ? ?/sec    1.00       2.1±0.03s        ? ?/sec
store_sequential_insertion_without_predicates/size_100       1.01  1715.6±32.70µs        ? ?/sec    1.00  1698.1±35.04µs        ? ?/sec
store_sequential_insertion_without_predicates/size_1000      1.00     17.5±0.35ms        ? ?/sec    1.04     18.2±0.42ms        ? ?/sec
store_sequential_insertion_without_predicates/size_10000     1.00    179.4±2.77ms        ? ?/sec    1.01    181.8±2.92ms        ? ?/sec
store_sequential_insertion_without_predicates/size_100000    1.00  1693.0±37.25ms        ? ?/sec    1.10  1858.7±28.70ms        ? ?/sec

deven96 · 2024-11-15T23:35:59Z

ahnlich/types/src/ai/preprocess.rs

@@ -16,6 +17,7 @@ pub enum StringAction {
 pub enum ImageAction {
    ResizeImage,
    ErrorIfDimensionsMismatch,
+    ModelPreprocessing


Switching to using ModelProcessing and SkipProcessing

ahnlich/ai/src/engine/ai/providers/processors/preprocessor.rs

ahnlich/ai/src/engine/ai/providers/processors/rescale.rs

Rough setup for ClipVit Text

0024610

Ran clippy and typegen

a192e72

HAKSOAT marked this pull request as draft October 30, 2024 03:49

I got the models to work!

40cb9ce

deven96 reviewed Nov 15, 2024

View reviewed changes

ahnlich/ai/src/engine/ai/providers/processors/preprocessor.rs Outdated Show resolved Hide resolved

deven96 reviewed Nov 16, 2024

View reviewed changes

ahnlich/ai/src/engine/ai/providers/processors/preprocessor.rs Outdated Show resolved Hide resolved

deven96 reviewed Nov 16, 2024

View reviewed changes

ahnlich/ai/src/engine/ai/providers/processors/rescale.rs Outdated Show resolved Hide resolved

HAKSOAT added 3 commits November 24, 2024 07:36

Set up ORT for Text, ORT for Image yet to run

0a10b48

Postprocessor working on images

b405579

Preprocessors and Postprocessors now work

92cddd2

HAKSOAT marked this pull request as ready for review November 26, 2024 10:17

HAKSOAT force-pushed the feat/clipvittext branch from de724a6 to 7f58064 Compare November 26, 2024 10:38

HAKSOAT changed the title ~~Rough setup for ClipVit Text~~ Setup ORT for Text and Images Nov 26, 2024

Ran type gen

66b968b

HAKSOAT force-pushed the feat/clipvittext branch from 7f58064 to 66b968b Compare November 26, 2024 10:53

HAKSOAT and others added 3 commits November 26, 2024 11:23

Removed fastembed

cf5ae28

Got rid of some mutexing as we want to lock as little as possible

58a657c

Fix OnnxOutputTransform for Resnet50

9f52737

deven96 force-pushed the feat/clipvittext branch from 0b9d9d2 to 9f52737 Compare November 26, 2024 20:34

deven96 added 2 commits November 26, 2024 21:36

Formatting Python files

033487e

Fix test_ai_store_binary_actions

bf516b0

deven96 force-pushed the feat/clipvittext branch from a555038 to bf516b0 Compare November 26, 2024 20:46

deven96 added 4 commits November 26, 2024 21:55

Fix test_set_in_store_parse

aab36d6

Fixing python tests with previous preprocess modes

c90b72e

Merge branch 'main' into feat/clipvittext

e8b65a9

Fixing python tests with previous preprocess modes on merge

4f7d123

deven96 approved these changes Nov 26, 2024

View reviewed changes

deven96 merged commit b6a1769 into main Nov 26, 2024
5 checks passed

deven96 deleted the feat/clipvittext branch November 27, 2024 16:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup ORT for Text and Images #139

Setup ORT for Text and Images #139

HAKSOAT commented Oct 30, 2024 •

edited

Loading

github-actions bot commented Oct 30, 2024 •

edited

Loading

github-actions bot commented Oct 30, 2024 •

edited

Loading

deven96 Nov 15, 2024 •

edited

Loading

Setup ORT for Text and Images #139

Setup ORT for Text and Images #139

Conversation

HAKSOAT commented Oct 30, 2024 • edited Loading

Main Changes

On removing dependence on FastEmbed

github-actions bot commented Oct 30, 2024 • edited Loading

Test Results

github-actions bot commented Oct 30, 2024 • edited Loading

Benchmark Results

deven96 Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

HAKSOAT commented Oct 30, 2024 •

edited

Loading

github-actions bot commented Oct 30, 2024 •

edited

Loading

github-actions bot commented Oct 30, 2024 •

edited

Loading

deven96 Nov 15, 2024 •

edited

Loading