tflite interpreter for runtime inference #48

cpmpercussion · 2024-08-23T02:16:42Z

With the capacity to save models as tflite files we have experimented with making predictions using Tensorflow Lite.

On a powerful laptop, the present Keras model inference has an average time of 4.96ms (median 4.98) over 3200 predictions. Over 10000 predictions with a 9d s network, the tflite inference runner averages 0.17ms per prediction.

29.17x improvement!

Far too much performance improvement to leave on the table. This issue proposes building out the tflite inference engine so that it is automatically is used if a user specifies a .tflite file in config.toml.

If tflite inference seems to behave similarly, we should be doing this by default.

Thanks Scott H for helping to surface this.

The text was updated successfully, but these errors were encountered:

cpmpercussion added the enhancement New feature or request label Aug 23, 2024

cpmpercussion mentioned this issue Aug 23, 2024

Tflite inference model #50

Merged

cpmpercussion closed this as completed in #50 Aug 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tflite interpreter for runtime inference #48

tflite interpreter for runtime inference #48

cpmpercussion commented Aug 23, 2024

tflite interpreter for runtime inference #48

tflite interpreter for runtime inference #48

Comments

cpmpercussion commented Aug 23, 2024