Skip to content

Releases: roboflow/inference

v0.36.1

31 Jan 16:17
36780c7
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.36.0...v0.36.1

v0.36.0

31 Jan 11:46
59aa9f9
Compare
Choose a tag to compare

🚀 Added

🏋 Workflows Batch Processing in Roboflow cloud

We’re thrilled to introduce early access for Workflows Batch Processing in Roboflow Cloud! Now, you can process entire directories of images or video files using your Workflows - no coding required!

⚡ Feature characteristics

  • Asynchronous batch processing – You kick of a job that run in the background, and you can retrieve results later
  • High throughput - Feature is designed to process a lot of data - so if your machine is not capable enough to process the data you have - this solution may be for you 🎯
  • Fully managed – Runs on our cloud, so you don’t have to worry about infrastructure.
  • No UI yet, but CLI ready – Use the updated inference-cli to access the feature.

📺 Check out the demo to see it in action!

This is an alpha release, and we’re inviting a small group of early adopters to test it out. Let us know if you’d like to be among the first to try it! 🚀

batch_processing_short_demo.mp4

🔍 Select Bounding Boxes Contained Within a Specific Class

We’ve heard your feedback! A common challenge in object detection is distinguishing objects contained within another object - and we’ve got you covered.

For example:

  • 🚗 People inside a car vs. standing on the street
  • 📦 Products on a store shelf vs. misplaced items
  • 🔍 Defects within a specific component vs. background noise

We’re excited to announce a new Workflows operation that lets you filter bounding boxes based on spatial containment within a selected class. This makes it easier than ever to refine detections and extract the insights that matter most!

new_detections_filter.mp4

Check the details here

Auto mode in Stitch OCR Detections Block

Previously, users needed to manually specify the direction of text (e.g., left-to-right, top-to-bottom) before stitching. With the new "auto" mode, text direction is automatically detected as either left-to-right or top-to-bottom.

image

Check the details of @chandlersupple change here

💪 Improved

Full Changelog: v0.35.0...v0.36.0

v0.35.0

24 Jan 20:58
adfca01
Compare
Choose a tag to compare

🚀 Added

PLC Modbus TCP Block

This new block brings Modbus TCP communication to Workflows. The block uses the pymodbus library to read from and write to PLC registers over Modbus TCP. It supports three modes of operation:

read: Reads specified registers from a PLC.
write: Writes specified values to PLC registers.
read_and_write: Performs both reading and writing operations in one execution.

Adding Modbus TCP support allows workflows to interact with a broader range of PLCs and industrial devices that use the Modbus protocol.

Velocity block

New analytics block was added to Workflows to calculate and embed velocities and speeds of tracked objects across video frames.

Key Features:

  • Velocity Calculation: Computes raw and smoothed velocities based on object movements between frames.
  • Speed Calculation: Determines the magnitude of velocity vectors to obtain speed.

This block supports smoothing of velocity measurements using an exponential moving average with configurable smoothing_alpha.
Incorporated unit conversion from pixels to meters via the pixels_per_meter parameter

Add support to run ResNet Classification Model in Inference

The recent update to the Roboflow Inference repository introduces support for ResNet classification models, as described in the seminal paper "Deep Residual Learning for Image Recognition" (He et al., 2015, [arXiv:1512.03385](https://arxiv.org/abs/1512.03385)). This integration enables users to leverage the powerful ResNet architecture for image classification tasks, enhancing the model options available within the inference engine and expanding its utility for diverse computer vision applications.

💪 Improved

Multiple changes enhancing Inference documentation!

Improvements to Workflows Blocks

Security improvements

Improvements to webcam workflow preview

Other changes

Full Changelog: v0.34.0...v0.35.0

v0.34.0

17 Jan 17:19
6b987a7
Compare
Choose a tag to compare

🚀 Added

Introducing Stability AI Image Generation Block v1 🖌️✨

Your gateway to limitless creativity with Stability AI! Block leverages Stability AI’s robust API with an easy-to-use interface. Just plug in your Stability AI API key, and you’re ready to go!

Main features:

  • Text-to-Image Magic - Generate entirely new images from text prompts in seconds.
  • Image Variations Made Easy - Start with an image and let the block transform it into captivating variations. Adjust the influence of your input image with precise control (strength parameter).
  • Positive Prompts: Describe what you want to see.
  • Negative Prompts: Specify what you don’t want to include.
  • Model Selection: Choose from cutting-edge models (core, ultra, sd3) to best suit your creative needs.

💪 Improved

@hansent contributions enhancing Inference documentation!

⚡ Support for Roboflow Instant Models

Roboflow Instant Models are now supported in Inference! While this feature is part of the broader Roboflow Instant Models initiative, Inference now includes the ability to load these models seamlessly.
Roboflow Instant Models leverage the power of box prompting, utilizing your entire dataset as prompts during inference for enhanced performance and smarter predictions.

Other changes

New Contributors

Full Changelog: v0.33.0...v0.34.0

v0.33.0

10 Jan 22:35
46621f8
Compare
Choose a tag to compare

🚀 Added

Llama Vision 3.2 🤝 other VLMs supported in Workflows

We welcome new block bringing Llama Vision 3.2 to workflows ecosystem!

Llama 3.2 is a new generation of vision and lightweight models that fit on edge devices, tailored for use cases that require more private and personalized AI experiences.

Brought by @AHB102 in #866

Related changes:

MQTT Writer Enterprise Workflow Block (added in #930)

This block enables our enterprise users to publish messages to an MQTT broker through Workflows.

MQTT (Message Queuing Telemetry Transport) is a lightweight messaging protocol designed for low-bandwidth, high-latency, or unreliable networks. It's widely used in applications where devices need to communicate with minimal overhead, such as the Internet of Things (IoT).

With this change workflows can communicate with the world through MQTT!

Change introduced by @chandlersupple

Plc EthernetIP Enterprise Workflow Block (added in #905)

This block enables our enterprise users to interface with PLC.
A Programmable Logic Controller (PLC) is an industrial computer specifically designed to automate machinery and processes in manufacturing and other industries. It monitors inputs (e.g., sensors), processes data based on a programmed logic, and controls outputs (e.g., actuators) to perform tasks.
This block is utilizing pylogix library over Ethernet/IP. Block supports three modes of operation:

  • read: Reads specified tags from the PLC.
  • write: Writes specified values to the PLC tags.
  • read_and_write: Performs both read and write operations in a single execution.

This change brings vision capabilities into real-world industrial plants!

Change introduced by @reedajohns

💪 Improved

Documentation improvements

@yeldarby transforms Inference docs with streamlined navigation, styling, and instant rendering!

More contributions enhancing Inference documentation:

Improvements to CI by @alexnorell

Other changes

New Contributors

Full Changelog: v0.32.0...v0.33.0

v0.32.0

20 Dec 15:49
03494f6
Compare
Choose a tag to compare

🚀 Added

👀 Gaze detection in Workflows

@joaomarcoscrs (as part of hist first contribution 🏅) introduced Gaze detection model into Workflows.

Don't know what Gaze Detection is?

Gaze detection is a method to determine where a person is looking by analyzing their eye movements and gaze direction. It typically uses cameras or sensors to track eye position and orientation, identifying the point of focus in real time.

It is commonly used in areas like:

  • Human-Computer Interaction: Controlling devices with eye movements.
  • Behavioral Analysis: Understanding attention and interest.
  • Marketing Research: Measuring what catches a person's attention.

Now - you can apply Workflows in such use-cases. Check out Gaze Detection block 📖 documentation to find more information.

Note

The block is currently not supported on Roboflow Hosted Platform. Check out how to send requests to localhost inference server

🏋️‍♂️ New experimental Workflows blocks enabling new capabilities

@yeldarby prepared whole series of blocks to open-up new capabilities for Workflows, including:

💪 Improved

Florence 2 runs now up to 3x faster

🧙‍♂️ @isaacrob-roboflow did some magic 🪄 and now, all of the sudden Florence2 models deployed in inference could run up to 3x faster 🤯 ❗
See details in #885

🔧 Fixed

Security vulnerability in landing page

We've fixed security issue in inference server landing page: #890

Issue description

If a Next.js application is performing authorization in middleware based on pathname, it was possible for this authorization to be bypassed.
This issue was patched in Next.js 14.2.15 and later.

Caution

We advise all users of older versions of inference server to migrate to version 0.32.0

Other fixes

🚧 What's Changed

🏅 New Contributors

Full Changelog: v0.31.1...v0.32.0

v0.31.1

13 Dec 18:52
be906b2
Compare
Choose a tag to compare

🔧 Fixed

Full Changelog: v0.31.0...v0.31.1

v0.31.0

13 Dec 18:27
ac4787d
Compare
Choose a tag to compare

🚀 Added

📏 Easily create embeddings and compare them in Workflows

Thanks to @yeldarby, we have Clip Embedding and Cosine Similarity Workflows blocks. Just take a look what would now be possible.

💡 Application ideas

  • Visual Search: Match text queries (e.g., "red shoes") to the most relevant images without training a custom model.
  • Image Deduplication: Identify similar or duplicate images by calculating embeddings and measuring cosine similarity.
  • Zero-Shot Classification: Classify images into categories by comparing their embeddings to pre-defined text labels (e.g., "cat," "dog").
  • Outliers detection: Check which images do not match to general trend

gemini-2.0-flash 🤝 Workflows

Check out model card and start using new model, simply pointing new model type in Google Gemini Workflow block 😄 All thanks to @EmilyGavrilenko

🔥 Recent supervision versions are now supported

For a long time we had issue with not supporting up-to-date supervision releases. This is no longer the case thanks to @LinasKo and his contribution #881 🙏

🐕‍🦺 React on changes in Workflows

We have new Delta Filter block that optimizes workflows by triggering downstream steps only when input values change, reducing redundant processing.

📊 Key Features:

  • Value Changes Detection: Triggers actions only on value changes.
  • Flexibility: Hooks up to changes in numbers, strings, and more.
  • Per-Video Caching: Tracks changes using - changes for each video stream or batch element would be traced separately

💡 Use Case:

  • Detect changes (e.g., people count) in video analysis and trigger downstream actions efficiently.

🔧 Fixed

  • confidence threshold was not applied for multi-label classification models. @grzegorz-roboflow fixed the problem in #873
  • Active Learning Data collection finally works for multi-label classification models - see @grzegorz-roboflow work in #874
  • Fixed model_id bug with InferenceAggregator block by @robiscoding in #876
  • Security issue: nanoid from 3.3.7 to 3.3.8 - see #878
  • Fix measurement logic for segmentations in measurement block by @NickHerrig in #872

🚧 Changed

New Contributors

Full Changelog: v0.30.0...v0.31.0

v0.30.0

11 Dec 13:06
25aa233
Compare
Choose a tag to compare

🚀 Added

✨ Paligemma2 support!

Enhanced model support: We’re excited to introduce Paligemma2 integration, a next-generation model designed for more flexible and efficient inference. This upgrade facilitates smoother handling of multi-modal inputs like images and captions, offering better versatility in machine learning applications. Check out the implementation details and examples in this script to see how to get started.

Change added by @probicheaux in #864

Remaining changes

Full Changelog: v0.29.2...v0.30.0

v0.29.2

05 Dec 18:00
6abcb3c
Compare
Choose a tag to compare

ultralytics security issue fixed

Caution

Ultralytics maintainers notified the community, that code in the ultralytics wheel 8.3.41 is not what's in GitHub and appears to invoke mining. Users of ultralytics who install 8.3.41 will unknowingly execute an xmrig miner.
Please see this issue for more details

Remaining fixes

Full Changelog: v0.29.1...v0.29.2