Export snapshot as variable (workaround to ring-mqtt) #75

helicalchris · 2024-10-25T18:58:12Z

Is your feature request related to a problem? Please describe.
TL;DR not really a problem, more how ring-mqtt works, but this addon could help with these problems.

As per the discussions in my recent issue ring-mqtt doesn't present cameras quite how this addon would prefer. The video is an rtsp but the entity_image, and snapshot images, are not real time. They are periodic images taken by the ring platform and not linked to real time viewing or motion events. This means that streaming support in this addon doesn't work with ring-mqtt. Ce la vie.

However it also means that you can't attach a snapshot from the analysed video in any notifications. The only place that has that video is this plugin and the file it reads from (ie it isn't in any HA entity)

Describe the solution you'd like
To return one of the analysed frames (first probably) optionally in the response variable or another variable

Describe alternatives you've considered
Otherwise I need to find a way to take a frame from the mp4 video file itself, but I can't easily see how in an HA supportable way

Additional context
This is of course all due to ring-mqtt and ring itself, and how it works, as it's not a streaming cctv service and doesn't function like Frigate etc. Might be worth a note on this in the docs as it's not immediately obvious (and also you need to know that streaming support doesn't take a streaming video it takes snapshots from entity_image which also isn't immediately obvious)

valentinfrlch · 2024-10-27T09:07:55Z

TL;DR Possible solution for root cause attached below (thread), blueprint inputs need to be reworked.

I agree. Judging by people on the forum it is too complicated. The two modes on the blueprint have different methods of showing the preview which is not ideal.

Probably the best way to fix is this is to store the most recent preview for each camera entity in the www folder. This limits the number of images stored to the amount of cameras.
The downside of course is that only the most recent snapshot for each camera will be available. (For notifications I'm not entirely sure but it's possible that the image is cached on the phone once the notification is received. This way even older notifications could have a preview...)

This would only show a static snapshot, but I guess that's better than no snapshot at all.
Unfortunately even this way the two modes of the blueprint would still need different methods as previewing frigate's event from the notification is very useful.

I also agree that it would be better if stream_analyzer directly tapped into the rtsp feed. This is another compatibility problem as the rtsp address for most camera entities is not exposed. The entity_picture attribute is present on all camera entities (at least to my knowledge) but may not update as frequently.

Btw: I found this thread about configuring a generic camera for live view of ring-mqtt. Maybe you've already seen that, but in case not: https://community.home-assistant.io/t/configuring-generic-camera-for-live-view-of-ring-mqtt/730270/2

In case the solution in the thread actually works, this would actually solve this problem. The blueprint will still need to be simplified.

helicalchris · 2024-10-27T10:38:07Z

Thanks. I've got real time working on generic camera already (that way) but as per original post the real time and snapshots are not linked for ring based implementations. Snapshots are every x minutes (configurable >=1 from memory) regardless of activity.

There is no way to extract a frame from live video rtsp I have found hence it not being possible to get snapshots to use in notifications. All the snapshots (saved using actions or from entity_picture or from still frame (all the same image) are from the periodic capture

valentinfrlch · 2024-10-27T12:12:54Z

I think I finally understand the problem. So correct me if I'm wrong but your camera (the generic camera) works and you get a preview in the notification (but it's the live view) and you'd prefer the snapshot of the actual event. Correct me if that's not the case.

helicalchris · 2024-10-27T14:27:42Z

Yes spot on. Logic was that as you are extracting frames anyway, could you return one of those extracted frames for use as a variable.

There is a wider sense to this in that the AI request only relates to a few images from a stream. Therefore as context it's useful to see what the AI engine has seen. The variable response could even include all the images sent (maybe in a debug mode). That would have been useful as while ago I could have kept them and rerun the prompt with the same images a few times to tune it rather than different images every time.

valentinfrlch · 2024-10-27T15:08:29Z

Ah great to hear that it's at least working somewhat!
The latest version added a fairly complex function to decide which images are important and which are not (based on how much the image changed to a baseline).
Still I understand you want the images. What could be added is an addional key in the response variable which holds a list of base64 encoded images. The problem with this approach is probably decoding them into an image.
What do you think of a 'debug mode' that stores the analyzed frames in a folder?

helicalchris · 2024-10-27T15:13:05Z

It's working perfectly in terms of taking the saved video clip and sending that for analysis. So, in so far as the addon purpose, it's working perfectly 😀

I think that writing images to a folder would be fine actually. Probably best have an option to choose whether it always appends date-time or not (I would have it overwrite the same files each time - otherwise that will get big fast - others may prefer to be able to look back)

helicalchris · 2024-10-27T15:15:05Z

Also happy to write some text for the docs to explain how ring_mqtt works and how to use the addon with it, if that's helpful, which can call out this debug feature too?

valentinfrlch · 2024-10-28T06:58:23Z

Some explainer how to get ring cameras working with this integration would be greatly appreciated!
I will add a boolean parameter expose_images to the action call for all analyzers. In the blueprint there will be a dropdown to choose between 'Live Preview' and 'Snapshot' (this will only apply to camera mode).
When 'Snapshot' is selected it would then set 'expose_images' in the action call and return the first image from the series.

helicalchris · 2024-10-28T08:14:38Z

I think that's perfect.

If I put a write up of Ring on the HA community thread is that easiest or shall I PM it to you?

helicalchris · 2024-10-28T08:18:32Z

(I assume you mean by camera mode it's in video analyser or stream analyser mode?)

valentinfrlch · 2024-10-28T08:37:36Z

If I put a write up of Ring on the HA community thread is that easiest or shall I PM it to you?

The HA community thread would be great! That way more people see it. I'd also put this in the docs if that's ok with you.

Thanks for contributing!

valentinfrlch · 2024-10-28T08:40:01Z

(I assume you mean by camera mode it's in video analyser or stream analyser mode?)

When the blueprint is in Frigate mode then video_analyzer will be called, Camera mode uses stream_analyzer

helicalchris · 2024-10-28T11:38:48Z

Got you. Not that it affects me but would people want to see an example image in frigate mode maybe to work out what is causing the prompt to respond as it is?

(Re ring noted, Of course copy my text into the docs, not a problem)

valentinfrlch · 2024-10-29T08:25:18Z

Thank you for writing such an elaborate guide. Given the popularity of ring I'm sure a lot of people will find this useful.

v1.3 is almost ready I just need to fix a few small things. If you have some time and want to test the beta, feedback would be very welcome.

helicalchris · 2024-10-29T09:22:32Z

Thank you 😃

Very happy to test the beta

valentinfrlch · 2024-10-29T14:51:09Z

The beta is out!
You can find the changelog here: https://github.com/valentinfrlch/ha-llmvision/releases/tag/v1.3-beta.1

To update the blueprint to the v1.3 beta you can use this url to import it: https://github.com/valentinfrlch/ha-llmvision/blob/semantic-index/blueprints/event_summary.yaml

Thanks for taking the time!

helicalchris · 2024-10-29T18:42:44Z

Beta installed...however I am away from home so I need a postman or courier to walk past it to trigger (otherwise the image is always the same and I can't see if I'm getting a frame from the video or an old artefact!)

helicalchris · 2024-10-29T19:41:53Z

This might be a me problem but when I run the automation step there is an error that:


Stopped because an error was encountered at 29 October 2024 at 19:35:44 (runtime: 11.03 seconds)

No image input provided

I write my video file to /share/front-camera-clip.mp4 (and I can load that file and play it myself and it works), and it was working fine until recently.

I don't use the blueprint so not sure if that's an issue (but the new options are showing in the automation - I created a new one from scratch to be sure). Also I think this is actually an issue from the last version as I have been away and haven't been using it much but did see a trace from before I changed to 1.3 beta (I just ignored it thinking it was an anomaly then ... ring is fickle) with the same issue I think (and now I can't look back as there seems to be nothing in logs about this)

Happy to post full automation yaml but it's a bit long!

valentinfrlch · 2024-10-29T19:57:48Z

Thanks for the feedback!
Not sure about your folder structure but can you try /config/share/front-camera-clip.mp4?
I have some test images in /www/tmp but it only works when I add the path as /config/www/tmp

helicalchris · 2024-10-30T07:05:15Z

I will do, but access to folders like share are permitted via configuration.yaml entries (see my ring guide for link). That shouldn't be an issue and it wasn't in 1.x versions of the addon where it worked perfectly

Note the real path is /share on the box. It's not under /config (like /www is) so I will need to make a new folder called share within config to do this

valentinfrlch · 2024-10-30T07:12:30Z

Ah I see... Do you have any images in the www folder so you could test if that works?
iirc there have been no changes to the way local files are handled.

helicalchris · 2024-10-30T09:07:48Z

I suppose I could put any image in there it doesn't matter for the test purposes so yes I'll give that a go

I can view the video on the share so it shouldn't be permissions

helicalchris · 2024-10-30T14:01:07Z

It works in image mode if I point it at an image in /share


action: llmvision.image_analyzer
data:
  provider: (redacted not sure if this is secret or a uid!)
  message: tell me what you see
  image_file: /share/front-snapshot.jpg
  include_filename: false
  max_tokens: 86
  temperature: 0.4

It fails in video mode if I point it at a video in /share


action: llmvision.video_analyzer
data:
  max_frames: 3
  include_filename: false
  target_width: 1280
  detail: low
  max_tokens: 100
  temperature: 0.2
  provider: (redacted not sure if this is secret or a uid!)
  message: tell me what you see
  video_file: /share/front-camera-clip.mp4

valentinfrlch · 2024-10-30T19:33:12Z

That's pretty strange but thanks a lot for testing. I will look into it!
Are there any errors/warnings in the log? Can you enable debug logging? That would help a lot!

You can enable debug logging by adding this to your configuration.yaml:

logger:
  logs:
    custom_components.llmvision: debug

helicalchris · 2024-10-30T20:56:48Z

Sorry I should have posted the logs. When I said earlier there are none I meant none except the one the automation tells me (as per above) but the log does give line numbers etc


Logger: homeassistant.components.automation.gemini_video_analyse
Source: components/automation/__init__.py:763
integration: Automation (documentation, issues)
First occurred: 20:53:58 (1 occurrences)
Last logged: 20:53:58

Error while executing automation automation.gemini_video_analyse: No image input provided

Remember as per above I believe this was also happening with 1.21. Does rollback work with HACS properly I've never tried? I could revert and test

helicalchris · 2024-10-31T18:09:46Z

I can't explain why, (running 1.3beta), but in between multiple failed executions all with same error there's one which worked and ran fully and found the file. I have absolutely no idea as nothing changed and there's five events either side that failed! Just thought I'd mention it!

I'm going to try to roll back to 1.1x and see if that works again

valentinfrlch · 2024-10-31T18:14:58Z

Thank you for your thorough testing!
Are you saving the files in the same automation? Maybe adding a timeout (delay) between saving and analyzing could help. Just a wild guess though.

helicalchris · 2024-10-31T19:17:28Z

Back on 1.1.3 release and it seems to be working fine...

(edit: but don't rely on this just yet as there is a strange ring-ism that watching live doesn't always work, but if motion is detected then that is pretty reliable, so I need some people to go to my door to be sure. At the moment camera.record seems to fail but Llm vision is reading the existing mp4 clip properly which it wasnt before)

helicalchris · 2024-10-31T19:19:55Z

Are you saving the files in the same automation? Maybe adding a timeout (delay) between saving and analyzing could help. Just a wild guess though.

Yes I am saving (camera.record action), changing a switch (to stop live streaming), and then running the Llm vision step.

The fact it worked once is odd, but the file is always there, so it's not like it's not synced and is missing, if that were the case the last clip would be processed instead. I can't see why any locks would exist...

valentinfrlch · 2024-11-01T21:10:31Z

I've looked at the difference of 1.1.3 and 1.3-beta.2 (v1.1.3...v1.3-beta.2). The only noteworthy change is the new ssim function that compares the difference of two images. Maybe there is some blocking call there that I missed but if that was the case then you'd see that in your logs. So I actually have pretty much no idea what could cause this error...

Have you tried the call with a video from another folder? The fact that it worked once means it probably not the folder but still...

helicalchris · 2024-11-02T06:44:03Z

Now I'm home and it's being triggered by motion it's working 100% of the time on 1.1.3....so strange

helicalchris · 2024-11-03T20:20:35Z

Now on 1.3 release, same issue returns. Have tried using /config/www/llmvision for the files, and have tried adding a 10s timer after the file is written, all to no avail

It's not something about the context of files and folders that you run the new image analysis stuff in the addon container is it that it's not /config/www, it's /www or something silly?

Shall I move this to a new bug to let this feature be closed off?

valentinfrlch · 2024-11-04T15:49:49Z

Just checked again, all paths are /config/www/llmvision. I assume a call isn't awaited properly or a blocking call. That might explain why I can't replicate this issue.

Yes, please move this to a new issue.

Thanks for your testing though!

helicalchris assigned valentinfrlch Oct 25, 2024

valentinfrlch closed this as completed Nov 4, 2024

helicalchris mentioned this issue Nov 4, 2024

Image error failure for video files #80

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export snapshot as variable (workaround to ring-mqtt) #75

Export snapshot as variable (workaround to ring-mqtt) #75

helicalchris commented Oct 25, 2024 •

edited

Loading

valentinfrlch commented Oct 27, 2024

helicalchris commented Oct 27, 2024

valentinfrlch commented Oct 27, 2024

helicalchris commented Oct 27, 2024

valentinfrlch commented Oct 27, 2024

helicalchris commented Oct 27, 2024

helicalchris commented Oct 27, 2024

valentinfrlch commented Oct 28, 2024

helicalchris commented Oct 28, 2024

helicalchris commented Oct 28, 2024

valentinfrlch commented Oct 28, 2024

valentinfrlch commented Oct 28, 2024 •

edited

Loading

helicalchris commented Oct 28, 2024

valentinfrlch commented Oct 29, 2024

helicalchris commented Oct 29, 2024

valentinfrlch commented Oct 29, 2024

helicalchris commented Oct 29, 2024

helicalchris commented Oct 29, 2024 •

edited

Loading

valentinfrlch commented Oct 29, 2024

helicalchris commented Oct 30, 2024 •

edited

Loading

valentinfrlch commented Oct 30, 2024

helicalchris commented Oct 30, 2024

helicalchris commented Oct 30, 2024 •

edited

Loading

valentinfrlch commented Oct 30, 2024 •

edited

Loading

helicalchris commented Oct 30, 2024 •

edited

Loading

helicalchris commented Oct 31, 2024

valentinfrlch commented Oct 31, 2024

helicalchris commented Oct 31, 2024 •

edited

Loading

helicalchris commented Oct 31, 2024

valentinfrlch commented Nov 1, 2024

helicalchris commented Nov 2, 2024

helicalchris commented Nov 3, 2024 •

edited

Loading

valentinfrlch commented Nov 4, 2024

Export snapshot as variable (workaround to ring-mqtt) #75

Export snapshot as variable (workaround to ring-mqtt) #75

Comments

helicalchris commented Oct 25, 2024 • edited Loading

valentinfrlch commented Oct 27, 2024

helicalchris commented Oct 27, 2024

valentinfrlch commented Oct 27, 2024

helicalchris commented Oct 27, 2024

valentinfrlch commented Oct 27, 2024

helicalchris commented Oct 27, 2024

helicalchris commented Oct 27, 2024

valentinfrlch commented Oct 28, 2024

helicalchris commented Oct 28, 2024

helicalchris commented Oct 28, 2024

valentinfrlch commented Oct 28, 2024

valentinfrlch commented Oct 28, 2024 • edited Loading

helicalchris commented Oct 28, 2024

valentinfrlch commented Oct 29, 2024

helicalchris commented Oct 29, 2024

valentinfrlch commented Oct 29, 2024

helicalchris commented Oct 29, 2024

helicalchris commented Oct 29, 2024 • edited Loading

valentinfrlch commented Oct 29, 2024

helicalchris commented Oct 30, 2024 • edited Loading

valentinfrlch commented Oct 30, 2024

helicalchris commented Oct 30, 2024

helicalchris commented Oct 30, 2024 • edited Loading

valentinfrlch commented Oct 30, 2024 • edited Loading

helicalchris commented Oct 30, 2024 • edited Loading

helicalchris commented Oct 31, 2024

valentinfrlch commented Oct 31, 2024

helicalchris commented Oct 31, 2024 • edited Loading

helicalchris commented Oct 31, 2024

valentinfrlch commented Nov 1, 2024

helicalchris commented Nov 2, 2024

helicalchris commented Nov 3, 2024 • edited Loading

valentinfrlch commented Nov 4, 2024

helicalchris commented Oct 25, 2024 •

edited

Loading

valentinfrlch commented Oct 28, 2024 •

edited

Loading

helicalchris commented Oct 29, 2024 •

edited

Loading

helicalchris commented Oct 30, 2024 •

edited

Loading

helicalchris commented Oct 30, 2024 •

edited

Loading

valentinfrlch commented Oct 30, 2024 •

edited

Loading

helicalchris commented Oct 30, 2024 •

edited

Loading

helicalchris commented Oct 31, 2024 •

edited

Loading

helicalchris commented Nov 3, 2024 •

edited

Loading