Feature Request: Support for Sending Images or Audio Files to LLM #334
Replies: 2 comments
-
Thanks for your kind words. Your suggestions and bug spotting has been hugely appreciated over the past couple of months. I've thought about adding vision capabilities to the plugin for a while now. I haven't fully implemented it because there's always been, at least in my opinion, lower hanging fruit to pursue. If you could share your use case so I can understand it better than I'm more likely to bump it up the roadmap. To be open, I plan on adding some features that are similar to Aider in the coming weeks. The ability to create files on the disk, run tests and modify code until the tests pass and make the plugin capable of completing benchmarking challenges so users can better evaluate what models to use. I'd also like to build a |
Beta Was this translation helpful? Give feedback.
-
Thank you.
I totally understand, and I really don’t want to rush it at all.
Here’s my use case for images in two workflows:
In both workflows, I usually use screenshots. For audio, I mainly use it for transcribing and translating, which are far from coding-related tasks. I could probably make a small app for that, so there’s probably no need to include that feature.
Your plan for the plugin is impressive, and I admire that. I wasn’t aware of Aider, but it looks amazing from what I’ve seen so far.
I think this feature would be a major milestone for this plugin, and I can’t wait to try it out when it’s ready. Thank you again. |
Beta Was this translation helpful? Give feedback.
-
Hello,
First of all, thank you for all the hard work on this plugin!
It’s become my go-to tool for interacting with any LLM. I was wondering if it would be possible to add a feature that allows sending images or audio to the LLM (perhaps by providing the file path)?
Thanks again!
Beta Was this translation helpful? Give feedback.
All reactions