Add support for GPT-4 Vision #17

ronaldmannak · 2024-04-13T06:05:08Z

This PR adds support for GPT-4 Vision. Images can be added to a message as a URL or as Data.

The token counter for small images adds the correct amount of tokens. However, large images have a complex way of counting tokens and require the library to download the image. To keep things simple, this PR just adds the maximum number of tokens possible. Maybe a feature could be added in the future that optionally downloads the images and counts the tokens the right way.

let media = [
  MessageContent.text("Count the number of apples in this image"),   
  MessageContent.imageUrl("https://apples.com/apples.jpg"),
 ]
thread.addUserMessage(_ media: media)

or

let imageData = UIImage(named: "apples.jpg").jpegData(compressionQuality: 1.0) 
let media = [
  MessageContent.text("Count the number of apples in this image"), 
  MessageContent.imageUrl(URLDetails(imageData: imageData))]
thread.addUserMessage(_ media: media)

There is one breaking change. The content property in ChatMessage is now an enum and invoking description might be needed to fetch the string, as was the case in one of the unit tests:

  XCTAssertEqual(userMessageContent, chatThread.getNonSystemMessages().first?.content) // old
  XCTAssertEqual(userMessageContent, chatThread.getNonSystemMessages().first?.content?.description) // new

ronaldmannak added 7 commits April 12, 2024 21:07

Add Content

7a4e8d8

Make ChatContent public

e6948a2

Rename ChatContent -> MessageContent

c96f7c6

Update ChatMessage

40c3780

Fix init

85872e2

Add init for converting image data to base64

72d769e

Return emtpy string in case of no text

5c4b24e

btfranklin merged commit cf2f607 into btfranklin:main Apr 13, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for GPT-4 Vision #17

Add support for GPT-4 Vision #17

ronaldmannak commented Apr 13, 2024

Add support for GPT-4 Vision #17

Add support for GPT-4 Vision #17

Conversation

ronaldmannak commented Apr 13, 2024