Introduce experimental FileOutput interface for models that output File and Path types #348

aron · 2024-09-11T14:17:43Z

This PR is a proposal to add a new FileOutput type to the client SDK to abstract away file outputs from Replicate models.

It can be enabled by passing the use_file_output flag to the run() method (this will be moved to the constructor).

replicate.run("black-forest-labs/flux-schnell", input={}, use_file_output=True);

When enabled any URLs (and soon data-uris) will be converted into a FileOutput type. This is essentially an Iterable[bytes] | AsyncIterable[bytes] that has two additional fields, the attribute url referencing underlying URL and read() which will return bytes with the file data loaded into memory.

The intention here is to make it easier to work with file outputs and allows us to optimize the delivery of file assets to the client in future iterations.

Usage is as follows:

output = replicate.run(
  "black-forest-labs/flux-schnell",
  input={"prompt": "astronaut riding a rocket like a horse"},
  use_file_output=True,
);

For most basic cases you'll want to utilize either the url or read() fields depending on whether you want to directly consume the file or pass it on.

To access the file URL:

print(output.url) #=> "https://delivery.replicate.com/..."

To consume the file directly:

with open('output.bin', 'wb') as file:
    file.write(output.read())

Or for very large files they can be streamed:

with open(file_path, 'wb') as file:
    for chunk in output:
        file.write(chunk)

Each of these methods has an equivalent asyncio API.

async with aiofiles.open(filename, 'w') as file:
    await file.write(await output.aread())

async with aiofiles.open(filename, 'w') as file:
    await for chunk in output:
        await file.write(chunk)

For streaming responses from common frameworks, all support taking Iterator types:

Django

@condition(etag_func=None)
def stream_response(request):
    output = replicate.run("black-forest-labs/flux-schnell", input={...}, use_file_output =True)
    return HttpResponse(output, content_type='image/webp')

FastAPI

@app.get("/")
async def main():
    output = replicate.run("black-forest-labs/flux-schnell", input={...}, use_file_output =True)
    return StreamingResponse(output)

Flask

@app.route('/stream')
def streamed_response():
    output = replicate.run("black-forest-labs/flux-schnell", input={...}, use_file_output =True)
    return app.response_class(stream_with_context(output))

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

SyncByteStream is an abstract class; ByteStream is a concrete class that inherits abstracts sync and async byte stream classes Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

mattt

Great work, @aron! I made a couple changes and moved some things around to better match project conventions, but overall I think this is looking great.

bfirsh

I love it. This is really excellent.

A thought about the iterator semantics... are there any existing patterns for iterators on file handles returning chunks of data? Usually file handles return lines or single bytes if you iterate over them, and HTTPX responses have the iter_*() methods.

It feels like we should be consistent with one of those patterns rather than invent a new pattern. If it's the file handle pattern, which feels like the more universal thing to me, then we could also support the length argument to read() to read chunks.

Not a big deal anyway and we can always iterate (zing!) before release.

aron · 2024-09-12T10:16:46Z

@bfirsh this is great feedback thanks. I started off implementing io.IOBase but that felt more complex, than the httpx.Response which felt cleaner and supported asyncio nicely. I saw the iter_*() methods which provide decoding utilities, in favor of just providing the bytes and simplifying the interface.

If adding support for read(size) improves things we can do that.

aron · 2024-09-12T10:46:48Z

I've added support for data-uris. This puts it into parity with JS implementation.

Also I did a quick skim of common web frameworks Django, Flask and FastAPI, all support streaming HTTP responses with iterators so I've updated the PR description with examples. But this aspect feels pretty good with the current implementation.

bfirsh · 2024-09-12T19:02:10Z

OK brillant. If this is compatible with things that expect to get a binary file handle type thing, that's all I care about.

@mattt Do you have any thoughts about file handle and iterator semantics?

mattt · 2024-09-12T22:00:19Z

@bfirsh What we have now is in line with how I'd expect something like this to work. @aron did a good job surveying how this API is likely to be used.

aron added 4 commits September 11, 2024 15:15

Update lockfile from running rye sync

f9d16db

Update pyproject.toml to configure pyright

57f6f0b

Implement a FileOutput interface

3cc0b86

Implement experimental FileOutput interface

7b2e7f4

aron force-pushed the file-output branch from 9e88cb5 to 7b2e7f4 Compare September 11, 2024 14:26

aron changed the title ~~file output~~ Introduce experimental FileOutput interface for models that output File and Path types Sep 11, 2024

mattt added 10 commits September 11, 2024 10:52

Ignore ANN401 warnings

9b7c82c

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Refactor transform_output

bd12f1d

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Fix warning: Boolean default positional argument in function definition

310aea2

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Rename json module to helpers

69c95e6

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Move transform_output to helpers module

c238c39

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Move FileOutput to helpers module

4687e28

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Inherit from SyncByteStream instead of ByteStream

46d69d1

SyncByteStream is an abstract class; ByteStream is a concrete class that inherits abstracts sync and async byte stream classes Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Fix warnings

1ae54d5

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Remove custom implementation of __repr__

024916e

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

Add docstrings

fe2323a

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

mattt force-pushed the file-output branch from f4f07c0 to fe2323a Compare September 11, 2024 18:36

Rename FileOutput.client to ._client

04be83f

Signed-off-by: Mattt Zmuda <mattt@replicate.com>

mattt marked this pull request as ready for review September 11, 2024 18:39

mattt requested a review from bfirsh September 11, 2024 18:40

mattt approved these changes Sep 11, 2024

View reviewed changes

bfirsh approved these changes Sep 11, 2024

View reviewed changes

Add support to FileOutput to read data-uris

c0a4b13

aron force-pushed the file-output branch from a8d11c5 to c0a4b13 Compare September 12, 2024 10:38

mattt merged commit e7f699f into main Sep 16, 2024
7 checks passed

mattt deleted the file-output branch September 16, 2024 07:05

mattt mentioned this pull request Sep 16, 2024

Return file handles if it's a file #67

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce experimental FileOutput interface for models that output File and Path types #348

Introduce experimental FileOutput interface for models that output File and Path types #348

aron commented Sep 11, 2024 •

edited

Loading

mattt left a comment

bfirsh left a comment •

edited

Loading

aron commented Sep 12, 2024

aron commented Sep 12, 2024

bfirsh commented Sep 12, 2024

mattt commented Sep 12, 2024

Introduce experimental FileOutput interface for models that output File and Path types #348

Introduce experimental FileOutput interface for models that output File and Path types #348

Conversation

aron commented Sep 11, 2024 • edited Loading

mattt left a comment

Choose a reason for hiding this comment

bfirsh left a comment • edited Loading

Choose a reason for hiding this comment

aron commented Sep 12, 2024

aron commented Sep 12, 2024

bfirsh commented Sep 12, 2024

mattt commented Sep 12, 2024

aron commented Sep 11, 2024 •

edited

Loading

bfirsh left a comment •

edited

Loading