-x/--extract option for returning content of first fenced code block #681

simonw · 2024-12-19T14:17:50Z

Thought of this after writing up my one-shot prompting tricks here: https://simonwillison.net/2024/Dec/19/one-shot-python-tools/

Idea is to be able to do this:

llm 'Python CLI tool for finding all files matching an expression, recursively' --extract

This would run the provided prompt and return just the content of the first fenced code block (```python ...) in the response.

If no fenced code blocks found, returns the whole string.

The text was updated successfully, but these errors were encountered:

simonw · 2024-12-19T14:18:18Z

Used o1-mini for the initial code: https://chatgpt.com/share/67642b13-72a4-8006-8620-f75e62706554

simonw · 2024-12-19T14:40:52Z

Updated documentation: https://github.com/simonw/llm/blob/67d4a9964501071b3d810a76598ea27cc741ec6f/docs/usage.md#extracting-fenced-code-blocks

simonw · 2024-12-19T14:42:27Z

Demo:

llm -m gpt-4o-mini 'javascript function to reverse a string' -x

Output:

function reverseString(str) {
    return str.split('').reverse().join('');
}

// Example usage
const originalString = "Hello, World!";
const reversedString = reverseString(originalString);
console.log(reversedString); // Output: "!dlroW ,olleH"

But running llm logs -c shows the full prompt and response: https://gist.github.com/simonw/0964cdd9d7e3cb46932d9b017083f25a

simonw · 2024-12-19T15:16:13Z

Templates should support this too.

simonw · 2024-12-19T15:17:28Z

This now works:

llm --system 'write a Python function' --extract --save python-function
llm -t python-function 'reverse a string'

Docs here: https://github.com/simonw/llm/blob/000e984def983aa36384a24df42d4dbb558b5bb1/docs/templates.md#configuring-code-extraction

simonw · 2024-12-19T15:17:55Z

From my blog entry: https://simonwillison.net/2024/Dec/19/one-shot-python-tools/

llm -s 'You write Python tools as single files. They always start with this comment:

# /// script
# requires-python = ">=3.12"
# ///

These files can include dependencies on libraries such as Click. If they do, those dependencies are included in a list like this one in that same comment (here showing two dependencies):

# /// script
# requires-python = ">=3.12"
# dependencies = [
#     "click",
#     "sqlite-utils",
# ]
# ///' --save python --extract

simonw · 2024-12-19T15:19:41Z

I tried this:

llm -t python 'starlette server providing /?url= API to extract just text using BeautifulSoup'

And got back this:

# /// script
# requires-python = ">=3.12"
# dependencies = [
#     "starlette",
#     "httpx",
#     "beautifulsoup4",
# ]
# ///

from starlette.applications import Starlette
from starlette.responses import JSONResponse
from starlette.requests import Request
import httpx
from bs4 import BeautifulSoup

app = Starlette()

@app.route("/", methods=["GET"])
async def extract_text(request: Request):
    url = request.query_params.get("url")

    if not url:
        return JSONResponse({"error": "URL parameter is required"}, status_code=400)
    
    try:
        async with httpx.AsyncClient() as client:
            response = await client.get(url)
            response.raise_for_status()
    except httpx.RequestError:
        return JSONResponse({"error": "Failed to fetch the URL"}, status_code=500)
    except httpx.HTTPStatusError as exc:
        return JSONResponse({"error": f"HTTP error occurred: {exc.response.status_code}"}, status_code=exc.response.status_code)

    soup = BeautifulSoup(response.text, 'html.parser')
    text = soup.get_text()

    return JSONResponse({"text": text.strip()})

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="127.0.0.1", port=8000)

Full transcript here: https://gist.github.com/simonw/b32bb1b9ad3ea36247189f0d66cfc38e

That was with gpt-4o-mini.

cmosguy · 2025-01-04T19:49:58Z

Hey @simonw how do we do this for o1-preview?

llm -m o1-preview-azure -t python-function 'reverse a string'                                                                                                   

Error: Error code: 400 - {'error': {'message': "Unsupported value: 'messages[0].role' does not support 'system' with this model.", 'type': 'invalid_request_error', 'param': 'messages[0].role', 'code': 'unsupported_value'}}

BTW, I had to do this:

llm -m o1-preview-azure 'You are an expert Python software engineer with over 20 years of experience.  create a script that will collect all the IBM LSF information using the necessary commands to tell me which host is available' -x

That seemed to work ok, but I cannot use templates yet with o1.

simonw · 2025-01-23T04:22:46Z

That's because o1-preview doesn't support system messages at all. You can't use templates that set system messages with that model.

Refs #654, #676, #677, #681, #688, #690, #700, #702, #709

cmosguy · 2025-01-24T01:12:26Z

@simonw Yes I suspected that after I wrote that last comment. Thanks for following up on this!

simonw added the enhancement New feature or request label Dec 19, 2024

simonw closed this as completed in 67d4a99 Dec 19, 2024

simonw reopened this Dec 19, 2024

simonw closed this as completed in 000e984 Dec 19, 2024

simonw mentioned this issue Jan 10, 2025

llm logs -x/--extract option #693

Merged

simonw added a commit that referenced this issue Jan 23, 2025

Release 0.20

dc127d2

Refs #654, #676, #677, #681, #688, #690, #700, #702, #709

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

-x/--extract option for returning content of first fenced code block #681

-x/--extract option for returning content of first fenced code block #681

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

cmosguy commented Jan 4, 2025 •

edited

Loading

simonw commented Jan 23, 2025

cmosguy commented Jan 24, 2025

-x/--extract option for returning content of first fenced code block #681

-x/--extract option for returning content of first fenced code block #681

Comments

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

simonw commented Dec 19, 2024

cmosguy commented Jan 4, 2025 • edited Loading

simonw commented Jan 23, 2025

cmosguy commented Jan 24, 2025

cmosguy commented Jan 4, 2025 •

edited

Loading