Merge branch 'main' into vb/add-sync-client

langchain-ai · Sep 23, 2024 · ae2185a · ae2185a
2 parents 985f1b1 + e1a3336
commit ae2185a
Show file tree

Hide file tree

Showing 17 changed files with 1,029 additions and 597 deletions.
diff --git a/docs/docs/concepts/human_in_the_loop.md b/docs/docs/concepts/human_in_the_loop.md
@@ -1,63 +1,278 @@
 # Human-in-the-loop
 
-Agentic systems often require some human-in-the-loop (or "on-the-loop") interaction patterns. This is because agentic systems are still not very reliable, so having a human involved is required for any sensitive tasks/actions. These are all easily enabled in LangGraph, largely due to built-in [persistence](./persistence.md), implemented via checkpointers.
+Human-in-the-loop (or "on-the-loop") enhances agent capabilities through several common user interaction patterns.
 
-The reason a checkpointer is necessary is that a lot of these interaction patterns involve running a graph up until a certain point, waiting for some sort of human feedback, and then continuing. When you want to "continue" you will need to access the state of the graph prior to the interrupt. LangGraph persistence enables this by checkpointing the state at every superstep.
+Common interaction patterns include:
 
-There are a few common human-in-the-loop interaction patterns we see emerging.
+(1) `Approval` - We can interrupt our agent, surface the current state to a user, and allow the user to accept an action. 
 
-## Approval
+(2) `Editing` - We can interrupt our agent, surface the current state to a user, and allow the user to edit the agent state. 
+
+(3) `Input` - We can explicitly create a graph node to collect human input and pass that input directly to the agent state.
+
+Use-cases for these interaction patterns include:
+
+(1) `Reviewing tool calls` - We can interrupt an agent to review and edit the results of tool calls.
+
+(2) `Time Travel` - We can manually re-play and / or fork past actions of an agent.
+
+## Persistence
+
+All of these interaction patterns are enabled by LangGraph's built-in [persistence](./persistence.md) layer, which will write a checkpoint of the graph state at each step. Persistence allows the graph to stop so that a human can review and / or edit the current state of the graph and then resume with the human's input.
+
+### Breakpoints
+
+Adding a [breakpoint](./low_level.md#breakpoints) a specific location in the graph flow is one way to enable human-in-the-loop. In this case, the developer knows *where* in the workflow human input is needed and simply places a breakpoint prior to or following that particular graph node.
+
+### Dynamic Breakpoints
+
+Alternatively, the developer can define some *condition* that must be met for the breakpoint to be triggered. This concept of [dynamic breakpoints](./low_level.md#dynamic-breakpoints) is useful when the developer wants to halt the graph under *a particular condition*. This uses a `NodeInterrupt`, which is a special type of exception that can be raised from within a node based upon some condition.
+
+```python
+def my_node(state: State) -> State:
+    if len(state['input']) > 5:
+        raise NodeInterrupt(f"Received input that is longer than 5 characters: {state['input']}")
+    return state
+```
+
+See [our guide](../how-tos/human_in_the_loop/dynamic_breakpoints.ipynb) for a detailed how-to on doing this!
+
+## Interaction Patterns
+
+### Approval
 
 ![](./img/human_in_the_loop/approval.png)
 
-A basic pattern is to have the agent wait for approval before executing certain tools. This may be all tools, or just a subset of tools. This is generally recommend for more sensitive actions (like writing to a database). This can easily be done in LangGraph by setting a [breakpoint](./low_level.md#breakpoints) before specific nodes.
+Sometimes we want to approve certain steps in our agent's execution. 
+
+We can interrupt our agent at a [breakpoint](./low_level.md#breakpoints) prior to the step that we want to approve.
+
+This is generally recommend for sensitive actions (e.g., using external APIs or writing to a database).
+
+With persistence, we can surface the current agent state as well as the next step to a user for review and approval. 
+
+If approved, the graph resumes execution from the last saved checkpoint, which is saved to the `thread`:
+
+```python
+# Compile our graph with a checkpoitner and a breakpoint before the step to approve
+graph = builder.compile(checkpointer=checkpoitner, interrupt_before=["node_2"])
+
+# Run the graph up to the breakpoint
+for event in graph.stream(inputs, thread, stream_mode="values"):
+    print(event)
+
+# ... Get human approval ...
+
+# If approved, continue the graph execution from the last saved checkpoint
+for event in graph.stream(None, thread, stream_mode="values"):
+    print(event)
+```
 
-See [this guide](../how-tos/human_in_the_loop/breakpoints.ipynb) for how do this in LangGraph.
+See [our guide](../how-tos/human_in_the_loop/breakpoints.ipynb) for a detailed how-to on doing this!
 
-## Wait for input
+### Editing
+
+![](./img/human_in_the_loop/edit_graph_state.png)
+
+Sometimes we want to review and edit the agent's state. 
+
+As with approval, we can interrupt our agent at a [breakpoint](./low_level.md#breakpoints) prior the the step we want to check. 
+
+We can surface the current state to a user and allow the user to edit the agent state.
+
+This can, for example, be used to correct the agent if it made a mistake (e.g., see the section on tool calling below).
+
+We can edit the graph state by forking the current checkpoint, which is saved to the `thread`.
+
+We can then proceed with the graph from our forked checkpoint as done before. 
+
+```python
+# Compile our graph with a checkpoitner and a breakpoint before the step to review
+graph = builder.compile(checkpointer=checkpoitner, interrupt_before=["node_2"])
+
+# Run the graph up to the breakpoint
+for event in graph.stream(inputs, thread, stream_mode="values"):
+    print(event)
+
+# Review the state, decide to edit it, and create a forked checkpoint with the new state
+graph.update_state(thread, {"state": "new state"})
+
+# Continue the graph execution from the forked checkpoint
+for event in graph.stream(None, thread, stream_mode="values"):
+    print(event)
+```
+
+See [this guide](../how-tos/human_in_the_loop/edit-graph-state.ipynb) for a detailed how-to on doing this!
+
+### Input
 
 ![](./img/human_in_the_loop/wait_for_input.png)
 
-A similar one is to have the agent wait for human input. This can be done by:
+Sometimes we want to explicitly get human input at a particular step in the graph. 
+
+We can create a graph node designated for this (e.g., `human_input` in our example diagram).
+
+As with approval and editing, we can interrupt our agent at a [breakpoint](./low_level.md#breakpoints) prior to this node.
+
+We can then perform a state update that includes the human input, just as we did with editing state.
 
-1. Create a node specifically for human input
-2. Add a breakpoint before the node
-3. Get user input
-4. Update the state with that user input, acting as that node
-5. Resume execution
+But, we add one thing: 
 
-See [this guide](../how-tos/human_in_the_loop/wait-user-input.ipynb) for how do this in LangGraph.
+We can use `as_node=human_input` with the state update to specify that the state update *should be treated as a node*.
 
-## Edit agent actions
+The is subtle, but important: 
 
-![](./img/human_in_the_loop/edit_graph_state.png)
+With editing, the user makes a decision about whether or not to edit the graph state.
+
+With input, we explicitly define a node in our graph for collecting human input!
+
+The the state update with the human input then runs *as this node*.
+
+```python
+# Compile our graph with a checkpoitner and a breakpoint before the step to to collect human input
+graph = builder.compile(checkpointer=checkpoitner, interrupt_before=["human_input"])
+
+# Run the graph up to the breakpoint
+for event in graph.stream(inputs, thread, stream_mode="values"):
+    print(event)
+
+# Update the state with the user input as if it was the human_input node
+graph.update_state(thread, {"user_input": user_input}, as_node="human_input")
+
+# Continue the graph execution from the checkpoint created by the human_input node
+for event in graph.stream(None, thread, stream_mode="values"):
+    print(event)
+```
+
+See [this guide](../how-tos/human_in_the_loop/wait-user-input.ipynb) for a detailed how-to on doing this!
+
+## Use-cases
+
+### Reviewing Tool Calls
+
+Some user interaction patterns combine the above ideas.
+
+For example, many agents use [tool calling](https://python.langchain.com/docs/how_to/tool_calling/) to make decisions. 
+
+Tool calling presents a challenge because the agent must get two things right: 
+
+(1) The name of the tool to call 
+
+(2) The arguments to pass to the tool
+
+Even if the tool call is correct, we may also want to apply discretion: 
+
+(3) The tool call may be a sensitive operation that we want to approve 
+
+With these points in mind, we can combine the above ideas to create a human-in-the-loop review of a tool call.
+
+```python
+# Compile our graph with a checkpoitner and a breakpoint before the step to to review the tool call from the LLM 
+graph = builder.compile(checkpointer=checkpoitner, interrupt_before=["human_review"])
+
+# Run the graph up to the breakpoint
+for event in graph.stream(inputs, thread, stream_mode="values"):
+    print(event)
+
+# Review the tool call and update it, if needed, as the human_review node
+graph.update_state(thread, {"tool_call": "updated tool call"}, as_node="human_review")
+
+# Otherwise, approve the tool call and proceed with the graph execution with no edits 
+
+# Continue the graph execution from either: 
+# (1) the forked checkpoint created by human_review or 
+# (2) the checkpoint saved when the tool call was originally made (no edits in human_review)
+for event in graph.stream(None, thread, stream_mode="values"):
+    print(event)
+```
+
+See [this guide](../how-tos/human_in_the_loop/review-tool-calls.ipynb) for a detailed how-to on doing this!
+
+### Time Travel
+
+When working with agents, we often want closely examine their decision making process: 
+
+(1) Even when they arrive a desired final result, the reasoning that led to that result is often important to examine.
+
+(2) When agents make mistakes, it is often valuable to understand why.
+
+(3) In either of the above cases, it is useful to manually explore alternative decision making paths.
+
+Collectively, we call these debugging concepts `time-travel` and they are composed of `replaying` and `forking`.
+
+#### Replaying
+
+![](./img/human_in_the_loop/replay.png)
+
+Sometimes we want to simply replay past actions of an agent. 
+
+Above, we showed the case of executing an agent from the current state (or checkpoint) of the graph.
+
+We by simply passing in `None` for the input with a `thread`.
+
+```
+thread = {"configurable": {"thread_id": "1"}}
+for event in graph.stream(None, thread, stream_mode="values"):
+    print(event)
+```
+
+Now, we can modify this to replay past actions from a *specific* checkpoint by passing in the checkpoint ID.
+
+To get a specific checkpoint ID, we can easily get all of the checkpoints in the thread and filter to the one we want.
+
+```python
+all_checkpoints = []
+for state in app.get_state_history(thread):
+    all_checkpoints.append(state)
+```
+
+Each checkpoint has a unique ID, which we can use to replay from a specific checkpoint.
+
+Assume from reviewing the checkpoints that we want to replay from one, `xxx`.
+
+We just pass in the checkpoint ID when we run the graph.
+
+```python
+config = {'configurable': {'thread_id': '1', 'checkpoint_id': 'xxx'}}
+for event in graph.stream(None, config, stream_mode="values"):
+    print(event)
+```
+
+Importantly, the graph knows which checkpoints have been previously executed. 
+
+So, it will re-play any previously executed nodes rather than re-executing them.
+
+See [this additional conceptual guide](https://langchain-ai.github.io/langgraph/concepts/persistence/#replay) for related context on replaying.
 
-This is a more advanced interaction pattern. In this interaction pattern the human can actually edit some of the agent's previous decisions. This can be done either during the flow (after a [breakpoint](./low_level.md#breakpoints), part of the [approval](#approval) flow) or after the fact (as part of [time-travel](#time-travel))
+See see [this guide](../how-tos/human_in_the_loop/time-travel.ipynb) for a detailed how-to on doing time-travel!
 
-See [this guide](../how-tos/human_in_the_loop/edit-graph-state.ipynb) for how do this in LangGraph.
+#### Forking
 
-## Time travel
+![](./img/human_in_the_loop/forking.png)
 
-This is a pretty advanced interaction pattern. In this interaction pattern, the human can look back at the list of previous checkpoints, find one they like, optionally [edit it](#edit-agent-actions), and then resume execution from there.
+Sometimes we want to fork past actions of an agent, and explore different paths through the graph.
 
-See [this guide](../how-tos/human_in_the_loop/time-travel.ipynb) for how to do this in LangGraph.
+`Editing`, as discussed above, is *exactly* how we do this for the *current* state of the graph! 
 
-## Review Tool Calls
+But, what if we want to fork *past* states of the graph?
 
-This is a specific type of human-in-the-loop interaction but it's worth calling out because it is so common. A lot of agent decisions are made via tool calling, so having a clear UX for reviewing tool calls is handy.
+For example, let's say we want to edit a particular checkpoint, `xxx`.
 
-A tool call consists of:
+We pass this `checkpoint_id` when we update the state of the graph.
 
-- The name of the tool to call
-- Arguments to pass to the tool
+```python
+config = {"configurable": {"thread_id": "1", "checkpoint_id": "xxx"}}
+graph.update_state(config, {"state": "updated state"}, )
+```
 
-Note that these tool calls can obviously be used for actually calling functions, but they can also be used for other purposes, like to route the agent in a specific direction.
-You will want to review the tool call for both of these use cases.
+This creates a new forked checkpoint, `xxx-fork`, which we can then run the graph from.
 
-When reviewing tool calls, there are few actions you may want to take.
+```python
+config = {'configurable': {'thread_id': '1', 'checkpoint_id': 'xxx-fork'}}
+for event in graph.stream(None, config, stream_mode="values"):
+    print(event)
+```
 
-1. Approve the tool call (and let the agent continue on its way)
-2. Manually change the tool call, either the tool name or the tool arguments (and let the agent continue on its way after that)
-3. Leave feedback on the tool call. This differs from (2) in that you are not changing the tool call directly, but rather leaving natural language feedback suggesting the LLM call it differently (or call a different tool). You could do this by either adding a `ToolMessage` and having the feedback be the result of the tool call, or by adding a `ToolMessage` (that simulates an error) and then a `HumanMessage` (with the feedback).
+See [this additional conceptual guide](https://langchain-ai.github.io/langgraph/concepts/persistence/#update-state) for related context on forking.
 
-See [this guide](../how-tos/human_in_the_loop/review-tool-calls.ipynb) for how to do this in LangGraph.
+See see [this guide](../how-tos/human_in_the_loop/time-travel.ipynb) for a detailed how-to on doing time-travel!
diff --git a/docs/docs/concepts/img/human_in_the_loop/approval.png b/docs/docs/concepts/img/human_in_the_loop/approval.png
diff --git a/docs/docs/concepts/img/human_in_the_loop/edit_graph_state.png b/docs/docs/concepts/img/human_in_the_loop/edit_graph_state.png
diff --git a/docs/docs/concepts/img/human_in_the_loop/forking.png b/docs/docs/concepts/img/human_in_the_loop/forking.png
diff --git a/docs/docs/concepts/img/human_in_the_loop/replay.png b/docs/docs/concepts/img/human_in_the_loop/replay.png
diff --git a/docs/docs/concepts/img/human_in_the_loop/wait_for_input.png b/docs/docs/concepts/img/human_in_the_loop/wait_for_input.png