-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathindex.html
155 lines (144 loc) · 8.64 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
<!DOCTYPE html>
<html>
<head>
<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=G-BFMDYSTN1L"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'G-BFMDYSTN1L');
</script>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0, shrink-to-fit=no">
<title>MemGPT</title>
<meta name="title" content="MemGPT" />
<meta name="description" content="Memory-GPT (MemGPT) - Towards LLMs as Operating Systems - Teach LLMs to manage their own memory for unbounded context!" />
<meta name="keywords" content="MemGPT, AI, ChatBot, GPT, GPT-4, llm">
<meta property="og:type" content="website" />
<meta property="og:url" content="https://memgpt.ai/" />
<meta property="og:title" content="MemGPT" />
<meta property="og:description" content="Memory-GPT (MemGPT) - Towards LLMs as Operating Systems - Teach LLMs to manage their own memory for unbounded context!" />
<meta property="og:image" content="https://memgpt.ai/assets/img/memgpt-system-diagram.png" />
<meta property="twitter:card" content="summary_large_image" />
<meta property="twitter:url" content="https://memgpt.ai/" />
<meta property="twitter:title" content="MemGPT" />
<meta property="twitter:description" content="Memory-GPT (MemGPT) - Towards LLMs as Operating Systems - Teach LLMs to manage their own memory for unbounded context!" />
<meta property="twitter:image" content="https://memgpt.ai/assets/img/memgpt-system-diagram.png" />
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/4.5.0/css/bootstrap.min.css">
<link rel="stylesheet" href="https://fonts.googleapis.com/css?family=Source+Sans+Pro">
<link rel="stylesheet" href="assets/css/Highlight-Clean.css">
<link rel="stylesheet" href="assets/css/styles.css">
<link rel="stylesheet" href="assets/css/Team-Clean.css">
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.0.0-beta3/css/all.min.css">
<link rel="icon" href="favicon.ico" type="image/x-icon">
<link rel="icon" type="image/png" sizes="32x32" href="/favicon-32x32.png">
<link rel="icon" type="image/png" sizes="16x16" href="/favicon-16x16.png">
</head>
<body>
<div class="highlight-clean" style="padding-bottom: 10px;">
<div class="container">
<div style="display: flex; align-items: center; justify-content: center;">
<img src="assets/img/memgpt_logo_transparent.png" alt="MemGPT Logo" style="height: 3.5em; margin-right: 0.5em;">
<span style="font-size: 3.5em;">MemGPT</span>
</div>
<h2 class="text-center">Towards LLMs as Operating Systems</h2>
</div>
<div><p></p></div>
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12">
<h4 class="text-center" style="margin: 0px;">
<a class="text-center" href="https://people.eecs.berkeley.edu/~cpacker/">Charles Packer</a>,
<a class="text-center" href="https://swooders.com/">Sarah Wooders</a>,
<a class="text-center" href="https://people.eecs.berkeley.edu/~kevinlin/">Kevin Lin</a>, <br>
<a class="text-center" href="https://www.vivi.sh/">Vivian Fang</a>,
<a class="text-center" href="https://shishirpatil.github.io/">Shishir G. Patil</a>,
<a class="text-center" href="https://people.eecs.berkeley.edu/~istoica/">Ion Stoica</a>,
<a class="text-center" href="https://people.eecs.berkeley.edu/~jegonzal/">Joseph E. Gonzalez</a>
</h4>
<h4></h4>
<h5 class="text-center">UC Berkeley</h5>
</div>
</div>
</div>
<div><p></p></div>
<div class="buttons" style="margin-bottom: 8px;">
<a class="btn btn-primary" role="button" href="https://arxiv.org/abs/2310.08560">
<i class="fas fa-file-alt" style="margin-right: 5px;"></i> Paper
</a>
<a class="btn btn-primary" role="button" href="https://discord.gg/9GEQrxmVyE">
<i class="fab fa-discord" style="margin-right: 5px;"></i> Discord
</a>
<a class="btn btn-primary" role="button" href="https://github.com/cpacker/MemGPT">
<i class="fab fa-github" style="margin-right: 5px;"></i> GitHub
</a>
<a class="btn btn-primary" role="button" href="https://huggingface.co/MemGPT">
<i class="fas fa-database" style="margin-right: 5px;"></i> Dataset
</a>
</div>
</div>
<div><p></p></div>
<div>
<div class="container" style="max-width: 850px;">
<div class="row">
<div class="col-md-12">
<h3 class="text-center">Teach LLMs to manage their own memory for unbounded context!</h3>
</div>
</div>
</div>
</div>
<div><p></p></div>
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12 text-center"><img src="assets/img/memgpt-system-diagram.png"
style="width: 100%;margin-bottom: 20px;margin-top: 10px" alt="MemGPT system overview">
</div>
</div>
</div>
</div>
<div><p></p></div>
<hr style="max-width: 768px;">
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12">
<h2>Overview</h2>
<ul>
<li>LLMs are increasingly being used for perpetual chats </li>
<li>Limited context lengths makes perpetual chat challenging</li>
<li>MemGPT manages a <mark>virtual context</mark> (inspired by virtual memory in operating systems) to create unbounded LLM context</li>
<li><mark>With MemGPT, we demonstrate that LLMs can be taught to manage their own memory!</mark>
</li>
</ul>
</div>
</div>
<div><p></p></div>
<hr style="max-width: 768px;">
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12">
<h2>Abstract</h2>
<p>
Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems which provide the illusion of an extended virtual memory via paging between physical memory and disk. Using this technique, we introduce MemGPT (MemoryGPT), a system that intelligently manages different storage tiers in order to effectively provide extended context within the LLM’s limited context window. We evaluate our OS-inspired design in two domains where the limited context windows of modern LLMs severely handicaps their performance: document analysis, where MemGPT is able to analyze large documents that far exceed the underlying LLM’s context window, and multi-session chat, where MemGPT can create conversational agents that remember, reflect, and evolve dynamically through long-term interactions with their users. We release MemGPT code and data for our experiments at <a href="https://memgpt.ai" style="color: inherit;">https://memgpt.ai</a>.
<br>
</p>
</div>
</div>
</div>
<hr style="max-width: 768px;">
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12">
<h2>Citation</h2>
<code>
@article{packer2023memgpt,<br>
title={{MemGPT}: Towards LLMs as Operating Systems}, <br>
author={Packer, Charles and Wooders, Sarah and Lin, Kevin and Fang, Vivian and Patil, Shishir G. and Stoica, Ion and Gonzalez, Joseph E.}, <br>
journal={arXiv preprint arXiv:2310.08560},<br>
year={2023}<br>
}
</code></div>
</div>
</div>
</body>
</html>