Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

insertMany nosqlbench workload #784

Merged
merged 4 commits into from
Jan 11, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 2 additions & 11 deletions nosqlbench/http-jsonapi-vector-insertmany.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,9 +2,7 @@

## Description

The JSON API insertmany Dataset workflow targets Stargate's JSON API using JSON documents from an external dataset.
The [dataset](#dataset) is mandatory and should contain a vector per row that should be used as the input for write, read and update operations.
This workflow is perfect for testing Stargate performance using your own JSON dataset or any other realistic dataset.
The JSON API insertMany(ordered false) performance test workload

In contrast to other workflows, this one is not split into ramp-up and main phases. Instead, there is only the write phase.

Expand All @@ -15,26 +13,19 @@ In contrast to other workflows, this one is not split into ramp-up and main phas
The default scenario for http-jsonapi-vector-insertmany.yaml only has one operation - insert 20 records to the database
a time.

Note that error handling is set to `errors=timer,warn`, which means that in case of HTTP errors the scenario is not stopped.

## Dataset

### Vector Sample

Vector size is 1536 in the nosqlbench file. (openAI embedding vector standard size is 1536)
Sample dataset is in [vector dataset](vector-dataset.txt)

> If you want to test different vector-size, please change [http-jsonapi-vector-crud create-collection op](http-jsonapi-vector-crud.yaml) and [vector dataset](vector-dataset.txt)


## Sample Command

### Against AstraDB

> comment out `create-namespace` op in the [nosqlbench yaml file](http-jsonapi-vector-crud.yaml)

```
nb5 -v http-jsonapi-vector-crud docscount=1000 threads=20 jsonapi_host=Your-AstraDB-Host auth_token=Your-AstraDB-Token jsonapi_port=443 protocol=https path_prefix=/api/json namespace=Your-Keyspace
nb5 -v http-jsonapi-vector-crud docscount=1000 threads=20 jsonapi_host=Your-AstraDB-Host auth_token=Your-AstraDB-Token jsonapi_port=443 protocol=https path_prefix=/api/json keyspace=Your-Keyspace
```

### Against Local JSON API
Expand Down
Loading