Create and use a unique ES API key for each simulated client #1520

michaelbaamonde · 2022-06-15T19:10:52Z

This PR introduces the create_api_keys_per_client client option. If true, the coordinating load driver will create a unique API key per logical client after the benchmark's allocation matrix is created, but before any task execution begins. For any given client, its generated API key will be used for authentication for all of the tasks assigned to it. Upon benchmark completion, the coordinating load driver will delete all API keys that it created initially.

Basic auth credentials are required to create API keys at the start of the benchmark and delete them at the end. We do intend to support using a "global API key" for these administrative operations (see #1067 (comment)) but that will be a follow-up.

Here is an example CLI invocation that you can use to test:

esrally race --distribution-version=8.2.0 --car="defaults,trial-license,x-pack-security" --client-options="use_ssl:true,verify_certs:false,basic_auth_user:'rally',basic_auth_password:'rally-password',create_api_key_per_client:true" --track=geonames --test-mode

It will be used multiple times: rest api check, api key creation, api key deletion.

If the `create_api_key_per_client:true` client option is provided, the coordinating load driver will create a unique API key per logical client after the benchmark's allocation matrix is created, but before any task execution begins. For any given client, its generated API key will be used for authentication for all of the tasks assigned to it. Upon benchmark completion, the coordinating load driver will delete all API keys that it created initially.

esrally/driver/driver.py

pquentin

This looks really great! I yet have to try it, but expect to only leave nits.

docs/command_line_reference.rst

esrally/client/factory.py

esrally/driver/driver.py

Co-authored-by: Quentin Pradet <quentin.pradet@gmail.com>

esrally/client/factory.py

This indicates that ES Security isn't enabled, so we inform the user and fail the benchmark. Since this isn't recoverable, we don't bother retrying.

- Fix a copy/paste error that was calling side_effect on the wrong mock - Ensure that call counts are correct - Be more explicit about what arguments we expect calls to contain

esrally/client/factory.py

pquentin

I'd like to reiterate that I'm a big fan of the amount of care that went into this pull requests and its tests.

I have a final issue: I can't get Rally to show me the error message that you carefully crafted. If I leave out basic auth credentials or don't ask for x-pack then I only get a LaunchError in my console and an unprintable RallyError object in the logs. I should see the error messages instead. (This might be unrelated to this pull request. If yes, we can fix it in another one.)

tests/client/factory_test.py

pquentin · 2022-06-27T11:22:22Z

tests/client/factory_test.py

+    @pytest.mark.parametrize("version", ["7.9.0", "7.10.0"])
+    @mock.patch("elasticsearch.Elasticsearch")
+    def test_raises_exception_when_api_key_deletion_fails(self, es, version):
+        es.info.return_value = {"version": {"number": version}}
+        ids = ["foo", "bar", "baz"]
+        es.security.invalidate_api_key.side_effect = [
+            elasticsearch.TransportError(503, "Service Unavailable"),
+            elasticsearch.TransportError(401, "Unauthorized"),
+            Exception("Whoops!"),
+        ]
+
+        with pytest.raises(exceptions.RallyError, match=re.escape(f"Could not delete API keys with the following IDs: {ids}")):
+            client.delete_api_keys(es, ids)
+


nit: Would you agree that this is subset of the test_legacy_api_key_deletion_reports_only_undeleted_ids_in_exception test?

This actually raised a more substantial issue in my mind, which I've addressed in 8eec4b1:

It's possible for the 7.10.0+ version of the deletion code to fail silently if we rely just on exceptions to catch errors. This is because it's basically a bulk request, which means that the response can contain both successful and unsuccessful deletions but still have an HTTP 200 status code. That commit handles that scenario, modifies the logic for the "legacy" code, and refactors the relevant tests.

esrally/client/factory.py

michaelbaamonde · 2022-06-27T14:57:28Z

I can't get Rally to show me the error message that you carefully crafted. If I leave out basic auth credentials or don't ask for x-pack then I only get a LaunchError in my console and an unprintable RallyError object in the logs. I should see the error messages instead. (This might be unrelated to this pull request. If yes, we can fix it in another one.)

@pquentin I think we may not always fail particularly gracefully in general if the combination of client options and cars provided are somehow invalid (implementing #580 would help uncover some of these scenarios). But in this PR's case, here's how I've forced the main failure modes that are specific to API keys that we're trying to catch:

Basic auth missing

Invocation:

esrally race --distribution-version=8.2.0 --car="defaults,trial-license,x-pack-security" --client-options="create_api_key_per_client:true" --track=geonames --test-mode

Output

    [INFO] Race id is [3a64c6f4-287c-4311-b249-e5d0462c46c4]
    [INFO] Preparing for race ...
    Basic auth credentials are required in order to create API keys.
    Missing basic auth client options are: ['basic_auth_user', 'basic_auth_password']
    Read the documentation at https://esrally.readthedocs.io/en/latest/command_line_reference.html#client-options
    [ERROR] Cannot race. Traceback (most recent call last):
      File "/home/baamonde/code/elastic/rally/esrally/actor.py", line 92, in guard
        return f(self, msg, sender)
      File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 272, in receiveMsg_PrepareBenchmark
        self.coordinator.prepare_benchmark(msg.track)
      File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 677, in prepare_benchmark
        es_clients = self.create_es_clients()
      File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 605, in create_es_clients
        es[cluster_name] = self.es_client_factory(cluster_hosts, cluster_client_options).create()
      File "/home/baamonde/code/elastic/rally/esrally/client/factory.py", line 123, in __init__
        raise exceptions.SystemSetupError(
    esrally.exceptions.SystemSetupError: You must provide the 'basic_auth_user' and
      'basic_auth_password' client options in addition to
      'create_api_key_per_client' in order to create client API keys.

Basic auth incomplete (password missing)

Invocation:

esrally race --distribution-version=8.2.0 --car="defaults,trial-license,x-pack-security" --client-options="create_api_key_per_client:true,basic_auth_user:rally" --track=geonames --test-mode

Output

[INFO] Race id is [b93464a5-105a-484a-acdb-05b527958cfe]
[INFO] Preparing for race ...
Basic auth credentials are required in order to create API keys.
Missing basic auth client options are: ['basic_auth_password']
Read the documentation at https://esrally.readthedocs.io/en/latest/command_line_reference.html#client-options
[ERROR] Cannot race. Traceback (most recent call last):
  File "/home/baamonde/code/elastic/rally/esrally/actor.py", line 92, in guard
    return f(self, msg, sender)
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 272, in receiveMsg_PrepareBenchmark
    self.coordinator.prepare_benchmark(msg.track)
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 677, in prepare_benchmark
    es_clients = self.create_es_clients()
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 605, in create_es_clients
    es[cluster_name] = self.es_client_factory(cluster_hosts, cluster_client_options).create()
  File "/home/baamonde/code/elastic/rally/esrally/client/factory.py", line 123, in __init__
    raise exceptions.SystemSetupError(
esrally.exceptions.SystemSetupError: You must provide the 'basic_auth_user' and
  'basic_auth_password' client options in addition to
  'create_api_key_per_client' in order to create client API keys.

Security not enabled

Invocation:

esrally race --distribution-version=8.2.0 --client-options="create_api_key_per_client:true,basic_auth_user:'rally',basic_auth_password:'rally-password'" --track=geonames --test-mode --kill-running-processes

Output

[INFO] Race id is [e7cedccf-6508-4767-97a7-1e28c2d076f1]
[INFO] Preparing for race ...
[INFO] Racing on track [geonames], challenge [append-no-conflicts] and car ['defaults'] with version [8.2.0].

[ERROR] Cannot race. Traceback (most recent call last):
  File "/home/baamonde/code/elastic/rally/esrally/client/factory.py", line 287, in create_api_key
    return es.security.create_api_key({"name": f"rally-client-{client_id}"})
  File "/home/baamonde/code/elastic/rally/.venv/lib/python3.8/site-packages/elasticsearch/client/utils.py", line 168, in _wrapped
    return func(*args, params=params, headers=headers, **kwargs)
  File "/home/baamonde/code/elastic/rally/.venv/lib/python3.8/site-packages/elasticsearch/client/security.py", line 117, in create_api_key
    return self.transport.perform_request(
  File "/home/baamonde/code/elastic/rally/.venv/lib/python3.8/site-packages/elasticsearch/transport.py", line 458, in perform_request
    raise e
  File "/home/baamonde/code/elastic/rally/.venv/lib/python3.8/site-packages/elasticsearch/transport.py", line 419, in perform_request
    status, headers_response, data = connection.perform_request(
  File "/home/baamonde/code/elastic/rally/.venv/lib/python3.8/site-packages/elasticsearch/connection/http_urllib3.py", line 277, in perform_request
    self._raise_error(response.status, raw_data)
  File "/home/baamonde/code/elastic/rally/.venv/lib/python3.8/site-packages/elasticsearch/connection/base.py", line 330, in _raise_error
    raise HTTP_EXCEPTIONS.get(status_code, TransportError)(
elasticsearch.exceptions.TransportError: TransportError(405, 'Incorrect HTTP method for uri [/_security/api_key] and method [PUT], allowed: [POST]', 'Incorrect HTTP method for uri [/_security/api_key] and method [PUT], allowed: [POST]')

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 657, in create_api_key
    api_key = client.create_api_key(es, client_id)
  File "/home/baamonde/code/elastic/rally/esrally/client/factory.py", line 291, in create_api_key
    raise exceptions.SystemSetupError(
esrally.exceptions.SystemSetupError: Got status code 405 when attempting to create API keys. Is Elasticsearch Security enabled?

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/baamonde/code/elastic/rally/esrally/actor.py", line 92, in guard
    return f(self, msg, sender)
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 277, in receiveMsg_StartBenchmark
    self.coordinator.start_benchmark()
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 750, in start_benchmark
    resp = self.create_api_key(self.default_sync_es_client, client_id)
  File "/home/baamonde/code/elastic/rally/esrally/driver/driver.py", line 664, in create_api_key
    raise exceptions.SystemSetupError(e.message)
esrally.exceptions.SystemSetupError: Got status code 405 when attempting to
create API keys. Is Elasticsearch Security enabled?

What did you try? If there's a scenario that's API-key specific that we can handle better, let's do it in this PR. If it's a more generic issue with invalid client options, a follow-up sounds good.

michaelbaamonde · 2022-06-27T16:18:28Z

@elasticmachine test this please

In 7.10.0+, API keys can be invalidated in bulk. Like bulk indexing requests, it's possible for some of the keys specified in the request to be deleted while others fail. In this scenario, we won't actually get an exception, so we need to parse the response ourselves to make sure that all keys were actually deleted. If we don't do this, it's possible that we'd silently ignore errors, leaving API keys behind that we didn't intend to. This commit implements this error handling and also refactors the "legacy" API key deletion code to use the same data structures for tracking which API keys have been deleted and which failed. If there are any un-deleted API keys after we've exhausted our number of attempts, we report their IDs.

pquentin

This works great, thanks! Ship it with or without the change to the except clause. And ignore my other comment. :)

pquentin · 2022-06-29T13:09:19Z

esrally/client/factory.py

+                time.sleep(1)
+            else:
+                raise_exception(remaining, cause=e)
+        except Exception as e:


Should we only catch RallyError here? I think for something else like say a KeyError there's no point in retrying.

esrally/client/factory.py

michaelbaamonde force-pushed the api-keys branch 4 times, most recently from 8e5732f to a81189e Compare June 21, 2022 00:06

Mike Baamonde added 4 commits June 21, 2022 10:32

Add functions for creating and deleting API keys.

ee7bcf4

Optionally create async ES clients with API keys.

f91a0d5

Store the driver's default ES sync client as an attribute.

c514017

It will be used multiple times: rest api check, api key creation, api key deletion.

michaelbaamonde force-pushed the api-keys branch from a81189e to 0c80c1f Compare June 21, 2022 14:33

michaelbaamonde commented Jun 21, 2022

View reviewed changes

esrally/driver/driver.py Show resolved Hide resolved

michaelbaamonde marked this pull request as ready for review June 21, 2022 14:51

michaelbaamonde requested review from DJRickyB and pquentin June 21, 2022 14:52

pquentin requested changes Jun 23, 2022

View reviewed changes

michaelbaamonde and others added 2 commits June 23, 2022 12:09

Update docs/command_line_reference.rst

f2c6e65

Co-authored-by: Quentin Pradet <quentin.pradet@gmail.com>

Update esrally/client/factory.py

f79cb8e

Co-authored-by: Quentin Pradet <quentin.pradet@gmail.com>

pquentin reviewed Jun 23, 2022

View reviewed changes

esrally/client/factory.py Outdated Show resolved Hide resolved

Mike Baamonde added 4 commits June 24, 2022 10:42

Use kwargs when crating an ApiKey namedtuple.

d9d7ec5

Inline version-specific functions for API key deletion.

5b86c7b

Catch Exception, not BaseException.

ea5a9d8

Formatting.

504ec30

pquentin reviewed Jun 24, 2022

View reviewed changes

esrally/client/factory.py Outdated Show resolved Hide resolved

Mike Baamonde added 3 commits June 24, 2022 12:30

Refactor API key deletion.

e8ea1db

Fail immediately if create_api_key gets an HTTP 405 error.

4076a3c

This indicates that ES Security isn't enabled, so we inform the user and fail the benchmark. Since this isn't recoverable, we don't bother retrying.

Clean up unit tests for creating/deleting API keys. Mainly:

45a4e63

- Fix a copy/paste error that was calling side_effect on the wrong mock - Ensure that call counts are correct - Be more explicit about what arguments we expect calls to contain

michaelbaamonde commented Jun 24, 2022

View reviewed changes

esrally/client/factory.py Show resolved Hide resolved

pquentin reviewed Jun 27, 2022

View reviewed changes

Don't repeat the same exception in unit test side effects.

5757b2c

Mike Baamonde added 5 commits June 27, 2022 17:22

Log that we're creating an API key.

89af97b

Remove unused variable.

beec396

Fix control flow for deleting API keys.

2206c40

Chain exceptions appropriately.

0ce0e94

pquentin approved these changes Jun 29, 2022

View reviewed changes

michaelbaamonde added highlight A substantial improvement that is worth mentioning separately in release notes :Load Driver Changes that affect the core of the load driver such as scheduling, the measurement approach etc. enhancement Improves the status quo labels Jun 29, 2022

michaelbaamonde merged commit 9d2dd33 into elastic:master Jun 29, 2022

pquentin added this to the 2.5.1 milestone Jul 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create and use a unique ES API key for each simulated client #1520

Create and use a unique ES API key for each simulated client #1520

michaelbaamonde commented Jun 15, 2022 •

edited

Loading

pquentin left a comment

pquentin left a comment

pquentin Jun 27, 2022

michaelbaamonde Jun 27, 2022

michaelbaamonde commented Jun 27, 2022

michaelbaamonde commented Jun 27, 2022

pquentin left a comment

pquentin Jun 29, 2022

Create and use a unique ES API key for each simulated client #1520

Create and use a unique ES API key for each simulated client #1520

Conversation

michaelbaamonde commented Jun 15, 2022 • edited Loading

pquentin left a comment

Choose a reason for hiding this comment

pquentin left a comment

Choose a reason for hiding this comment

pquentin Jun 27, 2022

Choose a reason for hiding this comment

michaelbaamonde Jun 27, 2022

Choose a reason for hiding this comment

michaelbaamonde commented Jun 27, 2022

michaelbaamonde commented Jun 27, 2022

pquentin left a comment

Choose a reason for hiding this comment

pquentin Jun 29, 2022

Choose a reason for hiding this comment

michaelbaamonde commented Jun 15, 2022 •

edited

Loading