Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement datasource metrics #75

Open
kortemik opened this issue Aug 26, 2024 · 1 comment
Open

Implement datasource metrics #75

kortemik opened this issue Aug 26, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@kortemik
Copy link
Member

kortemik commented Aug 26, 2024

Description
Implement datasource metrics

Use case or motivation behind the feature request
Currently users do not know the progress of a query which is frustrating. This needs to be fixed. Now that the datasource uses Spark 3 APIs it is possible to provide metric information about the datasource progress.

Please create at least following metrics aggregated into JSON data format:
Driver:

  • Current archive offset
  • Kafka offset

Task

  • Amount of records processed
  • Amount of bytes processed
  • Bytes per second
  • Records per second

Please consider implementing a precreated (hourly/automatic) buckets within the driver for earliest-latest span and binning the processed data in the tasks into these created buckets.

Please define JSON schema once initial development is done.

Related issues
teragrep/ajs_01#70 depends on this

Additional context
See example at #74 and close when implemented.

@kortemik kortemik added the enhancement New feature or request label Aug 26, 2024
@kortemik
Copy link
Member Author

this feature replaces "metricsLogger" in DPLParserCatalystContext on pth_10.

@51-code 51-code assigned eemhu and unassigned 51-code Sep 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants