Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[META] API Based Spark Connector and Benchmars #1746

Open
brijos opened this issue Jun 15, 2023 · 1 comment
Open

[META] API Based Spark Connector and Benchmars #1746

brijos opened this issue Jun 15, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@brijos
Copy link

brijos commented Jun 15, 2023

Is your feature request related to a problem?
Machines generating operational data continue to grow in number and storage size and to present new use cases. Companies rely on logs to aid in troubleshooting and to better understand how their digital offerings are being used. Not all data is equal, though. As community members grow, they have to decide what high-value data to store in OpenSearch and what to store in object stores, due to cost/scale. Customers need to search both OpenSearch and object stores using separate tools and do so with the low latency required for log analytics use cases such as observability and security analytics. It is expensive, or impossible, to perform complex joined queries with filtered analytics at scale, not to mention to provide a unified view across use cases. Customers are left incurring costs on data stored elsewhere that they cannot easily put to use.

What solution would you like?
OpenSearch to query object stores using Apache Spark and build indexes based on object store data so that search query latency is suitable for those troubleshooting live problems in their system.

What alternatives have you considered?
Apache Spark has the compute required to power complex queries and aggregations that OpenSearch does not and has a large open source community. The team believes that compute should be modular in the future, but is starting with Apache Spark.

Do you have any additional context?
Add any other context or screenshots about the feature request here.

@brijos brijos added enhancement New feature or request untriaged labels Jun 15, 2023
@brijos
Copy link
Author

brijos commented Jun 15, 2023

#1375

@anirudha anirudha changed the title [META] API Based Spark Connector and Benchmark Tool [META] API Based Spark Connector and Benchmars Jun 29, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants