Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature][Connector V2] Add conditions for mongodb source incremental pull #3567

Closed
2 of 3 tasks
wangsvip opened this issue Nov 24, 2022 · 1 comment · Fixed by #3697
Closed
2 of 3 tasks

[Feature][Connector V2] Add conditions for mongodb source incremental pull #3567

wangsvip opened this issue Nov 24, 2022 · 1 comment · Fixed by #3697
Assignees

Comments

@wangsvip
Copy link

wangsvip commented Nov 24, 2022

Search before asking

  • I had searched in the feature and found no similar feature requirement.

Description

I will regularly pull the incremental data of mongodb into hive every day. At present, the extraction method of mongodb is to pull the full amount, and then filter the incremental data in the transform. This method is not friendly. If this table has 1 billion data, it will increase every day. The amount of 5,000 records requires the source side to pull in 1 billion data, and then filter out 5,000 records on the transform side. DBA disagrees with this approach, saying that it consumes a lot of resources, and it needs to be done once a day. If there are too many, the CPU on the direct line will explode. Therefore, I hope to add a place where the filter conditions can be written on the source side, so that I can directly check the 5000 incremental data and export it.

Usage Scenario

No response

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@MonsterChenzhuo
Copy link
Contributor

please assign to me; thx

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

3 participants