-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(WIP) Updated all calls to get file system through FSUtils.getFs() #191
Conversation
1bcce25
to
ef5a407
Compare
private static DistributedFileSystem dfs; | ||
private static Logger logger = LogManager.getLogger(TestMultipleFSExample.class); | ||
|
||
@Parameter(names={"--table-path", "-p"}, description = "path for Hoodie sample table") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is just a test, and not a CLI right. so why the parameters?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@gekath Have you been able to test this out in production? This is rather big change, so would like to understand how much testing has been done..
63b301d
to
d6af8c9
Compare
@vinothchandar this PR focused on testing that a read from local, and write to HDFS will result in correct count of records, and this functionality should be similar between any two file systems. Our specific goal is to consider use case of reading from HDFS, writing to GCS. Will look at adding additional testing, (changed to WIP). Key updates are in:
|
Sounds good. Will take a closer look sometime this week. |
} | ||
|
||
public static FileSystem getFs(String path, Configuration conf) { | ||
System.out.println(path); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
logger or please remote s.o.pln
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look safe to me. Need to look at the stuff at HoodieWrapperFileSystem more closely..
Please ping back once you have tested this more for your use-cases.. I ll pull and test on HDFS at Uber as well and we can proceed from there.
@gekath any update on this? |
Hi @vinothchandar , we're working on integrating Hoodie into our current pipeline and then will proceed to test out performance. |
Sounds good. please keep us posted on progress/blockers. |
@gekath - Do you have any updates on this PR? Thanks. |
@prazanna not much progress on this front yet. |
from what @gekath and I discusssed offline.. Seems this is being actively tested :) |
82e2216
to
3949388
Compare
Merge conflicts Merge conflicts Removed print statement Merge conflicts Merge conflicts Merge conflicts Merge conflict Removed default getFs that takes no arguments.
3949388
to
481c4a5
Compare
@alunarbeach do you want to drive this? |
Closing in favor of #293 |
Co-authored-by: StreamingFlames <18889897088@163.com> Co-authored-by: Nicholas Jiang <programgeek@163.com> Co-authored-by: Y Ethan Guo <ethan.guoyihua@gmail.com> Co-authored-by: Sagar Sumit <sagarsumit09@gmail.com> Co-authored-by: Alexey Kudinkin <alexey@infinilake.com> Co-authored-by: RexAn <bonean131@gmail.com>
Updates all consumers of FSUtils.getFs() to accept a path, and infers scheme based on given path and/or configuration, instead of inferring scheme through
fs.defaultFS
. Calls to get a filesystem go through FSUtils.getFs(), aimed to support multiple filesystems./fix #96
/cc @vinothchandar @prazanna @zqureshi