-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(hbase): support gen HFile for hbase v2 (BETA) #358
Conversation
…schema to SchemaCache fix apache#333
…schema to SchemaCache fix apache#333
merge master into schemaCache-optimize
…incubator-hugegraph-toolchain into schemaCache-optimize
@JackyYangPassion Is this an improved part? |
Codecov Report
@@ Coverage Diff @@
## master #358 +/- ##
============================================
- Coverage 64.82% 62.44% -2.38%
- Complexity 1851 1864 +13
============================================
Files 255 260 +5
Lines 9081 9462 +381
Branches 837 874 +37
============================================
+ Hits 5887 5909 +22
- Misses 2810 3169 +359
Partials 384 384
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
|
OK, mark it also as to be reviewed. and could u handle the third-party dependencies check? |
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/EdgeBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/EdgeBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/EdgeBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/VertexBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/VertexBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/DirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/DirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/DirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
adjust code style fix apache#357
1、The code style has been adjusted, |
fix third-party dependencies error fix apache#357
thanks,the 3rd party check seems failed,need some help? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for your contribution~
please also address other comments: https://github.com/apache/incubator-hugegraph-toolchain/pull/358/files (search by "ago"), and also address this file LoadOptions.java
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
adjust code style fix apache#357
@imbajin Hi, I can help solve the loader ci check failure |
Thanks, I have already adopted the basic code, and current the differ is: expected: {
"version":"2.0",
"structs":[
{
"id":"1",
"skip":false,
"input":{
"type":"FILE",
"path":"users.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"Gender",
"Age",
"Occupation",
"Zip-code"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
{
"label":"user",
"skip":false,
"id":null,
"unfold":false,
"field_mapping":{
"UserID":"id"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Occupation",
"Zip-code",
"Gender",
"Age"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
],
"edges":[
]
},
{
"id":"2",
"skip":false,
"input":{
"type":"FILE",
"path":"ratings.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"MovieID",
"Rating",
"Timestamp"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
],
"edges":[
{
"label":"rating",
"skip":false,
"source":[
"UserID"
],
"unfold_source":false,
"target":[
"MovieID"
],
"unfold_target":false,
"field_mapping":{
"UserID":"id",
"MovieID":"id",
"Rating":"rate"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Timestamp"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
]
}
]
} actual: {
"version":"2.0",
"structs":[
{
"id":"1",
"skip":false,
"input":{
"type":"FILE",
"path":"users.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"Gender",
"Age",
"Occupation",
"Zip-code"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
{
"label":"user",
"skip":false,
"id":null,
"unfold":false,
"field_mapping":{
"UserID":"id"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Occupation",
"Zip-code",
"Gender",
"Age"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
],
"edges":[
]
},
{
"id":"2",
"skip":false,
"input":{
"type":"FILE",
"path":"ratings.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"MovieID",
"Rating",
"Timestamp"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
],
"edges":[
{
"label":"rating",
"skip":false,
"source":[
"UserID"
],
"unfold_source":false,
"target":[
"MovieID"
],
"unfold_target":false,
"field_mapping":{
"UserID":"id",
"MovieID":"id",
"Rating":"rate"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Timestamp"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
]
}
],
"backendStoreInfo":null
} seems |
The configuration information of the storage layer that bulkLoad depends on is specified in struct.json, so backendstoreinfo is added. The follow-up iteration is to obtain the configuration information of the storage layer from the server; |
it's fine, just adopt it in test 😄 (so as other test problem if exists) |
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/executor/LoadOptions.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/util/MappingUtil.java
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks, we could handle the 3rd dependencies together before release (to avoid waste a lot time on it)
Do I need to solve 3rd dependencies check failed?
ok |
We can downgrade the version if necessary, hadoop dependency seems can not be excluded , |
Yes, loader needs hadoop dependency . Internally, we read data from hdfs and load it into the graph |
close #357
1、Support write vertex/edge directly to KV storage
2、Just support customString and customNumber ID now
3、submit the loader code for bypass server for hbase writing