Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Please clarify some documentation here in the Readme #6

Open
alexmc6 opened this issue Mar 7, 2013 · 2 comments
Open

Please clarify some documentation here in the Readme #6

alexmc6 opened this issue Mar 7, 2013 · 2 comments

Comments

@alexmc6
Copy link

alexmc6 commented Mar 7, 2013

I see from the Elastic Search website at
http://www.elasticsearch.org/guide/reference/modules/gateway/hadoop.html

that "The hadoop gateway is deprecated and will be removed in a future version. Please use the local gateway instead."

Can you explain here the pros and cons of using this elasticsearch-hadoop plugin for ES storage? Is the author just saying that all remote gateways are deprecated and should not be used.

I have lots of data in Hadoop/HDFS which I would like to import into ES. Can you please explain whether or not this software will help with that? I assume I really need a Hadoop-River plugin - which doesn't exist yet.

Thanks

@kimchy
Copy link
Member

kimchy commented Mar 7, 2013

  1. yes, all shared gateways are deprecated.
  2. you need to run a map reduce job, or something similar, to index the data from Hadoop to elasticsearch. You can have a look at the wonderdog project to do it, we do hope to provide something that simplifies it even more in the future.

@alexmc6
Copy link
Author

alexmc6 commented Mar 7, 2013

Cheers. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants