Fixes Spark REST fetcher for client mode applications #193

shkhrgpt · 2017-01-25T18:04:00Z

As mentioned in the issue, #175, the current REST API fetcher fails to parse client mode applications because there is no attemptId in REST response of ApplicationInfo.

In this change, we are modifying the fetcher to not consider attemptId in REST urls when attemptId is not returned by ApplicationInfo response, as it's mentioned in this Spark documentation.

With this change, the fetcher will be able to process both Client and Cluster mode applications.

@akshayrai can you please have a look.

Thank you.

…ations.

shkhrgpt · 2017-01-30T04:19:32Z

@rayortigas , @paulbramsen , @shankar37 can you please review this change. This is a critical and active issue.
Thank you.

shankar37

@rayortigas can you take a look as well ?

rayortigas · 2017-01-30T05:28:07Z

Code looks fine. Just wanted to add a couple of comments:

I'm well aware that the API can serve requests without an attempt ID. However, the old Spark fetcher didn't appear to support yarn-client, considering that it was hardcoding _1 when fetching the Spark event logs. See https://github.com/linkedin/dr-elephant/pull/162/files#diff-24b9ededac05bd3a6096311b696d0830L240 For my education, I'd like to know whether a) I introduced a regression and missed something, or b) if this also wasn't working before Rewrite Spark fetcher/heuristics. #162, and we only expected to support yarn-cluster mode.
I feel the corresponding unit test should be updated. It's probably way more code than the fix provided here, but it'll reinforce/document expectations about supporting both yarn-client and yarn-cluster modes, which the previous unit test didn't appear to do https://github.com/linkedin/dr-elephant/pull/162/files#diff-25f92aef2adc15b2196f1f12b5344549L39

shkhrgpt · 2017-01-30T07:52:44Z

@rayortigas: I am not very familiar with the old Spark fetcher, so I am not sure whether or not it supported yarn-client. Maybe @akshayrai or @shankar37 can provide some input here. We submit Spark applications both in cluster and client mode, and Dr Elephant wasn't able to fetch client mode applications. Later I also found an issue (#175), where other people are also trying to use Dr Elephant to analyze client mode applications and it's failing.

As per your second concern, I have added a unit test to test this change. Please take a look.

Thank you.

rayortigas · 2017-01-30T17:56:21Z

@shkhrgpt LGTM. Thanks for adding the unit test!

Yeah, the other question was primarily for @akshayrai and @shankar37.

shkhrgpt · 2017-02-07T04:06:58Z

@akshayrai and @shankar37 Would you like me to make any more changes in order to merge to this PR? There is still a critical and active issue which requires this fix.
Thank you.

akshayrai · 2017-02-07T04:51:26Z

+1 LGTM

* Removes pattern matching

shkhrgpt added 2 commits January 25, 2017 09:55

Fixes the issue where Spark fetcher fails to fetch client mode applic…

aebe878

…ations.

Removes pattern matching

0ba8ce1

shkhrgpt mentioned this pull request Jan 30, 2017

SparkRestClient.scala fetchData() throws java.lang.IllegalArgumentException for yarn-client Spark jobs #175

Closed

shankar37 approved these changes Jan 30, 2017

View reviewed changes

shkhrgpt added 2 commits January 29, 2017 23:10

Adds unit test

9aa7bd2

Adds unit test

f8c9310

akshayrai merged commit e93d431 into linkedin:master Feb 7, 2017

shkhrgpt mentioned this pull request Feb 20, 2017

Spark REST API has no attempt information #208

Closed

skakker pushed a commit to skakker/dr-elephant that referenced this pull request Dec 14, 2017

Fixes Spark REST fetcher for client mode applications (linkedin#193)

7020584

* Removes pattern matching

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes Spark REST fetcher for client mode applications #193

Fixes Spark REST fetcher for client mode applications #193

shkhrgpt commented Jan 25, 2017

shkhrgpt commented Jan 30, 2017

shankar37 left a comment

rayortigas commented Jan 30, 2017

shkhrgpt commented Jan 30, 2017

rayortigas commented Jan 30, 2017

shkhrgpt commented Feb 7, 2017

akshayrai commented Feb 7, 2017

Fixes Spark REST fetcher for client mode applications #193

Fixes Spark REST fetcher for client mode applications #193

Conversation

shkhrgpt commented Jan 25, 2017

shkhrgpt commented Jan 30, 2017

shankar37 left a comment

Choose a reason for hiding this comment

rayortigas commented Jan 30, 2017

shkhrgpt commented Jan 30, 2017

rayortigas commented Jan 30, 2017

shkhrgpt commented Feb 7, 2017

akshayrai commented Feb 7, 2017