Skip to content

umich-arc/spark-yarn-shuffle-qa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

spark-yarn-shuffle-qa

Should be submitted by id with access to sample twitter data

spark-submit --master yarn --queue arcts --num-executors 150 --executor-memory 5g --conf spark.shuffle.service.enabled=true --driver-memory 120g --conf spark.driver.maxResultSize=120g --deploy-mode cluster --executor-cores 4 spark-yarn-shuffle-qa.py

use yarn logs -applicationId application_1628522746056_2297 > spark-shuf-big-1 grep -i shuf spark-shuf-big-1 to verify shuffle usage replace with your app id

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages