Skip to content

gunadhya/Spark-Streaming-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spark-Streaming-Project

Outputs

Kafka Producer

Image of Yaktocat

Kafka Console Consumer

Image of Yaktocat

Progress

Image of Yaktocat Image of Yaktocat

Streaming UI

Image of Yaktocat

  1. How did changing values on the SparkSession property parameters affect the throughput and latency of the data?
  • processedRowsPerSecond is affected.
  1. What were the 2-3 most efficient SparkSession property key/value pairs? Through testing multiple variations on values, how can you tell these were the most optimal?
  • spark.default.parallelism

  • spark.streaming.kafka.maxRatePerPartition

    Based on the processed rows per second.

referred: https://knowledge.udacity.com/questions/123410

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages