[SPARK-49941][CORE] Rename `errorClass` to `condition` in errors of the JSON format #48431

MaxGekk · 2024-10-12T12:53:04Z

What changes were proposed in this pull request?

In the PR, I propose to rename the errorClass to condition in errors in the JSON formats: MINIMAL and STANDARD.

For example:

{
  "condition" : "DIVIDE_BY_ZERO",
  "sqlState" : "22012",
  "messageParameters" : { "config" : "CONFIG"}
}

Why are the changes needed?

To follow new naming convention introduced by #44902.

Does this PR introduce any user-facing change?

Yes.

How was this patch tested?

By running the affected tests:

$ build/sbt "sql/testOnly org.apache.spark.sql.SQLQueryTestSuite"

Was this patch authored or co-authored using generative AI tooling?

No.

MaxGekk · 2024-10-12T19:21:02Z

@srielau @panbingkun @nchammas @cloud-fan Could you review the PR, please.

panbingkun · 2024-10-14T06:47:11Z

LGTM.
nit: Do we need to update the following?

MaxGekk · 2024-10-14T09:27:27Z

@panbingkun Thank you for review.

UIUtils: The name errorClass is regexp group name. Not related to the changes.
SQLJsonProtocolSuite: it is just an example in a test. It could handle old input.

panbingkun · 2024-10-15T01:57:06Z

UIUtils

Yeah!

Yes, it is indeed the group name of a regular expression. Not related to the changes, and calling it as errorClass will not affect the final functionality.
(PS: As a follow-up, I'm not sure if we need to rename it to condiiton to reduce misunderstandings? Because from UT, it seems that its purpose is to obtain the value of condiiton (In the past, it was called errorClass)

spark/core/src/test/scala/org/apache/spark/ui/UIUtilsSuite.scala

Lines 209 to 211 in 488f680

    
           val e1 = "Job aborted due to stage failure: Task 0 in stage 1.0 failed 1 times, most recent failure: Lost task 0.0 in stage 1.0 (TID 1) (10.221.98.22 executor driver): org.apache.spark.SparkArithmeticException: [DIVIDE_BY_ZERO] Division by zero. Use `try_divide` to tolerate divisor being 0 and return NULL instead. If necessary set \"spark.sql.ansi.enabled\" to \"false\" to bypass this error.\n== SQL (line 1, position 8) ==\nselect a/b from src\n       ^^^\n\n\tat org.apache.spark.sql.errors.QueryExecutionErrors$.divideByZeroError(QueryExecutionErrors.scala:226)\n\tat org.apache.spark.sql.errors.QueryExecutionErrors.divideByZeroError(QueryExecutionErrors.scala)\n\tat org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage1.processNext(generated.java:54)\n\tat org.apache.spark.sql.execution.BufferedRowIterator.hasNext(BufferedRowIterator.java:43)\n\tat org.apache.spark.sql.execution.WholeStageCodegenEvaluatorFactory$WholeStageCodegenPartitionEvaluator$$anon$1.hasNext(WholeStageCodegenEvaluatorFactory.scala:43)\n\tat org.apache.spark.sql.execution.SparkPlan.$anonfun$getByteArrayRdd$1(SparkPlan.scala:388)\n\tat org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2(RDD.scala:890)\n\tat org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2$adapted(RDD.scala:890)\n\tat org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)\n\tat org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:364)\n\tat org.apache.spark.rdd.RDD.iterator(RDD.scala:328)\n\tat org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:93)\n\tat org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:161)\n\tat org.apache.spark.scheduler.Task.run(Task.scala:141)\n\tat org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:592)\n\tat org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1474)\n\tat org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:595)\n\tat java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)\n\tat java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)\n\tat java.lang.Thread.run(Thread.java:750)\n\nDriver stacktrace:" 
        
           val cell1 = UIUtils.errorMessageCell(e1) 
        
           assert(cell1 === <td>{"DIVIDE_BY_ZERO"}{UIUtils.detailsUINode(isMultiline = true, e1)}</td>)

(Because after this PR, I believe that in the error log, there should only be condiiton and no errorClass.)

Okay.

nchammas

LGTM, but is there a summary of why we are going with condition rather than errorCondition?

I remember seeing a discussion somewhere about this, but it's not on the ticket associated with this PR.

MaxGekk added 2 commits October 12, 2024 14:36

Rename errorClass to condition in errors in JSON formats

c21ddf2

Re-gen golden files

c641047

github-actions bot added SQL CORE labels Oct 12, 2024

MaxGekk changed the title ~~[WIP] Rename errorClass to condition in errors of the JSON format~~ [SPARK-49941][CORE] Rename errorClass to condition in errors of the JSON format Oct 12, 2024

MaxGekk marked this pull request as ready for review October 12, 2024 19:19

MaxGekk requested a review from cloud-fan October 12, 2024 19:19

MaxGekk changed the title ~~[SPARK-49941][CORE] Rename errorClass to condition in errors of the JSON format~~ [SPARK-49941][CORE] Rename errorClass to condition in errors of the JSON format Oct 12, 2024

Update the golden files w/ the extension .java21

eccedc5

nchammas approved these changes Oct 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-49941][CORE] Rename `errorClass` to `condition` in errors of the JSON format #48431

[SPARK-49941][CORE] Rename `errorClass` to `condition` in errors of the JSON format #48431

MaxGekk commented Oct 12, 2024 •

edited

Loading

MaxGekk commented Oct 12, 2024

panbingkun commented Oct 14, 2024

MaxGekk commented Oct 14, 2024

panbingkun commented Oct 15, 2024 •

edited

Loading

nchammas left a comment •

edited

Loading

[SPARK-49941][CORE] Rename errorClass to condition in errors of the JSON format #48431

Are you sure you want to change the base?

[SPARK-49941][CORE] Rename errorClass to condition in errors of the JSON format #48431

Conversation

MaxGekk commented Oct 12, 2024 • edited Loading

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

MaxGekk commented Oct 12, 2024

panbingkun commented Oct 14, 2024

MaxGekk commented Oct 14, 2024

panbingkun commented Oct 15, 2024 • edited Loading

nchammas left a comment • edited Loading

Choose a reason for hiding this comment

[SPARK-49941][CORE] Rename `errorClass` to `condition` in errors of the JSON format #48431

[SPARK-49941][CORE] Rename `errorClass` to `condition` in errors of the JSON format #48431

MaxGekk commented Oct 12, 2024 •

edited

Loading

panbingkun commented Oct 15, 2024 •

edited

Loading

nchammas left a comment •

edited

Loading