Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-51166 Prepare Apache Spark 4.1.0
  3. SPARK-49386

Add memory based thresholds for shuffle spill

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 4.1.0
    • 4.1.0
    • Spark Core, SQL

    Description

      We can only determine the number of spills by configuringĀ spark.shuffle.spill.numElementsForceSpillThreshold. In some scenarios, the size of a row may be very large in the memory.

      Attachments

        Issue Links

          Activity

            People

              dzcxzl dzcxzl
              dzcxzl dzcxzl
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: