-
- Downloads
[SPARK-20148][SQL] Extend the file commit API to allow subscribing to task commit messages
## What changes were proposed in this pull request? The internal FileCommitProtocol interface returns all task commit messages in bulk to the implementation when a job finishes. However, it is sometimes useful to access those messages before the job completes, so that the driver gets incremental progress updates before the job finishes. This adds an `onTaskCommit` listener to the internal api. ## How was this patch tested? Unit tests. cc rxin Author: Eric Liang <ekl@databricks.com> Closes #17475 from ericl/file-commit-api-ext.
Showing
- core/src/main/scala/org/apache/spark/internal/io/FileCommitProtocol.scala 7 additions, 0 deletions...ala/org/apache/spark/internal/io/FileCommitProtocol.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileFormatWriter.scala 16 additions, 6 deletions...he/spark/sql/execution/datasources/FileFormatWriter.scala
- sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala 30 additions, 1 deletion...rg/apache/spark/sql/test/DataFrameReaderWriterSuite.scala
Please register or sign in to comment