-
- Downloads
[SPARK-12870][SQL] better format bucket id in file name
for normal parquet file without bucket, it's file name ends with a jobUUID which maybe all numbers and mistakeny regarded as bucket id. This PR improves the format of bucket id in file name by using a different seperator, `_`, so that the regex is more robust. Author: Wenchen Fan <wenchen@databricks.com> Closes #10799 from cloud-fan/fix-bucket.
Showing
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/bucket.scala 10 additions, 4 deletions...a/org/apache/spark/sql/execution/datasources/bucket.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/json/JSONRelation.scala 1 addition, 1 deletion...e/spark/sql/execution/datasources/json/JSONRelation.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRelation.scala 1 addition, 1 deletion...k/sql/execution/datasources/parquet/ParquetRelation.scala
- sql/hive/src/main/scala/org/apache/spark/sql/hive/orc/OrcRelation.scala 1 addition, 1 deletion...ain/scala/org/apache/spark/sql/hive/orc/OrcRelation.scala
Please register or sign in to comment