-
Notifications
You must be signed in to change notification settings - Fork 29k
Pull requests: apache/spark
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SPARK-52764][ML][CONNECT][TESTS] Restore
test_parity_classification and test_parity_regression
CONNECT
ML
PYTHON
#53504
opened Dec 17, 2025 by
zhengruifeng
Loading…
[SPARK-54730][SQL][CONNECT] Delay failure of dataframe column resolution
SQL
#53503
opened Dec 17, 2025 by
zhengruifeng
Loading…
[SPARK-54729][CORE] Proactively replicate shuffle data to the FallbackStorage
CORE
KUBERNETES
#53502
opened Dec 17, 2025 by
EnricoMi
Loading…
[WIP][CONNECT] Read ChunkedLocalRelation from BlockManager
CONNECT
SQL
#53497
opened Dec 17, 2025 by
hvanhovell
•
Draft
[SPARK-54725][SQL] Add inferring transitive join conditions in CostBasedJoinReorder
SQL
#53496
opened Dec 17, 2025 by
juntaozhang
Loading…
[SPARK-54724][INFRA] Hook worker exception handler to dump report before sending exception
PYTHON
#53495
opened Dec 17, 2025 by
gaogaotiantian
Loading…
[SPARK-54723][PYTHON][TESTS] Add Test for @udf Usage on Arrow Grouped Aggregate Iter UDF
PYTHON
SQL
#53494
opened Dec 17, 2025 by
Yicong-Huang
Loading…
[SPARK-54722][PYTHON][SQL] Register Pandas Grouped Iter Aggregate UDF for SQL usage
PYTHON
SQL
#53493
opened Dec 17, 2025 by
Yicong-Huang
Loading…
[SPARK-54703][PYTHON] Consolidate SQL_GROUPED_AGG_ARROW_ITER_UDF and SQL_GROUPED_AGG_PANDAS_ITER_UDF mapper logic
CORE
PYTHON
#53492
opened Dec 17, 2025 by
Yicong-Huang
Loading…
[SPARK-54721][SQL] Completely disable feature 'nested struct coercion' for MERGE INTO
SQL
#53490
opened Dec 16, 2025 by
szehon-ho
Loading…
[SPARK-54720][SQL] Add SparkSession.emptyDataFrame with a schema
SQL
#53489
opened Dec 16, 2025 by
hvanhovell
Loading…
[SPARK-54713][SQL] Add vector similarity/distance function expressions support
SQL
#53481
opened Dec 16, 2025 by
zhidongqu-db
Loading…
[SPARK-54696][CONNECT] Clean-up Arrow Buffers - follow-up
CONNECT
SQL
#53480
opened Dec 16, 2025 by
hvanhovell
Loading…
[SPARK-46166][PS] Implementation of pandas.DataFrame.any with axis=None
PANDAS API ON SPARK
PYTHON
#53478
opened Dec 15, 2025 by
devin-petersohn
Loading…
[SPARK-54711] Add a timeout for daemon created worker connection
CORE
PYTHON
#53476
opened Dec 15, 2025 by
gaogaotiantian
Loading…
[SPARK-54698][SQL] Support hashing for all data types for array set like operations
SQL
#53468
opened Dec 13, 2025 by
Kimahriman
Loading…
[MINOR][PYTHON] Fix
_create_converter and covert overload signature
PYTHON
SQL
#53467
opened Dec 13, 2025 by
gaogaotiantian
Loading…
[SPARK-54450][INFRA][FOLLOWUP] Support unittest style string in run-tests
DOCS
PYTHON
#53465
opened Dec 13, 2025 by
gaogaotiantian
Loading…
[SPARK-54701] Improve the runnerConf chain for Python workers
CORE
PYTHON
SQL
STRUCTURED STREAMING
#53462
opened Dec 12, 2025 by
gaogaotiantian
Loading…
[SPARK-54700][SQL][WIP] Quote constraint name and columns for dsv2 constraints
SQL
#53460
opened Dec 12, 2025 by
yhuang-db
Loading…
[SPARK-54443][SS] Integrate PartitionKeyExtractor in Re-partition reader
SQL
STRUCTURED STREAMING
#53459
opened Dec 12, 2025 by
zifeif2
Loading…
[SPARK-54586][SQL] Validate UTF-8 when casting Binary to String
SQL
#53458
opened Dec 12, 2025 by
qlong
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.