Reducing complexity of implementation in order to be able to add Atlas text search token based pagination by kbuma · Pull Request #1046 · materialsproject/api

kbuma · 2025-12-19T20:05:36Z

Summary

Major changes:

remove parallelization of server requests
re-implement handling of list criteria for parameters in endpoints that do not accept lists (this was tied into the parallelization code previously)
re-implement handling of list criteria that is too large for a single request (this was tied into the parallelization code previously)

mp_api/client/core/client.py

tsmathis

Not really much to say on my end, I am be curious though about the performance/execution time of this implementation vs. the parallel approach.

mp_api/client/core/client.py

esoteric-ephemera · 2026-01-06T22:09:34Z

mp_api/client/core/client.py

-                r -= 1
+            except MPRestError as e:
+                # If we get 422 or 414 error, or 0 results for comma-separated params, split into batches
+                if "422" in str(e) or "414" in str(e) or "Got 0 results" in str(e):


any(trace in str(e) for trace in ("422","414","Got 0 results"))

addressed with 18b0b79

esoteric-ephemera · 2026-01-06T22:13:55Z

mp_api/client/core/client.py

-            ]
+                    # Batch the split values to reduce number of requests
+                    # Use batches of up to 100 values to balance URL length and request count
+                    batch_size = min(100, max(1, len(split_values) // 10))


Should the batch size be chosen according to the limits we (may) impose on a Query? Or alternatively, should there be a check on the length of a batch after fixing the batch size? That way excessively long queries get rejected (e.g., I query for 1M task IDs, 100 batches would still give me an overly-long list of task IDs)

That would change the existing user interface. Trying to avoid doing that as part of this refactor.

…d flatten once with chain.from_iterable

…quested

kbuma · 2026-02-06T22:31:22Z

@esoteric-ephemera @tsmathis I've addressed all outstanding comments. Ready for re-review.

tsmathis · 2026-02-07T00:09:09Z

Nothing else to add from my end

…ion with append + chain.from_iterable

kbuma · 2026-02-07T01:18:06Z

still a couple of .extend(...)s

this one should be fine, it's just the final step right? https://github.com/kbuma/api/blob/858accf6b00d90055eb21d642e0ac4eabf8ba921/mp_api/client/core/client.py#L732

this one is in a while loop: https://github.com/kbuma/api/blob/858accf6b00d90055eb21d642e0ac4eabf8ba921/mp_api/client/core/client.py#L806

@tsmathis addressed with fc25c1e

kbuma added 5 commits December 18, 2025 13:53

skip alloys test if lib is missing

4f44708

remove parallel calls in client

460196f

added back logic to split some requests but do not parallelize.

ca8ddd0

Merge branch 'main' into search-pagination

67f337a

lint

8af7860

kbuma requested review from esoteric-ephemera, tschaume and tsmathis December 19, 2025 20:10

tsmathis reviewed Dec 19, 2025

View reviewed changes

mp_api/client/core/client.py Outdated Show resolved Hide resolved

tsmathis reviewed Dec 19, 2025

View reviewed changes

mp_api/client/core/client.py Outdated Show resolved Hide resolved

tsmathis reviewed Dec 19, 2025

View reviewed changes

esoteric-ephemera reviewed Jan 6, 2026

View reviewed changes

mp_api/client/core/client.py Outdated Show resolved Hide resolved

esoteric-ephemera reviewed Jan 6, 2026

View reviewed changes

esoteric-ephemera added 2 commits January 21, 2026 12:17

Merge remote-tracking branch 'upstream/main' into search-pagination

d6ae5c2

resolve merge conflicts

28e74e1

esoteric-ephemera force-pushed the search-pagination branch from 77913b3 to 28e74e1 Compare January 21, 2026 21:02

esoteric-ephemera and others added 5 commits February 3, 2026 11:13

merge conflicts

50ab68d

Switched the split-parameter path to accumulate lists with .append an…

410c450

…d flatten once with chain.from_iterable

Added a _batched helper (modeled on the 3.12 itertools.batched docs)

cf4a917

Updated the error check to the any(... for trace in (...)) form as re…

18b0b79

…quested

formatting

858accf

This comment was marked as resolved.

Sign in to view

Switched the remaining .extend usages in client.py to chunk accumulat…

fc25c1e

…ion with append + chain.from_iterable

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reducing complexity of implementation in order to be able to add Atlas text search token based pagination#1046

Reducing complexity of implementation in order to be able to add Atlas text search token based pagination#1046
kbuma wants to merge 13 commits intomaterialsproject:mainfrom
kbuma:search-pagination

kbuma commented Dec 19, 2025

Uh oh!

Uh oh!

Uh oh!

tsmathis left a comment

Uh oh!

Uh oh!

esoteric-ephemera Jan 6, 2026

Uh oh!

kbuma Feb 6, 2026

Uh oh!

esoteric-ephemera Jan 6, 2026

Uh oh!

kbuma Feb 6, 2026

Uh oh!

kbuma commented Feb 6, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

tsmathis commented Feb 7, 2026

Uh oh!

kbuma commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kbuma commented Dec 19, 2025

Summary

Uh oh!

Uh oh!

Uh oh!

tsmathis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

esoteric-ephemera Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

kbuma Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

esoteric-ephemera Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

kbuma Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

kbuma commented Feb 6, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

tsmathis commented Feb 7, 2026

Uh oh!

kbuma commented Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants