Cache requires-python checks & skip debug logging #13128

ichard26 · 2024-12-26T18:18:48Z

The requires-python check is pretty fast, but when performed for 10000 links, the checks consume a nontrival amount of time. For example, while installing (pre-cached + --dry-run) a pared down list of homeassistant dependencies (117 in total), link evaluation took 15% of the total runtime, with check_requires_python() accounting for half (7.5%) of that.

The cache can be kept pretty small as requires-python specifiers often repeat, and when they do change, it's often in chunks (or between entirely different packages). For example, setuptools has like 1500 links, but only ~30 different requires-python specifiers.

In addition, _log_skipped_link() is a hot method and unfortunately expensive as it hashes the link on every call. Fortunately, we can return early when debug logging is not enabled. In the same homeassistant run, this saves 0.7% of the runtime.

Command: python -m cProfile -o profile.pstats -m pip install -r temp/homeassistant/requirements.txt --dry-run

Before

After

The Link.from_json() classmethod is surprisingly expensive. I'll take a look at speeding that up next.

The `requires-python` check is pretty fast, but when performed for 10000 links, the checks consume a nontrival amount of time. For example, while installing (pre-cached + --dry-run) a pared down list of homeassistant dependencies (117 in total), link evaluation took 15% of the total runtime, with check_requires_python() accounting for half (7.5%) of that. The cache can be kept pretty small as `requires-python` specifiers often repeat, and when they do change, it's often in chunks (or between entirely different packages). For example, setuptools has like 1500 links, but only ~30 different `requires-python` specifiers. In addition, _log_skipped_link() is a hot method and unfortunately expensive as it hashes the link on every call. Fortunately, we can return early when debug logging is not enabled. In the same homeassistant run, this saves 0.7% of the runtime.

ichard26 · 2024-12-26T18:21:27Z

I noticed while writing a janky demo of installing build dependencies in-process that setuptools was still taking forever to download and install. It turns out setuptools' index page is so huge that requires-python evaluation is pretty slow, taking ~100 ms on my reasonably fast machine. (For reference, the HTTP request took ~500ms.)

Command: python -m cProfile -o profile.pstats -m pip install setuptools --ignore-installed

ichard26 added the type: performance Commands take too long to run label Dec 26, 2024

psf-chronographer bot added the bot:chronographer:provided label Dec 26, 2024

ichard26 changed the title ~~perf: cache requires-python checks & skip debug logging~~ Cache requires-python checks & skip debug logging Dec 26, 2024

ichard26 mentioned this pull request Dec 27, 2024

perf: Avoid unnecessary URL processing while parsing links #13132

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache requires-python checks & skip debug logging #13128

Cache requires-python checks & skip debug logging #13128

ichard26 commented Dec 26, 2024

ichard26 commented Dec 26, 2024 •

edited

Loading

Cache requires-python checks & skip debug logging #13128

Are you sure you want to change the base?

Cache requires-python checks & skip debug logging #13128

Conversation

ichard26 commented Dec 26, 2024

ichard26 commented Dec 26, 2024 • edited Loading

ichard26 commented Dec 26, 2024 •

edited

Loading