chore: ci3 post-merge fixes #10825

ludamad · 2024-12-17T20:25:58Z

No description provided.

The test `can simulate public txs while building a block` would, in some cases, build blocks with varying tx sizes. An example run built blocks with 4, 12, and 12 txs. This totals 28 txs, so the remaining 2 txs never get mined and the test timeout. This fixes it by forcing the sequencer to build with just 4 txs consistently, and making the total number of txs a multiple of it.

Resolves #10371

We now require the *average* of the increase in proven chain across a namespace to be 0 for an *hour* to trigger a slack alert.

Resolves #10033 Resolves #10374 This PR does the following: - Witgen handles out-of-gas errors for all opcodes - all halts (return/revert/exceptional) work as follows: - charge gas for the problematic instruction as always, adding a row to the gas trace - pop the parent/caller's latest gas from the stack - call a helper function on the gas trace to mutate that most recent gas row, returning to the parent's latest gas minus any consumed gas (all gas consumed on exceptional halt) - `GasTraceEntry` includes a field `is_halt_or_first_row_in_nested_call` which lets us break gas rules on a halt or when starting a nested call because in both cases gas will jump. - `constrain_gas` returns a bool `out_of_gas` so that opcode implementations can handle out of gas - `write_to_memory` now has an option to skip the "jump back to correct pc" which was problematic when halting because the `jump` wouldn't result in a next row with the right pc Explanation on how gas works for calls: - Parent snapshots its gas right before a nested call in `ctx.*_gas_left` - Nested call is given a `ctx.start_*_gas_left` and the gas trace is forced to that same value - throughout the nested call, the gas trace operates normally, charging per instruction - when any halt is encountered, the instruction that halted must have its gas charged normally, but then we call a helper function on the gas trace to mutate the most recent row, flagging it to eventually become a sort of "fake" row that skips some constraints - the mutation of the halting row resets the gas to the parents last gas before the call (minus however much gas was consumed by the nested call... if exceptional halt, that is _all_ allocated gas) Follow-up work - properly constrain gas for nested calls, returns, reverts and exceptional halts - if `jump` exceptionally halts (i.e. out of gas), it should be okay that the next row doesn't have the target pc - Handle the edge case when an error is encountered on return/revert/call, but after the stack has already been modified

Attempts two fixes at e2e epochs test. First, it increases the L1 block time, to account for general CI slowness. Second, it adds more retries to the L1 gas utils getTx, since the e2e epochs test works using the tx delayer, which artificially introduces a delay between a tx being sent and it being available in anvil, so it triggered a timeout in the utils. **Update**: Increasing the retries caused the error to change, we were getting a timeout in teardown. This was caused because the sequencer got stuck in waiting for the tx to be mined for way too long (more than 5 minutes, the afterAll hook timeout), and the `node.stop()` waits for the current loops to end before returning. But what's interesting is _why_ the sequencer got stuck in waiting for its tx to be mined. The tx was being delayed by the tx-delayer, which intercepts txs, manually computes their tx hash to return it to the caller immediately, and holds on the tx to submit it to anvil at a later point in time. What came up is that the tx hash we were manually computing over txs with blobs did not match the tx hash returned by anvil. This has been logged as #10824. However, we can sidestep this issue by just choosing a reasonable value for max attempts so teardown doesn't timeout. --------- Co-authored-by: ludamad <[email protected]>

Please read [contributing guidelines](CONTRIBUTING.md) and remove this line.

ludamad and others added 30 commits December 17, 2024 20:20

robustness

b76cf31

more sane build-pic prep

d0beea3

more sane build-pic prep

6493903

fix

074e1b1

fix

58cfc23

feat(p2p): attestation pool persistence (#10667)

dacef9f

better startup script

438ab54

generate

1564662

Yarn

2174e1f

fix

346c4f0

chore(avm): radix opcode - remove immediates (#10696)

4ac13e6

Resolves #10371

chore: average alerts across namespace for 1 hour (#10827)

962a7a2

We now require the *average* of the increase in proven chain across a namespace to be 0 for an *hour* to trigger a slack alert.

fix

d529df7

ensure-tester

6146775

files

147de9c

files

4d94837

S3_FORCE_UPLOAD

1a3ced8

ensure-builder fix

072bb7c

start

d740eaf

fix

0e70dfc

gogo

0499b10

fix

703d40a

fix

b51eb14

chore(docs): update migration notes (#10829)

be7cadf

Please read [contributing guidelines](CONTRIBUTING.md) and remove this line.

fix

22d63f4

fix

df2921c

fix

8f66e31

ludamad added 22 commits December 19, 2024 01:20

Update

7bd8e81

hash

0782b44

try it out

ed0085a

more conditionality

c63df90

edge ci

c168db6

refactor vm tests

686a46c

Docs selection of ci

981aa4a

dont make GH runner token mandatory

45f2435

simplify workflows

3f50921

Merge remote-tracking branch 'origin/master' into cl/ci3-fixes

4c8f79b

-

d99e789

update prover client fix

0a23c38

simplify docs

b5945d4

quieter network

52ed4ea

fix nargo

ef785f9

bring back earthly retries

b91a45f

sufferage

d583d8f

lets try again

8608c1c

flakefest

d2a0e8a

use ref name for arm concurrency

4a1d21d

lets try it again

0a5055a

woopsy

1b96ea3

ludamad marked this pull request as draft December 19, 2024 02:55

ludamad added 4 commits December 19, 2024 02:57

arm fixes

835730a

changes

c9461ef

fix

9462caa

fix bench e2e

11f57f3

ludamad marked this pull request as ready for review December 19, 2024 03:10

ludamad merged commit 0832045 into cl/ci3 Dec 19, 2024
34 of 35 checks passed

ludamad deleted the cl/ci3-fixes branch December 19, 2024 03:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: ci3 post-merge fixes #10825

chore: ci3 post-merge fixes #10825

ludamad commented Dec 17, 2024

chore: ci3 post-merge fixes #10825

chore: ci3 post-merge fixes #10825

Conversation

ludamad commented Dec 17, 2024