CHIP 2024-12 OP_EVAL: Function Evaluation

Function Evaluation (OP_EVAL) would let Bitcoin Cash contracts be efficiently factored into reusable functions. Optimizes contract sizes for finite field arithmetic, pairing-based crypto, ZKP systems, homomorphic encryption, post-quantum crypto, and many other applications.

CHIP:

Some discussion:

https://x.com/bitjson/status/1867470838099791966

4 Likes

Please add OP_EXEC to evaluation of alternatives. The main difference is in being able to wall-off parts of parent execution context stack. Rationale on why we shouldn’t have some of this functionality in OP_EVAL would be good.

More details: Brainstorming OP_EVAL

Re. this:

In signature operations, the covered bytecode includes only the active bytecode beginning from the last executed code separator (A.K.A. pbegincodehash ).

This is actually causing problems for things like DSPs, because from signature preimage there’s no way to tell whether it is covering a full redeem (or eval) script or just a slice of it.

Maybe we could use the opportunity to fix it.

From Telegram:

bitcoincashautist, [10/27/24 10:45 AM]
btw, this little OP_CODESEPARATOR bastard makes it so that you need to add whole prevout TXs to DSPs if you want to support P2SH & bare multisig

src/script/interpreter.cpp · master · Bitcoin Cash Node / Bitcoin Cash Node · GitLab

bitcoincashautist, [10/27/24 10:47 AM]
because signature will cover only the part of redeem script after the codeseparator, and the signer can’t tell if some stuff (like opdrop and replacing key&sig with some dummy ones) was executed before

bitcoincashautist, [10/27/24 10:47 AM]
in the signature preimage there’s no indication whether it’s a full redeem script, or sliced due to codeseparator

bitcoincashautist, [10/27/24 10:48 AM]
we could fix this by adding 1 byte in case codeseparator was used

3 Likes

Cool to see this new CHIP! :smiley:

I know the idea of OP_EVAL is really old, dating back from 2011, when it was mainly thought about as something for MAST-like constructions or emulating P2SH. In the motivation you describe it as a way to allow for re-usable function, so the rational for an OP_EVAL opcode changed significantly over time.

That’s why I would also really like to see a comparison/ evaluation of alternatives to understand how it affects the specific opcode implementation.

I think it would be great to also see some simple/minimal “Usage Examples” where just the concept of re-usable functions are demonstrated.

I also posted this thought elsewhere, about how executing user-provided script could lead to a situation where a user might be able to use eval to circumvent the rest of the contract execution:

i know eval in javascript is a big security concern, I wonder how this relates to the blockchain context.
A user should never be able to push data which is then evaluated as script if this can affect the rest of the execution of the program

3 Likes

If eval-script can modify the parent stack then allowing a non-authenticated script to be provided by spender would make the UTXO anyonecanspend.

Consider this trivial example:

OP_IF
    OP_DUP OP_SHA256 <hash_of_function> OP_EQUALVERIFY OP_EVAL
OP_ELSE
    OP_EVAL
OP_ENDIF

The 1st branch can only run the specific eval-script that matches the hash. The 2nd branch can run anything - and if the anything is read from the input script then spender can just provide OP_1 or OP_DROP OP_1 or whatever will make the contract pass.

But what if we wanted to make some covenant where users can specify by themselves how to secure their part of the contract? We’d need a way to wall off the outer context so eval-script has strict limitation on before/after stack state, which is what the alternative, OP_EXEC, would enable.

1 Like

Thanks! Yes, I need to add more supporting material, just wanted to get the technical proposals out as soon as possible so people can start experimenting with it. The draft implementation is also live in Bitauth IDE:

Fibonacci numbers via both OP_BEGIN/OP_UNTIL and OP_EVAL recursion, for anyone interested in experimenting:

“Playing with Fibonacci numbers” - Bitauth IDE

2 Likes

Collecting some other discussion:

https://x.com/bitjson/status/1867614179622084958

[…] it’s useful to clarify here that there are two reasonable ways to implement [OP_EVAL].

Most software will choose to just recursively call the script evaluation function again, but more constrained systems (esp. hardware-implemented Forths) might do what Libauth (intentionally) does: copy the current execution info to a new control stack frame, then swap the currently evaluating instructions in place (resetting ip/pc to 0). When you get to the end of instructions, check the control stack for more work to resume until it’s empty (or contract rejected). (Btw, if you squint, this is like tail-call elimination.)

And a comparison to BIP 12’s OP_EVAL:

https://x.com/bitjson/status/1867611387046113636

Looking at https://en.bitcoin.it/wiki/BIP_0012:

  • Doesn’t save stack frames to the control stack (their proper location in a Forth implementation if you want to maximize implementability e.g. hardware) + limited depth to 2: seriously hampers the overall usefulness by preventing deep re-use of functions. In general, we want functions to be able to call as many sub-functions as they need until the work is done. For now our existing 100-depth control stack is probably plenty, but nothing prevents that from increasing if there’s demand and/or tail-call optimization (though note that loops are almost always more byte efficient in our bytecode, so it’s hard to justify effort or complexity spent on tail-call optimization today).

  • We’ve preserved the usefulness of OP_CODESEPARATORs across past upgrades, this did not (and of course, this didn’t know about OP_ACTIVEBYTECODE); OP_EVAL CHIP also avoids some traps here where signature-covered bytecode includes more than the top evaluation (it’s useful that the function behaves the same regardless of where it’s called; sig checks in a particular function should pass given the same signature across any control stack configuration)

  • IIRC, this cleared the alternate stack? Anti-feature, would waste bytes across functions which need the extra communication bandwidth (you’d just be removing one of two stacks by which code can “call” functions, forcing all code to stack juggling rather than letting subroutines accept e.g. two separate arrays in whatever cases where it’s optimal).

Again, we have tons of new information and clearer constraints today that make these choices easy. It would be unreasonable to expect any design developed before 2024 (VM limits) to have strong answers to these questions.

3 Likes

Some discussion of sandboxing behavior and NEXA’s OP_EXEC:

3 Likes

Regarding OP_EXEC vs OP_EVAL… it all depends on how they end up being used.

OP_EVAL - less “safe” but less overhead; seems like it would be most useful as an internal low-level tool for cashscript or other compilers to use to re-use code… i.e. to call internal functions they 100% control/have generated/have guaranteed known pre and post-conditions.

OP_EXEC - more safe, more overhead; seems like a more generic solution to the problem where both trusted code and “untrusted” code can be exec’d and the caller has guarantees about pre- and post-conditions, since they are enforced. I.e. you know the callee cannot mess with your stack, so you can jump into “untrusted” code, e.g. from a cashtoken commitment or the scriptsig stack or other random places.

So in short – if you envision OP_EVAL only being used for trusted code, ie code that you can verify is what it says it is either because you yourself generated it (compiler), or because you check its hash beforehand (p2sh style), then it’s fine. Less overhead. Lean and mean. But if you ever think it may be used to execute arbitrary contract code – then it’s a landmine waiting to go off.

OP_EXEC enforces a call signature and pre- and post-conditions. Called code cannot touch your stack. All it can do is push results onto your stack but cannot otherwise mess with it. Much safer.

My personal preference is for OP_EXEC. But, I get the efficienty-at-the-expense-of-genericity arguments of OP_EVAL.

Anyway in my ideal world we would have both OP_EVAL and OP_EXEC. This way nobody can make fun of us for being primitive apes.

However if you don’t envision anybody ever needed something more well-defined and secure like OP_EXEC… ever… then OP_EVAL only is fine.

2 Likes

Something missing from the spec: It’s important that an eval’d/exec’d/whatever function not ever end on an imbalanced conditional. Script should fail if the conceptual conditional stack local to the function has any items on it, basically…

The final spec needs to mention that since that could be a source of woe if left unspecified.

Also related/not related: do we need to resurrect OP_RETURN for breaking out of a called function early? (Implicitly also then popping the conditional stack local to that function to 0 as a side-effect)…? Or not worth bothering?

2 Likes

Can we please avoid the Waterfall model - Wikipedia development method and actually try the different approaches on some testnet and actual usecases before we get any proposals for consensus rules?

This should start with a fork of bchn and a lot of throw-away scripting code.

(for the non-programmers here, the waterfall model is widely regarded as leading to failure).

You can literally open ide.bitauth.com right now (pick BCH 2026 VM in top-right corner drop-down) and try a script like:

OP_DUP OP_DUP
OP_SHA256 <0x4ae81572f06e1b88fd5ced7a1a000945432e83e1551e6f721ee9c00b8cc33260> OP_EQUAL OP_SWAP
OP_SHA256 <0x060459fe25666da7ddf2ba22fae9f9f507f7659b8c12875a234b320519242868> OP_EQUAL
OP_BOOLOR OP_VERIFY OP_EVAL

To spend, you can provide eval script 0x51 (OP_1) or eval script 0x007551 (OP_0 OP_DROP OP_1) as the input’s pushed data.
It can be any contract hidden behind each of the hashes. Could be some P2PKH for the 1st hash, and 2nd one would have some secret-until-used recovery path.

If combined with loops CHIP you can do full MAST with locking (or redeem) script size only 53 bytes.

// UNLOCKING SCRIPT:

<0x6344d1334be4f53b993030810f63f0452e9c3c569c40acfad81202a360f478a7>
<0> // swap or not
<0x8de0b3c47f112c59745f717a626932264c422a7563954872e237b223af4ad643>
<1> // swap or not
<2> // depth
<0x52> // leaf

// LOCKING SCRIPT:

// store a copy of leaf
OP_DUP OP_TOALTSTACK
// hash the leaf
OP_SHA256 OP_SWAP

// process merkle path and obtain root
OP_BEGIN
OP_1SUB OP_TOALTSTACK
// --
OP_SWAP OP_IF OP_SWAP OP_ENDIF OP_CAT OP_SHA256
// --
OP_FROMALTSTACK OP_IFDUP OP_NOT
OP_UNTIL

// verify root
<0x2fb1740d8e932861b16c08c5d3a8771adad3c78df60569acb0628c185233b99f>
OP_EQUALVERIFY

// execute leaf
OP_FROMALTSTACK OP_EVAL

The leaves in this example are 0x00, 0x51, 0x52, 0x53.

The alternative, OP_EXEC, exists on mainnet of BU’s chain.

2 Likes

that’s just one way of solving the problem.
There are a buch of other approaches that makes sense and actual developers writing actual bitcoin-scripts should try them. Compare them, get a feel for them. Stuff that any language designer, or indeed library author, will do. You don’t settle on the first one without testing the various other approaches too.

Bitauth is a great playground, yet, I don’t think it can emulate a single transaction with a lot of outputs and introspection to, for instance, have a single output that has a bunch of methods reused by the others.

“jmp” or “eval” or “exec” (or loops) are in essence a sysmic shift in the way that the vm can be used. Nobody will understanding by just writing a spec, they need to write tons of small apps and be able to be creative in the setup based on this experience.

Python 2 → 3 is a great example of them failing to have done that in v2 before committing to something. And the process of maturing that hurt them for 15 years.

It’s a full TX validation and ScriptVM engine, consensus-compatible with actual nodes, it can validate any transaction (and if using current consensus spec: generate it and have it be accepted by the network).

With a little more code, Libauth would become a full Javascript node.

3 Likes

Hi all,

I wrote a long evaluation comparing OP_EVAL to an optimized version of OP_EXEC here:

https://x.com/bitjson/status/1877821412897120387

Can anyone identify any use cases for the stack-isolating behavior of Nexa’s OP_EXEC?

Fully-formed product ideas not needed – even contrived examples to demonstrate an advantage would be great.

Even if we steelman OP_EXEC to be less wasteful per-invocation vs. OP_EVAL (e.g. by accepting the “function signature” in some encoding that is pre-concatenated with the evaluated bytecode, wasting those bytes only once per function definition), I’m having trouble devising any scenarios where the function signature adds value:

  1. The resulting transactions require more bytes than OP_EVAL.
  2. Defining a per-contract, bytecode-based “OP_EXEC API” is complex, error prone, and incompatible with existing contract systems. E.g. for an existing multisig wallet, each multisig signer has to be taught to understand and sign for the new OP_EXEC-based input type rather than simply signing their known input type in new transactions.
  3. Stack isolation adds no security value. When you compare an end-to-end example OP_EXEC-based delegation scheme vs. constructions that we already have today (sibling inputs, introspection, CashTokens, etc.), OP_EXEC systems have more potential security pitfalls with no bandwidth savings.

Some ideas I’ve reviewed:

Contract compression via reusable functions

Transaction compilers should be capable of minimizing any contract by identifying and factoring sequences of bytecode into functions, pushing them to the stack at optimal times and OP_PICK OP_EVAL-ing as needed.

This is the “common case”. Nearly all complex contracts can save at least a few bytes by factoring reused sequences of bytecode into functions. For contracts implementing finite field arithmetic, zero-knowledge proofs, post-quantum cryptography, and other non-trivial computations, these savings are significant enough to enable many currently-impractical applications – cutting KBs or MBs from total transaction sizes.

Reviewing OP_EXEC:

Stack isolation here adds no security and significantly increases compiler and contract complexity. Existing OP_EXEC-like proposals would waste several bytes per function invocation, but the steelman-ed OP_EXEC above could reduce that to only a few bytes per function definition. Depending on contract complexity, this may still result in hundreds or thousands of wasted bytes per transaction.

MAST-like constructions (“Merklized Alternative Script Tree”)

The UTXO commits to the hash of one or more spending paths which get revealed and authenticated at spending time. This has already been possible on Bitcoin Cash since 2023, but OP_EVAL or OP_EXEC would save bytes by avoiding the need for a sibling input.

For less-interactive setups, hashes can be either directly referenced or committed within a merkle tree.

For interactive setups, participants can prepare a data signature over each allowed spending path using an aggregated Schnorr key, eliminating the need for unpacking a particular hash from a merkle tree. Spending then requires only one of the pre-signed scripts and any unlocking material (variable length) + aggregated signature (65 bytes) + aggregated key (33 bytes). Collaborative spending from the address is also “included”, costing no further bytes (like with BTC’s Taproot).

Reviewing OP_EXEC:

Stack isolation offers no additional security for these cases and wastes a few bytes per definition/invocation depending on precise semantics.

Post-funding assigned instructions

In these cases, OP_EVAL or OP_EXEC could be used to avoid committing to all spending paths prior to the UTXO being funded; the contract instead commits to some other method for authenticating later-assigned instructions. For example, a spending path in the contract could accept instructions authorized by some key, a large deposit, a held CashToken, etc.

Note the overlap between these cases and “Mast-like constructions” above. However, rather than aiming to reduce the on-chain footprint of contracts, these cases aim to delegate control over the spending path to whatever mechanism authorizes the new instructions.

Reviewing OP_EXEC:

Again, stack isolation operates at the wrong level to be useful here. If the instruction authorizer is malicious, they can likely lock up funds by committing the contract to unspendable instructions (e.g. OP_RETURN); if the instruction authorizer wants to take the funds, they can always commit the contract to instructions allowing spends from a key they hold. In fact, I’m skeptical that there are any use cases in this category which aren’t either 1) security theater (the authorizer is just a custodian), or 2) more efficient to do with just CashTokens (in cases of delegation to decentralized contract systems).

User-provided pure functions

This is my attempted steelman of an OP_EXEC use case: a decentralized application author wants to allow some user(s) to provide a pure function (in VM bytecode) which accepts some raw input from the contract and computes some result(s) which are used by the contract. To make this plausible, we need a reason to accept a function rather than precomputed results, i.e. the contract needs to 1) save and run the function later using yet-unknown inputs or 2) prove that the same pure function was faithfully performed against multiple sets of inputs.

Contrived example: a decentralized exchange protocol allows users to submit new market making algorithms, and anyone can deposit assets or trade with any market maker.

Ignoring all the data, timing, and incentive questions – can we design a scenario in which OP_EXEC at least saves some bytes vs. OP_EVAL by taking advantage of the “built-in” isolation?

Presumably this is the kind of situation made “safer” by isolating an evaluation: some users are risking funds in contracts which execute arbitrary code provided by different users. If the market maker function has some secret exploit, the function author can exploit it to drain the market maker of other people’s money.

Reviewing OP_EXEC:

Once again, stack isolation is protecting the wrong thing – policing stack usage neither prevents the function from including an underhanded exploit nor can it prevent the function from surprise-bricking the contract (<funds> <100_BCH> OP_GREATERTHAN OP_IF OP_RETURN OP_ENDIF).

Fundamentally, the stack usage of an evaluation is simply not relevant to contract security. If the function uses too many items or produces too few, the rest of the contract will fail and the attempted transaction will be invalid (and if the contract can be frozen by a failing function, we’re doomed with or without an isolated stack). If the stack contains something we don’t want the pure function to modify, we need only rearrange our contract to push it later (or validate it later, if the data comes from unlocking bytecode).

So: stack isolation remains useless while wasting several bytes per function definition or invocation. If it mattered, contracts could easily prevent segments of bytecode from manipulating the stack in unexpected ways, but again, the validation that really matters in this scenario would have to look at the actual contents of the pure function. In reality, the end user('s wallet) is ultimately responsible for verifying the safety and security of the contract they’re using.

In general, I’m very skeptical that user-provided pure functions are the optimal construction for any use case. If a contract system requires on-chain configurability, it’s almost certainly more efficient to “build up” state by expressing the configuration as one or more parameters for fixed contract instructions.

And at a higher level, it’s almost certainly even more efficient to simply use smaller, purpose-built contracts rather than configure (on-chain) a one-size-fits-all contract. E.g. because Jedex (Joint-Execution Decentralized Exchange) is carefully broken into efficient, single-purpose contracts, almost all Jedex inputs are smaller than standard single-signature inputs (P2PKH).


To summarize:

As of now, I’m skeptical that OP_EXEC has any plausible advantages vs. OP_EVAL, and OP_EXEC has serious disadvantages in protocol complexity, contract complexity, and overall transaction sizes.

Please let me know of any other use cases I should review, and please leave a comment if I can answer any questions.

5 Likes

Copying snippet from Brainstorming OP_EVAL - #10 by bitjson :

OP_EVAL vs. word definition

Yes! I definitely need to include a section comparing the OP_EVAL CHIP with “proper” Forth-like word definition (also called OP_DEFINE/OP_UNDEFINE/OP_INVOKE in old Libauth branches).

As you pointed out, a full “word definition” upgrade proposal is quite a bit more involved: how and where we track definitions, any necessary limits for those new data structures, whether or not a word can be undefined (we only have OP_0 to OP_16 + maybe OP_1NEGATE!), what makes a valid identifier (only numbers? any single byte? multi-byte?), should we include Forth OP_FETCH/OP_STORE corollaries for data (some discussion of OP_EVAL vs. a TX-wide “data annex” here), and probably many more details.

Fortunately, we have a great argument for avoiding this bikeshed altogether: we can easily prove that OP_EVAL is the optimal construction for single-use evaluations (as you mentioned). Even if a “word definition” upgrade were hammered out and activated in the future, OP_EVAL would still be the most efficient option for many constructions. (This coincidentally was the same argument that made P2SH32 a strong proposal vs. OP_EVAL – even with OP_EVAL, P2SH32 remains the more byte-efficient construction for those use cases.)

As BCH VM bytecode is a concatenative language, a perfectly-optimizing compiler is quite likely to produce single-use evaluations from common pieces of different words/functions, even if the contract author didn’t deliberately design something requiring OP_EVAL (e.g. MAST-like constructions).

So:

  • OP_EVAL is feature-equivalent to word definition (each enables all the same use cases)
  • OP_EVAL typically requires 1 extra byte per invocation, but sometimes saves 1 byte vs. OP_INVOKE.
    • OP_EVAL is 3 bytes (<index> OP_PICK OP_EVAL) for many calls, but some will be optimized to only 1 byte (just OP_EVAL)
    • OP_INVOKE is always 2 bytes (<identifer> OP_INVOKE).
  • OP_EVAL always saves 1 byte per definition by avoiding the OP_DEFINE.
  • OP_EVAL remains optimal for some uses even if a future upgrade added word definition (as a 1-byte optimization for some function calls).
4 Likes

Jason, I don’t disagree with going OP_EVAL and ignoring OP_EXEC but I am having trouble agreeing with this claim:

I don’t think this is true. One can imagine a scenario where it adds security – if you exec “untrusted” code, the stack isolation, etc, provides perfect security. The untrusted code cannot muck with any state you are tracking at all. All it can do is receive parameters and return 1 or more result(s).

This adds security.

Thus, I think the statement “OP_EXEC stack isolation adds no security” is a false statement. It does add security, demonstrably.

Whether or not that security is of any value to imagined use-cases is another matter. But security it does add, even if the added security is value-less, inconsequential, useless, etc (TBD).

It’s like saying: “Passing-out and sleeping inside a locked, bulletproof tank versus on a park bench adds no security… because I don’t anticipate ever being robbed.”

No, the tank is more secure than you sprawled on a park bench. Whether or not that security makes any difference to you is another matter… but demonstrably one is more secure than the other, even if the additional provided security leads to identical outcomes as the less secure situation in practice…

3 Likes

Here’s an example:

{some trusted code} OP_2DUP <untrusted_code> <2> <1> OP_EXEC OP_VERIFY {some trusted code}

If the sequence between 2 blocks of trusted code passes, from point-of-view of outer code it will just be a NOP which can’t affect stack state around it. If it fails, the whole script fails.

With OP_EVAL you can’t allow the untrusted code in any place because it could mess up the stack and break the code around it, so it could only be ran as last operation in the whole script, and some VERIFY opcode must precede it: {some trusted code ending with OP_VERIFY} <untrusted_code> OP_EVAL

2 Likes

Yep, decent example.

@bitjson Just to be clear – I’m all in favor of simple OP_EVAL! I am not advocating for OP_EXEC! I am perfectly happy with OP_EVAL. I am not a contract author and I have no idea what authors need. To me, the usecase of a compiler optimizing common code into OP_EVAL bits makes tons of sense… even for that reason alone it’s fine by me.

3 Likes

This whole discussion has gotten weird.

People are comparing two ideas that neither is perfect. Notice that here on bchr we don’t actually have the actual proposals in detail so the “it uses less bytes” is a bit hard to follow and likely closes the discussion to a lot of readers.

But there is no point in comparing two, we can make 20 new proposals based on what it is that we actually want. There is no reason to limit us to just those two.

And, as I pointed out in another thread: Brainstorming OP_EVAL - #14 by tom, the comparison is mostly based on false premises. There are not just TWO ways of doing things that pick things like stack protection. Stack protection is an ingredient that can be applied to either propoosal under discussion today.

The real question is not about picking between two options, the real question is to find a good way that uses the various ingredients we actually want.

As far as I can tell, the ingredients you can mix and match are:

  • Stack-protection
  • Requiring a method declaration and then calling it based on a method-index (two opcodes).
  • Being able to put methods in another output on the same transaction.
  • Being able to get the method-code from stack, which implies being able to execute code that is untrusted.
  • Being able to write self-modifying code.

Personally, I’d add stack protection as that is virtually free. It stops script writers doing really nasty things that breaks good programming assumtions.

Having the idea of a method declaration is neat as that saves bytes in calling, as your indexes are always 1 byte. Additionally it provides compiler safety because your compiler can ensure that the method you call has the right number of parameters (see linked post).

Being able to put methods in another output fits very nicely in the rest of our scripting design and allows neat things like being able to do verification of the methods-holding-output by simply hashing it and comparing the hash, all in script.

Being able to call code that at any point is stored on the stack, however, sounds to me like something you really really want to avoid. I mean, we introduced p2sh-32, so maybe it is not sane to introduce a no-checking way to execute a script that can be pushed in the unlocking script. Now, I know you can also check the hash of that, but if you’re supposed to always include that check, then that warrents its own opcode. An op-mast, if you will.
The problem with executing code that was not known at the time of mining the locking script is that this is money that can be unlocked by people coming up with a script to unlock it. That may be neat, but it doesn’t make for good money.

Using the stack to store your code you eval means you can write self-modifying code. This likewise has no place in modern programming strategies. Absolutely double that when it comes to money.

Last reason why storing your code on-stack makes no sense is that stack operations are defined to cost a price to operate on. This will be activated in May. As such either the usage of eval will cost double (once for pushing, once for executing it), or it needs some sort of exception. Which makes the design no longer coherent as a whole.

Further OP_EVAL critisisms;

  • the chip as it is on github right now writes: “the evaluation may modify the stack and alternate stack without limitation”.
    This means you could dirty things like writing a method that does 5 pops.
    This idea goes against 60 years of computer science concept and learning, and I don’t like it. It gives the contract writer not just a gun to shoot its own foot, it likely will leave a crater.

  • " ii. The OP_CODESEPARATOR operation records the index of the current instruction pointer (A.K.A. pc ) within the OP_EVAL -ed bytecode."
    I read that as a way to skip all code from the jumped-to-position to the first codeseparator and start evaluating there.
    This will be useful if you want to write self-modifying code. Which is not something that belongs in money.

I don’t like op-eval. It makes hacky code writing the norm, it provides lots of ways to do extremely dirty things which will make the The Fourth Annual Obfuscated Perl Contest Results - The Perl Journal, Fall 1999 look like easy to understand code.

OP_EXEC misunderstands contract security and usage

I’ll have to mention OP_EXEC in the CHIP’s evaluation of alternatives, and I don’t want there to be any doubt that the rationale treats it fairly.

I’ve tried my best to steelman OP_EXEC, but as of now, my analysis is:

  • OP_EXEC misunderstands contract security and usage. It only seems to make sense if you haven’t tried to use it.
  • OP_EXEC has zero advantages and significant disadvantages vs. OP_EVAL.

If anyone disagrees, it should be easy to provide a counterexample: a contract with an exploit that OP_EXEC could prevent.

The reasoning trap

WARNING: experienced programmers will almost certainly be misled about OP_EVAL and OP_EXEC by their intuitions from other languages/environments.

This is downstream of 1) our unusual environment – the whole transaction is the potentially-abusive input, not just the hypothetical “code” being consumed by OP_EVAL/OP_EXEC (and in another sense, the VM is itself a “sandbox”: malicious code can’t consume excessive resources or take control of the node), and 2) the concatenative programming paradigm. (It’s a deep rabbit hole, see Thinking Forth and maybe 1% the code.)

(@cculianu and @bitcoincashautist please forgive me for nit-picking parts of your posts to explain myself now, I appreciate you guys commenting here. :pray:)

This is the reasoning trap: we’re used to the concept of “untrusted code” elsewhere, so we assume that VM bytecode has a direct analogy. Skip some steps, and stack isolation looks like a solution to some plausible contract security problem – no need for further review.

I’m arguing that 100% of these imagined scenarios actually rely on internal contradictions which can be revealed by fleshing out the example.

  1. Stack “isolation” is superfluous – any contract where it seems to prevent an exploit has other equivalent exploits not prevented by the isolation.
  2. You can already get that “stack isolation” behavior – for zero bytes – today, simply by fixing your code (or, more likely, using a good compiler). If you think you need OP_EXEC to get that effect, your contract is wasting bytes. In fact, a linter could automatically optimize contracts by removing instances of OP_EXEC (replacing them with OP_EVAL and/or compiling them away altogether).

I hope to convince everyone to either:

  1. Try it yourself and prove me wrong by offering a single counterexample, or
  2. acknowledge that stack isolation is an anti-feature, not some imagined security-for-efficiency tradeoff.

“Untrusted” means nothing without context

(Again, thanks for your comments here @bitcoincashautist. I get that you already prefer OP_EVAL and you’re just trying to objectively review OP_EXEC; please forgive me for nit-picking your comment. :pray:)

This code snippet includes OP_EXEC, but it’s missing all the context we need to review. In fact, you even mention that the whole OP_EXEC portion is unnecessary in your description: “from point-of-view of outer code it will just be a NOP”.

Please correct me if you meant to say something else – doesn’t this sentence imply that the computation’s result can simply be pushed as data instead of using OP_EXEC?

My claim about OP_EXEC

“Stack isolation” is a reasoning trap. If you think it might add security value, your mental model is omitting critical context.

If stack isolation offers any security value at all, it should be easy for someone to refute this with a single example. No hand-waving away the context though: that’s where the logical errors are hiding.

For anyone who still cares to defend “stack isolation” as an idea:

  • Can you fill in your “trusted code” and “untrusted code” blanks with a concrete example? Who is using this contract?
  • What potential exploits can be performed by the untrusted code – who loses money and how?

TL;DR

OP_EXEC is like requiring motorcycle helmets in a swimming pool. Given the context, “safer” is not a word that comes to mind.

2 Likes