BIP: 346
Layer: Consensus (soft fork)
Title: OP_TXHASH
Authors: Steven Roose <steven@roose.io>
Brandon Black <freedom@reardencode.com>
Status: Draft
Type: Specification
Assigned: 2024-04-24
License: BSD-3-Clause
Abstract
This BIP proposes a new opcode OP_TXHASH, to be activated as a change to the
semantics of OP_SUCCESS189 in tapscript contexts.
This opcode provides a generalized method for introspecting certain details of the spending transaction, which enables non-interactive enforcement of certain properties of the transaction spending a certain UTXO.
Together with an opcode like OP_CHECKSIGFROMSTACK, this opcode effectively
provides a fully generalized signature hash construction, fully supporting
all existing SIGHASH flags, the proposed sighash flags from
BIP-118 (SIGHASH_ANYPREVOUT) and many other new signature hash
combinations.
The constructions specified in this BIP also open up the way for other potential updates; see Motivation section for more details.
Specification
OP_TXHASH
OP_TXHASH redefines the OP_SUCCESS189 tapscript opcode (0xbd) as a soft
fork upgrade. This opcode is only active in tapscript contexts.
Note that OP_SUCCESS187 is used by BIP-443 (OP_CHECKCONTRACTVERIFY) and
OP_SUCCESS188 was used by BIP-345 (OP_VAULT) at the time of first draft
of this BIP, making OP_SUCCESS189 the next available opcode for this purpose.
It has the following semantics:
- There is at least one element on the stack, fail otherwise.
- The element is interpreted as the TxFieldSelector and is popped off the stack.
- The TxFieldSelector must be valid, fail otherwise.
- The 32-byte TxHash of the transaction at the current input index, calculated using the given TxFieldSelector is pushed onto the stack.
TxFieldSelector
The TxFieldSelector has the following encoding. We will give a brief conceptual summary, followed by a reference implementation of the CalculateTxHash function.
In the following specifications, the | operator is used for the bitwise OR
operation.
- There are two special cases for the TxFieldSelector:
-
the empty value, zero bytes long: it is set equal to
TXFS_SPECIAL_TEMPLATE, the de-facto default value which means everything except the prevouts and the prevout scriptPubkeys and amounts.Special case
TXFS_SPECIAL_TEMPLATEis 4 bytes long, as follows:- 1:
TXFS_VERSION | TXFS_LOCKTIME | TXFS_CURRENT_INPUT_IDX - 2:
TXFS_INPUTS_SEQUENCES | TXFS_INPUTS_SCRIPTSIGS | TXFS_OUTPUTS_ALL - 3:
TXFS_INOUT_NUMBER | TXFS_INOUT_SELECTION_ALL - 4:
TXFS_INOUT_NUMBER | TXFS_INOUT_SELECTION_ALL
- 1:
-
If the TxFieldSelector has exactly 1 byte, we use a short notation. It has its 8 bits assigned as follows, from lowest to highest:
- 2/1: Inputs
- 00:
TXFS_INOUT_SELECTION_NONE - 01:
TXFS_INOUT_SELECTION_CURRENT - 11:
TXFS_INOUT_SELECTION_ALL
- 00:
- 4/3: Outputs
- 00:
TXFS_INOUT_SELECTION_NONE - 01:
TXFS_INOUT_SELECTION_CURRENT - 11:
TXFS_INOUT_SELECTION_ALL
- 00:
- 5:
TXFS_INPUTS_PREVOUTS - 6:
TXFS_INPUTS_PREV_SCRIPTPUBKEYS | TXFS_INPUTS_PREV_VALUES - 7:
TXFS_CURRENT_INPUT_CONTROL_BLOCK | TXFS_CURRENT_INPUT_SPENTSCRIPT | TXFS_CURRENT_INPUT_LAST_CODESEPARATOR_POS - 8:
TXFS_CURRENT_INPUT_IDX
Additionally, it includes
TXFS_VERSION | TXFS_LOCKTIME | TXFS_CONTROL | TXFS_CURRENT_INPUT_TAPROOT_ANNEXas global fields andTXFS_INPUTS_SEQUENCES | TXFS_INPUTS_SCRIPTSIGS | TXFS_OUTPUTS_ALLas input and output fields.These 1-byte selections allow the TxFieldSelector to emulate current signature hashing modes and those defined in BIP-118:
- 2/1: Inputs
-
| BIP-341/118 sighash type | 1-byte TxFieldSelector |
|---|---|
ALL | 0b11111111 |
SINGLE | 0b11110111 |
NONE | 0b11110011 |
| `ALL | ANYONECANPAY` |
| `SINGLE | ANYONECANPAY` |
| `NONE | ANYONECANPAY` |
| `ALL | ANYPREVOUT` |
| `SINGLE | ANYPREVOUT` |
| `NONE | ANYPREVOUT` |
| `ALL | ANYPREVOUTANYSCRIPT` |
| `SINGLE | ANYPREVOUTANYSCRIPT` |
| `NONE | ANYPREVOUTANYSCRIPT` |
-
If the TxFieldSelector is longer than one byte, the first byte of the TxFieldSelector has its 8 bits assigned as follows, from lowest to highest:
- 1: version (
TXFS_VERSION) - 2: locktime (
TXFS_LOCKTIME) - 3: current input index (
TXFS_CURRENT_INPUT_IDX) - 4: current input control block (
TXFS_CURRENT_INPUT_CONTROL_BLOCK) - 5: current input spent script (
TXFS_CURRENT_INPUT_SPENTSCRIPT) - 6: current script last
OP_CODESEPARATORposition (or 0xffffffff) (TXFS_CURRENT_INPUT_LAST_CODESEPARATOR_POS) - 7: current input annex including prefix byte (or empty) (
TXFS_CURRENT_INPUT_TAPROOT_ANNEX) - 8:
TXFS_CONTROL(i.e. include TxFieldSelector into hash)
- 1: version (
-
The highest bit of the first byte (
TXFS_CONTROL), we will call the "control bit", and it can be used to control the behavior of the opcode. ForOP_TXHASH, the control bit is used to determine whether the TxFieldSelector itself has to be included in the resulting hash. (For potential other uses of the TxFieldSelector (like a hypotheticalOP_TX), this bit can be repurposed.) -
The second byte will be used to indicate fields from the inputs and outputs. The 8 bits are assigned the following variables, from lowest to highest:
-
Specifying which fields of the inputs will be selected:
- 1: prevouts (
TXFS_INPUTS_PREVOUTS) - 2: sequences (
TXFS_INPUTS_SEQUENCES) - 3: scriptSigs (
TXFS_INPUTS_SCRIPTSIGS) - 4: prevout scriptPubkeys (
TXFS_INPUTS_PREV_SCRIPTPUBKEYS) - 5: prevout values (
TXFS_INPUTS_PREV_VALUES) - 6: taproot annexes (
TXFS_INPUTS_TAPROOT_ANNEXES)
- 1: prevouts (
-
Specifying which fields of the outputs will be selected:
- 7: scriptPubkeys (
TXFS_OUTPUTS_SCRIPTPUBKEYS) - 8: values (
TXFS_OUTPUTS_VALUES)
- 7: scriptPubkeys (
-
-
We define as follows:
TXFS_ALL = TXFS_VERSION | TXFS_LOCKTIME | TXFS_CURRENT_INPUT_IDX | TXFS_CURRENT_INPUT_CONTROL_BLOCK | TXFS_CURRENT_INPUT_LAST_CODESEPARATOR_POS | TXFS_CONTROLTXFS_INPUTS_ALL = TXFS_INPUTS_PREVOUTS | TXFS_INPUTS_SEQUENCES | TXFS_INPUTS_SCRIPTSIGS | TXFS_INPUTS_PREV_SCRIPTPUBKEYS | TXFS_INPUTS_PREV_VALUES | TXFS_INPUTS_TAPROOT_ANNEXESTXFS_OUTPUTS_ALL = TXFS_OUTPUTS_SCRIPTPUBKEYS | TXFS_OUTPUTS_VALUES
-
For both inputs and then outputs, expect an additional byte as follows:
-
The highest bit (
TXFS_INOUT_NUMBER) indicates whether the "number of in-/outputs" should be committed to. -
For the remaining bits, there are three exceptional values:
- 0x00 (
TXFS_INOUT_SELECTION_NONE) means "no in/outputs" (hence only the number of them as0x80(TXFS_INOUT_NUMBER)). 0x40(TXFS_INOUT_SELECTION_CURRENT) means "select only the in/output of the current input index" (it is invalid when current index exceeds number of outputs).0x3f(TXFS_INOUT_SELECTION_ALL) means "select all in/outputs".
- 0x00 (
-
The second highest bit (
TXFS_INOUT_SELECTION_MODE) is the "specification mode":- Set to 0 it means "leading mode".
- Set to 1 it means "individual mode".
-
In "leading mode", the third highest bit (
TXFS_INOUT_LEADING_SIZE) is used to indicate the "index size", i.e. the number of bytes will be used to represent the number of in/output.- With "index size" set to 0, the remaining lowest 5 bits of the first byte will be interpreted as the number of leading in/outputs to select.
- With "index size" set to 1, the remaining lowest 5 bits of the first byte together with the 8 bits of the next byte will be interpreted as the number of leading in/outputs to select.
-
In "individual mode", the third highest bit (
TXFS_INOUT_INDIVIDUAL_MODE) indicates whether we are passing absolute indices (0) or indices relative to the current input (1), the remaining lowest 5 bits will be interpreted asn, the number of individual in/outputs follow.- In absolute mode (second highest bit is 0), for each of the
nindices, at least one extra byte is expected.- If that byte's highest bit is set to 0, the remaining 7 bits represent the absolute index to select.
- If that byte's highest bit is set to 1, the remaining 7 bits, together with the next byte's 8 bits represent the absolute index to select.
- In relative mode (second highest bit is 1), for each of the
nindices, at least one extra byte is expected.- If that byte's highest bit is set to 0, the remaining 7 bits represent the relative index in two's complement.
- If that byte's highest bit is set to 1, the remaining 7 bits, together with the next byte's 8 bits represent the relative index in two's complement.
- In absolute mode (second highest bit is 0), for each of the
-
Effectively, this allows a user to select
- all in/outputs
- the current input index
- the leading in/outputs up to 8,191
- up to 32 individually selected in/outputs ** using absolute indices up to 32,767 ** using indices relative to the current input index from -16382 to +16383.
TxFieldSelector malleability
It is possible to represent the same selected data using multiple different
TxFieldSelectors. For this reason, users are advised to always set the
TXFS_CONTROL field flag that commits to the TxFieldSelector that was used
to get the hash.
Visualization
- first byte
1 1 1 1 1 1 1 1
| | | | | | | ^ version
| | | | | | ^ locktime
| | | | | ^ current input index
| | | | ^ current input control block
| | | ^ current input spent script
| | ^ current script last OP_CODESEPARATOR
| ^ current input taproot annex
^ control bit (ie. include TXFS in hash)
- second byte
<-> outputs
| | <---------> inputs
1 1 1 1 1 1 1 1
| | | | | | | ^ prevouts
| | | | | | ^ sequences
| | | | | ^ scriptSigs
| | | | ^ prevout scriptPubkeys
| | | ^ prevout values
| | ^ taproot annexes
| ^ scriptPubkeys
^ values
- in/output selector byte
"leading 3 in/outputs"
1 0 0 0 0 0 1 1
| | | <-------> integer 0b00011 == 3
| | ^ index size: single byte
| ^ leading mode
^ commit the number of in/outputs
"leading 257 in/outputs"
1 0 1 0 0 0 0 1 0 0 0 0 0 0 0 1
| | | <------------------------> integer 0b00001 00000001 == 257
| | ^ index size 1: two bytes
| ^ leading mode
^ commit the number of in/outputs
"indices 1 and 3"
0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 1
| | | | <--------------> second idx: 3
| | | | <-------------> first idx: 1
| | | | <-----> selection count: 0b0010 == 2 indices
| | | ^ index size: single byte per index
| | ^ absolute index
| ^ individual mode
^ don't commit the number of in/outputs
- total example
ff ff c2 01 03 83
| | ^ commit number of outputs + leading 3 outputs
| | <------> commit number of inputs + inputs at indices 1 and 3
| ^ all input and output fields
^ all regular fields
Resource limits
Using the same validation budget ("sigops budget") introduced in BIP-0342,
each TransactionHash decreases the validation budget by 25. If this brings the
budget below zero, the script fails immediately.
The following
considerations should be made:
- All fields that can be of arbitrary size are cacheable as TransactionHash always hashes their hashed values.
- In "individual mode", a user can at most commit 32 inputs or outputs, which we don't consider excessive for potential repeated use.
- In "leading mode", a caching strategy can be used where the SHA256 context is stored every N in/outputs so that multiple executions of the TransactionHash function can use the caches and only have to hash an additional N-1 items at most.
Motivation
This BIP specifies a basic transaction introspection primitive that is useful to either reduce interactivity in multi-user protocols or to enforce some basic constraints on transactions.
Additionally, the constructions specified in this BIP can lay the groundwork for some potential future upgrades:
- The TxFieldSelector construction would work well with a hypothetical opcode
OP_TXthat allows for directly introspecting the transaction by putting the fields selected on the stack instead of hashing them together. - The TransactionHash obtained by
OP_TXHASHcan be combined withOP_CHECKSIGFROMSTACK(see BIP-348) to effectively create an incredibly flexible signature hash, which would enable constructions likeSIGHASH_ANYPREVOUT. - The TransactionHash obtained by
OP_TXHASHcan be introduced as a native sighash calculation in a future segwit upgrade, so that signatures using the TransactionHash as their sighash can be used in keyspend context.
Comparing with some alternative proposals
-
This proposal strictly generalizes BIP-119's
OP_CHECKTEMPLATEVERIFY, as the default mode of our TxFieldSelector is semantically the same (though not byte-for-byte identical) as whatOP_CTVaccomplishes, without costing any additional bytes. Additionally, usingOP_TXHASHallows for more flexibility which can help in the case for- enabling adding fees to a transaction without breaking a multi-tx protocol;
- multi-user protocols where users are only concerned about their own inputs and outputs.
-
Constructions like
OP_IN_OUT_VALUEused withOP_EQUALVERIFYcan be emulated by twoOP_TXHASHinstances by using the TxFieldSelector to select a single input value first and a single output value second and enforcing equality on the hashes. Neither of these alternatives can be used to enforce small value differences without the availability of 64-bit arithmetic in Script. -
Like mentioned above,
SIGHASH_ANYPREVOUTcan be emulated usingOP_TXHASHwhen combined withOP_CHECKSIGFROMSTACK:<txfs> OP_TXHASH <pubkey> OP_CHECKSIGFROMSTACKeffectively emulatesSIGHASH_ANYPREVOUT.
Reference Implementation
A reference implementation in Rust is provided attached as part of this BIP together with a JSON file of test vectors generated using the reference implementation.
Design Considerations
This specification in in Draft and there is definitely still room for feedback and improvements. Some considerations that were made but could be revisited:
- The
0b10in/output selector for the shorthand is unused. Could possibly be filled in with "current + next", "current + previous" or any other semantics. - The individual index selection semantics allow for absolute indices up to ~32k and relative ones up to +/- ~16k, which is probably excessive. When removing the second byte there would reduce that to just 256 and +/- 128 respectively.
- Similar to
OP_TEMPLATEHASH(BIP number to be assigned, PR), we could not supportscriptSigsanymore and removeTXFS_INPUTS_SCRIPTSIGS. This field could then possibly be repurposed.
Backwards Compatibility
OP_TXHASH replaces OP_SUCCESS189. The SUCCESS opcodes were
introduced in taproot (BIP-342) to support changing the semantics of opcodes in
ways that do allow the new semantics to change the stack. For this reason,
OP_TXHASH only works in tapscript context. Since it is overriding a SUCCESS
opcode, any older version of the software will always accept any script that
uses the opcode, while the new versions of the software will validate the
scripts according to the semantics outlined in this BIP. As such, this is also a
soft fork change.
Interactions with other BIPs
This proposal interacts with several other BIPs:
-
BIP-118 (SIGHASH_ANYPREVOUT):
OP_TXHASHcan be combined withOP_CHECKSIGFROMSTACK(BIP-348) to emulateSIGHASH_ANYPREVOUTand other signature hash modes defined in BIP-118. The 1-byte TxFieldSelector format explicitly supports these modes. -
BIP-119 (OP_CHECKTEMPLATEVERIFY):
OP_TXHASHwith the empty TxFieldSelector produces a hash semantically equivalent to BIP-119'sOP_CHECKTEMPLATEVERIFY, making this proposal a generalization of BIP-119. -
BIP-347 (OP_CAT): When combined with
OP_CAT,OP_TXHASHenables powerful transaction introspection capabilities. The bit encoding format is designed to be explicit about endianness to ensure correct interaction with concatenation operations. -
BIP-348 (OP_CHECKSIGFROMSTACK): Together with
OP_CHECKSIGFROMSTACK,OP_TXHASHprovides a fully generalized signature hash construction, enabling flexible covenant designs and multi-user protocols.
Implementation
A reference implementation is included as part of the BIP, see here. This implementation focusses on clarity and correctness, not on efficiency. A rudimentary set of test vectors is also generated from this implementation and included here.
Furthermore, following other implementation attempts exist:
- A proposed implementation for Bitcoin Core is available here: https://github.com/bitcoin/bitcoin/pull/29050
- A proposed implementation for rust-bitcoin is available here: https://github.com/rust-bitcoin/rust-bitcoin/pull/2275
NOTE: These implementations are slightly outdated as they were made for an earlier version of this specification. Updates are in progress.
Both of the above implementations perform effective caching to avoid potential denial-of-service attack vectors.
Deployment
This BIP can be deployed using a BIP 9 VersionBits deployment. The specific strategy and bit assignment are left unspecified and can later be amended to the BIP depending on community preference.
Acknowledgement
Credit for this proposal mostly goes to Jeremy Rubin for his work on BIP-119's
OP_CHECKTEMPLATEVERIFY and to Russell O'Connor for the original idea of
generalizing OP_CHECKTEMPLATEVERIFY into OP_TXHASH.
Additional thanks to Andrew Poelstra, Greg Sanders, Rearden Code, Rusty Russell and others for their feedback on the specification.
Copyright
This document is licensed under the 3-clause BSD license.