|
| 1 | +--- |
| 2 | +eip: 8051 |
| 3 | +title: Precompile for ML-DSA signature verification |
| 4 | +description: Precompiled contracts for FIPS-204 ML-DSA signature verification and an EVM-optimized variant |
| 5 | +author: Renaud Dubois (@rdubois-crypto), Simon Masson (@simonmasson) |
| 6 | +discussions-to: https://ethereum-magicians.org/t/eip-8051-ml-dsa-verification/25857 |
| 7 | +status: Draft |
| 8 | +type: Standards Track |
| 9 | +category: Core |
| 10 | +created: 2025-10-15 |
| 11 | +--- |
| 12 | + |
| 13 | +## Abstract |
| 14 | + |
| 15 | +This proposal adds precompiled contracts that perform signature verifications using the NIST module-lattice signature scheme. Two instantiations are supported: |
| 16 | + |
| 17 | +* **ML-DSA** — NIST-compliant version using SHAKE256 (FIPS-204), |
| 18 | +* **ML-DSA-ETH** — EVM-optimized version for cheaper on-chain verification: |
| 19 | + * Uses Keccak-based PRNG instead of SHAKE256 (leverages native KECCAK256 precompile) |
| 20 | + * Stores public-key polynomial `t1` in the NTT domain to skip one NTT during verification (convertible to regular encoding offline) |
| 21 | + |
| 22 | +Two precompile contracts are specified: |
| 23 | + |
| 24 | +* `VERIFY_MLDSA` — verifies a ML-DSA signature compliant to FIPS-204. |
| 25 | +* `VERIFY_MLDSA_ETH` — verifies a ML-DSA-ETH signature replacing SHAKE256 with a more efficient hash function, deviating from FIPS-204. |
| 26 | + |
| 27 | +## Motivation |
| 28 | + |
| 29 | +Quantum computers pose a long-term risk to classical cryptographic algorithms. In particular, signature algorithms based on the hardness of the Elliptic Curve Discrete Logarithm Problem (ECDLP) such as secp256k1, are widely used in Ethereum and threaten by quantum algorithms. This exposes potentially on-chain assets and critical infrastructure to quantum adversaries. |
| 30 | + |
| 31 | +Integrating post-quantum signature schemes is crucial to future-proof Ethereum and other EVM-based environments. It shall be noted that infrastructure for post-quantum signatures should be deployed before quantum adversaries are known to be practical because it takes on the order of years for existing applications to integrate. |
| 32 | + |
| 33 | +Dilithium, a lattice-based scheme standardized by NIST as FIPS-204, offers high security against both classical and quantum adversaries. As the main winner of the standardization, the scheme has been selected as the main alternative to elliptic curve signature algorithms, making it a serious option for Ethereum. |
| 34 | + |
| 35 | +While the signature size (2.4kB) and public key size (1.3kB) are larger than other post-quantum candidates such as Falcon FN-DSA, this scheme is more flexible in terms of parameters. It is thus possible to derive a zero-knowledge version of Dilithium, keeping the security of the scheme together with an efficient in-circuit verification. This EIP does not dig into details this instance. |
| 36 | + |
| 37 | +ML-DSA has a simpler signer algorithm than FN-DSA, making hardware implementation easier. |
| 38 | +Finally, ML-DSA is based on the same mathematical construction as Kyber, the Post-Quantum Key Exchange algorithm standardized by NIST as FIPS-203. |
| 39 | +All these properties make ML-DSA well-suited for blockchain applications. |
| 40 | + |
| 41 | +In the context of the Ethereum Virtual Machine, a precompile for Keccak256 hash function is already available, making ML-DSA verification much faster when instantiated with an extendable output function derived from Keccak than with SHAKE256, as specified in NIST submission. This EIP specifies two version of ML-DSA enabling two important features: one version being fully compliant with the NIST specification, the other deviating in encodings to reduce the gas cost. |
| 42 | + |
| 43 | +## Specification |
| 44 | + |
| 45 | +The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 and RFC 8174. |
| 46 | + |
| 47 | +The following specification provides two precompiled contract: |
| 48 | + |
| 49 | +|**Precompiled contract**|**1**|**2**| |
| 50 | +|-|-|-| |
| 51 | +|**Name**|`MLDSA_VERIFY`|`MLDSA_VERIFY_ETH`| |
| 52 | +|**Address**| 0x12| 0x13| |
| 53 | +|**Gas cost**| 4500| 4500| |
| 54 | + |
| 55 | +While ML-DSA can be instantiated for three security levels: NIST level II, III and IV, this EIP only covers NIST level II, corresponding to 128 bits of security. |
| 56 | + |
| 57 | +For the two variants of ML-DSA of this EIP, the following parameters are fixed: |
| 58 | + |
| 59 | +* Polynomial degree: `n = 256`, |
| 60 | +* Field modulus: `q = 8380417`, |
| 61 | +* Matrix dimensions: `k=4`, `l=4`, |
| 62 | +* Bounds of rejection: `γ_1 = 2¹⁷`, `γ_2 = (q-1)/88`, |
| 63 | +* Additional parameters: `η = 2`, `τ = 39`, `d = 13`. |
| 64 | + |
| 65 | +These parameters strictly follows NIST. More precisely, `q`, `n`, `k`, `l` and `η` are chosen in order to ensure a hard MLWE related problem, and the remaining parameters are chosen for the hardness of MSIS as well as for the efficiency of the scheme. |
| 66 | + |
| 67 | +In terms of storage, ML-DSA public key can be derived by the verifier, making the overall public key of 1312 bytes. However, this increases the verifier cost, making the on-chain verification too expensive from a practical point of view. In this EIP, the verification algorithm takes the public key in raw format, meaning that the storage for the public key is: |
| 68 | + |
| 69 | +* The full matrix `A_hat` of 16 384 bytes, |
| 70 | +* `tr` is stored in order to save one hash, with 32 bytes, |
| 71 | +* `t1` is stored in the NTT domain in order to save one NTT, with 4096 bytes. |
| 72 | + |
| 73 | +The overall storage for the **public key** is **20512 bytes**. The signature follows the same format as specified in FIPS-204: 32 bytes for `c_tilde`, 2304 bytes for the coefficients of `z`, and 84 bytes for `h`. In total, a **signature** requires **2420 bytes**. |
| 74 | + |
| 75 | +### Sub-algorithms of ML-DSA |
| 76 | + |
| 77 | +#### Number Theoretic Transform |
| 78 | + |
| 79 | +Polynomial arithmetic is computed efficiency using Number Theoretic Transform (NTT). Efficient polynomial multiplication can be implemented following the draft NTT EIP. NTT inverse cost is roughly the same as an NTT: $n\log(n)$ additions and $n/2 \log(n)$ multiplications over the field $\mathbb F_q$, where $q=8380417$ and $n=256$. |
| 80 | + |
| 81 | +#### EXtendable Output Function |
| 82 | + |
| 83 | +The verification algorithm requires an eXtendable Output Function (XOF) made from a hash function. |
| 84 | +This EIP provides two instantiations of a XOF: |
| 85 | + |
| 86 | +* SHAKE256 is the XOF provided in NIST submission, a sponge construction derived from SHA256. Extracting bytes using SHAKE256 calls the `Keccak_f` permutation as described in Section 3.7 of FIPS-204. While this construction is standardized, it is expensive when computed in the Ethereum Virtual Machine because `Keccak_f` has no EVM opcode. |
| 87 | +* Keccak-PRNG is a XOF that is build from a counter-mode PRNG based on Keccak256. Generating new chunks of bytes requires an incrementing counter, as described in NIST SP800-90A revision 1. This XOF has the same interface as SHAKE256, but requires a `flip()` function that initiate a counter to `0`. Then, the `squeeze` function outputs as many bytes as needed using a counter mode as specified in SP800-90A revision 1. A precompile of `Keccak256` is available in the Ethereum Virtual Machine, making this XOF very efficient in the EVM. |
| 88 | + |
| 89 | +#### Hints in ML-DSA |
| 90 | + |
| 91 | +ML-DSA requires some hint computation. More precisely, the function `use_hint` must be implemented following Algorithm 40 of FIPS-204. The output hint is a polynomial with coefficients in {0,1}. Another function `sum_hint` is required, and counts the number of non-zero values of the hint. |
| 92 | + |
| 93 | +#### Sample In Ball Challenge |
| 94 | + |
| 95 | +In ML-DSA, a challenge is computed using a XOF. |
| 96 | +This algorithm `sample_in_ball` outputs a polynomial with τ small coefficients (in {-1,1}). |
| 97 | +The values of the coefficients as well as the position in the coefficients list is obtained using the XOF, as specified in Algorithm 29 of FIPS-204. |
| 98 | + |
| 99 | +### ML-DSA verification algorithm |
| 100 | + |
| 101 | +Verifying a ML-DSA signature follows Algorithm 8 of FIPS-204, with `A_hat` of the public key stored in expanded format, and `t1` stored in the NTT domain. |
| 102 | +```python |
| 103 | +def VERIFY_MLDSA(public_key, message, signature) -> bool: |
| 104 | + A_hat, tr, t1 are decoded from public_key |
| 105 | + c_tilde, z, h are decoded from signature |
| 106 | + if h is not properly encoded, return False |
| 107 | + |
| 108 | + μ = shake_256(tr+m).extract(64) |
| 109 | + c = sample_in_ball(c_tilde, τ) |
| 110 | + # computed in the NTT domain |
| 111 | + # three NTTs for c and z, and t1 |
| 112 | + # one final inverse NTT. |
| 113 | + Az_minus_ct1 = A_hat * z - c * (2^d * t1) |
| 114 | + |
| 115 | + w_prime = h.use_hint(Az_minus_ct1, 2*γ_2) |
| 116 | + |
| 117 | + return check_norm_bound(z, γ_1 - β) and c_tilde == shake_256(μ + w_prime).extract(32) |
| 118 | +``` |
| 119 | + |
| 120 | +### ML-DSA-ETH verification algorithm |
| 121 | + |
| 122 | +The verification of ML-DSA-ETH signatures follows the same algorithm with another hash function, with two differences: |
| 123 | + |
| 124 | +* `t1` from the public key is stored in the NTT domain in order to save one NTT. The multiplication by `2^d` is also precomputed. Note that this change can be seen as a change of representation. |
| 125 | +* A variant of `sample_in_ball` is defined using KeccakPRNG. The only difference from Algorithm 29 of FIPS-204 is that it requires a `flip()` between lines 3 and 4 so that it initializes the counter to `0` before starting squeezing. Note that this can be implemented in `absorb()` and `squeeze()` so that the same interface can be used as in SHAKE256. |
| 126 | +```python |
| 127 | +def VERIFY_MLDSA_ETH(public_key, message, signature) -> bool: |
| 128 | + A_hat, tr, t1 are decoded from public_key |
| 129 | + c_tilde, z, h are decoded from signature |
| 130 | + if h is not properly encoded, return False |
| 131 | + |
| 132 | + μ = keccak_prng(tr+m).extract(64) |
| 133 | + c = sample_in_ball_keccak_prng(c_tilde, τ) |
| 134 | + # computed in the NTT domain |
| 135 | + # two NTTs for c and z |
| 136 | + # one final inverse NTT. |
| 137 | + Az_minus_ct1 = A_hat * z - c * t1 |
| 138 | + |
| 139 | + w_prime = h.use_hint(Az_minus_ct1, 2*γ_2) |
| 140 | + |
| 141 | + return check_norm_bound(z, γ_1 - β) and c_tilde == keccak_prng(μ + w_prime).extract(32) |
| 142 | +``` |
| 143 | + |
| 144 | +### Required checks in ML-DSA(-ETH) verification |
| 145 | + |
| 146 | +* The hint `h` needs to be properly encoded. The malformation of the hint is specified in Algorithm 21 of FIPS-204. |
| 147 | +* The element `z` must have a norm satisfying `||z||_∞ < γ_1 - β`. The norm `||.||_∞` is defined page 6 of FIPS-204. |
| 148 | +* The final hash output must be equal to the signature bytes `c_tilde`. |
| 149 | + |
| 150 | +### Precompiled contract specification |
| 151 | + |
| 152 | +### ML-DSA precompiled contract |
| 153 | + |
| 154 | +The precompiled contract VERIFY_MLDSA is proposed with the following input and outputs, which are big-endian values: |
| 155 | + |
| 156 | +* **Input data** |
| 157 | + * 32 bytes for the message |
| 158 | + * 2420 bytes for ML-DSA signature |
| 159 | + * 20512 bytes for the ML-DSA expanded public key |
| 160 | +* **Output data**: |
| 161 | + * If the algorithm process succeeds, it returns 1 in 32 bytes format. |
| 162 | + * If the algorithm process fails, it returns 0 in 32 bytes format. |
| 163 | + |
| 164 | +#### Error Cases |
| 165 | + |
| 166 | +* Insufficient gas has been provided. |
| 167 | +* Invalid input length (not compliant to described input) |
| 168 | +* Invalid field element encoding (≥ q) |
| 169 | +* Invalid norm bound |
| 170 | +* Invalid hint check |
| 171 | +* Signature verification failure |
| 172 | + |
| 173 | +### ML-DSA-ETH precompiled contract |
| 174 | + |
| 175 | +The precompiled contract VERIFY_MLDSA_ETH is proposed with the following input and outputs, which are big-endian values: |
| 176 | + |
| 177 | +* **Input data** |
| 178 | + * 32 bytes for the message |
| 179 | + * 2420 bytes for ML-DSA-ETH signature |
| 180 | + * 20512 bytes for the ML-DSA-ETH expanded public key |
| 181 | +* **Output data**: |
| 182 | + * If the algorithm process succeeds, it returns 1 in 32 bytes format. |
| 183 | + * If the algorithm process fails, it returns 0 in 32 bytes format. |
| 184 | + |
| 185 | +#### Error Cases |
| 186 | + |
| 187 | +* Insufficient gas has been provided. |
| 188 | +* Invalid input length (not compliant to described input) |
| 189 | +* Invalid field element encoding (≥ q) |
| 190 | +* Invalid norm bound |
| 191 | +* Invalid hint check |
| 192 | +* Signature verification failure |
| 193 | + |
| 194 | +### Precompiled contract gas usage |
| 195 | + |
| 196 | +The cost of the **VERIFY_MLDSA** and **VERIFY_MLDSA_ETH** functions is dominated by the call to the NTTs, and the required hash calls for sampling in the ball (and for μ and the final check). |
| 197 | +It represents in average 5 calls to the hash function. Taking linearly the cost of keccak256, and avoiding the context switching it represents 4500 gas. |
| 198 | + |
| 199 | +## Rationale |
| 200 | + |
| 201 | +The ML-DSA scheme was selected as a NIST-standardized post-quantum cryptographic algorithm due to its strong security guarantees and efficiency. |
| 202 | + |
| 203 | +ML-DSA is a signature algorithm build from lattice-based cryptography. Specifically, its hardness relies on the Short Integer Solution (SIS) problem and the Learning With Errors (LWE) problem, which is believed to be hard for both classical and quantum computers. |
| 204 | + |
| 205 | +ML-DSA (based on CRYSTALS-Dilithium) offers a strong balance between security, efficiency, and practicality compared to classical ECC and other post-quantum schemes. Its signature and key sizes remain reasonably small, making it practical for real-world deployments such as Ethereum blockchain. As the main winner of the NIST PQC standardization process, it benefits from a broad community consensus on its security, which is not yet the case for many alternatives. Moreover, its lattice-based structure allows flexible parameter tuning, making it easier to adapt for specialized contexts like zero-knowledge proofs. Schemes like Falcon (FN-DSA, another signature scheme built on top of lattice-based assumptions) have a more rigid parameterization, making zero-knowledge circuits larger, and thus proof computation more expensive. |
| 206 | + |
| 207 | +Given the increasing urgency of transitioning to quantum-resistant cryptographic primitives or even having them ready in the event that research into quantum computers speeds up. |
| 208 | + |
| 209 | +## Backwards Compatibility |
| 210 | + |
| 211 | +In compliance with EIP-7932, the necessary parameters and structure for its integration are provided. `ALG_TYPE = 0xD1` uniquely identifies ML-DSA transactions, set MAX_SIZE = 2420 bytes to accommodate the fixed-length signature_info container, and recommend a `GAS_PENALTY` of approximately `3000` gas subject to benchmarking. The verification function follows the EIP-7932 model, parsing the signature_info, recovering the corresponding Dilithium public key, verifying the signature against the transaction payload hash, and deriving the signer's Ethereum address as the last 20 bytes of keccak256(pubkey). This definition ensures that ML-DSA can be cleanly adopted within the `AlgorithmicTransaction` container specified by EIP-7932. |
| 212 | +```python |
| 213 | +signature_info = Container[ |
| 214 | + # 0xD1 for ML-DSA (NIST-compliant version), |
| 215 | + # 0xD2 for ML-DSA-ETH (EVM-friendly version), |
| 216 | + version: uint8 |
| 217 | + # ML-DSA signature |
| 218 | + signature: ByteVector[2420] |
| 219 | + # keccak256(pubkey)[12:] |
| 220 | + pubkey_hash: ByteVector[20] |
| 221 | +] |
| 222 | +``` |
| 223 | + |
| 224 | +In the format of EIP-7932: |
| 225 | + |
| 226 | +* For the NIST-compliant version: |
| 227 | +```python |
| 228 | +verify(signature_info: bytes, payload_hash: Hash32) -> ExecutionAddress: |
| 229 | + assert len(signature_info) == 699 |
| 230 | + version = signature_info[0] |
| 231 | + signature = signature_info[1:2421] |
| 232 | + pubkey_hash = signature_info[2421:2441] |
| 233 | + assert version == 0xD1 |
| 234 | + pubkey = lookup_pubkey(pubkey_hash) |
| 235 | + assert VERIFY_MLDSA(pubkey, payload_hash, signature) |
| 236 | + return ExecutionAddress(keccak256(pubkey)[12:]) |
| 237 | +``` |
| 238 | + |
| 239 | +* For the EVM-friendly version: |
| 240 | +```python |
| 241 | +verify(signature_info: bytes, payload_hash: Hash32) -> ExecutionAddress: |
| 242 | + assert len(signature_info) == 699 |
| 243 | + version = signature_info[0] |
| 244 | + signature = signature_info[1:2421] |
| 245 | + pubkey_hash = signature_info[2421:2441] |
| 246 | + assert version == 0xD2 |
| 247 | + pubkey = lookup_pubkey(pubkey_hash) |
| 248 | + assert VERIFY_MLDSA_ETH(pubkey, payload_hash, signature) |
| 249 | + return ExecutionAddress(keccak256(pubkey)[12:]) |
| 250 | +``` |
| 251 | + |
| 252 | +## Test Cases |
| 253 | + |
| 254 | +A set of test vectors for verifying implementations is located in a separate file (to be provided for each opcode). For the NIST compliant version, KATs are reproduced. |
| 255 | + |
| 256 | +## Reference Implementation |
| 257 | + |
| 258 | +An implementation is provided in `assets` of this EIP. For the NIST-compliant version, KAT vectors of the NIST submission are valid. |
| 259 | + |
| 260 | +## Security Considerations |
| 261 | + |
| 262 | +The derivation path to obtain the private key from the seed is (tbd). |
| 263 | + |
| 264 | +## Copyright |
| 265 | + |
| 266 | +Copyright and related rights waived via [CC0](../LICENSE.md). |
0 commit comments