Skip to content

Commit 41af916

Browse files
Alexei StarovoitovKernel Patches Daemon
authored andcommitted
bpf: Recognize special arithmetic shift in the verifier
cilium bpf_wiregard.bpf.c when compiled with -O1 fails to load with the following verifier log: 192: (79) r2 = *(u64 *)(r10 -304) ; R2=pkt(r=40) R10=fp0 fp-304=pkt(r=40) ... 227: (85) call bpf_skb_store_bytes#9 ; R0=scalar() 228: (bc) w2 = w0 ; R0=scalar() R2=scalar(smin=0,smax=umax=0xffffffff,var_off=(0x0; 0xffffffff)) 229: (c4) w2 s>>= 31 ; R2=scalar(smin=0,smax=umax=0xffffffff,smin32=-1,smax32=0,var_off=(0x0; 0xffffffff)) 230: (54) w2 &= -134 ; R2=scalar(smin=0,smax=umax=umax32=0xffffff7a,smax32=0x7fffff7a,var_off=(0x0; 0xffffff7a)) ... 232: (66) if w2 s> 0xffffffff goto pc+125 ; R2=scalar(smin=umin=umin32=0x80000000,smax=umax=umax32=0xffffff7a,smax32=-134,var_off=(0x80000000; 0x7fffff7a)) ... 238: (79) r4 = *(u64 *)(r10 -304) ; R4=scalar() R10=fp0 fp-304=scalar() 239: (56) if w2 != 0xffffff78 goto pc+210 ; R2=0xffffff78 // -136 ... 258: (71) r1 = *(u8 *)(r4 +0) R4 invalid mem access 'scalar' The error might confuse most bpf authors, since fp-304 slot had 'pkt' pointer at insn 192 and became 'scalar' at 238. That happened because bpf_skb_store_bytes() clears all packet pointers including those in the stack. On the first glance it might look like a bug in the source code, since ctx->data pointer should have been reloaded after the call to bpf_skb_store_bytes(). The relevant part of cilium source code looks like this: // bpf/lib/nodeport.h int dsr_set_ipip6() { if (ctx_adjust_hroom(...)) return DROP_INVALID; // -134 if (ctx_store_bytes(...)) return DROP_WRITE_ERROR; // -141 return 0; } bool dsr_fail_needs_reply(int code) { if (code == DROP_FRAG_NEEDED) // -136 return true; return false; } tail_nodeport_ipv6_dsr() { ret = dsr_set_ipip6(...); if (!IS_ERR(ret)) { ... } else { if (dsr_fail_needs_reply(ret)) return dsr_reply_icmp6(...); } } The code doesn't have arithmetic shift by 31 and it reloads ctx->data every time it needs to access it. So it's not a bug in the source code. The reason is DAGCombiner::foldSelectCCToShiftAnd() LLVM transformation: // If this is a select where the false operand is zero and the compare is a // check of the sign bit, see if we can perform the "gzip trick": // select_cc setlt X, 0, A, 0 -> and (sra X, size(X)-1), A // select_cc setgt X, 0, A, 0 -> and (not (sra X, size(X)-1)), A The conditional branch in dsr_set_ipip6() and its return values are optimized into BPF_ARSH plus BPF_AND: 227: (85) call bpf_skb_store_bytes#9 228: (bc) w2 = w0 229: (c4) w2 s>>= 31 ; R2=scalar(smin=0,smax=umax=0xffffffff,smin32=-1,smax32=0,var_off=(0x0; 0xffffffff)) 230: (54) w2 &= -134 ; R2=scalar(smin=0,smax=umax=umax32=0xffffff7a,smax32=0x7fffff7a,var_off=(0x0; 0xffffff7a)) after insn 230 the register w2 can only be 0 or -134, but the verifier approximates it, since there is no way to represent two scalars in bpf_reg_state. After fallthough at insn 232 the w2 can only be -134, hence the branch at insn 239: (56) if w2 != -136 goto pc+210 should be always taken, and trapping insn 258 should never execute. LLVM generated correct code, but the verifier follows impossible path and rejects valid program. To fix this issue recognize this special LLVM optimization and fork the verifier state. So after insn 229: (c4) w2 s>>= 31 the verifier has two states to explore: one with w2 = 0 and another with w2 = 0xffffffff which makes the verifier accept bpf_wiregard.c Note there are 20+ such patterns in bpf_wiregard.o compiled with -O1 and -O2, but they're rarely seen in other production bpf programs, so push_stack() approach is not a concern. Reported-by: Hao Sun <[email protected]> Signed-off-by: Alexei Starovoitov <[email protected]>
1 parent eb66b0c commit 41af916

File tree

1 file changed

+32
-0
lines changed

1 file changed

+32
-0
lines changed

kernel/bpf/verifier.c

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15470,6 +15470,35 @@ static bool is_safe_to_compute_dst_reg_range(struct bpf_insn *insn,
1547015470
}
1547115471
}
1547215472

15473+
static int maybe_fork_scalars(struct bpf_verifier_env *env, struct bpf_insn *insn,
15474+
struct bpf_reg_state *dst_reg)
15475+
{
15476+
struct bpf_verifier_state *branch;
15477+
struct bpf_reg_state *regs;
15478+
bool alu32;
15479+
15480+
if (dst_reg->smin_value == -1 && dst_reg->smax_value == 0)
15481+
alu32 = false;
15482+
else if (dst_reg->s32_min_value == -1 && dst_reg->s32_max_value == 0)
15483+
alu32 = true;
15484+
else
15485+
return 0;
15486+
15487+
branch = push_stack(env, env->insn_idx + 1, env->insn_idx, false);
15488+
if (IS_ERR(branch))
15489+
return PTR_ERR(branch);
15490+
15491+
regs = branch->frame[branch->curframe]->regs;
15492+
if (alu32) {
15493+
__mark_reg32_known(&regs[insn->dst_reg], 0);
15494+
__mark_reg32_known(dst_reg, -1ull);
15495+
} else {
15496+
__mark_reg_known(&regs[insn->dst_reg], 0);
15497+
__mark_reg_known(dst_reg, -1ull);
15498+
}
15499+
return 0;
15500+
}
15501+
1547315502
/* WARNING: This function does calculations on 64-bit values, but the actual
1547415503
* execution may occur on 32-bit values. Therefore, things like bitshifts
1547515504
* need extra checks in the 32-bit case.
@@ -15563,6 +15592,9 @@ static int adjust_scalar_min_max_vals(struct bpf_verifier_env *env,
1556315592
scalar32_min_max_arsh(dst_reg, &src_reg);
1556415593
else
1556515594
scalar_min_max_arsh(dst_reg, &src_reg);
15595+
ret = maybe_fork_scalars(env, insn, dst_reg);
15596+
if (ret)
15597+
return ret;
1556615598
break;
1556715599
default:
1556815600
break;

0 commit comments

Comments
 (0)