libbpf

mirror of https://github.com/netdata/libbpf.git synced 2026-06-23 23:19:07 +08:00

Author	SHA1	Message	Date
Eduard Zingerman	438adf417d	libbpf: Rewrite btf datasec names starting from '?' Optional struct_ops maps are defined using question mark at the start of the section name, e.g.: SEC("?.struct_ops") struct test_ops optional_map = { ... }; This commit teaches libbpf to detect if kernel allows '?' prefix in datasec names, and if it doesn't then to rewrite such names by replacing '?' with '_', e.g.: DATASEC ?.struct_ops -> DATASEC _.struct_ops Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240306104529.6453-13-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Eduard Zingerman	fc8b86bda2	libbpf: Struct_ops in SEC("?.struct_ops") / SEC("?.struct_ops.link") Allow using two new section names for struct_ops maps: - SEC("?.struct_ops") - SEC("?.struct_ops.link") To specify maps that have bpf_map->autocreate == false after open. Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240306104529.6453-12-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Eduard Zingerman	d5d0b6e920	libbpf: Replace elf_state->st_ops_* fields with SEC_ST_OPS sec_type The next patch would add two new section names for struct_ops maps. To make working with multiple struct_ops sections more convenient: - remove fields like elf_state->st_ops_{shndx,link_shndx}; - mark section descriptions hosting struct_ops as elf_sec_desc->sec_type == SEC_ST_OPS; After these changes struct_ops sections could be processed uniformly by iterating bpf_object->efile.secs entries. Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240306104529.6453-11-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Eduard Zingerman	060c604db8	libbpf: Sync progs autoload with maps autocreate for struct_ops maps Automatically select which struct_ops programs to load depending on which struct_ops maps are selected for automatic creation. E.g. for the BPF code below: SEC("struct_ops/test_1") int BPF_PROG(foo) { ... } SEC("struct_ops/test_2") int BPF_PROG(bar) { ... } SEC(".struct_ops.link") struct test_ops___v1 A = { .foo = (void )foo }; SEC(".struct_ops.link") struct test_ops___v2 B = { .foo = (void )foo, .bar = (void *)bar, }; And the following libbpf API calls: bpf_map__set_autocreate(skel->maps.A, true); bpf_map__set_autocreate(skel->maps.B, false); The autoload would be enabled for program 'foo' and disabled for program 'bar'. During load, for each struct_ops program P, referenced from some struct_ops map M: - set P.autoload = true if M.autocreate is true for some M; - set P.autoload = false if M.autocreate is false for all M; - don't change P.autoload, if P is not referenced from any map. Do this after bpf_object__init_kern_struct_ops_maps() to make sure that shadow vars assignment is done. Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240306104529.6453-9-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Eduard Zingerman	cb426140d0	libbpf: Honor autocreate flag for struct_ops maps Skip load steps for struct_ops maps not marked for automatic creation. This should allow to load bpf object in situations like below: SEC("struct_ops/foo") int BPF_PROG(foo) { ... } SEC("struct_ops/bar") int BPF_PROG(bar) { ... } struct test_ops___v1 { int (foo)(void); }; struct test_ops___v2 { int (foo)(void); int (does_not_exist)(void); }; SEC(".struct_ops.link") struct test_ops___v1 map_for_old = { .test_1 = (void )foo }; SEC(".struct_ops.link") struct test_ops___v2 map_for_new = { .test_1 = (void )foo, .does_not_exist = (void )bar }; Suppose program is loaded on old kernel that does not have definition for 'does_not_exist' struct_ops member. After this commit it would be possible to load such object file after the following tweaks: bpf_program__set_autoload(skel->progs.bar, false); bpf_map__set_autocreate(skel->maps.map_for_new, false); Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: David Vernet <void@manifault.com> Link: https://lore.kernel.org/bpf/20240306104529.6453-4-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Eduard Zingerman	aded62b120	libbpf: Tie struct_ops programs to kernel BTF ids, not to local ids Enforce the following existing limitation on struct_ops programs based on kernel BTF id instead of program-local BTF id: struct_ops BPF prog can be re-used between multiple .struct_ops & .struct_ops.link as long as it's the same struct_ops struct definition and the same function pointer field This allows reusing same BPF program for versioned struct_ops map definitions, e.g.: SEC("struct_ops/test") int BPF_PROG(foo) { ... } struct some_ops___v1 { int (test)(void); }; struct some_ops___v2 { int (test)(void); }; SEC(".struct_ops.link") struct some_ops___v1 a = { .test = foo } SEC(".struct_ops.link") struct some_ops___v2 b = { .test = foo } Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240306104529.6453-3-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Eduard Zingerman	f7fd5dbc07	libbpf: Allow version suffixes (___smth) for struct_ops types E.g. allow the following struct_ops definitions: struct bpf_testmod_ops___v1 { int (test)(void); }; struct bpf_testmod_ops___v2 { int (test)(void); }; SEC(".struct_ops.link") struct bpf_testmod_ops___v1 a = { .test = ... } SEC(".struct_ops.link") struct bpf_testmod_ops___v2 b = { .test = ... } Where both bpf_testmod_ops__v1 and bpf_testmod_ops__v2 would be resolved as 'struct bpf_testmod_ops' from kernel BTF. Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: David Vernet <void@manifault.com> Acked-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240306104529.6453-2-eddyz87@gmail.com	2024-03-06 13:58:27 -08:00
Alexei Starovoitov	00b08dceea	bpf: Introduce may_goto instruction Introduce may_goto instruction that from the verifier pov is similar to open coded iterators bpf_for()/bpf_repeat() and bpf_loop() helper, but it doesn't iterate any objects. In assembly 'may_goto' is a nop most of the time until bpf runtime has to terminate the program for whatever reason. In the current implementation may_goto has a hidden counter, but other mechanisms can be used. For programs written in C the later patch introduces 'cond_break' macro that combines 'may_goto' with 'break' statement and has similar semantics: cond_break is a nop until bpf runtime has to break out of this loop. It can be used in any normal "for" or "while" loop, like for (i = zero; i < cnt; cond_break, i++) { The verifier recognizes that may_goto is used in the program, reserves additional 8 bytes of stack, initializes them in subprog prologue, and replaces may_goto instruction with: aux_reg = (u64 )(fp - 40) if aux_reg == 0 goto pc+off aux_reg -= 1 (u64 )(fp - 40) = aux_reg may_goto instruction can be used by LLVM to implement __builtin_memcpy, __builtin_strcmp. may_goto is not a full substitute for bpf_for() macro. bpf_for() doesn't have induction variable that verifiers sees, so 'i' in bpf_for(i, 0, 100) is seen as imprecise and bounded. But when the code is written as: for (i = 0; i < 100; cond_break, i++) the verifier see 'i' as precise constant zero, hence cond_break (aka may_goto) doesn't help to converge the loop. A static or global variable can be used as a workaround: static int zero = 0; for (i = zero; i < 100; cond_break, i++) // works! may_goto works well with arena pointers that don't need to be bounds checked on access. Load/store from arena returns imprecise unbounded scalar and loops with may_goto pass the verifier. Reserve new opcode BPF_JMP \| BPF_JCOND for may_goto insn. JCOND stands for conditional pseudo jump. Since goto_or_nop insn was proposed, it may use the same opcode. may_goto vs goto_or_nop can be distinguished by src_reg: code = BPF_JMP \| BPF_JCOND src_reg = 0 - may_goto src_reg = 1 - goto_or_nop Signed-off-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Eduard Zingerman <eddyz87@gmail.com> Link: https://lore.kernel.org/bpf/20240306031929.42666-2-alexei.starovoitov@gmail.com	2024-03-06 13:58:27 -08:00
Chen Shen	bf52494e2b	libbpf: Correct debug message in btf__load_vmlinux_btf In the function btf__load_vmlinux_btf, the debug message incorrectly refers to 'path' instead of 'sysfs_btf_path'. Signed-off-by: Chen Shen <peterchenshen@gmail.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Yonghong Song <yonghong.song@linux.dev> Link: https://lore.kernel.org/bpf/20240302062218.3587-1-peterchenshen@gmail.com	2024-03-06 13:58:27 -08:00
Kui-Feng Lee	acfaeffeaa	libbpf: Convert st_ops->data to shadow type. Convert st_ops->data to the shadow type of the struct_ops map. The shadow type of a struct_ops type is a variant of the original struct type providing a way to access/change the values in the maps of the struct_ops type. bpf_map__initial_value() will return st_ops->data for struct_ops types. The skeleton is going to use it as the pointer to the shadow type of the original struct type. One of the main differences between the original struct type and the shadow type is that all function pointers of the shadow type are converted to pointers of struct bpf_program. Users can replace these bpf_program pointers with other BPF programs. The st_ops->progs[] will be updated before updating the value of a map to reflect the changes made by users. Signed-off-by: Kui-Feng Lee <thinker.li@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240229064523.2091270-3-thinker.li@gmail.com	2024-03-06 13:58:27 -08:00
Kui-Feng Lee	0758d8b0f2	libbpf: Set btf_value_type_id of struct bpf_map for struct_ops. For a struct_ops map, btf_value_type_id is the type ID of it's struct type. This value is required by bpftool to generate skeleton including pointers of shadow types. The code generator gets the type ID from bpf_map__btf_value_type_id() in order to get the type information of the struct type of a map. Signed-off-by: Kui-Feng Lee <thinker.li@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240229064523.2091270-2-thinker.li@gmail.com	2024-03-06 13:58:27 -08:00
Kees Cook	fa4d00254d	bpf: Replace bpf_lpm_trie_key 0-length array with flexible array Replace deprecated 0-length array in struct bpf_lpm_trie_key with flexible array. Found with GCC 13: ../kernel/bpf/lpm_trie.c:207:51: warning: array subscript i is outside array bounds of 'const __u8[0]' {aka 'const unsigned char[]'} [-Warray-bounds=] 207 \| (__be16 )&key->data[i]); \| ^~~~~~~~~~~~~ ../include/uapi/linux/swab.h:102:54: note: in definition of macro '__swab16' 102 \| #define __swab16(x) (__u16)__builtin_bswap16((__u16)(x)) \| ^ ../include/linux/byteorder/generic.h:97:21: note: in expansion of macro '__be16_to_cpu' 97 \| #define be16_to_cpu __be16_to_cpu \| ^~~~~~~~~~~~~ ../kernel/bpf/lpm_trie.c:206:28: note: in expansion of macro 'be16_to_cpu' 206 \| u16 diff = be16_to_cpu((__be16 )&node->data[i] ^ \| ^~~~~~~~~~~ In file included from ../include/linux/bpf.h:7: ../include/uapi/linux/bpf.h:82:17: note: while referencing 'data' 82 \| __u8 data[0]; /* Arbitrary size / \| ^~~~ And found at run-time under CONFIG_FORTIFY_SOURCE: UBSAN: array-index-out-of-bounds in kernel/bpf/lpm_trie.c:218:49 index 0 is out of range for type '__u8 []' Changing struct bpf_lpm_trie_key is difficult since has been used by userspace. For example, in Cilium: struct egress_gw_policy_key { struct bpf_lpm_trie_key lpm_key; __u32 saddr; __u32 daddr; }; While direct references to the "data" member haven't been found, there are static initializers what include the final member. For example, the "{}" here: struct egress_gw_policy_key in_key = { .lpm_key = { 32 + 24, {} }, .saddr = CLIENT_IP, .daddr = EXTERNAL_SVC_IP & 0Xffffff, }; To avoid the build time and run time warnings seen with a 0-sized trailing array for struct bpf_lpm_trie_key, introduce a new struct that correctly uses a flexible array for the trailing bytes, struct bpf_lpm_trie_key_u8. As part of this, include the "header" portion (which is just the "prefixlen" member), so it can be used by anything building a bpf_lpr_trie_key that has trailing members that aren't a u8 flexible array (like the self-test[1]), which is named struct bpf_lpm_trie_key_hdr. Unfortunately, C++ refuses to parse the __struct_group() helper, so it is not possible to define struct bpf_lpm_trie_key_hdr directly in struct bpf_lpm_trie_key_u8, so we must open-code the union directly. Adjust the kernel code to use struct bpf_lpm_trie_key_u8 through-out, and for the selftest to use struct bpf_lpm_trie_key_hdr. Add a comment to the UAPI header directing folks to the two new options. Reported-by: Mark Rutland <mark.rutland@arm.com> Signed-off-by: Kees Cook <keescook@chromium.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Gustavo A. R. Silva <gustavoars@kernel.org> Closes: https://paste.debian.net/hidden/ca500597/ Link: https://lore.kernel.org/all/202206281009.4332AA33@keescook/ [1] Link: https://lore.kernel.org/bpf/20240222155612.it.533-kees@kernel.org	2024-03-06 13:58:27 -08:00
Aahil Awatramani	f749be80b7	bonding: Add independent control state machine Add support for the independent control state machine per IEEE 802.1AX-2008 5.4.15 in addition to the existing implementation of the coupled control state machine. Introduces two new states, AD_MUX_COLLECTING and AD_MUX_DISTRIBUTING in the LACP MUX state machine for separated handling of an initial Collecting state before the Collecting and Distributing state. This enables a port to be in a state where it can receive incoming packets while not still distributing. This is useful for reducing packet loss when a port begins distributing before its partner is able to collect. Added new functions such as bond_set_slave_tx_disabled_flags and bond_set_slave_rx_enabled_flags to precisely manage the port's collecting and distributing states. Previously, there was no dedicated method to disable TX while keeping RX enabled, which this patch addresses. Note that the regular flow process in the kernel's bonding driver remains unaffected by this patch. The extension requires explicit opt-in by the user (in order to ensure no disruptions for existing setups) via netlink support using the new bonding parameter coupled_control. The default value for coupled_control is set to 1 so as to preserve existing behaviour. Signed-off-by: Aahil Awatramani <aahila@google.com> Reviewed-by: Hangbin Liu <liuhangbin@gmail.com> Link: https://lore.kernel.org/r/20240202175858.1573852-1-aahila@google.com Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-03-06 13:58:27 -08:00
Andrii Nakryiko	fb98d4bd25	include: fix BPF_CALL_REL definition Fix our Github-specific definition of BPF_CALL_REL macro. It was missing the code part. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-03-01 15:39:45 -08:00
Kui-Feng Lee	f4e9b606f4	ci: clean up bpf_test_no_cfi.ko for v5.5.0 and v4.9.0. bpf_test_no_cfi.ko is not available for v5.5.0 and v4.9.0. Signed-off-by: Kui-Feng Lee <thinker.li@gmail.com>	2024-02-27 10:14:31 -08:00
Kui-Feng Lee	ff95bd6238	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: 92a871ab9fa59a74d013bc04f321026a057618e7 Checkpoint bpf-next commit: 2ab256e93249f5ac1da665861aa0f03fb4208d9c Baseline bpf commit: 577e4432f3ac810049cb7e6b71f4d96ec7c6e894 Checkpoint bpf commit: dced881ead78e4d6add3735d02a9186ba2415630 Arnaldo Carvalho de Melo (1): tools headers UAPI: Sync linux/fcntl.h with the kernel sources Cupertino Miranda (1): libbpf: Add support to GCC in CORE macro definitions Martin Kelly (1): bpf: Clarify batch lookup/lookup_and_delete semantics Matt Bobrowski (1): libbpf: Make remark about zero-initializing bpf_*_info structs include/uapi/linux/bpf.h \| 6 ++++- include/uapi/linux/fcntl.h \| 3 +++ src/bpf.h \| 39 ++++++++++++++++++++++++--------- src/bpf_core_read.h \| 45 ++++++++++++++++++++++++++++++++------ 4 files changed, 75 insertions(+), 18 deletions(-) Signed-off-by: Kui-Feng Lee <thinker.li@gmail.com>	2024-02-27 10:14:31 -08:00
Arnaldo Carvalho de Melo	a894b0cb9b	tools headers UAPI: Sync linux/fcntl.h with the kernel sources To get the changes in: 8a924db2d7b5eb69 ("fs: Pass AT_GETATTR_NOSEC flag to getattr interface function") That don't add anything that is handled by existing hard coded tables or table generation scripts. This silences this perf build warning: Warning: Kernel ABI header differences: diff -u tools/include/uapi/linux/fcntl.h include/uapi/linux/fcntl.h Cc: Adrian Hunter <adrian.hunter@intel.com> Cc: Christian Brauner <brauner@kernel.org> Cc: Ian Rogers <irogers@google.com> Cc: Jiri Olsa <jolsa@kernel.org> Cc: Namhyung Kim <namhyung@kernel.org> Cc: Stefan Berger <stefanb@linux.ibm.com> Link: https://lore.kernel.org/lkml/ZbJv9fGF_k2xXEdr@kernel.org Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>	2024-02-27 10:14:31 -08:00
Martin Kelly	afa81fb1cb	bpf: Clarify batch lookup/lookup_and_delete semantics The batch lookup and lookup_and_delete APIs have two parameters, in_batch and out_batch, to facilitate iterative lookup/lookup_and_deletion operations for supported maps. Except NULL for in_batch at the start of these two batch operations, both parameters need to point to memory equal or larger than the respective map key size, except for various hashmaps (hash, percpu_hash, lru_hash, lru_percpu_hash) where the in_batch/out_batch memory size should be at least 4 bytes. Document these semantics to clarify the API. Signed-off-by: Martin Kelly <martin.kelly@crowdstrike.com> Acked-by: Yonghong Song <yonghong.song@linux.dev> Link: https://lore.kernel.org/r/20240221211838.1241578-1-martin.kelly@crowdstrike.com Signed-off-by: Martin KaFai Lau <martin.lau@kernel.org>	2024-02-27 10:14:31 -08:00
Matt Bobrowski	16e68ab13c	libbpf: Make remark about zero-initializing bpf_*_info structs In some situations, if you fail to zero-initialize the bpf_{prog,map,btf,link}_info structs supplied to the set of LIBBPF helpers bpf_{prog,map,btf,link}_get_info_by_fd(), you can expect the helper to return an error. This can possibly leave people in a situation where they're scratching their heads for an unnnecessary amount of time. Make an explicit remark about the requirement of zero-initializing the supplied bpf_{prog,map,btf,link}_info structs for the respective LIBBPF helpers. Internally, LIBBPF helpers bpf_{prog,map,btf,link}_get_info_by_fd() call into bpf_obj_get_info_by_fd() where the bpf(2) BPF_OBJ_GET_INFO_BY_FD command is used. This specific command is effectively backed by restrictions enforced by the bpf_check_uarg_tail_zero() helper. This function ensures that if the size of the supplied bpf_{prog,map,btf,link}_info structs are larger than what the kernel can handle, trailing bits are zeroed. This can be a problem when compiling against UAPI headers that don't necessarily match the sizes of the same underlying types known to the kernel. Signed-off-by: Matt Bobrowski <mattbobrowski@google.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Jiri Olsa <jolsa@kernel.org> Link: https://lore.kernel.org/bpf/ZcyEb8x4VbhieWsL@google.com	2024-02-27 10:14:31 -08:00
Cupertino Miranda	b19fdbf1be	libbpf: Add support to GCC in CORE macro definitions Due to internal differences between LLVM and GCC the current implementation for the CO-RE macros does not fit GCC parser, as it will optimize those expressions even before those would be accessible by the BPF backend. As examples, the following would be optimized out with the original definitions: - As enums are converted to their integer representation during parsing, the IR would not know how to distinguish an integer constant from an actual enum value. - Types need to be kept as temporary variables, as the existing type casts of the 0 address (as expanded for LLVM), are optimized away by the GCC C parser, never really reaching GCCs IR. Although, the macros appear to add extra complexity, the expanded code is removed from the compilation flow very early in the compilation process, not really affecting the quality of the generated assembly. Signed-off-by: Cupertino Miranda <cupertino.miranda@oracle.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240213173543.1397708-1-cupertino.miranda@oracle.com	2024-02-27 10:14:31 -08:00
Manu Bretelle	445486dcbf	ci: Pass arch parameter to setup-build-env Since `1bc40aecb3` arch parameter needs to be passed to `setup-build-env` Signed-off-by: Manu Bretelle <chantr4@gmail.com>	2024-02-15 10:44:45 -08:00
Andrii Nakryiko	820bca2cb6	ci: verifier_global_subprogs can't be run on 5.5 We get: libbpf: struct_ops init_kern: struct bpf_dummy_ops is not found in kernel BTF So even though it's irrelevant to the subtests we do want to test, entire test has to be skipped, unfortunately. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-02-06 11:52:00 -08:00
Andrii Nakryiko	8a8feae5f4	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: 943b043aeecce9accb6d367af47791c633e95e4d Checkpoint bpf-next commit: 92a871ab9fa59a74d013bc04f321026a057618e7 Baseline bpf commit: 577e4432f3ac810049cb7e6b71f4d96ec7c6e894 Checkpoint bpf commit: 577e4432f3ac810049cb7e6b71f4d96ec7c6e894 Andrii Nakryiko (1): libbpf: fix return value for PERF_EVENT __arg_ctx type fix up check Toke Høiland-Jørgensen (1): libbpf: Use OPTS_SET() macro in bpf_xdp_query() src/libbpf.c \| 6 +++--- src/netlink.c \| 4 ++-- 2 files changed, 5 insertions(+), 5 deletions(-) Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-02-06 11:52:00 -08:00
Toke Høiland-Jørgensen	a20b60f971	libbpf: Use OPTS_SET() macro in bpf_xdp_query() When the feature_flags and xdp_zc_max_segs fields were added to the libbpf bpf_xdp_query_opts, the code writing them did not use the OPTS_SET() macro. This causes libbpf to write to those fields unconditionally, which means that programs compiled against an older version of libbpf (with a smaller size of the bpf_xdp_query_opts struct) will have its stack corrupted by libbpf writing out of bounds. The patch adding the feature_flags field has an early bail out if the feature_flags field is not part of the opts struct (via the OPTS_HAS) macro, but the patch adding xdp_zc_max_segs does not. For consistency, this fix just changes the assignments to both fields to use the OPTS_SET() macro. Fixes: 13ce2daa259a ("xsk: add new netlink attribute dedicated for ZC max frags") Signed-off-by: Toke Høiland-Jørgensen <toke@redhat.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240206125922.1992815-1-toke@redhat.com	2024-02-06 11:52:00 -08:00
Andrii Nakryiko	b24a6277cc	libbpf: fix return value for PERF_EVENT __arg_ctx type fix up check If PERF_EVENT program has __arg_ctx argument with matching architecture-specific pt_regs/user_pt_regs/user_regs_struct pointer type, libbpf should still perform type rewrite for old kernels, but not emit the warning. Fix copy/paste from kernel code where 0 is meant to signify "no error" condition. For libbpf we need to return "true" to proceed with type rewrite (which for PERF_EVENT program will be a canonical `struct bpf_perf_event_data *` type). Fixes: 9eea8fafe33e ("libbpf: fix __arg_ctx type enforcement for perf_event programs") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20240206002243.1439450-1-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2024-02-06 11:52:00 -08:00
Andrii Nakryiko	25fe467af4	ci: allowlist tests validating libbpf's __arg_ctx type rewrite logic Allowlist test_global_funcs/arg_tag_ctx* and a few of verifier_global_subprogs subtests that validate libbpf's logic for rewriting __arg_ctx globl subprog argument types on kernels that don't natively support __arg_ctx. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-02-06 10:17:28 -08:00
Andrii Nakryiko	f11758a780	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: ced33f2cfa21a14a292a00e31dc9f85c1bfbda1c Checkpoint bpf-next commit: 943b043aeecce9accb6d367af47791c633e95e4d Baseline bpf commit: 577e4432f3ac810049cb7e6b71f4d96ec7c6e894 Checkpoint bpf commit: 577e4432f3ac810049cb7e6b71f4d96ec7c6e894 Andrii Nakryiko (8): libbpf: integrate __arg_ctx feature detector into kernel_supports() libbpf: fix __arg_ctx type enforcement for perf_event programs libbpf: add __arg_trusted and __arg_nullable tag macros libbpf: add bpf_core_cast() macro libbpf: Call memfd_create() syscall directly libbpf: Add missing LIBBPF_API annotation to libbpf_set_memlock_rlim API libbpf: Add btf__new_split() API that was declared but not implemented libbpf: Add missed btf_ext__raw_data() API Eduard Zingerman (1): libbpf: Remove unnecessary null check in kernel_supports() Ian Rogers (1): libbpf: Add some details for BTF parsing failures src/bpf.h \| 2 +- src/bpf_core_read.h \| 13 ++++++ src/bpf_helpers.h \| 2 + src/btf.c \| 33 ++++++++++++--- src/features.c \| 58 +++++++++++++++++++++++++ src/libbpf.c \| 99 ++++++++++++++----------------------------- src/libbpf.map \| 5 ++- src/libbpf_internal.h \| 2 + src/linker.c \| 2 +- 9 files changed, 140 insertions(+), 76 deletions(-) Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	cbb8ba352d	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	95b4beb502	libbpf: Add missed btf_ext__raw_data() API Another API that was declared in libbpf.map but actual implementation was missing. btf_ext__get_raw_data() was intended as a discouraged alias to consistently-named btf_ext__raw_data(), so make this an actuality. Fixes: 20eccf29e297 ("libbpf: hide and discourage inconsistently named getters") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Eduard Zingerman <eddyz87@gmail.com> Link: https://lore.kernel.org/bpf/20240201172027.604869-5-andrii@kernel.org	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	5b7613e50f	libbpf: Add btf__new_split() API that was declared but not implemented Seems like original commit adding split BTF support intended to add btf__new_split() API, and even declared it in libbpf.map, but never added (trivial) implementation. Fix this. Fixes: ba451366bf44 ("libbpf: Implement basic split BTF support") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Eduard Zingerman <eddyz87@gmail.com> Link: https://lore.kernel.org/bpf/20240201172027.604869-4-andrii@kernel.org	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	245394fb36	libbpf: Add missing LIBBPF_API annotation to libbpf_set_memlock_rlim API LIBBPF_API annotation seems missing on libbpf_set_memlock_rlim API, so add it to make this API callable from libbpf's shared library version. Fixes: e542f2c4cd16 ("libbpf: Auto-bump RLIMIT_MEMLOCK if kernel needs it for BPF") Fixes: ab9a5a05dc48 ("libbpf: fix up few libbpf.map problems") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Eduard Zingerman <eddyz87@gmail.com> Link: https://lore.kernel.org/bpf/20240201172027.604869-3-andrii@kernel.org	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	3b19b1bb55	libbpf: Call memfd_create() syscall directly Some versions of Android do not implement memfd_create() wrapper in their libc implementation, leading to build failures ([0]). On the other hand, memfd_create() is available as a syscall on quite old kernels (3.17+, while bpf() syscall itself is available since 3.18+), so it is ok to assume that syscall availability and call into it with syscall() helper to avoid Android-specific workarounds. Validated in libbpf-bootstrap's CI ([1]). [0] https://github.com/libbpf/libbpf-bootstrap/actions/runs/7701003207/job/20986080319#step:5:83 [1] https://github.com/libbpf/libbpf-bootstrap/actions/runs/7715988887/job/21031767212?pr=253 Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Eduard Zingerman <eddyz87@gmail.com> Link: https://lore.kernel.org/bpf/20240201172027.604869-2-andrii@kernel.org	2024-02-01 15:10:17 -08:00
Eduard Zingerman	7529e0c4c7	libbpf: Remove unnecessary null check in kernel_supports() After recent changes, Coverity complained about inconsistent null checks in kernel_supports() function: kernel_supports(const struct bpf_object *obj, ...) [...] // var_compare_op: Comparing obj to null implies that obj might be null if (obj && obj->gen_loader) return true; // var_deref_op: Dereferencing null pointer obj if (obj->token_fd) return feat_supported(obj->feat_cache, feat_id); [...] - The original null check was introduced by commit [0], which introduced a call `kernel_supports(NULL, ...)` in function bump_rlimit_memlock(); - This call was refactored to use `feat_supported(NULL, ...)` in commit [1]. Looking at all places where kernel_supports() is called: - There is either `obj->...` access before the call; - Or `obj` comes from `prog->obj` expression, where `prog` comes from enumeration of programs in `obj`; - Or `obj` comes from `prog->obj`, where `prog` is a parameter to one of the API functions: - bpf_program__attach_kprobe_opts; - bpf_program__attach_kprobe; - bpf_program__attach_ksyscall. Assuming correct API usage, it appears that `obj` can never be null when passed to kernel_supports(). Silence the Coverity warning by removing redundant null check. [0] e542f2c4cd16 ("libbpf: Auto-bump RLIMIT_MEMLOCK if kernel needs it for BPF") [1] d6dd1d49367a ("libbpf: Further decouple feature checking logic from bpf_object") Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Link: https://lore.kernel.org/bpf/20240131212615.20112-1-eddyz87@gmail.com	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	688879fb01	libbpf: add bpf_core_cast() macro Add bpf_core_cast() macro that wraps bpf_rdonly_cast() kfunc. It's more ergonomic than kfunc, as it automatically extracts btf_id with bpf_core_type_id_kernel(), and works with type names. It also casts result to (T ) pointer. See the definition of the macro, it's self-explanatory. libbpf declares bpf_rdonly_cast() extern as __weak __ksym and should be safe to not conflict with other possible declarations in user code. But we do have a conflict with current BPF selftests that declare their externs with first argument as `void obj`, while libbpf opts into more permissive `const void *obj`. This causes conflict, so we fix up BPF selftests uses in the same patch. Acked-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20240130212023.183765-2-andrii@kernel.org Signed-off-by: Martin KaFai Lau <martin.lau@kernel.org>	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	0303e25be3	libbpf: add __arg_trusted and __arg_nullable tag macros Add __arg_trusted to annotate global func args that accept trusted PTR_TO_BTF_ID arguments. Also add __arg_nullable to combine with __arg_trusted (and maybe other tags in the future) to force global subprog itself (i.e., callee) to do NULL checks, as opposed to default non-NULL semantics (and thus caller's responsibility to ensure non-NULL values). Acked-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20240130000648.2144827-4-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2024-02-01 15:10:17 -08:00
Ian Rogers	9b306ac9be	libbpf: Add some details for BTF parsing failures As CONFIG_DEBUG_INFO_BTF is default off the existing "failed to find valid kernel BTF" message makes diagnosing the kernel build issue somewhat cryptic. Add a little more detail with the hope of helping users. Before: ``` libbpf: failed to find valid kernel BTF libbpf: Error loading vmlinux BTF: -3 ``` After not accessible: ``` libbpf: kernel BTF is missing at '/sys/kernel/btf/vmlinux', was CONFIG_DEBUG_INFO_BTF enabled? libbpf: failed to find valid kernel BTF libbpf: Error loading vmlinux BTF: -3 ``` After not readable: ``` libbpf: failed to read kernel BTF from (/sys/kernel/btf/vmlinux): -1 ``` Closes: https://lore.kernel.org/bpf/CAP-5=fU+DN_+Y=Y4gtELUsJxKNDDCOvJzPHvjUVaUoeFAzNnig@mail.gmail.com/ Signed-off-by: Ian Rogers <irogers@google.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20240125231840.1647951-1-irogers@google.com	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	c57fb75864	libbpf: fix __arg_ctx type enforcement for perf_event programs Adjust PERF_EVENT type enforcement around __arg_ctx to match exactly what kernel is doing. Fixes: 76ec90a996e3 ("libbpf: warn on unexpected __arg_ctx type when rewriting BTF") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20240125205510.3642094-3-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	0b412d1918	libbpf: integrate __arg_ctx feature detector into kernel_supports() Now that feature detection code is in bpf-next tree, integrate __arg_ctx kernel-side support into kernel_supports() framework. Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20240125205510.3642094-2-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2024-02-01 15:10:17 -08:00
Andrii Nakryiko	3b09738928	sync: remove NETDEV_XSK_FLAGS_MASK which is not in bpf/bpf-next anymore This part of code is not present in either bpf or bpf-next trees anymore, so manually remove it. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-29 10:48:12 -08:00
Andrii Nakryiko	5139f12ef1	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: c8632acf193beac64bbdaebef013368c480bf74f Checkpoint bpf-next commit: ced33f2cfa21a14a292a00e31dc9f85c1bfbda1c Baseline bpf commit: 0a5bd0ffe790511d802e7f40898429a89e2487df Checkpoint bpf commit: 577e4432f3ac810049cb7e6b71f4d96ec7c6e894 Andrii Nakryiko (1): libbpf: Fix faccessat() usage on Android src/libbpf_internal.h \| 14 ++++++++++++++ 1 file changed, 14 insertions(+) Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-29 10:48:12 -08:00
Andrii Nakryiko	830e0d017b	libbpf: Fix faccessat() usage on Android Android implementation of libc errors out with -EINVAL in faccessat() if passed AT_EACCESS ([0]), this leads to ridiculous issue with libbpf refusing to load /sys/kernel/btf/vmlinux on Androids ([1]). Fix by detecting Android and redefining AT_EACCESS to 0, it's equivalent on Android. [0] https://android.googlesource.com/platform/bionic/+/refs/heads/android13-release/libc/bionic/faccessat.cpp#50 [1] https://github.com/libbpf/libbpf-bootstrap/issues/250#issuecomment-1911324250 Fixes: 6a4ab8869d0b ("libbpf: Fix the case of running as non-root with capabilities") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Jiri Olsa <jolsa@kernel.org> Link: https://lore.kernel.org/bpf/20240126220944.2497665-1-andrii@kernel.org	2024-01-29 10:48:12 -08:00
Andrii Nakryiko	fad5d91381	libbpf: make sure linux/kernel.h includes linux/compiler.h This replicates kernel upstream setup and brings READ_ONCE() and WRITE_ONCE() macros anywhere where linux/kernel.h is included, which is assumption libbpf code makes. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	8ca30626cc	Makefile: add features.o to Makefile Libbpf got new source code file, features.c, we need to add it to Makefile here on Github version as well. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	274d6037f8	libbpf: add BPF_CALL_REL() macro implementation Add BPF_CALL_REL() macro implementation into include/linux/filter.h header, which is now used by libbpf code for feature detection. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	0f84f3bef6	ci: regenerate vmlinux.h Update vmlinux.h for old kernel CI workflows. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	3dea2db84b	ci: drop custom patches for fixing upstream kernel issues All the issues should be fixed upstream already. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	2f81310ec0	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: 98e20e5e13d2811898921f999288be7151a11954 Checkpoint bpf-next commit: c8632acf193beac64bbdaebef013368c480bf74f Baseline bpf commit: 7c5e046bdcb2513f9decb3765d8bf92d604279cf Checkpoint bpf commit: 0a5bd0ffe790511d802e7f40898429a89e2487df Andrey Grafin (1): libbpf: Apply map_set_def_max_entries() for inner_maps on creation Andrii Nakryiko (17): libbpf: feature-detect arg:ctx tag support in kernel libbpf: warn on unexpected __arg_ctx type when rewriting BTF libbpf: call dup2() syscall directly bpf: Introduce BPF token object bpf: Add BPF token support to BPF_MAP_CREATE command bpf: Add BPF token support to BPF_BTF_LOAD command bpf: Add BPF token support to BPF_PROG_LOAD command libbpf: Add bpf_token_create() API libbpf: Add BPF token support to bpf_map_create() API libbpf: Add BPF token support to bpf_btf_load() API libbpf: Add BPF token support to bpf_prog_load() API libbpf: Split feature detectors definitions from cached results libbpf: Further decouple feature checking logic from bpf_object libbpf: Move feature detection code into its own file libbpf: Wire up token_fd into feature probing logic libbpf: Wire up BPF token support at BPF object level libbpf: Support BPF token path setting through LIBBPF_BPF_TOKEN_PATH envvar Daniel Borkmann (1): bpf: Sync uapi bpf.h header for the tooling infra Dima Tisnek (1): libbpf: Correct bpf_core_read.h comment wrt bpf_core_relo struct Jiri Olsa (2): bpf: Add cookie to perf_event bpf_link_info records bpf: Store cookies in kprobe_multi bpf_link_info data Kan Liang (2): perf: Add branch stack counters perf/x86/intel: Support branch counters logging Kui-Feng Lee (3): bpf: pass btf object id in bpf_map_info. bpf: pass attached BTF to the bpf_struct_ops subsystem libbpf: Find correct module BTFs for struct_ops maps and progs. Martin KaFai Lau (1): libbpf: Ensure undefined bpf_attr field stays 0 include/uapi/linux/bpf.h \| 79 +++- include/uapi/linux/perf_event.h \| 13 + src/bpf.c \| 42 +- src/bpf.h \| 38 +- src/bpf_core_read.h \| 2 +- src/btf.c \| 10 +- src/elf.c \| 2 - src/features.c \| 503 +++++++++++++++++++++ src/libbpf.c \| 744 ++++++++++++-------------------- src/libbpf.h \| 21 +- src/libbpf.map \| 1 + src/libbpf_internal.h \| 50 ++- src/libbpf_probes.c \| 12 +- src/str_error.h \| 3 + 14 files changed, 1019 insertions(+), 501 deletions(-) create mode 100644 src/features.c Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	0e57fade4e	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	a36646e2b3	libbpf: Support BPF token path setting through LIBBPF_BPF_TOKEN_PATH envvar To allow external admin authority to override default BPF FS location (/sys/fs/bpf) for implicit BPF token creation, teach libbpf to recognize LIBBPF_BPF_TOKEN_PATH envvar. If it is specified and user application didn't explicitly specify bpf_token_path option, it will be treated exactly like bpf_token_path option, overriding default /sys/fs/bpf location and making BPF token mandatory. Suggested-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Alexei Starovoitov <ast@kernel.org> Link: https://lore.kernel.org/bpf/20240124022127.2379740-29-andrii@kernel.org	2024-01-26 18:12:29 -05:00
Andrii Nakryiko	e1a43809a9	libbpf: Wire up BPF token support at BPF object level Add BPF token support to BPF object-level functionality. BPF token is supported by BPF object logic either as an explicitly provided BPF token from outside (through BPF FS path), or implicitly (unless prevented through bpf_object_open_opts). Implicit mode is assumed to be the most common one for user namespaced unprivileged workloads. The assumption is that privileged container manager sets up default BPF FS mount point at /sys/fs/bpf with BPF token delegation options (delegate_{cmds,maps,progs,attachs} mount options). BPF object during loading will attempt to create BPF token from /sys/fs/bpf location, and pass it for all relevant operations (currently, map creation, BTF load, and program load). In this implicit mode, if BPF token creation fails due to whatever reason (BPF FS is not mounted, or kernel doesn't support BPF token, etc), this is not considered an error. BPF object loading sequence will proceed with no BPF token. In explicit BPF token mode, user provides explicitly custom BPF FS mount point path. In such case, BPF object will attempt to create BPF token from provided BPF FS location. If BPF token creation fails, that is considered a critical error and BPF object load fails with an error. Libbpf provides a way to disable implicit BPF token creation, if it causes any troubles (BPF token is designed to be completely optional and shouldn't cause any problems even if provided, but in the world of BPF LSM, custom security logic can be installed that might change outcome depending on the presence of BPF token). To disable libbpf's default BPF token creation behavior user should provide either invalid BPF token FD (negative), or empty bpf_token_path option. BPF token presence can influence libbpf's feature probing, so if BPF object has associated BPF token, feature probing is instructed to use BPF object-specific feature detection cache and token FD. Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Alexei Starovoitov <ast@kernel.org> Link: https://lore.kernel.org/bpf/20240124022127.2379740-26-andrii@kernel.org	2024-01-26 18:12:29 -05:00

1 2 3 4 5 ...

2341 Commits