libbpf

mirror of https://github.com/netdata/libbpf.git synced 2026-05-09 00:49:10 +08:00

Author	SHA1	Message	Date
Xin Liu	8d719b0c08	libbpf: Optimized return value in libbpf_strerror when errno is libbpf errno This is a small improvement in libbpf_strerror. When libbpf_strerror is used to obtain the system error description, if the length of the buf is insufficient, libbpf_sterror returns ERANGE and sets errno to ERANGE. However, this processing is not performed when the error code customized by libbpf is obtained. Make some minor improvements here, return -ERANGE and set errno to ERANGE when buf is not enough for custom description. Signed-off-by: Xin Liu <liuxin350@huawei.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Link: https://lore.kernel.org/bpf/20221210082045.233697-1-liuxin350@huawei.com	2022-12-14 22:09:00 -08:00
Kumar Kartikeya Dwivedi	6b90604fa7	bpf: Rework process_dynptr_func Recently, user ringbuf support introduced a PTR_TO_DYNPTR register type for use in callback state, because in case of user ringbuf helpers, there is no dynptr on the stack that is passed into the callback. To reflect such a state, a special register type was created. However, some checks have been bypassed incorrectly during the addition of this feature. First, for arg_type with MEM_UNINIT flag which initialize a dynptr, they must be rejected for such register type. Secondly, in the future, there are plans to add dynptr helpers that operate on the dynptr itself and may change its offset and other properties. In all of these cases, PTR_TO_DYNPTR shouldn't be allowed to be passed to such helpers, however the current code simply returns 0. The rejection for helpers that release the dynptr is already handled. For fixing this, we take a step back and rework existing code in a way that will allow fitting in all classes of helpers and have a coherent model for dealing with the variety of use cases in which dynptr is used. First, for ARG_PTR_TO_DYNPTR, it can either be set alone or together with a DYNPTR_TYPE_* constant that denotes the only type it accepts. Next, helpers which initialize a dynptr use MEM_UNINIT to indicate this fact. To make the distinction clear, use MEM_RDONLY flag to indicate that the helper only operates on the memory pointed to by the dynptr, not the dynptr itself. In C parlance, it would be equivalent to taking the dynptr as a point to const argument. When either of these flags are not present, the helper is allowed to mutate both the dynptr itself and also the memory it points to. Currently, the read only status of the memory is not tracked in the dynptr, but it would be trivial to add this support inside dynptr state of the register. With these changes and renaming PTR_TO_DYNPTR to CONST_PTR_TO_DYNPTR to better reflect its usage, it can no longer be passed to helpers that initialize a dynptr, i.e. bpf_dynptr_from_mem, bpf_ringbuf_reserve_dynptr. A note to reviewers is that in code that does mark_stack_slots_dynptr, and unmark_stack_slots_dynptr, we implicitly rely on the fact that PTR_TO_STACK reg is the only case that can reach that code path, as one cannot pass CONST_PTR_TO_DYNPTR to helpers that don't set MEM_RDONLY. In both cases such helpers won't be setting that flag. The next patch will add a couple of selftest cases to make sure this doesn't break. Fixes: 205715673844 ("bpf: Add bpf_user_ringbuf_drain() helper") Acked-by: Joanne Koong <joannelkoong@gmail.com> Signed-off-by: Kumar Kartikeya Dwivedi <memxor@gmail.com> Link: https://lore.kernel.org/r/20221207204141.308952-4-memxor@gmail.com Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-12-14 22:09:00 -08:00
Timo Hunziker	74244c5bd7	libbpf: Parse usdt args without offset on x86 (e.g. 8@(%rsp)) Parse USDT arguments like "8@(%rsp)" on x86. These are emmited by SystemTap. The argument syntax is similar to the existing "memory dereference case" but the offset left out as it's zero (i.e. read the value from the address in the register). We treat it the same as the the "memory dereference case", but set the offset to 0. I've tested that this fixes the "unrecognized arg #N spec: 8@(%rsp).." error I've run into when attaching to a probe with such an argument. Attaching and reading the correct argument values works. Something similar might be needed for the other supported architectures. [0] Closes: https://github.com/libbpf/libbpf/issues/559 Signed-off-by: Timo Hunziker <timo.hunziker@gmx.ch> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221203123746.2160-1-timo.hunziker@eclipso.ch	2022-12-14 22:09:00 -08:00
Eyal Birger	da08611c65	tools: add IFLA_XFRM_COLLECT_METADATA to uapi/linux/if_link.h Needed for XFRM metadata tests. Signed-off-by: Eyal Birger <eyal.birger@gmail.com> Link: https://lore.kernel.org/r/20221203084659.1837829-4-eyal.birger@gmail.com Signed-off-by: Martin KaFai Lau <martin.lau@kernel.org>	2022-12-14 22:09:00 -08:00
Andrii Nakryiko	1e479aec4f	ci: don't run test_maps in libbpf CI It crashes often, it doesn't really test libbpf much. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-07 09:28:07 -08:00
Andrii Nakryiko	8846dc7a20	ci: fix Ubuntu version for kernel tests and pahole workflows Having too new build environment in workflows that build selftests on the host, but run them in a separate QEMU image can lead to problems with runtime linker complaining about missing new enough version of glibc and other dependencies. Until we update images, fix used Ubuntu version to ubuntu-20.04 to mitigate. Suggested-by: Manu Bretelle <chantr4@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-05 11:52:11 -08:00
Andrii Nakryiko	eb9b5c567d	sync: regenerate vmlinux.h Update checked in vmlinux.h for 5.5 and 4.9 kernels. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-02 22:12:29 -08:00
Andrii Nakryiko	be8f15bb93	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: 5b1d640800de7fe02d68bf592d9d101de24c87f2 Checkpoint bpf-next commit: 706819495921ddad6b3780140b9d9e9293b6dedc Baseline bpf commit: 47df8a2f78bc34ff170d147d05b121f84e252b85 Checkpoint bpf commit: e931a173a685fe213127ae5aa6b7f2196c1d875d Alexei Starovoitov (1): selftests/bpf: Workaround for llvm nop-4 bug Andrii Nakryiko (2): libbpf: Ignore hashmap__find() result explicitly in btf_dump libbpf: Avoid enum forward-declarations in public API in C++ mode Donald Hunter (1): docs/bpf: Add table of BPF program types to libbpf docs Hou Tao (4): libbpf: Use page size as max_entries when probing ring buffer map libbpf: Handle size overflow for ringbuf mmap libbpf: Handle size overflow for user ringbuf mmap libbpf: Check the validity of size in user_ring_buffer__reserve() Ji Rongfeng (1): bpf: Update bpf_{g,s}etsockopt() documentation docs/index.rst \| 3 + docs/program_types.rst \| 203 +++++++++++++++++++++++++++++++++++++++ include/uapi/linux/bpf.h \| 23 +++-- src/bpf.h \| 7 ++ src/btf_dump.c \| 2 +- src/libbpf.c \| 3 +- src/libbpf_probes.c \| 2 +- src/ringbuf.c \| 26 +++-- 8 files changed, 250 insertions(+), 19 deletions(-) create mode 100644 docs/program_types.rst -- 2.30.2 Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-02 22:12:29 -08:00
Andrii Nakryiko	2bf5ed3a48	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-02 22:12:29 -08:00
Andrii Nakryiko	0fbf777e0b	libbpf: Avoid enum forward-declarations in public API in C++ mode C++ enum forward declarations are fundamentally not compatible with pure C enum definitions, and so libbpf's use of `enum bpf_stats_type;` forward declaration in libbpf/bpf.h public API header is causing C++ compilation issues. More details can be found in [0], but it comes down to C++ supporting enum forward declaration only with explicitly specified backing type: enum bpf_stats_type: int; In C (and I believe it's a GCC extension also), such forward declaration is simply: enum bpf_stats_type; Further, in Linux UAPI this enum is defined in pure C way: enum bpf_stats_type { BPF_STATS_RUN_TIME = 0; } And even though in both cases backing type is int, which can be confirmed by looking at DWARF information, for C++ compiler actual enum definition and forward declaration are incompatible. To eliminate this problem, for C++ mode define input argument as int, which makes enum unnecessary in libbpf public header. This solves the issue and as demonstrated by next patch doesn't cause any unwanted compiler warnings, at least with default warnings setting. [0] https://stackoverflow.com/questions/42766839/c11-enum-forward-causes-underlying-type-mismatch [1] Closes: https://github.com/libbpf/libbpf/issues/249 Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Link: https://lore.kernel.org/bpf/20221130200013.2997831-1-andrii@kernel.org	2022-12-02 22:12:29 -08:00
Hou Tao	4d21c979ce	libbpf: Check the validity of size in user_ring_buffer__reserve() The top two bits of size are used as busy and discard flags, so reject the reservation that has any of these special bits in the size. With the addition of validity check, these is also no need to check whether or not total_size is overflowed. Signed-off-by: Hou Tao <houtao1@huawei.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221116072351.1168938-5-houtao@huaweicloud.com	2022-12-02 22:12:29 -08:00
Hou Tao	11ad834557	libbpf: Handle size overflow for user ringbuf mmap Similar with the overflow problem on ringbuf mmap, in user_ringbuf_map() 2 * max_entries may overflow u32 when mapping writeable region. Fixing it by casting the size of writable mmap region into a __u64 and checking whether or not there will be overflow during mmap. Fixes: b66ccae01f1d ("bpf: Add libbpf logic for user-space ring buffer") Signed-off-by: Hou Tao <houtao1@huawei.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221116072351.1168938-4-houtao@huaweicloud.com	2022-12-02 22:12:29 -08:00
Hou Tao	f056d1bd54	libbpf: Handle size overflow for ringbuf mmap The maximum size of ringbuf is 2GB on x86-64 host, so 2 * max_entries will overflow u32 when mapping producer page and data pages. Only casting max_entries to size_t is not enough, because for 32-bits application on 64-bits kernel the size of read-only mmap region also could overflow size_t. So fixing it by casting the size of read-only mmap region into a __u64 and checking whether or not there will be overflow during mmap. Fixes: bf99c936f947 ("libbpf: Add BPF ring buffer support") Signed-off-by: Hou Tao <houtao1@huawei.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221116072351.1168938-3-houtao@huaweicloud.com	2022-12-02 22:12:29 -08:00
Hou Tao	b822a139e3	libbpf: Use page size as max_entries when probing ring buffer map Using page size as max_entries when probing ring buffer map, else the probe may fail on host with 64KB page size (e.g., an ARM64 host). After the fix, the output of "bpftool feature" on above host will be correct. Before : eBPF map_type ringbuf is NOT available eBPF map_type user_ringbuf is NOT available After : eBPF map_type ringbuf is available eBPF map_type user_ringbuf is available Signed-off-by: Hou Tao <houtao1@huawei.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221116072351.1168938-2-houtao@huaweicloud.com	2022-12-02 22:12:29 -08:00
Ji Rongfeng	a5b4a53781	bpf: Update bpf_{g,s}etsockopt() documentation * append missing optnames to the end * simplify bpf_getsockopt()'s doc Signed-off-by: Ji Rongfeng <SikoJobs@outlook.com> Link: https://lore.kernel.org/r/DU0P192MB15479B86200B1216EC90E162D6099@DU0P192MB1547.EURP192.PROD.OUTLOOK.COM Signed-off-by: Martin KaFai Lau <martin.lau@kernel.org>	2022-12-02 22:12:29 -08:00
Donald Hunter	e84419ff5a	docs/bpf: Add table of BPF program types to libbpf docs Extend the libbpf documentation with a table of program types, attach points and ELF section names. Signed-off-by: Donald Hunter <donald.hunter@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Reviewed-by: Bagas Sanjaya <bagasdotme@gmail.com> Acked-by: David Vernet <void@manifault.com> Link: https://lore.kernel.org/bpf/20221121121734.98329-1-donald.hunter@gmail.com	2022-12-02 22:12:29 -08:00
Alexei Starovoitov	ca515c0dda	selftests/bpf: Workaround for llvm nop-4 bug Currently LLVM fails to recognize .data.* as data section and defaults to .text section. Later BPF backend tries to emit 4-byte NOP instruction which doesn't exist in BPF ISA and aborts. The fix for LLVM is pending: https://reviews.llvm.org/D138477 While waiting for the fix lets workaround the linked_list test case by using .bss.* prefix which is properly recognized by LLVM as BSS section. Fix libbpf to support .bss. prefix and adjust tests. Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-12-02 22:12:29 -08:00
Andrii Nakryiko	95959419a7	libbpf: Ignore hashmap__find() result explicitly in btf_dump Coverity is reporting that btf_dump_name_dups() doesn't check return result of hashmap__find() call. This is intentional, so make it explicit with (void) cast. Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Link: https://lore.kernel.org/bpf/20221117192824.4093553-1-andrii@kernel.org	2022-12-02 22:12:29 -08:00
Andrii Nakryiko	3c659715ec	sync: fix sync scripts commit_signature function After recent lint changes, commit_signature() function now gets optional array of paths as multiple arguments, instead of entire array as second argument. So adjust commit_signature() to handle this correctly. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-02 21:04:03 -08:00
Andrii Nakryiko	f46b17ef0e	sync: add Signed-off-by for auto-generated sync commits Now that we enforce Signed-off-by on every commit, make sure that auto-generatd sync commits also get corrected Signed-off-by tags. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-12-02 20:51:21 -08:00
Evgeny Vereshchagin	1596a09b5d	oss-fuzz: bump elfutils to make it less likely for the libbpf fuzz target to run into elfutils bugs that have been fixed upstream since two new fuzz targets were added there back in April. Signed-off-by: Evgeny Vereshchagin <evvers@ya.ru>	2022-11-18 13:54:40 -08:00
Kui-Feng Lee	5322b8e76c	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: b548b17a93fd18357a5a6f535c10c1e68719ad32 Checkpoint bpf-next commit: 5b1d640800de7fe02d68bf592d9d101de24c87f2 Baseline bpf commit: 9cbd48d5fa14e4c65f8580de16686077f7cea02b Checkpoint bpf commit: 47df8a2f78bc34ff170d147d05b121f84e252b85 David Michael (1): libbpf: Fix uninitialized warning in btf_dump_dump_type_data Jiri Olsa (1): libbpf: Use correct return pointer in attach_raw_tp Kang Minchul (3): libbpf: checkpatch: Fixed code alignments in btf.c libbpf: Fixed various checkpatch issues in libbpf.c libbpf: checkpatch: Fixed code alignments in ringbuf.c Kumar Kartikeya Dwivedi (1): bpf: Support bpf_list_head in map values include/uapi/linux/bpf.h \| 10 +++++++++ src/btf.c \| 5 +++-- src/btf_dump.c \| 2 +- src/libbpf.c \| 47 +++++++++++++++++++++++++--------------- src/ringbuf.c \| 4 ++-- 5 files changed, 45 insertions(+), 23 deletions(-) -- 2.30.2	2022-11-18 13:53:39 -08:00
Kui-Feng Lee	15bbaabed8	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions.	2022-11-18 13:53:39 -08:00
Jiri Olsa	eb77c7210b	libbpf: Use correct return pointer in attach_raw_tp We need to pass '*link' to final libbpf_get_error, because that one holds the return value, not 'link'. Fixes: 4fa5bcfe07f7 ("libbpf: Allow BPF program auto-attach handlers to bail out") Signed-off-by: Jiri Olsa <jolsa@kernel.org> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221114145257.882322-1-jolsa@kernel.org	2022-11-18 13:53:39 -08:00
Kumar Kartikeya Dwivedi	2557efc8e1	bpf: Support bpf_list_head in map values Add the support on the map side to parse, recognize, verify, and build metadata table for a new special field of the type struct bpf_list_head. To parameterize the bpf_list_head for a certain value type and the list_node member it will accept in that value type, we use BTF declaration tags. The definition of bpf_list_head in a map value will be done as follows: struct foo { struct bpf_list_node node; int data; }; struct map_value { struct bpf_list_head head __contains(foo, node); }; Then, the bpf_list_head only allows adding to the list 'head' using the bpf_list_node 'node' for the type struct foo. The 'contains' annotation is a BTF declaration tag composed of four parts, "contains:name:node" where the name is then used to look up the type in the map BTF, with its kind hardcoded to BTF_KIND_STRUCT during the lookup. The node defines name of the member in this type that has the type struct bpf_list_node, which is actually used for linking into the linked list. For now, 'kind' part is hardcoded as struct. This allows building intrusive linked lists in BPF, using container_of to obtain pointer to entry, while being completely type safe from the perspective of the verifier. The verifier knows exactly the type of the nodes, and knows that list helpers return that type at some fixed offset where the bpf_list_node member used for this list exists. The verifier also uses this information to disallow adding types that are not accepted by a certain list. For now, no elements can be added to such lists. Support for that is coming in future patches, hence draining and freeing items is done with a TODO that will be resolved in a future patch. Note that the bpf_list_head_free function moves the list out to a local variable under the lock and releases it, doing the actual draining of the list items outside the lock. While this helps with not holding the lock for too long pessimizing other concurrent list operations, it is also necessary for deadlock prevention: unless every function called in the critical section would be notrace, a fentry/fexit program could attach and call bpf_map_update_elem again on the map, leading to the same lock being acquired if the key matches and lead to a deadlock. While this requires some special effort on part of the BPF programmer to trigger and is highly unlikely to occur in practice, it is always better if we can avoid such a condition. While notrace would prevent this, doing the draining outside the lock has advantages of its own, hence it is used to also fix the deadlock related problem. Signed-off-by: Kumar Kartikeya Dwivedi <memxor@gmail.com> Link: https://lore.kernel.org/r/20221114191547.1694267-5-memxor@gmail.com Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-11-18 13:53:39 -08:00
Kang Minchul	9781b9eced	libbpf: checkpatch: Fixed code alignments in ringbuf.c Fixed some checkpatch issues in ringbuf.c Signed-off-by: Kang Minchul <tegongkang@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Stanislav Fomichev <sdf@google.com> Link: https://lore.kernel.org/bpf/20221113190648.38556-4-tegongkang@gmail.com	2022-11-18 13:53:39 -08:00
Kang Minchul	4c3b53d09c	libbpf: Fixed various checkpatch issues in libbpf.c Fixed following checkpatch issues: WARNING: Block comments use a trailing / on a separate line + other BPF program's BTF object / WARNING: Possible repeated word: 'be' + name. This is important to be be able to find corresponding BTF ERROR: switch and case should be at the same indent + switch (ext->kcfg.sz) { + case 1: (__u8 )ext_val = value; break; + case 2: (__u16 )ext_val = value; break; + case 4: (__u32 )ext_val = value; break; + case 8: (__u64 )ext_val = value; break; + default: ERROR: trailing statements should be on next line + case 1: (__u8 )ext_val = value; break; ERROR: trailing statements should be on next line + case 2: (__u16 )ext_val = value; break; ERROR: trailing statements should be on next line + case 4: (__u32 )ext_val = value; break; ERROR: trailing statements should be on next line + case 8: (__u64 )ext_val = value; break; ERROR: code indent should use tabs where possible + }$ WARNING: please, no spaces at the start of a line + }$ WARNING: Block comments use a trailing / on a separate line + for faster search / ERROR: code indent should use tabs where possible +^I^I^I^I^I^I &ext->kcfg.is_signed);$ WARNING: braces {} are not necessary for single statement blocks + if (err) { + return err; + } ERROR: code indent should use tabs where possible +^I^I^I^I sizeof(obj->btf_modules), obj->btf_module_cnt + 1);$ Signed-off-by: Kang Minchul <tegongkang@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Stanislav Fomichev <sdf@google.com> Link: https://lore.kernel.org/bpf/20221113190648.38556-3-tegongkang@gmail.com	2022-11-18 13:53:39 -08:00
Kang Minchul	7b18ff1212	libbpf: checkpatch: Fixed code alignments in btf.c Fixed some checkpatch issues in btf.c Signed-off-by: Kang Minchul <tegongkang@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Stanislav Fomichev <sdf@google.com> Link: https://lore.kernel.org/bpf/20221113190648.38556-2-tegongkang@gmail.com	2022-11-18 13:53:39 -08:00
David Michael	c975797ebe	libbpf: Fix uninitialized warning in btf_dump_dump_type_data GCC 11.3.0 fails to compile btf_dump.c due to the following error, which seems to originate in btf_dump_struct_data where the returned value would be uninitialized if btf_vlen returns zero. btf_dump.c: In function ‘btf_dump_dump_type_data’: btf_dump.c:2363:12: error: ‘err’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 2363 \| if (err < 0) \| ^ Fixes: 920d16af9b42 ("libbpf: BTF dumper support for typed data") Signed-off-by: David Michael <fedora.dm0@gmail.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Stanislav Fomichev <sdf@google.com> Acked-by: Alan Maguire <alan.maguire@oracle.com> Link: https://lore.kernel.org/bpf/87zgcu60hq.fsf@gmail.com	2022-11-18 13:53:39 -08:00
Manu Bretelle	9167308b4a	ci: remove s390x-self-hosted-builder from libbpf/libbpf Those were moved to libbpf/ci: https://github.com/libbpf/ci/tree/master/rootfs/s390x-self-hosted-builder Signed-off-by: Manu Bretelle <chantr4@gmail.com>	2022-11-16 13:58:37 -08:00
Manu Bretelle	7049d3a2ea	ci: Use `s390x` label to schedule workflows on s390x The runners are having their labels uniformized across architecture. z15 is being removed in favor of s390x. Signed-off-by: Manu Bretelle <chantr4@gmail.com>	2022-11-16 13:55:31 -08:00
Andrii Nakryiko	ea931ec6c5	ci: drop LGTM integration LGTM is deprecated, remove it. We have CodeQL now. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-16 12:17:40 -08:00
Andrii Nakryiko	3a73d6f865	readme: replace LGTM badge with CodeQL badge LGTM is going to be removed, CodeQL is supposed to be a replacement. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-16 12:17:40 -08:00
Andrii Nakryiko	7b0891ac6b	ci: build libbpf with more versions of clang and gcc Add few more versions of clang and gcc used to compile-test libbpf. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-16 12:16:17 -08:00
Andrii Nakryiko	c80f12f7f6	ci: fix Debian builds due to pkg-config dependency change Seems like we need pkgconfig dependency instead of pkg-config. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-16 11:25:17 -08:00
Andrii Nakryiko	3b6093fd43	sync: start syncing include/uapi/linux/fcntl.h UAPI header Libbpf relies on F_DUPFD_CLOEXEC constant coming from fcntl.h UAPI header, so we need to sync it along other UAPI headers. Also update sync script to keep doing this automatically going forward. Reported-by: Arnaldo Carvalho de Melo <acme@redhat.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-16 10:56:59 -08:00
Andrii Nakryiko	8d358ab948	sync: make LIBBPF_PATHS and LIBBPF_VIEW_PATHS into real array variables Use correct Bash syntax to define these two variables as arrays. Drop shellcheck opt-out for unquoted use of array. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-14 21:42:37 -08:00
Andrii Nakryiko	971ad8f8d0	sync: fix sync script's use of bash array variables Don't wrap LIBBPF_PATHS[@] and LIBBPF_VIEW_PATHS[@] in quotes when passing it to git commands. Not clear how it worked before, but something recently broke. Either git commands became stricter or something. But either way, we do want to pass each element of LIBBPF_PATHS or LIBBPF_VIEW_PATHS as separate command line arguments, so putting them in quotes doesn't make sense, as that makes them look like a single argument to git. So drop all the quotes around these arrays. The only place where it's still needed is in commit_signature call, as we do want to pass array as single arg ($2) and then internally we unfold it into multiple command line arguments. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-12 18:24:12 -08:00
Andrii Nakryiko	2ed27f9e63	ci: update vmlinux.h Update vmlinux.h to get latest enums for some of selftests. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-11-12 18:24:12 -08:00
Andrii Nakryiko	4bdbb7ea28	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: 62c69e89e81bfbdb9a87ae3e0599dcc6aacf786b Checkpoint bpf-next commit: b548b17a93fd18357a5a6f535c10c1e68719ad32 Baseline bpf commit: e7b09357453a99e6f9e74c39e9ca1363c22c0b96 Checkpoint bpf commit: 9cbd48d5fa14e4c65f8580de16686077f7cea02b Alan Maguire (1): libbpf: Btf dedup identical struct test needs check for nested structs/arrays Andrii Nakryiko (2): libbpf: clean up and refactor BTF fixup step libbpf: only add BPF_F_MMAPABLE flag for data maps with global vars Anshuman Khandual (4): perf: Add system error and not in transaction branch types perf: Extend branch type classification perf: Capture branch privilege information perf: Add PERF_BR_NEW_ARCH_[N] map for BRBE on arm64 platform Eduard Zingerman (4): libbpf: Resolve enum fwd as full enum64 and vice versa libbpf: Hashmap interface update to allow both long and void* keys/values libbpf: Resolve unambigous forward declarations libbpf: Hashmap.h update to fix build issues using LLVM14 Martin KaFai Lau (1): bpf: Add hwtstamp field for the sockops prog Namhyung Kim (1): perf: Kill __PERF_SAMPLE_CALLCHAIN_EARLY Ravi Bangoria (3): perf/mem: Introduce PERF_MEM_LVLNUM_{EXTN_MEM\|IO} perf/uapi: Define PERF_MEM_SNOOPX_PEER in kernel header file perf/mem: Rename PERF_MEM_LVLNUM_EXTN_MEM to PERF_MEM_LVLNUM_CXL Sandipan Das (1): perf/core: Add speculation info to branch entries Xu Kuohai (1): libbpf: Avoid allocating reg_name with sscanf in parse_usdt_arg() Yonghong Song (2): bpf: Implement cgroup storage available to non-cgroup-attached bpf progs libbpf: Support new cgroup local storage include/uapi/linux/bpf.h \| 51 +++++- include/uapi/linux/perf_event.h \| 57 ++++++- src/btf.c \| 267 ++++++++++++++++++++++---------- src/btf_dump.c \| 15 +- src/hashmap.c \| 18 +-- src/hashmap.h \| 91 +++++++---- src/libbpf.c \| 196 ++++++++++++++--------- src/libbpf_probes.c \| 1 + src/strset.c \| 18 +-- src/usdt.c \| 44 +++--- 10 files changed, 511 insertions(+), 247 deletions(-) -- 2.30.2	2022-11-12 18:24:12 -08:00
Andrii Nakryiko	4978cf9cd8	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions.	2022-11-12 18:24:12 -08:00
Martin KaFai Lau	00fc9f407c	bpf: Add hwtstamp field for the sockops prog The bpf-tc prog has already been able to access the skb_hwtstamps(skb)->hwtstamp. This patch extends the same hwtstamp access to the sockops prog. In sockops, the skb is also available to the bpf prog during the BPF_SOCK_OPS_PARSE_HDR_OPT_CB event. There is a use case that the hwtstamp will be useful to the sockops prog to better measure the one-way-delay when the sender has put the tx timestamp in the tcp header option. Signed-off-by: Martin KaFai Lau <martin.lau@kernel.org> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Yonghong Song <yhs@fb.com> Link: https://lore.kernel.org/bpf/20221107230420.4192307-2-martin.lau@linux.dev	2022-11-12 18:24:12 -08:00
Eduard Zingerman	e1b34c589d	libbpf: Hashmap.h update to fix build issues using LLVM14 A fix for the LLVM compilation error while building bpftool. Replaces the expression: _Static_assert((p) == NULL \|\| ...) by expression: _Static_assert((__builtin_constant_p((p)) ? (p) == NULL : 0) \|\| ...) When "p" is not a constant the former is not considered to be a constant expression by LLVM 14. The error was introduced in the following patch-set: [1]. The error was reported here: [2]. [1] https://lore.kernel.org/bpf/20221109142611.879983-1-eddyz87@gmail.com/ [2] https://lore.kernel.org/all/202211110355.BcGcbZxP-lkp@intel.com/ Reported-by: kernel test robot <lkp@intel.com> Fixes: c302378bc157 ("libbpf: Hashmap interface update to allow both long and void* keys/values") Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Stanislav Fomichev <sdf@google.com> Link: https://lore.kernel.org/bpf/20221110223240.1350810-1-eddyz87@gmail.com	2022-11-12 18:24:12 -08:00
Eduard Zingerman	7583310911	libbpf: Resolve unambigous forward declarations Resolve forward declarations that don't take part in type graphs comparisons if declaration name is unambiguous. Example: CU #1: struct foo; // standalone forward declaration struct foo some_global; CU #2: struct foo { int x; }; struct foo another_global; The `struct foo` from CU #1 is not a part of any definition that is compared against another definition while `btf_dedup_struct_types` processes structural types. The the BTF after `btf_dedup_struct_types` the BTF looks as follows: [1] STRUCT 'foo' size=4 vlen=1 ... [2] INT 'int' size=4 ... [3] PTR '(anon)' type_id=1 [4] FWD 'foo' fwd_kind=struct [5] PTR '(anon)' type_id=4 This commit adds a new pass `btf_dedup_resolve_fwds`, that maps such forward declarations to structs or unions with identical name in case if the name is not ambiguous. The pass is positioned before `btf_dedup_ref_types` so that types [3] and [5] could be merged as a same type after [1] and [4] are merged. The final result for the example above looks as follows: [1] STRUCT 'foo' size=4 vlen=1 'x' type_id=2 bits_offset=0 [2] INT 'int' size=4 bits_offset=0 nr_bits=32 encoding=SIGNED [3] PTR '(anon)' type_id=1 For defconfig kernel with BTF enabled this removes 63 forward declarations. Examples of removed declarations: `pt_regs`, `in6_addr`. The running time of `btf__dedup` function is increased by about 3%. Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Reviewed-by: Alan Maguire <alan.maguire@oracle.com> Acked-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221109142611.879983-3-eddyz87@gmail.com	2022-11-12 18:24:12 -08:00
Eduard Zingerman	4a65c5d888	libbpf: Hashmap interface update to allow both long and void* keys/values An update for libbpf's hashmap interface from void* -> void* to a polymorphic one, allowing both long and void* keys and values. This simplifies many use cases in libbpf as hashmaps there are mostly integer to integer. Perf copies hashmap implementation from libbpf and has to be updated as well. Changes to libbpf, selftests/bpf and perf are packed as a single commit to avoid compilation issues with any future bisect. Polymorphic interface is acheived by hiding hashmap interface functions behind auxiliary macros that take care of necessary type casts, for example: #define hashmap_cast_ptr(p) \ ({ \ _Static_assert((p) == NULL \|\| sizeof((p)) == sizeof(long),\ #p " pointee should be a long-sized integer or a pointer"); \ (long )(p); \ }) bool hashmap_find(const struct hashmap map, long key, long value); #define hashmap__find(map, key, value) \ hashmap_find((map), (long)(key), hashmap_cast_ptr(value)) - hashmap__find macro casts key and value parameters to long and long* respectively - hashmap_cast_ptr ensures that value pointer points to a memory of appropriate size. This hack was suggested by Andrii Nakryiko in [1]. This is a follow up for [2]. [1] https://lore.kernel.org/bpf/CAEf4BzZ8KFneEJxFAaNCCFPGqp20hSpS2aCj76uRk3-qZUH5xg@mail.gmail.com/ [2] https://lore.kernel.org/bpf/af1facf9-7bc8-8a3d-0db4-7b3f333589a2@meta.com/T/#m65b28f1d6d969fcd318b556db6a3ad499a42607d Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221109142611.879983-2-eddyz87@gmail.com	2022-11-12 18:24:12 -08:00
Eduard Zingerman	3a387f5a8f	libbpf: Resolve enum fwd as full enum64 and vice versa Changes de-duplication logic for enums in the following way: - update btf_hash_enum to ignore size and kind fields to get ENUM and ENUM64 types in a same hash bucket; - update btf_compat_enum to consider enum fwd to be compatible with full enum64 (and vice versa); This allows BTF de-duplication in the following case: // CU #1 enum foo; struct s { enum foo a; } x; // CU #2 enum foo { x = 0xfffffffff // big enough to force enum64 }; struct s { enum foo a; } y; De-duplicated BTF prior to this commit: [1] ENUM64 'foo' encoding=UNSIGNED size=8 vlen=1 'x' val=68719476735ULL [2] INT 'long unsigned int' size=8 bits_offset=0 nr_bits=64 encoding=(none) [3] STRUCT 's' size=8 vlen=1 'a' type_id=4 bits_offset=0 [4] PTR '(anon)' type_id=1 [5] PTR '(anon)' type_id=3 [6] STRUCT 's' size=8 vlen=1 'a' type_id=8 bits_offset=0 [7] ENUM 'foo' encoding=UNSIGNED size=4 vlen=0 [8] PTR '(anon)' type_id=7 [9] PTR '(anon)' type_id=6 De-duplicated BTF after this commit: [1] ENUM64 'foo' encoding=UNSIGNED size=8 vlen=1 'x' val=68719476735ULL [2] INT 'long unsigned int' size=8 bits_offset=0 nr_bits=64 encoding=(none) [3] STRUCT 's' size=8 vlen=1 'a' type_id=4 bits_offset=0 [4] PTR '(anon)' type_id=1 [5] PTR '(anon)' type_id=3 Enum forward declarations in C do not provide information about enumeration values range. Thus the `btf_type->size` field is meaningless for forward enum declarations. In fact, GCC does not encode size in DWARF for forward enum declarations (but dwarves sets enumeration size to a default value of `sizeof(int) * 8` when size is not specified see dwarf_loader.c:die__create_new_enumeration). Signed-off-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221101235413.1824260-1-eddyz87@gmail.com	2022-11-12 18:24:12 -08:00
Ravi Bangoria	a2eba90326	perf/mem: Rename PERF_MEM_LVLNUM_EXTN_MEM to PERF_MEM_LVLNUM_CXL PERF_MEM_LVLNUM_EXTN_MEM was introduced to cover CXL devices but it's bit ambiguous name and also not generic enough to cover cxl.cache and cxl.io devices. Rename it to PERF_MEM_LVLNUM_CXL to be more specific. Signed-off-by: Ravi Bangoria <ravi.bangoria@amd.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/f6268268-b4e9-9ed6-0453-65792644d953@amd.com	2022-11-12 18:24:12 -08:00
Yonghong Song	7106ebe768	libbpf: Support new cgroup local storage Add support for new cgroup local storage. Acked-by: David Vernet <void@manifault.com> Acked-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Yonghong Song <yhs@fb.com> Link: https://lore.kernel.org/r/20221026042856.673989-1-yhs@fb.com Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-11-12 18:24:12 -08:00
Yonghong Song	3c6d127e50	bpf: Implement cgroup storage available to non-cgroup-attached bpf progs Similar to sk/inode/task storage, implement similar cgroup local storage. There already exists a local storage implementation for cgroup-attached bpf programs. See map type BPF_MAP_TYPE_CGROUP_STORAGE and helper bpf_get_local_storage(). But there are use cases such that non-cgroup attached bpf progs wants to access cgroup local storage data. For example, tc egress prog has access to sk and cgroup. It is possible to use sk local storage to emulate cgroup local storage by storing data in socket. But this is a waste as it could be lots of sockets belonging to a particular cgroup. Alternatively, a separate map can be created with cgroup id as the key. But this will introduce additional overhead to manipulate the new map. A cgroup local storage, similar to existing sk/inode/task storage, should help for this use case. The life-cycle of storage is managed with the life-cycle of the cgroup struct. i.e. the storage is destroyed along with the owning cgroup with a call to bpf_cgrp_storage_free() when cgroup itself is deleted. The userspace map operations can be done by using a cgroup fd as a key passed to the lookup, update and delete operations. Typically, the following code is used to get the current cgroup: struct task_struct task = bpf_get_current_task_btf(); ... task->cgroups->dfl_cgrp ... and in structure task_struct definition: struct task_struct { .... struct css_set __rcu cgroups; .... } With sleepable program, accessing task->cgroups is not protected by rcu_read_lock. So the current implementation only supports non-sleepable program and supporting sleepable program will be the next step together with adding rcu_read_lock protection for rcu tagged structures. Since map name BPF_MAP_TYPE_CGROUP_STORAGE has been used for old cgroup local storage support, the new map name BPF_MAP_TYPE_CGRP_STORAGE is used for cgroup storage available to non-cgroup-attached bpf programs. The old cgroup storage supports bpf_get_local_storage() helper to get the cgroup data. The new cgroup storage helper bpf_cgrp_storage_get() can provide similar functionality. While old cgroup storage pre-allocates storage memory, the new mechanism can also pre-allocate with a user space bpf_map_update_elem() call to avoid potential run-time memory allocation failure. Therefore, the new cgroup storage can provide all functionality w.r.t. the old one. So in uapi bpf.h, the old BPF_MAP_TYPE_CGROUP_STORAGE is alias to BPF_MAP_TYPE_CGROUP_STORAGE_DEPRECATED to indicate the old cgroup storage can be deprecated since the new one can provide the same functionality. Acked-by: David Vernet <void@manifault.com> Signed-off-by: Yonghong Song <yhs@fb.com> Link: https://lore.kernel.org/r/20221026042850.673791-1-yhs@fb.com Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-11-12 18:24:12 -08:00
Alan Maguire	6ebbbacb5c	libbpf: Btf dedup identical struct test needs check for nested structs/arrays When examining module BTF, it is common to see core kernel structures such as sk_buff, net_device duplicated in the module. After adding debug messaging to BTF it turned out that much of the problem was down to the identical struct test failing during deduplication; sometimes the compiler adds identical structs. However it turns out sometimes that type ids of identical struct members can also differ, even when the containing structs are still identical. To take an example, for struct sk_buff, debug messaging revealed that the identical struct matching was failing for the anon struct "headers"; specifically for the first field: __u8 __pkt_type_offset[0]; /* 128 0 */ Looking at the code in BTF deduplication, we have code that guards against the possibility of identical struct definitions, down to type ids, and identical array definitions. However in this case we have a struct which is being defined twice but does not have identical type ids since each duplicate struct has separate type ids for the above array member. A similar problem (though not observed) could occur for struct-in-struct. The solution is to make the "identical struct" test check members not just for matching ids, but to also check if they in turn are identical structs or arrays. The results of doing this are quite dramatic (for some modules at least); I see the number of type ids drop from around 10000 to just over 1000 in one module for example. For testing use latest pahole or apply [1], otherwise dedups can fail for the reasons described there. Also fix return type of btf_dedup_identical_arrays() as suggested by Andrii to match boolean return type used elsewhere. Fixes: efdd3eb8015e ("libbpf: Accommodate DWARF/compiler bug with duplicated structs") Signed-off-by: Alan Maguire <alan.maguire@oracle.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/1666622309-22289-1-git-send-email-alan.maguire@oracle.com [1] https://lore.kernel.org/bpf/1666364523-9648-1-git-send-email-alan.maguire	2022-11-12 18:24:12 -08:00

1 2 3 4 5 ...

1909 Commits