libbpf

mirror of https://github.com/netdata/libbpf.git synced 2026-06-14 18:49:08 +08:00

Author	SHA1	Message	Date
Andrii Nakryiko	a202bd7433	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions.	2022-08-18 15:29:04 -07:00
Andrii Nakryiko	ba81a5b778	libbpf: Clean up deprecated and legacy aliases Remove three missed deprecated APIs that were aliased to new APIs: bpf_object__unload, bpf_prog_attach_xattr and btf__load. Also move legacy API libbpf_find_kernel_btf (aliased to btf__load_vmlinux_btf) into libbpf_legacy.h. Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Hao Luo <haoluo@google.com> Link: https://lore.kernel.org/bpf/20220816001929.369487-4-andrii@kernel.org	2022-08-18 15:29:04 -07:00
Andrii Nakryiko	f7cee4152f	libbpf: Streamline bpf_attr and perf_event_attr initialization Make sure that entire libbpf code base is initializing bpf_attr and perf_event_attr with memset(0). Also for bpf_attr make sure we clear and pass to kernel only relevant parts of bpf_attr. bpf_attr is a huge union of independent sub-command attributes, so there is no need to clear and pass entire union bpf_attr, which over time grows quite a lot and for most commands this growth is completely irrelevant. Few cases where we were relying on compiler initialization of BPF UAPI structs (like bpf_prog_info, bpf_map_info, etc) with `= {};` were switched to memset(0) pattern for future-proofing. Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Hao Luo <haoluo@google.com> Link: https://lore.kernel.org/bpf/20220816001929.369487-3-andrii@kernel.org	2022-08-18 15:29:04 -07:00
Andrii Nakryiko	06c4624c8c	libbpf: Fix potential NULL dereference when parsing ELF Fix if condition filtering empty ELF sections to prevent NULL dereference. Fixes: 47ea7417b074 ("libbpf: Skip empty sections in bpf_object__init_global_data_maps") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Hao Luo <haoluo@google.com> Link: https://lore.kernel.org/bpf/20220816001929.369487-2-andrii@kernel.org	2022-08-18 15:29:04 -07:00
Hao Luo	c8f4b9c878	libbpf: Allows disabling auto attach Adds libbpf APIs for disabling auto-attach for individual functions. This is motivated by the use case of cgroup iter [1]. Some iter types require their parameters to be non-zero, therefore applying auto-attach on them will fail. With these two new APIs, users who want to use auto-attach and these types of iters can disable auto-attach on the program and perform manual attach. [1] https://lore.kernel.org/bpf/CAEf4BzZ+a2uDo_t6kGBziqdz--m2gh2_EUwkGLDtMd65uwxUjA@mail.gmail.com/ Signed-off-by: Hao Luo <haoluo@google.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220816234012.910255-1-haoluo@google.com	2022-08-18 15:29:04 -07:00
Hangbin Liu	079bc8536d	libbpf: Making bpf_prog_load() ignore name if kernel doesn't support Similar with commit 10b62d6a38f7 ("libbpf: Add names for auxiliary maps"), let's make bpf_prog_load() also ignore name if kernel doesn't support program name. To achieve this, we need to call sys_bpf_prog_load() directly in probe_kern_prog_name() to avoid circular dependency. sys_bpf_prog_load() also need to be exported in the libbpf_internal.h file. Signed-off-by: Hangbin Liu <liuhangbin@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Quentin Monnet <quentin@isovalent.com> Link: https://lore.kernel.org/bpf/20220813000936.6464-1-liuhangbin@gmail.com	2022-08-18 15:29:04 -07:00
Quentin Monnet	8be13ee80b	bpf: Clear up confusion in bpf_skb_adjust_room()'s documentation Adding or removing room space _below_ layers 2 or 3, as the description mentions, is ambiguous. This was written with a mental image of the packet with layer 2 at the top, layer 3 under it, and so on. But it has led users to believe that it was on lower layers (before the beginning of the L2 and L3 headers respectively). Let's make it more explicit, and specify between which layers the room space is adjusted. Reported-by: Rumen Telbizov <rumen.telbizov@menlosecurity.com> Signed-off-by: Quentin Monnet <quentin@isovalent.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Link: https://lore.kernel.org/bpf/20220812153727.224500-3-quentin@isovalent.com	2022-08-18 15:29:04 -07:00
Hangbin Liu	3db7585378	libbpf: Add names for auxiliary maps The bpftool self-created maps can appear in final map show output due to deferred removal in kernel. These maps don't have a name, which would make users confused about where it comes from. With a libbpf_ prefix name, users could know who created these maps. It also could make some tests (like test_offload.py, which skip base maps without names as a workaround) filter them out. Kernel adds bpf prog/map name support in the same merge commit fadad670a8ab ("Merge branch 'bpf-extend-info'"). So we can also use kernel_supports(NULL, FEAT_PROG_NAME) to check if kernel supports map name. As discussed [1], Let's make bpf_map_create accept non-null name string, and silently ignore the name if kernel doesn't support. [1] https://lore.kernel.org/bpf/CAEf4BzYL1TQwo1231s83pjTdFPk9XWWhfZC5=KzkU-VO0k=0Ug@mail.gmail.com/ Signed-off-by: Hangbin Liu <liuhangbin@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220811034020.529685-1-liuhangbin@gmail.com	2022-08-18 15:29:04 -07:00
Daniel Müller	69938da6d7	Explicitly specify Qemu image path to use The path to the file system image used by our invocation of Qemu is currently hard coded to /tmp/root.img somewhere in a different repository. With `da44c0b6ee` landed we have the option of specifying it explicitly from here. Let's do just that, so that we can remove the default value from libbpf/ci altogether. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-18 14:38:23 -07:00
Daniel Müller	bfdf7653e0	Rename travis-ci/ directory to ci/ We are no longer using Travis. As such, we should move away from a lot of CI functionality located in a folder called travis-ci/. This change renames the travis-ci/ directory to the more generic ci/. To preserve backwards compatibility until all "consumers" have transitioned, we add a symbolic link called travis-ci back. It will be removed in the near term future. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-18 09:02:13 -07:00
Daniel Müller	d700dcf162	Print allow and denylists We should include the deny and allow lists used somewhere in the output of our CI runs in order to improve debuggability in general. With this change we print out these lists once assembled. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-17 11:41:22 -07:00
Daniel Müller	c03b9f6d0b	Move kernel version check inwards The run_selftests.sh script defines functions for running individual tests. However, not all tests are run in all configurations. E.g., test_progs is not run on 4.9.0 kernels and test_maps is only run when testing on the "latest" kernel version. The checks for these conditions, however, are applied inconsistently: some are in the functions themselves and others on the call site. This change unifies all checks to happen within the test function itself. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-17 11:41:22 -07:00
Daniel Müller	66b788c1a4	Factor out test_progs_noalu function This change factors out a new function, test_progs_noalu, in the run_selftests.sh script. Having this function available will make it easier for us to run tests conditionally later on, but it's also a matter of having one function for one binary. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-17 11:41:22 -07:00
Daniel Müller	e3c2b8a48d	Re-enable test_maps selftest Back in 2020, we disabled the test_maps selftest with `e05f9be4f4` ("vmtests: temporarily disable test_maps") for reasons not closely elaborated. It appears that by now the test is succeeding again, so let's enable it back. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-15 15:50:55 -07:00
Andrii Nakryiko	13a26d78f3	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: 71930846b36f8e4e68267f8a3d47e33435c3657a Checkpoint bpf-next commit: 73cf09a36bf7bfb3e5a3ff23755c36d49137c44d Baseline bpf commit: f946964a9f79f8dcb5a6329265281eebfc23aee5 Checkpoint bpf commit: e7c677bdd03d54e9a1bafcaf1faf5c573a506bba Alexei Starovoitov (1): bpf: Disallow bpf programs call prog_run command. Andrii Nakryiko (2): libbpf: Reject legacy 'maps' ELF section libbpf: preserve errno across pr_warn/pr_info/pr_debug Dave Marchevsky (1): bpf: Improve docstring for BPF_F_USER_BUILD_ID flag Florian Fainelli (1): libbpf: Initialize err in probe_map_create Gustavo A. R. Silva (1): treewide: uapi: Replace zero-length arrays with flexible-array members Hengqi Chen (1): libbpf: Do not require executable permission for shared libraries James Hilliard (2): libbpf: Skip empty sections in bpf_object__init_global_data_maps libbpf: Ensure functions with always_inline attribute are inline Jesper Dangaard Brouer (1): bpf: Add BPF-helper for accessing CLOCK_TAI Namhyung Kim (1): perf/core: Add a new read format to get a number of lost samples include/uapi/linux/bpf.h \| 27 +++++++++++++++++++++++++-- include/uapi/linux/perf_event.h \| 7 +++++-- include/uapi/linux/pkt_cls.h \| 4 ++-- src/bpf_tracing.h \| 14 +++++++------- src/libbpf.c \| 25 +++++++++++++++++-------- src/libbpf_probes.c \| 2 +- src/skel_internal.h \| 4 ++-- src/usdt.bpf.h \| 4 ++-- 8 files changed, 61 insertions(+), 26 deletions(-) -- 2.30.2	2022-08-10 14:07:19 -07:00
Andrii Nakryiko	6b92311c3a	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions.	2022-08-10 14:07:19 -07:00
Alexei Starovoitov	6fdbfb00f1	bpf: Disallow bpf programs call prog_run command. The verifier cannot perform sufficient validation of bpf_attr->test.ctx_in pointer, therefore bpf programs should not be allowed to call BPF_PROG_RUN command from within the program. To fix this issue split bpf_sys_bpf() bpf helper into normal kern_sys_bpf() kernel function that can only be used by the kernel light skeleton directly. Reported-by: YiFei Zhu <zhuyifei@google.com> Fixes: b1d18a7574d0 ("bpf: Extend sys_bpf commands for bpf_syscall programs.") Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-08-10 14:07:19 -07:00
Andrii Nakryiko	45dca19bd2	libbpf: preserve errno across pr_warn/pr_info/pr_debug As suggested in [0], make sure that libbpf_print saves and restored errno and as such guaranteed that no matter what actual print callback user installs, macros like pr_warn/pr_info/pr_debug are completely transparent as far as errno goes. While libbpf code is pretty careful about not clobbering important errno values accidentally with pr_warn(), it's a trivial change to make sure that pr_warn can be used anywhere without a risk of clobbering errno. No functional changes, just future proofing. [0] https://github.com/libbpf/libbpf/pull/536 Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Daniel Müller <deso@posteo.net> Link: https://lore.kernel.org/r/20220810183425.1998735-1-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-08-10 14:07:19 -07:00
Jesper Dangaard Brouer	2fe1958ec8	bpf: Add BPF-helper for accessing CLOCK_TAI Commit 3dc6ffae2da2 ("timekeeping: Introduce fast accessor to clock tai") introduced a fast and NMI-safe accessor for CLOCK_TAI. Especially in time sensitive networks (TSN), where all nodes are synchronized by Precision Time Protocol (PTP), it's helpful to have the possibility to generate timestamps based on CLOCK_TAI instead of CLOCK_MONOTONIC. With a BPF helper for TAI in place, it becomes very convenient to correlate activity across different machines in the network. Use cases for such a BPF helper include functionalities such as Tx launch time (e.g. ETF and TAPRIO Qdiscs) and timestamping. Note: CLOCK_TAI is nothing new per se, only the NMI-safe variant of it is. Signed-off-by: Jesper Dangaard Brouer <brouer@redhat.com> [Kurt: Wrote changelog and renamed helper] Signed-off-by: Kurt Kanzenbach <kurt@linutronix.de> Link: https://lore.kernel.org/r/20220809060803.5773-2-kurt@linutronix.de Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-08-10 14:07:19 -07:00
Dave Marchevsky	cbd9b7e5d8	bpf: Improve docstring for BPF_F_USER_BUILD_ID flag Most tools which use bpf_get_stack or bpf_get_stackid symbolicate the stack - meaning the stack of addresses in the target process' address space is transformed into meaningful symbol names. The BPF_F_USER_BUILD_ID flag eases this process by finding the build_id of the file-backed vma which the address falls in and translating the address to an offset within the backing file. To be more specific, the offset is a "file offset" from the beginning of the backing file. The symbols in ET_DYN ELF objects have a st_value which is also described as an "offset" - but an offset in the process address space, relative to the base address of the object. It's necessary to translate between the "file offset" and "virtual address offset" during symbolication before they can be directly compared. Failure to do so can lead to confusing bugs, so this patch clarifies language in the documentation in an attempt to keep this from happening. Signed-off-by: Dave Marchevsky <davemarchevsky@fb.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220808164723.3107500-1-davemarchevsky@fb.com	2022-08-10 14:07:19 -07:00
Hengqi Chen	0cc6bfab39	libbpf: Do not require executable permission for shared libraries Currently, resolve_full_path() requires executable permission for both programs and shared libraries. This causes failures on distos like Debian since the shared libraries are not installed executable and Linux is not requiring shared libraries to have executable permissions. Let's remove executable permission check for shared libraries. Reported-by: Goro Fuji <goro@fastly.com> Signed-off-by: Hengqi Chen <hengqi.chen@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220806102021.3867130-1-hengqi.chen@gmail.com	2022-08-10 14:07:19 -07:00
Andrii Nakryiko	41c612167e	libbpf: Reject legacy 'maps' ELF section Add explicit error message if BPF object file is still using legacy BPF map definitions in SEC("maps"). Before this change, if BPF object file is still using legacy map definition user will see a bit confusing: libbpf: elf: skipping unrecognized data section(4) maps libbpf: prog 'handler': bad map relo against 'server_map' in section 'maps' Now libbpf will be explicit about rejecting "maps" ELF section: libbpf: elf: legacy map definitions in 'maps' section are not supported by libbpf v1.0+ Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Link: https://lore.kernel.org/bpf/20220803214202.23750-1-andrii@kernel.org	2022-08-10 14:07:19 -07:00
James Hilliard	69d537ba0b	libbpf: Ensure functions with always_inline attribute are inline GCC expects the always_inline attribute to only be set on inline functions, as such we should make all functions with this attribute use the __always_inline macro which makes the function inline and sets the attribute. Fixes errors like: /home/buildroot/bpf-next/tools/testing/selftests/bpf/tools/include/bpf/bpf_tracing.h:439:1: error: ‘always_inline’ function might not be inlinable [-Werror=attributes] 439 \| ____##name(unsigned long long *ctx, ##args) \| ^~~~ Signed-off-by: James Hilliard <james.hilliard1@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Jiri Olsa <jolsa@kernel.org> Link: https://lore.kernel.org/bpf/20220803151403.793024-1-james.hilliard1@gmail.com	2022-08-10 14:07:19 -07:00
Florian Fainelli	bd1e5cff31	libbpf: Initialize err in probe_map_create GCC-11 warns about the possibly unitialized err variable in probe_map_create: libbpf_probes.c: In function 'probe_map_create': libbpf_probes.c:361:38: error: 'err' may be used uninitialized in this function [-Werror=maybe-uninitialized] 361 \| return fd < 0 && err == exp_err ? 1 : 0; \| ~~~~^~~~~~~~~~ Fixes: 878d8def0603 ("libbpf: Rework feature-probing APIs") Signed-off-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Jiri Olsa <jolsa@kernel.org> Link: https://lore.kernel.org/bpf/20220801025109.1206633-1-f.fainelli@gmail.com	2022-08-10 14:07:19 -07:00
James Hilliard	3d484ca473	libbpf: Skip empty sections in bpf_object__init_global_data_maps The GNU assembler generates an empty .bss section. This is a well established behavior in GAS that happens in all supported targets. The LLVM assembler doesn't generate an empty .bss section. bpftool chokes on the empty .bss section. Additionally in bpf_object__elf_collect the sec_desc->data is not initialized when a section is not recognized. In this case, this happens with .comment. So we must check that sec_desc->data is initialized before checking if the size is 0. Signed-off-by: James Hilliard <james.hilliard1@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Jiri Olsa <jolsa@kernel.org> Link: https://lore.kernel.org/bpf/20220731232649.4668-1-james.hilliard1@gmail.com	2022-08-10 14:07:19 -07:00
Gustavo A. R. Silva	c25544735b	treewide: uapi: Replace zero-length arrays with flexible-array members There is a regular need in the kernel to provide a way to declare having a dynamically sized set of trailing elements in a structure. Kernel code should always use “flexible array members”[1] for these cases. The older style of one-element or zero-length arrays should no longer be used[2]. This code was transformed with the help of Coccinelle: (linux-5.19-rc2$ spatch --jobs $(getconf _NPROCESSORS_ONLN) --sp-file script.cocci --include-headers --dir . > output.patch) @@ identifier S, member, array; type T1, T2; @@ struct S { ... T1 member; T2 array[ - 0 ]; }; -fstrict-flex-arrays=3 is coming and we need to land these changes to prevent issues like these in the short future: ../fs/minix/dir.c:337:3: warning: 'strcpy' will always overflow; destination buffer has size 0, but the source string has length 2 (including NUL byte) [-Wfortify-source] strcpy(de3->name, "."); ^ Since these are all [0] to [] changes, the risk to UAPI is nearly zero. If this breaks anything, we can use a union with a new member name. [1] https://en.wikipedia.org/wiki/Flexible_array_member [2] https://www.kernel.org/doc/html/v5.16/process/deprecated.html#zero-length-and-one-element-arrays Link: https://github.com/KSPP/linux/issues/78 Build-tested-by: kernel test robot <lkp@intel.com> Link: https://lore.kernel.org/lkml/62b675ec.wKX6AOZ6cbE71vtF%25lkp@intel.com/ Acked-by: Dan Williams <dan.j.williams@intel.com> # For ndctl.h Signed-off-by: Gustavo A. R. Silva <gustavoars@kernel.org>	2022-08-10 14:07:19 -07:00
Namhyung Kim	179c7940eb	perf/core: Add a new read format to get a number of lost samples Sometimes we want to know an accurate number of samples even if it's lost. Currenlty PERF_RECORD_LOST is generated for a ring-buffer which might be shared with other events. So it's hard to know per-event lost count. Add event->lost_samples field and PERF_FORMAT_LOST to retrieve it from userspace. Original-patch-by: Jiri Olsa <jolsa@redhat.com> Signed-off-by: Namhyung Kim <namhyung@kernel.org> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20220616180623.1358843-1-namhyung@kernel.org	2022-08-10 14:07:19 -07:00
Daniel Müller	f6692dc4e8	Remove checked-in configuration Both the bpf and bpf-next tree have suitable BPF selftest configurations available for usage with the latest kernel now upstream. While we do test on 4.9 and 5.5 kernels as well, there we just download prebuilt binaries. The configuration we use for building selftests is always the upstream one. With this change we remove the checked-in configuration, as it is now no longer needed. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-10 10:24:28 -07:00
Daniel Müller	693de729d0	Rename blacklists and whitelists Upstream uses denylist and allowlist terminology instead of blacklist and whitelist. It also has established a less deeply nested directory structure. This change renames the blacklist & whitelist files accordingly and moves them one level up out of their containing directory to mirror the layout we have upstream as well as in kernel-patches/vmtest. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-10 08:31:17 -07:00
Daniel Müller	0667206913	Use checkout action in version v3 The current version of actions/checkout is v3. That means that v2, which we currently use, has been superseded. Update the version we use accordingly. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-09 14:02:50 -07:00
Daniel Müller	a2ebd9ceff	Rely on upstream kernel configuration So far we have relied on the kernel configuration as checked into the this repository. However, a suitable configuration is now included in upstream Linux [0]. With this change we add support for using the configuration from there. [0] https://lore.kernel.org/bpf/165893461358.29339.11641967418379627671.git-patchwork-notify@kernel.org/T/#m2a97b0ea9ef0ddee7a53bbf7919e3f324b233937 Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-09 09:23:59 -07:00
Daniel Müller	0e43565ad8	ci: Bump LLVM version we use to 16 Development on LLVM 16 has started and version 15 is no longer available in the repository we install it from. Bump the version we use accordingly. Signed-off-by: Daniel Müller <deso@posteo.net>	2022-08-01 13:10:42 -07:00
Andrii Nakryiko	5b795f7b30	ci: blacklist skeleton selftest Selftest relies on new 5.19+ kernel support for big ARRAY maps. Signed-off-by: Andrii Nakryiko <andrii@kernel.org>	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	3fa2c28d2c	sync: latest libbpf changes from kernel Syncing latest libbpf commits from kernel repository. Baseline bpf-next commit: b0d93b44641a83c28014ca38001e85bf6dc8501e Checkpoint bpf-next commit: 71930846b36f8e4e68267f8a3d47e33435c3657a Baseline bpf commit: d28b25a62a47a8c8aa19bd543863aab6717e68c9 Checkpoint bpf commit: f946964a9f79f8dcb5a6329265281eebfc23aee5 Andrii Nakryiko (7): libbpf: add bpf_core_type_matches() helper macro libbpf: Remove unnecessary usdt_rel_ip assignments libbpf: generalize virtual __kconfig externs and use it for USDT libbpf: improve BPF_KPROBE_SYSCALL macro and rename it to BPF_KSYSCALL libbpf: add ksyscall/kretsyscall sections support for syscall kprobes libbpf: fallback to tracefs mount point if debugfs is not mounted libbpf: make RINGBUF map size adjustments more eagerly Anquan Wu (1): libbpf: Fix the name of a reused map Chuang Wang (3): libbpf: Cleanup the legacy kprobe_event on failed add/attach_event() libbpf: Fix wrong variable used in perf_event_uprobe_open_legacy() libbpf: Cleanup the legacy uprobe_event on failed add/attach_event() Dan Carpenter (3): libbpf: fix an snprintf() overflow check libbpf: Fix sign expansion bug in btf_dump_get_enum_value() libbpf: Fix str_has_sfx()'s return value Daniel Müller (4): bpf: Introduce TYPE_MATCH related constants/macros bpf, libbpf: Add type match support bpf: Correctly propagate errors up from bpf_core_composites_match libbpf: Support PPC in arch_specific_syscall_pfx Hangbin Liu (1): Bonding: add per-port priority for failover re-selection Hengqi Chen (1): libbpf: Error out when binary_path is NULL for uprobe and USDT Ilya Leoshkevich (1): libbpf: Extend BPF_KSYSCALL documentation James Hilliard (1): libbpf: Disable SEC pragma macro on GCC Joanne Koong (2): bpf: Add flags arg to bpf_dynptr_read and bpf_dynptr_write APIs bpf: fix bpf_skb_pull_data documentation Joe Burton (1): libbpf: Add bpf_obj_get_opts() Jon Doron (1): libbpf: perfbuf: Add API to get the ring buffer Pu Lehui (1): bpf, docs: Remove deprecated xsk libbpf APIs description Yixun Lan (1): libbpf, riscv: Use a0 for RC register docs/libbpf_naming_convention.rst \| 13 +- include/uapi/linux/bpf.h \| 15 +- include/uapi/linux/if_link.h \| 1 + src/bpf.c \| 9 + src/bpf.h \| 11 + src/bpf_core_read.h \| 11 + src/bpf_helpers.h \| 13 + src/bpf_tracing.h \| 60 +++- src/btf_dump.c \| 2 +- src/gen_loader.c \| 2 +- src/libbpf.c \| 440 ++++++++++++++++++++++-------- src/libbpf.h \| 62 +++++ src/libbpf.map \| 3 + src/libbpf_internal.h \| 8 +- src/relo_core.c \| 286 ++++++++++++++++++- src/relo_core.h \| 4 + src/usdt.bpf.h \| 16 +- src/usdt.c \| 6 +- 18 files changed, 793 insertions(+), 169 deletions(-) -- 2.30.2	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	0fa013e705	sync: auto-generate latest BPF helpers Latest changes to BPF helper definitions.	2022-07-31 16:45:48 -07:00
Joe Burton	d8e2c9d965	libbpf: Add bpf_obj_get_opts() Add an extensible variant of bpf_obj_get() capable of setting the `file_flags` parameter. This parameter is needed to enable unprivileged access to BPF maps. Without a method like this, users must manually make the syscall. Signed-off-by: Joe Burton <jevburton@google.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220729202727.3311806-1-jevburton.kernel@gmail.com	2022-07-31 16:45:48 -07:00
Daniel Müller	b2d7228d7c	libbpf: Support PPC in arch_specific_syscall_pfx Commit 708ac5bea0ce ("libbpf: add ksyscall/kretsyscall sections support for syscall kprobes") added the arch_specific_syscall_pfx() function, which returns a string representing the architecture in use. As it turns out this function is currently not aware of Power PC, where NULL is returned. That's being flagged by the libbpf CI system, which builds for ppc64le and the compiler sees a NULL pointer being passed in to a %s format string. With this change we add representations for two more architectures, for Power PC and Power PC 64, and also adjust the string format logic to handle NULL pointers gracefully, in an attempt to prevent similar issues with other architectures in the future. Fixes: 708ac5bea0ce ("libbpf: add ksyscall/kretsyscall sections support for syscall kprobes") Signed-off-by: Daniel Müller <deso@posteo.net> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220728222345.3125975-1-deso@posteo.net	2022-07-31 16:45:48 -07:00
Ilya Leoshkevich	427f2a0c83	libbpf: Extend BPF_KSYSCALL documentation Explicitly list known quirks. Mention that socket-related syscalls can be invoked via socketcall(). Signed-off-by: Ilya Leoshkevich <iii@linux.ibm.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Jiri Olsa <jolsa@kernel.org> Link: https://lore.kernel.org/bpf/20220726134008.256968-2-iii@linux.ibm.com	2022-07-31 16:45:48 -07:00
Dan Carpenter	8663289b51	libbpf: Fix str_has_sfx()'s return value The return from strcmp() is inverted so it wrongly returns true instead of false and vice versa. Fixes: a1c9d61b19cb ("libbpf: Improve library identification for uprobe binary path resolution") Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Martin KaFai Lau <kafai@fb.com> Cc: Alan Maguire <alan.maguire@oracle.com> Link: https://lore.kernel.org/bpf/YtZ+/dAA195d99ak@kili	2022-07-31 16:45:48 -07:00
Dan Carpenter	77e514d626	libbpf: Fix sign expansion bug in btf_dump_get_enum_value() The code here is supposed to take a signed int and store it in a signed long long. Unfortunately, the way that the type promotion works with this conditional statement is that it takes a signed int, type promotes it to a __u32, and then stores that as a signed long long. The result is never negative. This is from static analysis, but I made a little test program just to test it before I sent the patch: #include <stdio.h> int main(void) { unsigned long long src = -1ULL; signed long long dst1, dst2; int is_signed = 1; dst1 = is_signed ? (int )&src : (unsigned int )0; dst2 = is_signed ? (signed long long)(int )&src : (unsigned int )0; printf("%lld\n", dst1); printf("%lld\n", dst2); return 0; } Fixes: d90ec262b35b ("libbpf: Add enum64 support for btf_dump") Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Martin KaFai Lau <kafai@fb.com> Link: https://lore.kernel.org/bpf/YtZ+LpgPADm7BeEd@kili	2022-07-31 16:45:48 -07:00
Dan Carpenter	b44b214118	libbpf: fix an snprintf() overflow check The snprintf() function returns the number of bytes it would have copied if there were enough space. So it can return > the sizeof(gen->attach_target). Fixes: 67234743736a ("libbpf: Generate loader program out of BPF ELF file.") Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com> Acked-by: Martin KaFai Lau <kafai@fb.com> Link: https://lore.kernel.org/r/YtZ+oAySqIhFl6/J@kili Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	610707057a	libbpf: make RINGBUF map size adjustments more eagerly Make libbpf adjust RINGBUF map size (rounding it up to closest power-of-2 of page_size) more eagerly: during open phase when initializing the map and on explicit calls to bpf_map__set_max_entries(). Such approach allows user to check actual size of BPF ringbuf even before it's created in the kernel, but also it prevents various edge case scenarios where BPF ringbuf size can get out of sync with what it would be in kernel. One of them (reported in [0]) is during an attempt to pin/reuse BPF ringbuf. Move adjust_ringbuf_sz() helper closer to its first actual use. The implementation of the helper is unchanged. Also make detection of whether bpf_object is already loaded more robust by checking obj->loaded explicitly, given that map->fd can be < 0 even if bpf_object is already loaded due to ability to disable map creation with bpf_map__set_autocreate(map, false). [0] Closes: https://github.com/libbpf/libbpf/pull/530 Fixes: 0087a681fa8c ("libbpf: Automatically fix up BPF_MAP_TYPE_RINGBUF size, if necessary") Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Acked-by: Yonghong Song <yhs@fb.com> Link: https://lore.kernel.org/r/20220715230952.2219271-1-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Joanne Koong	7e567b8761	bpf: fix bpf_skb_pull_data documentation Fix documentation for bpf_skb_pull_data() helper for when len == 0. Fixes: fa15601ab31e ("bpf: add documentation for eBPF helpers (33-41)") Signed-off-by: Joanne Koong <joannelkoong@gmail.com> Acked-by: Quentin Monnet <quentin@isovalent.com> Acked-by: Martin KaFai Lau <kafai@fb.com> Link: https://lore.kernel.org/r/20220715193800.3940070-1-joannelkoong@gmail.com Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	1fe0248c61	libbpf: fallback to tracefs mount point if debugfs is not mounted Teach libbpf to fallback to tracefs mount point (/sys/kernel/tracing) if debugfs (/sys/kernel/debug/tracing) isn't mounted. Acked-by: Yonghong Song <yhs@fb.com> Suggested-by: Connor O'Brien <connoro@google.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20220715185736.898848-1-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	0862e4e54d	libbpf: add ksyscall/kretsyscall sections support for syscall kprobes Add SEC("ksyscall")/SEC("ksyscall/<syscall_name>") and corresponding kretsyscall variants (for return kprobes) to allow users to kprobe syscall functions in kernel. These special sections allow to ignore complexities and differences between kernel versions and host architectures when it comes to syscall wrapper and corresponding __<arch>_sys_<syscall> vs __se_sys_<syscall> differences, depending on whether host kernel has CONFIG_ARCH_HAS_SYSCALL_WRAPPER (though libbpf itself doesn't rely on /proc/config.gz for detecting this, see BPF_KSYSCALL patch for how it's done internally). Combined with the use of BPF_KSYSCALL() macro, this allows to just specify intended syscall name and expected input arguments and leave dealing with all the variations to libbpf. In addition to SEC("ksyscall+") and SEC("kretsyscall+") add bpf_program__attach_ksyscall() API which allows to specify syscall name at runtime and provide associated BPF cookie value. At the moment SEC("ksyscall") and bpf_program__attach_ksyscall() do not handle all the calling convention quirks for mmap(), clone() and compat syscalls. It also only attaches to "native" syscall interfaces. If host system supports compat syscalls or defines 32-bit syscalls in 64-bit kernel, such syscall interfaces won't be attached to by libbpf. These limitations may or may not change in the future. Therefore it is recommended to use SEC("kprobe") for these syscalls or if working with compat and 32-bit interfaces is required. Tested-by: Alan Maguire <alan.maguire@oracle.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20220714070755.3235561-5-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	fd6c9d906a	libbpf: improve BPF_KPROBE_SYSCALL macro and rename it to BPF_KSYSCALL Improve BPF_KPROBE_SYSCALL (and rename it to shorter BPF_KSYSCALL to match libbpf's SEC("ksyscall") section name, added in next patch) to use __kconfig variable to determine how to properly fetch syscall arguments. Instead of relying on hard-coded knowledge of whether kernel's architecture uses syscall wrapper or not (which only reflects the latest kernel versions, but is not necessarily true for older kernels and won't necessarily hold for later kernel versions on some particular host architecture), determine this at runtime by attempting to create perf_event (with fallback to kprobe event creation through tracefs on legacy kernels, just like kprobe attachment code is doing) for kernel function that would correspond to bpf() syscall on a system that has CONFIG_ARCH_HAS_SYSCALL_WRAPPER set (e.g., for x86-64 it would try '__x64_sys_bpf'). If host kernel uses syscall wrapper, syscall kernel function's first argument is a pointer to struct pt_regs that then contains syscall arguments. In such case we need to use bpf_probe_read_kernel() to fetch actual arguments (which we do through BPF_CORE_READ() macro) from inner pt_regs. But if the kernel doesn't use syscall wrapper approach, input arguments can be read from struct pt_regs directly with no probe reading. All this feature detection is done without requiring /proc/config.gz existence and parsing, and BPF-side helper code uses newly added LINUX_HAS_SYSCALL_WRAPPER virtual __kconfig extern to keep in sync with user-side feature detection of libbpf. BPF_KSYSCALL() macro can be used both with SEC("kprobe") programs that define syscall function explicitly (e.g., SEC("kprobe/__x64_sys_bpf")) and SEC("ksyscall") program added in the next patch (which are the same kprobe program with added benefit of libbpf determining correct kernel function name automatically). Kretprobe and kretsyscall (added in next patch) programs don't need BPF_KSYSCALL as they don't provide access to input arguments. Normal BPF_KRETPROBE is completely sufficient and is recommended. Tested-by: Alan Maguire <alan.maguire@oracle.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20220714070755.3235561-4-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Andrii Nakryiko	d56d93baff	libbpf: generalize virtual __kconfig externs and use it for USDT Libbpf supports single virtual __kconfig extern currently: LINUX_KERNEL_VERSION. LINUX_KERNEL_VERSION isn't coming from /proc/kconfig.gz and is intead customly filled out by libbpf. This patch generalizes this approach to support more such virtual __kconfig externs. One such extern added in this patch is LINUX_HAS_BPF_COOKIE which is used for BPF-side USDT supporting code in usdt.bpf.h instead of using CO-RE-based enum detection approach for detecting bpf_get_attach_cookie() BPF helper. This allows to remove otherwise not needed CO-RE dependency and keeps user-space and BPF-side parts of libbpf's USDT support strictly in sync in terms of their feature detection. We'll use similar approach for syscall wrapper detection for BPF_KSYSCALL() BPF-side macro in follow up patch. Generally, currently libbpf reserves CONFIG_ prefix for Kconfig values and LINUX_ for virtual libbpf-backed externs. In the future we might extend the set of prefixes that are supported. This can be done without any breaking changes, as currently any __kconfig extern with unrecognized name is rejected. For LINUX_xxx externs we support the normal "weak rule": if libbpf doesn't recognize given LINUX_xxx extern but such extern is marked as __weak, it is not rejected and defaults to zero. This follows CONFIG_xxx handling logic and will allow BPF applications to opportunistically use newer libbpf virtual externs without breaking on older libbpf versions unnecessarily. Tested-by: Alan Maguire <alan.maguire@oracle.com> Reviewed-by: Alan Maguire <alan.maguire@oracle.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/r/20220714070755.3235561-2-andrii@kernel.org Signed-off-by: Alexei Starovoitov <ast@kernel.org>	2022-07-31 16:45:48 -07:00
Jon Doron	1648fa16b5	libbpf: perfbuf: Add API to get the ring buffer Add support for writing a custom event reader, by exposing the ring buffer. With the new API perf_buffer__buffer() you will get access to the raw mmaped()'ed per-cpu underlying memory of the ring buffer. This region contains both the perf buffer data and header (struct perf_event_mmap_page), which manages the ring buffer state (head/tail positions, when accessing the head/tail position it's important to take into consideration SMP). With this type of low level access one can implement different types of consumers here are few simple examples where this API helps with: 1. perf_event_read_simple is allocating using malloc, perhaps you want to handle the wrap-around in some other way. 2. Since perf buf is per-cpu then the order of the events is not guarnteed, for example: Given 3 events where each event has a timestamp t0 < t1 < t2, and the events are spread on more than 1 CPU, then we can end up with the following state in the ring buf: CPU[0] => [t0, t2] CPU[1] => [t1] When you consume the events from CPU[0], you could know there is a t1 missing, (assuming there are no drops, and your event data contains a sequential index). So now one can simply do the following, for CPU[0], you can store the address of t0 and t2 in an array (without moving the tail, so there data is not perished) then move on the CPU[1] and set the address of t1 in the same array. So you end up with something like: void **arr[] = [&t0, &t1, &t2], now you can consume it orderely and move the tails as you process in order. 3. Assuming there are multiple CPUs and we want to start draining the messages from them, then we can "pick" with which one to start with according to the remaining free space in the ring buffer. Signed-off-by: Jon Doron <jond@wiz.io> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220715181122.149224-1-arilou@gmail.com	2022-07-31 16:45:48 -07:00
Anquan Wu	9b6f4eb157	libbpf: Fix the name of a reused map BPF map name is limited to BPF_OBJ_NAME_LEN. A map name is defined as being longer than BPF_OBJ_NAME_LEN, it will be truncated to BPF_OBJ_NAME_LEN when a userspace program calls libbpf to create the map. A pinned map also generates a path in the /sys. If the previous program wanted to reuse the map， it can not get bpf_map by name, because the name of the map is only partially the same as the name which get from pinned path. The syscall information below show that map name "process_pinned_map" is truncated to "process_pinned_". bpf(BPF_OBJ_GET, {pathname="/sys/fs/bpf/process_pinned_map", bpf_fd=0, file_flags=0}, 144) = -1 ENOENT (No such file or directory) bpf(BPF_MAP_CREATE, {map_type=BPF_MAP_TYPE_HASH, key_size=4, value_size=4,max_entries=1024, map_flags=0, inner_map_fd=0, map_name="process_pinned_",map_ifindex=0, btf_fd=3, btf_key_type_id=6, btf_value_type_id=10,btf_vmlinux_value_type_id=0}, 72) = 4 This patch check that if the name of pinned map are the same as the actual name for the first (BPF_OBJ_NAME_LEN - 1), bpf map still uses the name which is included in bpf object. Fixes: 26736eb9a483 ("tools: libbpf: allow map reuse") Signed-off-by: Anquan Wu <leiqi96@hotmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/OSZP286MB1725CEA1C95C5CB8E7CCC53FB8869@OSZP286MB1725.JPNP286.PROD.OUTLOOK.COM	2022-07-31 16:45:48 -07:00
Hengqi Chen	b3fe4be0b3	libbpf: Error out when binary_path is NULL for uprobe and USDT binary_path is a required non-null parameter for bpf_program__attach_usdt and bpf_program__attach_uprobe_opts. Check it against NULL to prevent coredump on strchr. Signed-off-by: Hengqi Chen <hengqi.chen@gmail.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20220712025745.2703995-1-hengqi.chen@gmail.com	2022-07-31 16:45:48 -07:00

... 6 7 8 9 10 ...

2131 Commits