From: Tomasz Duszynski <tduszynski@marvell.com>
To: <david.marchand@redhat.com>
Cc: <bruce.richardson@intel.com>, <dev@dpdk.org>,
<jerinj@marvell.com>, <mb@smartsharesystems.com>,
<tduszynski@marvell.com>, <thomas@monjalon.net>
Subject: Re: [PATCH v7 7/8] trace: add PMU
Date: Mon, 21 Jul 2025 12:24:57 +0200 [thread overview]
Message-ID: <20250721102457.2399936-1-tduszynski@marvell.com> (raw)
In-Reply-To: <CAJFAV8zDBgpEyg_GL-Pzh-5n5kGjYE6g2JedgZeoTgGiDN97OQ@mail.gmail.com>
> On Fri, Jun 27, 2025 at 5:41 PM Tomasz Duszynski <tduszynski@marvell.com> wrote:
> >
> > In order to profile app, one needs to store significant amount of samples
> > somewhere for an analysis later on.
> > Since trace library supports storing data in a CTF format,
> > lets take advantage of that and add a dedicated PMU tracepoint.
> >
> > Signed-off-by: Tomasz Duszynski <tduszynski@marvell.com>
> > ---
> > MAINTAINERS | 1 +
> > app/test/test_trace_perf.c | 10 ++++
> > doc/guides/prog_guide/profile_app.rst | 5 ++
> > doc/guides/prog_guide/trace_lib.rst | 31 ++++++++++
> > doc/guides/rel_notes/release_25_07.rst | 2 +
> > lib/eal/common/eal_common_trace.c | 5 +-
> > lib/eal/common/eal_common_trace_pmu.c | 38 ++++++++++++
> > lib/eal/common/eal_common_trace_points.c | 20 +++++++
> > lib/eal/common/eal_trace.h | 4 ++
> > lib/eal/common/meson.build | 1 +
> > lib/eal/include/rte_eal_trace.h | 16 +++++
> > lib/eal/include/rte_trace_point.h | 7 +++
> > lib/eal/include/rte_trace_point_register.h | 2 +
> > lib/eal/meson.build | 3 +
> > lib/meson.build | 2 +-
> > lib/pmu/pmu.c | 69 +++++++++++++++++++++-
> > lib/pmu/rte_pmu.h | 24 ++++++++
> > 17 files changed, 236 insertions(+), 4 deletions(-)
> > create mode 100644 lib/eal/common/eal_common_trace_pmu.c
> >
> > diff --git a/MAINTAINERS b/MAINTAINERS
> > index 0e9357f3a3..74cc8fc195 100644
> > --- a/MAINTAINERS
> > +++ b/MAINTAINERS
> > @@ -1850,6 +1850,7 @@ F: doc/guides/prog_guide/eventdev/dispatcher_lib.rst
> > PMU - EXPERIMENTAL
> > M: Tomasz Duszynski <tduszynski@marvell.com>
> > F: lib/pmu/
> > +F: lib/eal/common/eal_common_trace_pmu.c
> > F: app/test/test_pmu.c
> >
> > Job statistics
> > diff --git a/app/test/test_trace_perf.c b/app/test/test_trace_perf.c
> > index 8257cc02be..28f908ce40 100644
> > --- a/app/test/test_trace_perf.c
> > +++ b/app/test/test_trace_perf.c
> > @@ -114,6 +114,10 @@ worker_fn_##func(void *arg) \
> > #define GENERIC_DOUBLE rte_eal_trace_generic_double(3.66666)
> > #define GENERIC_STR rte_eal_trace_generic_str("hello world")
> > #define VOID_FP app_dpdk_test_fp()
> > +#ifdef RTE_LIB_PMU
> > +/* 0 corresponds first event passed via --trace= */
> > +#define READ_PMU rte_pmu_trace_read(0)
> > +#endif
> >
> > WORKER_DEFINE(GENERIC_VOID)
> > WORKER_DEFINE(GENERIC_U64)
> > @@ -122,6 +126,9 @@ WORKER_DEFINE(GENERIC_FLOAT)
> > WORKER_DEFINE(GENERIC_DOUBLE)
> > WORKER_DEFINE(GENERIC_STR)
> > WORKER_DEFINE(VOID_FP)
> > +#ifdef RTE_LIB_PMU
> > +WORKER_DEFINE(READ_PMU)
> > +#endif
> >
> > static void
> > run_test(const char *str, lcore_function_t f, struct test_data *data, size_t sz)
> > @@ -174,6 +181,9 @@ test_trace_perf(void)
> > run_test("double", worker_fn_GENERIC_DOUBLE, data, sz);
> > run_test("string", worker_fn_GENERIC_STR, data, sz);
> > run_test("void_fp", worker_fn_VOID_FP, data, sz);
> > +#ifdef RTE_LIB_PMU
> > + run_test("read_pmu", worker_fn_READ_PMU, data, sz);
> > +#endif
> >
> > rte_free(data);
> > return TEST_SUCCESS;
> > diff --git a/doc/guides/prog_guide/profile_app.rst b/doc/guides/prog_guide/profile_app.rst
> > index 2f47680d5d..362fd20143 100644
> > --- a/doc/guides/prog_guide/profile_app.rst
> > +++ b/doc/guides/prog_guide/profile_app.rst
> > @@ -42,6 +42,11 @@ Current implementation imposes certain limitations:
> > * EAL lcores must not share a CPU.
> > * Each EAL lcore measures the same group of events.
> >
> > +Alternatively tracing library can be used,
> > +which offers dedicated tracepoint ``rte_pmu_trace_read()``.
> > +
> > +Refer to :doc:`../prog_guide/trace_lib` for more details.
> > +
> >
> > Profiling on x86
> > ----------------
> > diff --git a/doc/guides/prog_guide/trace_lib.rst b/doc/guides/prog_guide/trace_lib.rst
> > index d9b17abe90..97158cce37 100644
> > --- a/doc/guides/prog_guide/trace_lib.rst
> > +++ b/doc/guides/prog_guide/trace_lib.rst
> > @@ -46,6 +46,7 @@ DPDK tracing library features
> > trace format and is compatible with ``LTTng``.
> > For detailed information, refer to
> > `Common Trace Format <https://diamon.org/ctf/>`_.
> > +- Support reading PMU events on ARM64 and x86-64 (Intel)
> >
> > How to add a tracepoint?
> > ------------------------
> > @@ -139,6 +140,36 @@ the user must use ``RTE_TRACE_POINT_FP`` instead of ``RTE_TRACE_POINT``.
> > ``RTE_TRACE_POINT_FP`` is compiled out by default and it can be enabled using
> > the ``enable_trace_fp`` option for meson build.
> >
> > +PMU tracepoint
> > +--------------
> > +
> > +Performance Monitoring Unit (PMU) event values can be read from hardware registers
> > +using the predefined ``rte_pmu_read`` tracepoint.
> > +
> > +Tracing is enabled via ``--trace`` EAL option by passing both expression
> > +matching PMU tracepoint name i.e ``lib.eal.pmu.read``
> > +and expression ``e=ev1[,ev2,...]`` matching particular events::
> > +
> > + --trace='.*pmu.read\|e=cpu_cycles,l1d_cache'
> > +
> > +Event names are available under ``/sys/bus/event_source/devices/PMU/events`` directory,
> > +where ``PMU`` is a placeholder for either a ``cpu`` or a directory containing ``cpus``.
> > +
> > +In contrary to other tracepoints this does not need any extra variables
> > +added to source files.
> > +Instead, caller passes index
> > +which follows the order of events specified via ``--trace`` parameter.
> > +In the following example, index ``0`` corresponds to ``cpu_cyclces``,
> > +while index ``1`` corresponds to ``l1d_cache``.
> > +
> > +.. code-block:: c
> > +
> > + rte_pmu_trace_read(0);
> > + rte_pmu_trace_read(1);
> > +
> > +PMU tracing support must be explicitly enabled
> > +using the ``enable_trace_fp`` option for Meson build.
> > +
> > Event record mode
> > -----------------
> >
> > diff --git a/doc/guides/rel_notes/release_25_07.rst b/doc/guides/rel_notes/release_25_07.rst
> > index 19be7740c4..3c55f93853 100644
> > --- a/doc/guides/rel_notes/release_25_07.rst
> > +++ b/doc/guides/rel_notes/release_25_07.rst
> > @@ -71,6 +71,8 @@ New Features
> >
> > Added a Performance Monitoring Unit (PMU) library which allows Linux applications
> > to perform self monitoring activities without depending on external utilities like perf.
> > + After integration with :doc:`../prog_guide/trace_lib`, data gathered from hardware counters
> > + can be stored in CTF format for further analysis.
> >
> > * **Added Mucse rnp net driver.**
> >
> > diff --git a/lib/eal/common/eal_common_trace.c b/lib/eal/common/eal_common_trace.c
> > index be1f78a68d..45e7f9aa56 100644
> > --- a/lib/eal/common/eal_common_trace.c
> > +++ b/lib/eal/common/eal_common_trace.c
> > @@ -75,8 +75,10 @@ eal_trace_init(void)
> > goto free_meta;
> >
> > /* Apply global configurations */
> > - STAILQ_FOREACH(arg, &trace.args, next)
> > + STAILQ_FOREACH(arg, &trace.args, next) {
> > trace_args_apply(arg->val);
> > + trace_pmu_args_apply(arg->val);
> > + }
> >
> > rte_trace_mode_set(trace.mode);
> >
> > @@ -92,6 +94,7 @@ eal_trace_init(void)
> > void
> > eal_trace_fini(void)
> > {
> > + trace_pmu_args_free();
> > trace_mem_free();
> > trace_metadata_destroy();
> > eal_trace_args_free();
> > diff --git a/lib/eal/common/eal_common_trace_pmu.c b/lib/eal/common/eal_common_trace_pmu.c
> > new file mode 100644
> > index 0000000000..3824904481
> > --- /dev/null
> > +++ b/lib/eal/common/eal_common_trace_pmu.c
> > @@ -0,0 +1,38 @@
> > +/* SPDX-License-Identifier: BSD-3-Clause
> > + * Copyright(C) 2025 Marvell International Ltd.
> > + */
> > +
> > +#include <rte_common.h>
> > +
> > +#include "eal_trace.h"
> > +
> > +#ifdef RTE_LIB_PMU
> > +
> > +#include <rte_pmu.h>
> > +
> > +void
> > +trace_pmu_args_apply(const char *arg)
> > +{
> > + static bool once;
> > +
> > + if (!once) {
> > + if (rte_pmu_init())
> > + return;
> > + once = true;
> > + }
> > +
> > + rte_pmu_add_events_by_pattern(arg);
> > +}
> > +
> > +void
> > +trace_pmu_args_free(void)
> > +{
> > + rte_pmu_fini();
> > +}
> > +
> > +#else /* !RTE_LIB_PMU */
> > +
> > +void trace_pmu_args_apply(const char *arg __rte_unused) { return; }
> > +void trace_pmu_args_free(void) { return; }
> > +
> > +#endif /* RTE_LIB_PMU */
> > diff --git a/lib/eal/common/eal_common_trace_points.c b/lib/eal/common/eal_common_trace_points.c
> > index 0903f3c639..ea90279d77 100644
> > --- a/lib/eal/common/eal_common_trace_points.c
> > +++ b/lib/eal/common/eal_common_trace_points.c
> > @@ -119,3 +119,23 @@ RTE_TRACE_POINT_REGISTER(rte_eal_trace_intr_enable,
> > lib.eal.intr.enable)
> > RTE_TRACE_POINT_REGISTER(rte_eal_trace_intr_disable,
> > lib.eal.intr.disable)
> > +
> > +#ifdef RTE_LIB_PMU
> > +RTE_EXPORT_EXPERIMENTAL_SYMBOL(__rte_pmu_trace_read, 25.07)
> > +RTE_TRACE_POINT_REGISTER(rte_pmu_trace_read,
> > + lib.pmu.read)
> > +#endif
> > +#ifdef RTE_EXEC_ENV_IS_WINDOWS
> > +/* gen-version-map.py script generates export symbol maps by scanning source files without
> > + * evaluating conditional compilation. Hence __rte_pmu_trace_read will be included the version map
> > + * even if library is not compiled.
> > + *
> > + * On Windows if msvc linker is used this leads to a hard link error
> > + * (LNK2001: unresolved external symbol) because msvc requires all symbols listed in the .def file
> > + * to be present in the object files.
> > + *
> > + * Other linkers, e.g: gnu ld or mingw ld, are more forgiving. They silently ignore symbols listed
> > + * in the map file if those symbols are not present in the binary.
> > + */
> > +rte_trace_point_t __rte_pmu_trace_read;
> > +#endif
>
> From a quick look, could you export this symbol from the PMU library itself?
Got caught up, but here is my take. It would likely make trace a dependency, but I believe the
dependency should be reversed. Also from my perspective this suggestion feels more like a
refactoring.
So unless I've misunderstood your point, I'd rater keep the current solution as is.
next prev parent reply other threads:[~2025-07-21 10:25 UTC|newest]
Thread overview: 85+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-16 6:53 [PATCH 0/6] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-16 6:53 ` [PATCH 1/6] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-06-16 6:53 ` [PATCH 2/6] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-16 6:53 ` [PATCH 3/6] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-16 7:03 ` Thomas Monjalon
2025-06-16 9:54 ` Tomasz Duszynski
2025-06-16 6:53 ` [PATCH 4/6] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-16 7:08 ` Thomas Monjalon
2025-06-16 10:53 ` Tomasz Duszynski
2025-06-16 6:53 ` [PATCH 5/6] test/pmu: enable fast test Tomasz Duszynski
2025-06-16 6:53 ` [PATCH 6/6] trace: add PMU Tomasz Duszynski
2025-06-16 7:13 ` Thomas Monjalon
2025-06-16 9:49 ` Tomasz Duszynski
2025-06-16 10:32 ` Bruce Richardson
2025-06-16 13:18 ` Morten Brørup
2025-06-18 6:56 ` [PATCH v2 0/6] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-18 6:56 ` [PATCH v2 1/6] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-06-18 6:56 ` [PATCH v2 2/6] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-18 6:56 ` [PATCH v2 3/6] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-18 6:56 ` [PATCH v2 4/6] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-18 6:56 ` [PATCH v2 5/6] test/pmu: enable fast test Tomasz Duszynski
2025-06-18 6:56 ` [PATCH v2 6/6] trace: add PMU Tomasz Duszynski
2025-06-18 7:16 ` Morten Brørup
2025-06-18 9:47 ` Thomas Monjalon
2025-06-18 10:28 ` Bruce Richardson
2025-06-18 11:30 ` Morten Brørup
2025-06-18 10:23 ` Tomasz Duszynski
2025-06-18 10:37 ` Morten Brørup
2025-06-20 12:05 ` [PATCH v3 0/7] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 1/7] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 2/7] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 3/7] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 4/7] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 5/7] test/pmu: enable fast test Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 6/7] trace: add PMU Tomasz Duszynski
2025-06-20 12:05 ` [PATCH v3 7/7] lib/pmu: fix out-of-bound access Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 0/7] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 1/7] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 2/7] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 3/7] lib/pmu: do not try enabling perf counter access on arm64 Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 4/7] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 5/7] test/pmu: enable fast test Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 6/7] trace: add PMU Tomasz Duszynski
2025-06-24 12:29 ` [PATCH v4 7/7] lib/pmu: fix out-of-bound access Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 0/8] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 1/8] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 2/8] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 3/8] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 4/8] lib/pmu: do not try enabling perf counter access on arm64 Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 5/8] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 6/8] test/pmu: enable fast test Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 7/8] trace: add PMU Tomasz Duszynski
2025-06-25 4:47 ` [PATCH v5 8/8] lib/pmu: fix out-of-bound access Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 0/8] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 1/8] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 2/8] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 3/8] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 4/8] lib/pmu: do not try enabling perf counter access on arm64 Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 5/8] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 6/8] test/pmu: enable test Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 7/8] trace: add PMU Tomasz Duszynski
2025-06-27 10:57 ` [PATCH v6 8/8] lib/pmu: fix out-of-bound access Tomasz Duszynski
2025-06-27 15:40 ` [PATCH v7 0/8] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 1/8] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 2/8] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 3/8] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 4/8] lib/pmu: do not try enabling perf counter access on arm64 Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 5/8] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 6/8] test/pmu: enable test Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 7/8] trace: add PMU Tomasz Duszynski
2025-07-01 13:33 ` David Marchand
2025-07-21 10:24 ` Tomasz Duszynski [this message]
2025-07-21 10:45 ` Thomas Monjalon
2025-07-22 10:10 ` Morten Brørup
2025-07-22 11:06 ` Tomasz Duszynski
2025-06-27 15:41 ` [PATCH v7 8/8] lib/pmu: fix out-of-bound access Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 0/8] lib/pmu: cleanups and trace integration Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 1/8] lib/pmu: quiesce rte_pmu_read deprecation warning in chkincs Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 2/8] lib/pmu: export only necessary arch headers Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 3/8] lib/pmu: reimplement per-arch ops as callbacks Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 4/8] lib/pmu: do not try enabling perf counter access on arm64 Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 5/8] lib/pmu: use build system defined RTE_LIB_PMU macro Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 6/8] test/pmu: enable test Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 7/8] trace: add PMU Tomasz Duszynski
2025-07-22 12:00 ` [PATCH v8 8/8] lib/pmu: fix out-of-bound access Tomasz Duszynski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250721102457.2399936-1-tduszynski@marvell.com \
--to=tduszynski@marvell.com \
--cc=bruce.richardson@intel.com \
--cc=david.marchand@redhat.com \
--cc=dev@dpdk.org \
--cc=jerinj@marvell.com \
--cc=mb@smartsharesystems.com \
--cc=thomas@monjalon.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).