From: Jan Viktorin <viktorin@rehivetech.com>
To: Jerin Jacob <jerin.jacob@caviumnetworks.com>
Cc: dev@dpdk.org
Subject: Re: [dpdk-dev] [PATCH 2/3] arm64: acl: add neon based acl implementation
Date: Mon, 2 Nov 2015 18:31:56 +0100 [thread overview]
Message-ID: <20151102183156.440ca54c@pcviktorin.fit.vutbr.cz> (raw)
In-Reply-To: <20151102161954.GB1869@localhost.localdomain>
On Mon, 2 Nov 2015 21:49:54 +0530
Jerin Jacob <jerin.jacob@caviumnetworks.com> wrote:
> On Mon, Nov 02, 2015 at 04:39:37PM +0100, Jan Viktorin wrote:
> > On Mon, 2 Nov 2015 19:48:40 +0530
> > Jerin Jacob <jerin.jacob@caviumnetworks.com> wrote:
> >
> > > Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>
> > > ---
> > > app/test-acl/main.c | 4 +
> > > lib/librte_acl/Makefile | 5 +
> > > lib/librte_acl/acl.h | 4 +
> > > lib/librte_acl/acl_run_neon.c | 46 +++++++
> > > lib/librte_acl/acl_run_neon.h | 290 ++++++++++++++++++++++++++++++++++++++++++
> > > lib/librte_acl/rte_acl.c | 25 ++++
> > > lib/librte_acl/rte_acl.h | 1 +
> > > 7 files changed, 375 insertions(+)
> > > create mode 100644 lib/librte_acl/acl_run_neon.c
> > > create mode 100644 lib/librte_acl/acl_run_neon.h
> > >
> > > diff --git a/app/test-acl/main.c b/app/test-acl/main.c
> > > index 72ce83c..0b0c093 100644
> > > --- a/app/test-acl/main.c
> > > +++ b/app/test-acl/main.c
> > > @@ -101,6 +101,10 @@ static const struct acl_alg acl_alg[] = {
> > > .name = "avx2",
> > > .alg = RTE_ACL_CLASSIFY_AVX2,
> > > },
> > > + {
> > > + .name = "neon",
> > > + .alg = RTE_ACL_CLASSIFY_NEON,
> > > + },
> > > };
> > >
> > > static struct {
> > > diff --git a/lib/librte_acl/Makefile b/lib/librte_acl/Makefile
> > > index 7a1cf8a..27f91d5 100644
> > > --- a/lib/librte_acl/Makefile
> > > +++ b/lib/librte_acl/Makefile
> > > @@ -48,9 +48,14 @@ SRCS-$(CONFIG_RTE_LIBRTE_ACL) += rte_acl.c
> > > SRCS-$(CONFIG_RTE_LIBRTE_ACL) += acl_bld.c
> > > SRCS-$(CONFIG_RTE_LIBRTE_ACL) += acl_gen.c
> > > SRCS-$(CONFIG_RTE_LIBRTE_ACL) += acl_run_scalar.c
> > > +ifeq ($(CONFIG_RTE_ARCH_ARM64),y)
> > > +SRCS-$(CONFIG_RTE_LIBRTE_ACL) += acl_run_neon.c
> >
> > Are the used NEON instrinsics for ACL ARMv8-specific? If so, the file should be named
> > something like acl_run_neonv8.c...
> >
>
> Yes, bit of armv8 specific, looks like vqtbl1q_u8 NEON instrinsics
> defined only in armv8. I could rename to acl_run_neonv8.c but keeping
> as acl_run_neon.c, may in future it can be extend to armv7 also.
> I am open to any decision, let me know your views.
OK, this sounds reasonable. Leave it as it is.
>
> > > +else
> > > SRCS-$(CONFIG_RTE_LIBRTE_ACL) += acl_run_sse.c
> > > +endif
> > >
> > > CFLAGS_acl_run_sse.o += -msse4.1
> > > +CFLAGS_acl_run_neon.o += -flax-vector-conversions -Wno-maybe-uninitialized
> >
> > From man gcc:
> >
> > -flax-vector-conversions
> > Allow implicit conversions between vectors with differing numbers of elements and/or
> > incompatible element types. This option should not be used for new code.
> >
> > I've already pointed to this in the Dave's ARMv8 patchset. They dropped it silently.
> > What is the purpose? Is it necessary?
>
> Yes, the same tr hi value we can representing as unsigned and signed
> based on it DFA or QRANGE .
I don't understand your answer. What is "tr hi"? What means DFA and
QRANGE here?
I just wanted to point to the note: "This option should not be used for
new code."
Jan
>
>
> >
> > Jan
> >
> > >
> > > #
> > > # If the compiler supports AVX2 instructions,
> > > diff --git a/lib/librte_acl/acl.h b/lib/librte_acl/acl.h
> > > index eb4930c..09d6784 100644
> > > --- a/lib/librte_acl/acl.h
> > > +++ b/lib/librte_acl/acl.h
> > > @@ -230,6 +230,10 @@ int
> > > rte_acl_classify_avx2(const struct rte_acl_ctx *ctx, const uint8_t **data,
> > > uint32_t *results, uint32_t num, uint32_t categories);
> > >
> > --snip--
> >
> > --
> > Jan Viktorin E-mail: Viktorin@RehiveTech.com
> > System Architect Web: www.RehiveTech.com
> > RehiveTech
> > Brno, Czech Republic
--
Jan Viktorin E-mail: Viktorin@RehiveTech.com
System Architect Web: www.RehiveTech.com
RehiveTech
Brno, Czech Republic
next prev parent reply other threads:[~2015-11-02 17:34 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-02 14:18 [dpdk-dev] [PATCH 0/3] ARM64: NEON ACL implementation Jerin Jacob
2015-11-02 14:18 ` [dpdk-dev] [PATCH 1/3] arm: ret_vector.h improvements Jerin Jacob
2015-11-02 14:18 ` [dpdk-dev] [PATCH 2/3] arm64: acl: add neon based acl implementation Jerin Jacob
2015-11-02 14:18 ` [dpdk-dev] [PATCH 3/3] arm64: defconfig: enabled CONFIG_RTE_LIBRTE_ACL Jerin Jacob
2015-11-02 15:39 ` [dpdk-dev] [PATCH 2/3] arm64: acl: add neon based acl implementation Jan Viktorin
2015-11-02 16:19 ` Jerin Jacob
2015-11-02 17:31 ` Jan Viktorin [this message]
2015-11-02 16:54 ` Ananyev, Konstantin
2015-11-03 4:30 ` Jerin Jacob
2015-11-03 10:23 ` Ananyev, Konstantin
2015-11-03 10:35 ` Jan Viktorin
2015-11-03 13:20 ` Ananyev, Konstantin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20151102183156.440ca54c@pcviktorin.fit.vutbr.cz \
--to=viktorin@rehivetech.com \
--cc=dev@dpdk.org \
--cc=jerin.jacob@caviumnetworks.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).