]> git.puffer.fish Git - matthieu/frr.git/log
matthieu/frr.git
14 months agozebra: Remove helper function if_nhg_dependents_is_empty
Donald Sharp [Thu, 8 Feb 2024 16:49:08 +0000 (11:49 -0500)]
zebra: Remove helper function if_nhg_dependents_is_empty

Let's just let the function figure this out.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agozebra: Installation success should not set NHG as valid
Donald Sharp [Thu, 8 Feb 2024 16:35:31 +0000 (11:35 -0500)]
zebra: Installation success should not set NHG as valid

The nexthop group is marked as valid/invalid and then
installed.  Not installed and then marked valid.
This is just a bit of code removed that might be covering
up other problems that need to be sorted.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agozebra: Use switch when handling return from dplane for nhgs
Donald Sharp [Thu, 8 Feb 2024 16:30:30 +0000 (11:30 -0500)]
zebra: Use switch when handling return from dplane for nhgs

Convert the dplane results function for nhg's over to
using a switch for the result enum.  Let's specifically
call out the unexpected state and also set the nexthop
group as not installed when installation fails.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agozebra: Conslidate zebra_nhg_set_valid|invalid functions
Donald Sharp [Wed, 7 Feb 2024 19:28:37 +0000 (14:28 -0500)]
zebra: Conslidate zebra_nhg_set_valid|invalid functions

Basically the same function two times.  Let's consolidate.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agozebra: Fixup documentation in zebra_nhg.h
Donald Sharp [Thu, 8 Feb 2024 00:50:49 +0000 (19:50 -0500)]
zebra: Fixup documentation in zebra_nhg.h

The documentation was wrong.  Let's adjust it.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agozebra: dplane_nexthop_add cannot return ZEBRA_DPLANE_REQUEST_SUCCESS
Donald Sharp [Thu, 8 Feb 2024 01:32:24 +0000 (20:32 -0500)]
zebra: dplane_nexthop_add cannot return ZEBRA_DPLANE_REQUEST_SUCCESS

When installing a NHG via dplane_nexthop_add, it can only return
REQUEST_QUEUED or REQUEST_FAILURE.  There is no way SUCCESS can
be returned with the way the dplane works at this point in time.
Remove the code that attempts to set the NHE state appropriately
as it is impossible.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agoMerge pull request #15286 from idryzhov/fix-mgmt-startup
Christian Hopps [Wed, 7 Feb 2024 21:04:11 +0000 (16:04 -0500)]
Merge pull request #15286 from idryzhov/fix-mgmt-startup

Fix conflict in mgmtd on startup

14 months agoMerge pull request #15317 from louis-6wind/fix-affinity-map-odr
Donatas Abraitis [Wed, 7 Feb 2024 20:42:36 +0000 (22:42 +0200)]
Merge pull request #15317 from louis-6wind/fix-affinity-map-odr

lib: fix affinity map duplicate

14 months agovtysh: remove resync workaround when exiting to config node
Igor Ryzhov [Fri, 2 Feb 2024 23:15:46 +0000 (01:15 +0200)]
vtysh: remove resync workaround when exiting to config node

When exiting from a level below the config node, like `router rip`,
vtysh executes a resync by sending "end" and "conf term [file-lock]"
commands to all the daemons. As statet in the description comment, it's
done "in case one of the daemons is somewhere else". I don't think this
actually ever happens, but even if it is, it is a bug in a daemon that
needs to be fixed. This resync was okay before the introduction of
mgmtd, but now it unlocks and locks back the datastores during the
configuration reading process, which can lead to a failure which is
explained in the previous commit.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd, vtysh: fix possible conflict when reading the config
Igor Ryzhov [Fri, 2 Feb 2024 22:42:58 +0000 (00:42 +0200)]
mgmtd, vtysh: fix possible conflict when reading the config

When FRR starts, after mgmtd is initialized, backend clients connect to
it and request their config. To supply the config, mgmtd creates a
configuration transaction. At the same time, `vtysh -b` tries to read
the startup config and configure mgmtd, which also creates a
configuration transaction. If these two actions happen at the exact same
time, there's a conflict between them, because only a single
configuration translaction is allowed. Because of that, vtysh fails and
the config is completely ignored.

When starting the config reading, vtysh locks candidate and running
datastores in mgmtd. This commit adds locking of running datastore when
initializing the backend client. It allows to retry locking on the vtysh
side and read the config only when the lock is aquired instead of
failing.

This change also prevents running datastore from being changed during
initialization of backend clients. This could lead to a desynchronized
state between mgmtd and backends.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agolib: fix affinity map duplicate
Louis Scalbert [Wed, 7 Feb 2024 12:16:26 +0000 (13:16 +0100)]
lib: fix affinity map duplicate

Fix duplicate definition of frr_affinity_map_cli_info in libfrr.so.0 and
libmgmt_be_nb.so.0

> =================================================================
> ==3860488==ERROR: AddressSanitizer: odr-violation (0x7f12c98c4d20):
>   [1] size=296 'frr_affinity_map_cli_info' lib/affinitymap_cli.c:77:35
>   [2] size=296 'frr_affinity_map_cli_info' lib/affinitymap_cli.c:77:35
> These globals were registered at these points:
>   [1]:
>     #0 0x7f12c9a36f40 in __asan_register_globals ../../../../src/libsanitizer/asan/asan_globals.cpp:341
>     #1 0x7f12c9585b7d in _sub_I_00099_1 (/lib/libfrr.so.0+0x185b7d)
>     #2 0x7f12ca437fe1 in call_init elf/dl-init.c:72
>
>   [2]:
>     #0 0x7f12c9a36f40 in __asan_register_globals ../../../../src/libsanitizer/asan/asan_globals.cpp:341
>     #1 0x7f12c93824ed in _sub_I_00099_1 (/lib/libmgmt_be_nb.so.0+0x6f4ed)
>     #2 0x7f12ca437fe1 in call_init elf/dl-init.c:72
>
> ==3860488==HINT: if you don't care about these errors you may set ASAN_OPTIONS=detect_odr_violation=0
> SUMMARY: AddressSanitizer: odr-violation: global 'frr_affinity_map_cli_info' at lib/affinitymap_cli.c:77:35
> ==3860488==ABORTING

Fixes: dc6ff4c0de ("lib: convert affinity-map to mgmtd")
Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
14 months agoMerge pull request #15314 from opensourcerouting/fix/remove_bgp_evpn_attr_get_df_pref
Donald Sharp [Wed, 7 Feb 2024 12:44:07 +0000 (07:44 -0500)]
Merge pull request #15314 from opensourcerouting/fix/remove_bgp_evpn_attr_get_df_pref

bgpd: A couple random EVPN findings

14 months agoMerge pull request #15305 from louis-6wind/label-dead-code
Donald Sharp [Tue, 6 Feb 2024 19:50:56 +0000 (14:50 -0500)]
Merge pull request #15305 from louis-6wind/label-dead-code

bgpd: remove dead label code in bgp_update

14 months agoMerge pull request #15283 from okda-networks/doc-update
Donald Sharp [Tue, 6 Feb 2024 16:42:02 +0000 (11:42 -0500)]
Merge pull request #15283 from okda-networks/doc-update

doc: sysrepo plugin doc update with test commands

14 months agoMerge pull request #15311 from louis-6wind/fix-show-srv6-sid
Donald Sharp [Tue, 6 Feb 2024 16:40:14 +0000 (11:40 -0500)]
Merge pull request #15311 from louis-6wind/fix-show-srv6-sid

bgpd: fix displaying srv6 sid

14 months agoMerge pull request #15301 from opensourcerouting/fix/fqdn_capability_dynamic
Donald Sharp [Tue, 6 Feb 2024 16:36:03 +0000 (11:36 -0500)]
Merge pull request #15301 from opensourcerouting/fix/fqdn_capability_dynamic

bgpd: Send FQDN capability via dynamic capability if enabled

14 months agoMerge pull request #15308 from donaldsharp/update_source_documentation
Mark Stapp [Tue, 6 Feb 2024 13:24:17 +0000 (08:24 -0500)]
Merge pull request #15308 from donaldsharp/update_source_documentation

doc: Document `update-source <IFNAME>` command better

14 months agobgpd: remove dead label code in bgp_update
Louis Scalbert [Mon, 5 Feb 2024 14:43:45 +0000 (15:43 +0100)]
bgpd: remove dead label code in bgp_update

No need to init new_attr. It is not used until it is overridden.

> new_attr = *attr;

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
14 months agozebra: Use uint16_t for DF preference
Donatas Abraitis [Tue, 6 Feb 2024 12:03:36 +0000 (14:03 +0200)]
zebra: Use uint16_t for DF preference

It's 16-bits everywhere, not sure why it was here as 32.

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agobgpd: Drop unused function bgp_evpn_attr_get_df_pref()
Donatas Abraitis [Tue, 6 Feb 2024 12:02:51 +0000 (14:02 +0200)]
bgpd: Drop unused function bgp_evpn_attr_get_df_pref()

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agoMerge pull request #15287 from donaldsharp/rcu_doc
Donatas Abraitis [Tue, 6 Feb 2024 10:57:32 +0000 (12:57 +0200)]
Merge pull request #15287 from donaldsharp/rcu_doc

doc: Add some documentation around a new pthread call

14 months agoMerge pull request #15309 from donaldsharp/reorg_route_entry
Donatas Abraitis [Tue, 6 Feb 2024 10:52:25 +0000 (12:52 +0200)]
Merge pull request #15309 from donaldsharp/reorg_route_entry

zebra: Reorg `struct route_entry` to have important bits first

14 months agobgpd: fix displaying srv6 sid
Louis Scalbert [Mon, 5 Feb 2024 15:34:27 +0000 (16:34 +0100)]
bgpd: fix displaying srv6 sid

98efa5bc6b ("bgpd: bgp_path_info_extra memory optimization") has removed
SID info from the extra structure.

Do not test for extra presence.

Fixes: 98efa5bc6b ("bgpd: bgp_path_info_extra memory optimization")
Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
14 months agoMerge pull request #15294 from okda-networks/sysrepo_set_sess_err
Donald Sharp [Mon, 5 Feb 2024 19:59:45 +0000 (14:59 -0500)]
Merge pull request #15294 from okda-networks/sysrepo_set_sess_err

lib: set change errmsg in sysrepo session

14 months agodoc: Add some documentation around a new pthread call
Donald Sharp [Sat, 3 Feb 2024 13:52:38 +0000 (08:52 -0500)]
doc: Add some documentation around a new pthread call

Not necessarily the correct place for this but there
is no other place and it needs to be called out and I
would rather have some documentation in place.  Long
term I would like to add a bunch of frr_pthread documentation
but at this point in time it's not there.  We can
re-arrange when that happens.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agozebra: Reorg `struct route_entry` to have important bits first
Donald Sharp [Mon, 5 Feb 2024 19:15:29 +0000 (14:15 -0500)]
zebra: Reorg `struct route_entry` to have important bits first

The `struct route_entry` had items that were almost never used at
the front of the data structure resulting in items that would be
loaded first into memory that were never used.  Let's reorg a
tiny bit and put all the frequently used items in the first cache
line.  I'm sure people will notice .000000001 speedup

new layout:

sharpd@eva /w/h/s/frr1 (reorg_route_entry)> /home/sharpd/pahole/build/pahole --reorganize --show_reorg_steps -C route_entry zebra/.libs/zebra
struct route_entry {
struct re_list_item        next;                 /*     0     8 */
struct nhg_hash_entry *    nhe;                  /*     8     8 */
uint32_t                   nhe_id;               /*    16     4 */
uint32_t                   nhe_installed_id;     /*    20     4 */
int                        type;                 /*    24     4 */
vrf_id_t                   vrf_id;               /*    28     4 */
uint32_t                   table;                /*    32     4 */
uint32_t                   metric;               /*    36     4 */
uint32_t                   mtu;                  /*    40     4 */
uint32_t                   nexthop_mtu;          /*    44     4 */
uint32_t                   flags;                /*    48     4 */
uint32_t                   status;               /*    52     4 */
uint32_t                   dplane_sequence;      /*    56     4 */
uint16_t                   instance;             /*    60     2 */
uint8_t                    distance;             /*    62     1 */

/* XXX 1 byte hole, try to pack */

/* --- cacheline 1 boundary (64 bytes) --- */
route_tag_t                tag;                  /*    64     4 */

/* XXX 4 bytes hole, try to pack */

time_t                     uptime;               /*    72     8 */
struct re_opaque *         opaque;               /*    80     8 */
struct nexthop_group       fib_ng;               /*    88    32 */
struct nexthop_group       fib_backup_ng;        /*   120    32 */

/* size: 152, cachelines: 3, members: 20 */
/* sum members: 147, holes: 2, sum holes: 5 */
/* last cacheline: 24 bytes */
};

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agodoc: Document `update-source <IFNAME>` command better
Donald Sharp [Mon, 5 Feb 2024 16:55:07 +0000 (11:55 -0500)]
doc: Document `update-source <IFNAME>` command better

There was a question in regards to how the update-source
choose the ip address for the source when using the `update-source`
command in BGP.  Upon looking at the code, I was a but surprised,
so I decided to document this behavior.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agoMerge pull request #15291 from idryzhov/mgmtd-yang-embed
Donald Sharp [Mon, 5 Feb 2024 12:34:38 +0000 (07:34 -0500)]
Merge pull request #15291 from idryzhov/mgmtd-yang-embed

lib, mgmtd: fix missing embedded modules

14 months agoMerge pull request #15300 from idryzhov/mgmtd-cli-callbacks
Donald Sharp [Mon, 5 Feb 2024 12:33:51 +0000 (07:33 -0500)]
Merge pull request #15300 from idryzhov/mgmtd-cli-callbacks

lib, mgmtd: don't register NB config callbacks in mgmtd

14 months agoMerge pull request #15289 from idryzhov/coverity-fixes
Donald Sharp [Mon, 5 Feb 2024 12:32:58 +0000 (07:32 -0500)]
Merge pull request #15289 from idryzhov/coverity-fixes

Coverity fixes

14 months agoMerge pull request #15293 from idryzhov/mgmt-read-config
Donald Sharp [Mon, 5 Feb 2024 12:32:03 +0000 (07:32 -0500)]
Merge pull request #15293 from idryzhov/mgmt-read-config

mgmtd: disable lib code for config reading

14 months agoMerge pull request #15297 from idryzhov/mgmtd--n
Donald Sharp [Mon, 5 Feb 2024 12:31:11 +0000 (07:31 -0500)]
Merge pull request #15297 from idryzhov/mgmtd--n

mgmtd: fix missing -n flag and help

14 months agoMerge pull request #15295 from idryzhov/filter-autocompletion
Donatas Abraitis [Mon, 5 Feb 2024 10:43:43 +0000 (12:43 +0200)]
Merge pull request #15295 from idryzhov/filter-autocompletion

fixes for access-list/prefix-list autocompletion

14 months agoMerge pull request #15296 from idryzhov/doc-no
Donatas Abraitis [Mon, 5 Feb 2024 10:43:08 +0000 (12:43 +0200)]
Merge pull request #15296 from idryzhov/doc-no

doc: remove [no] from commands

14 months agobgpd: Send dynamic capability when on/off FQDN capability
Donatas Abraitis [Mon, 5 Feb 2024 08:55:22 +0000 (10:55 +0200)]
bgpd: Send dynamic capability when on/off FQDN capability

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agobgpd: Do not reset the session if turning on/off FQDN capability
Donatas Abraitis [Mon, 5 Feb 2024 08:40:24 +0000 (10:40 +0200)]
bgpd: Do not reset the session if turning on/off FQDN capability

Allow BGP dynamic capabilities handle this gracefully.

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agobgpd: Send FQDN capability via dynamic capability if enabled
Donatas Abraitis [Mon, 5 Feb 2024 06:22:46 +0000 (08:22 +0200)]
bgpd: Send FQDN capability via dynamic capability if enabled

Since we have a knob to disable sending FQDN capability, it MUST be checked
before sending it using dynamic capabilities.

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agostaticd: coverity fixes
Igor Ryzhov [Sat, 3 Feb 2024 16:44:59 +0000 (18:44 +0200)]
staticd: coverity fixes

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agozebra: coverity fixes
Igor Ryzhov [Sat, 3 Feb 2024 16:44:48 +0000 (18:44 +0200)]
zebra: coverity fixes

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: embed frr-zebra-route-map module
Igor Ryzhov [Sat, 3 Feb 2024 20:29:40 +0000 (22:29 +0200)]
mgmtd: embed frr-zebra-route-map module

All used modules should be embedded to ensure work on all operating
systems.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agolib: add ietf-yang-metadata to the list of built-in modules
Igor Ryzhov [Sat, 3 Feb 2024 20:27:49 +0000 (22:27 +0200)]
lib: add ietf-yang-metadata to the list of built-in modules

We don't need to manually load built-in modules. This fixes the
following warning in mgmtd:
```
YANG model "ietf-yang-metadata@*" "*@*"not embedded, trying external file
```

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: disable lib code for config reading
Igor Ryzhov [Sat, 3 Feb 2024 20:43:58 +0000 (22:43 +0200)]
mgmtd: disable lib code for config reading

mgmtd reads config files on its own, it doesn't need libfrr to do that.
The code is already skipped, because mgmtd uses `di->read_in` thread for
config reading and libfrr doesn't reschedule the thread, so this commit
just removes the dead code.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months ago*: use af-specific autocompletion for prefix-lists when possible
Igor Ryzhov [Sat, 3 Feb 2024 21:57:56 +0000 (23:57 +0200)]
*: use af-specific autocompletion for prefix-lists when possible

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agolib: fix autocompletion for prefix-lists
Igor Ryzhov [Sat, 3 Feb 2024 21:57:44 +0000 (23:57 +0200)]
lib: fix autocompletion for prefix-lists

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agolib: fix autocompletion for access-lists
Igor Ryzhov [Sat, 3 Feb 2024 21:56:15 +0000 (23:56 +0200)]
lib: fix autocompletion for access-lists

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agodoc: remove [no] from commands
Igor Ryzhov [Sat, 3 Feb 2024 23:16:52 +0000 (01:16 +0200)]
doc: remove [no] from commands

The rule is to document only positive versions of commands.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agolib, mgmtd: don't register NB config callbacks in mgmtd
Igor Ryzhov [Sun, 4 Feb 2024 19:52:47 +0000 (21:52 +0200)]
lib, mgmtd: don't register NB config callbacks in mgmtd

mgmtd is supposed to only register CLI callbacks. If configuration
callbacks are registered, they are getting called on startup when mgmtd
reads config files, and they can use infrastructure that is not
initialized on mgmtd, or allocate some memory that is never freed.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: fix missing -n flag and help
Igor Ryzhov [Sun, 4 Feb 2024 00:46:48 +0000 (02:46 +0200)]
mgmtd: fix missing -n flag and help

Only --vrfwnetns works right now, because -n was missing from short ops.
Also add the missing help.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agolib, mgmtd: don't register NB config callbacks in mgmtd
Igor Ryzhov [Sun, 4 Feb 2024 19:52:47 +0000 (21:52 +0200)]
lib, mgmtd: don't register NB config callbacks in mgmtd

mgmtd is supposed to only register CLI callbacks. If configuration
callbacks are registered, they are getting called on startup when mgmtd
reads config files, and they can use infrastructure that is not
initialized on mgmtd, or allocate some memory that is never freed.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agoMerge pull request #15298 from donaldsharp/rcu_bites_the_bitey_thing
Igor Ryzhov [Sun, 4 Feb 2024 20:08:14 +0000 (22:08 +0200)]
Merge pull request #15298 from donaldsharp/rcu_bites_the_bitey_thing

Rcu bites the bitey thing

14 months agobgpd: Prevent rpki from hooking multiple times into rcu code
Donald Sharp [Sun, 4 Feb 2024 03:35:31 +0000 (22:35 -0500)]
bgpd: Prevent rpki from hooking multiple times into rcu code

As far as I can tell, the rpki code creates a pthread that
is used to handle the i/o associated with talking to the
remote rpki server.  The problem that we are having is that
the rpki code in FRR wants to behave like FRR code and use
the zlog_XXX functions.  These functions all depend on
the RCU code.  Which is a bit picky( and rightly so!!! )
about being started up properly and shut down properly.

This commit is fixing the problem of shutdown.  From
playing with the rpki code, I was able to experimentally
determine that the rpki_create_socket callback function
can be called multiple times per pthread.  Additionally
I was able to clearly see multiple *different* pthreads
actually be created.  This leaves the possiblity
that each time it is called it might be hooking into the
RCU code.  Which makes the rcu code unhappy on shutdown.

Let's address the issue by checking to see if this pthread
has already hooked into the RCU code or not.  If so
then don't do this again.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agolib: Actually create the rcu and save it before using it
Donald Sharp [Sun, 4 Feb 2024 02:34:02 +0000 (21:34 -0500)]
lib: Actually create the rcu and save it before using it

In a non-controlled startup, the rcu data structures were
not being created until after logging could happen.  This
is bad.  Move it so that the rcu data structures are
created first, before logging( HA! ) can happen.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agolib: set change errmsg in sysrepo session
ali-aqrabawi [Sat, 3 Feb 2024 20:57:13 +0000 (23:57 +0300)]
lib: set change errmsg in sysrepo session

this will enable sysrepo based cli users to know what went wrong
when config change fails

Signed-off-by: ali-aqrabawi <aaqrabaw@okdanetworks.com>
14 months agoMerge pull request #15265 from louis-6wind/fix-rpki-logs
Donald Sharp [Sat, 3 Feb 2024 13:40:27 +0000 (08:40 -0500)]
Merge pull request #15265 from louis-6wind/fix-rpki-logs

bgpd,lib: fix logging from rpki_create_socket()

14 months agoMerge pull request #15192 from fdumontet6WIND/capa_nego
Donatas Abraitis [Sat, 3 Feb 2024 10:19:53 +0000 (12:19 +0200)]
Merge pull request #15192 from fdumontet6WIND/capa_nego

bgpd: add [no]neighbor capability fqdn

14 months agoMerge pull request #10151 from pguibert6WIND/ensure_routing_protocols_good_bw
Donald Sharp [Fri, 2 Feb 2024 17:51:35 +0000 (12:51 -0500)]
Merge pull request #10151 from pguibert6WIND/ensure_routing_protocols_good_bw

zebra: avoid having speed set to UINT32_MAX

14 months agoMerge pull request #15216 from donaldsharp/zebra_opaque_mem_leak
Jafar Al-Gharaibeh [Fri, 2 Feb 2024 16:54:20 +0000 (10:54 -0600)]
Merge pull request #15216 from donaldsharp/zebra_opaque_mem_leak

zebra: Fix opaque memory leak in rare situation

14 months agoMerge pull request #15279 from idryzhov/staticd-fixes
Donald Sharp [Fri, 2 Feb 2024 16:51:03 +0000 (11:51 -0500)]
Merge pull request #15279 from idryzhov/staticd-fixes

several fixes for staticd configuration

14 months agoMerge pull request #15278 from idryzhov/fix-no-vrf
Donatas Abraitis [Fri, 2 Feb 2024 13:23:49 +0000 (15:23 +0200)]
Merge pull request #15278 from idryzhov/fix-no-vrf

lib: fix "no vrf" command

14 months agoMerge pull request #15282 from donaldsharp/poll_info
Donatas Abraitis [Fri, 2 Feb 2024 11:09:49 +0000 (13:09 +0200)]
Merge pull request #15282 from donaldsharp/poll_info

lib: Warn operator when fd limit is set too large

14 months agodoc: sysrepo plugin doc update with tested commands
Amjad Daraiseh [Fri, 2 Feb 2024 06:12:15 +0000 (01:12 -0500)]
doc: sysrepo plugin doc update with tested commands

Signed-off-by: Amjad Daraiseh <adaraiseh@okdanetworks.com>
14 months agodoc: add neighbor PEER capability fqdn command
Francois Dumontet [Mon, 22 Jan 2024 13:29:48 +0000 (14:29 +0100)]
doc: add neighbor PEER capability fqdn command

improve bgp doc

Signed-off-by: Francois Dumontet <francois.dumontet@6wind.com>
14 months agotests: improve topotest bgp_dont_capability_negotiate
Francois Dumontet [Mon, 22 Jan 2024 13:19:32 +0000 (14:19 +0100)]
tests: improve topotest bgp_dont_capability_negotiate

add some steps for testing of add [no]neighbor capability fqdn
command support.

Signed-off-by: Francois Dumontet <francois.dumontet@6wind.com>
14 months agobgpd: add [no]neighbor capability fqdn command
Francois Dumontet [Mon, 22 Jan 2024 10:53:36 +0000 (11:53 +0100)]
bgpd: add [no]neighbor capability fqdn command

cisco routers are not dealing fairly whith unsupported capabilities.
When a cisco router receive an unsupported capabilities it reset the
negociation without notifying the unmatching capability as described in
RFC2842.
Cisco suggest the use of
neighbor x.x.x.x capability fqdn
to avoid the use of fqdn in open message.

this new command is to remove the use of fqdn capability in the
open message with the peer "x.x.x.x".

Link: https://www.cisco.com/c/en/us/support/docs/ip/border-gateway-protocol-bgp/116189-problemsolution-technology-00.pdf
Signed-off-by: Francois Dumontet <francois.dumontet@6wind.com>
14 months agobgpd: fix logging from rpki_create_socket()
Louis Scalbert [Fri, 2 Feb 2024 09:35:10 +0000 (10:35 +0100)]
bgpd: fix logging from rpki_create_socket()

Fix the following crash when logging from rpki_create_socket():

> #0  raise (sig=<optimized out>) at ../sysdeps/unix/sysv/linux/raise.c:50
> #1  0x00007f6e21723798 in core_handler (signo=6, siginfo=0x7f6e1e502ef0, context=0x7f6e1e502dc0) at lib/sigevent.c:248
> #2  <signal handler called>
> #3  __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:50
> #4  0x00007f6e2144e537 in __GI_abort () at abort.c:79
> #5  0x00007f6e2176348e in _zlog_assert_failed (xref=0x7f6e2180c920 <_xref.16>, extra=0x0) at lib/zlog.c:670
> #6  0x00007f6e216b1eda in rcu_read_lock () at lib/frrcu.c:294
> #7  0x00007f6e21762da8 in vzlog_notls (xref=0x0, prio=2, fmt=0x7f6e217afe50 "%s:%d: %s(): assertion (%s) failed", ap=0x7f6e1e504248) at lib/zlog.c:425
> #8  0x00007f6e217632fb in vzlogx (xref=0x0, prio=2, fmt=0x7f6e217afe50 "%s:%d: %s(): assertion (%s) failed", ap=0x7f6e1e504248) at lib/zlog.c:627
> #9  0x00007f6e217621f5 in zlog (prio=2, fmt=0x7f6e217afe50 "%s:%d: %s(): assertion (%s) failed") at lib/zlog.h:73
> #10 0x00007f6e21763596 in _zlog_assert_failed (xref=0x7f6e2180c920 <_xref.16>, extra=0x0) at lib/zlog.c:687
> #11 0x00007f6e216b1eda in rcu_read_lock () at lib/frrcu.c:294
> #12 0x00007f6e21762da8 in vzlog_notls (xref=0x7f6e21a50040 <_xref.68>, prio=4, fmt=0x7f6e21a4999f "getaddrinfo: debug", ap=0x7f6e1e504878) at lib/zlog.c:425
> #13 0x00007f6e217632fb in vzlogx (xref=0x7f6e21a50040 <_xref.68>, prio=4, fmt=0x7f6e21a4999f "getaddrinfo: debug", ap=0x7f6e1e504878) at lib/zlog.c:627
> #14 0x00007f6e21a3f774 in zlog_ref (xref=0x7f6e21a50040 <_xref.68>, fmt=0x7f6e21a4999f "getaddrinfo: debug") at ./lib/zlog.h:84
> #15 0x00007f6e21a451b2 in rpki_create_socket (_cache=0x55729149cc30) at bgpd/bgp_rpki.c:1337
> #16 0x00007f6e2120e7b7 in tr_tcp_open (tr_socket=0x5572914d1520) at rtrlib/rtrlib/transport/tcp/tcp_transport.c:111
> #17 0x00007f6e2120e212 in tr_open (socket=0x5572914b5e00) at rtrlib/rtrlib/transport/transport.c:16
> #18 0x00007f6e2120faa2 in rtr_fsm_start (rtr_socket=0x557290e17180) at rtrlib/rtrlib/rtr/rtr.c:130
> #19 0x00007f6e218b7ea7 in start_thread (arg=<optimized out>) at pthread_create.c:477
> #20 0x00007f6e21527a2f in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:95

rpki_create_socket() is a hook function called from the rtrlib library.
The issue arises because rtrlib initiates its own separate pthread in which
it runs the hook, which does not establish an FRR RCU context. Consequently,
this leads to failures in the logging mechanism that relies on RCU.

Initialize a new FRR pthread context from the rtrlib pthread with a
valid RCU context to allow logging from the rpki_create_socket() and
dependent functions.

Link: https://github.com/FRRouting/frr/issues/15260
Fixes: a951752d4a ("bgpd: create cache server socket in vrf")
Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
14 months agolib: add ability to log from external pthread
Louis Scalbert [Fri, 2 Feb 2024 09:25:05 +0000 (10:25 +0100)]
lib: add ability to log from external pthread

External libraries can re-enter the FRR code through a hook function. A
crash occurs when logging from this hook function if the library has
initiated a new pthread, as the FRR RCU context is not initialized for
this thread.

Add frr_pthread_non_controlled_startup() function to initialize a valid
RCU context within a FRR pthread context, originating from an external
pthread.

Signed-off-by: Louis Scalbert <louis.scalbert@6wind.com>
14 months agolib: Warn operator when fd limit is set too large
Donald Sharp [Fri, 2 Feb 2024 02:18:50 +0000 (21:18 -0500)]
lib: Warn operator when fd limit is set too large

It's unlikely that an operator will ever set a fd
limit of over 100k.  Let's warn the operator that
things are in a bit of a wonky state.

Fixes: #15280
Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agoMerge pull request #15275 from opensourcerouting/fix/one_more_memory_optimization_att...
Donald Sharp [Fri, 2 Feb 2024 01:46:09 +0000 (20:46 -0500)]
Merge pull request #15275 from opensourcerouting/fix/one_more_memory_optimization_attr_struct

bgpd: One more attr struct memory optimization

14 months agolib: fix "no vrf" command
Igor Ryzhov [Thu, 1 Feb 2024 22:18:30 +0000 (00:18 +0200)]
lib: fix "no vrf" command

Remove operational data check from CLI command. It never works in mgmtd
and it is not needed in backend daemons because it's done in
`lib_vrf_destroy` callback.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agostaticd: fix NB dependency hack
Igor Ryzhov [Thu, 1 Feb 2024 22:57:59 +0000 (00:57 +0200)]
staticd: fix NB dependency hack

Currently, staticd configuration is tightly coupled with VRF existence.
Because of that, it has to use a hack in NB infrastructure to create a
VRF configuration when at least one static route is configured for this
VRF. This hack is incompatible with mgmtd, because mgmtd doesn't execute
configuration callbacks. Because of that, the configuration may become
out of sync between mgmtd and staticd. There are two main cases:

1. Create static route in a VRF. The VRF data node will be created
   automatically in staticd by the NB hack, but not in mgmtd.
2. Delete VRF which has some static routes configured. The static route
   configuration will be deleted from staticd by the NB hack, but not
   from mgmtd.

To fix the problem, decouple configuration of static routes from VRF
configuration. Now it is possible to configure static routes even if the
VRF doesn't exist yet. Once the VRF is created, staticd applies all the
preconfigured routes.

This change also fixes the problem with static routes being preserved in
the system when staticd "control-plane-protocol" container is deleted
but the VRF is still configured.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agostaticd: fix nexthops without interface
Igor Ryzhov [Wed, 31 Jan 2024 12:10:09 +0000 (14:10 +0200)]
staticd: fix nexthops without interface

When interface is not set in "ip route" command, CLI passes "(null)" as
an interface name instead of an empty string. The actual code in turn
uses "nh->ifname[0] != 0" to check if the interface name was set. Fix
the problem by changing the "(null)" string into an empty string when
populating the nexthop structure.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agoRevert "staticd: Accept full blackhole typed keywords for ip_route_cmd"
Igor Ryzhov [Tue, 30 Jan 2024 20:51:46 +0000 (22:51 +0200)]
Revert "staticd: Accept full blackhole typed keywords for ip_route_cmd"

This reverts commit 76b2bc97e73874d882d5cf021972cfca84656cef.

This change is wrong for several reasons:
- it is backwards incompatible - previously it was always possible to
  create blackhole/reject routes using shortened versions of the words
  and it suddenly became impossible if there's an interface in the
  system with the same name
- it uses operational data for validation which is prohibited
- it doesn't really solve the problem with inability to create routes
  using interface names like `bla` or `rej`

14 months agoMerge pull request #15276 from mjstapp/port_registry
Donald Sharp [Thu, 1 Feb 2024 21:07:11 +0000 (16:07 -0500)]
Merge pull request #15276 from mjstapp/port_registry

*: create a single registry of daemons' default port values

14 months agoMerge pull request #15272 from LabNConsulting/chopps/mgmtd-notif-test
Donald Sharp [Thu, 1 Feb 2024 19:56:07 +0000 (14:56 -0500)]
Merge pull request #15272 from LabNConsulting/chopps/mgmtd-notif-test

tests: add mgmtd backend notification test

14 months agoMerge pull request #15264 from opensourcerouting/fix/memory_optimization
Donald Sharp [Thu, 1 Feb 2024 19:55:18 +0000 (14:55 -0500)]
Merge pull request #15264 from opensourcerouting/fix/memory_optimization

bgpd: Optimize memory for rd_ip struct

14 months agoMerge pull request #15270 from LabNConsulting/chopps/fix-rpm-pkg
Donald Sharp [Thu, 1 Feb 2024 19:51:58 +0000 (14:51 -0500)]
Merge pull request #15270 from LabNConsulting/chopps/fix-rpm-pkg

redhat: add option to include mgmtd_testc test client

14 months agoMerge pull request #15266 from donaldsharp/packable
Russ White [Thu, 1 Feb 2024 19:23:58 +0000 (14:23 -0500)]
Merge pull request #15266 from donaldsharp/packable

2 memory optimizations

14 months ago*: create a single registry of daemons' default port values
Mark Stapp [Thu, 1 Feb 2024 16:40:02 +0000 (11:40 -0500)]
*: create a single registry of daemons' default port values

Create a single registry of default port values that daemons
are using. Most of these are vty ports, but there are some
others for features like ospfapi and zebra FPM.

Signed-off-by: Mark Stapp <mjs@labn.net>
14 months agobgpd: One more attr struct memory optimization
Donatas Abraitis [Thu, 1 Feb 2024 15:28:42 +0000 (17:28 +0200)]
bgpd: One more attr struct memory optimization

```
struct attr {
struct aspath *            aspath;               /*     0     8 */
struct community *         community;            /*     8     8 */
long unsigned int          refcnt;               /*    16     8 */
_uint64_t                  flag;                 /*    24     8 */
struct in_addr             nexthop;              /*    32     4 */
uint32_t                   med;                  /*    36     4 */
uint32_t                   local_pref;           /*    40     4 */
ifindex_t                  nh_ifindex;           /*    44     4 */
uint8_t                    nh_flags;             /*    48     1 */
uint8_t                    origin;               /*    49     1 */
uint8_t                    es_flags;             /*    50     1 */
uint8_t                    router_flag;          /*    51     1 */
uint8_t                    distance;             /*    52     1 */
uint8_t                    df_alg;               /*    53     1 */
uint16_t                   df_pref;              /*    54     2 */
enum pta_type              pmsi_tnl_type;        /*    56     4 */
uint32_t                   rmap_change_flags;    /*    60     4 */
/* --- cacheline 1 boundary (64 bytes) --- */
struct in6_addr            mp_nexthop_global;    /*    64    16 */
struct in6_addr            mp_nexthop_local;     /*    80    16 */
ifindex_t                  nh_lla_ifindex;       /*    96     4 */
mpls_label_t               label;                /*   100     4 */
struct ecommunity *        ecommunity;           /*   104     8 */
struct ecommunity *        ipv6_ecommunity;      /*   112     8 */
struct lcommunity *        lcommunity;           /*   120     8 */
/* --- cacheline 2 boundary (128 bytes) --- */
struct cluster_list *      cluster1;             /*   128     8 */
struct transit *           transit;              /*   136     8 */
struct in_addr             mp_nexthop_global_in; /*   144     4 */
struct in_addr             aggregator_addr;      /*   148     4 */
struct in_addr             originator_id;        /*   152     4 */
uint32_t                   weight;               /*   156     4 */
as_t                       aggregator_as;        /*   160     4 */
uint8_t                    mp_nexthop_len;       /*   164     1 */
uint8_t                    mp_nexthop_prefer_global; /*   165     1 */
uint8_t                    sticky;               /*   166     1 */
uint8_t                    default_gw;           /*   167     1 */
route_tag_t                tag;                  /*   168     4 */
uint32_t                   label_index;          /*   172     4 */
struct bgp_attr_srv6_vpn * srv6_vpn;             /*   176     8 */
struct bgp_attr_srv6_l3vpn * srv6_l3vpn;         /*   184     8 */
/* --- cacheline 3 boundary (192 bytes) --- */
struct bgp_attr_encap_subtlv * encap_subtlvs;    /*   192     8 */
struct bgp_attr_encap_subtlv * vnc_subtlvs;      /*   200     8 */
struct bgp_route_evpn      evpn_overlay;         /*   208    36 */
uint32_t                   mm_seqnum;            /*   244     4 */
uint32_t                   mm_sync_seqnum;       /*   248     4 */
struct ethaddr             rmac;                 /*   252     6 */
/* --- cacheline 4 boundary (256 bytes) was 2 bytes ago --- */
uint16_t                   encap_tunneltype;     /*   258     2 */
uint32_t                   rmap_table_id;        /*   260     4 */
uint32_t                   link_bw;              /*   264     4 */
esi_t                      esi;                  /*   268    10 */

/* XXX 2 bytes hole, try to pack */

uint32_t                   srte_color;           /*   280     4 */
enum nexthop_types_t       nh_type;              /*   284     4 */
enum blackhole_type        bh_type;              /*   288     4 */
uint32_t                   otc;                  /*   292     4 */
_uint64_t                  aigp_metric;          /*   296     8 */

/* size: 304, cachelines: 5, members: 54 */
/* sum members: 302, holes: 1, sum holes: 2 */
/* last cacheline: 48 bytes */
};   /* saved 8 bytes! */
```

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agobgpd: Save memory when using bgp_path_info_extra and vnc
Donald Sharp [Wed, 31 Jan 2024 18:37:25 +0000 (13:37 -0500)]
bgpd: Save memory when using bgp_path_info_extra and vnc

Structure size of bgp_path_info_extra when compiled
with vnc is 184 bytes.  Reduce this size to 72 bytes
when compiled w/ vnc but not necessarily turned
on vnc.

With 2 full bgp feeds this saves aproximately 100mb
when compiling with vnc and not using vnc.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agolib: Reduce size of hash_bucket by 8 bytes
Donald Sharp [Wed, 31 Jan 2024 16:33:36 +0000 (11:33 -0500)]
lib: Reduce size of hash_bucket by 8 bytes

pahole reports that the hash_bucket has 2 4 byte holes
in the data structure.  Let's reorganize this a bit
and save 8 bytes per hash_bucket instance.

Signed-off-by: Donald Sharp <sharpd@nvidia.com>
14 months agotests: add mgmtd backend notification test
Christian Hopps [Thu, 1 Feb 2024 09:36:32 +0000 (04:36 -0500)]
tests: add mgmtd backend notification test

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agoMerge pull request #15268 from LabNConsulting/chopps/quieter-code
Igor Ryzhov [Thu, 1 Feb 2024 08:54:30 +0000 (10:54 +0200)]
Merge pull request #15268 from LabNConsulting/chopps/quieter-code

lowercase and localize debug/err macros

14 months agoredhat: add option to include mgmtd_testc test client
Christian Hopps [Thu, 1 Feb 2024 08:17:44 +0000 (03:17 -0500)]
redhat: add option to include mgmtd_testc test client

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agomgmtd: convert MGMTD_FE_CLIENT_DBG() to debug_fe_client()
Christian Hopps [Thu, 1 Feb 2024 01:06:14 +0000 (20:06 -0500)]
mgmtd: convert MGMTD_FE_CLIENT_DBG() to debug_fe_client()

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agomgmtd: convert MGMTD_BE_CLIENT_DBG() to debug_be_client()
Christian Hopps [Thu, 1 Feb 2024 01:01:41 +0000 (20:01 -0500)]
mgmtd: convert MGMTD_BE_CLIENT_DBG() to debug_be_client()

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agomgmtd: convert MGMT_FE_ADAPTER_DBG() to __dbg()
Christian Hopps [Thu, 1 Feb 2024 00:52:26 +0000 (19:52 -0500)]
mgmtd: convert MGMT_FE_ADAPTER_DBG() to __dbg()

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agomgmtd: convert MGMT_BE_ADAPTER_DBG() to __dbg()
Christian Hopps [Thu, 1 Feb 2024 00:50:23 +0000 (19:50 -0500)]
mgmtd: convert MGMT_BE_ADAPTER_DBG() to __dbg()

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agomgmtd: convert MGMT_DS_DBG() to __dbg()
Christian Hopps [Wed, 31 Jan 2024 00:53:38 +0000 (19:53 -0500)]
mgmtd: convert MGMT_DS_DBG() to __dbg()

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agomgmtd: convert MGMT_TXN_DBG() to __dbg()
Christian Hopps [Wed, 31 Jan 2024 00:50:52 +0000 (19:50 -0500)]
mgmtd: convert MGMT_TXN_DBG() to __dbg()

Signed-off-by: Christian Hopps <chopps@labn.net>
14 months agoMerge pull request #15258 from louis-6wind/fix-adj-in-attr
Donatas Abraitis [Wed, 31 Jan 2024 13:06:40 +0000 (15:06 +0200)]
Merge pull request #15258 from louis-6wind/fix-adj-in-attr

bgpd: fix attr comparaison bgp_adj_in_set

14 months agoMerge pull request #15251 from qlyoung/fix-doc-nbcli-numlist
Igor Ryzhov [Wed, 31 Jan 2024 13:06:25 +0000 (15:06 +0200)]
Merge pull request #15251 from qlyoung/fix-doc-nbcli-numlist

doc: fix misformatted list in nbapi docs

14 months agoMerge pull request #15246 from idryzhov/mgmt-get-data-improvements
Christian Hopps [Wed, 31 Jan 2024 11:04:04 +0000 (06:04 -0500)]
Merge pull request #15246 from idryzhov/mgmt-get-data-improvements

mgmt get-data improvements

14 months agobgpd: Optimize memory for rd_ip struct
Donatas Abraitis [Wed, 31 Jan 2024 09:29:30 +0000 (11:29 +0200)]
bgpd: Optimize memory for rd_ip struct

```
struct rd_ip {
uint16_t                   type;                 /*     0     2 */
uint16_t                   val;                  /*     2     2 */
struct in_addr             ip;                   /*     4     4 */

/* size: 8, cachelines: 1, members: 3 */
/* last cacheline: 8 bytes */
};   /* saved 4 bytes! */
```

Signed-off-by: Donatas Abraitis <donatas@opensourcerouting.org>
14 months agotests: add tests for mgmt get-data with-defaults parameter
Igor Ryzhov [Mon, 29 Jan 2024 22:34:21 +0000 (00:34 +0200)]
tests: add tests for mgmt get-data with-defaults parameter

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: add our own format definitions for frontend messages
Igor Ryzhov [Sun, 28 Jan 2024 23:05:14 +0000 (01:05 +0200)]
mgmtd: add our own format definitions for frontend messages

It allows people not familiar with libyang and FRR internals to use
mgmtd FE API by looking only at `mgmt_msg_native.h` header. We still use
the same values to avoid a lot of mapping code, and ensure that any
change doesn't slip unnoticed by using static asserts.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: fix printing an empty data tree
Igor Ryzhov [Sun, 28 Jan 2024 00:35:14 +0000 (02:35 +0200)]
mgmtd: fix printing an empty data tree

We don't need to create an actual tree to print an empty tree, libyang
handles NULL just fine. The actual problem is that `yang_dnode_new`
creates a tree by validating it, and the validation creates all implicit
default nodes. Therefore, when called with "with-default" flags, instead
of getting an empty tree, we get a tree with all top-level default set.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: add ability to choose datastore to get data from
Igor Ryzhov [Fri, 26 Jan 2024 01:08:55 +0000 (03:08 +0200)]
mgmtd: add ability to choose datastore to get data from

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agomgmtd: add support for with-defaults parameter to get-data
Igor Ryzhov [Mon, 29 Jan 2024 20:50:26 +0000 (22:50 +0200)]
mgmtd: add support for with-defaults parameter to get-data

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>
14 months agoyang: add more ietf models
Igor Ryzhov [Thu, 25 Jan 2024 21:51:13 +0000 (23:51 +0200)]
yang: add more ietf models

These models are needed to use LYD_PRINT_WD_ALL_TAG flag of libyang.

Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>