Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 09 Nov 2020 22:28:56 +0000
From:      Alexander V. Chernikov <melifaro@ipfw.ru>
To:        John-Mark Gurney <jmg@funkthat.com>
Cc:        freebsd-arch <freebsd-arch@freebsd.org>
Subject:   Re: Versioning support for kernel<>userland sysctl interface
Message-ID:  <428251604959994@mail.yandex.ru>
In-Reply-To: <20201102221330.GS31099@funkthat.com>
References:  <356181604233241@mail.yandex.ru> <20201102221330.GS31099@funkthat.com>

next in thread | previous in thread | raw e-mail | index | archive | help
02.11.2020, 22:13, "John-Mark Gurney" <jmg@funkthat.com>:
> Alexander V. Chernikov wrote this message on Sun, Nov 01, 2020 at 12:47 +0000:
>>  I would like to propose a change [1] that introduces versioning support for the data structures exposed to userland by sysctl interface.
>>
>>  We have dozens of interfaces exposing various statistics and control data by filling in and exporting structures.
>>  net.inet6.icmp6.stats or net.inet6.icmp6.nd6_prlist can be a good examples of such interaction.
>
> We also need to decide the policy on dealing w/ support for these
> data structures going forward... Because if we do the simple, default
> policy of all userland apps can handle all structures, and kernel can
> produce all structures, we now have an unbounded growth of complexity
> and testing...
I totally agree. While backward compatibility is important, it should not impose notable technical debt. I had the following as my mental model:
* the code should be organised to support output for the latest version.
* There should be a separate, isolatable, piece of code that converts from latest to n-1 (which can be chained: from n-1 to n-2 and so on)
* when introducing changes we should garden older versions by COMPAT_X defines.
> I do understand the desire to solve this problem, but IMO, this solution
> is too simple, and dangerous to unbounded growth above.
>
> While I do like it's simplicity, one idea that I've had, while being a
> bit more complex, has the ability to handle modification in a more
> compatible way.
>
> Since we have dtrace, one of the outputs of dtrace is ctf, which allows
> use to convey the type and structure information in a machine parseable
> format. The idea is that each sysctl oid (that supports this) would
> have the ability to fetch the ctf data for that oid. The userland would
> then be able to convert the members to the local members of a similar
> struct. A set of defaults could also be provided, allowing new fields
> to have sane initial values.
>
> As long as the name of a structure member is never reused for a different
> meaning, this will get us most of the way there, in a much cleaner
> method...
>
> I do realize that this isn't the easiest thing, but the tools to do this
> are in the tree, and would solve this problem, IMO, in a way that is a
> lot more maintainable, and long term than the current proposal.
>
> Other solution, use ctf data to produce nvlist generation/consumption
> code for a structure... The data transfered would be larger, but also
> more compatible...
I do like idea on the self-documenting approach. It addresses append-only case nicely, but that's not always the case.
For example, in the initially-discussed icmp6 stats we have 256 64-bit counters representing icmp6 protocol historgram, resulting in 4k frame being allocated on stack for the current kernel implementation. If in the future our icmp6 kernel implementation changes and we won't be able to provide this counters, eventually we would want to remove all these counters from the structure. I'm not sure how can this be addressed without some sort of versioning scheme.

> Overall, using bare structures is an ABI compatibility nightmare that
> should be fixed in a better method.
>
> --
>   John-Mark Gurney Voice: +1 415 225 5579
>
>      "All that I will do, has been done, All that I have, has not."
> _______________________________________________
> freebsd-arch@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-arch
> To unsubscribe, send any mail to "freebsd-arch-unsubscribe@freebsd.org"



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?428251604959994>