[pve-devel] [PATCH v4 pve-storage 05/16] cephconfig: align our parser more with Ceph's parser
Fabian Grünbichler
f.gruenbichler at proxmox.com
Tue Mar 19 10:38:35 CET 2024
On March 5, 2024 4:07 pm, Max Carrara wrote:
> 1. Comments, irrespective of whether they start with '#' or ';' are
> now treated the same. Otherwise, sections and key-value pairs with
> a trailing comment starting with ';' are still parsed. Consider
> this example:
>
> [some.section] # inline comment after section
> foo = bar ; inline comment after value
>
> The '[some.section]' section in the example above would otherwise
> not be parsed at all, while in the key-value definition 'foo'
> parses as the key, which is correct, but 'bar ; inline comment
> after value' parses as value, which is incorrect according to
> Ceph's grammar [0][1].
>
> 2. Sections may now contain any character, including whitespace, but
> not '\n' or a comment literal '#' or ';'. The case for comment
> literals is handled in 1. above.
>
> 3. Instead of treating '-', '_' and ' ' as the same, only '_' and ' '
> are treated the same, like in Ceph's parser [2].
>
> 4. Although not crucial for Ceph, our parser now also supports empty
> sections. When a section header is successfully parsed, it gets
> added to the configuration hash and the parser continues operating
> on the next line.
>
> [0]: https://git.proxmox.com/?p=ceph.git;a=blob;f=ceph/src/common/ConfUtils.cc;h=2f78fd02bf9e27467275752e6f3bca0c5e3946ce;hb=refs/heads/master#l178
> [1]: https://git.proxmox.com/?p=ceph.git;a=blob;f=ceph/src/common/ConfUtils.cc;h=2f78fd02bf9e27467275752e6f3bca0c5e3946ce;hb=refs/heads/master#l194
> [2]: https://git.proxmox.com/?p=ceph.git;a=blob;f=ceph/src/common/ConfUtils.cc;h=2f78fd02bf9e27467275752e6f3bca0c5e3946ce;hb=refs/heads/master#l294
>
> Signed-off-by: Max Carrara <m.carrara at proxmox.com>
> ---
> Changes v1 --> v2:
> * new
> Changes v2 --> v3:
> * support comment literals (4.)
> Changes v3 --> v4:
> * support empty sections
> * fix and move support for comment literals to separate patch
>
> src/PVE/CephConfig.pm | 21 ++++++++++++++-------
> 1 file changed, 14 insertions(+), 7 deletions(-)
>
> diff --git a/src/PVE/CephConfig.pm b/src/PVE/CephConfig.pm
> index 6b10d46..74a92eb 100644
> --- a/src/PVE/CephConfig.pm
> +++ b/src/PVE/CephConfig.pm
> @@ -10,6 +10,8 @@ cfs_register_file('ceph.conf',
> \&parse_ceph_config,
> \&write_ceph_config);
>
> +# For more details on how Ceph's config parser works, see:
> +# https://git.proxmox.com/?p=ceph.git;a=blob;f=ceph/src/common/ConfUtils.cc;h=2f78fd02bf9e27467275752e6f3bca0c5e3946ce;hb=refs/heads/master
> sub parse_ceph_config {
> my ($filename, $raw) = @_;
>
> @@ -20,14 +22,18 @@ sub parse_ceph_config {
>
> my $section;
>
> - foreach my $line (@lines) {
> - $line =~ s/#.*$//;
> + for my $line (@lines) {
> + $line =~ s/(?<!\\)(#|;).*$//;
nit: [#;] instead of the group works as well, and is a bit more
idiomatic IMHO.
so this seems to agree with the parser grammar, but it kind of disagrees
with the docs that for example state:
> mon_host
>
> This is a list of IP addresses or hostnames that are separated by commas, whitespace, or semicolons.
https://docs.ceph.com/en/latest/rados/configuration/ceph-conf/#confval-mon_host
well, they don't mention there that you need to escape the `;` for that
to work, else only the first "entry" takes, and the rest is silently
interpreted as comment. if you quote the "list", then it's even a parse
error since the closing quote is interpreted as part of the comment ;)
but unfortunately there is one more issue:
\\;
still marks the start of a comment (since it's the backlash that is
being escaped) in ceph's parser (same applies to '#' as well).
what a mess this file format is..
> $line =~ s/^\s+//;
> - $line =~ s/^;.*$//;
> $line =~ s/\s+$//;
> next if !$line;
>
> - $section = $1 if $line =~ m/^\[(\S+)\]$/;
> + if ($line =~ m/^\[(.+)\]$/) {
> + $section = $1;
> + $cfg->{$section} = {} if !exists($cfg->{$section});
> + next;
> + }
> +
> if (!$section) {
> warn "no section - skip: $line\n";
> next;
> @@ -35,11 +41,12 @@ sub parse_ceph_config {
>
> if ($line =~ m/^(.*?\S)\s*=\s*(\S.*)$/) {
> my ($key, $val) = ($1, $2);
> - # ceph treats ' ', '_' and '-' in keys the same, so lets do too
> - $key =~ s/[-\ ]/_/g;
> + # ceph treats ' ' and '_' in keys the same, so lets do too
> + $key =~ s/ /_/g;
> $cfg->{$section}->{$key} = $val;
> - }
>
> + next;
this next serves no purpose? (ah, it does with one of the later patches
that adds code below, so please move this to that patch!)
> + }
> }
>
> return $cfg;
> --
> 2.39.2
More information about the pve-devel
mailing list