[pve-devel] [PATCH pve-manager 2/3] Fix #5708: Add CPU raw counters

Daniel Kral d.kral at proxmox.com
Tue Sep 24 14:25:18 CEST 2024


On 9/17/24 07:50, Sascha Westermann via pve-devel wrote:
> Add a map containing raw values from /proc/stat and "uptime_ticks" which
> can be used in combination with cpuinfo.user_hz to calculate CPU usage
> from two samples. "uptime_ticks" is only defined at the top level, as
> /proc/stat is read once, so that core-specific raw values match this
> value.
> 
> Signed-off-by: Sascha Westermann <sascha.westermann at hl-services.de>
> ---
>  PVE/API2/Nodes.pm | 32 ++++++++++++++++++++++++++++++++
>  1 file changed, 32 insertions(+)
> 
> diff --git a/PVE/API2/Nodes.pm b/PVE/API2/Nodes.pm
> index 9920e977..1943ec56 100644
> --- a/PVE/API2/Nodes.pm
> +++ b/PVE/API2/Nodes.pm
> @@ -5,6 +5,7 @@ use warnings;
>  
>  use Digest::MD5;
>  use Digest::SHA;
> +use IO::File;
>  use Filesys::Df;
>  use HTTP::Status qw(:constants);
>  use JSON;
> @@ -466,6 +467,37 @@ __PACKAGE__->register_method({

note: the same route also gets called when using the WebGUI and a set of 
the values that get returned are displayed on the "Node > Status" page. 
What I have seen, the added data size is very negligible.

>  	$res->{cpu} = $stat->{cpu};
>  	$res->{wait} = $stat->{wait};
>  
> +	if (my $fh = IO::File->new ("/proc/stat", "r")) {

nit: Minor note, but there shouldn't be a space between the function's 
name and its parameter list [0].

> +	    my ($uptime_ticks) = PVE::ProcFSTools::read_proc_uptime(1);
> +	    while (defined (my $line = <$fh>)) {
> +		if ($line =~ m|^cpu\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)(?:\s+(\d+)\s+(\d+))?|) {
> +		    $res->{cpustat}->{user} = int($1);
> +		    $res->{cpustat}->{nice} = int($2);
> +		    $res->{cpustat}->{system} = int($3);
> +		    $res->{cpustat}->{idle} = int($4);
> +		    $res->{cpustat}->{iowait} = int($5);
> +		    $res->{cpustat}->{irq} = int($6);
> +		    $res->{cpustat}->{softirq} = int($7);
> +		    $res->{cpustat}->{steal} = int($8);
> +		    $res->{cpustat}->{guest} = int($9);
> +		    $res->{cpustat}->{guest_nice} = int($10);
> +		    $res->{cpustat}->{uptime_ticks} = $uptime_ticks;

nit: I think this could be placed rather nicely at 
`$res->{uptime_ticks}`, like `$res->{uptime}`, to make `cpustat` a 
little more consistent with `PVE::ProcFSTools::read_proc_stat()` and

> +		} elsif ($line =~ m|^cpu(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\d+)(?:\s+(\d+)\s+(\d+))?|) {
> +		    $res->{cpustat}->{"cpu" . $1}->{user} = int($2);
> +		    $res->{cpustat}->{"cpu" . $1}->{nice} = int($3);
> +		    $res->{cpustat}->{"cpu" . $1}->{system} = int($4);
> +		    $res->{cpustat}->{"cpu" . $1}->{idle} = int($5);
> +		    $res->{cpustat}->{"cpu" . $1}->{iowait} = int($6);
> +		    $res->{cpustat}->{"cpu" . $1}->{irq} = int($7);
> +		    $res->{cpustat}->{"cpu" . $1}->{softirq} = int($8);
> +		    $res->{cpustat}->{"cpu" . $1}->{steal} = int($9);
> +		    $res->{cpustat}->{"cpu" . $1}->{guest} = int($10);
> +		    $res->{cpustat}->{"cpu" . $1}->{guest_nice} = int($11);
> +		}
> +	    }
> +	    $fh->close;
> +	}

Is there something that is holding us back to move this directly into 
`PVE::ProcFSTools::read_proc_stat()`?

As far as I can tell, the output of `PVE::ProcFSTools::read_proc_stat()` 
is used at these locations:

- the PVE `/nodes/{node}/status` API endpoint of course, which only uses 
the values of `cpu` and `wait` at the moment
- `PMG::API2::Nodes`: also only uses the values of `cpu` and `wait`
- the PMG `/nodes/{node}/status` API endpoint, which also only uses the 
values of `cpu` and `wait`
- `PVE::Service::pvestatd::update_node_status`: retrieve the current 
node status and then update them for rrd via `broadcast_rrd` (uses only 
the values of `cpu` and `wait` selectively) and external metric servers

The first three and a half (speaking of `broadcast_rrd` in the latter) 
look fine to me, but we should take a closer look how external metric 
servers will handle the added data, especially for existing 
queries/dashboards. It could also be a name collision, as 'cpustat' is 
also used for the data that gets sent to the metric servers.

In my opinion, I think it would be a worthwhile feature to add the 
properties for external metric servers (either as part of this or a 
future patch series).

> +
>  	my $meminfo = PVE::ProcFSTools::read_meminfo();
>  	$res->{memory} = {
>  	    free => $meminfo->{memfree},
> -- 
> 2.46.0
It would also be very beneficial if the added data properties that are 
returned here are documented in the 'returns' JSONSchema, so that they 
can be easily understood by other users as well (especially in which 
unit those raw values are so that it's easier to know how they would 
need to get converted).

---

Otherwise, this works just as intended when querying the API endpoint 
`/nodes/{node}/status` via curl and pvesh.

Reviewed-by: Daniel Kral <d.kral at proxmox.com>
Tested-by: Daniel Kral <d.kral at proxmox.com>

[0] https://pve.proxmox.com/wiki/Perl_Style_Guide#Spacing_and_syntax_usage




More information about the pve-devel mailing list