summaryrefslogtreecommitdiff
path: root/man/man1/pmlogreduce.1
diff options
context:
space:
mode:
Diffstat (limited to 'man/man1/pmlogreduce.1')
-rw-r--r--man/man1/pmlogreduce.1327
1 files changed, 327 insertions, 0 deletions
diff --git a/man/man1/pmlogreduce.1 b/man/man1/pmlogreduce.1
new file mode 100644
index 0000000..4b3be60
--- /dev/null
+++ b/man/man1/pmlogreduce.1
@@ -0,0 +1,327 @@
+'\"macro stdmacro
+.\"
+.\" Copyright (c) 2000 Silicon Graphics, Inc. All Rights Reserved.
+.\"
+.\" This program is free software; you can redistribute it and/or modify it
+.\" under the terms of the GNU General Public License as published by the
+.\" Free Software Foundation; either version 2 of the License, or (at your
+.\" option) any later version.
+.\"
+.\" This program is distributed in the hope that it will be useful, but
+.\" WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY
+.\" or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
+.\" for more details.
+.\"
+.\"
+.TH PMLOGREDUCE 1 "PCP" "Performance Co-Pilot"
+.SH NAME
+\f3pmlogreduce\f1 \- temporal reduction of Performance Co-Pilot archives
+.SH SYNOPSIS
+\f3$PCP_BINADM_DIR/pmlogreduce\f1
+[\f3\-z\f1]
+[\f3\-A\f1 \f2align\f1]
+[\f3\-S\f1 \f2starttime\f1]
+[\f3\-s\f1 \f2samples\f1]
+[\f3\-T\f1 \f2endtime\f1]
+[\f3\-t\f1 \f2interval\f1]
+[\f3\-v\f1 \f2volsamples\f1]
+[\f3\-Z\f1 \f2timezone\f1]
+\f2input\f1 \f2output\f1
+.SH DESCRIPTION
+.B pmlogreduce
+reads one Performance Co-Pilot (PCP) archive
+identified by
+.I input
+(this must be a PCP archive created by
+.BR pmlogger (1),
+.BR pmlogextract (1)
+or
+.BR pmlogreduce (1)),
+and creates a temporally reduced PCP archive in
+.IR output .
+The
+data reduction involves statistical and temporal reduction of samples with
+an output sampling
+interval defined by the
+.B \-t
+option in the
+.I output
+archive (independent of the sampling intervals in the
+.I input
+archive), and is further controlled by
+other command line arguments.
+.PP
+For some metrics, temporal data reduction is not going to be helpful,
+so for metrics with types
+.B PM_TYPE_AGGREGATE
+or
+.BR PM_TYPE_EVENT ,
+a warning is issued if these metrics are found in
+.I input
+and they will be skipped and not appear in the
+.I output
+archive.
+.SH COMMAND LINE OPTIONS
+The command line options for
+.B pmlogreduce
+are as follows:
+.PP
+.TP 7
+.BI \-A " align"
+Specify a ``natural'' alignment of the output sample times; refer
+to
+.BR PCPIntro (1).
+.PP
+.TP 7
+.BI \-S " starttime"
+Define the start of a time window to restrict the samples retrieved
+from the
+.I input
+archive; refer to
+.BR PCPIntro (1).
+.PP
+.TP 7
+.BI \-s " samples"
+The argument
+.I samples
+defines the number of samples to be written to
+.IR output .
+If
+.I samples
+is 0 or
+.B -s
+is not specified,
+.B pmlogreduce
+will sample until the end of the PCP archive,
+or the end of the time window as specified by
+.BR -T ,
+whichever comes first. The
+.B -s
+option will override the
+.B -T
+option if it occurs sooner.
+.PP
+.TP 7
+.BI \-T " endtime"
+Define the termination of a time window to restrict the samples
+retrieved from the
+.I input
+archive; refer to
+.BR PCPIntro (1).
+.PP
+.TP 7
+.BI \-v " volsamples"
+The
+.I output
+archive is potentially a multi-volume data set, and the
+.B \-v
+option causes
+.B pmlogreduce
+to start a new volume after
+.I volsamples
+log records have been written to the
+.I output
+archive.
+.RS 7
+.PP
+Independent of any
+.B \-v
+option, each volume of an archive is limited to no more than
+2^31 bytes, so
+.I pmlogreduce
+will automatically create a new volume for the archive before
+this limit is reached.
+.RE
+.PP
+.TP 7
+.BI \-t " interval"
+Consecutive samples in the
+.I output
+archive will appear with a time delta defined by
+.IR interval ;
+refer to
+.BR PCPIntro (1).
+Note the default value is 600 (seconds, i.e. 10 minutes).
+.PP
+.TP 7
+.BI \-Z " timezone"
+Use
+.I timezone
+when displaying the date and time, or interpreting the
+.B \-S
+and
+.B \-T
+options.
+.I Timezone
+is in the format of the environment variable
+.B TZ
+as described in
+.BR environ (5).
+.PP
+.TP 7
+.B \-z
+Use the local timezone of the host from the
+.I input
+archive when displaying the date and time, or interpreting the
+.B \-S
+and
+.B \-T
+options.
+The default is to initially use the timezone of the local host.
+.SH DATA REDUCTION
+.PP
+The statistical and temporal reduction follows the following rules:
+.TP 4m
+1.
+Consecutive records from
+.I input
+are read without interpolation, and at most one output record
+is written for each
+.IR interval ,
+summarizing the performance data over that period.
+.TP 4m
+2.
+If the semantics of a metric indicates it is
+.B instantaneous
+or
+.B discrete
+then
+.I output
+value is computed as the arithmetic mean of the observations (if any)
+over each
+.IR interval .
+.TP 4m
+3.
+If the semantics of a metric indicates it is a
+.B counter
+then the following transformations are applied:
+.RS +4m
+.nr PD 0
+.TP 4m
+a)
+Metrics with 32-bit precision are promoted to 64-bit precision.
+.TP 4m
+b)
+Any counter wrap (overflow) is noted, and appropriate adjustment made
+in the value of the metric over each
+.IR interval .
+This will be correct in the case of a single counter wrap, but will
+silently
+.B underestimate
+in the case where more than one counter wrap occurs between consecutive
+observations in the
+.I input
+archive, and silently
+.B overestimate
+in the case where a counter is reset occurs between consecutive
+observations in the
+.I input
+archive; unfortunately these situations cannot be detected, but
+are believed to be rare events for the sort of production monitoring
+environments where
+.B pmlogreduce
+is most likely to be deployed.
+.RE
+.PD
+.TP 4m
+4.
+Any changes in instance domains, and indeed all metadata, is preserved.
+.TP 4m
+5.
+Any ``mark'' records in the
+.I input
+archive (as created by
+.BR pmlogextract (1))
+will be preserved in the
+.I output
+archive, so periods where no data is available are maintained, and data
+interpolation will
+.B not
+occur across these periods when the
+.I output
+archive is subsequently processed with PCP applications.
+.SH FILES
+.PD 0
+For each of the
+.I input
+and
+.I output
+archives, several physical files are used.
+.TP 10
+\f2archive\f3.meta
+metadata (metric descriptions, instance domains, etc.) for the archive log
+.TP
+\f2archive\f3.0
+initial volume of metrics values (subsequent volumes have suffixes
+.BR 1 ,
+.BR 2 ,
+\&...) \- for
+.I input
+these files may have been previously compressed with
+.BR bzip2 (1)
+or
+.BR gzip (1)
+and thus may have an additional
+.B .bz2
+or
+.B .gz
+suffix.
+.TP
+\f2archive\f3.index
+temporal index to support rapid random access to the other files in the
+archive log.
+.PD
+.SH "PCP ENVIRONMENT"
+Environment variables with the prefix
+.B PCP_
+are used to parameterize the file and directory names
+used by PCP.
+On each installation, the file
+.I /etc/pcp.conf
+contains the local values for these variables.
+The
+.B $PCP_CONF
+variable may be used to specify an alternative
+configuration file,
+as described in
+.BR pcp.conf (5).
+.SH SEE ALSO
+.BR PCPIntro (1),
+.BR pmdumplog (1),
+.BR pmlc (1),
+.BR pmlogextract (1),
+.BR pmlogger (1),
+.BR pcp.conf (5)
+and
+.BR pcp.env (5).
+.SH DIAGNOSTICS
+All error conditions detected by
+.B pmlogreduce
+are reported on
+.I stderr
+with textual (if sometimes terse) explanation.
+.PP
+Should the
+.I input
+archive be corrupted (this can happen
+if the
+.B pmlogger
+instance writing the archive suddenly dies), then
+.B pmlogreduce
+will detect and report the position of the corruption in the file,
+and any subsequent information from the
+.I input
+archive will not be processed.
+.PP
+If any error is detected,
+.B pmlogreduce
+will exit with a non-zero status.
+.SH CAVEATS
+.PP
+The preamble metrics (pmcd.pmlogger.archive, pmcd.pmlogger.host,
+and pmcd.pmlogger.port), which are automatically recorded by
+.B pmlogger
+at the start of the archive, may not be present in the archive output by
+.BR pmlogreduce .
+These metrics are only relevant while the archive is being created,
+and have no significance once recording has finished.