Table of contents: Part:1 OxMetrics Reference; 8 OxMetrics Statistics; 8.1 Actual series and scatter plots; 8.2 Mean, standard deviation and variance; 8.3 Autocorrelation function (ACF) or covariogram; 8.4 Partial autocorrelation function (PACF); 8.5 Correlogram; 8.6 Cross-correlation function (CCF); 8.7 Periodogram; 8.8 Spectral density; 8.9 Histogram, estimated density and distribution; 8.10 Regression lines and smooths; 8.10.1 Kernel smooth; 8.10.2 Spline smooth; 8.11 QQ plot; 8.12 Box plot; 8.13 Exponentially-weighted moving average (EWMA); 8.14 Exponentially-weighted moving correlation; 10 Algebra Language; 10.1 Introduction; 10.2 Executing Algebra code; 10.2.1 Calculator (Alt+c); 10.2.2 Algebra Editor (Alt+a); 10.2.3 Algebra from Results windows (Ctrl+a); 10.2.4 Algebra from a Batch file; 10.3 Syntax of Algebra language; 10.3.1 Variables and variable names; 10.3.2 Comment; 10.3.3 Constants; 10.3.4 Algebra operators; 10.3.4.1 Arithmetic operators; 10.3.4.2 Relational and logical operators; 10.3.4.3 Algebra operator precedence; 10.3.5 Assignment statements; 10.3.6 Conditional assignment statements; 10.3.7 Indexing; 10.3.8 Keywords; 10.4 Algebra implementation; 10.5 Algebra Functions; 10.5.1 Differencing and lag functions; 10.5.2 Implicit functions; 10.5.3 ACF and periodogram functions; 10.5.4 Sorting functions; 10.5.5 Smoothing functions; 10.5.5.1 Hodrick--Prescott filter; 10.5.5.2 Kernel and spline smoothing; 10.5.5.3 Exponentially-weighted moving average and correlation; 10.5.5.4 Date and time functions; 10.6 Algebra function summary; 11 Batch Language; 11.1 Introduction; 11.2 Executing Batch commands; 11.2.1 Batch Editor (Alt+b); 11.2.2 Batch from Results windows (Ctrl+b); 11.2.3 Batch from the File/Open command; 11.2.4 Batch from the Windows Explorer; 11.2.5 Batch from a Batch file; 11.3 Batch files and default folders; 11.4 General Batch command summary; 11.4.1 Comment; 11.4.2 Command types; 11.4.3 Default arguments; 11.5 Batch commands; 11.5.1 algebra {...}; 11.5.2 appenddata("filename", "group"="");; 11.5.3 appresults("filename");; 11.5.4 break;; 11.5.5 chdir("path");; 11.5.6 closedata("databasename");; 11.5.7 closedrawwindow();; 11.5.8 command("command_line");; 11.5.9 database("name", year1, period1, year2, period2, freq);; 11.5.10 draw(area, "y", "mode"="");; 11.5.11 drawf(area, "y", "function", d1=0, d2=0);; 11.5.12 drawptext(area, "text", "x", "y", size=300, rot=0);; 11.5.13 drawtext(area, "text", "x", "y", size=300);; 11.5.14 drawtitle(area, "text", "pos", size=300);; 11.5.15 drawx(area, "y", "x", "mode"="");; 11.5.16 drawz(area, "y", "x", "mode"="");; 11.5.17 exit;; 11.5.18 loadalgebra("filename");; 11.5.19 loadbatch("filename");; 11.5.20 loadcommand("filename");; 11.5.21 loaddata("filename");; 11.5.22 loadgraph("filename");; 11.5.23 module("name");; 11.5.24 package("packagename", "modeltype"="");; 11.5.25 print("text");; 11.5.26 println("text");; 11.5.27 printdate;; 11.5.28 savedata("filename");; 11.5.29 savedrawwindow("filename", "window"="");; 11.5.30 saveresults("filename");; 11.5.31 setdraw("option", i1=0, i2=0, i3=0, i4=0, i5=0);; 11.5.32 setdrawwindow("name");; 11.5.33 show;; 11.5.34 usedata("databasename", i1=0);; 11.6 Examples; 12 OxMetrics Graphics; 12.1 Graphics paper; 12.2 Creating graphs; 12.2.1 Actual series with optional transformations; 12.2.2 Multiple series with optional transformations; 12.2.3 Scatter plots; 12.2.4 Distribution; 12.2.5 Time-series: ACF etc.; 12.2.6 QQ plots; 12.2.7 Two series by a third; 12.2.8 3-dimensional plots; 12.3 Printing graphs; 12.4 Graphics formats; 12.5 Saving and loading graphs; 12.6 Saving Ox code of a graph; 12.7 Editing graphs; 12.7.1 Graph layout; 12.7.2 Area layout; 12.7.3 Variable against time, scatter, or 3D; 12.7.3.1 Error bars; 12.7.3.2 Regression, Scale; 12.7.4 Axes; 12.7.4.1 Settings for non-default labelling; 12.7.4.2 Location and transformation; 12.7.4.3 Style; 12.7.4.4 Label style; 12.7.5 Legend style; 12.7.6 Histogram; 12.7.7 Copy properties to other areas; 12.7.8 Text; 12.7.9 Lines and symbols; 12.7.10 Adding, moving and deleting objects; 12.7.11 Pointing; 12.7.12 Graphics setup; 12.8 Copy and paste; 12.9 Graphs and sample selection; 12.10 Text formatting; 13 OxMetrics Data Management; 13.1 Creating data; 13.2 Database font; 13.3 Database description; 13.4 Printing data; 13.5 Data formats; 13.6 Summary statistics; 13.7 Saving data; 13.8 Navigation and editing; 13.9 Renaming variables; 13.10 Deleting variables; 13.11 Reordering variables; 13.12 Adding variables; 13.13 Extending or reducing the sample period; 13.13.1 Changing the sample period; 13.14 Copy and paste; 13.15 Appending data; 13.16 Daily, weekly and timed data

List of tables: Table:10.1 Algebra operator precedence; Table:10.2 Reserved Algebra keywords; Table:10.3 Algebra functions; Table:11.1 Batch command summary; Table:12.1 Greek symbols; Table:12.2 Arrows; Table:12.3 Mathematics accents; Table:12.4 Foreign and accented characters; Table:12.5 Mathematical symbols; Table:12.6 Log-like symbols; Table:12.7 Miscellaneous symbols; Table:12.8 Fonts and sizes

[2]

Part:1 OxMetrics Reference

Chapter 8 OxMetrics Statistics

OxMetrics provides many convenient graphical options for prior data analysis. For simplicity, we denote the selected variable by x_t, t=1,...,T. This chapter summarizes the underlying formulae, discussing each available graph type in turn.

8.1 Actual series and scatter plots

Actual series gives a graph for each selected variable, showing the variable over time. The All scatter plots option gives scatter plots between all selected variables (omitting redundant scatter plots).

8.2 Mean, standard deviation and variance

These are relevant for the statistics described below. For a series x_t, t=...,T:

sample mean	x= 1/T ∑_t=1^Tx_t,
sample variance	σ̃²_x = 1/T ∑_t=1^T( x_t-x)²,
sample standard deviation	σ̃_x.

(eq:8.1)

Note that an alternative definition of the variance would divide by T-1 instead of T.

[Note: The maximum likelihood estimate for a linear model gives 1/T, while regression on a constant using OLS produces an unbiased estimate of the variance using 1/(T-1).]

sample variance (OLS)	σ̂²_x = 1/T-1 ∑_t=1^T( x_t-x)²,
sample standard error	σ̂_x.

(eq:8.2)

8.3 Autocorrelation function (ACF) or covariogram

Define the sample autocovariances {ĉ_j} of a series x_t, t=...,T:

ĉ_j= 1/T ∑_t=j+1^T( x_t-x) ( x_t-j-x), j=0,...,T-1,

(eq:8.3)

using the full sample mean x= 1/T ∑_t=1^Tx_t. The variance σ̃²_x corresponds to ĉ₀, see (eq:8.1).

The sample autocorrelation function (ACF) is the series {r̂_j} where r̂_j is the sample correlation coefficient between x_t and x_t-j. The length of the correlogram is specified by the user, leading to a figure which shows ( r̂₁,r̂₂,...,r̂_s) plotted against ( 1,2,...,s) where for any j when x is any chosen variable:

r̂_j=ĉ_j / ĉ₀, j=0,...,T-1.

(eq:8.4)

The first autocorrelation, {r̂₀}, is equal to one, and omitted from the graphs.

The asymptotic variance of the autocorrelations is 1/T, so approximate 95% error bars are indicated at ±2T^-1/2 (see e.g. Harvey, 1993, p.42).

If there are missing values in the data series, the correlogram uses all valid observation as follows: the mean is computed for all valid observations; terms where x_t or x_t-j are missing are omitted from the summation in (eq:8.3). Consequently, the same procedure is used in (eq:8.4) and any other derived functions such as the spectrum and periodogram.

8.4 Partial autocorrelation function (PACF)

Given the sample autocorrelation function {r̂_j} the partial autocorrelations are computed using Durbin's method as described in Golub and Van Loan (1989, §4.7.2). This corresponds to recursively solving the Yule--Walker equations. For example, with autocorrelations, r̂₀, r̂₁, r̂₂, ..., the first partial correlation is α̂₀=1 (omitted from the graphs). The second, α̂₁, is the solution from

(

r̂₀

r̂₁

) = (

r̂₀	r̂₁
r̂₁	r̂₀

) (

α̂₀

α₁

et cetera.

8.5 Correlogram

The sample correlogram plots the series {r̂_j^*} where r̂_j^* is the correlation coefficient between x_t and x_t-j. The length of the correlogram is specified by the user, leading to a figure which shows ( r̂₁^*,r̂₂^*,...,r̂_s^*) plotted against ( 1,2,...,s) where for any j when x is any chosen variable:

r̂_j^*=

∑_t=j+1^T( x_t-x₀) ( x_t-j-x_j)

(∑_t=j+1^T( x_t-x₀) ²∑_t=j+1^T( x_t-j-x_j) ²)^½

(eq:8.5)

Here x₀= 1/T-j ∑_t=j+1^Tx_t is the sample mean of x_t, t=j+1,...,T, and x_j= 1/T-j ∑_t=j+1^Tx_t-j is the sample mean of x_t-j, so that r̂_j^* corresponds to a correlation coefficient proper.

Note the difference with the definition of the sample autocorrelations {r̂_j} in (eq:8.4) above. This difference tends to be small, and vanishes asymptotically, provided the series are stationary. But, as argued in Nielsen (2006), the correlogram provides a better discrimination between stationary and non-stationary variables: for an autoregressive value of one (or higher), the correlogram declines more slowly than the ACF.

When the correlogram is selected for plotting, the PACF is based on the correlogram values, rather than the ACF. Handling of missing values is the same as for the ACF.

8.6 Cross-correlation function (CCF)

The sample cross-correlation function (CCF) graphs the correlation between a series x_t and the lags of another series y_t:

r̂^xy_j=

∑_t=j+1^T( x_t-x) ( y_t-j-y)

∑_t=1^T( x_t-x) ∑_t=1^T( y_t-y)

ĉ^xy_j

( ĉ^x₀ ĉ^y₀)^1/2

, j=0,...,T-1,

(eq:8.6)

using the full sample means x and y.

The CCF between x_t and y_t is different from that between y_t and x_t: r̂^xy_j≠r̂^yx_j. Both are plotted in the same graph.

8.7 Periodogram

The periodogram is defined as:

p( ω)	=	1/2π ∑_j=-T+1^T-1 ĉ_{\| j\|}e^-iωj = 1/2π ∑_j=-T+1^T-1 ĉ_{\| j\|} cos ( jω)
	=	ĉ₀/2π ∑_j=-T+1^T-1 r̂_{\| j\|} cos ( jω) ,
		for ω= 0, 2π/T, 4π/T, ..., (int(T/2)2π)/T.

(eq:8.7)

Note that p(0)=0.

When the periodogram is plotted, only frequencies greater than zero and up to π are used. Moreover, the x-axis, with values 0,...,π, is represented as 0,...,1. So, when T=4 the x coordinates are 0.5,1 corresponding to π/2, π. When T=5, the x coordinates are 0.4,0.8 corresponding to 2π/5, 4π/5.

However, when the periodogram is evaluated using the Algebra function periodogram, it is evaluated at all frequencies up to 2π, resulting in T periodogram values. For T=4 these are evaluated at 0, π/2, π, 3π/2. For T=5: 0, 2π/5, 4π/5, 6π/5, 8π/5. The frequencies are returned in the last argument of periodogram.

8.8 Spectral density

The estimated spectral density is a smoothed function of the sample autocorrelations {r̂_j}, defined as in (eq:8.4). The sample spectral density is then defined as:

ŝ( ω) =

2π

∑_j=-(T-1)^T-1K( j) r̂_{| j|} cos ( jω) , 0≤ω≤π,

(eq:8.8)

where | ^.| takes the absolute value, so that, for example, r̂_{| -1|}=r̂₁. The K( ^.) function is called the lag window. OxMetrics uses the Parzen window:

K( j) =	1-6( j/m ) ²+6\| j/m \| ³,	\| j/m \| ≤0.5,
=	2( 1-\| j/m \| ) ³,	0.5≤\| j/m \| ≤1.0,
=	0,	\| j/m \| >1.

(eq:8.9)

We have that K(-j)=K(j), so that the sign of j does not matter ( cos (x)= cos (-x)). The r̂_js are based on fewer observations as j increases. The window function attaches decreasing weights to the autocorrelations, with zero weight for j>m. The parameter m is called the lag truncation parameter. In OxMetrics, this is taken to be the same as the chosen length of the correlogram. For example, selecting s=12 (the with length setting in the dialog) results in m=12. The larger m, the less smooth the spectrum becomes, but the lower the bias. The spectrum is evaluated at 128 points between 0 and π. For more information see Priestley (1981) and Granger and Newbold (1986).

8.9 Histogram, estimated density and distribution

Given a data set {x_t}=( x₁...x_T) which are observations on a random variable X. The range of {x_t} is divided into N intervals of length h with h defined below. Then the proportion of x_t in each interval constitutes the histogram; the sum of the proportions is unity on the scaling in OxMetrics. The density can be estimated as a smoothed function of the histogram using a normal or Gaussian kernel. This can then be summed (`integrated') to obtain the estimated cumulative distribution function (CDF).

Denote the actual density of X at x by f_x( x) . A non-parametric estimate of the density is obtained from the sample by:

f_x( x) ̂= 1/Th ∑_t=1^TK(

x-x_t

) ,

(eq:8.10)

where h is the window width or smoothing parameter, and K( ^.) is a kernel such that:

∫_-∞^∞ K( z) dz=1.

OxMetrics sets:

h=1.06σ̃ _x/T^0.2

as a default, and uses the standard normal density for K( ^.) :

x-x_t

) =

(2π)^½

exp [ - ½(

x-x_t

) ²] .

(eq:8.11)

The default window width is used when the dialog option is set to Default bars. Alternatively, it is possible to directly choose the number of bars, in which case, the more bars, the less smooth the density estimate will be. The histogram uses the whole sample, skipping over missing values.

f_x( x) ̂ is usually calculated for 128 values of x, but since direct evaluation can be somewhat expensive in computer time, a fast Fourier transform is used (we are grateful to Dr. Silverman for permission to use his algorithm). The estimated CDF of X can be derived from f_x( x) ̂; this is shown with a standard normal CDF for comparison. An excellent reference on density function estimation is Silverman (1986).

8.10 Regression lines and smooths

The scatter plots option allows for various types of regression lines to be drawn. A single regression line draws the fitted values from the OLS estimates α̂ and β̂ in

ŷ_t=α̂+β̂x_t, t=1,...T,

assuming that y_t and x_t are the selected variables. The distinction between sequential and recursive regression lines is easily explained using three lines. Divide the sample in three parts: 1,...,T/3, T/3+1,...,2T/3 and 2T/3+1,...,T, then:

sequential

line 1	ŷ_t=α̂ ₁+β̂ ₁x_t,	t=1,...T/3,
line 2	ŷ_t=α̂ ₂+β̂ ₂x_t,	t=T/3+1,...,2T/3,
line 3	ŷ_t=α̂ ₃+β̂ ₃x_t,	t=2T/3+1,...,T,

recursive

line 1	ŷ_t=α̂ ₁+β̂ ₁x_t,	t=1,...T/3,
line 2	ŷ_t=α̂ ₄+β̂ ₄x_t,	t=1,...,2T/3,
line 3	ŷ_t=α̂ +β̂ x_t,	t=1,...,T.

The regression line is fitted over the whole sample, skipping over missing values inside sample.

8.10.1 Kernel smooth

As a first step towards non-parametric regression, we start with a scatter plot of y_t and x_t, with x_t along the horizontal axis. Divide the x-axis in N intervals, and compute the average y-value in each interval. The resulting step function provides a non-parametric regression line: to compute ŷ(x₀) we locate the interval on which x₀ falls, and use the mean of y in that interval as ŷ. An obvious drawback is that, if x₀ is towards the edge of an interval, most of the observations are on the other side. Solve this by centring the interval in x₀. This still gives as much weight to points far away as to points nearby. In line with the smoothing functions (kernels) in the spectrum and density estimates, we use a density function to take a weighted average.

The non-parametric estimate of y is obtained from

ŷ_h( x) =( ∑_t=1^TK(

x-x_t

) ) ^-1( ∑_t=1^TK(

x-x_t

) y_t) ,

(eq:8.12)

where, as for (eq:8.10), h is the window width, bandwidth or smoothing parameter, and K( ^.) is a kernel which integrates to 1. OxMetrics uses the Epanechnikov kernel:

K( u) =	3/4 ( 1-u²) ,	\| u\| ≤1,
=	0,	\| u\| >1,

(eq:8.13)

which is optimal in a certain sense. The optimal bandwidth, equivalent to the default chosen for the non-parametric density estimate is:

h=0.75σ̃ _x/T^0.2.

The function ŷ_h( ^.) is computed at 128 points. As h→0, ŷ_h( ^.) →y: the sample mean is the fitted value for any x. On the other hand, if h→∞ the interval goes to zero, and we fit the nearest corresponding y-value so that each data point is picked up exactly. Note, however, that the ŷ( ^.) function is evaluated at the T data points x_t (which is time in the absence of x). Härdle (1990) is a general reference on the subject of non-parametric regression.

There are three ways of specifying the bandwidth:

Number of parameters
This specifies the equivalent number of parameters (approximately comparable to the number of regressors used in a linear regression). The default is

3/4 [
(T-1)
12
]^1/2 T^-0.2.
Set
Sets the bandwidth directly.
Automatic
Chooses the bandwidth by generalized cross validation (GCV). We find that choosing bandwidth using GCV or cross validation (CV) tends to undersmooth.
Equation (eq:8.14) below indicates how GCV is computed.

The kernel smooth is not computed using a Fourier transform, but directly, so can be slow for large T. The smooth is fitted over the whole sample skipping over missing values inside sample, which are estimated by the fit from the smooth.

One drawback of this smooth is that, for trending lines, it behaves counter intuitively at the edges. Consider, for example, the left edge of the kernel smooth of a variable which is upwardly trending. Since the smooth starts as a moving average of points which are only to the right, and hence mainly higher, the left edge will have a J shape, starting above what would be fitted by eye.

8.10.2 Spline smooth

A spline is another method for smoothing a scatter plot. Consider a plot of y_t, against x_t, and sort the data according to x: a<x_[1]<...<x_[T]<b. In a spline model, the sum of squared deviations from a function g is minimized, subject to a roughness penalty:

min ∑_t=1^T [ y_t - g(x_[t])]² +α∫_a^b [ g^''(x)]² dx.

OxMetrics uses a natural cubic spline, which is cubic because the function g is chosen as a third degree polynomial, and natural because the smooth is a straight line between a and x_[1] and between x_[T] and b. This avoids the end-point problem of the kernel smooth of the previous section. Two good references on splines and nonparametric regression are Green and Silverman (1994) and Hastie and Tibshirani (1994).

The α parameter is the bandwidth: the smaller α, the lower the roughness penalty, and hence the closer the smooth will track the actual data.

The spline is evaluated at all the data points, where missing y values are estimated by the fit from the smooth. The spline procedure handles ties in the x variable. The algorithm used to compute the spline is of order T, and discussed extensively in Green and Silverman (1994, Chs.2,3).

For evenly-spaced data (e.g. scatter plot against time), the algorithm involves a Toeplitz matrix. This illustrates the closeness of a natural cubic spline to the Hodrick--Prescott filter which is popular in macro-economics:

	diagonal	2nd diag	3rd diag	remainder
Hodrick--Prescott	6α+ 1	-4α	α	0
spline	6α+ 2/3	-4α+ 1/6	α	0

The only difference is in this Toeplitz matrix. Since the Hodrick--Prescott filter uses α=1600 (for quarterly data), the smoothers from both methods are virtually identical.

[Note: Some articles describe computation of the Hodrick--Prescott filter as involving inversion of a T×T matrix. This is clearly not the case. It is avoided in the Reinsch algorithm used for the general scatter-plot smooth, and more obviously in the evenly spaced case, where all that is required is the solution of a linear system involving a banded Toeplitz matrix. Alternatively, the Kalman filter may be used, as in Koopman, Harvey, Doornik, and Shephard (2013).]

There are three ways of specifying the bandwidth:

Number of parameters
This specifies the equivalent number of parameters, k_e, (approximately comparable to the number of regressors used in a linear regression). The default is

3/4 [
(T-1)
12
]^1/2 T^-0.2.
Set
Sets the bandwidth directly, with the default of 1600 corresponding to the Hodrick--Prescott filter for quarterly data (see §10.5.5.1 for other frequencies).
Automatic
Chooses the bandwidth by generalized cross validation (GCV). We find that choosing bandwidth using GCV or cross validation (CV) tends to undersmooth. The GCV criterion is computed as:

GCV(α) = T(
RSS
T - 1.25 k_e + 0.5
).
(eq:8.14)

We have adopted GCV instead of CV because a very good fit at one point could dominate the CV criterion.

8.11 QQ plot

Draws a QQ plot. The variable would normally hold critical values which are hypothesized to come from a certain distribution. This function then draws a cross plot of these observed values (sorted), against the theoretical quantiles (the horizontal axis). The 45^o line is drawn for reference (the closer the cross plot to this line, the better the match). The QQ plot line is drawn over the whole sample, skipping over missing values.

The following distributions are supported:

χ²(df₁),
F(df₁, df₂),
N(0, 1),
t(df₁),
Uniform(0,1), resulting in a Quantile plot.

The normal QQ plot includes the pointwise asymptotic 95% standard error bands, as derived in Engler and Nielsen (2009) for residuals of regression models (possibly autoregressive) with an intercept. Here the normal QQ plot is created for the standardized data, but drawn on the original scale. The standard errors are given by the square root of:

T^-1(

Φ(z)(1 - Φ(z))

φ(z)²

- z² / 2 - 1),

where z denotes the coordinates of the reference distribution (the standard normal), Φ is the standard normal cdf, and φ the density.

8.12 Box plot

A box plot shows the distribution of a variable in terms of its quartiles, labelled Q₁, Q₂, Q₃ (the 25%, 50%and 75%quartiles). Define the interquartile range as IQR = 1.5 (Q₃ - Q₁). The box plot consists of the following elements:

a box, with horizontal lines at Q₁, Q₂ (the median) and Q₃;
a vertical line from Q₁ - IQR to Q₃ + IQR (omitted inside the box);
individual observations: all observations outside the (Q₁ - IQR,Q₃ + IQR) range, plus the two observations on either end which just fall inside this range.

The box plot uses the whole sample, skipping over missing values.

8.13 Exponentially-weighted moving average (EWMA)

A simple exponentially-weighted moving average of a series y_t, t=1,...,T, is defined as:

m_t = αy_t + (1 - α)m_t-1, t=2,...,T,

(eq:8.15)

using m₁=y₁. The resulting series z_t is z_t=m_t, except when any y_t+h is missing for h≥1, in which case it is replaced by forecasts z_t+h=m_t.

An extended version introduces a slope coefficient β, and is also called Holt's method:

m_t	=	αy_t + (1 - α)(m_t-1+b_t-1),
b_t	=	β(m_t-m_t-1) + (1 - β)b_t-1.

In this case m₁=y₁, m₂=y₂, b₂=y₂-y₁; when β=0 the EWMA (eq:8.15) is used. The resulting series z_t is z_t=m_t, except when any y_t+h is missing for h≥1, in which case it is replaced by forecasts z_t+h=m_t+hb_t.

Back-substitution in the EWMA without slope coefficient shows that:

m_t = (1-α)^t-1m₁ + α∑_s=0^t-1(1-α)^s y_t-s.

See e.g. Harvey (1993, Ch. 5), or Makridakis, Wheelwright, and Hyndman (1998, Ch. 4).

8.14 Exponentially-weighted moving correlation

The exponentially weighted moving sample correlation of two series y_t and x_t, both having expectation zero, is defined as:

r_t =

p_t(x,y)

[p_t(x,x)p_t(y,y)]^1/2

, t=3,...,T

where

p_t(x,y) = ∑_s=0^t-1λ^s x_t-s y_t-s, t=1,...,T,

and r₁,r₂ are set to missing values. When any y_t+h or x_t+h is missing r_t+h is set to r_t.

Premultiplying the numerator and denominator of r_t by 1-λ, shows that m_t^*=(1-λ)p_t(x,y) is the EWMA of the cross-product with α=1-λ and m₁=0. This does not work for λ=1.

Chapter 10 Algebra Language

10.1 Introduction

OxMetrics is mostly menu-driven for ease of use. To add flexibility, certain functions can be accessed through entering commands. The syntax of these commands, which can be seen as small computer languages, is described in this and the next chapter. In addition, there is the Ox language, which is a more serious computer language, and described in a separate book.

10.2 Executing Algebra code

Algebra is a simple vector language, operating on only one type of object: the variables in the database. This object is manipulated as a whole, although it is possible to limit access to a subsample. The only valid statements are assignments and conditional assignments. Statements have to be terminated with a semicolon.

If an error occurs while executing the algebra code, the processing will be aborted, and control returned to OxMetrics. All successful statements up to that point will have been executed, with corresponding changes to the database. Take this into account when rerunning the corrected code.

There are three ways of running Algebra code, as discussed in the next three sections.

10.2.1 Calculator (Alt+c)

The calculator enables easy manipulation of the variables in the database, and is a convenient way to write algebra expressions.

The aim is to build a valid algebra expression in the expression window (without the assignment part and the terminating semi-colon). All successful transformations are logged in the Results window as Algebra code.

10.2.2 Algebra Editor (Alt+a)

The Tools/Algebra Editor command enables you to transform the database variables by typing mathematical formulae into an editor. The algebra code can be saved and reloaded. The Algebra editor also allows for choosing the database to which the code is applied.

10.2.3 Algebra from Results windows (Ctrl+a)

A text selection containing Algebra code can be run directly from that window using the Edit/Run as Algebra command (or Ctrl+a as short cut). The code is applied to the currently active database. The current database is changed by setting focus to that database, or using any of the drop down boxes (calculator, algebra editor, graphics), or using the usedata() batch command.

10.2.4 Algebra from a Batch file

Algebra code can be specified in a Batch program, see Ch. 11, and specifically §11.5.1. An Algebra file can also be loaded into a batch file (§11.5.18).

10.3 Syntax of Algebra language

10.3.1 Variables and variable names

Names are made up of letters and digits. The first character must be a letter or an underscores (_). Subsequent characters must be a letter, underscore, digit, or one of the four symbols $, #, %, @. A #can be followed by a date, so II#1990(1) and II#2020-12-31 are valid names. If a variable name does not follow these rules, it must be enclosed in double quotes in Algebra expressions.

Valid names are CONS, cons, cons_1, _a_1_b, etc. Examples of invalid names are ΔCONS, 1-CONS, log(X), etc. Invalid names may be used in Algebra when enclosed in double quotes, so

"log(X)" = log(X);

is valid as long as the variable called X exists. Algebra is case-sensitive: CONS and cons refer to different variables. When you create a new variable through an assignment operation, it is immediately added to the database, and initialized to missing values. If necessary the database name will be truncated (longer names are allowed in algebra than in the database).

10.3.2 Comment

Anything between /* and */ is considered comment. Note that this comment cannot be nested. So

aap = CONS + 1; /* comment /* nested comment */ */

leads to a syntax error. Everything following // up to the end of the line is also comment.

10.3.3 Constants

A constant is a number which is used in an expression. Examples are 1, 1.2, .5, -77000, -.5e-10 and 2.1E-12.

10.3.4 Algebra operators

10.3.4.1 Arithmetic operators

The binary arithmetic operators are ^4 , *, /, +, -. The ^4 is the power operator; CONS^4 2 raises the variable CONS to the power two. All computations are double precision floating point arithmetic. The precedence order is: ^4, unary + and unary -, then *, /, and finally +, -. An example of unary minus is: x = -1. Any arithmetic operation involving missing values returns a missing value.

10.3.4.2 Relational and logical operators

The relational operators are <, <=, >, >=, standing for `less', `less or equal', `greater', `greater or equal'. The equality operators are == and !=, `is equal to' and is `not equal to'. These are ranked below the relational operators in precedence. Ranked yet lower are logical AND (&&) and logical OR (||). The unary negation operator ! has the highest precedence. An example of unary negation is:

new = !season();

new will have a 0 where the season function returns a 1, and vice versa. Boolean arithmetic is also done in floating point. The numeric result is 1. for an expression that evaluates to true, and 0. for one that is false. An expression with value 0 is false, whereas a value that is not 0 is true.

Missing values are treated as follows:

a op b	result when a and/or b is a missing value
`<`, `<=`, `>`, `>=`	`FALSE` if a and/or b is missing
`==`	`TRUE` if both a and b are missing
`!=`	`TRUE` if either a or b is missing (but not both)
`&&`, `\|\|`	no special treatment of missing values

10.3.4.3 Algebra operator precedence

Table Table:10.1 lists the operator precedence, with the highest precedence at the top of the table.

Table:10.1 Algebra operator precedence

Symbol	Description	Associativity
`^4`	Power	Left to right
`!`	Logical NOT	Right to left
`-`	Unary minus
`+`	Unary plus
`*`	Multiply	Left to right
`/`	Divide
`+`	Add	Left to right
`-`	Subtract
`<`	Less than	Left to right
`<=`	Less than or equal to
`>`	Greater than
`>=`	Greater than or equal to
`==`	Equal	Left to right
`!=`	Not equal
`&&`	Logical AND	Left to right
`\|\|`	Logical OR	Left to right

10.3.5 Assignment statements

The assignment operator is the = symbol. Assignment statements have to be terminated by a semicolon (;). If the variable to which a result is assigned does not yet exist, it is created and added to the database. Otherwise the existing variable is overwritten. Note that assignment expressions are vector expressions: all observations will be overwritten (it is possible to restrict assignment to a subsample using the insample function). Some valid assignment statements are:

cons2 = 2 * CONS - 1;
cons3 = CONS / 3 - (cons2 + 5) * 1.55;
Seasonal = season();

The last statement constructs a variable called Seasonal, which is 1 in period 1, and 0 otherwise. This variable will be used in further examples in this chapter.

Take care not to confuse the = and == symbols: = assigns, while == compares. Parentheses may be used in expressions in the usual way; in the example above (cons2 + 5) * 1.55 evaluates to cons2 * 1.55 + 5 * 1.55.

10.3.6 Conditional assignment statements

The conditional assignment expression has the following form:

TestExpression ? TrueExpression : FalseExpression

First the TestExpression is evaluated; if it is true (not 0.), then TrueExpression will be evaluated, else FalseExpression. Let us consider an example involving the Seasonal variable created above, which is 1 in the first quarter, and 0 in the other quarters. The statement

new = Seasonal ? CONS : 0;

can be read as follows: the variable new will get the value of CONS when Seasonal is true (that is, when Seasonal is not 0., so in the first quarter of each year), else it will get the value 0. In this case, the same result could have been reached with:

new = Seasonal * CONS;

However, if we had used a seasonal with the value 2 in the first quarter, and the rest zeros, then only the conditional assignment would have given the desired result (since 2 is not false, and hence true).

Note that the `: FalseExpression' part is optional. For example:

new = Seasonal ? CONS;

is a valid conditional assignment statement. Now the variable will have the value of CONS for the first quarter, but the other observations of new are not touched (so if new is created by this expression, it will contain missing values for quarters 2, 3 and 4, making the variable unusable for modelling purposes). Another example:

new = (CONS == MISSING) ? 0 : CONS;

This new variable takes the values of CONS, replacing missing values by zeros.

10.3.7 Indexing

It is possible to index a particular observation in a variable using rectangular brackets. For example:

CONS_1 = CONS[-1];

There are three methods of indexing:

Relative to the current observation.
Each Algebra statement can be seen as running within a loop over all observations. The relative index is offset to the current observation in the loop. So
```
CONS_1 = CONS[-1];
```
can be interpreted as:
```
for (t = first observation; t <= last observation; t = t + 1)
    CONS_1[t] = CONS[t - 1];
```
Consequently, this statement has the same effect as
```
CONS_1[0] = CONS[-1];
```
By absolute date
In this case the index has the form year(period), for example
```
X = INC + CONS[1980(1)];
y[1955(1)] = 12;
```
By absolute index
In this case the index has the form index(0), which references observation index (with the first observation having index 0!). For example
```
X = INC + CONS[0(0)];   // add first CONS value
```

10.3.8 Keywords

The following keywords are reserved by algebra:

Table:10.2 Reserved Algebra keywords

keyword	value
`FALSE`	0.
`TRUE`	`!FALSE`
`FREQUENCY`	data frequency
`MISSING`	the missing value
`PI`	Pi (3.1415..)
`NOBS`	number of observations

10.4 Algebra implementation

Each Algebra statement has an implicit loop over the sample size around it. E.g.

LCONS = log(CONS);
meanLCONS = mean(LCONS);

is equivalent to the following pseudo code:

for (t = first observation; t <= last observation; t = t + 1)
    LCONS[t] = log(CONS[t]);
for (t = first observation; t <= last observation; t = t + 1)
    meanLCONS[t] = mean(LCONS);

So meanLCONS will have the same value for every observation.

By default, the loop is over all the observations of the database. However, the insample function can restrict the loop to a subsample.

Implementing transformations in Ox is different: there, variables are extracted from the database as column vectors and operations work on the vector as a whole. Implementing the same code in Ox:

#include <oxstd.h>
#import <database>
main()
{
    decl db = new Database();
    db.Load("data.oxdata");
    db.Append(log(db.GetVar("CONS")), "LCONS");
    decl mean = meanc(db.GetVar("LCONS"));
    db.Append(constant(mean, db.GetSize(), 1), "meanLCONS");
    db.Info();
    delete db;
}

Note that the mean of a vector is just a single number, so creating a vector which has this value for every observation is implemented with the constant function in Ox (and adding this to the database is not very useful).

10.5 Algebra Functions

A large set of functions is provided by algebra. Most of these take both variables and constants as arguments. A function name must be followed by parentheses, even if it does not take any arguments. See the Algebra function list of Table Table:10.3 for function definitions. Some examples of statements involving functions are:

lcons = log (CONS);      // takes the natural logarithm of CONS
cons_1 = lag(CONS, 1);                   // lag CONS one period
ccons = exp(log(CONS));             // gives back original CONS
dummy = insample(1979, 1, 1983, 4) ? log(CONS) : 0;
              // Dummy will be log(CONS) for the period 1979(1)
              // to 1983(4), and 0 outside it.
trough = (lag(trough,1) == MISSING || CONS < lag(trough,1))
       ? CONS : lag(trough,1);
              // a trough can also be created with the built in
                                   // algebra function trough()
dummy  = (year() == 1979) ? 1 : 0;  // creates a dummy variable

10.5.1 Differencing and lag functions

A variable is lagged using the lag function,

[Note: In most cases it is better to avoid storing lags in the database: most estimation modules handle lags explicitly, allowing for dynamic analysis of the model.]

and differenced using the diff function. For example, for first lag and difference respectively, as well as second lag and difference:

Y_1 = lag(Y, 1);
DY = diff(Y, 1);
Y_2 = lag0(Y, 2);
D2Y = diff(Y, 2);

This code adopts the naming convention which is used by OxMetrics: append an underscore followed by lag length for a lagged variable, and a prefix `D' for differenced variables. Note that lags should not be created in the database: applications create them during model formulation, so that they can be handled appropriately.

When taking a first difference or lag using diff, dlog, or lag, the first observation becomes missing; for a second difference or lag, the first two observations are missing, etc. The lag0 function sets the missing lags to zero.

Neither function can have an expression as the argument for differencing. So you cannot write diff(diff(Y,1),1) but must write:

DY = diff(Y, 1);
DDY = diff(DY, 1);

Note that this DDY is not the same as D2Y above.

Finally, the dlog(.) function takes the first difference of the (natural) logarithm of the variable.

10.5.2 Implicit functions

A few functions are implicitly defined, so that the corresponding variables do not need to be present in the database. For example, if an expression uses CSeasonal as variable on the right-hand side, but the variable does not exist in the database, it is implicitly translated to the function call cseasonal(0), while CSeasonal_1 becomes cseasonal(1). This also implies to Seasonal.

The implicit call is not made if the variable does exist in the database.

A similar implicit definition is used for indicators of the type DI#, II#, SI#, TI#, QI#, AO#, LS#, LT#, MI#. A date field is expected after the #, while the multiplicative indicator has an aditional variable name after the @ symbol. Examples are:

ls#1973(1) = LS#1973(1);
lt#1973(1) = LT#1973(1);
my#1973(1) = MI#1973(1)@LY;

The indicator functions are defined as:

`DI#`y(p)	1 in y(p), -1 in next period, 0 otherwise,
`II#`y(p)	1 in y(p), 0 otherwise (same as `AO#`),
`SI#`y(p)	1 up to y(p), 0 afterwards,
`LS#`y(p)	0 up to y(p), 1 afterwards,
`TI#`y(p)	...,-4,-3,-3,-1,0 up to y(p), 0 afterwards,
`LT#`y(p)	0 up to y(p), 1,2,3,... afterwards,
`QI#`y(p)	`TI#`y(p) squared,
`MI#`y(p)`@`v	v ×`SI#`y(p).

II#1950(1) is the same as the function call II(1950, 1).

10.5.3 ACF and periodogram functions

The sample autocorrelation function can easily be plotted. To compute the exact values of the ACF, use the following algebra code:

acf_lag = MISSING;
acf_dcons = MISSING;
acf(DCONS, acf_dcons, acf_lag);

The first two lines are required to create the output variables for the acf function. The output from acf is the actual ACF values (here stored in ACF_DCONS), and the lag length (in ACF_LAG).

The periodogram function has a similar syntax, returning the sample periodogram in the second argument, and the frequencies at which it was computed in the third.

10.5.4 Sorting functions

The sort functions change the order of observations in the database and must be handled with care. Note that the trend() function can be used to create a variable which corresponds to the original observation index of the data. So if that variable is sorted together with the variable we wish to sort, it can be used to `unsort' the variable.

♦ sort(arg1)

Sorts the variable arg1 in increasing order (arg1 must be a variable, and cannot be an expression). Returns the value of arg1 after sorting. Suppose the residuals of a regression have been saved in the variable called Residual, then a sorted copy is created as follows:

res = Residual;
sort(res);

♦ _sortby(arg1, arg2)

Sorts the variable arg1 in increasing order, and sorts arg2 accordingly. Returns the value of arg1 after sorting. Both arg1 and arg2 must be variables. Suppose the residuals of a regression have been saved in the variable called Residual, then a sorted residual is created as follows:

index = trend();
tmp = _sortby(Residual, index);
            /* Residual is sorted, and index accordingly */
                             /* tmp is a dummy variable, */
                 /* at this stage identical to Residual. */
        /* Now it is easy to locate outliers in Residual */

The following statement will reset the old ordering (do not rerun the index = trend(); statement, because that will overwrite the index and lose the information on the original ordering):

tmp = _sortby(index, Residual);
                         /* Restores index and Residual, */
               /* at this stage index is equal to Trend. */

♦ _sortallby(arg1)

Sorts the variable arg1 in increasing order, and sorts the whole database accordingly. Returns the value of arg1 after sorting. This function will be most useful with cross-section data; for example, to push missing values to the end:

exclude =
   (Var1 == MISSING || Var2 == MISSING || Var3 == MISSING);
   /* exclude is 1 for each observation where any of the
        three variables has a missing value, 0 elsewhere */
tmp = _sortallby(exclude);
       /* the observations without any missing values will
         precede those with missing values, making it easy
                      to exclude them from the regression.
      An index variable can be used to restore the order */

10.5.5 Smoothing functions

The smoothing functions which are available in scatter plots can also be accessed from algebra.

10.5.5.1 Hodrick--Prescott filter

The Hodrick--Prescott is a filter which is popular in macro-economics, and virtually identical to a natural cubic spline (§8.10.2). The syntax is:

var_dest = smooth_hp(var,alpha,var_dest);

where var is the variable to be smoothed, alpha is the bandwith (use 0 for the default of 1600), and var_dest is the destination variable (must be different from var). For example:

hpCONS = smooth_hp(CONS, 0, hpCONS);

which creates hpCONS through the assignment statement, and uses it as the destination for the fitted values from the filter.

If there are missing values in the data series, smooth_hp uses data starting from the first valid observation, and stopping at the first missing value thereafter (this is unlike the smoothing functions of the next section, which will skip over missing values, using the maximum sample). The sample used can be restricted using the insample function.

The bandwidth parameter associated with the Hodrick--Prescott filter is 100 (freq)²:

annual data	100
quarterly data	1600
monthly data	14400

10.5.5.2 Kernel and spline smoothing

The kernel based smoother (using the Epanechnikov kernel, §8.10.1), and the natural cubic spline (§8.10.2) are computed respectively by:

var_dest = smooth_np(var,alpha,var_dest);
var_dest = smooth_sp(var,alpha,var_dest);

where var is the variable to be smoothed, var_dest is the destination variable (must be different from var), and alpha is the bandwith, used as follows:

0	use automatic bandwidth selection based on generalized cross validation;
<0	the absolute value is used for the bandwidth;
>0	specifies the equivalent number of parameters to be used.

These functions will use all available observations (unless restricted by the insample function), and will fill in missing values using the fit from the smoother.

10.5.5.3 Exponentially-weighted moving average and correlation

A simple exponentially-weighted moving average (see §8.13) of a series y can be created using:

    ysmo = ewma(y, 0.5, 0, ysmo);

Here α=0.5. An extended version introduces a slope coefficient β, and is also called Holt's method, e.g. with β=0.1:

    ysmo = ewma(y, 0.5, 0.1, ysmo);

In both cases, in-sample missing values are replaced by forecasts.

The exponentially weighted moving sample correlation (see §8.14) of two series y and x (both assumed to have expectation zero) can be created by:

    xysmo = ewmc0(x, y, 0.5, xysmo);

Here, λ is set to 0.5.

10.5.5.4 Date and time functions

When a database is dated, there are functions to return components of the current date and time for each observation: day, month, week, hours, minutes, seconds.

The makedate and maketime functions translate year, month, day and hours, minutes, seconds to date and time respectively. These can be added together to create a date with a time.

The dayofweek, isdayofmonth and iseaster functions are needed when determining holidays, for example to delete all holidays and weekends from a database. The following code, supplied as holiday_us.alg in the algebra folder, determines all the official US holidays:

hol_us_fix =                   /* official US holidays: fixed dates */
       (month()==1 && day()==1)                /* New Year's Day    */
    || (month()==7 && day()==4)                /* Independence Day  */
    || (month()==11 && day()==11)              /* Veterans Day      */
    || (month()==12 && day()==25);             /* Christmas Day     */
hol_us_fix = hol_us_fix                 /* fixed dates moved to Fri */
    || (month()==12 && day()==31 && dayofweek()==6)
    || (month()==7 && day()==3 && dayofweek()==6)       
    || (month()==11 && day()==10 && dayofweek()==6) 
    || (month()==12 && day()==24 && dayofweek()==6);    
hol_us_fix = hol_us_fix                 /* fixed dates moved to Mon */
    || (month()==1 && day()==2 && dayofweek()==2)
    || (month()==7 && day()==5 && dayofweek()==2)       
    || (month()==11 && day()==12 && dayofweek()==2) 
    || (month()==12 && day()==26 && dayofweek()==2);    
hol_us_flt =                /* official US holidays: floating dates */
       isdayofmonth(1,2,3) /* 3rd Mon in Jan: Martin Luther King day*/
    || isdayofmonth(2,2,3) /* 3rd Mon in Feb: Washington's Birthday */
    || isdayofmonth(5,2,-1)/*last Mon in May: Memorial Day          */
    || isdayofmonth(9,2,1) /* 1st Mon in Sep: Labor Day             */
    || isdayofmonth(10,2,2)/* 2nd Mon in Oct: Columbus Day          */
    || isdayofmonth(11,5,4);/*4th Thu in Nov: Thanksgiving          */

UK holidays also require the iseaster function:

hol_uk =                     /* Bank holidays for England and Wales */
       (month()==1 && day()==1)                            /* Jan 1 */
    || (month()==1 && day()==2 && dayofweek() == 2) /* Jan 1 on Sun */
    || (month()==1 && day()==3 && dayofweek() == 2) /* Jan 1 on Sat */
    || iseaster(-2)                                  /* Good Friday */
    || iseaster(1)                                 /* Easter Monday */
    || isdayofmonth(5,2,1)                /* Early May Bank Holiday */
    || isdayofmonth(5,2,-1)                  /* Spring Bank Holiday */
    || isdayofmonth(8,2,-1)                  /* Summer Bank Holiday */
    || (month()==12 && day()==25)                  /* Christmas day */
    || (month()==12 && day()==26)                     /* Boxing day */
    || (month()==12 && day()==27 && dayofweek() == 3)/* Xmas on Sun */
    || (month()==12 && day()==27 && dayofweek() == 2)/* Xmas on Sat */
    || (month()==12 && day()==28 && dayofweek() == 3);/*Xmas on Sat */

10.6 Algebra function summary

Available functions are listed in Table Table:10.3. Any function operating on missing values returns a missing value. Any function which fails also returns a missing value. Where the argument is var, it must be just a variable name, e.g. one = 1; cum(one); is allowed, but cum(1) is not. Similarly: lag(CONS+1,1) is not allowed, while lag(CONS,1)+1 is.

The indicator functions DI(), ..., QI() for fixed frequency are, e.g. II(2020,3), and for a dated database: II(2020-03-23, 0). The date refers to the point at which the underlying process changes.

Several examples of Algebra code were given above; for convenience these are summarized here:

    LX = log(X);
    DX = diff(X, 1);
    "log(X)" = log(X);
    aap = CONS + 1; /* comment /* nested comment */ */
    new1 = !season();
    cons2 = 2 * CONS - 1;
    cons3 = CONS / 3 - (cons2 + 5) * 1.55;
    Seasonal = season();
    new2 = Seasonal ? CONS : 0;
    new3 = Seasonal * CONS;
    new4 = Seasonal ? CONS;
    new5 = (CONS == MISSING) ? 0 : CONS;
    CONSlag1 = CONS[-1];
    X = INC + CONS[1980(1)];
    y[1955(1)] = 12;
    X = INC + CONS[0(0)];   // add first CONS value

Table:10.3 Algebra functions

Chapter 11 Batch Language

11.1 Introduction

The Batch language gives some control over OxMetrics through a command language. This allows for automating repetitive tasks, or as a quick way to get to a certain point in your analysis. Other programs (such as PcGive, STAMP and X12arima) extend this batch language with their own module-specific commands.

OxMetrics allows you to save the most recent model from the current module as a batch file (if supported by the module). If a model has been created interactively, it can be saved as a batch file for further editing or easy recall in a later session. This is also the most convenient way to create a batch file.

11.2 Executing Batch commands

If an error occurs while executing the Batch commands, the processing will be aborted, and control returned to OxMetrics. All successful statements up to that point will have been executed.

There are five ways of running Batch commands, as discussed in the next four sections.

11.2.1 Batch Editor (Alt+b)

The Tools/Batch Editor command activates the edit window in which you can edit/load/save a set of batch commands. The file extension used for batch files is .FL.

11.2.2 Batch from Results windows (Ctrl+b)

A text selection containing Batch commands can be run directly from that window using the Edit/Run as Batch command (or Ctrl+b as short cut).

11.2.3 Batch from the File/Open command

If you use File/Open, and set the file type to Run Batch file (*.fl), then the batch file is run immediately.

11.2.4 Batch from the Windows Explorer

You can double click on a .FL file in the Windows Explorer to run the file directly. If OxMetrics is not active yet, it will be started automatically.

11.2.5 Batch from a Batch file

A batch file can be called from another batch file, see §11.5.19.

11.3 Batch files and default folders

When a batch file is run, the working folder is set to that of the batch file. This allows data and algebra files which reside in the folder of the batch file to be found, even if the batch file is run from somewhere else. In addition the chdir command allows specifying a new default directory. Finally, paths to data files can be hard coded, but that is to be avoided if possible.

When a batch file is loaded in the batch editor, and then run, there will not be an associated default folder. However, the process of opening the batch file would set the default folder to that of the batch file.

11.4 General Batch command summary

Table Table:11.1 gives an alphabetical list of the OxMetrics batch language statements.

Table:11.1 Batch command summary

algebra {...}

appenddata("filename", "group"="");

appresults("filename");

break;

chdir("path");

closedata("databasename");

closedrawwindow();

command("command_line");

database("name", year1, period1, year2, period2, frequency);

draw(area, "y", "mode"="");

drawf(area, "y", "function", d1=0, d2=0);

drawptext(area, "text", "x", "y", size=300, rot=0);

drawtext(area, "text", "x", "y", size=300);

drawtitle(area, "text", "pos", size=300);

drawx(area, "y", "x", "mode"="");

drawz(area, "y", "x", "mode"="");

exit;

loadalgebra("filename");

loadbatch("filename");

loadcommand("filename");

loaddata("filename");

loadfile("filename");

loadgraph("filename");

module("name");

package("packagename", "modeltype"="");

print("text");

printdate;

println("text");

savedata("filename");

savedrawwindow("filename", "window"="");

saveresults("filename");

setdraw("option", i1=0, i2=0, i3=0, i4=0, i5=0);

setdrawwindow("name");

show;

usedata("databasename", i1=0);

11.4.1 Comment

Anything between /* and */ is considered comment. Note that this comment cannot be nested. Everything following // up to the end of the line is also comment.

11.4.2 Command types

There are two types of batch commands:

function calls (with or without arguments) terminated by a semicolon. An example is the loaddata command.
block commands, which are followed by statements between curly brackets. An example is the algebra command.

11.4.3 Default arguments

Table Table:11.1 shows two instances of default arguments, in the appenddata command, and in the setdraw command. For example, when omitting the last argument as in

    appenddata("data.oxdata");

the command is actually interpreted as:

    appenddata("data.oxdata", "");

11.5 Batch commands

In the following list, function arguments are indicated by words, whereas the areas where statement blocks are expected are indicated by .... Examples follow the list of descriptions. For terms in double quotes, the desired term must be substituted and provided together with the quotes. A command summary is given in Table Table:11.1. Module specific commands are documented with each module (so, for PcGive batch commands consult Volume I of the PcGive books).

11.5.1 `algebra` {...}

Contains the algebra code to execute. The block of algebra should be enclosed in curly braces, for example:

algebra
{                           // Create SAVINGSL in database test
    SAVINGSL = lag(INC,1) - lag(CONS, 1);
    y = log(Y);
}

11.5.2 `appenddata("`filename`"`, `"`group`"=""`);

Appends the data from the named data file to the existing in-memory database. If no database exists yet, the named data file will be opened as in the loaddata() command. If the second argument is omitted, or equal to "" (that is, no group name is specified) the whole file will be appended, otherwise only the named group.

11.5.3 `appresults("`filename`"`);

Append the contents of the Results window to the named file and clear the Results window.

11.5.4 `break;`

Stop the batch file, return to OxMetrics menus. While a batch file is running, there is no way of stopping it other than this batch command.

11.5.5 `chdir("`path`"`);

Sets the current working folder to the specified path. There are three special forms of this command:

chdir("#batch") sets the working folder to that of the batch file;
chdir("#home") sets the working folder to that of the OxMetrics.exe executable file (but if it ends in \bin that will be deleted). If OxMetrics.exe is in its default folder \Program Files\OxMetrics\bin, then this command will set the working folder to \Program Files\OxMetrics.
chdir("") prints the current working folder.

11.5.6 `closedata("`databasename`"`);

Closes the database, which is currently loaded in OxMetrics. The database is closed without saving, so any changes which have been made since the last save are lost.

11.5.7 `closedrawwindow();`

Closes the batch draw window.

11.5.8 `command("`command_line`"`);

Sends the line of text directly to the active module. The module must be able to accept commands (either via batch, or as an input console module).

11.5.9 `database("`name`"`, year1, period1, year2, period2, freq`);`

Creates a database with the specified name and frequency, and sample period year1 (period1) to year2 (period2).

11.5.10 `draw(area, "`y`"`, `"`mode`"="");`

Graphs variable y from the current database in the requested area (the first area has number 0). If the second argument is missing, the graph is a standard time-series plot. For other types of graphs, mode can be one of:

`"bar"`	draw bars
`"diff"`	take first differences of y
`"dlog"`	growth rates (first difference of natural logarithm)
`"index"`	draw index line
`"log"`	take natural logarithms of y
`"logscale"`	take natural logarithms and show on (natural) log scale
`"match"`	match in variables area by means and ranges to first variable
`"shading"`	draw shading
`"startat100"`	scale the variable to start at 100.
`"symbol"`	draw symbols as well as the line
`"twoscale"`	as "match", but showing second variable in area on right-hand axis
`"seasonal"`	draw seasonal sub-plot

11.5.11 `drawf(area, "`y`"`, `"`function`"`, d1`=0`, d2`=0);`

Graphs a function of the variable y from the current database in the requested area (the first area has number 0). The function argument can be one of:

`"acf"`	autocorrelation function with specified lag length
`"boxplot"`	boxplot
`"cdf"`	estimated distribution (QQ plot against normal)
`"chisq"`	QQ plot against Chi²(d1)
`"cumfreq"`	cumulative frequency count (for non-standard bars, specify the bar count)
`"density"`	estimated density (for non-standard bars, specify the bar count)
`"f"`	QQ plot against F(d1,d2)
`"frequency"`	frequency count (for non-standard bars, specify the bar count)
`"histogram"`	histogram (for non-standard bars, specify the bar count)
`"normal"`	QQ plot against normal with same mean and variance
`"pacf"`	partial autocorrelation function with specified lag length
`"periodogram"`	periodogram
`"spectrum"`	spectral density using specified lag length
`"t"`	QQ plot against student-t(d1)
`"uniform"`	Quantile plot (QQ against uniform)

11.5.12 `drawptext(area, "`text`"`, `"`x`"`, `"`y`"`, size`=300`, rot`=0);`

11.5.13 `drawtext(area, "`text`"`, `"`x`"`, `"`y`"`, size`=300);`

11.5.14 `drawtitle(area, "`text`"`, `"`pos`"`, size`=300);`

Adds text to the specified area.

`"drawptext"`	draws at pixel coordinates
`"drawtext"`	draws at world coordinates
`"drawtitle"`	draws a title at position "" (top), "X", "Y", or "Z"

11.5.15 `drawx(area, "`y`"`, `"`x`"`, `"`mode`"="");`

Graphs variable y against x from the current database in the requested area (the first area has number 0). This is a standard scatter plot, but if the second argument equals "", the graph is a standard time-series plot. For other types of graphs, mode can be one of:

`"alt"`	use alternate style: x,y labels along axes
`"project"`	add regression line with projections
`"regression"`	add a regression line
`"smooth"`	add a cubic spline (automatic bandwidth)

11.5.16 `drawz(area, "`y`"`, `"`x`"`, `"`mode`"="");`

Graphs variable y against x by z from the current database in the requested area (the first area has number 0). This is a standard scatter plot, but the second argument may equal "" for a time-series plot. The mode argument can be one of:

`"bar"`	z is error bar
`"band"`	z is error band
`"fan"`	z is error fan
`"hilo"`	high-low
`"symbol"`	z is symbol size
`"value"`	print values of z

For 3-dimensional graphs, the mode argument can be one of:

`"contour"`	contour from scattered data; x,y,z are vectors
`"contourx"`	contour from tabular data; x,y are vectors, z=z(x,y)
`"contoury"`	contour from tabular data; x,y are vectors, z=z(y,x)
`"points"`	sequence of 3D points
`"surface"`	surface from scattered data; x,y,z are vectors
`"surfacex"`	surface from tabular data; x,y are vectors, z=z(x,y)
`"surfacey"`	surface from tabular data; x,y are vectors, z=z(y,x)
`"trisurface"`	triangulated surface from scattered data; x,y,z are vectors

11.5.17 `exit;`

Stops the batch file and exits OxMetrics.

11.5.18 `loadalgebra("`filename`"`);

Loads and executes the algebra code from the named .ALG file. The extension must be provided. See §11.3 on the default folder.

11.5.19 `loadbatch("`filename`"`);

Loads and executes the batch code from the named .FL file. The extension must be provided. See §11.3 on the default folder.

11.5.20 `loadcommand("`filename`"`);

Sends the contents of the named file to the active module. The module must be able to accept commands (either via batch, or as an input console module).

11.5.21 `loaddata("`filename`"`);

Load the data from the named .IN7 file. To load, a full pathname would sometimes have to be specified, e.g. "c:\mydata\data.oxdata". The extension indicates the data type and must be provided. Inside OxMetrics, the database will then be known as data.oxdata, and will be the current default database. See §11.3 on the default folder.

If the database is already opened, the command works as usedata().

11.5.22 `loadgraph("`filename`"`);

Load the graph from the named file. To load, a full pathname would sometimes have to be specified, e.g. "c:\mydata\data.oxdata". The extension indicates the file type and must be provided (but OxMetrics will always require the .gwg file for loading, so when using loadgraph("fig1.eps"), OxMetrics will look for fig1.gwg.

11.5.23 `module("`name`"`);

Starts the specified module. When the module is already active, it becomes the focus for subsequent module-specific batch commands. Otherwise the module is started first.

11.5.24 `package("`packagename`"`, `"`modeltype`"="");`

Used to specify the package and model type for modelling:

package("PcGive", "Single-equation");

11.5.25 `print("`text`");`

11.5.26 `println("`text`");`

Prints the specified text in the Results window. This can be useful to introduce explanatory notes. The println commands adds a new line after printing the text.

11.5.27 `printdate;`

Prints the current date and time in the Results window. The date and time are pasted into the text in ISO format yyyy-mm-dd hh:mm:ss, so putting the year first and using a 24-hour clock.

11.5.28 `savedata("`filename`");`

Save the current database to the named .IN7 file. The .BN7 file will get the same base name. If files with these names already exist, they will be overwritten!

11.5.29 `savedrawwindow("`filename`"`, `"`window`"="");`

If no window is specified, the command saves the current Batch Graphics window to the named file (or gives an error message `Cannot save to' if there is no Batch Graphics window or the file cannot be saved). Otherwise it saves the named graphics window.

11.5.30 `saveresults("`filename`"`);

Save the contents of the results to the named file and clear the Results window. If a file with that name already exists, it will be overwritten!

11.5.31 `setdraw("`option`"`, i1`=0`, i2`=0`, i3`=0`, i4`=0`, i5`=0`);

This command changes the default settings used in graphics. The easiest way to use this command is to design the layout using the interactive dialogs (Graphics Setup and Postscript Setup). Then use the Write as batch command on both pages of the Graphics Setup dialog to record the batch code in the results window. This can then be pasted to the batch editor, and saved to a file. Or edited, and run directly from the results window.

option	changes	option	changes
`"axis"`	axis fonts/ticks	`"legendfontsize"`	legend font size
`"axisline"`	axis options	`"legendhide"`	legend hiding
`"axisformat"`	format of labels	`"legendresize"`	legend resizing
`"box"`	box and grid	`"line"`	colour line settings
`"bw"`	black&white settings	`"linebw"`	b&w line settings
`"color"`	colour settings	`"margin"`	paper margins
`"colormodel"`	PostScript model	`"palette_max"`	palette light colour
`"default"`	reset default	`"palette_min"`	palette dark colour
`"font"`	font	`"papercolor"`	Paper colour

`"grid"`	grid style	`"printpage"`	PostScript paper
`"histogram"`	bar colours	`"symbol"`	symbol settings
`"legend"`	legend style	`"xystyle"`	cross-plot style

Use of the default option is as follows:

`setdraw("default")`	reset all graphics defaults,
`setdraw("default", 0)`	reset all graphics defaults,
`setdraw("default", 1)`	reset layout, leave line types and colours,
`setdraw("default", 2)`	reset line types and colours, leave layout.

The following table lists the integer arguments for each option, with the range of possible values. If no range is given, the argument is a size in pixel coordinates (see Ch. 12). The default width and precision for axisformat is 8 and 6 respectively.

linetype		papertype		colormodel
solid	0	A4	0	black &white
dotted	1	Letter	1	black, white, gray
dashed	2	user defined	2	gray
long dashes			3	color
user defined

	symboltype
0	solid box	5	none	10	diamond
1	open box	6	line	11	solid diamond
2	plus	7	solid circle	12	cross
3	dash	8	triangle
4	circle	9	solid triangle

option	i1	i2	i3	i4	i5
`"axis"`	fontsize	step	tick
`"axisline"`	no X-line:0--1	no Y-line:0--1	:0--1	no small Y:0--1
`"axisformat"`	width	precision	:0--1	:0--1
`"box"`	box:0--1	X-grid:0--1	Y-grid:0--1
`"bw"`	lineno:0--15	red:0--255	green:0--255	blue:0--255
`"color"`	lineno:0--15	red:0--255	green:0--255	blue:0--255
`"colormodel"`	model:0--3
`"default"`	0,1,2
`"font"`	fontno:0--3	fontsize
`"grid"`	color:0--15	type:0--15
`"histogram"`	inside:0--15	outside:0--15
`"legend"`	boxed:0--1	columns
`"legendfontsize"`
`"legendhide"`	hide:0--1
`"legendresize"`	resize:0--1
`"line"`	lineno:0--15	linetype:0--4	width	on	off
`"linebw"`	lineno:0--15	linetype:0--4	width	on	off
`"margin"`	left	top
`"palette_max"`	index:0--7	red:0--255	green:0--255	blue:0--255
`"palette_min"`	index:0--7	red:0--255	green:0--255	blue:0--255
`"papercolor"`	red:0--255	green:0--255	blue:0--255
`"printpage"`	:0--1	papertype:0--2	X-size	Y-size
`"symbol"`	lineno:0--15	symtype:0--4	size
`"xystyle"`	:0--1	3d along axes:0--1

11.5.32 `setdrawwindow("`name`");`

Sets the name of the graphics window in which the graphs appear. The default is "Batch Graphics".

11.5.33 `show;`

Shows the graph created in batch. Note that graphs from draw/drawf/drawx/drawz commands are not displayed until the show command is issued.

11.5.34 `usedata("`databasename`"`, i1`=0`);

Sets the default database to databasename. The database must already be loaded into OxMetrics. Normally, only the one-argument function would be used, e.g, usedata("ukm1.in7"). If the second argument is 1, the database is marked clean: when it is closed there is no prompt for saving it. This can be useful after a section of algebra, which implements basic transformation for a dataset. This can be used as follows:

loaddata("ukm1.in7");
loadalgebra("ukm1.alg");
usedata("ukm1.in7", 1);

11.6 Examples

We finish with two example batch files. The first uses most non-graphics Batch commands, as well as some commands from the PcGive module.

    database("test", 1950, 1, 2000, 4, 4);   // Create the database
                            // Append the tutorial data set to test
                // note that test has a longer sample then data.oxdata
    chdir("C:\Program Files\OxMetrics8"); // default install folder
                             // it is better to use chdir("#home");
    appenddata("data.oxdata");
    loaddata("data.oxdata");  // Load the tutorial data. Now there are
               // two databases, with data.oxdata the default database
    usedata("test");              // make test the default database
    
    algebra
    {                           // Create SAVINGSL in database test
        SAVINGSL = lag(INC,1) - lag(CONS, 1);
    }
    package("PcGive", "Single-equation");
    system
    {
        Y = CONS;                        // The endogenous variable
        Z = Constant, CONS_1, CONS_2;             // the regressors
    }
    estimate("OLS", 1953, 3, 1992, 3, 8);
                 // Estimate the system by OLS over 1953(2)-1992(3)
                                            // withhold 8 forecasts
    println("Saving results to test.out");                                        
    saveresults("test.out");    // Save the contents of the Results
                         // window to test.out and clear the window
    testsummary;                             // Do the test summary
    println("Appending results to test.out");                                        
    appresults("test.out");   // Append the contents of the Results
                        // window to test.out, and clear the window
    
    break;                         // stop the batch run, remaining
                                   // commands will not be executed
    savedata("newtest.in7");// save test to newtest.in7/newtest.bn7
    exit;                              // Exit OxMetrics and PcGive

The final example uses many graphics batch commands, and is supplied as plots.fl with OxMetrics:

    chdir("#home");
    loaddata("data.oxdata");
    setdrawwindow("Time plots");
    draw(0, "CONS");
    draw(0, "INC","match");
    draw(1, "CONS");
    draw(1, "INC","twoscale");
    draw(2, "CONS","startat100");
    draw(2, "INC","startat100");
    draw(3, "CONS","log");
    draw(4, "CONS","logscale");
    draw(5, "CONS","dlog");
    show;
    
    setdrawwindow("Scatter plots");
    drawx(0, "CONS","INC","alt");
    drawx(1, "CONS","INC","regression");
    drawx(2, "CONS","INC","project");
    drawx(3, "CONS","INC","smooth");
    show;
    
    setdrawwindow("Function plots");
    drawf(0, "CONS","acf", 5);
    drawf(0, "CONS","pacf", 5);
    drawf(1, "CONS","periodogram");
    drawf(2, "CONS","spectrum", 5);
    drawf(3, "CONS","density");
    drawf(3, "CONS","histogram");
    drawf(4, "CONS","cdf");
    drawf(5, "CONS","frequency");
    drawf(6, "CONS","cumfreq", 20);
    drawf(7, "CONS","boxplot");
    drawf(8, "CONS","normal");
    show;
    
    algebra
    {
        z = fabs(INC - 890) * 10;
    }
    usedata("data.oxdata", 1);  // mark as clean
    
    setdrawwindow("Z plots");
    drawz(0, "CONS","","z", "bar");
    drawz(1, "CONS","","z", "band");
    drawz(2, "CONS","","INC", "hilo");
    drawz(3, "CONS","INC","z", "value");
    drawz(4, "CONS","INC","z", "symbol");
    show;
    
    database("3d_scatter", 1, 1, 625, 1, 1);
    algebra
    {   x = (ranu() - 0.5) * 5;
        y = (ranu() - 0.5) * 5;
        z = exp(-x^2-y^2);
    }
    setdrawwindow("3D contour plots");
    drawz(0, "x","y","z", "contour");
    show;
    
    setdrawwindow("3D surface plots");
    drawz(0, "x","y","z", "surface");
    drawz(1, "x","y","z", "trisurface");
    show;

Chapter 12 OxMetrics Graphics

12.1 Graphics paper

Graphs in OxMetrics are drawn on a graphics worksheet, consisting of 10000 by 15000 pixels, with (0,0) in the bottom left corner:

These pixels are virtual and different from screen pixels: the paper is always 10000 ×15000, regardless of the screen resolution or the size on screen.

Positions can be specified in pixel coordinates, as for example (p_x,p_y) =(700,3200). More often it is convenient to use real-world coordinates to map the pixel coordinates into real data values. This is done by specifying an area on the graphics worksheet, and attaching real-world coordinates to it. These areas are allowed to overlap, but need not:

The areas are numbered from left to right and top to bottom, counting starts at zero. Suppose we have set up all areas as being from (x,y)=(0.0,0.0) to (x,y)=(1.0,1.0) (again within each area the origin is the lower left corner). Then we can draw a line through area 2 in two ways:

in real coordinates within an area
1. step 1: select area 2;
2. step 2: move to (0.0, 0.0);
3. step 3: draw a line to (1.0,1.0).
using pixel coordinates on the worksheet
1. step 1: move to pixel coordinates (600,600);
2. step 2: draw a line to pixel coordinates (3600, 3600).

where we assume that (600,600) to (3600,3600) are the pixel coordinates chosen for area 2. Drawing in real-world coordinates has the advantage that it corresponds more closely to our data.

12.2 Creating graphs

A graph is created from data in a database using the Graphics toolbar button, or Model/Graphics. These graphs appear in the window labelled OxMetrics Graphics. Subsequent graphs are added to this window. To start from scratch, close the window, or select and delete each area. To keep the existing OxMetrics graphics window and open a new one for subsequent graphs, use File/New Data Plot Window (note that the menu contents change according to the type of window which is active).

Modules such as PcGive will put their graphs in a window with a different name.

12.2.1 Actual series with optional transformations

plots the actual values of all selected variables against time (or the observation index for undated series), together or each in a separate graph. If there are missing values, these show up as gaps in the line.

Create separate Plots
This creates as many graphs as there are series.
Style
- Lines
- Symbols
- Lines and symbols
- Index line: plot first series as index
- Index line and symbols: plot first series as index
- Bars: plot all series as bars
- Shading: use shading where this variable is 1, no shading otherwise
- First as bar: plot only the first as bar chart
Transformation:
- Logarithms: natural logs of the series: log (y_t)
- Growth rates: log (y_t-y_t-1)
- First differences: Δy_t=y_t-y_t-1
- Seasonal growth rates: log (y_t-y_t-s), s=4 for quarterly data, s=12 for monthly data,
- Seasonal differences: Δ_s y_t=y_t-y_t-s
Use log scale: assumes the data are in logs and powers up the axis to reflect the log scale.

12.2.2 Multiple series with optional transformations

Match series by
- None: no matching is done
- Mean &range matched to first series
- Second series on right scale
- Start = 100
Style: as above
Transformation: as above
Use log scale

12.2.3 Scatter plots

The available scatter-plot types are:

Y against X
Y against X, labels along the axes
Scatter plot with regression line
With cubic spline smooth, automatic bandwidth
All scatter plots

The Style, Regression and Smoothing sections allow additional options and combinations.

If there are missing values, these are omitted from the graphs.

12.2.4 Distribution

Estimated density and histogram, optionally with normal reference
The histogram is a simple graph: the range of x_t is divided into intervals, and the number of observations in each interval is counted. The height of each bar records the number of entries in that interval. In OxMetrics, these are divided by the total number of observations to show the relative frequency (use bar to keep the count). OxMetrics sets a default number of intervals dependent on the sample size, but this can be changed by the user by clicking on Smoothing and Use custom bar count to set a different value.
Estimated distribution against normal: a QQ plot
Frequencies and/or cumulative frequencies
QQ plot against Uniform, normal, t, F, or χ² distribution
Box plot

12.2.5 Time-series: ACF etc.

Autocorrelation function
The sample autocorrelation function (ACF), or correlogram, plots the correlations r̂_j between x_t and successive x_t-j, Also see §8.3. The length of the ACF can be set by the user. Since the correlation between x_t and x_t is always unity, it is not drawn in the graphs.
Partial autocorrelation function
The partial autocorrelation coefficients correct the autocorrelation for the effects of previous lags. So the first partial autocorrelation coefficient equals the first normal autocorrelation coefficient.
Cross-correlation function
The sample cross-correlation function (CCF) graphs the correlation between a series and the lags of another series.
Periodogram
The periodogram and the spectrum (or more accurately here: spectral density) graph the series in the frequency domain. The sample periodogram is based on the coefficients of the Fourier decomposition of the sample autocorrelations (that is, the correlations {r̂_j} between x_t and x_t-j).
Spectral density
The sample spectral density is a smoothed (and scaled) function of the periodogram. It is symmetric between -π and π, and so is only graphed for [0,π] ; 1 on the horizontal axis stands for π, 0.5 for 0.5π, etc. Peaks at certain frequencies can indicate regular (cyclical or seasonal) behaviour in the series.
Seasonal sub-plot
In a seasonal sub-plot, the data are displayed by season. For example, for quarterly data, first the quarter 1 data are graphed, then quarter 2, etc. This helps detecting changing seasonal patterns.

12.2.6 QQ plots

QQ (quantile-quantile), or cross-probability, plots:

Quantile plot: against uniform
QQ plot against normal (same mean, variance)
QQ plot with choice of distribution

As discussed in the companion book on PcGive, statisticians view variables as being described by probability distributions. If X is the variable, and x a value it could take, then

P_x( X>x)

is the probability that the value is in fact greater than x. For example, if the variable is the height of a child, when x is one metre, P_x( X>1) is the probability that the child is taller than a metre. The values of x can be any number, but P_x( ^.) cannot be negative or exceed unity. Plotting P_x( X>x) against x generates an ∫-shaped curve (but more stretched out horizontally), which is hard to interpret.

When X has a uniform distribution over ( 0,1) , if P_x( ^.) is plotted against x in a unit square, the result is a straight line. A similar idea applies to all distributions and QQ plots can be selected so some reference distribution is a straight line with the empirical distribution compared to it.

The QQ plot options page on the Graphics dialog allows for a choice of reference distributions. For the t, F and χ² distribution it is necessary to also supply the degrees of freedom arguments. Drawing a variable against a uniform results in a so-called quantile plot.

12.2.7 Two series by a third

Error bars
Error bands
Error fans
High-low
Show Z values
As symbol size: Bubble chart
A bubble chart consists of a scatter plot, where the symbols are circles, and the size of the circle indicates a third dimension (e.g. market share).

12.2.8 3-dimensional plots

Surface from scatter: X,Y,Z are vectors
Triangulation from scatter: X,Y,Z are vectors
Surface from table: Z,Y columns match
Surface from table: Z,X columns match
3D points
2D contour from 3D scatter surface

With 3-dimensional plots there are two ways of presenting the data: as a scatter of points in three-dimensional space, or as a table with the X and Y values along the side, and the Z values in the cells. The tabular format specifies the 3D surface directly, whereas for the scatter it is left to OxMetrics to work out the surface. Although the tabular format leads to better results, the scatter is easier to use, as it requires only three variables.

To create a surface from a scatter, OxMetrics derives a triangulation, and from the triangulation a smooth surface. This method works best when the surface is smooth.

There are a few additional edit actions available for 3-dimensional graphs:

Rotation
The Edit menu has an entry for rotation: left/right (azimuth, keyboard short-cuts are left-arrow, right-arrow), up/down (elevation, keyboard short-cuts are up-arrow, down-arrow) and side ways (twist). This can be done from the menu for the currently selected 3-d area, or for all graphs. The keyboard short-cuts only work when a 3D area is selected.
Palette
The Style entry for the 3D graph on Edit Graphs allows for a choice of palette.
Contour lines
The same page allows for the addition of contour lines to a surface (but not a triangulation).

12.3 Printing graphs

Select File/Print to print to an attached printer and File/Print Preview to first view the results. The following control of headers and footers is available for printing from File/Print Page Setup:

Orientation: portrait or landscape;
Column and scale are ignored, as the graph will fill the page;
Draw a box;
The size of margin to use;
Draw a header for the graph. This is controlled via the Header and Footer boxes. There are five specials:
- &[date] prints the date,
- &[time] prints the time,
- &[page] prints the page number,
- &[pages] prints the total number of pages,
- &[file] prints the file name.
Other text in the edit field is printed directly.

Other print setup options are printer specific.

Use File/Print Preview to view the result before printing. Note that colours are shown in the preview, but will come out in grey levels (unless you have a colour printer of course).

12.4 Graphics formats

Graphs can be saved to disk in various formats:

PDF file (.pdf), which is the format used to produce all the graphs in this book;
Scalable Vector Graphics (.svg), which can be inserted into web pages, as well as Microsoft Word and Excel using Insert/Pictures;
OxMetrics Graphics File (.gwg);
Enhanced metafile (.emf); These can easily be inserted into Microsoft Word using Insert/Pictures.
Encapsulated PostScript (.eps);
PostScript (.ps), this is like EPS, but defaulting to a full page print;
Portable Network Graphics (.png), which is a bitmap format that may be useful for insertion in web pages if SVG cannot be used;
Ox (.ox and .ox.gwg) code to regenerate the graph can be created by right-clicking on a graph in the document listing and selecting Save Ox code. This can be useful if, after manual adjustments, the graph is to be used as a template for other graphs.

The GWG format is particular to OxMetrics; no other program can read it and no printer can handle it. OxMetrics requires the GWG format, because it needs to be able to allow re-editing of the graph when it is reloaded; the other formats do not store sufficient information.

When you save a graph in any format, the GWG file is automatically saved alongside it. Then, when loading a previously saved PDF file (say), OxMetrics can use the GWG file to reload the graph.

12.5 Saving and loading graphs

Use File/Save As or the diskette toolbar button to save the graph to a file. At the bottom of the dialog is a dropdown listbox in which you select the type of file.

When you save a graph in any format, the GWG file is automatically saved alongside it. Then, when loading a previously saved EPS file (say), OxMetrics can use the GWG file to reload the graph.

Choose File/Open, to open a previously saved graphics file type.

12.6 Saving Ox code of a graph

Right click on the name of a graphics window in the work space gives the option to Save Ox code. This first saves the .gwg file, then uses the gwg2ox package to write the Ox code that recreates the graph.

For Figure ?? the code is (with the data cut off for reasons of space):

// File generated by gwg2ox
// gwg file   : D:\Waste\gwtutg2.ox.gwg
// gwg status : OK
// gwg objects: 25
#include <oxstd.oxh>
#include <oxdraw.oxh>
DoDrawing()
{
    SetDrawWindow("gwtutg2.ox.gwg");
    decl data = {                                             // cut
      "y1":<890.4495849609375,886.5430297851562,886.3287963867188  >,
      "y2":<908.2122802734375,900.6785278320312,899.7952880859375  >,
      "x3":<908.2122802734375,900.6785278320312,899.7952880859375  >,
      "y3":<890.4495849609375,886.5430297851562,886.3287963867188  >,
      "y4":<.,-3.90655517578125,-0.2142333984375,-1.444091796875   >,
      "y5":<.,-7.53375244140625,-0.88323974609375,-1.31378173828125>,
      "x6":<.,-7.53375244140625,-0.88323974609375,-1.31378173828125>,
      "y6":<.,-3.90655517578125,-0.2142333984375,-1.444091796875   >,
      "end": <>
    };
    DrawAdjust(ADJ_AREAMATRIX, -1, -1, 1, 640);
    DrawTMatrix(0,data["y1"],"CONS",1953,1,4,3,2);
    DrawAdjust(ADJ_INDEX, 1, 0);
    DrawTMatrix(0,data["y2"],"INC",1953,1,4,0,3);
    DrawAdjust(ADJ_REGRESSION, 1, 1, 4, 4);
    DrawAxisAuto(0,1);
    DrawAxisAuto(0,0);
    DrawLegend(0,20,20,0);
    DrawXMatrix(1,data["y3"],"CONS",data["x3"],"income",1,2);
    DrawAdjust(ADJ_SYMBOL, PL_FILLBOX, 100);
    DrawAdjust(ADJ_REGRESSION, 4, 1, 4, 4);
    DrawAxisAuto(1,1);
    DrawAxisAuto(1,0);
    DrawLegend(1,20,20,0);
    DrawTMatrix(2,data["y4"],"DCONS",1953,1,4,0,2);
    DrawAdjust(ADJ_COLOR, -1, 12);
    DrawTMatrix(2,data["y5"],"DINC",1953,1,4,0,3);
    DrawAxisAuto(2,1);
    DrawAdjust(ADJ_AXISGRID, 1, 14, 14);
    DrawAxisAuto(2,0);
    DrawAdjust(ADJ_AXISGRID, 1, 14, 14);
    DrawLegend(2,457,870,0);
    DrawXMatrix(3,data["y6"],"DCONS",data["x6"],"DINC",1,2);
    DrawAdjust(ADJ_SYMBOL, PL_CIRCLE, 90);
    DrawAdjust(ADJ_REGRESSION, 20, 0, 4, 4);
    DrawAxisAuto(3,1);
    DrawAxisAuto(3,0);
    DrawLegend(3,20,20,0);
    DrawAdjust(ADJ_AREAMATRIX, 2, 2);
    ShowDrawWindow();
}
main()
{
    DoDrawing();
}

12.7 Editing graphs

Each graph consists of a collection of objects, which in most cases can be manipulated, moved or deleted. The main one, the area, was introduced in §12.1.

12.7.1 Graph layout

An area defines a rectangle on the paper which has world coordinates attached to it. The concept of areas enables multiple graphs on the paper, up to 36. Areas are allowed to overlap.

The Graph Layout page of the Edit Graphics dialog allows for making adjustments that are relevant for the whole graph. Activate this through Edit/Edit Graph, or by double clicking in an area. You can:

Change the Areas layout, e.g. for example from 2 ×1 to 1 ×2.
Box all areas in the current graphics window.
Change the aspect ratio, for example to half-height graphs as used occasionally in this book.
The aspect ratio of the `graph paper' defaults to 1.5 (15000 along the x-axis and 10000 along the y-axis). This usually works very well, but occasionally it may need changing, for example to make the graphs square on paper. Half-size graphs can also look good in publications.
Change margins.
Adjust all styles to make all lines in all the areas thicker or thinner, or
to Increase all font sizes.
Set the PostScript mode: usually plots on screen are colour, but required in gray levels for publication purposes.
Display mode can be changed from colour to black &white. Occasionally, this can be useful when pasting to another application for printing. OxMetrics always starts in colour mode.
Paper colour allows the colour to be set for the (normally) white space around the graphics areas, e.g. to a light grey.

12.7.2 Area layout

By default, an area has `automatic' pixel coordinates, which means that, when an area is added, the pre-existing area will automatically shrink to make space. An area can be selected with the mouse, and moved around the paper. This changes the automatic paper positioning into a fixed position. Real world coordinates are also selected automatically, but can be set to fixed values.

The Area Layout page of the Edit Graphics dialog contains the settings that relate to positioning and layout of each individual area. You can:

change world coordinates: select the desired area and select the Set checkbox for the X or Y coordinates, then adjust the settings. Press All to apply this to all areas.
reset pixel coordinates to automatic or set specific pixel coordinates.
change layout, e.g. for example from 2 ×1 to 1 ×2.
Box the current area, or all areas in the current graphics window.

The Edit menu allows you to rotate 3-dimensional graphs. This applies to the currently selected 3D area, or all 3D areas if none is selected:

Rotate left or right (from the keyboard use Alt< or Alt>; this does not require the shift key, so just hold Alt pressed while typing the < or > key). This canges the azimuth.
Rotate up or down (from the keyboard use Alt- or Alt+). This canges the elevation.
Twist left or right.
Reset the rotation and twist back to the original values.

12.7.3 Variable against time, scatter, or 3D

The title of this section is derived from the variables being plotted, such as CONS x (time) x z or INC x (time).

The variable (or vector) is the main graphics object: it consists of a sequence of observations, graphed against time or another variable. A variable cannot be moved, nor can observations be moved on screen. There are essentially four types of vectors:

variable against time;
variable against another variable;
variable against another variable by a third variable. The third variable can indicate symbol size, have its value printed, or specify error bars.
3D graphs: surface, contour or scatter.

The Lines page of the Graphics Property dialog controls how the vector is displayed. The line type joining the observations can be:

Line: linked,
Symbols: not linked, but showing symbols,
Line and symbols,
Index: this draws a line from the point to the zero axis (if part of the area), or the edge of the area (towards zero). A non zero base can be specified in the Base for index/bars edit field.
Index and symbols,
Bars: this draws a bar to the zero axis (if part of the area), or the edge of the area (towards zero), with the bar centred on the current point. A non zero base can be specified in the Base for index/bars edit field.

Style allows a choice of the colour or pattern for the currently selected entry; the type choice is repeated in the drop-down box. The colour with which lines and symbols are drawn is set in the first column; the line width and pattern is set in the second combo box. In both cases an entry is selected from one of the 16 defined line types. These can be changed in Model/Preferences/Graphics Setup

The observations can be marked by a symbol, and the style and size of that symbol can be set in the third column.

For 3D graphs it is possible to change the colour palette, or add contour lines (but not for a triangulation). The global palette definition is set in Model/Graphics Setup

The entries used in the legend can be changed by renaming the labels of the variables in the X, Y, Z label boxes. If there is no name, that component will be omitted from the legend label for the vector: if all labels are empty, the legend entry will be omitted alltogether.

12.7.3.1 Error bars

Error bar options are only shown if the line object has error bars. Then you can switch between bars, bands and fans, choose a color for the bars/bands. A multiplication factor may also be set for error bars and bands. For example, if the data in the Z variable are standard errors, then a factor 2 will show the data plus/minus 2 standard errors (which would be a 95confidence interval if the data come from a normal distribution).

To rename a label for the selected vector, click on Rename, or double-click on the vector in the list box.

12.7.3.2 Regression, Scale

The Regression, Scale page of the Edit Graphics dialog allows adding regression lines to variables, and adjustment of the scaling (relative to the selected variable):

None removes all scaling.
Means matches the means of all variables to that of the selected variable: the shift factor will be set to make the means of the variables equal.
Ranges: the scale factor will be set to give all the variables the same range. Maximum -- minimum will now be the same for every variable. Adjustment is made relative to the selected variable.
Means and Ranges matches means and ranges. Both the shift and scale factors are set.

The scale and shift factor can also be set directly.

The regression options in this page are:

Number of least squares lines.
By default no least squares lines are drawn; you can select up to 40 lines.
Sequential versus Recursive regression lines.
The default is sequential regression: the X-axis is divided in s sections (where s is the number of regression lines). Each section gets its own regression line. Recursive regression lines will lead to each subsequent section added on to the previous.
With or without Projections; projections shows the deviation between the variable and the regression line.
Style sets the line colour and pattern.

12.7.4 Axes

An X and Y axis (and Z for 3D graphs) is added automatically to each graph. Usually the default suffices, but a large number of adjustments can be made in the Axis entries of the Edit Graphics dialog.

12.7.4.1 Settings for non-default labelling

Each axis consists of (values are in world coordinates):

Minimum and Maximum;
First large tick: the value at which the first large tick mark appears;
Intervals for large ticks: the gap between large tick marks;
Intervals for small ticks: the gap between small tick marks;

These values are all set to a default, but can be changed when Default labelling is unchecked.

12.7.4.2 Location and transformation

The location and transformation settings further position and define the axis:

Anchor: determines where the axis is anchored, e.g. for an X-axis this is a Y value (changing the anchor would move the X-axis up or down). Minimum anchors at the left (bottom) of an area, Maximum at the right (top). Use specified anchor allows anchoring anywhere in the graph. For 3D graphs, the X axis is anchored at an Y and Z value, Y at X,Z and Z at X,Y.
Transformation
The Axis type:

Normal
the default axis without transformation;
Log (data is in logs)
when the data is in natural logarithms, use this to show the original values on the axis;
Log10 (data is in log10)
when the data is in logarithms of base 10, use this to show the original values on the axis;
Use specified shift and scale
allows the shift and scale factors to be set;
Dates
when the values on the axis are to be interpreted as date values.

12.7.4.3 Style

The following adjustments can be made to an axis (all adjustments apply to the current axis only):

hide an axis by checking Hide. (Axes can also be deleted, but when an area has no X or Y axis and is redrawn, that axis will be added.)
No base line removes the line, leaving only tick marks.
No small ticks removes the small tick marks.
Line at y=0 is only for X-axes, and uses the same line colour and type as the grid. By default a line showing zero is added when both negative and positive data occur.
Grid attaches a grid, perpendicular to the selected axis. The grid is at the large tick marks. The grid colour and type is set in Graphics Setup, the default is that of line colour and type 14.
The Tick Size. This sets the small tick size, the large tick size is twice the small tick size.
The Font Size sets the size of the axis labels.

12.7.4.4 Label style

Labels other side puts the labels on the other side of the axis.
Rotate labels writes text of labels along the Y-axis (Y axis), or at a right angle (X-axis).
Centre dates puts the labels in between the tick marks. For example for annual data, the label 1960 is centred at 1960 by default. With Centre dates it is set half way between 1960 and 1961. This option is for X axes only. This does not work when the axis is dated.

Note that you may have as many axes as you wish. Axes can be selected on screen with the mouse.

12.7.5 Legend style

Legends are added automatically in an appropriate size, unless the legend font gets too small. In that case (normally with more than sixteen areas), the legend will be omitted. In addition, a legend can be hidden, and subsequently unhidden.

Legends can be selected with the mouse, and moved around. Deleting the legend results in it being hidden. The available options in the Legend style page of the Edit Graphics dialog are:

Hide, legends are never completely deleted.
Auto resize, by default, legends are reduced when the area gets smaller. When the text gets too small, the legend is hidden.
Boxed draws a box around the legends.
Default box size will be unchecked if the box is resized with the mouse. When it is checked, the box size is left to OxMetrics.
Columns sets the number of columns for the labels; the default is two.
Font size determines the size of the label. This is reduced further when Auto resize is set.
Transparent is the default. In that case, legends are drawn below data plots, with the exception of shading and separately drawn symbols. Opaque legends, on the other hand, are drawn on top of everything else.

12.7.6 Histogram

The only available options for histograms is to select a line color/pattern for the outside line and for the area inside the boxes.

12.7.7 Copy properties to other areas

Some aspects of one area can be copied to all other areas in the graph:

Pixel coordinates: width and height or reset the default;
World coordinates: X or Y;
Axes: labelling, scale tyle or label style;
Legend style;
Histogram style.

12.7.8 Text

Text can be added from the Edit menu using Add Text. Text belongs to an area, in which case it is deleted if the area is deleted (or copied when the area is copied). Text entered outside any area will be part of area 0. More than one line of text can be entered at a time. Text can also be rotated, by specifying the angle in degrees. This does not work so well for multiple line text blocks. Text entered immediately above the graph will have the title property, which means that it is moved with the area when the area is moved.

Text uses a LaTeX-style of formatting, see §12.10.

12.7.9 Lines and symbols

Lines can be added from the Edit menu using Draw. Line coordinates may be specified in world coordinates (the default when the line is drawn inside an area), in which case the line is deleted if the area is deleted (or copied when the area is copied). Alternatively, lines may be specified in pixel coordinates. The appearance of a line can be changed to a box or a solid box.

12.7.10 Adding, moving and deleting objects

Objects are added from the Edit menu. Most objects can be selected with the mouse, and moved and deleted subsequently.

12.7.11 Pointing

The status bar shows the graphics coordinates of the the mouse.

12.7.12 Graphics setup

Persistent setup for graphics is set through the Model/Preferences/Graphics Setup dialog.

12.8 Copy and paste

There is a distinction between use of the clipboard inside OxMetrics, and that for external use. There are three copy commands:

Copy is for internal use.
When no area is selected, the whole graphics window is copied. When an area is selected, only that is copied to the clipboard. These clipboard items are not recognized by other programs.
When a OxMetrics graphics format is on the clipboard, it can be added to an existing graph (no area selected) or to an existing area (which must be selected). Copying in the internal format is delayed, which means that the full copying is not actually performed until a paste is requested. This implies that when a graphics window is closed it is not available for pasting anymore.
Copy Bitmap to Clipboard copies the whole graphics window in bitmap format. This can be pasted into other programs that recognize bitmaps.
Copy Metafile to Clipboard copies the whole graphics window in EMF format, from which is a vector format that can be pasted into other programs.

It is not possible to paste from other programs into OxMetrics graphs, with the exception of text.

12.9 Graphs and sample selection

All types of graphs will use all valid observations, just skipping over missing values.

12.10 Text formatting

Text uses a LaTeX-style of formatting, which allows for complex mathematics be used to annotate graphs. OxMetrics implements a subset of the normal LaTeX commands, and mathematics is typeset using the Symbol font.

Mathematics is enclosed in $...$; superscripts and subscripts are only allowed in math mode. A superscript is written as ^{...}, a subscript as _{...}. For example, $\hat \epsilon _{i-1}^{k+1}$ gives ε̂ _i-1^k+1. Tables Table:12.1 to Table:12.6 list the mathematics symbols which are available in OxMetrics.

The entries in Tables Table:12.7 to Table:12.8 do not require math mode. Note that, to write $ \ _ { } % you must enter \$ \\ \_ \{ \} \% respectively (\\ for a backslash is not standard in LaTeX, where it denotes a line break).

Table:12.1 Greek symbols

α	`\alpha`	θ	`\theta`	o	`o`	υ	`\upsilon`
β	`\beta`	θ	`\vartheta`	π	`\pi`	φ	`\phi`
γ	`\gamma`	ι	`\iota`	π	`\varpi`	φ	`\varphi`
δ	`\delta`	κ	`\kappa`	ρ	`\rho`	χ	`\chi`
ε	`\epsilon`	λ	`\lambda`	ρ	`\varrho`	ψ	`\psi`
ε	`\varepsilon`	μ	`\mu`	σ	`\sigma`	ω	`\omega`
ζ	`\zeta`	ν	`\nu`	σ	`\varsigma`
η	`\eta`	ξ	`\xi`	τ	`\tau`
Γ	`\Gamma`	Λ	`\Lambda`	Σ	`\Sigma`	Ψ	`\Psi`
Δ	`\Delta`	Ξ	`\Xi`	Υ	`\Upsilon`	Ω	`\Omega`
Θ	`\Theta`	Π	`\Pi`	Φ	`\Phi`

Table:12.2 Arrows

\ \	`\arrowext`^*	⇐	`\Leftarrow`	⇒	`\Rightarrow`
↓	`\downarrow`	←	`\leftarrow`	→	`\rightarrow`
⇓	`\Downarrow`	⇔	`\Leftrightarrow`	↑	`\uparrow`
		↔	`\leftrightarrow`	⇑	`\Uparrow`
		←	`\leftarrow\arrowext`
^* indicates that the command is not a standard LaTeX command.

Table:12.3 Mathematics accents

á	`\acute a`	ȧ	`\dot a`	à	`\grave a`	ã	`\tilde a`
a	`\bar a`	ä	`\ddot a`	â	`\hat a`

Table:12.4 Foreign and accented characters

å	`\aa`	æ	`\ae`	ø	`\o`	ß	`\ss`
Å	`\AA`	Æ	`\AE`	Ø	`\O`
á	`\'a`	â	`\^a`	ã	`\~a`	ȧ	`\.a`
à	\`a	ä	`\"a`	a̅	`\=a`

Table:12.5 Mathematical symbols

ℵ	`\aleph`	∃	`\exists`	\|	`\mid`	⌋	`\rfloor`
∠	`\angle`	∀	`\forall`	¬	`\neg`	~	`\sim`
≈	`\approx`	≥	`\geq`	≠	`\neq`	♠	`\spadesuit`
*	`\ast`	>	`\gt`^* or >	∋	`\ni`	⊂	`\subset`
•	`\bullet`	♥	`\heartsuit`	⊕	`\oplus`	⊆	`\subseteq`
∩	`\cap`	ℑ	`\Im`	⊗	`\otimes`	∑	`\sum`
^.	`\cdot`	∈	`\in`	∂	`\partial`	⊃	`\supset`
♣	`\clubsuit`	∞	`\infty`	⊥	`\perp`	⊇	`\supseteq`
≅	`\cong`	∫	`\int`	±	`\pm`	√	`\surd`
∪	`\cup`	⟨	`\langle`	'	`\prime`	∴	`\therefore`
°	`\degree`	⌈	`\lceil`	∏	`\prod`	×	`\times`
♦	`\Diamond`	...	`\ldots`	~	`\propto`	∇	`\triangledown`
♦	`\diamondsuit`	≤	`\leq`	⟩	`\rangle`	∨	`\vee`
/	`\div`	⌊	`\lfloor`	⌉	`\rceil`	∧	`\wedge`
∅	`\emptyset`	<	`\lt`^* or <	ℜ	`\Re`	℘	`\wp`
≡	`\equiv`
^* indicates that the command is not a standard LaTeX command.

Table:12.6 Log-like symbols

arccos	`\arccos`	csc	`\csc`	ker	`\ker`	min	`\min`
arcsin	`\arcsin`	deg	`\deg`	lg	`\lg`	Pr	`\Pr`
arctan	`\arctan`	det	`\det`	lim	`\lim`	sec	`\sec`
arg	`\arg`	dim	`\dim`	liminf	`\liminf`	sin	`\sin`
cos	`\cos`	exp	`\exp`	limsup	`\limsup`	sinh	`\sinh`
cosh	`\cosh`	gcd	`\gcd`	ln	`\ln`	sup	`\sup`
cot	`\cot`	hom	`\hom`	log	`\log`	tan	`\tan`
coth	`\coth`	inf	`\inf`	max	`\max`	tanh	`\tanh`

Table:12.7 Miscellaneous symbols

©	`\copyright`	¶	`\P`	§	`\S`
Euro^*	`\euro`	£	`\pounds`	^tm	`\trademark`^*
f	`\florin`^*	O	`\registered`^*
$	`\$`	\	`\\`	_	`\_`
{	`\{`	}	`\}`	%	`\%`
^* indicates that the command is not a standard LaTeX command.

Table:12.8 Fonts and sizes

bold	`\bf`	roman	`\rm`	small	`\small`
bold italic	`\bi`	sans serif	`\sf`	large	`\large`
font 0 to 9	`\f0` -- `\f9`	`typewriter`	`\tt`	normalsize	`\normalsize`
italic	`\it`

Chapter 13 OxMetrics Data Management

13.1 Creating data

File/New allows for the creation of a new database. All variables in a database have the same frequency. A sample period is required, but can be extended or changed later.

13.2 Database font

View/Increase Text Size and View/Decrease Text Size are useful to select a smaller or larger font size for displaying the database on screen.

13.3 Database description

A database can have a description for documentation purposes, and separate from the variable description. Only .in7/.bn7 data files can preserve the description.

13.4 Printing data

Use File/Print when a database has the focus to print the database. File/Print Preview is available to view the result before printing.

Select File/Print to print to an attached printer and File/Print Preview to first view the results. The following control of headers and footers is available for printing from File/Print Page Setup:

Orientation: portrait or landscape;
Number of columns on the printed page;
Scale for printing: 100 is full scale, 90 smaller, etc.;
Draw a box;
The size of margin to use;
Write a header and footer. This is controlled via the Header and Footer boxes. There are five specials:
- &[date] prints the date,
- &[time] prints the time,
- &[page] prints the page number,
- &[pages] prints the total number of pages,
- &[file] prints the file name.
Other text in the edit field is printed directly.

13.5 Data formats

The data format specific to OxMetrics has the .oxdata extension. In addition, OxMetrics can read and write human-readable files, Excel spreadsheet files, comma-separated files, and Stata files.

13.6 Summary statistics

This command prints the name of the database, the sample period and number of variables to the Results window. For each variable in the database the name is printed as well as:

leading sample: -- the first valid sample period without missing values,
#obs: -- number of valid observations,
#miss: -- number of observations with missing values,
minimum: -- minimum value,
mean: -- mean of valid observations, see (eq:8.1),
maximum: -- maximum value,
std.dev: -- standard deviation of valid observations, see (eq:8.1).

13.7 Saving data

Use File/Save As or the diskette toolbar button to save the data to a disk file. At the bottom of the dialog is a dropdown listbox in which you select the type of file.

13.8 Navigation and editing

A database cursor indicates which is the current observation. The cursor can be moved from the keyboard or using the mouse. A block of observations can be selected (highlighted) for:

copying to the clipboard;
as a destination for a block from the clipboard;
to set to a single value: press Enter to specify the value, whence you will be prompted whether the value should be applied to the whole selection.

Press Enter or double click on an observation to edit the value. Single click on a cell with the cursor will give access to the inline editor. Algebra and the Calculator allows manipulation of variables through mathematical expressions.

13.9 Renaming variables

Press Enter or double click on the variable name in the database to change the name or documentation. When in the Calculator or the Algebra Editor, click Ren when a variable is selected in the list box to rename.

13.10 Deleting variables

Click on the variable name in the database to select the variable (or a group of variables), then press the Del key to delete (confirmation of the deletion will be required). When in the Calculator, press the Del buttonwhen a variable is selected in the list box to delete, confirmation will be required again.

13.11 Reordering variables

The variables listbox in the Calculator and Algebra Editor allows variables to be picked up with the mouse and moved down or up in the listbox.

13.12 Adding variables

Double click on an empty field in the database next to an existing variable to create a new variable. Alternatively use Edit/New Variable.

13.13 Extending or reducing the sample period

Double click on an empty field in the database below an existing variable to activate the Extend Database sample dialog. Alternatively use Edit/Extend sample. Enter a negative value to reduce the sample size.

The Change Sample option on the Edit menu allows the following actions:

Add observations at the start;
Add observations at the end;
Delete observations at the start;
Delete observations at the end.

13.13.1 Changing the sample period

The Change Sample option on the Edit menu allows changing the start year and period, as well as the frequency. The number of observations remains unchanged, and no data will be lost.

13.14 Copy and paste

The standard copy and paste facilities are available, also see §13.8.

13.15 Appending data

A database can be appended from disk to an open database, using Append Data File when right clicking on a database in the workspace. This requires the frequencies to match. OxMetrics will make additional adjustments when these are required:

the sample of the open database will be extended to encompass that of the appended database;
if a variable from the appended database already exists in the open database, then the existing variable is updated, except where the appended variable has missing values (for those observations the existing variable is not changed).

13.16 Daily, weekly and timed data

The calendar

[Note: The calendar is Gregorian from 15 October 1582 onwards, and Julian before (so there is no year 0: year -1 precedes year 1; day 0 is on Julian date 1 January -4713). ]

index used by OxMetrics is the Julian day number. For example, Julian day 2453402 corresponds to 2005-01-31. An optional fractional part specifies the fraction of the day: 2453402.75 corresponds to 2005-01-01T18:00. If the day number is zero, it is interpreted as a time only, so 0.75 is just 18:00 (6 PM).

[Note: This is similar to how Excel stores dates and times.]

OxMetrics uses the ISO representation yyyy-mm-dd, and the 24 hour clock for time, hh:mm:ss:.hhh. A date with a time are joined by the letter T, as shown above.

In many fields it is possible to enter a date or time, which is then automatically translated to the internal representation. Years with week number are then also recognised, e.g. 1976-W3 returns the calendar index for the Monday of week 3 in 1976. (Week 1 is the first week that contains the first Thursday; or equivalently, the week that contains 4 January.)

OxMetrics uses two methods for dating a database:

Fixed frequency: Denoted by a start year, start period and frequency. From this, the date of any subsequent observation can be worked out. For example, if the start year is 2005, the start period 1, and the frequency 4 (quarterly), then the sixth observations is for 2006(2), the second quarter. This works well for monthly, quarterly and annual data. For undated data we simply set the start year and period to one, and the frequency as well.
Dated: The fixed frequency method does not work for weekly or daily data: not every year has the same numbers of observations. The solution adopted in OxMetrics is as follows: an integer value represents a date. A database variable can be classified as of type date, in which case the date is displayed, rather than the underlying value.

The following criteria must be satisfied for a database to be dated:

the first column must be of type Date,
the first column holds date values,
the optional fractional part of this indicates time as a fraction of 24 hours; e.g. 2453372.75 is 18:00 on 2005-1-1 (2005-1-1T18:00 in ISO notation),
the first and last observation must be valid, i.e. cannot be missing.

Note that the fixed sample period is still available for a dated database, but this is usually set as for an undated database (start year, start period, frequency all set to one).

Also note that the first variable in a dated database (which defines the calendar dates of the observations) is a variable as any other: it can be renamed, transformed, moved, deleted, etc. But some of these will make the database undated again.

References

Davidson, J. E. H., D. F. Hendry, F. Srba, and J. S. Yeo (1978). Econometric modelling of the aggregate time-series relationship between consumers' expenditure and income in the United Kingdom. Economic Journal 88, 661--692. Reprinted in Hendry, D. F., Econometrics: Alchemy or Science? Oxford: Blackwell Publishers, 1993, and Oxford University Press, 2000; and in Campos, J., Ericsson, N.R. and Hendry, D.F. (eds.), General to Specific Modelling. Edward Elgar, 2005.

Doornik, J. A. (2006). The role of simulation in econometrics. In T. Mills and K. Patterson (Eds.), Palgrave Handbook of Econometrics, pp. 787--811. Basingstoke: Palgrave MacMillan.

Doornik, J. A. (2021). Object-Oriented Matrix Programming using Ox (9th ed.). London: Timberlake Consultants Press.

Doornik, J. A. and D. F. Hendry (2021). Interactive Monte Carlo Experimentation in Econometrics Using PcNaive (5th ed.). London: Timberlake Consultants Press.

Doornik, J. A. and K. Juselius (2018). CATS 3: Cointegration Analysis of Time Series in OxMetrics. London: Timberlake Consultants Press.

Doornik, J. A. and M. Ooms (2005). Outlier detection in GARCH models. Discussion paper 2005-w24, Nuffield College.

Doornik, J. A. and M. Ooms (2008). Multimodality in the GARCH regression model. International Journal of Forecasting 24, 432--448.

Engler, E. and B. Nielsen (2009). The empirical process of autoregressive residuals. Econometrics Journal 12, 367--381.

Findley, D. F., B. C. Monsell, W. R. Bell, W. R. Otto, and B.-C. Chen (1998). New capabilities and methods of the X-12-ARIMA seasonal-adjustment program (with discussion). Journal of Business and Economic Statistics 16, 127--177.

Golub, G. H. and C. F. Van Loan (1989). Matrix Computations. Baltimore: The Johns Hopkins University Press.

Granger, C. W. J. (1966). The typical spectral shape of an economic variable. Econometrica 34, 150--161.

Granger, C. W. J. and P. Newbold (1986). Forecasting Economic Time Series, (2nd ed.). New York: Academic Press.

Green, P. J. and B. W. Silverman (1994). Nonparametric Regression and Generalized Linear Models. A Roughness Penalty Approach. London: Chapman and Hall.

Härdle, W. (1990). Applied Nonparametric Regression. Cambridge: Cambridge University Press.

Harvey, A. C. (1993). Time Series Models, (2nd ed.). Hemel Hempstead: Harvester Wheatsheaf.

Hastie, T. J. and R. J. Tibshirani (1994). Generalized Additive Models. London: Chapman and Hall.

Hendry, D. F. and J. A. Doornik (2021). Empirical Econometric Modelling -- PcGive 16 Volume I. London: Timberlake Consultants Press.

Koopman, S. J., A. C. Harvey, J. A. Doornik, and N. Shephard (2013). Structural Time Series Analysis, Modelling, and Prediction using STAMP (5th ed.). London: Timberlake Consultants Press.

Laurent, S. (2021). G@RCH Professional 9 (7th ed.). London: Timberlake Consultants Press.

Makridakis, S., S. C. Wheelwright, and R. C. Hyndman (1998). Forecasting: Methods and Applications (3rd ed.). New York: John Wiley and Sons.

Nielsen, B. (2006). Correlograms for non-stationary autoregressions. Journal of the Royal Statistical Society B 68, 707--720.

Priestley, M. B. (1981). Spectral Analysis and Time Series. London: Academic Press.

Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. London: Chapman and Hall.

Part:1 OxMetrics Reference

Chapter 8 OxMetrics Statistics

8.1 Actual series and scatter plots

8.2 Mean, standard deviation and variance

8.3 Autocorrelation function (ACF) or covariogram

8.4 Partial autocorrelation function (PACF)

8.5 Correlogram

8.6 Cross-correlation function (CCF)

8.7 Periodogram

8.8 Spectral density

8.9 Histogram, estimated density and distribution

8.10 Regression lines and smooths

8.10.1 Kernel smooth

8.10.2 Spline smooth

8.11 QQ plot

8.12 Box plot

8.13 Exponentially-weighted moving average (EWMA)

8.14 Exponentially-weighted moving correlation

Chapter 10 Algebra Language

10.1 Introduction

10.2 Executing Algebra code

10.2.1 Calculator (Alt+c)

10.2.2 Algebra Editor (Alt+a)

10.2.3 Algebra from Results windows (Ctrl+a)

10.2.4 Algebra from a Batch file

10.3 Syntax of Algebra language

10.3.1 Variables and variable names

10.3.2 Comment

10.3.3 Constants

10.3.4 Algebra operators

10.3.4.1 Arithmetic operators

10.3.4.2 Relational and logical operators

10.3.4.3 Algebra operator precedence

Table:10.1 Algebra operator precedence

10.3.5 Assignment statements

10.3.6 Conditional assignment statements

10.3.7 Indexing

10.3.8 Keywords

Table:10.2 Reserved Algebra keywords

10.4 Algebra implementation

10.5 Algebra Functions

10.5.1 Differencing and lag functions

10.5.2 Implicit functions

10.5.3 ACF and periodogram functions

10.5.4 Sorting functions

10.5.5 Smoothing functions

10.5.5.1 Hodrick--Prescott filter

10.5.5.2 Kernel and spline smoothing

10.5.5.3 Exponentially-weighted moving average and correlation

10.5.5.4 Date and time functions

10.6 Algebra function summary

Table:10.3 Algebra functions

Chapter 11 Batch Language

11.1 Introduction

11.2 Executing Batch commands

11.2.1 Batch Editor (Alt+b)

11.2.2 Batch from Results windows (Ctrl+b)

11.2.3 Batch from the File/Open command

11.2.4 Batch from the Windows Explorer

11.2.5 Batch from a Batch file

11.3 Batch files and default folders

11.4 General Batch command summary

Table:11.1 Batch command summary

11.4.1 Comment

11.4.2 Command types

11.4.3 Default arguments

11.5 Batch commands

11.5.1 algebra {...}

11.5.2 appenddata("filename", "group"="");

11.5.3 appresults("filename");

11.5.4 break;

11.5.5 chdir("path");

11.5.6 closedata("databasename");

11.5.7 closedrawwindow();

11.5.8 command("command_line");

11.5.9 database("name", year1, period1, year2, period2, freq);

11.5.10 draw(area, "y", "mode"="");

11.5.11 drawf(area, "y", "function", d1=0, d2=0);

11.5.12 drawptext(area, "text", "x", "y", size=300, rot=0);

11.5.13 drawtext(area, "text", "x", "y", size=300);

11.5.1 `algebra` {...}

11.5.2 `appenddata("`filename`"`, `"`group`"=""`);

11.5.3 `appresults("`filename`"`);

11.5.4 `break;`

11.5.5 `chdir("`path`"`);

11.5.6 `closedata("`databasename`"`);

11.5.7 `closedrawwindow();`

11.5.8 `command("`command_line`"`);

11.5.9 `database("`name`"`, year1, period1, year2, period2, freq`);`

11.5.10 `draw(area, "`y`"`, `"`mode`"="");`

11.5.11 `drawf(area, "`y`"`, `"`function`"`, d1`=0`, d2`=0);`

11.5.12 `drawptext(area, "`text`"`, `"`x`"`, `"`y`"`, size`=300`, rot`=0);`

11.5.13 `drawtext(area, "`text`"`, `"`x`"`, `"`y`"`, size`=300);`

11.5.14 `drawtitle(area, "`text`"`, `"`pos`"`, size`=300);`

11.5.15 `drawx(area, "`y`"`, `"`x`"`, `"`mode`"="");`

11.5.16 `drawz(area, "`y`"`, `"`x`"`, `"`mode`"="");`

11.5.17 `exit;`

11.5.18 `loadalgebra("`filename`"`);

11.5.19 `loadbatch("`filename`"`);

11.5.20 `loadcommand("`filename`"`);

11.5.21 `loaddata("`filename`"`);

11.5.22 `loadgraph("`filename`"`);

11.5.23 `module("`name`"`);

11.5.24 `package("`packagename`"`, `"`modeltype`"="");`

11.5.25 `print("`text`");`

11.5.26 `println("`text`");`

11.5.27 `printdate;`

11.5.28 `savedata("`filename`");`

11.5.29 `savedrawwindow("`filename`"`, `"`window`"="");`

11.5.30 `saveresults("`filename`"`);

11.5.31 `setdraw("`option`"`, i1`=0`, i2`=0`, i3`=0`, i4`=0`, i5`=0`);

11.5.32 `setdrawwindow("`name`");`

11.5.33 `show;`

11.5.34 `usedata("`databasename`"`, i1`=0`);