International
Tables for
Crystallography
Volume B
Reciprocal space
Edited by U. Shumeli

International Tables for Crystallography (2006). Vol. B, ch. 3.1, pp. 348-352   | 1 | 2 |
doi: 10.1107/97809553602060000559

Chapter 3.1. Distances, angles, and their standard uncertainties

D. E. Sandsa*

aDepartment of Chemistry, University of Kentucky, Chemistry–Physics Building, Lexington, Kentucky 40506-0055, USA
Correspondence e-mail: sands@pop.uky.edu

Methods for calculating distances and angles and the standard uncertainties in these quantities using the techniques of tensor analysis are described. A Fortran program showing how this tensor formulation can be adapted to computer languages is also given.

3.1.1. Introduction

| top | pdf |

A crystal structure analysis provides information from which it is possible to compute distances between atoms, angles between interatomic vectors, and the uncertainties in these quantities. In Cartesian coordinate systems, these geometric computations require the Pythagorean theorem and elementary trigonometry. The natural coordinate systems of crystals, though, are determined by symmetry, and only in special cases are the basis vectors (or coordinate axes) of these systems constrained to be of equal lengths or mutually perpendicular.

It is possible, of course, to transform the positional parameters of the atoms to a Cartesian system and perform the subsequent calculations with the transformed coordinates. Along with the coordinates, the transformations must be applied to anisotropic thermal factors, variance–covariance matrices and other important quantities. Moreover, leaving the natural coordinate system of the crystal sacrifices the simplified relationships imposed by translational and point symmetry; for example, if an atom has fractional coordinates [x^{1}], [x^{2}], [x^{3}], an equivalent atom will be at [1 + x^{1}], [x^{2}], [x^{3}], etc.

Fortunately, formulation of the calculations in generalized rectilinear coordinate systems is straightforward, and readily adapted to computer languages (Section 3.1.12[link] illustrates the use of Fortran for such calculations). The techniques for these computations are those of tensor analysis, which provides a compact and elegant notation. While an effort will be made to be self-sufficient in this chapter, some proficiency in vector algebra is assumed, and the reader not familiar with the basics of tensor analysis should refer to Chapter 1.1[link] and Sands (1982a[link]).

3.1.2. Scalar product

| top | pdf |

The scalar product of vectors u and v is defined as [{\bf u} \cdot {\bf v} = uv \cos \varphi, \eqno(3.1.2.1)] where u and v are the lengths of the vectors and [\varphi] is the angle between them. In terms of components, [\eqalignno{ &{\bf u} \cdot {\bf v} = (u^{i}{\bf a}_{i}) \cdot (v\hskip 2pt^{j}{\bf a}_{j}) &(3.1.2.2)\cr &{\bf u} \cdot {\bf v} = u^{i}v\hskip 2pt^{j}{\bf a}_{i} \cdot {\bf a}_{j} &(3.1.2.3)\cr &{\bf u} \cdot {\bf v} = u^{i}v\hskip 2pt^{j}g_{ij}. &(3.1.2.4)}%(3.1.2.4)] In all equations in this chapter, the convention is followed that summation is implied over an index that is repeated once as a subscript and once as a superscript in an expression; thus, the right-hand side of (3.1.2.4)[link] implies the sum of nine terms [u^{1}v^{1}g_{11} + u^{1}v^{2}g_{12} + \ldots + u^{3}v^{3}g_{33}.] The [g_{ij}] in (3.1.2.4)[link] are the components of the metric tensor [see Chapter 1.1[link] and Sands (1982a[link])] [g_{ij} = {\bf a}_{i} \cdot {\bf a}_{j}. \eqno(3.1.2.5)] Subscripts are used for quantities that transform the same way as the basis vectors [{\bf a}_{i}]; such quantities are said to transform covariantly. Superscripts denote quantities that transform the same way as coordinates [x^{i}]; these quantities are said to transform contravariantly (Sands, 1982a[link]).

Equation (3.1.2.4)[link] is in a form convenient for computer evaluation, with indices i and j taking successively all values from 1 to 3. The matrix form of (3.1.2.4)[link] is useful both for symbolic manipulation and for computation, [{\bf u} \cdot {\bf v} = {\bi u^{T}} {\bi gv}, \eqno(3.1.2.6)] where the superscript italic T following a matrix symbol indicates a transpose. Written out in full, (3.1.2.6)[link] is [{\bf u} \cdot {\bf v} = (u^{1} u^{2} u^{3}) \pmatrix{g_{11} &g_{12} &g_{13}\cr g_{21} &g_{22} &g_{23}\cr g_{31} &g_{32} &g_{33}\cr} \pmatrix{v^{1}\cr v^{2}\cr v^{3}\cr}. \eqno(3.1.2.7)] If u is the column vector with components [u^{1}, u^{2}, u^{3}], [ {\bi u}^{T}] is the corresponding row vector shown in (3.1.2.7)[link].

3.1.3. Length of a vector

| top | pdf |

By (3.1.2.1)[link], the scalar product of a vector with itself is [{\bf v} \cdot {\bf v} = (v)^{2}. \eqno(3.1.3.1)] The length of v is, therefore, given by [v = (v^{i} v\hskip 2pt^{j} g_{ij})^{1/2}. \eqno(3.1.3.2)] Computation of lengths in a generalized rectilinear coordinate system is thus simply a matter of evaluating the double summation [v^{i}v\hskip 2pt^{j}g_{ij}] and taking the square root.

3.1.4. Angle between two vectors

| top | pdf |

By (3.1.2.1)[link] and (3.1.2.4)[link], the angle [\varphi] between vectors u and v is given by [\varphi = \cos^{-1} [u^{i} v\hskip 2pt^{j} g_{ij}/(uv)]. \eqno(3.1.4.1)] An even more concise expression of equations such as (3.1.4.1)[link] is possible by making use of the ability of the metric tensor g to convert components from contravariant to covariant (Sands, 1982a[link]). Thus, [v_{i} = g_{ij} v\hskip 2pt^{j},\quad u_{j} = g_{ij} u^{i}, \eqno(3.1.4.2)] and (3.1.2.4)[link] may be written succinctly as [{\bf u} \cdot {\bf v} = u^{i} v_{i} \eqno(3.1.4.3)] or [{\bf u} \cdot {\bf v} = u_{i} v^{i}. \eqno(3.1.4.4)] With this notation, the angle calculation of (3.1.4.1)[link] becomes [\varphi = \cos^{-1} [u^{i} v_{i}/(uv)] = \cos^{-1} [u_{i} v^{i}/(uv)]. \eqno(3.1.4.5)] The summations in (3.1.4.3)[link], (3.1.4.4)[link] and (3.1.4.5)[link] include only three terms, and are thus equivalent in numerical effort to the computation in a Cartesian system, in which the metric tensor is represented by the unit matrix and there is no numerical distinction between covariant components and contravariant components.

Appreciation of the elegance of tensor formulations may be enhanced by noting that corresponding to the metric tensor g with components [g_{ij}] there is a contravariant metric tensor [{\bf g}^{*}] with components [g^{ij} = {\bf a}^{i} \cdot {\bf a}\hskip 2pt^{j}. \eqno(3.1.4.6)] The [{\bf a}^{i}] are contravariant basis vectors, known to crystallographers as reciprocal axes. Expressions parallel to (3.1.4.2)[link] may be written, in which [{\bf g}^{*}] plays the role of converting covariant components to contravariant components. These tensors thus express mathematically the crystallographic notions of crystal space and reciprocal space [see Chapter 1.1[link] and Sands (1982a[link])].

3.1.5. Vector product

| top | pdf |

The scalar product defined in Section 3.1.2[link] is one multiplicative operation of two vectors that may be defined; another is the vector product, which is denoted as [{\bf u} \wedge {\bf v}] (or [{\bf u} \times {\bf v}] or [uv]). The vector product of vectors u and v is defined as a vector of length [uv\sin \varphi], where [\varphi] is the angle between the vectors, and of direction perpendicular to both u and v in the sense that u, v and [{\bf u} \wedge {\bf v}] form a right-handed system; [{\bf u} \wedge {\bf v}] is generated by rotating u into v and advancing in the direction of a right-handed screw. The magnitude of [{\bf u} \wedge {\bf v}], given by [|{\bf u} \wedge {\bf v}| = uv \sin \varphi \eqno(3.1.5.1)] is equal to the area of the parallelogram defined by u and v.

It follows from the definition that [{\bf u} \wedge {\bf v} = -{\bf v} \wedge {\bf u}. \eqno(3.1.5.2)]

3.1.6. Permutation tensors

| top | pdf |

Many relationships involving vector products may be expressed compactly and conveniently in terms of the permutation tensors, defined as [\eqalignno{ \varepsilon_{ijk} &= {\bf a}_{i} \cdot {\bf a}_{j} \wedge {\bf a}_{k} &(3.1.6.1)\cr \varepsilon^{ijk} &= {\bf a}^{i} \cdot {\bf a}\hskip 2pt^{j} \wedge {\bf a}^{k}. &(3.1.6.2)}%(3.1.6.2)] Since [{\bf a}_{i} \cdot {\bf a}_{j} \wedge {\bf a}_{k}] represents the volume of the parallelepiped defined by vectors [{\bf a}_{i}, {\bf a}_{j}, {\bf a}_{k}], it follows that [\varepsilon_{ijk}] vanishes if any two indices are equal to each other. The same argument applies, of course, to [\varepsilon^{ijk}]. That is, [\varepsilon_{ijk} = 0,\quad \varepsilon^{ijk} = 0,\ \hbox{ if } j = i \hbox{ or } k = i \hbox{ or } k = j. \eqno(3.1.6.3)] If the indices are all different, [\varepsilon_{ijk} = PV,\quad \varepsilon^{ijk} = PV^{*} \eqno(3.1.6.4)] for even permutations of ijk (123, 231, or 312), and [\varepsilon_{ijk} = -PV,\quad \varepsilon^{ijk} = -PV^{*} \eqno(3.1.6.5)] for odd permutations (132, 213, or 321). Here, [P = +1] for right-handed axes, [P = -1] for left-handed axes, V is the unit-cell volume, and [V^{*} = 1/V] is the volume of the reciprocal cell defined by the reciprocal basis vectors [{\bf a}^{i}, {\bf a}\hskip 2pt^{j}, {\bf a}^{k}].

A discussion of the properties of the permutation tensors may be found in Sands (1982a[link]). In right-handed Cartesian systems, where [P = 1], and [V = V^{*} = 1], the permutation tensors are equivalent to the permutation symbols denoted by [e_{ijk}].

3.1.7. Components of vector product

| top | pdf |

As is shown in Sands (1982a[link]), the components of the vector product [{\bf u} \wedge {\bf v}] are given by [{\bf u} \wedge {\bf v} = \varepsilon_{ijk} u^{i} v\hskip 2pt^{j} {\bf a}^{k}, \eqno(3.1.7.1)] where again [{\bf a}^{k}] is a reciprocal basis vector (some writers use [{\bf a}^{*}, {\bf b}^{*}, {\bf c}^{*}] to represent the reciprocal axes). A special case of (3.1.7.1)[link] is [{\bf a}_{i} \wedge {\bf a}_{j} = \varepsilon_{ijk} {\bf a}^{k}, \eqno(3.1.7.2)] which may be taken as a defining equation for the reciprocal basis vectors. Similarly, [{\bf a}^{i} \wedge {\bf a}\hskip 2pt^{j} = \varepsilon^{ijk} {\bf a}_{k}, \eqno(3.1.7.3)] which completes the characterization of the dual vector system with basis vectors [{\bf a}_{i}] and [{\bf a}\hskip 2pt^{j}] obeying [{\bf a}_{i} \cdot {\bf a}\hskip 2pt^{j} = \delta_{i}\hskip-1pt^{j}. \eqno(3.1.7.4)] In (3.1.7.4)[link], [\delta_{i}\hskip -1pt^{j}] is the Kronecker delta, which equals 1 if [i = j], 0 if [i \neq j]. The relationships between these quantities are explored at some length in Sands (1982a[link]).

3.1.8. Some vector relationships

| top | pdf |

The results developed above lead to several useful relationships between vectors; for derivations, see Sands (1982a[link]).

3.1.8.1. Triple vector product

| top | pdf |

[\eqalignno{ {\bf u} \wedge ({\bf v} \wedge {\bf w}) &= ({\bf u} \cdot {\bf w}) {\bf v} - ({\bf u} \cdot {\bf v}) {\bf w} &(3.1.8.1)\cr ({\bf u} \wedge {\bf v}) \wedge {\bf w} &= - ({\bf v} \cdot {\bf w}) {\bf u} + ({\bf u} \cdot {\bf w}) {\bf v}. &(3.1.8.2)}%(3.1.8.2)]

3.1.8.2. Scalar product of vector products

| top | pdf |

[({\bf u} \wedge {\bf v}) \cdot ({\bf w} \wedge {\bf z}) = ({\bf u} \cdot {\bf w}) ({\bf v} \cdot {\bf z}) - ({\bf u} \cdot {\bf z}) ({\bf v} \cdot {\bf w}). \eqno(3.1.8.3)] A derivation of this result may be found also in Shmueli (1974[link]).

3.1.8.3. Vector product of vector products

| top | pdf |

[\eqalignno{ ({\bf u} \wedge {\bf v}) \wedge ({\bf w} \wedge {\bf z}) &= ({\bf u} \cdot {\bf w} \wedge {\bf z}) {\bf v} - ({\bf v} \cdot {\bf w} \wedge {\bf z}) {\bf u} &(3.1.8.4) \cr ({\bf u} \wedge {\bf v}) \wedge ({\bf w} \wedge {\bf z}) &= ({\bf u} \cdot {\bf v} \wedge {\bf z}) {\bf w} - ({\bf u} \cdot {\bf v} \wedge {\bf w}) {\bf z.} &(3.1.8.5)}%(3.1.8.5)]

3.1.9. Planes

| top | pdf |

Among several ways of characterizing a plane in a general rectilinear coordinate system is a description in terms of the coordinates of three non-collinear points that lie in the plane. If the points are U, V and W, lying at the ends of vectors u, v and w, the vectors [{\bf u} - {\bf v}], [{\bf v} - {\bf w}] and [{\bf w} - {\bf u}] are in the plane. The vector [{\bf z} = ({\bf u} - {\bf v}) \wedge ({\bf v} - {\bf w}) \eqno(3.1.9.1)] is normal to the plane. Expansion of (3.1.9.1)[link] yields [{\bf z} = ({\bf u} \wedge {\bf v}) + ({\bf v} \wedge {\bf w}) + ({\bf w} \wedge {\bf u}). \eqno(3.1.9.2)] Making use of (3.1.7.1)[link], [{\bf z} = \varepsilon_{ijk} (u\hskip 2pt^{j} v^{k} + v\hskip 2pt^{j} w^{k} + w\hskip 2pt^{j} u^{k}) {\bf a}^{i}. \eqno(3.1.9.3)] If now x is any vector from the origin to the plane, [{\bf x} -{\bf u}] is in the plane, and [({\bf x} - {\bf u}) \cdot {\bf z} = 0. \eqno(3.1.9.4)] From (3.1.9.2)[link], [{\bf u} \cdot {\bf z} = {\bf u} \cdot {\bf v} \wedge {\bf w}. \eqno(3.1.9.5)] Rearrangement of (3.1.9.4)[link] with [{\bf x} \cdot {\bf z}] on the left and [{\bf u} \cdot {\bf z}] on the right, and using (3.1.9.3)[link] for z on the left leads to [\varepsilon_{ijk} x^{i} (u\hskip 2pt^{j} v^{k} + v\hskip 2pt^{j} w^{k} + w\hskip 2pt^{j} u^{k}) = \varepsilon_{ijk} u^{i} v\hskip 2pt^{j} w^{k}. \eqno(3.1.9.6)] If, in particular, the points are on the coordinate axes, their designations are [[u^{1}, 0, 0]], [[0, v^{2}, 0]] and [[0, 0, w^{3}]], and (3.1.9.6)[link] becomes [x^{1}/u^{1} + x^{2}/v^{2} + x^{3}/w^{3} = 1, \eqno(3.1.9.7)] which may be written [x^{i} h_{i} = 1 \eqno(3.1.9.8)] or [{\bf x} \cdot {\bf h} = 1 \eqno(3.1.9.9)] in which the vector h has coordinates [{\bf h} = (1/u^{1}, 1/v^{2}, 1/w^{3}). \eqno(3.1.9.10)] That is, the covariant components of h are given by the reciprocals of the intercepts of the plane on the axes. The vector h is normal to the plane it describes (Sands, 1982a[link]) and the length of h is the reciprocal of the distance d of the plane from the origin; i.e., [h = 1/d. \eqno(3.1.9.11)]

If the indices [h_{i}] are relatively prime integers, the theory of numbers tells us that the Diophantine equation (3.1.9.8)[link] has solutions [x^{i}] that are integers. Points whose contravariant components are integers are lattice points, and such a plane passes through an infinite number of lattice points and is called a lattice plane. Thus, the [h_{i}] for lattice planes are the familiar Miller indices of crystallography.

Calculations involving planes become quite manageable when the normal vector h is introduced. Thus, the distance l from a point P with coordinates [p^{i}] to a plane characterized by h is [l = (1 - {\bf p} \cdot {\bf h})/h, \eqno(3.1.9.12)] where a negative sign indicates that the point is on the opposite side of the plane from the origin.

The dihedral angle [\tau] between planes with normals h and [{\bf h}'] is [\tau = \cos^{-1} [-{\bf h} \cdot {\bf h}'/(hh')]. \eqno(3.1.9.13)] A variation of (3.1.9.13)[link] expresses [\tau] in terms of vector u in the first plane, vector w in the second plane, and vector v, the intersection of the planes, as (Shmueli, 1974[link]) [\tau = \cos^{-1} [({\bf u} \wedge {\bf v}) \cdot ({\bf v} \wedge {\bf w})/|{\bf u} \wedge {\bf v}| |{\bf v} \wedge {\bf w}|]. \eqno(3.1.9.14)]

A similar calculation gives angles of torsion. Let [{\bf t}_{h}] and [{\bf u}_{h}] be, respectively, the projections of vectors t and u onto the plane with normal h. [\eqalignno{ {\bf t}_h &= {\bf t} - ({\bf t} \cdot {\bf h}){\bf h}/h^{2} &(3.1.9.15)\cr {\bf u}_h &= {\bf u} - ({\bf u} \cdot {\bf h}){\bf h}/h^{2}. &(3.1.9.16)}%(3.1.9.16)] The angle between [{\bf t}_{h}] and [{\bf u}_{h}] represents a torsion about h (Sands, 1982b[link]). Another approach to the torsion angle, which gives equivalent results (Shmueli, 1974[link]), is to compute the angle between [{\bf t} \wedge {\bf h}] and [{\bf u} \wedge {\bf h}] using (3.1.8.3)[link].

3.1.10. Variance–covariance matrices

| top | pdf |

Refinement of a crystal structure yields both the parameters that describe the structure and estimates of the uncertainties of those parameters. Refinement by the method of least squares minimizes a weighted sum of squares of residuals. In the matrix notation of Hamilton's classic book (Hamilton, 1964[link]), values of the m parameters to be determined are expressed by the [m \times 1] column vector X given by [{\bi X} = ({\bi A}^{T} {\bi{PA}})^{-1} {\bi A}^{T} {\bi PF}, \eqno(3.1.10.1)] where F is an [n \times 1] matrix representing the observations (structure factors or squares of structure factors), P is an [n \times n] weight matrix that is proportional to the variance–covariance matrix of the observed F, A is an [n \times m] design matrix consisting of the derivatives of each element of F with respect to each of the parameters and [{\bi A}^{T}] is the transpose of A. The variance–covariance matrix of the parameters is then given by [{\bi M} = {\bi V}^{T} {\bi PV} ({\bi A}^{T} {\bi PA})^{-1}/(n - m). \eqno(3.1.10.2)] Here, V is the [n \times 1] matrix of residuals, consisting of the differences between the observed and calculated values of the elements of F. Since [{\bi V}^{T}{\bi PV}/(n - m)] is just a single number, M is proportional to the inverse least-squares matrix [({\bi A}^{T}{\bi PA})^{-1}].

Once the variance–covariance matrix of the parameters is known, the variances and covariances of any quantities derived from these parameters can be computed. The variance of a single function f is given by [\sigma^{2}(\hskip 2ptf) = {\partial f\over \partial x^{i}} {\partial f\over \partial x\hskip 2pt^{j}} \hbox{cov} (x^{i}, x\hskip 2pt^{j}), \eqno(3.1.10.3)] where, as usual, we are using the summation convention and summing over all parameters included in f. A generalization of (3.1.10.3)[link] for two functions is [\hbox{cov} (\hskip 2ptf_{1}, f_{2}) = {\partial f_{1}\over \partial x^{i}} {\partial f_{2}\over \partial x\hskip 2pt^{j}} \hbox{cov} (x^{i}, x\hskip 2pt^{j}). \eqno(3.1.10.4)] [The covariance of two quantities is, of course, just the variance if the two quantities are the same. For an elementary discussion of statistical covariance and correlation, see Sands (1977)[link].] Equation (3.1.10.4)[link] may now be extended to any number of functions (Sands, 1966[link]); the [k \times k] variance–covariance matrix C of k functions of m parameters is given in terms of the [m \times m] variance–covariance matrix of the parameters by [{\bi C} = {\bi DMD}^{T}, \eqno(3.1.10.5)] in which the ijth element of the [k \times m] matrix D is the derivative of function [f_{i}] with respect to parameter j. Element [C_{II}] (no summation implied over I) is the variance of function [f_{I}], and [C_{IJ}] is the covariance of functions [f_{I}] and [f_{J}].

The calculation of C must, of course, include the contributions of all sources of error, so M in (3.1.10.5)[link] should include the variances and covariances of the unit-cell dimensions and of any other relevant parameters with non-negligible uncertainties.

It may be easier, in some cases, to carry out calculations of variances and covariances in steps. For example, the variance–covariance matrix of a set of distances may be computed and then other quantities may be determined as functions of the distances. It is imperative that all non-vanishing covariances be included in every stage of the calculation; only in special cases are the covariances negligible, and often they are large enough to affect the results seriously (Sands, 1977[link]).

These principles may be used to explore the effects of symmetry or of transformations on the variance–covariance matrices of atomic parameters and derived quantities. Using the notation of Sands (1966[link]), with [x_{A}^{i}] and [x_{B}^{i}] the positional parameters i of atoms A and B, respectively, we define [{\bi M}_{AA}, {\bi M}_{AB}, {\bi M}_{BA}] and [{\bi M}_{BB}] as [3 \times 3] matrices with ijth elements [\hbox{cov} (x_{A}^{i}, x_{A}^{\;j})], [\hbox{cov} (x_{A}^{i}, x_{B}^{\;j})], [\hbox{cov} (x_{B}^{i}, x_{A}^{\;j})] and [\hbox{cov} (x_{B}^{i}, x_{B}^{\;j})], respectively. If atom [B'] is generated from atom B by symmetry operator S, such that [\eqalignno{ {\bf x}_{B'} &= {\bi S}{\bf x}_{B} &(3.1.10.6)\cr x_{B'}^{i} &= {S}_{j}^{i} {x}\hskip 2pt^{j}_{B}, &(3.1.10.7)}%(3.1.10.7)] it is shown in Sands (1966[link]) that the variance–covariance matrices involving atom [{B}'] are [\eqalignno{ {\bi M}_{AB'} &= {\bi M}_{AB} {\bi S}^{T} &(3.1.10.8)\cr {\bi M}_{B'A} &= {\bi SM}_{BA} &(3.1.10.9)\cr {\bi M}_{B'B'} &= {\bi SM}_{BB} {\bi S}^{T}. & (3.1.10.10)}] If symmetry operator S is applied to both atoms A and B to generate atoms [{A}'] and [{B}'], the corresponding matrices may be expressed by the matrix equation [\pmatrix{{\bi M}_{A'A'} &{\bi M}_{A'B'}\cr {\bi M}_{B'A'} &{\bi M}_{B'B'}\cr} = \pmatrix{{\bi SM}_{AA} {\bi S}^{T} &{\bi SM}_{AB} {\bi S}^{T}\cr {\bi SM}_{BA} {\bi S}^{T} &{\bi SM}_{BB} {\bi S}^{T}\cr}. \eqno(3.1.10.11)]

If G is a matrix that transforms to a new set of axes, [{\bf a}' = {\bi G} {\bf a}, \eqno(3.1.10.12)] the transformed variance–covariance matrix of the atomic parameters is [{\bi M}' = ({\bi G^{T}})^{-1} {\bi MG}^{-1}. \eqno(3.1.10.13)]

To apply these formulae to calculations of the errors and covariances of interatomic distances and angles, consider the triangle of atoms A, B, C with edges [l_{1} = AB], [l_{2} = BC], [l_{3} = CA], and angles [\alpha_{1}], [\alpha_{2}], [\alpha_{3}] at A, B, C, respectively. If the atoms are not related by symmetry, [\eqalignno{\sigma^{2}(l_{1})& = {\bi l}_{1}^{T} {\bf g} ({\bi M}_{AA} - {\bi M}_{AB} - {\bi M}_{BA} + {\bi M}_{BB}) {\bf g}{\bi l}_{1}/l_{1}^{2} &(3.1.10.14)\cr \hbox{cov} (l_{1}, l_{2})& = {\bi l}_{1}^{T} {\bf g} ({\bi M}_{AB} - {\bi M}_{AC} - {\bi M}_{BB} + {\bi M}_{BC}) {\bf g}{\bi l}_{2}/l_{1} l_{2}.&(3.1.10.15)\cr}] If atom B is generated from atom A by symmetry matrix S, the results, as derived in Sands (1966[link]), are [\eqalignno{ \sigma^{2}(l_{1}) &= {\bi l}_{1}^{T} {\bf g} ({\bi M}_{AA} - {\bi SM}_{AA} - {\bi M}_{AA} {\bi S}^{T} \cr &\quad + {\bi SM}_{AA} {\bi S}^{T}) {\bf g} {\bi l}_{1}/l_{1}^{2} &(3.1.10.16)\cr \sigma^{2}(l_{2}) &= {\bi l}_{2}^{T} {\bf g} ({\bi SM}_{AA} {\bi S}^{T} - {\bi M}_{AC} {\bi S}^{T} \cr &\quad - {\bi SM}_{AC} + {\bi M}_{CC}) {\bf g} {\bi l}_{2}/l_{2}^{2} &(3.1.10.17) \cr \sigma^{2}(l_{3}) &= {\bi l}_{3}^{T} {\bf g} ({\bi M}_{AA} - {\bi M}_{AC} - {\bi M}_{CA} \cr &\quad + {\bi M}_{CC}) {\bf g} {\bi l}_{3}/l_{3}^{2} &(3.1.10.18)\cr \hbox{cov} (l_{1}, l_{2}) &= {\bi l}_{1}^{T} {\bf g} ({\bi M}_{AA} {\bi S}^{T} - {\bi SM}_{AA} {\bi S}^{T} \cr &\quad - {\bi M}_{AC} + {\bi SM}_{AC}) {\bf g} {\bi l}_{2}/l_{1}l_{2} &(3.1.10.19) \cr \hbox{cov} (l_{1}, l_{3}) &= {\bi l}_{1}^{T} {\bf g} (- {\bi M}_{AA} + {\bi SM}_{AA} \cr &\quad + {\bi M}_{AC} - {\bi SM}_{AC}) {\bf g} {\bi l}_{3}/l_{1}l_{3} &(3.1.10.20) \cr \hbox{cov} (l_{2}, l_{3}) &= {\bi l}_{2}^{T} {\bf g} (- {\bi SM}_{AA} + {\bi M}_{CA} \cr &\quad + {\bi SM}_{AC} - {\bi M}_{CC}) {\bf g} {\bi l}_{3}/l_{2}l_{3}. &(3.1.10.21)}%(3.1.10.21)] In equations (3.1.10.14)[link]–(3.1.10.21)[link], [{\bi l}_{i}] is a column vector with components the differences of the coordinates of the atoms connected by the vector. Representative formulae involving the angles [\alpha_{1}], [\alpha_{2}], [\alpha_{3}] are [\eqalignno{ \sigma^{2}(\alpha_{1}) &= [\cos^{2} \alpha_{2}\sigma^{2} (l_{1}) - 2 \cos \alpha_{2} \hbox{ cov} (l_{1}, l_{2}) \cr &\quad + 2 \cos \alpha_{2} \cos \alpha_{3} \hbox{ cov} (l_{1}, l_{3}) + \sigma^{2} (l_{2}) \cr &\quad - 2 \cos \alpha_{3} \hbox{ cov} (l_{2}, l_{3}) \cr &\quad + \cos^{2} \alpha_{3}\sigma^{2} (l_{3})] (l_{2}/l_{1}l_{3} \sin \alpha_{1})^{2} &(3.1.10.22)\cr \hbox{cov} (\alpha_{1}, \alpha_{2}) &= [\cos \alpha_{1} \cos \alpha_{2} \sigma^{2} (l_{1}) \cr &\quad + (\cos \alpha_{2} \cos \alpha_{3} - \cos \alpha_{1}) \hbox{ cov} (l_{1}, l_{2}) \cr &\quad + (\cos \alpha_{1} \cos \alpha_{3} - \cos \alpha_{2}) \hbox{ cov} (l_{1}, l_{3}) \cr &\quad - \cos \alpha_{3} \sigma^{2} (l_{2}) + (1 + \cos^{2} \alpha_{3}) \hbox{ cov} (l_{2}, l_{3}) \cr &\quad - \cos \alpha_{3} \sigma^{2} (l_{3})] / (l_{1}^{2} \sin \alpha_{1} \sin \alpha_{2}) &(3.1.10.23) \cr \hbox{cov} (\alpha_{1}, l_{1}) &= [- \cos \alpha_{2} \sigma^{2} (l_{1}) + \hbox{cov} (l_{1}, l_{2}) \cr &\quad - \cos \alpha_{3} \hbox{ cov} (l_{1}, l_{3})] (l_{2}/l_{1} l_{3} \sin \alpha_{1}) &(3.1.10.24) \cr \hbox{cov} (\alpha_{1}, l_{2}) &= [- \cos \alpha_{2} \hbox{ cov} (l_{1}, l_{2}) + \sigma^{2} (l_{2}) \cr &\quad - \cos \alpha_{3} \hbox{ cov} (l_{2}, l_{3})] (l_{2}/l_{1} l_{3} \sin \alpha_{1}). &(3.1.10.25)}%(3.1.10.25)] If any of the angles approach [] or [180^{\circ}], the denominators in (3.1.10.22)–(3.1.10.25)[link] will become very small, necessitating high-precision arithmetic. Indeterminacies resulting from special relationships between atomic positions may require rederivation of the equations for variances and covariances, to take the relationships into account explicitly and avoid the indeterminacies. A true symmetry condition requiring, for example, a linear bond should cause little problem, and the corresponding variance will be zero. It is the indeterminacies not originating from crystal symmetry that demand caution, in recognizing them and in coping with them correctly.

A general expression for the variance of a dihedral angle, in terms of the variances and covariances of the coordinates of the four atoms, is (Shmueli, 1974[link]) [\sigma^{2} (\tau) = {\displaystyle\sum\limits_{k}} {\displaystyle\sum\limits_{n}} {\partial \tau\over \partial x_{(k)}^{i}} {\partial \tau\over \partial x\hskip 1pt_{(n)}^{\;j}} \hbox{cov} [x_{(k)}^{i}, x\hskip 2pt_{(n)}^{\;j}], \eqno(3.1.10.26)] where, in addition to the usual tensor summation over i and j from 1 to 3, summation must be carried out over the four atoms (i.e., k and n vary from 1 to 4). Special cases of (3.1.10.26)[link], corresponding to various levels of approximation of diagonal matrices and isotropic errors, are given in Shmueli (1974[link]). Formulae in dyadic notation are given in Waser (1973)[link] for the variances and covariances of dihedral angles, of best planes, of torsion angles, and of other molecular parameters.

3.1.11. Mean values

| top | pdf |

The weighted mean of a set of quantities [X_{i}] is [\langle X \rangle = {\textstyle\sum} w_{i} X_{i} / {\textstyle\sum} w_{i}, \eqno(3.1.11.1)] where the weights are typically chosen to minimize the variance of [\langle X \rangle]. The variance may be computed from the variance–covariance matrix M of the [X_{i}] by [\sigma^{2} (\langle X \rangle) = {\bf w}^{T} {\bi M} {\bf w} / ({\textstyle\sum} w_{i})^{2}. \eqno(3.1.11.2)] Minimization of [\sigma^{2} (\langle X \rangle)] leads to weights given by [{\bf w} = {\bi M}^{-1} {\bf v}, \eqno(3.1.11.3)] where the components of vector v are all equal ([v_{i} = v_{j}] for all i and j); since (3.1.11.1)[link] and (3.1.11.2)[link] require only relative weights, we can assign [v_{i} = 1] for all i. Placing these weights in (3.1.11.2)[link] yields [\sigma^{2} (\langle X \rangle) = 1 / {\textstyle\sum} w_{i}. \eqno(3.1.11.4)] For the case of uncorrelated [X_{i}], the weights are inversely proportional to the corresponding variances [w_{i} = 1/\sigma^{2} (X_{i}). \eqno(3.1.11.5)] For the case of two correlated variables, [w_{i} = 1 / [\sigma^{2} (X_{i}) - \hbox{cov} (X_{1}, X_{2})]. \eqno(3.1.11.6)] Derivation and discussion of these equations may be found in Sands (1966[link], 1982b[link]).

The presence of systematic errors in the experimental data often results in these formulae producing estimates of the standard uncertainties of molecular dimensions that are too small; it has been suggested that such error estimates should be multiplied by 1.5 to make them more realistic (Taylor & Kennard, 1983[link]). It is essential also that averages be computed only of similar quantities, and interatomic distances corresponding to different bond orders or different environments may not represent the same physical quantities; that is, there are reasons for the discrepancies, and averaging may obscure important information. Another source of error in molecular geometry parameters determined from crystallographic measurements is thermal motion, and distances should be corrected for such effects before making comparisons (Busing & Levy, 1964[link]; Johnson, 1970[link], 1980[link]).

A discussion of the appropriateness of weighted and unweighted means may be found in Taylor & Kennard (1985[link]), which suggests that the unweighted mean might even be preferable if environmental effects are large.

3.1.12. Computation

| top | pdf |

It has been mentioned that the tensor formulation used in this chapter is particularly amenable to machine computation. As a simple illustration of this point, the following Fortran program will compute the lengths of vectors X and Y and the angle between them.[\eqalign{&\hbox{DIMENSION X(3),Y(3),G(3,3),SUM(3)}\cr &\hbox{READ (5,10)(X(I),I = 1,3)}\cr &\hbox{READ (5,10)(Y(I),I = 1,3)}\cr &\hbox{READ (5,10)((G(I,J),J = 1,3),I = 1,3)}\cr 10\; & \hbox{FORMAT (3F10.5)}\cr &\hbox{DO 20 I = 1,3}\cr 20\; &\hbox{SUM(I)} = 0\cr &\hbox{DO 30 I = 1,3}\cr & \hbox{DO 30 J = 1,3}\cr &\hbox{SUM(1)} = \hbox{SUM(1) + X(I) }\ast\hbox{ X(J) }\ast\hbox{ G(I,J)}\cr & \hbox{SUM(2)} = \hbox{SUM(2) + Y(I) }\ast\hbox{ Y(J) }\ast\hbox{ G(I,J)}\cr &\hbox{SUM(3)} = \hbox{SUM(3) + X(I) }\ast\hbox{ Y(J) }\ast\hbox{ G(I,J)}\cr 30\;&\hbox{CONTINUE}\cr &\hbox{DIST1} = \hbox{SQRT(SUM(1))}\cr &\hbox{DIST2} = \hbox{SQRT(SUM(2))}\cr &\hbox{ANGLE} = \hbox{57.296 }\ast\hbox{ ACOS(SUM(3)/(DIST1 }\ast\hbox{ DIST2))}\cr &\hbox{WRITE (6,10) DIST1,DIST2,ANGLE}\cr &\hbox{END}}]

References

Busing, W. R. & Levy, H. A. (1964). Effect of thermal motion on the estimation of bond lengths. Acta Cryst. 17, 142–146.
Hamilton, W. C. (1964). Statistics in physical science. New York: Ronald Press.
Johnson, C. K. (1970). The effect of thermal motion on interatomic distances and angles. In Crystallographic computing, edited by F. R. Ahmed, pp. 220–226. Copenhagen: Munksgaard.
Johnson, C. K. (1980). Thermal motion analysis. In Computing in crystallography, edited by R. Diamond, S. Ramaseshan & K. Venkatesan, pp. 14.01–14.19. Bangalore: Indian Academy of Sciences.
Sands, D. E. (1966). Transformations of variance–covariance tensors. Acta Cryst. 21, 868–872.
Sands, D. E. (1977). Correlation and covariance. J. Chem. Educ. 54, 90–94.
Sands, D. E. (1982a). Vectors and tensors in crystallography. Reading: Addison Wesley. Reprinted (1995) Dover Publications.
Sands, D. E. (1982b). Molecular geometry. In Computational crystallography, edited by D. Sayre, pp. 421–429. Oxford: Clarendon Press.
Shmueli, U. (1974). On the standard deviation of a dihedral angle. Acta Cryst. A30, 848–849.
Taylor, R. & Kennard, O. (1983). The estimation of average molecular dimensions from crystallographic data. Acta Cryst. B39, 517–525.
Taylor, R. & Kennard, O. (1985). The estimation of average molecular dimensions. 2. Hypothesis testing with weighted and unweighted means. Acta Cryst. A41, 85–89.
Waser, J. (1973). Dyadics and the variances and covariances of molecular parameters, including those of best planes. Acta Cryst. A29, 621–631.








































to end of page
to top of page