InternationalReciprocal spaceTables for Crystallography Volume B Edited by U. Shumeli © International Union of Crystallography 2006 |
International Tables for Crystallography (2006). Vol. B, ch. 3.1, pp. 348-352
https://doi.org/10.1107/97809553602060000559 ## Chapter 3.1. Distances, angles, and their standard uncertainties
Methods for calculating distances and angles and the standard uncertainties in these quantities using the techniques of tensor analysis are described. A Fortran program showing how this tensor formulation can be adapted to computer languages is also given. |

A crystal structure analysis provides information from which it is possible to compute distances between atoms, angles between interatomic vectors, and the uncertainties in these quantities. In Cartesian coordinate systems, these geometric computations require the Pythagorean theorem and elementary trigonometry. The natural coordinate systems of crystals, though, are determined by symmetry, and only in special cases are the basis vectors (or coordinate axes) of these systems constrained to be of equal lengths or mutually perpendicular.

It is possible, of course, to transform the positional parameters of the atoms to a Cartesian system and perform the subsequent calculations with the transformed coordinates. Along with the coordinates, the transformations must be applied to anisotropic thermal factors, variance–covariance matrices and other important quantities. Moreover, leaving the natural coordinate system of the crystal sacrifices the simplified relationships imposed by translational and point symmetry; for example, if an atom has fractional coordinates , , , an equivalent atom will be at , , , *etc.*

Fortunately, formulation of the calculations in generalized rectilinear coordinate systems is straightforward, and readily adapted to computer languages (Section 3.1.12 illustrates the use of Fortran for such calculations). The techniques for these computations are those of tensor analysis, which provides a compact and elegant notation. While an effort will be made to be self-sufficient in this chapter, some proficiency in vector algebra is assumed, and the reader not familiar with the basics of tensor analysis should refer to Chapter 1.1
and Sands (1982*a*).

The scalar product of vectors **u** and **v** is defined as where *u* and *v* are the lengths of the vectors and is the angle between them. In terms of components, In all equations in this chapter, the convention is followed that summation is implied over an index that is repeated once as a subscript and once as a superscript in an expression; thus, the right-hand side of (3.1.2.4) implies the sum of nine terms The in (3.1.2.4) are the components of the metric tensor [see Chapter 1.1
and Sands (1982*a*)] Subscripts are used for quantities that transform the same way as the basis vectors ; such quantities are said to transform covariantly. Superscripts denote quantities that transform the same way as coordinates ; these quantities are said to transform contravariantly (Sands, 1982*a*).

Equation (3.1.2.4) is in a form convenient for computer evaluation, with indices *i* and *j* taking successively all values from 1 to 3. The matrix form of (3.1.2.4) is useful both for symbolic manipulation and for computation, where the superscript italic *T* following a matrix symbol indicates a transpose. Written out in full, (3.1.2.6) is If ** u** is the column vector with components , is the corresponding row vector shown in (3.1.2.7).

By (3.1.2.1), the scalar product of a vector with itself is The length of **v** is, therefore, given by Computation of lengths in a generalized rectilinear coordinate system is thus simply a matter of evaluating the double summation and taking the square root.

By (3.1.2.1) and (3.1.2.4), the angle between vectors **u** and **v** is given by An even more concise expression of equations such as (3.1.4.1) is possible by making use of the ability of the metric tensor ** g** to convert components from contravariant to covariant (Sands, 1982

*a*). Thus, and (3.1.2.4) may be written succinctly as or With this notation, the angle calculation of (3.1.4.1) becomes The summations in (3.1.4.3), (3.1.4.4) and (3.1.4.5) include only three terms, and are thus equivalent in numerical effort to the computation in a Cartesian system, in which the metric tensor is represented by the unit matrix and there is no numerical distinction between covariant components and contravariant components.

Appreciation of the elegance of tensor formulations may be enhanced by noting that corresponding to the metric tensor **g** with components there is a contravariant metric tensor with components The are contravariant basis vectors, known to crystallographers as reciprocal axes. Expressions parallel to (3.1.4.2) may be written, in which plays the role of converting covariant components to contravariant components. These tensors thus express mathematically the crystallographic notions of crystal space and reciprocal space [see Chapter 1.1
and Sands (1982*a*)].

The scalar product defined in Section 3.1.2 is one multiplicative operation of two vectors that may be defined; another is the vector product, which is denoted as (or or [**uv**]). The vector product of vectors **u** and **v** is defined as a vector of length , where is the angle between the vectors, and of direction perpendicular to both **u** and **v** in the sense that **u**, **v** and form a right-handed system; is generated by rotating **u** into **v** and advancing in the direction of a right-handed screw. The magnitude of , given by is equal to the area of the parallelogram defined by **u** and **v**.

Many relationships involving vector products may be expressed compactly and conveniently in terms of the permutation tensors, defined as Since represents the volume of the parallelepiped defined by vectors , it follows that vanishes if any two indices are equal to each other. The same argument applies, of course, to . That is, If the indices are all different, for even permutations of *ijk* (123, 231, or 312), and for odd permutations (132, 213, or 321). Here, for right-handed axes, for left-handed axes, *V* is the unit-cell volume, and is the volume of the reciprocal cell defined by the reciprocal basis vectors .

A discussion of the properties of the permutation tensors may be found in Sands (1982*a*). In right-handed Cartesian systems, where , and , the permutation tensors are equivalent to the permutation symbols denoted by .

As is shown in Sands (1982*a*), the components of the vector product are given by where again is a reciprocal basis vector (some writers use to represent the reciprocal axes). A special case of (3.1.7.1) is which may be taken as a defining equation for the reciprocal basis vectors. Similarly, which completes the characterization of the dual vector system with basis vectors and obeying In (3.1.7.4), is the Kronecker delta, which equals 1 if , 0 if . The relationships between these quantities are explored at some length in Sands (1982*a*).

The results developed above lead to several useful relationships between vectors; for derivations, see Sands (1982*a*).

Among several ways of characterizing a plane in a general rectilinear coordinate system is a description in terms of the coordinates of three non-collinear points that lie in the plane. If the points are *U*, *V* and *W*, lying at the ends of vectors **u**, **v** and **w**, the vectors , and are in the plane. The vector is normal to the plane. Expansion of (3.1.9.1) yields Making use of (3.1.7.1), If now **x** is any vector from the origin to the plane, is in the plane, and From (3.1.9.2), Rearrangement of (3.1.9.4) with on the left and on the right, and using (3.1.9.3) for **z** on the left leads to If, in particular, the points are on the coordinate axes, their designations are , and , and (3.1.9.6) becomes which may be written or in which the vector **h** has coordinates That is, the covariant components of **h** are given by the reciprocals of the intercepts of the plane on the axes. The vector **h** is normal to the plane it describes (Sands, 1982*a*) and the length of **h** is the reciprocal of the distance *d* of the plane from the origin; *i.e.,*

If the indices are relatively prime integers, the theory of numbers tells us that the Diophantine equation (3.1.9.8) has solutions that are integers. Points whose contravariant components are integers are lattice points, and such a plane passes through an infinite number of lattice points and is called a *lattice plane.* Thus, the for lattice planes are the familiar Miller indices of crystallography.

Calculations involving planes become quite manageable when the normal vector **h** is introduced. Thus, the distance *l* from a point *P* with coordinates to a plane characterized by **h** is where a negative sign indicates that the point is on the opposite side of the plane from the origin.

The dihedral angle between planes with normals **h** and is A variation of (3.1.9.13) expresses in terms of vector **u** in the first plane, vector **w** in the second plane, and vector **v**, the intersection of the planes, as (Shmueli, 1974)

A similar calculation gives angles of torsion. Let and be, respectively, the projections of vectors **t** and **u** onto the plane with normal **h**. The angle between and represents a torsion about **h** (Sands, 1982*b*). Another approach to the torsion angle, which gives equivalent results (Shmueli, 1974), is to compute the angle between and using (3.1.8.3).

Refinement of a crystal structure yields both the parameters that describe the structure and estimates of the uncertainties of those parameters. Refinement by the method of least squares minimizes a weighted sum of squares of residuals. In the matrix notation of Hamilton's classic book (Hamilton, 1964), values of the *m* parameters to be determined are expressed by the column vector ** X** given by where

**is an matrix representing the observations (structure factors or squares of structure factors),**

*F***is an weight matrix that is proportional to the variance–covariance matrix of the observed**

*P***,**

*F***is an design matrix consisting of the derivatives of each element of**

*A***with respect to each of the parameters and is the transpose of**

*F***. The variance–covariance matrix of the parameters is then given by Here,**

*A***is the matrix of residuals, consisting of the differences between the observed and calculated values of the elements of**

*V***. Since is just a single number,**

*F***is proportional to the inverse least-squares matrix .**

*M*Once the variance–covariance matrix of the parameters is known, the variances and covariances of any quantities derived from these parameters can be computed. The variance of a single function *f* is given by where, as usual, we are using the summation convention and summing over all parameters included in *f*. A generalization of (3.1.10.3) for two functions is [The covariance of two quantities is, of course, just the variance if the two quantities are the same. For an elementary discussion of statistical covariance and correlation, see Sands (1977).] Equation (3.1.10.4) may now be extended to any number of functions (Sands, 1966); the variance–covariance matrix ** C** of

*k*functions of

*m*parameters is given in terms of the variance–covariance matrix of the parameters by in which the

*ij*th element of the matrix

**is the derivative of function with respect to parameter**

*D**j*. Element (no summation implied over

*I*) is the variance of function , and is the covariance of functions and .

The calculation of ** C** must, of course, include the contributions of all sources of error, so

**in (3.1.10.5) should include the variances and covariances of the unit-cell dimensions and of any other relevant parameters with non-negligible uncertainties.**

*M*It may be easier, in some cases, to carry out calculations of variances and covariances in steps. For example, the variance–covariance matrix of a set of distances may be computed and then other quantities may be determined as functions of the distances. It is imperative that all non-vanishing covariances be included in every stage of the calculation; only in special cases are the covariances negligible, and often they are large enough to affect the results seriously (Sands, 1977).

These principles may be used to explore the effects of symmetry or of transformations on the variance–covariance matrices of atomic parameters and derived quantities. Using the notation of Sands (1966), with and the positional parameters *i* of atoms *A* and *B*, respectively, we define and as matrices with *ij*th elements , , and , respectively. If atom is generated from atom *B* by symmetry operator ** S**, such that it is shown in Sands (1966) that the variance–covariance matrices involving atom are If symmetry operator

**is applied to both atoms**

*S**A*and

*B*to generate atoms and , the corresponding matrices may be expressed by the matrix equation

If ** G** is a matrix that transforms to a new set of axes, the transformed variance–covariance matrix of the atomic parameters is

To apply these formulae to calculations of the errors and covariances of interatomic distances and angles, consider the triangle of atoms *A*, *B*, *C* with edges , , , and angles , , at *A*, *B*, *C*, respectively. If the atoms are not related by symmetry, If atom *B* is generated from atom *A* by symmetry matrix ** S**, the results, as derived in Sands (1966), are In equations (3.1.10.14)–(3.1.10.21), is a column vector with components the differences of the coordinates of the atoms connected by the vector. Representative formulae involving the angles , , are If any of the angles approach or , the denominators in (3.1.10.22)–(3.1.10.25) will become very small, necessitating high-precision arithmetic. Indeterminacies resulting from special relationships between atomic positions may require rederivation of the equations for variances and covariances, to take the relationships into account explicitly and avoid the indeterminacies. A true symmetry condition requiring, for example, a linear bond should cause little problem, and the corresponding variance will be zero. It is the indeterminacies not originating from crystal symmetry that demand caution, in recognizing them and in coping with them correctly.

A general expression for the variance of a dihedral angle, in terms of the variances and covariances of the coordinates of the four atoms, is (Shmueli, 1974) where, in addition to the usual tensor summation over *i* and *j* from 1 to 3, summation must be carried out over the four atoms (*i.e.*, *k* and *n* vary from 1 to 4). Special cases of (3.1.10.26), corresponding to various levels of approximation of diagonal matrices and isotropic errors, are given in Shmueli (1974). Formulae in dyadic notation are given in Waser (1973) for the variances and covariances of dihedral angles, of best planes, of torsion angles, and of other molecular parameters.

The weighted mean of a set of quantities is where the weights are typically chosen to minimize the variance of . The variance may be computed from the variance–covariance matrix ** M** of the by Minimization of leads to weights given by where the components of vector

**v**are all equal ( for all

*i*and

*j*); since (3.1.11.1) and (3.1.11.2) require only relative weights, we can assign for all

*i*. Placing these weights in (3.1.11.2) yields For the case of uncorrelated , the weights are inversely proportional to the corresponding variances For the case of two correlated variables, Derivation and discussion of these equations may be found in Sands (1966, 1982

*b*).

The presence of systematic errors in the experimental data often results in these formulae producing estimates of the standard uncertainties of molecular dimensions that are too small; it has been suggested that such error estimates should be multiplied by 1.5 to make them more realistic (Taylor & Kennard, 1983). It is essential also that averages be computed only of similar quantities, and interatomic distances corresponding to different bond orders or different environments may not represent the same physical quantities; that is, there are reasons for the discrepancies, and averaging may obscure important information. Another source of error in molecular geometry parameters determined from crystallographic measurements is thermal motion, and distances should be corrected for such effects before making comparisons (Busing & Levy, 1964; Johnson, 1970, 1980).

A discussion of the appropriateness of weighted and unweighted means may be found in Taylor & Kennard (1985), which suggests that the unweighted mean might even be preferable if environmental effects are large.

### References

Busing, W. R. & Levy, H. A. (1964).*Effect of thermal motion on the estimation of bond lengths. Acta Cryst.*

**17**, 142–146.

Hamilton, W. C. (1964).

*Statistics in physical science*. New York: Ronald Press.

Johnson, C. K. (1970).

*The effect of thermal motion on interatomic distances and angles*. In

*Crystallographic computing*, edited by F. R. Ahmed, pp. 220–226. Copenhagen: Munksgaard.

Johnson, C. K. (1980).

*Thermal motion analysis*. In

*Computing in crystallography*, edited by R. Diamond, S. Ramaseshan & K. Venkatesan, pp. 14.01–14.19. Bangalore: Indian Academy of Sciences.

Sands, D. E. (1966).

*Transformations of variance–covariance tensors. Acta Cryst.*

**21**, 868–872.

Sands, D. E. (1977).

*Correlation and covariance. J. Chem. Educ.*

**54**, 90–94.

Sands, D. E. (1982

*a*).

*Vectors and tensors in crystallography*. Reading: Addison Wesley. Reprinted (1995) Dover Publications.

Sands, D. E. (1982

*b*).

*Molecular geometry*. In

*Computational crystallography*, edited by D. Sayre, pp. 421–429. Oxford: Clarendon Press.

Shmueli, U. (1974).

*On the standard deviation of a dihedral angle. Acta Cryst.*A

**30**, 848–849.

Taylor, R. & Kennard, O. (1983).

*The estimation of average molecular dimensions from crystallographic data. Acta Cryst.*B

**39**, 517–525.

Taylor, R. & Kennard, O. (1985).

*The estimation of average molecular dimensions. 2. Hypothesis testing with weighted and unweighted means. Acta Cryst.*A

**41**, 85–89.

Waser, J. (1973).

*Dyadics and the variances and covariances of molecular parameters, including those of best planes. Acta Cryst.*A

**29**, 621–631.