Radial Distribution Functions

Quick Start

KB science is intuitive because it is based on the RDF - the graph of how many of molecule i are surrounding molecule j, where, as in this case, j may be the same as i, indicating how much the molecule likes to be around itself. If the RDF (shown as g_ij) is large, that means that molecules like being together. In fact we have to take into account the available volume at a given distance, giving us the desired G_ij values. Here we use a very simple RDF, though it involves a bunch of rather meaningless parameters. It doesn't matter what they mean, what matters is the overall shape which your own intuition will tell you is "this molecule likes being with itself" or not.

Using this simple case will allow you to imagine what would happen with the RDF of real molecules and how that would give a large, positive G_ij (the molecules are happy together) or a negative one (unhappy).

Later in the text we see that G_ij calculations from RDFs are identical to seemingly different calculations found in the Fluctuations link which explains in more detail the angled bracket 〈δxδx〉 way of looking at things.

One tip. If both molecules are large then they can't get too close, so their G_ij values will be lower. This is the "Excluded Volume" effect. It sounds trivial, yet it is one of the more powerful solubility effects you will meet. The joy of KB is that Excluded Volume is so obvious; in other solubility schemes it is very confusing and obscure.

Radial Distribution Functions

g(d)

Radius

Cut-off

In molecular thermodynamics, the Radial Distribution Function, RDF, tells you a lot about what is going on in the solution. If we could only calculate them accurately, most of our solubility problems would disappear. Unfortunately, in general we cannot calculate them and we certainly cannot calculate them accurately. We will see why this second point is important.

The RDF counts how many molecules of any given pair (in this case it is a pure liquid so it is molecule 1 from molecule 1) are at any given distance, and compares it to the average distance over the bulk. Another way of saying the same thing is that the RDF, g_ij at distance r is given by:

g_ij(r) = ρ_ij(r)/ρ_ij⁰

where ρ_ij(r) is the number density at distance r and ρ_ij⁰ is the bulk density.

The standard liquid used to show an RDF is "Lennard-Jonesium", made up of Lennard-Jones particles. A convenient parameterised model for such RDFs comes from Matteoli and Mansoori¹ where the 6 parameters each affect a different aspect of the RDF. RDFs tend to be plotted with respect to the radius of the particle, R. Here the plot is to absolute R, though the default value of R is assumed to be 1. As this is an illustration, the units are not specified.

The link to thermodynamics is via the Kirkwood-Buff integral, G given, for a given radius r, by:

G_ij = 4π∫(g_ij-1)r²dr

where the 4πr² captures the surface area of each "shell" at a given r. The integral is from 0 to infinity, but it is generally hoped that the value stabilises at a more realistic value such as r=10 in the default settings. The logic behind the calculation is discussed below.

It is the 4πr² which causes a lot of problems for thermodynamics. The default values show a g_ij which oscillates briefly then flat-lines uninterestingly up to the cut-off radius of 10. Because of the amplification by 4πr², the G_ij does not level off so quickly and it takes much longer for the integral to stabilise. Worse than that, trivially small changes (to the human eye) in g_ij can lead to large changes in the asymptote value of G_ij. Try, for example, adjusting θ. This controls the exponential function to the left of the first peak. With perfect hard spheres the peak has an abrupt cut-off at the sphere radius, but real molecules have some fuzziness at this point. It is usually hard to see much happen when θ changes, but the G_ij value changes significantly. In other words, calculated thermodynamic values (which come from G_ij) are exquisitely sensitive to small changes in g_ij. So if molecular dynamics are used to calculate the RDF, small changes in the force fields can lead to large changes in the computed thermodynamic values, especially if the integral is ended at some "reasonable" value which (try adjusting the Cut-off value) turns out to be too soon.

Fluctuation Theory

As explained in the Fluctuations app we can calculate G₁₁ via the slightly simplified formula that uses the angled bracket nomenclature for averages, such as 〈δN₁δN₁〉:

`G_(11) ~ ("〈"δN_1δN_1"〉")/("〈"N_1"〉""〈"N_1"〉"`

where

`"〈"δN_1δN_1"〉" = (N_1-"〈"N_1"〉")(N_1-"〈"N_1"〉")`

This looks both mysterious and unhelpful. In fact it's simple and insightful. As you'll see in the Fluctuations app, although it's not at first obvious, the 〈δN₁δN₁〉 means, as shown in the second equation, the product of each value of N₁, the local number of molecules, minus the global average number 〈N₁〉. The calculation involved two passes through the data and can be done with a single pass via the equivalent:

`"〈"δN_1δN_1"〉" = "〈"N_1N_1"〉"-"〈"N_1"〉""〈"N_1"〉"`

So to calculate G₁₁ we need to go through probability space to get the average N₁. But in terms of this app that's just the sum of (g₁₁-1)r, and is basically 0. What about the 〈N₁N₁〉 term, the average of the square of the number of particles. That's the average of 4π(g₁₁-1)r²dr. But that's what the G₁₁ graph is showing! So Fluctuation theory and integrals of RDFs aren't just similar, they are the same thing.

Excluded volume

It is so obvious that the RDF must be zero inside the contact radius of the molecules that we hardly notice it. Yet this "excluded volume" can have significant thermodynamic effects. For example, the impact of large sugar molecules on the folding behaviour of proteins in solution is largely due (via a not-so-obvious chain of logic) to the fact that where there is a sugar molecule there isn't a water molecule.

¹Enrico Matteoli and G. Ali Mansoori, A simple expression for radial distribution functions of pure fluids and mixtures, J. Chem. Phys. 103, 4672 (1995)

Practical Solubility - KB

Radial Distribution Functions

Quick Start

Radial Distribution Functions

Fluctuation Theory

Excluded volume

Solubility Science: Principles in Practice