6  Diffraction and interference

Again, we will briefly look at definitions. Let’s start with interference:

1 Google AI

2 Wikipedia

In this course, we focus on waves that follow a linear wave equation, so that given two solutions to the equation, the sum is also a solution. Hence, given two sources, the resulting wave is that obtained by solving the equation for each source individually and then simply summing the results. The resulting interaction between the two waves is then referred to as interference.

What about diffraction:

3 Google AI

4 Wikipedia

5 In our analysis, infinitely many sources.

But it turns out that diffraction is essentially just interference, but with very many5 sources. You are hopefully familiar with the basic double-slit experiment, simplified so that each slit is considered as a single source. To calculate the interference pattern, we apply the principle of superposition to the wave from the two sources, i.e. we sum the waves. For three slits, we would sum three waves. For one thousand slits, we sum one thousand waves. Now what happens if we moves these slits so that they are positioned next to each other, with no gaps. We continue to sum the waves. However, now the slits have merged to become a single aperture and, according to the definitions above, the resulting wave is the result of diffraction.

6.1 The Huygens-Fresnel principle

The Huygens-Fresnel principle states that

  • Every point of a wavefront serves as a source of secondary spherical wavelets. The resulting wave is calculated by the superposition of these wavelets.

This is illustrated in figure 6.1 where five secondary sources are placed on the wavefront within the aperture.

Blue vertical lines to the left of the diagram represent wavefronts moving to the right. In the middle, a barrier stretches from top to bottom with a gap in the middle. In the gap, five dots represent five secondary sources. To the right of the gap, a number of semicircular wavefronts are plotted for each source, producing a pattern of overlapping wavelets.
Figure 6.1: The Huygens-Fresnel principle, applied across an aperture. To the right of the barrier, wavelets are plotted emanating from the five sources across the aperture. The overall wave is calculated by summing these wavelets and those for all other points across the gap.

6.2 Spherical waves

Given that the Huygens-Fresnel principle involves spherical waves and the following analyses will involve superposing such waves, we should ensure that we know the formula for such waves. Recall the wave equation \[ \nabla^2E = \frac{1}{v^2}\frac{\partial^2E}{\partial t^2}. \] I will claim that \[ E = \frac{f(r - vt)}{r} \] is a solution, where \(r\) is the distance from the origin6. You should check that this solution does indeed satisfy the wave equation. It will be useful to know that, in spherical coordinates \[ \nabla^2E = \frac{1}{r^2}\frac{\partial}{\partial r}\left(r^2\frac{\partial E}{\partial r}\right) + \frac{1}{r^2\sin\theta}\frac{\partial}{\partial\theta}\left(\sin\theta\frac{\partial E}{\partial\theta}\right) + \frac{1}{r^2\sin^2\theta}\frac{\partial^2E}{\partial\phi^2}. \]

6 Of course, \(r\) can be the distance from any point we choose — spherical waves are not all required to spread out from the origin.

6.3 Fresnel and Fraunhofer diffraction

In what follows, we will consider what happens to light when it encounters an opaque barrier with one or more apertures or slits. The study of the result diffraction patterns is often split into two domains:

Fresnel diffraction: Also called near-field diffraction, Fresnel diffraction concerns the situation when either the light source or the screen (or both) are ‘close’ to the barrier.

Fraunhofer diffraction: Also called far-field diffraction, Fraunhofer diffraction concerns the situation where both light source and screen are ‘far’ from the barrier.

The diffraction pattern changes as the distance from the barrier increases but eventually settles into the Fraunhofer pattern, after which the size of the pattern changes but the shape does not. Fresnel diffraction is typically more difficult to calculate, requiring the analysis of multiple spherical waves, while the large distances associated with Fraunhofer diffraction mean that simplifying assumptions can be made7. We will therefore focus on Fraunhofer diffraction for the reminder of this chapter.

7 We can consider wavefronts to be effectively planar.

Note that Fresnel and Fraunhofer diffraction are not different effects. They merely refer to examining the effect of diffraction, but at different distances from the barrier.

There remains the question of what we mean by ‘close’ or ‘far’ in our definitions. The calculation of Fraunhofer diffraction requires simplifying mathematical assumptions that are not valid too close to the screen. We will see that the domain in which these assumptions make sense depends both on the aperture width and the wavelength. We will find that Fraunhofer diffraction is valid provided both source and screen are at least a distance of \(R\) from the screen, where \(R = a^2/\lambda\) and where \(a\) is the slit width and \(\lambda\) is the wavelength.

6.4 Single-slit diffraction

The setup is as shown in figure 6.2, but with the screeen further from the barrier than shown. Waves approach the barrier from the left. These are plane waves8, monochromatic9 and coherent10. To begin, we will consider the situation in two dimensions only. In this case, the aperture can be considered as a line of sources of spherical wavelets which we will superpose to determine the diffraction pattern.

8 Suggesting a source of light infinitely far away.

9 A single frequency (and wavelength).

10 The same phase.

Blue vertical lines to the left of the figure indicate wave fronts approaching a barrier placed from top to bottom in the middle of the figure. The barrier has a gap, allowing roughly circular wavefronts to emanate from the aperture. A screen is placed to the right of the figure.
Figure 6.2: The setup for the single-slit diffraction experiment. The screen should be located far from the barrier. The roughly circular wavefronts are for illustration - we will determine whether this correct after performing our calculations.

Consider a fixed point, \(P\), on the screen such that a wavelet starting at the aperture centre must travel a distance \(R\) at angle \(\theta\) to the horizontal to reach it. Wavelets starting from other locations within the aperture travel a slightly different distance to reach \(P\), as shown in figure 6.3. The difference in distances will determine whether the wavelets interfere constructively or destructively.

A barrier is show to the left of the diagram. The view is a closeup, so the gap in the barrier looks larger. A ray is shown from the centre point of the aperture to point P on the screen. It subtends an angle of theta to the horizontal and is of distance capital R. A point within the aperture at distance y above the aperture centre is also shown (with this distance marked). A ray from this point to point P is shown, of length lower case r. The width of the aperture is a.
Figure 6.3: Wavelet starting a distance \(y\) from the centre of the aperture travels unknown distance \(r\) to reach point \(P\).

Using the cosine rule11 we find that \[ r = R\sqrt{1 - 2\frac{y}{R}\sin\theta + \frac{y^2}{R^2}}. \] We Taylor expand in \(y/R\) to get \[ r = R - y\sin\theta + \frac{y^2}{2R}\cos^2\theta + \ldots \] We would like to be able to ignore the third term. How small must it be to allow us to disregard it? It must be small compared with the wavelength, otherwise omitting the term will make a significant difference to the way the wavelets interfere at point \(P\). That is, we want \(y^2/R \ll \lambda\). Noting that \(y\) might be as large as \(a/2\) and rearranging this condition, we find that we require \(R \gg a^2/\lambda\), as mentioned earlier.

11 To remind you, this states that if \(a\), \(b\) and \(c\) are the lengths of the three sides of a triangle and \(A\) is the angle opposite side \(a\) then \[ a^2 = b^2 + c^2 - 2bc\cos A. \]

Ignoring this term, we find that the difference in path lengths is \(y\sin\theta\). Interestingly, this is what we would get if we simply assumed that the rays were parallel, as shown in figure 6.4.

The top half of an aperture in a barrier is shown to the left of the diagram. Two parallel rays are illustrated, both making an angle of theta to the horizontal. The bottom ray is labelled with a capital R while the top one has a lower case r. A dashed line is shown, starting at the start of the top ray and perpendicular to it, making a right angled triangle between the rays, with the top angle also being theta and the hypoteneuse - the vertical line within the aperture - being of length y. The bottom edge of the triangle is how far the bottom ray starts behind the top one and is labelled with y sine theta.
Figure 6.4: If outgoing rays are assumed to be parallel, the lower ray starts a distance of \(y\sin\theta\) behind the upper one.

We now sum the contribution of all the wavelets at position \(P\). Since we consider contributions from all the positions within the aperture, this is actually an integral. We get \[ E = A\int_{-a/2}^{a/2}\frac{e^{i\left(k\left(R - y\sin\theta\right) - \omega t\right)}}{R - y\sin\theta}\,dy. \] The \(y\sin\theta\) in the denominator is rather awkward. However, for large \(R\) and small \(y\), its contribution is negligible12 while the change \(y\sin\theta\) makes to the phase of the wave is not. We therefore ignore the \(y\sin\theta\) in the denominator and get \[ E = \frac{A}{R}e^{i\left(kR-\omega t\right)}\int_{a/2}^{a/2}e^{-iky\sin\theta}\,dy. \] We can do this integral. \[ \begin{aligned} \int_{a/2}^{a/2}e^{-iky\sin\theta}\,dy &= \frac{i}{k\sin\theta}\left[e^{-iky\sin\theta}\right]_{-a/2}^{a/2}\\ &= \frac{i}{k\sin\theta}\left(e^{-\frac{1}{2}ika\sin\theta} - e^{\frac{1}{2}ika\sin\theta}\right). \end{aligned} \] If we let \[ \beta = \frac{1}{2}ka\sin\theta \] then this is \[ \begin{aligned} \int_{a/2}^{a/2}e^{-iky\sin\theta}\,dy &= \frac{ai}{2\beta}\left(e^{-i\beta} - e^{i\beta}\right)\\ &= \frac{ai}{2\beta}\left(-2i\sin\beta\right)\\ &= \frac{a\sin\beta}{\beta}. \end{aligned} \] We therefore get \[ E = \frac{Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t\right)}. \]

12 We note that \[ \frac{1}{R-y\sin\theta} \simeq \frac{1}{R} + \frac{y}{R^2}\sin\theta. \]

13 Being physicists, we say that \(\sin \theta / \theta = 1\). A mathematician might say that this is undefined, being zero divided by zero, but would be happy to agree that the limit, as theta approaches zero, is one.

Now recall that the intensity is proportional to the amplitude squared. If \(I(\theta)\) represents the light intensity at angle \(\theta\), then we find that \(I(0)\) is proportional to13 \((Aa/R)^2\). We then find that, for any \(\theta\) \[ I(\theta) = I(0)\frac{\sin^2\beta}{\beta^2}. \] A plot of this function is shown in figure 6.5.

A graph of intensity against beta. The graph has mirror symmetry in the vertical axis, since it is a graph of an even function. In the middle, at beta equal to zero, there is a wide, tall peak. Lower peaks appear on either side, centred approximately but not exactly at plus or minus 1.5 times pi, 2.5 time pi, etc. Troughs occur at multipled of pi (except for at zero), where the intensity becomes zero.
Figure 6.5: A plot of light intensity against \(\beta\).

We see that minima occur whenever \(\beta\) is a multiple of \(\pi\), except at the central peak at \(\beta = 0\). Other maxima occur at \(\beta\) approximately equal to \(\pm1.43\pi\), \(\pm2.46\pi\), \(\pm3.47\pi\), etc.

Exercise

By differentiating the intensity with respect to \(\beta\), find a simple equation satisfied by the value of \(\beta\) at the maxima of intensity. Confirm that the values for \(\beta\) listed above satisfy this equation.

Noting that \[ \beta = \frac{\pi a}{\lambda}\sin\theta \] we see that when the slit width is large compared to the wavelength then \(\beta\) increases rapidly as \(\theta\) is increased. As a result, the central peak is very narrow14 and light intensity is negligible except when \(\theta\) is close to zero. There is little spreading of the waves, or in other words, little diffraction. In contrast, when the slit width is small compared with the wavelength, \(\beta\) increases very slowly as \(\theta\) increases. As a result, \(\theta\) reaches an angle of \(\pi / 2\) before \(\beta\) leaves the central peak of the illustrated graph15. The light is spread almost equally across all values of \(\theta\). Finally, when the slit width and the wavelength are similar, we see bands of light, i.e. a diffraction pattern, as suggested by the graph.

14 Imagine the graph squashed horizontally.

15 Imagine the graph stretched horizontally.

6.4.1 In three dimensions

Thus far we have considered the slit as an idealised line source16 in two dimensions. However, you may have noticed the inclusion of the \(z\) axis in figure 6.3. How much light is detected at non-zero values of \(z\)? The key here is to notice that if we consider the aperture as simply an idealised line source of light, then there is rotational symmetry around the \(y\)-axis. Hence, for a line source that is long compared with the wavelength, the intensity of light is negligible except in the \(x\)-\(z\) plane, where \(\theta\) is close to zero. This is equivalent to a single light source at the centre, but one which only emits light in the \(x\)-\(z\) plane, i.e. circular waves. If the line source is very short compared with the wavelength, it is equivalent to a single light source that emits light in all directions, i.e. spherical waves.

16 A line of zero thickness, each point of which is emitting monochromatic coherent spherical waves.

We must also revisit the notion of the aperture as a idealised line source. Figure 6.6 shows a slit in a barrier in three dimensions. This slit has both length and width. Start by thinking of the slit a collection of vertical idealised line sources. Since the length of the slit is much larger than the wavelength of the light, the light from each line source does not diffract much in the vertical direction. Each vertical line can be thought of as equivalent to a source of light, positioned at mid-height within the slit, emitting light only in the horizontal plane.

To the left of the diagram, light rays are show entering a vertical slit within a barrier. To the right of the diagram is a screen, showing a line of five horizontal line segments - the diffraction pattern. The central line is slightly longer than the others.
Figure 6.6: Single-slit diffraction in three dimensions. (To make the scale correct, either shrink the slit or scale up the screen.)

This now means that we have a short horizontal row of line sources. The width of the slit is not large compared with the wavelength and so light diffracts horizontally. The result is the diffraction pattern shown on the screen.

6.5 Double-slit diffraction

Moving to two slits is now straightforward. Consider two slits, both of width \(a\) as before, with centres separated by \(b\), as shown in figure 6.7.

A barrier with two vertical slits, separated by distance b. Each slit is of width a.
Figure 6.7: Two slits.

Each slit is displaced by \(b/2\) from the centre. Recall that a centrally placed slit gives us \[ E = \frac{Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t\right)}. \] Using the same logic as for the single slit case, a slit displaced by \(b/2\) produces \[ E = \frac{Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t \pm \alpha\right)} \] where \[ \alpha = \frac{1}{2}kb\sin\theta. \] Summing the waves from the two sources produces \[ \begin{aligned} E &= \frac{Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t - \alpha\right)} + \frac{Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t + \alpha\right)}\\ &= \frac{Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t\right)}\left(e^{i\alpha} + e^{-i\alpha}\right)\\ &= \frac{2Aa}{R}\frac{\sin\beta}{\beta}e^{i\left(kR-\omega t\right)}\cos\alpha. \end{aligned} \] The intensity is now given by17 \[ I(\theta) = I(0)\left(\frac{\sin^2\beta}{\beta^2}\right)\cos^2\alpha. \]

17 The intensity at \(\theta = 0\) is now four times what it was for a single slit. (I.e. this is a different \(I(0)\).)

Setting \(b = 3a\), this produces the graph shown in figure 6.8.

Figure 6.8: A plot of light intensity against \(\beta\), for the two slit experiment with \(b = 3a\).

The black curve here ignores the factor of \(\cos^2\alpha\) and is the same shape as that from the single-slit experiment. The cosine part of the formula oscillates more rapidly18. Multiplying the two parts produces the blue curve.

18 Since \(\alpha = 3\beta\) in this case.

Notice that where we would expect the third peak from the centre to appear, at \(\beta = \pm\pi\), the peak has been squashed by the black curve. If we make the slits narrower but maintain the slit separation, then we stretch the black curve horizontally and obtain a more familiar interference pattern with bands of similar intensity, as shown in figure 6.9.

Figure 6.9: A plot of light intensity against \(\beta\), for the two slit experiment with \(b = 3a\).