Diffraction Grating
In this derivation:
Summing the complex amplitude contributed by each slit:
Summing the series:
This is an amplitude, to find the intensity multiply by the complex conjugate:
Maxima of this function are when:
, where m is an integer.
is the intensity diffracted by a single slit:
where:
This provides the envelope of efficiency.
As
the principal maxima become sharper.
Blaze is a tilt on the grroves of a reflection grating to concentrate the
light at a higher angle, and therefore in orders M
0. Thus light is
concentrated in the dispersed orders.
For a blazed grating we get an efficiency curve of the form
,
where:
where
Now call the angle of diffraction d instead of
. Angle of incidence is still i
Blaze angle
.
Spectrograph Design
A reflection spectrograph consists of:
In this derivation:
The Disperser Equation for a grating is expressed as
.
In this case:
Magnification is given by:
The first term is just the ratio of focal lengths as normal, the second term is an anamorphic magnification, caused by the fact that the beam coming off the grating has a different diameter to the beam incident upon it.
Dispersion is given by the linear rate of change of wavelength with distance on the detector, which is the rate of change of wavelength with diffracted angle, times the focal length of the camera:
Resolving power
, and
And substituting in the Magnification:
But differentiating g with respect to each variable:
Also, from simple geometry:
and:
The properties of the spectrograph are then:
Note that the magnification given is appropriate for the dispersion direction only, the anamorphic factor does not work along the slit (which is why it is an anamorphic factor). The slit direction magnification is:
So the resolving power is increased if
is large and all of
,
and
are small. It is harder to design a high resolution
spectrograph the larger the telescope aperture.
Blaze angle
, and the camera-collimator angle
. From the disperser equation
.
Using
:
If
,
,
, and
are all known, this is a quadratic
equation in
which can be solved. As a quadratic equation it has
two solutions. these correspond to two different configurations of the
spectrograph, of which is called blaze-to-collimator which is
illustrated at the top of this section, and one which is called
blaze-to-camera which is illustrated below.
In the blaze-to-collimator configuation the beam expands as it comes off the
grating,
, and there is anamorphic demagnification of the
slit image, so the slit can be wider for the same resolving power. The penalty
is that the beam is larger, and so the camera optics need to be larger and
consequently more expensive. In the blaze-to-camera configuration
, and there is anamorphic magnification.
Going back to the formula for resolving power:
But:
Therefore:
So to get good resolving power we need large gratings, large blaze angle,
small camera angle (in practice this is usually of order 30
),
small entrance aperture size, and its easier with smaller telescopes. Of
course at high resolving power you need the large telescopes in order to get
enough signal for the science goals, so we need to build the big spectrographs
for the large telescopes.
Grating Fabrication
There are three principal ways of making reflection gratings:
Blazed Transmission Gratings and Grisms
Transpission gratings are used only for low resolution applications. It is hard to make effectively blazed transmission gratings. More common is a grism, where a transmission grating is replicated on the back of a prism, the prism is usually specified so that the deviation for the central wavelength of interest is equal and opposite to that of the grating. This means that the instrument can be straight. Grism spectrographs still operate at comparatively low dispersion.
Prismatic Dispersion
Prisms disperse light because the refractive index is different for different wavelengths. The refractive index as a function of wavelength is given by an empirical formula due to Hartmann:
Where
,
and
are properties of the refracting material. The between
the direction of the final ray and the initial ray is given by:
Dispersion is a much stronger function of wavelength that with a grating, so the relationship between wavelength and position on the detector will be more non-linear. Deviation is highest at shorter wavelengths, as opposed to a grating where deviation is highest at longer wavelengths. Prisms are made from glasses at optical wavelengths, and out of more exotic materials like Selenium Fluoride for infra-red use. Prisms are rarely used in spectrographs by themselves, but are much more commonly used as cross dispersers for order separation in conjunction with echelle gratings.
Echelle Gratings
An echelle grating is an extreme example of a blazed grating. The blaze angle
is set very high (to of order 65
),
with this blaze angle the camera angle
must be small to
avoid groove shadowing.
Because echelle gratings cannot be made with small groove spacing
, they
are used in high order
.
can be inthe range 5-10 for a low dispersion
echelle (echellette) spectrograph, but for the highest resolution stellar
echelle spectrographs orders of a few hundred can be used. This leads to order
overlap because the position on the detector depends only upon
;
for instance light at wavelength 550nm in order 100 overlaps with light at
wavelength 555.5nm in order 99.
In a classical echelle this order overlap problem is solved by cross dispersion, in which a low dispersion is provided in an orthogonal direction to the main dispersion provided by the echelle grating. This separates the orders which then appear side by side on the detector. This technique is suitable only for point sources (e.g. stars), otherwise the orders still overlap. The cross dispersing element must have a different functional form of the dispersion from the main disperser (the echelle), otherwise order coincidence still occurs. Usually a prism or series of prisms are used.
In the diagram above the layout of the echelle spectrograph built by University College London for the Anglo-Australian telescope is shown. Three prisms provide the cross dispersion, and the direction of the main dispersion provided by the echelle grating is out of the plane of the paper. This provides a series of orders, each covering a short wavelength range, separated on the detector. This is shown in the diagram below.
An alternative for the future to using cross dispersion is to use energy sensitive detectors, such as an array of Superconducting Tunnel Junction detectors or of Transition Edge Sensor Quantum Calorimeters, to separate the orders. If this can be done (currently these arrays have too few pixels to be useful) then the echelle spectrograph can also work on extended objects.
Bragg Diffraction
Diffraction of X-rays was used as a tool to analyse the structure of crystals in the early 20th century, but a crystal with a regular lattice spacing can be used as a diffraction element at X-ray wavelengths.
In this form of diffraction the phase difference is provided by the spacing
between the crystal planes, reflection from successive planes produces a phase
difference
, where
is the spacing between the crystal planes. We can sum the amplitudes from
successive planes as with the analysis of the diffraction grating, and find
that constructive intereference occurs at the Bragg angle
, given by:
A crystal like this with the crystal planes parallel to the crystal surface acts as a monochromater, i.e. only one wavelength of radiation is reflected from it. At X-ray wavelengths crystal spectrometers are constructed by either scanning the angle of the crystal, or alternatively using a bent crystal, where different parts of the surface reflect X-rays of different wavelengths at different angles to a position sensitive detector (e.g. a CCD).
Volume Phase Holographic gratings
A Volume Phase Holographic (VPH) grating works by a mechanism similar to Bragg diffraction, but these are used at optical and near infra-red wavelengths. The planes in this case are modulations of the refractive index in the form of fringe planes, parallel and equally spaced, running through the grating material, terminating at the surfaces of the volume. They are manufactured using holographic techniques, using material whose refractive index can be changed by a process similar to the photographic process, and consist of a layer of gelatin in which the fringes are developed, sanwiched between glass plates which provide mechanical support and protection from dust, water etc.
If the fringes are parallel to the grating surface, then as in the case of the crystal at X-ray wavelengths, the grating acts as a reflecting monochromater which can be scanned in angle.
If the fringes are nearly but not quite parallel to the grating surface, the grating acts as a reflection grating.
If the fringes are perpendicular or nearly so to the grating surface, the grating acts as a transmission grating. Using this technology it is possible to make transmission gratings of much higher fringe frequency that the ruling frequency possible with conventional gratings.
Notation for this section:
Refractive index is modulated according to:
We will consider the case of fringes perpendicular to the grating surface,
= 90
.
Bragg condition is:
Where
is the Bragg angle in the grating medium (gelatin). The
Bragg condition is met when
is integer and
and
are
such that the angles of incidence and diffraction are symmetric within the
grating medium about the fringe planes.
Near the Bragg condition there is a Bragg envelope, where light at different wavelengths is diffracted at different angles, but at less efficiency than the light diffracted at the Bragg angle. This is analagous to the blaze envelope of a conventional grating.
The disperser equation of the configuration in panel A of the plot above is:
So:
Unlike with a blazed grating, the angle can be rotated to an angle at which
the Bragg condition is satisfied for a different wavelength
or different order
. With a VPH transmission grating or ant transmission
grating the camera must be rotated as well.
The Diffraction efficiency of a VPH grating depends upon the thickness
,
the intensity of the modulations in the refractive index
, the
wavelength, and the Bragg angle. For
in first order:
And the passband over which the grating is effective is given approximately by:
here is the bandwidth, not the resolution.
In practice the amplitude of the refractive index modulations and the thickness must be tuned to produce a grating with the desired properties.
Fabry-Perot Etalons
A Fabry-Perot etalon works by constructive interference of light from multiple reflections between two exactly parallel surfaces, this is usually a gap between two precisely manufactured plates of some material (glass for an optical etalon, sometimes sapphire for an infra-red etalon) the inner surfaces of which are coated with a reflective surface. For the purposes of this discussion we call the material that the plates are made out of glass and the material in the gap air. Other materials are possible.
Notation for this section:
We begin by considering Stokes' treatment of reflection at a surface. The principal of reversibility states that if you reverse the direction of all rays at a surface, the amplitudes will remain the same.
![]() |
![]() |
By the principal of reversibility:
,
.
solving these equations:
;
.
Summing the complex amplitude from each reflected ray:
Summing the series:
To get the intensity multiply this by its complex conjugate:
Using
:
The phase difference is given by:
and the reflected intensity
.
High reflectivity
gives sharper fringes, and the peaks occur when
.
The reflective finesse of an etalon
is
used as a measure of its theoretical performance. It gives the ratio of the
free spectral range to the spectral purity.
The dispersion of an etalon is given by
, or at small
,
.
Resolving Power of an Etalon
We define the resolving power in a way analagous to the Rayleigh criterion for resolving images.
To get to the half intensity point we require the wavelength to change by
or the angle to change by
.
At maximum,
.
At the 50% point the denominator above must be 2, so:
Using the trigonometric addition formulae:
For small
;
.
Using the trigonometric addition formulae again:
is small, so
; and
. Also
.
, and
, (we use -1 for convenience).
But according to the Rayleigh criterion you have to go to twice this
to resolve the line.
In terms of finesse:
To increase the finesse, and hence the resolving power, of the etalon we require a high reflectivity, and a large gap d.
In practice the profile of an etalon does not achieve the theoretical sharpness, it is degraded by other contributions to the finesse:
The effective finesse is given by:
Fabry-Perots are used in high order m to give high resolving power, the
free spectral range
is therefore small. They
are used in astrophysical applications where the light is concentrated in
a small number of emission lines, such as HII regions and Planetary Nebulae,
where the light is concentrated in H
and in [OIII] 500.7 nm
respectively. Orders are sorted with narrow band filters, allowing through
only the light in that particular line. The etalon is scanned either by varying
the gap
, or by varying the refractive index
by varying the pressure,
and a three dimensional
data array, in which two dimensions are spatial dimensions on the sky, and
the third is wavelength, is built up. Because the etalon has a dispersion,
an individual etalon setting does not give a unique wavelength, the wavelength
map must be built up by scanning the etalon and then applying a phase
correction as a function of the distance of the position on the sky from the
field centre.
Imaging Fabry-Perot consists of:
Field Size limitation of a Fabry-Perot
The dispersion of a Fabry-Perot is
. This increases with the angle
away from the optical axis,
. Eventually this becomes so large that
the change of wavelength over one detector pixel, or over the spatial
resolution element set by the seeing
, is large enough to limit
the wavelength resolution.
If the pixel size on the detector is
, then the change across a
pixel is:
If the spatial pixel size is matched to the seeing then:
Subject to
which is a practical limit set by
the difficulty of manufacturing faster focal ratio optics.
The incident field angle
at which the wavelength change across
one resolution element is equal to the intrinsic wavelength resolution
is:
However the angle incident upon the etalon is related to the field angle on the sky by:
So:
The field diameter
is twice this.
Michelson or Fourier Transform Spectrometer
The Michelson or Fourier transform spectrograph was popular for infra-red applications at a time when infra-red astronomy only had single pixel detectors. Imaging Michelson spectrometers are a possibility, but in general have been supplanted by imaging Fabry-Perot instruments which are easier to construct. The Michelson spectrograph relies on the same principle as the Michelson-Morley experiment. Light from the source is split into two beams by a half-silvered mirror, one is reflected off a fixed mirror and one off a moving mirror. The beams interfere, and by making measurements of the signal at many discrete positions of the moving mirror, the spectrum can be reconstructed. Michelson spectrographs are capable of very high spectral resolution observations of very bright sources.
For a monochromatic source of wavelength
, and at a particular
position of the movable mirror, there will be a path difference
between the two interfering beams. The Intensity measured at the detector
will be:
K is a constant. For a source with a general spectral distribution:
The first term is a DC offset which we can neglect. This leaves:
Which is the cosine transform of the spectrum. We change variables now to
frequency
:
Defining
:
Let
:
And the spectrum may be obtained from the data by the inverse transform:
In practice the intensity is sampled at discrete values of x, and the discrete transform is used.