POLARIMETRY

This is continuation from the previous tutorial - Interferometers

1. POLARIMETERS

Polarimeters are optical instruments used for determining the polarization properties of light beams and samples. Polarimetry, the science of measuring polarization, is most simply characterized as radiometry with polarization elements.

To perform accurate polarimetry, all the issues necessary for careful and accurate radiometry must be considered, together with many additional polarization issues. In this tutorial, our emphasis is strictly on those additional polarization issues which must be mastered to accurately determine polarization properties from polarimetric measurements.

Typical applications of polarimeters include the following: remote sensing of the earth and astronomical bodies, calibration of polarization elements, measuring the thickness and refractive indices of thin films (ellipsometry), spectroscopic studies of materials, and alignment of polarization-critical optical systems.

We can broadly subdivide polarimeters into the several categories as discussed in succeeding sections.

2. LIGHT - MEASURING AND SAMPLE - MEASURING POLARIMETERS

Light-measuring polarimeters determine the polarization state of a beam of light or determine some of its polarization characteristics. These determinations may include the following: the direction of oscillation of the electric field vector for a linearly polarized beam, the helicity of a circularly polarized beam, or the elliptical parameters of an elliptically polarized beam, as well as the degree of polarization and other characteristics.

A light-measuring polarimeter utilizes a set of polarization elements placed in a beam of light in front of a radiometer; or, to paraphrase, the light from the sample is analyzed by a series of polarization state analyzers, and a set of measurements is acquired.

The polarization characteristics of the sample are determined from these measurements by a data reduction procedure. Measurement, calibration, and data reduction algorithms are treated under ‘‘Light-measuring Polarimeters.’’

3. SAMPLE - MEASURING POLARIMETERS

Sample-measuring polarimeters determine the relationship between the polarization states of incident and exiting beams for a sample. The term exiting beam is general and includes beams which are transmitted, reflected, diffracted, or scattered.

The term sample is also an inclusive term used in a broad sense to describe a general light-matter interaction or sequence of such interactions and applies to practically anything.

Measurements are acquired using a series of polarization elements located between a source and sample and the exiting beams are analyzed with a separate set of polarization elements between the sample and radiometer. Samples of great interest include surfaces, thin films on surfaces, polarization elements, optical elements, optical systems, natural scenes, biological samples, and industrial samples.

Accurate polarimetric measurements can be made only if the polarization generator and/or polarization analyzer are fully calibrated. To perform accurate polarimetry, the polarization elements do not need to be ideal.

If the Mueller matrices of the polarization components are known , the systematic errors due to nonideal polarization elements can be removed during the data reduction (see ‘‘Polarimetric Measurement Equation and Polarimetric Data Reduction Equation’’).

4. COMPLETE AND INCOMPLETE POLARIMETERS

A light-measuring polarimeter is ‘‘complete’’ if it measures a Stokes vector or if a Stokes vector can be determined from its measurements. An ‘‘incomplete’’ light-measuring polarimeter cannot be used to determine a Stokes vector.

For example, a polarimeter which employs a rotating polarizer in front of a detector does not determine the circular polarization content of a beam, and is incomplete.

Similarly, a sample-measuring polarimeter is complete if it is capable of measuring the full Mueller matrix, and incomplete otherwise. Complete polarimeters are often referred to as Stokes polarimeters or Mueller polarimeters.

5. POLARIZATION GENERATORS AND ANALYZERS

A polarization generator consists of a source, optical elements, and polarization elements to produce a beam of known polarization state. A polarization generator is specified by the Stokes vector S of the exiting beam. A polarization analyzer is a configuration of polarization elements, optical elements, and a detector which performs a flux measurement of a particular polarization component in an incident beam.

A polarization analyzer is characterized by a Stokes-like analyzer vector A which specifies the incident polarization state which is analyzed, the state which produces the maximal response at the detector. Sample-measuring polarimeters require polarization generators and polarization analyzers, while light-measuring polarimeters only require polarization analyzers.

Frequently the terms ‘‘polarization generator’’ and ‘‘polarization analyzer’’ refer just to the polarization elements in the generator and analyzer.

In this usage, it is important to distinguish between elliptical (and circular) generators and elliptical analyzers for a given state because they generally have different polarization characteristics and Mueller matrices (see ‘‘Elliptical and Circular Polarizers and Analyzers’’).

6. CLASSES OF LIGHT - MEASURING POLARIMETERS

Polarimeters operate by acquiring measurements with a set of polarization analyzers (and a set of polarization generators for sample-measuring instruments). The following sections classify polarimeters by the four broad methods by which these multiple measurements are acquired.

7. TIME - SEQUENTIAL MEASUREMENTS

In a time-sequential polarimeter, the measurements are taken sequentially in time. Between measurements, the polarization analyzer and/or polarization generator is changed.

Time-sequential polarimeters frequently employ rotating polarization elements or filter wheels containing a set of analyzers. A time-sequential polarimeter generally employs a single source and detector.

8. POLARIZATION MODULATION

Polarimeters employing polarization modulation comprise a subset of time-sequential polarimeters. Here, the polarization analyzer contains a polarization modulator, a rapidly changing polarization element.

The output of the analyzer is a rapidly fluctuating irradiance on which polarization information is encoded. Polarization parameters can then be determined by ac and vector voltmeters, by lock-in amplifiers, or by frequency-domain digital signal processing techniques.

For example, a rapidly spinning polarizer produces a modulated output which allows the flux and the degree of linear polarization to be read with a dc voltmeter and an ac voltmeter. The most common high-speed polarization modulators in general use are the electro-optical modulator, the magneto-optical modulator, and the photoelastic modulator.

9. DIVISION OF APERTURE

Polarimeters based on division of aperture employ multiple polarization analyzers operating side-by-side. The aperture of the polarimeter is subdivided, with each beam going into a separate polarization analyzer and detector. The detectors are usually synchronized to acquire measurements simultaneously.

This is similar in principle to the polarizing glasses used in 3-\(\text{D}\) movie systems, where a \(45^\circ\) polarizer is used for one eye and a \(135^\circ\) for the other, permitting two polarization measurements simultaneously in two eyes.

10. DIVISION OF AMPLITUDE

Division-of-amplitude polarimeters utilize beam splitters to divide beams and direct them toward multiple analyzers and detectors. A division-of-amplitude polarimeter can acquire its measurements simultaneously. Many division-of-amplitude polarimeters use polarizing beam splitters to simultaneously divide and analyze the polarization state of the beam.

The four-detector photopolarimeter uses a sequence of detectors at nonormal incidence to measure Stokes vectors (Azzam, 1985; Azzam, Elminyawi, and El-Saba, 1988).

11. DEFINITIONS

Analyzer—an element whose intensity transmission is proportional to the content of a specific polarization state in the incident beam. Analyzers are placed before the detector in polarimeters. The transmitted polarization state emerging from an analyzer is not necessarily the same as the state which is being analyzed.

Birefringence—a material property, the retardance associated with propagation through an anisotropic medium. For each propagation direction within a birefringent medium, there are two modes of propagation with different refractive indices \(n_1\) and \(n_2\). The birefringence \(\Delta n\) is \(\Delta n=|n_1-n_2|\).

Depolarization—a process which couples polarized light into unpolarized light. Depolarization is intrinsically associated with scattering and with diattenuation and retardance which vary in space, time, and/or wavelength.

Diattenuation—the property of an optical element or system whereby the intensity transmittance of the exiting beam depends on the polarization state of the incident beam.

The intensity transmittance is a maximum \(\text{P}_\text{max}\) for one incident state, and a minimum \(\text{P}_\text{min}\) for the orthogonal state. The diattenuation is defined as \((\text{P}_\text{max}-\text{P}_\text{min})/(\text{P}_\text{max}+\text{P}_\text{min})\).

Diattenuator—any homogeneous polarization element which displays significant diattenuation and minimal retardance. Polarizers have a diattenuation close to one, but nearly all optical interfaces are weak diattenuators.

Examples of diattenuators include the following: polarizers and dichroic materials, as well as metal and dielectric interfaces with reflection and transmission differences described by Fresnel equations; thin films (homogeneous and isotropic); and diffraction gratings.

Dichroism—the material property of displaying diattenuation during propagation. For each direction of propagation, dichroic media have two modes of propagation with different absorption coefficients. Examples of dichroic materials include sheet polarizers and dichroic crystals such as tourmaline.

Eigenpolarization—a polarization state transmitted unaltered by a polarization element except for a change of amplitude and phase. Every polarization element has two eigenpolarizations.

Any incident light not in an eigenpolarization state is transmitted in a polarization state different from the incident state. Eigenpolarizations are the eigenvectors of the corresponding Mueller or Jones matrix.

Ellipsometry—a polarimetric technique which uses the change in the state of polarization of light upon reflection for the characterization of surfaces, interfaces, and thin films (after Azzam, 1993).

Homogeneous polarization element —an element whose eigenpolarizations are orthogonal. Then, the eigenpolarizations are the states of maximum and minimum transmittance and also of maximum and minimum optical path length.

A homogeneous element is classified as linear, circular, or elliptical depending on the form of the eigenpolarizations.

Inhomogeneous polarization element —an element whose eigenpolarizations are not orthogonal. Such an element will display different polarization characteristics for forward and backward propagating beams.

The eigenpolarizations are generally not the states of maximum and minimum transmittance. Often inhomogeneous elements cannot be simply classified as linear, circular, or elliptical.

Ideal polarizer—a polarizer with an intensity transmittance of one for its principal state and an intensity transmittance of zero for the orthogonal state.

Linear polarizer—a device which, when placed in an incident unpolarized beam, produces a beam of light whose electric field vector is oscillating primarily in one plane, with only a small component in the perpendicular plane (after Bennett, 1993).

Nonpolarizing element—an element which does not change the polarization state for arbitrary states. The polarization state of the output light is equal to the polarization state of the incident light for all possible input polarization states.

Partially polarized light—light containing an unpolarized component; cannot be extinguished by an ideal polarizer.

Polarimeter—an optical instrument for the determination of the polarization state of a light beam, or the polarization-altering properties of a sample.

Polarimetry—the science of measuring the polarization state of a light beam and the diattenuating, retarding, and depolarizing properties of materials.

Polarization—any process which alters the polarization state of a beam of light, including diattenuation, retardance, depolarization, and scattering.

Polarization coupling—any conversion of light from one polarization state into another state.

Polarized light—light in a fixed, elliptically (including linearly or circularly) polarized state. It can be extinguished by an ideal polarizer. For polychromatic light, the polarization ellipses associated with each spectral component have identical ellipticity, orientation, and helicity.

Polarizer—a strongly diattenuating optical element designed to transmit light in a specified polarization state independent of the incident polarization state. The transmission of one of the eigenpolarizations is very nearly zero.

Polarization element—any optical element which alters the polarization state of light. This includes polarizers, retarders, mirrors, thin films, and nearly all optical elements.

Pure diattenuator—a diattenuator with zero retardance and no depolarization.

Pure retarder—a retarder with zero diattenuation and no depolarization.

Retardance—a polarization-dependent phase change associated with a polarization element or system. The phase (optical path length) of the output beam depends upon the polarization state of the input beam. The transmitted phase is a maximum for one eigenpolarization, and a minimum for the other eigenpolarization. Other states show polarization coupling and an intermediate phase.

Retardation plate—a retarder constructed from a plane parallel plate or plates of linearly birefringent material.

Retarder—a polarization element designed to produce a specified phase difference between the exiting beams for two orthogonal incident polarization states (the eigenpolarizations of the element).

For example, a quarter-wave linear retarder has as its eigenpolarizations two orthogonal linearly polarized states which are transmitted in their incident polarization states but with a \(90^\circ\) (quarter-wavelength) relative phase difference introduced.

Spectropolarimetry—the spectroscopic study of the polarization properties of materials. Spectropolarimetry is a generalization of conventional optical spectroscopy. Where conventional spectroscopy endeavors to measure the reflectance or transmission of a sample as a function of wavelength, spectropolarimetry also determines the diattenuating, retarding, and depolarizing properties of the sample. Complete characterization of these properties is accomplished by measuring the Mueller matrix of the sample as a function of wavelength.

Waveplate —a retarder.

12. STOKES VECTORS AND MUELLER MATRICES

Several calculi have been developed for analyzing polarization, including those based on the Jones matrix, coherency matrix, Mueller matrix, and other matrices (Shurcliff, 1962; Gerrard and Burch, 1975; Theocaris and Gdoutos, 1979; Azzam and Bashara, 1987; Coulson, 1988; Egan, 1992). Of these methods, the Mueller calculus is most generally suited for describing irradiance-measuring instruments, including most polarimeters, radiometers, and spectrometers, and is used exclusively in this paper.

In the Mueller calculus, the Stokes vector \(\boldsymbol{S}\) is used to describe the polarization state of a light beam , and the Mueller matrix \(\boldsymbol{M}\) to describe the polarization-altering characteristics of a sample. This sample may be a surface, a polarization element, an optical system, or some other light/matter interaction which produces a reflected, refracted, diffracted, or scattered light beam. All vectors and matrices are represented by bold characters. Normalized vectors have ‘‘hats’’ (i. e., \(\boldsymbol{\hat{A}})\).

13. PHENOMENOLOGICAL DEFINITION OF THE STOKES VECTOR

The Stokes vector is defined relative to the following six flux measurements \(\text{P}\) performed with ideal polarizers in front of a radiometer (Shurcliff, 1962):

\(\text{P}_H\) horizontal linear polarizer \((0^\circ)\)

\(\text{P}_V\) vertical linear polarizer \((90^\circ)\)

\(\text{P}_{45}\) \(45^\circ\) linear polarizer

\(\text{P}_{135}\) \(135^\circ\) linear polarizer

\(\text{P}_R\) right circular polarizer

\(\text{P}_L\) left circular polarizer

Normally, these measurements are irradiance measurements \((W/m^2)\) although other flux measurements might be used. The Stokes vector is defined as

\[\tag{1}s=\left[\begin{align}S_0\\S_1\\S_2\\S_3\end{align}\right]=\left[\begin{array}&P_H+P_V\\P_H-PV\\P_{45}-P_{135}\\P_R-P_L\end{array}\right]\]

where \(S_0,S_1,S_2\) and \(S_3\) are the Stokes vector elements. The Stokes vector does not need to be measured by these six ideal measurements; what is required is that other methods reproduce the Stokes vector defined in this manner. Ideal polarizers are not required.

Further, the Stokes vector is a function of wavelength, position on the object, and the light’s direction of emission or scatter. Thus, a Stokes vector measurement is an average over area, solid angle, and wavelength, as is any radiometric measurement. Each Stokes vector element has units of watts per meter squared.

The Stokes vector is defined relative to a local \(x-y\) coordinate system defined in the plane perpendicular to the propagation vector. The coordinate system is right-handed; the cross product \(\hat{x}\times\hat{y}\) of the basis vectors points in the direction of propagation of the beam.

14. POLARIZATION PROPERTIES OF LIGHT BEAMS

From the Stokes vector , the following polarization parameters are determined (Azzam and Bashara, 1977 and 1987; Kliger, Lewis, and Randall, 1990; Collett, 1992):

\[\tag{2}\text{Flux}\qquad\qquad\qquad P=s_0\]

\[\tag{3}\text{Degree}\;\text{of}\;\text{polarization}\qquad\qquad\qquad\boldsymbol{DOP}=\frac{\sqrt{S^2_1+S^2_2+S^2_3}}{S_0}\]

\[\tag{4}\text{Degree}\;\text{of}\;\text{linear}\;\text{polarization}\quad\qquad\boldsymbol{DOLP}=\frac{\sqrt{S^2_1+S^2_2}}{S_0}\]

\[\tag{5}\text{Degree}\;\text{of}\;\text{circular}\;\text{polarization}\qquad\boldsymbol{DOCP}=\frac{S_3}{S_0}\]

The Stokes vector for a partially polarized beam \(\boldsymbol{\text{(DOP}}<1)\) can be considered as a superposition of a completely polarized Stokes vector S P and an unpolarized Stokes vector \(\boldsymbol{S}_P\) which are uniquely related to \(\boldsymbol{S}\) as follows (Collett, 1992):

\[\tag{6}\boldsymbol{S}=\boldsymbol{S}_P+\boldsymbol{S}_U=\left[\begin{align}S_0\\S_1\\S_2\\S_3\end{align}\right]s_0\boldsymbol{DOP}\left[\begin{array}&1\\s_1/(s_0\boldsymbol{DOP})\\s_2/(s_0\boldsymbol{DOP})\\s_3/(s_0\boldsymbol{DOP})\end{array}\right]+(1-\boldsymbol{DOP})s_0\left[\begin{array}&1\\0\\0\\0\end{array}\right]\]

The polarized portion of the beam represents a net polarization ellipse traced by the electric field vector as a function of time. The ellipse has a magnitude of the semimajor axis \(a\), semiminor axis \(b\), orientation of the major axis \(\eta\) (azimuth of the ellipse) measured counterclockwise from the \(x\) axis, and eccentricity (or ellipticity).

\[\tag{7}\text{Ellipticity}\qquad\qquad\qquad e=\frac{b}{a}=\frac{S_3}{S_0+\sqrt{S^2_1+S^2_2}}\]

\[\tag{8}\text{Orientation}\;\text{of}\;\text{major}\;\text{axis},\text{azimuth}\qquad\qquad \eta=\frac{1}{2}\text{arctan}\left[\frac{S_2}{S_1}\right]\]

\[\tag{9}\text{Eccentricity}\qquad\qquad\qquad\epsilon=\sqrt{1-e^2}\]

The ellipticity is the ratio of the minor to the major axis of the corresponding electric field polarization ellipse, and varies from 0 for linearly polarized light to 1 for circularly polarized light.

The polarization ellipse is alternatively described by its eccentricity, which is zero for circularly polarized light, increases as the ellipse becomes thinner (more cigar-shaped), and becomes one for linearly polarized light.

15. MUELLER MATRICES

The Mueller matrix \(\boldsymbol{M}\) for a polarization-altering device is defined as the matrix which transforms an incident Stokes vector S into the exiting (reflected , transmitted , or scattered) Stokes vector \(\boldsymbol{S}'\),

\[\tag{10}\boldsymbol{S}'=\left[\begin{align}S'_0\\S'_1\\S'_2\\S'_3\end{align}\right]=\boldsymbol{\text{MS}}=\left[\begin{array}&m_{00}\;m_{01}\;m_{02}\;m_{03}\\m_{10}\;m_{11}\;m_{12}\;m_{13}\\m_{20}\;m_{21}\;m_{22}\;m_{23}\\m_{30}\;m_{31}\;m_{32}\;m_{33}\end{array}\right]\left[\begin{array}&S_0\\S_1\\S_2\\S_3\end{array}\right]\]

The Mueller matrix is a four-by-four matrix with real valued elements. The Mueller matrix \(M(k,\lambda)\) for a device is always a function of the direction of propagation \(k\) and wavelength \(\lambda\).

The Mueller matrix is an appropriate formalism for characterizing polarization measurements because it contains within its elements all of the polarization properties: diattenuation, retardance, depolarization, and their form, either linear, circular, or elliptical.

When the Mueller matrix is known, then the exiting polarization state is known for an arbitrary incident polarization state. Table 1 is a compilation of Mueller matrices for common polarization elements, together with the corresponding transmitted Stokes vector.

Other tables of Mueller matrices may be found in the following references: Shurcliff (1962), Gerrard and Burch (1975), Azzam and Bashara (1977), Theocaris and Gdoutos (1979), and Collett (1992). See detailed discussion of the polarization properties as related to the Mueller matrix elements later in this tutorial.

The Mueller matrix \(\boldsymbol{\text{M}}\) associated with a beam path through a sequence (cascade) of polarization elements \(q=1,2,\cdots, Q\) is the right-to-left product of the individual matrices \(\boldsymbol{\text{M}}_q\),

\[\tag{11}\boldsymbol{\text{M}}=\boldsymbol{\text{M}}_Q\boldsymbol{\text{M}}_{Q_1}\cdots\boldsymbol{\text{M}}_q\cdots\boldsymbol{\text{M}}_2\boldsymbol{\text{M}}_1=\sum^1_{q=Q-1}\boldsymbol{\text{M}}_q\]

When a polarization element with Mueller matrix M is rotated about the beam of light by an angle \(\theta\) such that the angle of incidence is unchanged (for example, for a normal-incidence beam, rotating the element about the normal) , the resulting Mueller matrix \(\boldsymbol{\text{M}}(\theta)\) is

\[\tag{12}\boldsymbol{M}(\theta)=\boldsymbol{R}_M(\theta)\boldsymbol{MR}_M(-\theta)=\left[\begin{array}&1&0&0&0\\0&\cos(2\theta)&-\sin(2\theta)&0\\0&\sin(2\theta)&\cos(2\theta)&0\\0&0&0&1\end{array}\right]\left[\begin{align}m_{00}\;m_{01}\;m_{02}\;m_{03}\\m_{10}\;m_{11}\;m_{12}\;m_{13}\\m_{20}\;m_{21}\;m_{22}\;m_{23}\\m_{30}\;m_{31}\;m_{32}\;m_{33}\end{align}\right]\times\left[\begin{array}&1&0&0&0\\0&\cos(2\theta)&\sin(2\theta)&0\\0&-\sin(2\theta)&\cos(2\theta)&0\\0&0&0&1\end{array}\right]\]

where \(\boldsymbol{R}_M\) is the rotational change of basis matrix for Stokes vectors and Mueller matrices.

TABLE 1 Example Mueller Matrices and Transmitted Stokes Vectors

Here \(\theta>0\) if the \(x\) axis of the device is rotated toward \(45^\circ\). If the polarization element remains fixed but the coordinate system rotates by \(\phi\), the resulting Mueller matrix is \(\boldsymbol{M}(\phi)=\boldsymbol{R}_M(-\phi)\boldsymbol{MR}_m(\phi)\).

16. COORDINATE SYSTEM FOR THE MUELLER MATRIX

Consider a Mueller polarimeter consisting of a polarization generator which illuminates a sample, and a polarization analyzer which collects the light exiting the sample in a particular direction.

We wish to characterize the polarization modification properties of the sample for a particular incident and exiting beam through the Mueller matrix.

The incident polarization states are specified by Stokes vectors defined relative to an \(\{\hat{x},\hat{y}\}\) coordinate system orthogonal to the propagation direction of the incident light. Similarly, the exiting lights’ Stokes vector is defined relative to an \(\{\hat{x}',\hat{y}'\}\) coordinate system orthogonal to its propagation direction.

For transmission measurements where the beam exits undeviated , the orientations of \(\{\hat{x},\hat{y}\}\) and \(\{\hat{x}',\hat{y}'\}\) will naturally be chosen to be aligned, \((\hat{x}=\hat{x}',\hat{y}=\hat{y}')\).

The global orientation of \(\{\hat{x},\hat{y}\}\) is arbitrary , and the measured Mueller matrix varies systematically if \(\{\hat{x},\hat{y}\}\) and \(\{\hat{x}',\hat{y}'\}\) are rotated together.

When the exiting beam emerges in a different direction from the incident beam, orientations must be specified for both sets of coordinates.

For measurements of reflection from a surface, a logical choice sets \(\{\hat{x},\hat{y}\}\) and \(\{\hat{x}',\hat{y}'\}\) to the \(\{\hat{s},\hat{p}\}\)orientations for the two beams.

Other Mueller matrix measurement configurations may have other obvious arrangements for the coordinates . All choices , however , are arbitrary , and lead to dif ferent Mueller matrices.

Let a Mueller matrix \(\boldsymbol{M}\) be defined relative to a particular \(\{\hat{x},\hat{y}\}\) and \(\{\hat{x}',\hat{y}'\}\).

Let another Mueller matrix \(\boldsymbol{M}(\theta_1,\theta_2)\) for the same measurement conditions have its \(\hat{x}\) axis rotated by \(\theta_1\) and \(x'\) axis rotated by \(\theta_2\) , where \(\theta>0\) indicates a counterclockwise rotation looking into the beam \((\hat{x}\) into \(\hat{y})\). These Mueller matrices are related by the equation

\[\tag{13}\boldsymbol{M}(\theta_1,\theta_2)=\left[\begin{array}&1&0&0&0\\0&\cos 2\theta_2&-\sin2\theta_2&0\\0&\sin2\theta_2&\cos2\theta_2&0\\0&0&0&1\end{array}\right]\left[\begin{array}&m_{00}&m_{01}&m_{02}&m_{03}\\m_{10}&m_{11}&m_{12}&m_{13}\\m_{20}&m_{21}&m_{22}&m_{23}\\m_{30}&m_{31}&m_{32}&m_{33}\end{array}\right]\times\left[\begin{array}&1&0&0&0\\0&\cos2\theta_1&\sin2\theta_1&0\\0&-\sin2\theta_1&\cos2\theta_1&0\\0&0&0&1\end{array}\right]\]

When \(\theta_1=\theta_2\) the coordinates rotate together, the eigenvalues are preserved, the circular polarization properties are preserved, and the linear properties are shifted in orientation.

When \(\theta_1\neq\theta_2\), the matrix properties are qualitatively dif ferent ; the eigenvalues of the matrix change. If the eigenpolarizatons of \(\boldsymbol{M}\) were orthogonal, they may not remain orthogonal.

After we perform data reduction on the matrix, the basic polarization properties couple in a complex fashion. For example, linear diattenuation in \(\boldsymbol{M}\) yields a circular retardance component in \(\boldsymbol{M}\)\((\theta_1,\theta_2)\), and a linear retardance component yields a circular diattenuation component.

The conclusion is that the selection of the coordinate systems for the incident and exiting beams is not important for determining exiting polarization states, but is crucial for identifying polarization characteristics of the sample.

17. ELLIPTICAL AND CIRCULAR POLARIZERS AND ANALYZERS

There are few good and convenient circularly or elliptically polarizing mechanisms, whereas linear polarizers are simple, inexpensive, and of high quality. Therefore, most circular and elliptical polarizers incorporate linear polarizers to perform the polarizing, and retarders to convert polarization states.

For such compound devices, the distinction between a polarizer and an analyzer becomes significant. This is perhaps best illustrated by three examples: (1) a left circular polarizer (which is also a horizontal linear analyzer) constructed from a horizontal linear polarizer \(\text{LP}(0^\circ)\) followed by a quarter-wave linear retarder with the fast axis oriented at \(135^\circ\), \(\text{QWLR}(135^\circ\)) Eq. 14 , (2) a left circular analyzer (which is also a horizontal linear polarizer) constructed from a \(\text{QWLR}(45^\circ\)) followed by an \(\text{LP}(0^\circ)\) Eq. 15, and, (3) a left circular analyzer and polarizer constructed from a \(\text{QWLR}(135^\circ\)), then an \(\text{LP}(0^\circ\)), followed by a \(\text{QWLR}(45^\circ\)) Eq. 16. The Mueller matrix equations and exiting polarization states for arbitrary incident states are as follows:

\[\tag{14}\text{QWLR}(135^\circ)\text{LP}(0^\circ)S=\frac{1}{2}\left[\begin{array}&1&1&0&0\\0&0&0&0\\0&0&0&0\\-1&-1&0&0\end{array}\right]\left[\begin{array}&S_0\\S_1\\S_2\\S_3\end{array}\right]=\frac{1}{2}\left[\begin{align}S_0&+S_1\\&0\\&0\\-S_0&-S_1\end{align}\right]\]

\[\tag{15}\text{LP}(0^\circ)\text{QWLR}(45^\circ)S=\frac{1}{2}\left[\begin{array}&1&0&0&-1\\1&0&0&-1\\0&0&0&0\\0&0&0&0\end{array}\right]\left[\begin{array}&S_0\\S_1\\S_2\\S_3\end{array}\right]=\frac{1}{2}\left[\begin{align}S_0&-S_3\\S_0&-S_3\\&0\\&0\end{align}\right]\]

\[\tag{16}\text{QWLR}(135^\circ)\text{LP}(0^\circ)\text{QWLR}(45^\circ)S=\frac{1}{2}\left[\begin{array}&1&0&0&-1\\0&0&0&0\\0&0&0&0\\-1&0&0&1\end{array}\right]\left[\begin{array}&S_0\\S_1\\S_2\\S_3\end{array}\right]=\frac{1}{2}\left[\begin{align}S_0&-S_3\\0\\0\\-S_0&+S_3\end{align}\right]\]

The device in Eq. (14) transmits only left circularly polarized light, because the zeroth and third elements have equal magnitude and opposite sign, making it a left circular polarizer.

However, the transmitted flux \((S_0+S_1)/2\) is the flux of horizontal linearly polarized light in the incident beam, making it a horizontal linear analyzer. Similarly, the transmitted flux from the example in Eq. (15), \((S_0-S_3)/2\), is the flux of left circularly polarized light in the incident beam, making this combination a left circular analyzer.

The final polarizer makes the device in Eq. (15) a horizontal linear polarizer, although this is not the standard Mueller matrix for horizontal linear polarizers found in tables. Thus an analyzer for a state does not necessarily transmit the state; its transmitted flux is proportional to the amount of the analyzed state in the incident beam . Examples in Eqs. (14) and (15) are referred to as inhomogeneous polarization elements because the eigenpolarizations are not orthogonal, and the characteristics of the device are different for propagation in opposite directions.

The device in Eq. (16) is both a left circular polarizer and a left circular analyzer; it has the same characteristics for propagation in opposite directions, and is referred to as a homogeneous left circular polarizer.

18. LIGHT - MEASURING POLARIMETERS

This section presents a general formulation of the measurement and data reduction procedure for a polarimeter intended to measure the state of polarization of a light beam. Similar developments are found in Theil (1976), Azzam (1990), and Stenflo (1991). A survey of light-measuring polarimeter configurations is found in Ellipsometry tutorial, ‘‘Ellipsometry’’ (Azzam, 1994).

Stokes vectors and related polarization parameters for a beam are determined by measuring the flux transmitted through a set of polarization analyzers. Each analyzer determines the flux of one polarization component in the incident beam. Since a polarization analyzer does not contain ideal polarization elements, the analyzer must be calibrated, and the calibration data used in the data reduction. This section describes data reduction algorithms for determining Stokes vectors which assume arbitrary analyzers; the algorithms allow for general calibration data to be used.

Each analyzer is used to measure one polarization component of the incident light. The measured values are related to the incident Stokes vector and the analyzers by the polarimetric measurement equation. A set of linear equations, the data reduction equations, is then solved to determine the Stokes parameters for the beam.

Henceforth, the ‘‘polarization analyzer’’ is considered as the polarization elements used for analyzing the polarization state together with any and all optical elements (lenses, mirrors, etc. ), and the detector contained in the polarimeter.

The polarization effects from all elements are included in the measurement and data reduction procedures for the polarimeter. A polarization analyzer is characterized by an analyzer vector containing four elements and is defined in a manner analogous to a Stokes vector.

Let \(P_H\) be flux measurement taken by the detector (the current or voltage generated) when one unit of horizontally polarized light is incident. Similarly \(P_V\), \(P_{45}\), \(P _{135}\), \(P_R\), and \(P_L\) are the detector’s flux measurements for the corresponding incident polarized beams with unit flux. Then the analyzer vector \(A\) is

\[\tag{17}\boldsymbol{A}=\left[\begin{array}&a_0\\a_1\\a_2\\a_3\end{array}\right]=\left[\begin{array}&P_H+P_V\\P_H-P_V\\P_{45}-P_{135}\\P_{R}-P_L\end{array}\right]\]

Note that \(P_H+P_V=P_{45}+P_{135}=P_R+P_L\). The response P of the polarization analyzer to an arbitrary polarization state \(\boldsymbol{S}\) is the dot product

\[\tag{18}\boldsymbol{P}=\boldsymbol{A}\cdot\boldsymbol{S}=a_0s_0+a_1s_1+a_2s_2+a_3s_3\]

A Stokes vector measurement consists of series of measurements taken with a set of polarization analyzers . Let the total number of analyzers be \(Q\), with each analyzer \(A_q\) specified by index \(q=0,1,\cdots,Q-1\). We assume the incident Stokes vector is the same for all polarization analyzers and strive to ensure this in our experimental setup.

The \(qth\) measurement generates an output \(P_q=\boldsymbol{A}_q\cdot\boldsymbol{S}\). \(boldsymbol{A}\) polarimetric measurement matrix \(\boldsymbol{W}\) is defined as a four-by-\(Q\) matrix with the \(qth\) row containing the analyzer vector \(\boldsymbol{A}_q\),

\[\tag{19}\boldsymbol{W}=\left[\begin{array}&a_{0,0}&a_{0,1}&a_{0,2}&a_{0,3}\\a_{1,0}&a_{1,1}&a_{1,2}&a_{1,3}\\\vdots\\a_{Q-1,0}&a_{Q-1,1}&a_{Q-1,2}&a_{A-1,3}\end{array}\right]\]

The \(\boldsymbol{Q}\) measured flux values are arranged in a measurement vector \(\boldsymbol{P}=[P_0,P_1\cdots,P_{Q-1}]^T\cdot\boldsymbol{P}\) is related to \(\boldsymbol{S}\) by the polarimetric measurement equation

\[\tag{20}\boldsymbol{P}=\left[\begin{array}&P_0\\P_1\\\vdots\\P_{Q-1}\end{array}\right]=\boldsymbol{WS}=\left[\begin{array}&a_{0,0}&a_{0,1}&a_{0,2}&a_{0,3}\\a_{1,0}&a_{1,1}&a_{1,2}&a_{1,3}\\\vdots\\a_{Q-1,0}&a_{Q-1,1}&a_{Q-1,2}&a_{Q-1,3}\end{array}\right]\left[\begin{array}S_0\\S_1\\S_2\\S_3\end{array}\right]\]

If \(\boldsymbol{W}\) is accurately known, then this equation can be inverted to solve for the incident Stokes vector. During calibration of the polarimeter \(\boldsymbol{W}\), the principal objective is the determination of the matrix \(\boldsymbol{W}\) or equivalent information regarding the states which the polarimeter analyzes at each of its analyzer settings.

However, systematic errors, differences between the calibrated and actual \(\boldsymbol{W}\), will always be present.

To calculate the incident Stokes vector from the data, the inverse of \(\boldsymbol{W}\) is determined and applied to the measured data. The measured value for the incident Stokes vector is designated \(\boldsymbol{S}_m\) to distinguish it from the actual \(\boldsymbol{S}\). In principle, \(\boldsymbol{S}_m\) is related to the data by the polarimetric data reduction matrix \(\boldsymbol{W}^{-1}\),

\[\tag{21}\boldsymbol{S}_m=\boldsymbol{W}^{-1}\boldsymbol{P}\]

Three considerations in the solution of this equation are the existence, rank, and uniqueness of the matrix inverse \(\boldsymbol{W}^{-1}\).

The simplest case occurs when four measurements are performed. If \(\boldsymbol{Q}=4\) linearly independent measurements are made, \(\boldsymbol{W}\) is of rank four, and the polarimetric measurement matrix \(\boldsymbol{W}\) is nonsingular. Then \(\boldsymbol{W}^{-1}\) exists and is unique. Data reduction is performed by Eq. 20 and the polarimeter measures all four elements of the incident Stokes vector.

The second case occurs when \(Q>4\) With more than four measurements, \(\boldsymbol{W}\) is not square, \(\boldsymbol{W}^{-1}\) is not unique, and \(\boldsymbol{S}_m\) is overdetermined by the measurements.

In the absence of noise in the measurements , the different \(\boldsymbol{W}^{-1}\) would all yield the same value for \(\boldsymbol{S}_m\). Because noise is always present, the optimum \(\boldsymbol{W}^{-1}\) is desired. The least squares estimate for \(\boldsymbol{S}_m\) utilizes the psuedoinverse \(\boldsymbol{W}^{-1}_P\) of \(\boldsymbol{W}\), \(\boldsymbol{W}^{-1}_P=(\boldsymbol{W}^T\boldsymbol{W})^{-1}\boldsymbol{W}^T\). The best estimate of \(\boldsymbol{S}\) in the presence of random noise is

\[\tag{22}\boldsymbol{S}_m=(\boldsymbol{W}^T\boldsymbol{W})^{-1}\boldsymbol{W}^T\boldsymbol{P}\]

The third case occurs when \(\boldsymbol{W}\) is of rank three or less. The optimal matrix inverse is the pseudoinverse. However, only three or less of the Stokes vector elements can be determined from the data. The polarimeter is referred to as ‘‘incomplete.’’ Figure 11 in ‘‘Ellipsometry,’’ tutorial summarizes polarization element configurations for Stokes vector measurements listing the vector elements not determined by the incomplete configurations.

19. SAMPLE - MEASURING POLARIMETERS FOR MEASURING MUELLER MATRIX ELEMENTS

The polarization characteristics of a sample are characterized by its Mueller matrix. This section describes the particulars of measuring Mueller matrix elements. The section following contains a general formulation of Mueller matrix determination. Since the Mueller matrix is a function of wavelength, angle of incidence, and location on the sample, these are assumed fixed. Figure 1 is a block diagram of a sample-measuring polarimeter. The polarization state generator \(\text{(PSG)}\) prepares the polarization states which are incident on a sample. A beam of light exiting the sample is analyzed by the polarization state analyzer \(\text{(PSA)}\) and detected by a detector.

The objective is to determine several elements of a sample Mueller matrix \(\boldsymbol{M}\) through a sequence \(q=0,1,\cdots,Q-1\) of polarimetric measurements.

The polarization generator prepares a set of polarization states with a sequence of Stokes vectors \(S_q\). The Stokes vectors exiting the sample are \(\boldsymbol{MS}_q\). These exiting states are analyzed by the \(qth\) polarization state analyzer \(\boldsymbol{A}_q\), yielding the measured flux \(P_q=\boldsymbol{A}^T_q\boldsymbol{MS}_q\).

Each measured flux is assumed to be a linear function of the sample’s Mueller matrix elements. From a set of polarimetric measurements, we develop a set of linear equations which can be solved for certain of the Mueller matrix elements.

**FIGURE 1.** A sample-measuring polarimeter consists of a source , polarization state generator \(\text{(PSG)}\), the sample, a polarization state analyzer \(\text{(PSA)}\), and the detector. ( After Chenault, 1992.)

For example, consider a measurement performed with horizontal linear polarizers for both the generator and analyzer. The measured flux depends on the Mueller matrix elements \(m_{00}\), \(m_{01}\), \(m_{10}\), and \(m_{11}\) as follows:

\[\tag{23}P=\boldsymbol{A}^T\boldsymbol{MS}=\frac{1}{2}[1\quad 1\quad0\quad0]\left[\begin{array}&m_{00}&m_{01}&m_{02}&m_{03}\\m_{10}&m_{11}&m_{12}&m_{13}\\m_{20}&m_{21}&m_{22}&m_{23}\\m_{30}&m_{31}&m_{32}&m_{33}\end{array}\right]\frac{1}{2}\left[\begin{array}&1\\1\\0\\0\end{array}\right]=\frac{m_{00}+m_{01}+m_{10}+m_{11}}{4}\]

As another example , consider measuring the Mueller matrix elements \(m_{00}\), \(m_{01}\), \(m_{10}\), and \(m_{11}\) using four measurements with ideal horizontal \(\text{(H)}\) and vertical \(\text{(V)}\) linear polarizers for the polarization state generators and analyzers.

The four measurements \(P_0, P_1, P_2,\) and \(P_3\) are taken with (generator/analyzer) settings of \((H/H)\), \((V/H)\), \((H/V)\), and \((V/V)\). The combination of Mueller matrix elements measured for each of the four permutations of these polarizers are as follows:

\[\tag{24}\begin{array}&P_0=(m_{00}+m_{01}+m_{10}+m_{11})/4,&P_1=(m_{00}+m_{01}-m_{10}-m_{11})/4\\P_2=(m_{00}-m_{01}+m_{10}-m_{11})/4,&P_3=(m_{00}-m_{01}-m_{10}+m_{11})/4\end{array}\]

These four equations are solved for the Mueller matrix elements as a function of the measured intensities, yielding

\[\tag{25}\left[\begin{array}&m_{00}\\m_{01}\\m_{10}\\m_{11}\end{array}\right]=\left[\begin{array}&P_0+P_1+P_2+P_3\\P_0+P_1-P_2-P_3\\P_0-P_1+P_2-P_3\\P_0-P_1-P_2+P_3\end{array}\right]\]

Other Mueller matrix elements are determined using other combinations of generator and analyzer states. For example, the four matrix elements at the corners of a rectangle in the Mueller matrix \(\{m_{00},m_{0i},m_{j0},m_{ji}\}\) can be determined from four measurements using a \(\pm i\)-generator and \(\pm j\)-analyzer.

For example, a right and left circularly polarizing generator and \(45^\circ\) and \(135^\circ\) polarizing analyzer will determine \(\{m_{00},m_{02},m_{30},m_{32}\}\).

In practice, the data reduction equations are far more complex than the above examples because many more measurements are involved, and especially because the polarization elements are not ideal.

The next section contains a method to sytematize the calculation of data reduction equations based on calibration data for the generator and analyzer.

20. POLARIMETRIC MEASUREMENT EQUATION AND POLARIMETRIC DATA REDUCTION EQUATION

This section develops equations which relate the measurements in a Mueller matrix polarimeter to the generator and analyzer states. The algorithm can use either ideal or calibrated values for the Stokes vectors of the polarization generator and analyzer. The data reduction equations then have the form of a straightforward matrix-vector multiplication on a data vector.

This method is an extension of the matrix data reduction methods presented under ‘‘Light-Measuring Polarimeters’’ on Stokes vector measurement. This method corrects for systematic errors in the generator and analyzer, provided these are characterized in the calibration. A well-calibrated generator and analyzer are essential for accurate Mueller matrix measurements.

A Mueller matrix polarimeter takes \(\text{Q}\) measurements identified by the index \(q=0,1,\cdots,Q-1\). For the \(qth\) measurement, the generator produces a beam with Stokes vector \(S_q\).

The beam exiting the sample is analyzed by the polarization analyzer with an analyzer vector \(A_q\). The measured flux \(P_q\) is related to the sample Mueller matrix by

\[\tag{26}P_q=\boldsymbol{A}^T_q\boldsymbol{MS}_q=[a_{q,0}\quad a_{q,1}\quad a_{q,2}\quad a_{q,3}]\left[\begin{array}&m_{00}&m_{01}&m_{02}&m_{03}\\m_{10}&m_{11}&m_{12}&m_{13}\\m_{20}&m_{21}&m_{22}&m_{23}\\m_{30}&m_{31}&m_{32}&m_{33}\end{array}\right]\left[\begin{array}&S_{q,0}\\S_{q,1}\\S_{q,2}\\S_{q,3}\end{array}\right]=\sum^3_{j=0}\sum^3_{k=0}a_{q,j}m_{j,k}S_{q,k}\]

This equation is now rewritten as a vector-vector dot product (Azzam, 1978; Goldstein, 1992). First, the Mueller matrix is flattened into a \(16\times1\) Mueller vector \(\overrightarrow{\boldsymbol{M}}=[m_{00}\quad m_{01}\quad m_{02}\quad m_{03}\quad m_{10}\quad\cdots\quad M_{33}]^T\). A \(16\times1\) polarimetric measurement vector \(\boldsymbol{W}_q\) for the \(qth\) measurement is defined as follows

\[\tag{27}\begin{array}\boldsymbol{W}_q&=[w_{q,00}\quad w_{q,01}\quad w_{q,02}\quad w_{q,03}\quad\cdots\quad w_{q,33}]^T\\&=[a_{q,0}S_{q,0}\quad a_{q,0}S_{q,1}\quad a_{q,0}S_{q,2}\quad a_{q,0}S_{q,3}\quad a_{q,1}S_{q,0}\quad \cdots\quad a_{q,3}S_{q,3}]^T\end{array}\]

where \(w_{q,jk}=a_{q,j}S_{q,k}\).The q th measured flux from Eq. (25) is rewritten as the dot product

\[\tag{28}P_q=\boldsymbol{W}_q\cdot\overrightarrow{\boldsymbol{M}}=\left[\begin{array}&a_{q,0}S_{q,0}\\a_{q,0}S_{q,1}\\a_{q,0}S_{q,2}\\a_{q,0}S_{q,3}\\a_{q,1}S_{q,0}\\a_{q,1}S_{q,1}\\\vdots\\a_{q,3}S_{q,3}\end{array}\right]\left[\begin{array}m_{0,0}\\m_{0,1}\\m_{0,2}\\m_{0,3}\\m_{1,0}\\m_{1,1}\\\vdots\\m_{3,3}\end{array}\right]\]

The full sequence of measurements is described by the polarimetric measurement matrix \(\boldsymbol{W}\), defined as the \(Q\times16\) matrix where the \(qth\) row is \(\boldsymbol{W}_q\). The polarimetric measurement equation relates the measurement vector \(\boldsymbol{P}\) to the sample Mueller vector by a matrix-vector multiplication,

\[\tag{29}\boldsymbol{P}=\boldsymbol{W\overrightarrow M}=\left[\begin{array}P_0\\P_1\\\vdots\\P_{Q-1}\end{array}\right]\left[\begin{array}&w_{0,00}&w_{0,01}&\cdots&w_{0,33}\\w_{1,00}&w_{1,01}&\cdots&w_{1,33}\\\vdots\\w_{Q-1,00}&w_{Q-1,01}&\cdots&w_{Q-1,33}\end{array}\right]\left[\begin{array}&m_{00}\\m_{01}\\\vdots\\m_{33}\end{array}\right]\]

If \(\boldsymbol{W}\) contains sixteen linearly independent columns, all sixteen elements of the Mueller matrix can be determined. Then, if \(Q=16\), the matrix inverse is unique and the Mueller matrix elements are determined from the polarimetric data reduction equation \(\overrightarrow{M}=\boldsymbol{W}^{-1}\boldsymbol{P}\).

More often, \(Q>16\), and \(\overrightarrow{M}\) is overdetermined. The optimal (least-squares) polarimetric data reduction equation for \(\overrightarrow{M}\) uses the pseudoinverse \(\boldsymbol{W}^{-1}_{P}\) of \(\boldsymbol{W}\), Eq. (21) where \(\boldsymbol{W}^{-1}_{P}\) is a polarimetric data reduction matrix for the polarimeter. The polarimetric data reduction equation is then

\[\tag{30}\overrightarrow{\boldsymbol{M}}=(\boldsymbol{W}^T\boldsymbol{W})^{-1}\boldsymbol{W}^{T}\boldsymbol{P}=\boldsymbol{W}^{-1}_{P}\boldsymbol{P}\]

where \(\boldsymbol{W}^{-1}_P\) operates on a set of measurements to estimate the Mueller matrix of the sample.

The advantages of this polarimetric measurement equation and polarimetric data reduction equation procedure are as follows. First, this procedure does not assume that the set of states of polarization state generator and analyzer have any particular form.

For example, the polarization elements in the generator and analyzer do not need to be rotated in uniform angular increments, but can comprise an arbitrary sequence. Second, the polarization elements are not assumed to be ideal polarization elements or have any particular imperfections.

If the Stokes vectors associated with the polarization generator and analyzer are determined through a calibration procedure, the effects of nonideal polarization elements are corrected in the data reduction. Third, the procedure readily treats overdetermined measurement sequences (more than sixteen measurements for the full Mueller matrix), providing a least-squares solution.

Finally, a matrix-vector form of data reduction is readily implemented and understood.

Following we will describe configurations of sample-measuring polarimeter with example data reduction matrices.

21. DUAL ROTATING RETARDER POLARIMETER

The dual rotating retarder Mueller matrix polarimeter is one of the most common Mueller polarimeters. Figure 2 shows the configuration: light from the source passes first through a fixed linear polarizer, then through a rotating linear retarder, the sample, a rotating linear retarder, and finally through a fixed linear polarizer.

In the most common configuration, first described by Azzam (1978), the polarizers are parallel, and the retarders are rotated in angular increments of five-to-one. This five-to-one ratio encodes all 16 Mueller matrix elements onto the amplitudes and phases of 12 frequencies in the detected signal. The detected signal is Fourier analyzed, and the Mueller matrix elements are calculated from the Fourier coefficients.

This polarimeter design has an important advantage: the polarizers do not move. The polarizer in the generator accepts only one polarization state from the source optics, making the measurement immune to instrumental polarization from the source optics.

If the polarizer did rotate, and if the beam incident on it were elliptically polarized, a systematic modulation of intensity would be introduced which would require compensation.

Similarly, the polarizer in the analyzer does not rotate; only one polarization state is transmitted through the analyzing optics and onto the detector. Any diattenuation in the analyzing optics and any polarization sensitivity in the detector will not affect the measurements.

The data reduction matrix is presented here for a polarimeter with ideal linear retarders with arbitrary retardances \(\delta_1\) in the generator and \(\delta_2\) in the analyzer. Optimal values for the retardances are near \(\lambda/4\) or \(\lambda/3\), depending which characteristics of the

**FIGURE 2.** The dual rotating retarder polarimeter consists of a source, a fixed linear polarizer, a retarder which rotates in steps, the sample, a second retarder which rotates in steps, a fixed linear polarizer, and the detector. This polarimeter measures the full Mueller matrix. It accepts only one polarization state from the source, and transmits only one polarization state to the detector. ( After Chenault, 1992. )

Mueller matrix are chosen for a figure of merit. If \(\delta_1=\delta_2=\pi\) rad, the last row and column of the sample Mueller matrix are not measured. \(\text{Q}\) measurements are taken, described by index \(q=0,1,\cdots,Q-1\).

The angular orientations of the two retarders for measurement \(q\) are \(\theta_{q,1}=q180^\circ/Q\), and \(\theta_{q,2}=5q180^\circ/Q\). The angular increment between settings of the generator retarder is \(\Delta\theta=180^\circ/Q\).

The data reduction matrix is a \(16\times Q\) matrix which multiplies a \(Q\times1\) data vector \(\boldsymbol{P}\), yielding the sample Mueller vector \(\overrightarrow{\boldsymbol{M}}\).

Table 2 lists the equations for the elements in each row \(q\) of the data reduction matrix assuming ideal polarization elements (Chenault, Pezzaniti, and Chipman, 1992).

Several data reduction methods have been published to account for additional imperfections in the polarization elements, leading to considerably more elaborate expressions than those presented here.

Hauge (1978) developed an algorithm to compensate for the linear diattenuation and linear retardance of the retarders. Goldstein and Chipman (1990) treat five errors, the retardances of the two retarders, and orientation errors of the two retarders and one of the polarizers, in a small angle approximation good for small errors. Chenault, Pezzaniti, and Chipman (1992) extended this method to larger errors.

22. INCOMPLETE SAMPLE - MEASURING POLARIMETERS

Incomplete sample-measuring polarimeters do not measure the full Mueller matrix of a sample and thus provide incomplete information regarding the polarization properties of a sample. Often the full Mueller matrix is not needed.

For example, many birefringent samples have considerable linear birefringence and minuscule amounts of the other forms of polarization. The magnitude of the birefringence can be measured, assuming all the other polarization effects are small, using much simpler configurations than a Mueller matrix polarimeter, such as the circular polariscope (Theocaris and Gdoutos, 1979).

Similarly , homogeneous and isotropic interfaces, such as dielectrics, metals, and thin films, should only display linear diattenuation and linear retardance aligned with the \(s-p\) planes.

These interfaces do not need characterization of their circular diattenuation and circular retardance. Many categories of ellipsometer will characterize such samples without providing the full Mueller matrix (Azzam and Bashara, 1977, 1987; Azzam, 1993).

23. DUAL ROTATING POLARIZER POLARIMETER

This section describes the dual rotating polarizer polarimeter, a common polarimetric configuration capable of measuring nine Mueller matrix elements (Collins and Kim, 1990). Figure 3 shows the arrangement of polarization elements in the polarimeter. Light from the source passes through a linear polarizer whose orientation \(\theta_1\) is adjustable. This

**FIGURE 3.** The dual rotating polarizer polarimeter consists of a source , a linear polarizer which rotated in steps, the sample, a second linear polarizer with a stepped angular orientation, and the detector. ( After Chenault, 1992. )

TABLE 2. Elements of the Polarimetric Data Reduction Matrix for the Dual Rotating Retarder Polarimeter

linearly polarized light interacts with the sample and is analyzed by a second linear polarizer whose orientation \(\theta_2\) is also adjustable. This polarimeter is incomplete because measurement of the last column of the Mueller matrix requires elliptical states from the polarization generator.

Similarly, elliptical analyzers are required in the polarization analyzer to measure the bottom row of the Mueller matrix.

The polarimetric data reduction matrix which follows is for a particular 16-measurement sequence.

The most common defects of polarizers have been taken into consideration : less than ideal diattenuation, and transmission of less than unity. The polarizers are characterized by \(\text{T}_\text{max}\), the maximum intensity transmittance for a single polarizer, and \(\text{T}_\text{min}\), the minimum intensity transmittance, which are associated with orthogonal linear states.

Let \(a=(16(\text{T}_\text{max}+\text{T}_\text{min})^2)^{-1}\), \(b=(8(\text{T}^2_\text{max}-\text{T}^2_\text{min}))^{-1}\) and \(c=(4(\text{T}_\text{max}-\text{T}_\text{min})^2)^{-1}\).

Sixteen measurements are acquired with the generator polarizer angle \(\theta_{q,1}\) and the analyzer polarizer angle \(\theta_{q,2}\) oriented as follows: \(\theta_{q,1}=(0^\circ,0^\circ,0^\circ,0^\circ,45^\circ,45^\circ,45^\circ,45^\circ,90^\circ,90^\circ,90^\circ,90^\circ,135^\circ,135^\circ,135^\circ,135^\circ)\) \(\theta_{q,2}=(0^\circ,45^\circ,90^\circ,135^\circ,0^\circ,45^\circ,90^\circ,135^\circ,0^\circ,45^\circ,90^\circ,135^\circ,0^\circ,45^\circ,90^\circ,135^\circ)\). Since only nine Mueller matrix elements are measured, a nine-element Mueller vector is used:

\[\tag{31}\overrightarrow{M}=[m_{00}\quad m_{01}\quad m_{02}\quad m_{10}\quad m_{11}\quad m_{12}\quad m_{20}\quad m_{21}\quad m_{22}]^T\]

The data reduction matrix \(\boldsymbol{W}^{-1}_{P}\) which operates on the 16-element measurement vector \(\boldsymbol{P}\) yielding \(\overrightarrow{\boldsymbol{M}}\) is

\[\tag{32}\boldsymbol{W}^{-1}_P=\left[\begin{array}&a&a&a&a&a&a&a&a&a&a&a&a&a&a&a&a\\b&b&b&b&0&0&0&0&-b&-b&-b&-b&0&0&0&0\\0&0&0&0&b&b&b&b&0&0&0&0&-b&-b&-b&-b\\b&0&-b&0&b&0&-b&0&b&0&-b&0&b&0&-b&0\\c&0&-c&0&0&0&0&0&-c&0&c&0&0&0&0&0\\0&0&0&0&c&0&-c&0&0&0&0&0&-c&0&c&0\\0&b&0&-b&0&b&0&-b&0&b&0&-b&0&b&0&-b\\0&c&0&-c&0&0&0&0&0&-c&0&c&0&0&0&0\\0&0&0&0&0&c&0&-c&0&0&0&0&0&-c&0&c\end{array}\right]\]

The source is assumed to be unpolarized in this equation. Similarly, the detector is assumed to be polarization-insensitive. When this is not the case, the data reduction matrix is readily generalized to incorporate these and other systematic effects following the method shown under ‘‘Polarimetric Measurement Equation and Polarimetric Data Reduction Equation.’’

24. NONIDEAL POLARIZATION ELEMENTS

For use in polarimetry, polarization elements require a level of characterization beyond what is normally provided by vendors. For retarder, usually only the linear retardance is specified. For polarizers, usually only the two principal transmittances or the extinction ratio is given. For polarization elements used in critical applications such as polarimetry, this level of characterization is inadequate.

In this section, defects of polarization elements are described, and the Mueller calculus is recommended as the most appropriate measure of performance.

25. POLARIZATION PROPERTIES OF POLARIZATION ELEMENTS

For ideal polarization elements , the polarization properties are readily defined. For real polarization elements, the precise description of the polarization properties is more complex. This tutorial ‘‘Polarizers’’ contains an extensive description of the various forms of polarizers and retarders and their characteristics (Bennett 1993).

Polarization elements such as polarizers, retarders, and depolarizers have three general polarization properties: diattenuation, retardance, and depolarization, and a typical element displays some amount of all three. Diattenuation arises when the intensity transmittance of an element is a function of the incident polarization state (Chipman, 1989a). The diattenuation \(\boldsymbol{D}\) of a device is defined in terms of the maximum \(\text{T}_{\text{max}}\) and minimum \(\text{T}_{\text{min}}\) intensity transmittances,

\[\tag{33}\boldsymbol{D}=\frac{T_{\text{max}}-T_{\text{min}}}{\text{T}_\text{max}+\text{T}_\text{min}}\]

for an ideal polarizer, \(\boldsymbol{D}=1\). When \(\boldsymbol{D}=0\), all incident polarization states are transmitted with equal loss, although the polarization states in general change upon transmission.

The quality of a polarizer is often expressed in terms of the related quantity, the extinction ratio \(\text{E}\),

\[\tag{34}\text{E}=\frac{\text{T}_\text{max}}{\text{T}_\text{min}}=\frac{1+\text{D}}{1-\text{D}}\]

Retardance is the phase change a device introduces between its eigenpolarizations (eigenstates). For a birefringent retarder with refractive indices \(n_1\) and \(n_2\), and thickness \(t\), the retardance \(\delta\) expressed in radians is

\[\tag{35}\delta=\frac{2\pi(n_1-n_2)t}{\lambda}\]

Depolarization describes the coupling by a device of incident polarized light into depolarized light in the exiting beam. For example, depolarization occurs when light transmits through milk or scatters from clouds. Multimode optical fibers generally depolarize the light.

Depolarization is intrinsically associated with scattering and a loss of coherence in the polarization state. A small amount of depolarization is probably associated with the scattered light from all optical components.

A depolarization coefficient \(e\) can be defined as the fraction of unpolarized power in the exiting beam when polarized light is incident. \(e\) is generally a function of the incident polarization state.

26. COMMON DEFECTS OF POLARIZATION ELEMENT

Here we list some common defects found in real polarization elements.

1. Polarizers have nonideal diattenuation since \(\text{T}_\text{max}<0\), and \(\text{T}_\text{min}>0\). (Bennett, 1993; King and Talim, 1971).

2. Retarders have the incorrect retardance. Thus, there will be some deviation from a quarter-wave or a half-wave of retardance, for example, because of fabrication errors or a change in wavelength.

3. Retarders usually have some diattenuation because of differences in absorption coefficients (dichroism) and due to dif ferent transmission and reflection coefficients at the interfaces. For example, birefringent retarders have diattenuation due to the difference of the Fresnel coefficients at normal incidence for the two eigenpolarizations since \(n_1\neq n_2\). This can be reduced by antireflection coatings.

4. Polarizers usually have some retardance; there is a dif ference in optical path length between the transmitted (principal) eigenpolarization and the small amount of the extinguished (secondary) eigenpolarization. For example, sheet polarizers and wire-grid polarizers show substantial retardance when the secondary state is not completely extinguished.

5. The polarization properties vary with angle of incidence ; for example, Glan- Thompson polarizers polarize over only a \(4^\circ\) field of view (Bennett, 1994). Birefringent retarders commonly show a quadratic variation of retardance with angle of incidence which increases along one axis and decreases along the orthogonal axis (Title, 1979; Hale and Day, 1988). For polarizing beam-splitter cubes, the axis of linear polarization rotates for incident light out of its normal plane (the plane defined by the face normals and the beam-splitting interface normal).

6. The polarization properties vary with wavelength; for example, for simple retarders made from a single birefringent plate, the retardance varies approximately linearly with wavelength.

7. For polarizers, the accepted state and the transmitted state can be different. Consider a polarizing device formed from a linear polarizer oriented at \(0^\circ\) followed by a linear polarizer oriented at \(2^\circ\). Incident light linearly polarized at \(0^\circ\) has the highest transmittance for all possible polarization states and is the accepted state. The corresponding exiting beam is linearly polarized at \(2^\circ\), which is the only state exiting the device.

In this example, the transmitted state is also an eigenpolarization. This ‘‘rotation’’ between the accepted and transmitted states of a polarizer frequently occurs, for example, when the crystal axes are misaligned in a birefringent polarizing prism assembly such as a Glan-Thompson polarizer.

8. A nominally ‘‘linear’’ element may be slightly elliptical (have elliptical eigenpolarizations). For example, a quartz linear retarder with the crystal axis misaligned becomes an elliptical retarder. Similarly a circular element may be slightly elliptical. For example, an (inhomogeneous) circular polarizer formed from a linear polarizer followed by a quarter-wave linear retarder at \(45^\circ\) [see Eq. (14)] becomes an elliptical polarizer as the retarder’s fast axis is rotated.

9. The eigenpolarizations of the polarization element may not be orthogonal; i.e., a polarizer may transmit linearly polarized light at \(0^\circ\) without change of polarization while extinguishing linearly polarized light oriented at \(88^\circ\).

Such a polarization element is referred to as inhomogeneous (Shurcliff, 1962; Lu and Chipman, 1992). Sequences of polarization elements, such as optical isolator assemblies, often are inhomogeneous. The circular polarizer in Eq. 14 is inhomogeneous.

10. A polarization element may depolarize , coupling polarized light into unpolarized light. A polarizer or retarder with a small amount of depolarization, when illuminated by a completely polarized beam, will have a small amount of unpolarized light in the transmitted beam.

Such a transmitted beam can no longer be extinguished by an ideal polarizer. Depolarization results from fabrication errors such as surface roughness, bulk scattering, random strains and dislocations, and thin-film microstructure.

11. Multiply reflected beams and other ‘‘secondary’’ beams may be present with undesired polarization properties. For example, the multiply reflected beams from a birefringent plate have various values for their retardance. Antireflection coatings will reduce this effect in one waveband, but may increase these problems with multiple reflections in other wavebands.

The preceding list of polarization element defects is by no means comprehensive. It should serve as a warning to those with demanding applications for polarization elements. In particular, the performance of polarizing beam-splitting cubes have been found to be quite different from the ideal (Pezzaniti and Chipman, 1991).

27. THE MUELLER MATRIX FOR POLARIZATION COMPONENT CHARACTERIZATION

The Mueller matrix provides the full characterization of a polarization element (Shurcliff, 1962; Azzam and Bashara, 1977). From the Mueller matrix, all of the performance defects listed previously and more are specified.

Thus, when one is using polarization elements in critical applications such as polarimetry, it is highly desirable that the Mueller matrix of the elements be known. This is analogous to having the interferogram of a lens to ensure that it is of suitable quality for incorporation into a critical imaging system.

The optics community has been very slow to adopt Mueller matrices for the testing of optical components and optical systems, delaying a broad understanding of how real polarization elements actually perform.

An impediment to the widespread acceptance of Mueller matrices for polarization element qualification has been that the polarization properties associated with a Mueller matrix (the diattenuation, retardance, and depolarization) are not easily ‘‘extracted’’ from the Mueller matrix.

Thus, while the operational definition of the Mueller matrix, Eq. (10), is straightforward, determining the diattenuation, retardance, and depolarization from an experimentally determined Mueller matrix is a complex process (Gil and Bernabeau, 1987). This is described later in this chapter.

The following matrix element pairs indicate the presence of the various forms of diattenuation and retardance:

\[\tag{36}\left[\begin{array}&0&a&b&c\\a&0&-d&-e\\b&d&0&-f\\c&e&f&0\end{array}\right]\]

Each pair of elements is related to the following properties:

a linear diattenuation oriented at \(0^\circ\) or \(90^\circ\)

b linear diattenuation oriented at \(45^\circ\) or \(135^\circ\)

c circular diattenuation

d linear retardance oriented at \(0^\circ\) or \(90^\circ\)

e linear retardance oriented at \(45^\circ\) or \(135^\circ\)

f circular retardance

For small amounts of these properties, the Mueller matrix elements indicated are linear in the diattenuation or retardance. Other degrees of freedom in the Mueller matrix, antisymmetry in \(a,b,\) or \(c\) or symmetry in \(d,e,\) or \(f\), indicate the presence of depolarization and inhomogeneity.

28. APPLICATIONS OF POLARIMETRY

Polarimetry has found application in nearly all areas of science and technology with several tens of thousands of papers detailing various applications. The following summarizes a few of the principal applications and introduces some of the books, reference works, and review papers which provide gateways to the various applications.

Ellipsometry

Ellipsometry is the application of polarimetry for determining the optical properties of surfaces and interfaces. Example applications are refractive indices and thin-film thickness determination, and investigations of processes at surfaces such as contamination and corrosion. In this tutorial ‘Ellipsometry,’’ by Azzam treats the fundamentals.

A more extensive treatment is found in the textbook by Azzam and Bashara (1979 and 1986) which presents the mathematical fundamentals of polarization, determination of the properties of thin films, polarimetric instrumentation , and a myriad of applications.

Azzam (1991) is a recent collection of historical papers. Calculation of the polarization properties of thin films is given a detailed presentation by Dobrowolski (1994) in future tutorial, and also in the text by Macleod (1986).

Spectropolarimetry for Chemical Applications

Spectropolarimeters are spectrometers which incorporate polarimeters for the purpose of measuring polarization properties as a function of wavelength. Whereas spectrometers measure transmission or reflectance as a function of wavelength, a spectropolarimeter also may measure dichroism (diattenuation), linear birefringence (linear retardance), optical activity (circular retardance), or depolarization, all as spectra.

In physical chemistry, spectra of the linear dichroism and the linear retardance of a molecule permit the determination of the orientation of the electric dipole moment in three dimensions.

Similarly, circular dichroism and optical activity provide information on the orbital magnetic moment. Schellman and Jensen (1987) and Johnson (1987) provide comprehensive surveys of the spectropolarimetry of oriented molecules and interpretation of the data in terms of molecular structure.

The volumes by Michl and Thulstrup (1986), Samori and Thulstrup (eds. ) (1988) and by Kliger, Lewis, and Randall (1990) cover the basics of polarimetry with an emphasis on spectroscopy with polarized light and interpretation of the resulting data. Texts and reviews on optical activity include the following: Jirgensons (1973), Mason (ed. ) (1978), Mason (1982), Thulstrup (1982), Barron (1986), and Laktakia (1990). Chenault (1992) contains a survey of spectropolarimetric instrumentation.

Remote Sensing

Polarimetry has become an important technique in remote sensing, since it augments the limited information available from spectrometric techniques. Polarization in the scattered light from the earth has many subtle characteristics.

The sunlight which illuminates the earth is essentially unpolarized, but the scattered light has a surprisingly large degree of polarization, which is mostly linear polarization (Egan, 1985; Konnen, 1985; Coulson, 1988; Coulson, 1989; Egan, 1992).

Visible light scattered from forest canopy, cropland, meadows, and similar features frequently has a degree of polarization of 20 percent or greater in the visible (Curran, 1982; Duggin, 1989). Light reflecting from mudflats and water often has a degree of polarization of 50 percent or higher, particularly for light incident near Brewster’s angle.

Light scattered from clouds is nearly unpolarized (Konnen, 1985; Coulson, 1988). The magnitude of the degree of linear polarization depends on many variables, including the angle of incidence, the angle of scatter , the wavelength, and the weather.

The polarization from a site varies from day to day even if the angles of incidence and scatter remain the same; these variations are caused just by changes in the earth’s vegetation, cloud cover, humidity, rain, and standing water. Polarization is complex to interpret but it conveys a great deal of useful information.

Astronomical Polarimetry

The polarization of light from astronomical bodies conveys considerable information regarding their physical state—information that generally cannot be acquired by any other means. Gehrels (1974) compiles information regarding the polarization of plants, stars, and other astronomical objects.

Polarimetry is the principle technique for determining solar magnetic fields. Solar vector magnetographs are imaging polarimeters combined with narrowband tunable spectral filters which measure Zeeman splitting in magnetically active ions in the solar atmosphere, from which the magnetic fields can be determined. November (1991) is a recent survey of instrumentation and ongoing measurement programs for solar magnetic field study.

Polarization Light Scattering

Polarization light scattering is the application of polarimetry to scattered light (Van de Hulst, 1957; Stover, 1990). The scattering characteristics of a sample are generally described by its bidirectional reflectance distribution function, \(\text{BRDF}\) \((a,\beta,\gamma,\delta,\lambda)\) which is the ratio of the scattered flux in a particular direction \((\gamma,\delta)\) to the flux of an incident beam from direction \((a,\beta)\).

The \(\text{BRDF}\) function contains no polarization information, but is the \(\text{m}_{00}\) element of the Mueller matrix relating the incident and scattered beams. The \(\text{BRDF}\) can be generalized to a Mueller bidirectional reflectance distribution function, or \(\text{MBRDF}\) \((a,\beta,\gamma,\delta,\lambda)\), which is the Mueller matrix relating arbitrary incident and scattered beams.

Scattered light is often a sensitive indicator to surface conditions; a small amount of surface roughness may reduce the specular power by less than a percent while increasing the scattered power by orders of magnitude.

The retardance, diattenuation, and depolarization of the scattered light similarly provide sensitive indicators of light-scattering conditions, such as uniformity of refractive index, orientation of surface defects, texture, strain and birefringence at an interface, subsurface damage, coating microstructure, and the degree of multiple scattering.

Optical and Polarization Metrology

Polarimetry is useful in optical metrology for measuring the instrumental polarization of optical systems and for characterizing optical and polarization components. Optical systems, both common and exotic, modify the polarization state of light due to the reflections, refractions, and other interactions with optical materials.

Each ray path through the optical system can be characterized by its polarization matrix (Chipman, 1989a; Chipman, 1989b). Polarization ray tracing is the technique of calculating the polarization matrices for ray paths from the optical and coating prescriptions (Waluschka, 1989; Bruegge, 1989; Wolf f and Kurlander, 1990). Diffraction image formation of polarization-aberrated beams is then handled by vector extensions to diffraction theory (Kuboda and Inoue, 1959; Urbanczyk, 1984; Urbanczyk, 1986; McGuire and Chipman, 1990; McGuire and Chipman, 199; Mansuripur, 1991). Polarimeters, particularly imaging polarimeters, are used to measure the Mueller matrices of ray paths through optical systems determining the polarization aberrations.

These polarization aberrations frequently have the same functional forms as the geometrical aberrations, since they arise from similar geometrical considerations (Chipman, 1987; McGuire and Chipman, 1987, 1989, 1990a, 1990b, 1991; Hansen, 1988; Chipman and Chipman, 1989). Several conferences have surveyed these areas (Chipman, 1988; Chipman, 1989c; Goldstein and Chipman, 1992).

Radar Polarimetry

Polarimetric measurements are a standard and highly evolved technique in radar with broad application (Poelman and Guy, 1985; Holm, 1987; van Zyl and Zebker, 1990). Although radar is outside the scope of this handbook, several references to the radar literature are included since the optical community can greatly benefit from advances in radar polarimetry.

The text by Mott (1992) develops the polarization properties of antennas and the techniques of radar polarimetry. Fundamental analyses of the Mueller matrix have been performed by Huynen (1965) and Kennaugh (1951), both of which have found broad application in the interpretation of radar polarization signatures.

Morris and Chipman (eds.) (1990) and Boerner and Mott (eds.) (1992) are proceedings from meetings specifically intended to provide an exchange between the optical polarimetry and radar polarimetry communities.

29. INTERPRETATION OF MUELLER MATRICES

The Mueller matrix is defined as a matrix which transforms incident Stokes vectors into exiting Stokes vectors with each element seen as a coupling between corresponding Stokes vector elements.

Despite this simple and elegant definition, the polarization properties associated with the Mueller matrix—the diattenuation, retardance, and depolarization— are not readily apparent from the matrix for two reasons.

First, the Stokes vector has an unusual coordinate system in which the different elements do not represent orthogonal polarization components. Instead, positive and negative values on each component separately represent orthogonal polarization components.

Second, the phenomenon of depolarization greatly complicates the matrix properties. It is not possible to analyze real arbitrary Mueller matrices measured by polarimeters without considering three tricky topics: physical realizability, depolarization, and inhomogeneity.

30. DIATTENUATION AND POLARIZATION SENSITIVITY

The intensity transmittance \(\text{T}\) for a given matrix \(\boldsymbol{M}\) and incident polarization state \(\boldsymbol{S}\) is defined as the ratio of exiting flux \(s'_0\) to incident flux \(s_0\),

\[\tag{37}\text{T}(\boldsymbol{\text{MS}})=\frac{s'_0}{s_0}=\frac{m_{00}s_0+m_{01}s_1++m_{02}s_2+m_{03}s_3}{s_0}\]

The intensity transmittance averaged over all incident polarization states is \(\text{T}_\text{avg}=m_{00}\). The maximum \(\text{T}_\text{max}\) and minimum \(\text{T}_\text{min}\) intensity transmittances are

\[\tag{38}\text{T}_\text{max}=m_{00}+\sqrt{m^2_{01}+m^2_{02}+m^2_{03},}\qquad\text{T}_\text{min}=m_{00}-\sqrt{m^2_{01}+m^2_{02}+m^2_{03}}\]

and are associated with the unnormalized incident states

\[\tag{39}S_\text{max}=\left[\begin{array}&\sqrt{m^2_{01}+m^2_{03}+m^2_{03}}\\m_{01}\\m_{02}\\m_{03}\end{array}\right]\qquad S_\text{min}=\left[\begin{array}&\sqrt{m^2_{01}+m^2_{03}+m^2_{03}}\\-m_{01}\\-m_{02}\\-m_{03}\end{array}\right]\]

The incident Stokes vectors of maximum \(\boldsymbol{S}_\text{max}\) and minimum \(\boldsymbol{S}_\text{min}\) intensity transmittance are always orthogonal. The term diattenuation refers to the two attenuations associated with these two orthogonal states. The diattenuation \(\boldsymbol{D(M)}\) of a Mueller matrix is a measure of the variation of intensity transmittance with incident polarization state,

\[\tag{40}\boldsymbol{D(M)}=\frac{\text{T}_\text{max}-\text{T}_\text{min}}{\text{T}_\text{max}+\text{T}_\text{min}}=\frac{\sqrt{m^2_{01}+m^2_{02}+m^2_{03}}}{m_{00}}\]

When \(\boldsymbol{D}=1\), the device is an ideal analyzer; it completely blocks one polarization component of the incident light, and only one Stokes vector exits the device. If this device is also nondepolarizing, then \(\boldsymbol{M}\) represents a polarizer.

When \(\boldsymbol{D}=0\), all incident states have the same intensity transmittance: the device may be nonpolarizing, depolarizing, or a pure retarder. Diattenuation is also referred to as polarization sensitivity. Linear polarization sensitivity or linear diattenuation \(\boldsymbol{LD(M)}\) characterizes the variation of intensity transmittance with incident linear polarization states:

\[\tag{41}\boldsymbol{LD}\text(M)=\frac{m^2_{01}+m^2_{02}}{m_{00}}\]

Linear polarization sensitivity is frequently specified as a performance parameter in remote sensing systems designed to measure incident power independently of any linearly polarized component present in scattered earth-light (Maymon and Chipman,1991). Note that \(\boldsymbol{LD}(M)=1\) specifies that \(\boldsymbol{M}\) is a linear analyzer; \(\boldsymbol{M}\) is not necessarily a linear polarizer, but may represent a linear polarizer followed by some other polarization element. Diattenuation in (fiber optical) components and systems is often characterized by the polarization dependent loss, given in decibels:

\[\tag{42}\boldsymbol{PDL}(M)=10\;\text{Log}_{10}\frac{\text{T}_\text{max}}{\text{T}_\text{min}}\]

31. POLARIZANCE

The polarizance \(\text{P}\boldsymbol{(M)}\) is the degree of polarization of the transmitted light when unpolarized light is incident (Bird and Shurcliff, 1959; Shurcliff, 1962).

\[\tag{43}\text{P}\boldsymbol{(M)}=\frac{\sqrt{m^2_{10}+m^2_{20}+m^2_{30}}}{m_{00}}\]

The Stokes vector of the exiting light \(\boldsymbol{S}_\text{P}\) is specified by the first column of \(\boldsymbol{M}\),

\[\tag{44}\boldsymbol{S}_\text{p}\boldsymbol{(M)}=[m_{00}\quad m_{10}\quad m_{20}\quad m_{30}]^T\]

and is not generally equal to \(\boldsymbol{S}_\text{max}\) when inhomogeneity or depolarization is present.

32. PHYSICALLY REALIZABLE MUELLER MATRICES

Mueller matrices form a subset of the four-by-four real matrices. A four-by-four real matrix is not a physically realizable Mueller matrix if it can operate on an incident Stokes vector to produce a vector with degree of polarization greater than one (\(s^2_0<s^2_1+s^2_2+s^2_3)\), which represents a physically unrealizable polarization state.

Similarly, a Mueller matrix cannot output a state with negative flux.

Conditions for physical realizability have been studied extensively in the literature, and many necessary conditions have been published (Hovenier, van de Hulst, and van der Mee, 1986; Barakat, 1987; Cloude, 1989; Girgel, 1991; Xing, 1992; Kumar and Simon, 1992; van der Mee and Hovenier, 1992; Kostinski, Givens, and Kwiatkowski). A set of sufficient conditions for physical realizability is not known to this author.

The following four necessary conditions for physical realizability are among the more general of those published:

1. \(\text{Tr}(\boldsymbol{MM}^T)\leq 4m^2_{00}\)

2. \(m_{00}\geq|m_{ij}|\)

3. \(m^2_{00}\geq b^2\)

4. \((m_{00}-b)^2\geq\displaystyle\sum^3_{j=1}(m_{0,j}-\sum^3_{k=1}m_{j,k}a_k)\)

where \(b=\sqrt{m^2_{01}+m^2_{02}+m^2_{03},}\;a_j=m_{0,j}/b\), and \(\text{Tr}\) indicates the trace of a matrix.

Another condition for physical realizability is that the matrix can be expressed as a sum of nondepolarizing Mueller matrices. The Mueller matrix for a passive device \(\text{T}_\text{max}\leq 1\), a device without gain, must satisfy the relation \(\text{T}_\text{max}=m_{00}+\sqrt{m^2_{01}+m^2_{02}+m^2_{03}}\leq 1\).

In the 16-dimensional space of Mueller matrices , the matrices for ideal polarizers, ideal retarders, and other nondepolarizing elements lie on the boundary between the physically realizable Mueller matrices and the unrealizable matrices.

Thus, a small amount of noise in the measurement of a Mueller matrix for a polarizer or retarder may yield a marginally unrealizable matrix.

33. DEPOLARIZATION

Depolarization is the coupling of polarized into unpolarized light. If an incident state is polarized and the exiting state has a degree of polarization less than one, then the sample has depolarization. Consider three Mueller matrices of the following forms:

\[\tag{45}\boldsymbol{ID}=\left[\begin{array}&1&0&0&0\\0&0&0&0\\0&0&0&0\\0&0&0&0\end{array}\right]\quad\boldsymbol{PD}=\left[\begin{array}&1&0&0&0\\0&a&0&0\\0&0&a&0\\0&0&0&a\end{array}\right]\quad\boldsymbol{VD}=\left[\begin{array}&1&0&0&0\\0&a&0&0\\0&0&b&0\\0&0&0&c\end{array}\right]\]

Matrix \(\boldsymbol{ID}\) is the ideal depolarizer; only unpolarized light exits the depolarizer. Matrix \(\boldsymbol{PD}\) is the partial depolarizer; all fully polarized incident states exit with their incident polarization ellipse, but with a degree of polarization \(\boldsymbol{DOP(PD)}=a\).

Matrix \(\boldsymbol{PD}\) represents a variable partial depolarizer; the degree of polarization of the exiting light is a function of the incident state. Physically, depolarization is closely related to scattering and usually has its origin in retardance or diattenuation which is rapidly varying in time, space, or wavelength.

The amount of depolarization is a function of the incident state, and is defined for polarized incident states as \(1-\boldsymbol{DOP}\{\boldsymbol{MS}\}\). To describe the depolarization characteristics of a Mueller matrix, two figures of merit are useful.

The first is the Euclidian distance of the normalized Mueller matrix \(\boldsymbol{M}/m_{00}\)from the ideal depolarizer:

\[\tag{46}\left\|\frac{\boldsymbol{M}}{m_{00}}-\boldsymbol{ID}\right\|=\frac{\sqrt{\left(\displaystyle\sum_{i,j}m^2_{i,j}\right)-m^2_{00}}}{m_{00}}\]

This quantity varies from zero for the ideal depolarizer to \(\sqrt{3}\) for nondepolarizing Mueller matrices, including all pure diattenuators, pure retarders, and any sequences composed from them. Another useful measure is the depolarization of the matrix \(\boldsymbol{Dep}\boldsymbol{(M)}\):

\[\tag{47}\boldsymbol{Dep}\boldsymbol{(M)}=1-\frac{\sqrt{\left(\displaystyle\sum_{i,j}m^2_{i,j}\right)-m^2_{00}}}{\sqrt{3}m_{00}}\]

This index measures how close a Mueller matrix is to the set of nondepolarizing Mueller matrices, and is related to the average depolarization of the exiting light.

If \(\boldsymbol{Dep}\boldsymbol{(M)}=0\) and the matrix is physically realizable, then for incident polarized states, the exiting light is polarized. \(\boldsymbol{Dep}\boldsymbol{(M)}\) is closely related to the depolarization index of Gil and Bernabeu, (1985, 1986).

If a polarized state becomes partially polarized and then is polarized again while interacting with a sequence of elements, depolarization is still present in the matrix, despite the fact the output beam is polarized. Consider a depolarizer followed by a horizontal linear polarizer:

\[\tag{48}\frac{1}{2}\left[\begin{array}&1&1&0&0\\1&1&0&0\\0&0&0&0\\0&0&0&0\end{array}\right]\left[\begin{array}&1&0&0&0\\0&0&0&0\\0&0&0&0\\0&0&0&0\end{array}\right]=\left[\begin{array}&1&0&0&0\\1&0&0&0\\0&0&0&0\\0&0&0&0\end{array}\right]\]

The exiting beam is horizontally polarized. All incident states, however, have equal intensity transmission due to the depolarizer. For this example, \(\boldsymbol{Dep}=1-1/\sqrt{3}\)

34. NONDEPOLARIZING MUELLER MATRICES AND JONES MATRICES

A sample which does not display depolarization is nondepolarizing. A nondepolarizing Mueller matrix satisfies the condition

\[\tag{49}Tr\boldsymbol{(MM^T)}=4m_{00}\]

An incident beam with degree of polarization of one will exit with a degree of polarization of one. Many other necessary conditions for nondepolarization may be found in the literature (Abhyankar and Fymat, 1969; Barakat, 1981; Fry and Kattawar, 1981; Simon, 1982; Gil and Bernabeu, 1985; Cloude, 1986).

Jones matrices form an alternative and very useful representation of sample polarization, particularly because Jones matrices have simpler properties and are more easily manipulated and interpreted. It is desirable to be able to transform between these two matrix representations.

The complication in mapping Mueller matrices onto Jones matrices and vice versa is that Mueller matrices cannot represent absolute phase and Jones matrices cannot represent depolarization. Thus, only nondepolarizing Mueller matrices have corresponding Jones matrices.

All Jones matrices have a corresponding Mueller matrix, but because the absolute phase is not represented, the mapping is many Jones matrices to one Mueller matrix. A Jones matrix \(\boldsymbol{J}\) is transformed into a Mueller matrix by the relation

\[\tag{50}\boldsymbol{M}=\boldsymbol{U}(\boldsymbol{J}\otimes\boldsymbol{J}^*)\boldsymbol{U}^{-1}\]

in which \(\otimes\) represents the tensor product and \(\boldsymbol{U}\) is the Jones/Mueller transformation matrix (Simon,1982; Kim, Mandel, and Wolf, 1987)

\[\tag{51}U=\frac{1}{\sqrt{2}}\left[\begin{array}&1&0&0&1\\1&0&0&-1\\0&1&1&0\\0&i&-i&0\end{array}\right]=(U^{-1})^\dagger\]

where the Hermitian adjoint is represented by \(\dagger\). All Jones matrices of the form \(\boldsymbol{J}'=e^{j\phi}\boldsymbol{J}\) transform to the same Mueller matrix. Nondepolarizing Mueller matrices are transformed into Jones matrices using the following relations:

\[\tag{52}\boldsymbol{J}=\left[\begin{array}&\text j_\text{xx}&\text j_\text{xy}\\\text j_\text{yx}&\text j_\text {yy}\end{array}\right]=\left[\begin{array}&\rho_\text{xx}e^{i\phi_\text{xx}}&\rho_\text{xy}e^{i\phi_\text{xy}}\\\rho_\text{yx}e^{i\phi_\text{yx}}&\rho_\text{yy}e^{i\phi_\text{yy}}\end{array}\right]\]

where the amplitudes are

\[\tag{53}\begin{array}&\rho_\text{xx}=\frac{1}{\sqrt{2}}\sqrt{m_{00}+m_{01}+m_{10}+m_{11}}&\rho_\text{xy}=\frac{1}{\sqrt{2}}\sqrt{m_{00}-m_{01}+m_{10}-m_{11}}\\\rho_\text{yx}=\frac{1}{\sqrt{2}}\sqrt{m_{00}+m_{01}-m_{10}-m_{11}}&\rho_\text{xy}=\frac{1}{\sqrt{2}}\sqrt{m_{00}-m_{01}-m_{10}+m_{11}}\end{array}\]

and the relative phases are

\[\tag{54}\begin{array}&\phi_\text{xy}-\phi_\text{xx}=\text{arctan}\left(\frac{-m_{03}-m_{13}}{m_{02}+m_{12}}\right)&\phi_\text{yx}-\phi_\text{xx}=\text{arctan}\left(\frac{m_{30}+m_{31}}{m_{20}+m_{21}}\right)\\\phi_\text{yy}-\phi_\text{xx}=\text{arctan}\left(\frac{m_{32}-m_{23}}{m_{22}+m_{33}}\right)\end{array}\]

The phase \(\phi_\text{xx}\) is not determined; it represents the absolute phase relative to which the other phases are determined. If \(j_\text{xx}=0\), then both the numerator and denominator of the arctan are zero and the phase equations fail. The equations can then be recast in closely related forms to use the phase of another Jones matrix element as the reference for the ‘‘absolute phase.’’

35. HOMOGENEOUS AND INHOMOGENEOUS POLARIZATION ELEMENTS

This refers specifically to nondepolarizing Mueller matrices.

A nondepolarizing Mueller matrix is defined as homogeneous if the two eigenpolarizations are orthogonal, and inhomogeneous otherwise.

A nondepolarizing Mueller matrix can be factored into a cascade of a diattenuator Mueller matrix \(\boldsymbol{M}_D\) followed by a retarder Mueller matrix \(\boldsymbol{M}_R\) or into a cascade of the same retarder followed by a diattenuator \(\boldsymbol{M}'_D\),

\[\tag{55}\boldsymbol{M}=\boldsymbol{M}_R\boldsymbol{M}_D=\boldsymbol{M}'_D\boldsymbol{M}_R\]

**FIGURE 4.** The principal Stokes vectors associated with an inhomogeneous polarization element mapped on the Poincarè sphere. The incident Stokes vectors of maximum \(\boldsymbol{S}_\text{max}\) and minimum \(\boldsymbol{S}_\text{min}\) intensity transmittance are diametrically opposite on the Poincare spherè (indicating orthogonal polarization states) while the eigenpolarizations \(\boldsymbol{S}_q\) and \(\boldsymbol{S}_r\) are separated by the angle \(\chi\).

where the diattenuation of \(\boldsymbol{M}_D\) and \(\boldsymbol{M}'_D\) are equal. We define the diattenuation of \(\boldsymbol{M}\) as the diattenuation of \(\boldsymbol{M}_D\), and the retardance of \(\boldsymbol{M}\) as the retardance of \(\boldsymbol{M}_R\).

For a homogeneous device, \(\boldsymbol{M}_D=\boldsymbol{M}'_D\) and the eiegenvectors of \(\boldsymbol{M}_R\) and \(\boldsymbol{M}_D\) are equal. Thus the retardance and diattenuation of a homogeneous Mueller matrix are ‘‘aligned,’’ giving it substantially simpler properties than the inhomogeneous Mueller matrices.

A necessary condition for a homogeneous Mueller matrix is \(m_{01}=m_{10}\), \(m_{02}=m_{20}\), \(m_{03}=m_{30}\).

Then, \(\boldsymbol{P\{M\}}=\boldsymbol{D\{M\}}\).

The inhomogeneity of a Mueller matrix is characterized by an inhomogeneity index \(\boldsymbol{I(M)}\) which characterizes the orthogonality of the eigenpolarizations; \(\boldsymbol{I(M)}\) varies from zero for orthogonal eigenpolarizations to one for degenerate (equal) eigenpolarizations.

Let \(\hat{\boldsymbol{S}}_1\) and \(\hat{\boldsymbol{S}}_2\) be normalized polarized Stokes vector eigenpolarizations of a Mueller matrix; then

\[\tag{56}\boldsymbol{I(M)}=\frac{\sqrt{\hat{\boldsymbol{S}_1}\cdot\hat{\boldsymbol{S}_2}}}{2}=\cos(\chi/2)\]

where \(\chi\) is the angle between the eigenpolarizations on the Poincare sphere measured from the center of the sphere as illustrated in Fig. 4.