Computational Photography, an AI-powered Slopendium — 09 Optics, lenses, and aberration correction

▸9.1 Aberrations and optical challenges⧉

`fig-thin-lens-vs-real` · side-by-side "convenient lie vs reality": ideal thin lens (single plane, all parallel rays → one focus) vs a real thick multi-surface lens where outer rays focus short of the paraxial focus (spherical aberration → blur, no single point) ✅

`fig-doublet-triplet-gauss` · cross-sections of three classic designs as stacked glass elements — achromatic doublet (2), Cooke triplet (3, + − +), double-Gauss (6, ≈symmetric about the stop); each labels element count and the aperture stop ✅

`fig-double-gauss` · a labelled double-Gauss cross-section: 6 elements ≈symmetric about the central aperture stop, parallel ray bundle traced to the focal plane; notes near-mirror symmetry cancels odd aberrations (coma, distortion, lateral colour) ✅

`fig-telephoto-vs-retrofocus` · two two-group schematics — telephoto (+ then −, principal plane H′ pushed in front → physical length < f) vs retrofocus/inverted-telephoto (− then +, long back-focal distance to clear the SLR mirror); marks f vs physical length, H′, F′ ✅

`fig-zoom-mechanism` · a zoom at two focal lengths (short-f/wide vs long-f/tele): moving variator + compensator groups slide along the axis to change f while the focal plane (sensor) stays fixed; motion arrows between states ✅

`fig-principal-planes-pupils` · a thick-lens cardinal-points diagram: principal planes H, H′, nodal points N, N′, entrance & exit pupils (the stop imaged by front/rear groups), focal points F, F′; f measured from H′ via the H→H′ equivalent-thin-lens construction ✅

equations

thick-lens / system focal length via the two principal planes (FUNDAMENTALS' $1/f=1/u+1/v$ holds when $u,v$ are measured from $H, H'$)

f-number from the **entrance-pupil** diameter $N = f/D_{\text{EP}}$ (not the front element)

T-stop $T = N/\sqrt{\tau}$ (transmittance $\tau<1$)

effective vs back focal length.

▸9.1.1 Taxonomy of challenges⧉

• a single lens fixes **one** degree of freedom (its power/focal length) but you need to satisfy **many** constraints simultaneously: bring *every* field point to focus, at *every* wavelength, across the *whole* aperture, onto a *flat* sensor.

• the thin lens **dropped** everything past the linear $\sin\theta\approx\theta$; restore the next term $\sin\theta\approx\theta-\theta^3/6$ (**third-order / Seidel optics**) and rays from one point **no longer meet at one point**. The departures are the aberrations — same approximation, dropped one term later (FUNDAMENTALS margin note). [`fig-seidel-aberrations`]

• two families: **monochromatic** (geometry of the surfaces, present even in one color — the five Seidel) and **chromatic** (because $n(\lambda)$ — dispersion). Each is *described* here; each is *corrected* in *Aberrations correction*.

#### spherical aberration

• rays through the **edge** of the aperture focus **nearer** than paraxial rays → a soft glowing core; worst wide open. (Deliberately left in for *soft-focus*, *Special optics*.) [`fig-spherical-aberration`]

#### coma

• off-axis a point images as a **comet/teardrop** smear (asymmetric) — the name *is* the comet shape.

#### astigmatism

• off-axis, tangential and sagittal line-features focus at **different depths** → you can't focus radial and tangential detail at once.

#### field curvature

• (Petzval): the sharp image lies on a **curved surface**, not the flat sensor → corners soft when the center is sharp (why "flat-field" objectives are prized).

#### chromatic aberration

• glass index varies with wavelength (**dispersion**, measured by the **Abbe number** $V_d$), so colors refract differently. [`fig-chromatic-aberration`]

• **axial (longitudinal)**: R/G/B focus at **different depths** → color halos; worst at fast apertures.

• **lateral (transverse)**: R/G/B imaged at **different magnifications** → colored fringes that **grow toward the corners** (and the everyday **purple fringing** at high-contrast edges, partly CA, partly sensor/flare). Lateral CA is essentially a **per-channel radial scaling**; axial CA is a depth-dependent blur — the two are a continuum but call for **different fixes** (*Aberrations correction*).

#### radial distortion

• magnification varies with field → straight lines bow (**barrel** / **pincushion**). Uniquely, distortion **moves no light energy** (points stay sharp, just mis-placed), so it is the **easiest to correct in software** — a **radial** remapping. [`fig-distortion-barrel-pincushion`]

• the standard parametric models used for lens-profile correction and camera calibration map undistorted radius $r_u$ and distorted radius $r_d$ (both measured from the distortion center):

• **Brown–Conrady** (radial, forward): $r_d = r_u\,(1 + k_1 r_u^2 + k_2 r_u^4 + k_3 r_u^6)$, with optional **tangential** (decentering) terms $p_1, p_2$ for off-axis-mounted elements; $k_1<0$ gives barrel, $k_1>0$ pincushion.

• **division model** (Fitzgibbon): $r_u = r_d / (1 + \lambda_1 r_d^2 + \lambda_2 r_d^4 + \dots)$ — often a single $\lambda_1$ captures even strong (wide-angle) distortion and inverts more cleanly, which is why it is popular for calibration.

#### wave effects and diffraction

• light is a **wave**; a wave through an aperture **diffracts** (spreads), so even a perfect lens images a point as the **Airy** disc + rings, angular radius $\theta \approx \lambda/D$ — a **floor** (the diffraction limit), not a defect. Bigger aperture → less diffraction.

• **the pattern *is* the aperture's Fourier transform** (squared magnitude = the diffraction-limited PSF): round hole → Airy disc; a wider hole has a narrower transform = a smaller spot (the $\lambda/D$ floor restated). Explains **sunstars** = FT of the polygonal blade opening (each blade-edge → a perpendicular streak; $n$ blades → $n$ spikes if even, $2n$ if odd). The lens computes a Fourier transform for free → ties to [[03 Basic image processing and ISP]] (Fourier) and the **MTF**.

• stopping down cures geometric aberrations but eventually raises the diffraction floor → every lens has a **sweet spot** ~2–3 stops down; past it, uniform softening.

#### vignetting

#### flare and coating

▸9.2 Aberrations correction⧉

`fig-achromat-doublet` · an achromatic doublet: positive crown cemented to negative flint; singlet panel (red & blue split = chromatic aberration) vs doublet panel (red = blue at a common focus); labels crown/flint, low/high dispersion ✅

⬜ figure not yet created

the dispersion cancels)

fig-lens-profile-correction · computational lens correction: a real window-grid facade degraded (barrel + vignette + lateral CA) → a measured profile's three radial maps (warp / gain / per-channel rescale) → corrected; note: geometric defects only, not blur (Aberrations correction) ✅

`fig-psf-deconvolution` · non-blind deconvolution by the measured PSF: a real crop convolved with a comatic PSF (inset) + noise → Wiener deconvolution (sharper) → under-regularized (noise amplified); image = scene ∗ PSF (Aberrations correction) ✅

equations

**achromat condition** $\phi_1/V_1 + \phi_2/V_2 = 0$ (powers weighted by dispersion cancel the chromatic term — Abbe number $V_d=(n_d-1)/(n_F-n_C)$ from the challenges chapter)

image = scene $*$ **PSF** $\to$ correction = **deconvolution** by the *measured, spatially-varying* PSF (cross-ref Wiener $\hat X = \overline K Y/(|K|^2+\mathrm{SNR}^{-1})$).

▸9.2.1 The two families of cure⧉

• every aberration is *described once* in *Aberrations and optical challenges*; here each is *corrected* — **in glass** (shape and combine elements, pick glasses, stop down) or **in software** (remap, rescale, deconvolve). Each design feature cancels a specific aberration, so the cures are organized by the defect they target.

▸9.2.2 Correction in glass⧉

• **achromat** (crown + flint doublet): pair a low-dispersion positive element with a high-dispersion negative one so the **chromatic terms cancel** at two wavelengths — the **achromat condition** $\phi_1/V_1+\phi_2/V_2=0$. [`fig-achromat-doublet`]

• **apochromat / superachromat**: correct **three (or more) wavelengths**, using **ED / low-dispersion / fluorite** glasses — for long telephotos where residual color is visible.

• **aspheric elements**: a non-spherical surface gives the designer an extra knob to kill **spherical aberration** with **fewer elements** (key to compact, fast, and **cell-phone** lenses). Sidebar: **spherical surfaces are easy to grind** (any two ground together tend to a sphere), so aspheres came late and rely on **precision molding** — which is exactly why cheap **molded plastic/glass** aspheres gave phone cameras their edge.

• **stopping down** reduces the aberrations that scale with aperture (spherical, coma, lateral color, vignetting) — the universal field fix — until **diffraction** sets the **sweet spot** (FUNDAMENTALS diffraction limit; usually ~2–3 stops down).

▸9.2.3 Computational correction⧉

• **lens-profile correction** (in the ISP): from a **measured profile** per lens+focal length+aperture, software undoes **geometric distortion** (radial warp via the Brown–Conrady / division models from *Aberrations and optical challenges* — fit $k_1,k_2,\dots$ or $\lambda_1$ per lens; detailed below in *Radial distortion correction*), **vignetting** (a radial gain map — both optical *and* the FUNDAMENTALS $\cos^4$), and **lateral CA** (per-channel radial rescale). Cheap, robust, ubiquitous — the everyday face of optics-plus-computation. [`fig-lens-profile-correction`]

• **deconvolution by the measured PSF**: an aberrated lens has a known, **spatially-varying PSF**; measuring it lets you **deconvolve** to recover sharpness — the same machinery as [[Slopendium/08 Single-image computational photography|deblurring]] (Wiener / regularized inverse), here with a **calibrated** rather than unknown kernel, so it is non-blind and per-field-position. [`fig-psf-deconvolution`]

• limits: deconvolution **amplifies noise** at frequencies the lens killed (FUNDAMENTALS/deblurring L) — it sharpens what was *attenuated*, not what was *destroyed*; and a **spatially-varying** PSF means per-region kernels.

• **forward-ref**: instead of *fighting* the lens's accidental blur, **engineer** a PSF that inverts cleanly and even encodes depth — **coded aperture / wavefront coding** (Dowski & Cathey 1995) and end-to-end optics (Advanced).

▸9.2.4 Radial distortion correction⧉

• **the most software-friendly aberration** (it is a *lens* defect, not a projection one — moved here from the warping part, where keystoning lives): radial distortion **moves no light energy** (points stay sharp, only mis-placed), so it comes off almost perfectly as a **1-D remap in radius**, with **no loss of sharpness** — the cleanest cure in the catalogue. Because it is so correctable it is increasingly *designed in*: phone/compact wide-angles ship heavy **barrel** distortion on purpose (smaller, faster glass) and trust the ISP to straighten it. [`fig-radial-distortion-correction`]

• **the symptom**: world-straight lines image as **curves** — **barrel** (bow **outward**, magnification falls with radius — wide-angle), **pincushion** (bow **inward** — telephoto), or a **mustache/wavy** mix; radially symmetric about the principal point, **worst at the edges**; the extreme is the **fisheye** (so strong it is the intended projection, not a defect).

• **the model — per-radius remap**: a 1-D function of radius applied along each ray (points move only **in/out**, never sideways) — the same forward **Brown–Conrady** $r_d = r_u(1+k_1 r_u^2+k_2 r_u^4+\dots)$ ($k_1<0$ barrel, $k_1>0$ pincushion) and **division** $r_u = r_d/(1+\lambda_1 r_d^2+\dots)$ models stated in *Aberrations and optical challenges*; correcting the image **inverts** that map per radius.

• **where the parameters come from**: a **lens profile** (a per-(lens, focal length, aperture) table shipped by **Lightroom/ACR, PTLens, DxO**, keyed off **EXIF** — the same profile that undoes vignetting and lateral CA in one pass) or **plumb-line calibration** (choose the $k_i$/$\lambda$ that make imaged straight lines straight — minimize their residual curvature).

• **applying it — inverse-warp + resample**: loop over output pixels, push each back through the radial map to its source radius, and resample the input (the transport engine of [[Warping and resampling]]); the stretched/compressed edge regions carry the usual **prefilter-before-downsample** care (**L16**), and out-of-frame corners follow the boundary/crop policy. The warping part now points *back here* for this correction.

---

▸9.3 Lens optimization⧉

`fig-merit-function-loop` · lens design as optimization, a closed loop: parameters → ray-trace (fields × wavelengths × pupil zones) → merit function Φ (weighted spot/wavefront + constraints) → damped-least-squares step → repeat; the book's forward-model+loss+optimizer pattern (Lens optimization) ✅

⬜ figure not yet created

before/after optimization) fig-wb-before-after

`fig-lens-design-tradeoff` · lens-design Pareto cartoon as a 6-axis radar (sharpness, speed, size, weight, cost, distortion): fast pro prime vs compact phone lens overlaid, plus a dashed polygon showing where software correction shifts the phone lens (Lens optimization) ✅

`fig-rms-spot-vs-field` · RMS spot radius (µm) vs field (center→corner) for 3 wavelengths, rising toward the edge, with the Airy diffraction limit as a shaded dashed floor (Lens optimization) ✅

• the move: add elements → add **surfaces, glasses, and air gaps** → add free parameters to *trade off* aberrations against each other. The art is doing it with as few elements as the quality target allows (cost, weight, flare).

equations

ray-trace forward model — apply Snell's law $n_1\sin\theta_1=n_2\sin\theta_2$ surface-by-surface (no paraxial approximation)

merit function $\Phi = \sum_{\text{fields}}\sum_{\lambda}\sum_{\text{pupil rays}} w_i\,\lVert (x_i,y_i) - (\bar x,\bar y)\rVert^2 \

+\

(\text{constraints on } f, \text{length, edge thickness}\dots)$ — a weighted sum of squared transverse ray errors plus penalties

damped-least-squares (Levenberg–Marquardt-style) update on the parameter vector.

▸9.3.1 The high-level idea: design as optimization⧉

• restate the impossibility: beyond first order there is **no formula** that places every ray; you **search** the parameter space instead.

• **parameters (the unknowns)**: each surface's **radius of curvature**, the **glass** of each element (index *and* dispersion — a discrete catalog choice), element **thicknesses** and **air-gap spacings**, **aspheric** coefficients, and the **stop** size/position.

• **objective (the merit function)**: a single number summing **residual aberration** over a grid of **field angles**, a set of **wavelengths** (e.g. C, d, F lines), and a sampling of **aperture/pupil** zones — typically RMS spot size or wavefront error — plus **hard constraints** (target $f$, total length, positive edge thickness, vignetting limits).

• "**the big loss integral**": conceptually, integrate the blur over *all the conditions the lens must satisfy at once* (every field point, color, aperture position) and drive it down — the discrete merit function is that integral, sampled.

▸9.3.2 The forward model: ray-tracing and spot diagrams⧉

• evaluate a candidate design by **exact ray-tracing**: launch many rays from each field point through a grid of pupil points, refract them by **Snell's law at every surface** (the full non-linear law, not the paraxial one), and see where they land.

• a **spot diagram** plots those landing points at the image plane for one field point and color: a tight cluster = sharp; a smeared blob = aberrated. Different colors land in different clusters (chromatic). This *is* the lens's PSF, sampled by rays. [`fig-spot-diagram`]

• the optimizer shrinks these spots across all fields/colors simultaneously. [`fig-merit-function-loop`, `fig-rms-spot-vs-field`]

▸9.3.3 From hand calculation to software⧉

• **the old-fashioned way**: designers traced rays **by hand** (and with logarithm tables, then mechanical/early electronic calculators), iterating curvatures laboriously — a few rays a day. The classic doublets/triplets were found this way.

• **today**: optical-design software (**Zemax OpticStudio**, **CODE V**, OSLO) ray-traces thousands of rays per evaluation and runs **damped-least-squares** optimization over the parameter vector, plus **global** search to escape local minima and **tolerancing** to keep the design manufacturable.

• **how people actually use it** (workflow, not a black box): start from a **starting point** (the program's library, a scaled existing lens, or a competitor's **patent** prescription) — never a blank sheet; set up the *problem* — field points, wavelengths + weights, f-number, then mark which parameters are **variables** and which are pinned by **solves** (auto-recompute one number to hold focal length / back focus); pick the **default** RMS-spot/wavefront merit + hand-added **operands** (cap distortion, edge thickness, length, glass cost); run **local** damped-least-squares (seconds), read **spot diagrams / ray fans / MTF**, nudge weights, re-run — a tight **interactive loop**, optimizer does arithmetic, human supplies judgment; when stuck, unleash **global** search (Zemax "Hammer", CODE V global synthesis) for hours/overnight; then **tolerance** + export to mechanical CAD / stray-light. The craft moved *up a level*: from tracing rays by hand to **steering** an optimizer.

• it is the same machinery as the rest of the book — forward model + merit/loss + iterative optimizer — applied to glass.

▸9.3.4 Tradeoffs: the design is always a compromise⧉

• the merit function lives under hard **constraints**, and the axes fight: **speed** (low f-number) vs. **aberration control**; **size/weight** vs. **correction**; **cost** (number of elements, exotic glass, aspheres) vs. **sharpness**; **zoom range** vs. **everything**. You buy one by spending another. [`fig-lens-design-tradeoff`]

• this is exactly why **computation** is attractive: let the optics be *good enough and cheap*, and **correct the rest in software** (lens-profile correction, deconvolution — *Aberrations correction*), shifting the operating point on the Pareto front.

• **forward-ref — end-to-end optimization**: make the *whole pipeline* (lens + sensor + reconstruction network) differentiable and optimize the **merit of the final image**, not of the optics alone — the optics may then look "wrong" by classical metrics yet give better pictures (Advanced part: differentiable / computational optics, coded apertures, wavefront coding).

▸9.3.5 Tolerancing: from the nominal design to a manufacturable one⧉

⬜ figure not yet created

a tolerance budget — performance (MTF) vs each perturbation

⬜ figure not yet created

a **Monte-Carlo** as-built MTF distribution (nominal vs the manufacturing spread, with a yield cutoff) [schem/plot]

• **the error sources**: per-surface **radius** error, **thickness / airspace** error, glass **index & dispersion** (melt-to-melt) variation, **wedge**, surface **irregularity** (departure from the test sphere), and assembly **decenter / tilt / roll** of each element

• **sensitivity vs statistics**: a **sensitivity analysis** ranks which perturbations hurt most (the tight tolerances that drive cost); a **Monte-Carlo** draws many random as-built lenses and predicts the **performance distribution** and the manufacturing **yield**

• **compensators**: leave a few adjustments free at assembly — refocus, set one **airspace**, or shim/center an element — to *recover* performance, so tolerances elsewhere can loosen (cheaper)

• **the punchline**: nominal performance is the easy part; **as-built** performance and **yield** decide cost and feasibility. Loosen tolerances and lean on **computational correction** (lens-profile correction, deconvolution — *Aberrations correction*) to claw back the residual — another reason cheap optics + computation wins.

---

▸9.4 A short bestiary of classic designs⧉

• **achromatic doublet**: a positive crown + negative flint, usually cemented; the *minimum* system that corrects color (two wavelengths) and is also used to tame spherical aberration. (Full treatment in *Aberrations correction*.) [`fig-doublet-triplet-gauss`]

• **Cooke triplet**: three elements (+ − +) — the smallest design with enough freedom to correct **all** the primary (Seidel) aberrations *and* color; the ancestor of most simple lenses. [`fig-doublet-triplet-gauss`]

• **double-Gauss (Planar)**: roughly **symmetric about the aperture stop**, 6–7 elements; symmetry automatically cancels the *odd* aberrations (coma, distortion, lateral color), which is why it dominates fast normal lenses ($f$/2 and faster). [`fig-double-gauss`]

• **telephoto vs. retrofocus** — two ways to decouple physical length from focal length: [`fig-telephoto-vs-retrofocus`]

• **telephoto** (positive group then negative): the rear negative group pushes the rear principal plane *forward*, so a long-focal-length lens is **physically shorter** than its $f$.

• **retrofocus / inverted telephoto** (negative then positive): pushes the rear principal plane *back*, giving a long **back-focal distance** — needed so a wide-angle lens clears the SLR mirror box (and historically why mirrorless can go wider/smaller).

• **zoom**: a variable-focal-length system — groups slide so $f$ changes (the *variator*) while a *compensator* group keeps the image plane fixed; many elements, many compromises, the hardest thing to optimize. [`fig-zoom-mechanism`]

▸9.4.1 The lens as a system: cardinal points, pupils, f-number, T-stop⧉

• **principal planes** $H, H'$: the two planes from which object/image distances and focal length are measured; with them, the thin-lens equation $1/f=1/u+1/v$ holds for the whole thick system. **Nodal points** (where a ray crosses the axis undeviated) coincide with the principal points in air — the pivot for "no-parallax" panorama heads. [`fig-principal-planes-pupils`]

• **entrance & exit pupils**: the images of the aperture stop seen from object / image space. The **entrance pupil** is the opening that actually gathers light, so the true f-number is $N = f/D_{\text{EP}}$ — *not* $f$ over the front-element diameter. Pupil position governs perspective of the bokeh and the **no-parallax point**.

• **f-number** (recap from FUNDAMENTALS) sets light-gathering and DoF; here it is pinned to the entrance pupil. **Vignetting** clips the pupil off-axis (next section's cat's-eye bokeh and the optical part of corner darkening).

• **T-stop**: the f-number assumes 100% transmission; real glass and coatings lose some, so cinema lenses are marked in **T-stops**, $T = N/\sqrt{\tau}$ with transmittance $\tau$, giving the *actual* exposure. Ties to *Glare suppression* (coatings raise $\tau$) and FUNDAMENTALS exposure.

• **element count today**: modern zooms and fast primes run **10–20+ elements** in several groups (with aspheres and special glasses) — a direct consequence of stacking corrections; phone "lenses" are 5–8 tiny molded-plastic/glass elements. The count is the visible cost of cancelling everything in glass instead of software.

---

▸9.5 Special optics⧉

`fig-scheimpflug-tilt` · the Scheimpflug principle: tilting the lens makes the lens plane, image plane & plane of focus meet in a common line → the focus plane tilts into a wedge of sharpness; contrasted with the parallel (untilted) case ✅

`fig-fisheye-projection` · projection mappings — image radius r vs incidence angle θ for rectilinear (r=f·tanθ, blows up near 90°), equidistant (r=f·θ) and equisolid (r=2f·sin(θ/2)); why fisheye reaches ~180° where rectilinear can't ✅

`fig-seidel-aberrations` · the five monochromatic (Seidel) aberrations at a glance — spherical, coma, astigmatism, field curvature, distortion — each a tiny ray/spot icon + a one-line "what it looks like" note ✅

`fig-catadioptric-mirror-lens` · a mirror (catadioptric) lens: light folds via primary + secondary mirror (Cassegrain); central obstruction → ring-shaped (doughnut) bokeh — folded path + doughnut-highlight inset ✅

`fig-wide-angle-perspective-distortion` · wide-angle perspective distortion: off-axis spheres image as radially-stretched ellipses (flat-sensor geometry), and faces near a wide frame's edge are widened — not a lens flaw; forward-ref to correction (Pinhole image formation) ✅

`fig-anamorphic-squeeze` · anamorphic optics: 2× horizontal squeeze at capture → de-squeeze in post; a circle → vertical oval on sensor → restored; oval bokeh + horizontal flares noted ✅

`fig-fresnel-zones` · a Fresnel lens: collapse a thick convex lens into concentric ring "zones" that keep the local surface slope but drop the bulk; cross-section of a normal lens vs its Fresnel equivalent ✅

`fig-grin-vs-metalens` · three ways to bend light: shaped bulk lens (refraction), a GRIN rod (curved ray through a varying-index medium, flat faces), and a metalens (flat surface, sub-wavelength nanostructures setting a phase profile); "keep the phase, drop the thickness" ✅

`fig-tilt-shift-perspective` · the *shift* movement (not tilt): tilting the camera up keystones a building (converging verticals) vs. keeping the sensor vertical/parallel and raising the lens within a large image circle, so verticals stay parallel — architectural perspective control ✅

`fig-telescope-types` · refractor (doublet objective → focus) vs. reflectors — Newtonian (concave primary + flat 45° secondary → side focus) and Cassegrain (primary + convex secondary → focus through a hole behind the primary); glass vs. mirrors to a focus ✅

`fig-microscope-objective` · high-NA objective at a short working distance: finite (fixed tube length, objective forms the image directly) vs. infinity-corrected (subject at front focus → parallel "infinity space" → separate tube lens forms the image) ✅

`fig-soft-focus` · a soft-focus portrait lens: under-corrected spherical aberration (marginal rays cross nearer than paraxial → halo at the paraxial plane) and the resulting PSF — a sharp core inside a broad glow vs. a well-corrected lens; a *wanted* aberration ✅

`fig-apodization-pupil` · an apodizing filter: pupil transmission profile (hard-edged plain aperture vs. radially graded) → the defocus disk each makes (flat disk with a hard ring vs. a smoothly fading disk, no ring) — shaping the bokeh PSF in hardware ✅

equations

**Scheimpflug condition** (lens plane, image plane, focus plane meet in one line) + the **Hinge rule** for where the focal wedge pivots

fisheye projection laws $r = f\theta$ (equidistant), $r = 2f\sin(\theta/2)$ (equisolid-angle) vs. rectilinear $r = f\tan\theta$

anamorphic squeeze factor (typ. 2×)

Fresnel = local surface slope preserved, bulk removed.

▸9.5.1 Tilt-shift and the Scheimpflug principle⧉

• **shift**: translate the lens parallel to the sensor to control **perspective** (keep verticals parallel — architecture) and to **stitch** without parallax; the image circle must be large enough to cover the shifted frame. [`fig-tilt-shift-perspective`]

• **tilt**: rotate the lens so the **plane of focus tilts** away from parallel-to-sensor. By the **Scheimpflug condition**, the lens plane, image plane, and plane of sharp focus all meet along one line — so you can lay the focal plane along a tabletop (everything sharp) or slice a thin wedge across a scene (the "miniature-faking" look). [`fig-scheimpflug-tilt`]

• ties to *Depth of field*: tilt reshapes the in-focus region from a slab into a **wedge**; it does not add DoF, it **reorients** it.

▸9.5.2 Fisheye and non-rectilinear projection⧉

• a normal lens targets **rectilinear** projection ($r=f\tan\theta$) — straight lines stay straight — which **blows up** past ~100° field. A **fisheye** instead adopts a projection that *bends* straight lines to fit a huge field (often **180°+**) onto the sensor: **equidistant** $r=f\theta$ or **equisolid-angle** $r=2f\sin(\theta/2)$. [`fig-fisheye-projection`]

• the barrel "distortion" is **the intended mapping**, not an aberration — and it is **invertible in software** to rectilinear (at the cost of stretched corners), the cleanest example of optics-and-computation splitting the work.

▸9.5.3 Mirrors: catadioptric and reflecting systems⧉

• **catadioptric** = mirrors **and** lenses. Folding the path with mirrors makes a **compact long telephoto** (mirror lens) and is naturally **achromatic** (reflection has no dispersion). The **central obstruction** gives a **ring-shaped pupil**, so out-of-focus highlights render as **doughnuts** — the signature mirror-lens bokeh (cross-ref *Depth of field*). [`fig-catadioptric-mirror-lens`]

• **telescopes**: reflectors (Newtonian, Cassegrain) and catadioptrics (Schmidt–Cassegrain, Maksutov) scale the same idea to large apertures; (space) telescopes also motivate the **glare/baffling** chapter. [`fig-telescope-types`]

▸9.5.4 Periscope / folded-lens design (smartphone telephoto)⧉

• **the constraint**: a phone body is only a few millimetres thick, so a lens whose axis runs perpendicular to the screen cannot be physically long — capping the focal length and thus the telephoto reach.

• **the fix**: a **prism or mirror** placed just behind the entrance window **bends the optical axis ~90°**, so the lens groups lie *sideways* along the phone's width or length, where there is room for a long focal length; the sensor sits parallel to the phone's back. Because the path is "folded" the design is called a **periscope** or **folded-optics** lens. [`fig-catadioptric-mirror-lens` (cross-ref, same folding idea)]

• **what it buys**: a long effective focal length — ~5×–10× optical zoom / ~100–250 mm equivalent — in a thin body, without the protruding bump a straight long-lens design would require.

• **mechanics**: moving internal groups give continuous or stepped zoom; **OIS** is commonly implemented by *tilting or shifting the prism* (or a group) rather than floating a lens element — a compact, low-power actuator.

• **tetraprism (Apple, iPhone 15 Pro Max)**: folds the path **multiple times** with a tetrahedral prism, lengthening the effective optical path further within the same device width.

• **history**: popularized by the **Huawei P30 Pro (2019)**, then adopted by Samsung (Galaxy S20 Ultra and successors) and Apple; the reason phones acquired genuine long-reach telephoto. Cross-ref: the **telephoto design** in [[A short bestiary of classic designs]] compresses *physical length* relative to focal length by placing principal planes; the periscope instead *reorients* the whole long lens so its length runs parallel to the phone face — two independent strategies for compact telephoto, usable together.

• **trade-offs**: the small front window and folded path limit the **maximum aperture** (less light than a rear-camera main module); each periscope module covers a fixed or narrow zoom range; prism alignment and cost add manufacturing complexity.

▸9.5.5 Anamorphic optics⧉

• a cylindrical-element system **squeezes the image horizontally** (classically 2×) so a wide aspect ratio fits a narrower frame, then a matching de-squeeze restores it on projection. [`fig-anamorphic-squeeze`]

• side effects that became a *look*: **oval bokeh** and **horizontal (blue) flare streaks** — an aesthetic now imitated computationally.

▸9.5.6 Macro, microscope objectives, and telescopes⧉

• **macro**: images at **life-size or larger** ($m\ge 1$), where the thin-lens conjugates run the *other* way (image distance large), DoF is razor-thin (cross-ref *Depth of field* — **focus stacking** lives there; [Stanford focal-stack paper](https://graphics.stanford.edu/papers/focalstack/focalstack.pdf)), and special **flat-field** correction matters.

• **microscope objectives** and **telescope objectives** are the same compound-lens optimization pushed to extreme magnification / aperture, with their own corrections (apochromatic, flat-field). The high-NA microscope objective images a tiny subject at a short **working distance**, in either a **finite** (fixed tube length) or modern **infinity-corrected** (parallel "infinity space" + a separate tube lens) configuration. [`fig-microscope-objective`]

▸9.5.7 Soft-focus and apodization⧉

• **soft-focus** portrait lenses deliberately leave **under-corrected spherical aberration**, overlaying a sharp core with a glowing halo — a *wanted* aberration. [`fig-soft-focus`]

• **apodization**: a radially-graded (smooth-edged) pupil that **softens the bokeh transition** (smooth-edged defocus disks, no hard ring) — shaping the PSF on purpose, a hardware cousin of computational bokeh. [`fig-apodization-pupil`]

▸9.5.8 From shaped glass to thin structures: Fresnel, diffractive, GRIN, metalenses⧉

`fig-diffraction-grating-sim` · live 2-D FDTD wave sim of a **diffraction grating** (many slits): plane wave splits into orders along d·sinθ=mλ, orders sweep with wavelength (longer λ → larger deviation); intensity comb peaks at the orders (OPTICS → Diffractive elements) ✅

• *(throughline to the Advanced part: "keep the phase / slope, drop the thickness.")*

#### Fresnel lenses

• a **Fresnel lens** keeps only the part of a lens that actually bends light — its **surface slope** — and throws away the bulk glass: slice the lens into concentric annular zones and **collapse each to one thin sheet**, so the smooth profile becomes a set of **sawtooth rings** (a ring of tiny prisms) with the same local refraction as the original curve.

• **why**: dramatically **thinner, lighter, cheaper** than an equivalent thick lens — at the cost of **image quality** (scattering and diffraction at the ring discontinuities, the visible concentric-ring artifacts), so it is used where thinness / weight / cost beat sharpness.

• **history & uses**: invented by **Augustin Fresnel** for **lighthouses** (a huge aperture without an unliftable blob of glass); today in **condenser / projection optics**, **screen & page magnifiers**, **solar concentrators**, **vehicle brake lights**, thin illumination optics, and the **Fresnel/PF telephoto** elements in modern camera lenses.

• **the throughline**: it is the first step from "shape the glass" toward **encoding the lens as a thin structure** → diffractive optics, kinoforms, and ultimately **flat optics / metalenses** (Advanced); the same "keep the phase, drop the thickness" idea recurs in **wavefront coding**.

#### Diffractive elements

• bend light by **diffraction** off fine surface structure rather than bulk refraction; their dispersion has the **opposite sign** to glass, so a diffractive element can **correct chromatic aberration** in a hybrid lens (Canon DO, Nikon PF). Thin, light, but flare-prone at the rings. [`fig-grin-vs-metalens`]

• the canonical diffractive element is the **diffraction grating** — a periodic array of slits/grooves of period $d$. Wavelets from adjacent periods reinforce only where their path difference is a whole number of wavelengths, the **grating equation $d\sin\theta_m = m\lambda$**, so incident light splits into discrete **orders** $m=0,\pm1,\pm2,\dots$ at angles $\theta_m$. Because $\theta_m$ **grows with $\lambda$**, a grating **disperses** white light into a spectrum — the heart of the **spectrometer** — dispersing in the *opposite* sense to a prism (prisms bend blue most; gratings bend red/long-λ most), and the same physics behind the rainbow sheen of a **CD/DVD**. 🖱️ **Interactive (web edition):** a live 2-D **FDTD wave simulation** — a plane wave hits a multi-slit grating and breaks into orders along the $d\sin\theta=m\lambda$ directions; **sweep the wavelength and watch the orders fan wider** (longer λ → larger deviation), or change the period. Absorbing (sponge + Mur) borders, so nothing reflects off the window edges. Companion to the single-slit `fig-diffraction-wave-sim` (FUNDAMENTALS). [`fig-diffraction-grating-sim`; interactive, FDTD]

#### Mirrors (cross-link)

• see *Mirrors: catadioptric and reflecting systems* above (reflection is the other way to avoid dispersion).

#### GRIN (gradient-index)

• the index **varies continuously inside** the glass, so a ray bends along a *curved* path even through flat surfaces — focusing with a homogeneous-looking slab. Used in fiber coupling, endoscopes, photocopier rod arrays, and the eye's own lens. [`fig-grin-vs-metalens`]

#### Metalenses (forward-ref)

• a flat **sub-wavelength nanostructure metasurface** imposes the lens's phase profile with essentially **zero thickness** — the endpoint of the Fresnel→diffractive→flat-optics arc. Full treatment in the **Advanced** part; placed here as the conceptual destination.

---

▸9.6 Focus, autofocus⧉

`fig-focus-conjugate-move` · focusing geometry: as subject distance u changes the image distance v must change (1/f=1/u+1/v); the lens slides to keep the image plane on the fixed sensor — near vs far focus (lens extends vs retracts) ✅

`fig-contrast-af-curve` · contrast-detect AF: high-frequency contrast (sharpness) vs focus position, peaked at best focus; hill-climb arrows that must overshoot past the peak then step back (no direction cue) ✅

⬜ figure not yet created

hunting to find it)

`fig-pdaf-principle` · phase-detect AF: a separator splits the beam to two line sensors; the signed phase (separation) of the two profiles gives direction + amount of defocus ("stereo in the lens"); in-focus vs front/back-focus ✅

fig-dual-pixel-af · on-sensor / dual-pixel AF: a photodiode split into L/R halves under one microlens; the L vs R images give per-pixel disparity → defocus; split-pixel cross-section + the two readout images ✅

`fig-depth-from-defocus` · depth from defocus: the blur-disk diameter grows with distance from the focus plane; two shots at different focus/aperture → estimate depth from the relative blur ✅

equations

focus error → sensor shift via $1/f=1/u+1/v$ (differentiate to get sensor move per diopter)

contrast metric (sum of squared gradients / high-pass energy) maximized

phase-detect: **disparity between the two pupil half-images is proportional to defocus** (it is stereo with a baseline = the pupil split — ties to FUNDAMENTALS $Z=fB/d$)

defocus blur radius $\propto$ distance from focus and aperture (from *Depth of field*).

▸9.6.1 Focus mechanics⧉

• focusing moves the **lens (or a focusing group, or the sensor)** to set the image distance $v$ so the subject's conjugate falls on the sensor; an object at infinity sits at $v=f$, nearer objects push $v$ back (FUNDAMENTALS). [`fig-focus-conjugate-move`]

• **manual focus aids**: ground-glass/split-prism, the **rangefinder** (a coincidence/disparity device — cross-ref FUNDAMENTALS stereo), and modern **focus peaking** / magnified live-view.

• **focus-by-wire**: in many AF lenses the focus ring is *not* mechanically linked — it sends an electronic signal to a motor (stepper / ultrasonic **USM** / linear voice-coil), enabling fast, quiet, programmable, and lens-data-aware focusing.

• **focus breathing**: as you refocus, a lens's **focal length / field of view subtly changes** — the image appears to **zoom slightly** as the focus pulls (because the focusing group's motion also shifts the effective $f$). An artifact in two regimes: in **video** it shows as a visible "breathing" during **focus pulls** (distracting in cinematography), and in **focus stacking / depth-from-defocus** it means consecutive frames **don't register** (different magnification → must be re-aligned, cross-ref *Depth of field* §focus stacking and part 08's focal-stack chapter, which calls it out). Modern **cine lenses** are designed to be breathing-free, and some cameras/software **correct** it digitally.

▸9.6.2 Contrast-detection AF⧉

• premise: a sharp image has **more high-frequency energy** than a blurred one. Maximize a **contrast metric** (sum of squared gradients / high-pass energy) over the focus position — gradient *ascent* on the sharpness hill. [`fig-contrast-af-curve`]

• strength: needs **no special hardware**, uses the imaging sensor itself, very accurate at the peak.

• weakness: the metric gives **no direction** — the camera must **hunt** (step, observe, maybe reverse), so it is slower and visibly racks back and forth; struggles in low contrast / low light.

▸9.6.3 Phase-detection AF (the split-pupil / stereo trick)⧉

• compare light from **two halves of the exit pupil**: in a dedicated AF sensor a **separator (secondary) lens** re-images the two pupil halves onto **two line sensors**. The **lateral offset** between the two patterns is a **signed** measurement of defocus — its **sign gives the direction**, its **magnitude gives how far** — so the lens can be driven to focus in **one move**, no hunting. [`fig-pdaf-principle`]

• this is literally **stereo inside the lens**: the baseline is the pupil split, and the offset is a disparity proportional to defocus (cf. FUNDAMENTALS $Z=fB/d$).

• classic **SLR** implementation: a partly-transparent main mirror + sub-mirror diverts light down to a dedicated PDAF module — fast, but a *separate* optical path needing calibration (front/back-focus).

• **sidebar — why the reflex mirror is disappearing** (synthesis of the author's 2018 *"DSLR will probably die"* essay, see [[Blog Index]]): the mirror's old benefits (optical TTL view, dedicated fast PDAF) shrink as EVFs improve and on-sensor PDAF closes the gap; its costs do not — added mechanism + per-body/lens calibration, larger body, the **retrofocus tax** on wide/fast lenses (cross-ref bestiary), and it **occludes the sensor while focusing** so no pre-shot scene analysis (face/eye); for video the mirror stays up, forfeiting all advantages. Reflex → mirrorless; AF migrates onto the main sensor.

▸9.6.4 On-sensor PDAF and dual-pixel AF⧉

• **masked / phase-detect pixels**: scatter pixels across the imaging sensor whose left or right half is masked, recovering the two pupil views *on the main sensor* — no separate module (enables mirrorless/live-view PDAF). [`fig-dual-pixel-af`]

• **dual-pixel AF**: split **every** photosite into **two half-photodiodes under one microlens** (reusing the FUNDAMENTALS microlens stack) — each pixel is its own phase-detect pair, giving dense, fast AF *and* a low-baseline disparity map (small depth signal usable for portrait-mode and refocus).

▸9.6.5 Depth from focus / defocus, and learned subject AF⧉

• **the two are different strategies**: **depth from focus** *sweeps* the focus through its range and picks, per pixel, the **setting that maximizes sharpness** — many shots, no blur model needed (the focus position *is* the depth). **Depth from defocus** instead estimates depth from the **amount of blur** in just **one or two** shots, given a **known blur (PSF) model** of how defocus scales with depth and aperture — far fewer frames, but it leans on the model.

• **depth from focus**: sweep focus, find for each pixel the position of **maximum sharpness** → depth (also the basis of **focus stacking**, *Depth of field* — the focal-stack sharpness map is this same computation, [Stanford focal-stack paper](https://graphics.stanford.edu/papers/focalstack/focalstack.pdf)).

• **depth from defocus**: from a few images at different focus/aperture, infer per-pixel **blur radius** → depth, with far fewer shots (Pentland 1987; Subbarao). The defocus PSF is the bridge to deconvolution (*Aberrations correction*). [`fig-depth-from-defocus`]

• **depth-from-defocus *as autofocus* — Panasonic DFD (Depth From Defocus)**: rather than build a depth map, use defocus to **drive focus**. With a stored model of how a given lens's blur (PSF) changes with focus, the camera compares **two frames at slightly different focus** and infers the **direction *and* amount** of defocus — a PDAF-like, near-single-step cue from an ordinary **contrast-detect** sensor with **no phase pixels**. It rides on CDAF (confirms at the sharpness peak) and is strong for video; its weakness is the reliance on a **per-lens blur profile** (and on the subject holding still between the two frames).

• **subject / eye AF (learned)**: a detector finds **faces, eyes, animals, vehicles**, and tracks them, then drives PDAF to the chosen point — AF's *decision* layer is now a recognition problem (cross-ref the learned-detection chapter). Other rangefinding aids historically/peripherally: active **IR / ultrasonic** (early Polaroid SONAR) and **LiDAR/ToF** assist in phones.

▸9.6.6 Focusing in astrophotography⧉

• **why autofocus fails**: stars are **dim point sources at infinity** with almost no contrast for CDAF and too little light for PDAF — so astro focus is essentially **manual** and demands *more* precision than daytime (a star is the ultimate test of focus; the circle of confusion must be tiny)

• **you can't just rack to infinity**: most lenses **focus past infinity** (for thermal expansion / AF overshoot), so the "∞" hard stop is *not* in focus — you must focus empirically each session, and **refocus as the temperature drops** (the barrel contracts)

• **the practical methods**: focus on a **bright star** with **magnified live view**; or use a **Bahtinov mask** — a grid placed over the aperture whose **diffraction spikes** form an X with a central spike that is centred *only at exact focus* (a precise, eyeballable null); software **FWHM / HFD** metrics on a star automate the same idea, and **motorized focusers with autofocus routines** (minimize star size vs focuser position — contrast-AF, done deliberately and slowly) are standard on deep-sky rigs

• cross-ref: the daytime AF families above (why they don't transfer); *Depth of field* (tiny CoC); FUNDAMENTALS Focus (the practical operator's view)

---

▸9.7 Depth of field⧉

`fig-dof-double-cone` · object-space DoF as a double cone in front of the lens, waist on the focus plane (the blur for one scene point) 🟨

`fig-dof-coc` · depth of field via the circle of confusion: the double cone behind the lens; points whose blur disk stays within the CoC limit c at the sensor are "acceptably sharp" → near/far DoF limits; CoC + hyperfocal H=f²/(N·c) labeled ✅

⬜ figure not yet created

pointer to FUNDAMENTALS for the full geometry)

`fig-bokeh-shapes` · bokeh: out-of-focus highlights take the aperture's shape — circular (many blades / wide open) vs polygonal (few blades, stopped down); synthetic defocus-disk fields + aperture insets ✅

`fig-cats-eye-vignetting` · cat's-eye bokeh: off-axis defocus disks clipped into lemon / cat's-eye shapes by optical (mechanical) vignetting toward the frame edge; round at centre → clipped at edge, tilted toward centre ✅

`fig-focus-stacking` · optics-chapter illustrative figure (07-07 Figure 4): a stepped-focus stack → sharpness selection → all-in-focus composite. Synthetic per-slice defocus on one photo (`sourced/corn-cobs.jpg`, © Frédo Durand) — license-safe. The full real-data treatment lives in part-08 `fig-focalstack-*` ✅

equations

**all imported** from FUNDAMENTALS (circle of confusion, hyperfocal $H=f^2/(Nc)$, near/far limits, DoF scaling). New here: aperture-shape ⇒ defocus-disk shape (the disk is the **image of the aperture stop**)

cat's-eye area loss from off-axis pupil clipping.

▸9.7.1 Recap: the geometry of focus (pointer, not re-derivation)⧉

• briefly restate the circle of confusion, the near/far limits, **hyperfocal** $H=f^2/(Nc)$, and the three knobs (smaller aperture, shorter $f$, farther focus all deepen DoF) — then **point to FUNDAMENTALS** for the full similar-triangles derivation and the framing-invariance result. [`fig-dof-coc`]

• restate the load-bearing caveat: **defocus is not a 2-D convolution** (it is depth- and occlusion-dependent) — the reason naive portrait-mode fakes fail and depth-aware/light-field methods are needed (forward-ref Advanced).

▸9.7.2 The bokeh look: shape and structure of the blur⧉

• the out-of-focus highlight is the **image of the aperture stop**, so its **shape = the aperture's shape**: a round iris → round disks, a few straight blades → **polygons** (pentagons, heptagons). [`fig-bokeh-shapes`]

• **disk structure** sets "good" vs. "bad" bokeh: a **bright-edged** disk (from over-corrected spherical aberration) looks **busy/nervous**; a **uniform or soft-edged** disk (under-corrected, or apodized — *Special optics*) looks **creamy**. The *whole* point of a portrait lens is the disk profile, not just resolution.

• **cat's-eye / lemon bokeh**: toward the frame edge the pupil is **clipped by the lens barrel and front/rear elements** (optical vignetting), so off-axis defocus disks are **cut to lemon shapes**, swirling toward the corners; stopping down restores round disks (and reduces corner vignetting). [`fig-cats-eye-vignetting`]

• ties to *Compound lenses* (pupil/vignetting) and *Special optics* (mirror-lens doughnuts, anamorphic ovals, apodizers).

▸9.7.3 Extending DoF: focus stacking⧉

• **focus stacking**: capture a **stack** at stepped focus distances, then composite the **sharpest pixels** from each into one **all-in-focus** image — the standard fix where DoF is hopeless (**macro**, product, landscape near-to-far) ([Stanford focal-stack paper](https://graphics.stanford.edu/papers/focalstack/focalstack.pdf)). [`fig-focus-stacking`]

• requires per-pixel **sharpness selection** + alignment (focus *breathing* changes magnification between shots → must register); a multi-image-fusion problem (cross-ref the fusion/stacking machinery).

• **the sharpness map is depth-from-defocus**: estimating, per pixel, *which slice of the stack is sharpest* is exactly **depth from focus/defocus** (cross-ref *Focus, autofocus*) — the focus position of maximum sharpness corresponds to that pixel's **depth**, so a focal stack yields an all-in-focus image **and** a depth map for free.

• contrast with *increasing* DoF by stopping down — which costs light and eventually **diffraction** softens everything (FUNDAMENTALS): stacking beats the diffraction wall.

▸9.7.4 Controlling and faking DoF⧉

• **shallow on purpose**: open up / step closer / longer lens to *magnify* the subject — recall DoF is set by **magnification**, not focal length per se (FUNDAMENTALS framing-invariance).

• **fake / computational DoF (portrait mode)**: estimate a **depth map** (dual-pixel disparity from *Focus*, stereo, or learned mono-depth) → apply **depth- and occlusion-aware** blur with a chosen disk shape. Honest about why it is hard (hair, edges, transparency) — full treatment forward-referenced to the Advanced computational-camera part; here we only set up *why* a 2-D blur is wrong and *what* a depth map buys.

• **reminder — defocus is NOT a single 2-D convolution**: the blur **kernel size varies with depth** (a far point blurs more than a near one), and **occlusion boundaries break shift-invariance** (a sharp foreground edge bleeds onto a blurred background, not symmetrically) — so no one uniform kernel applies. This is exactly why portrait-mode background blur needs a **depth map** rather than a flat Gaussian (cross-ref the *depth-of-field geometry* recap above and FUNDAMENTALS [[Manuscript/Draft — 02-02 Image formation and linear perspective|Image formation §Depth of field]]).

• **sidebar — coherent imaging (Barbastathis): out-of-depth is *black*, not blurry**: everything above is **incoherent** imaging, where an out-of-focus object still contributes light (a blur disk). George Barbastathis (MIT) notes that **coherent** imaging built around a *prepared* depth can do something qualitatively different — off-depth objects contribute **no light at all**, so they come out **black, not blurry** (incoherent imaging adds positive *intensities*, so every depth contributes; coherent imaging adds *complex amplitudes*, which can interfere destructively and cancel). Reframes DoF as a hard **on/off depth gate** rather than a graceful blur.

---

▸9.8 Fake (synthetic) depth of field⧉

---

▸9.9 Glare suppression⧉

`fig-lens-flare-ghosting` · stray light: a bright source, a ray reflecting off two internal lens surfaces → a "ghost" landing displaced on the sensor, plus a veiling-glare wash; ghost vs flare labelled ✅

`fig-dr-clip-vs-noise` · one exposure's two walls — clipping at full-well above, the noise floor below — shading the narrow band of scene contrast that fits between them 🟨

`fig-ar-coating-interference` · anti-reflection coating: a quarter-wave film cancels the front-surface reflection by destructive interference (top-surface reflection vs film–glass reflection, extra path λ/2 → out of phase); two emergent wavelets summing to ~0 + design rules $n_c=\sqrt{n_1 n_2}$, $d=\lambda/4n_c$ ✅

`fig-lens-hood-baffles` · lens hood + internal blackened baffles: an off-axis source (sun outside the frame) is blocked by the hood; stray light that sneaks in is trapped/absorbed by baffles; wanted image-forming rays (within the angle of view) pass through to the sensor ✅

`fig-veiling-glare-psf` · a PSF with a bright core + a broad low-level skirt (veiling glare) on a log-radial profile; second panel shows the integrated veil lifting blacks / reducing contrast ✅

equations

Fresnel reflectance at an interface $R=\left(\frac{n_1-n_2}{n_1+n_2}\right)^2$ (~4%/surface for air–glass → why an uncoated multi-element lens loses huge contrast)

**quarter-wave AR condition** — coating index $n_c=\sqrt{n_1 n_2}$ and thickness $\lambda/4$ for destructive interference

veiling glare modeled as the image $*$ a **broad glare PSF** → deconvolution.

▸9.9.1 Where stray light comes from: flare, ghosting, veiling glare⧉

• at **every** air–glass surface a few percent of light **reflects** (Fresnel, ~4% uncoated); a stray ray can reflect **twice** and reach the sensor as a **ghost** — a discrete, often aperture-shaped bright blob, usually mirrored across center from a strong source. [`fig-lens-flare-ghosting`]

• many weak scatters and reflections sum into **veiling glare** — a low-frequency **wash that lifts the blacks** and crushes contrast (worst with a bright source just outside the frame, or shooting into the sun/backlight).

• **why compound lenses make it worse**: $N$ surfaces give $O(N^2)$ inter-reflection pairs — a 15-element lens has dozens of surfaces, so without coatings it would be hopeless (cross-ref *Compound lenses* element count, and the T-stop transmittance loss).

• **flare as an aesthetic (cinematography)**: lens flare was long considered a **flaw / a no-no** in cinematography (the whole point of coatings and hoods is to *kill* it), but **modern films deliberately court it** for a dramatic / realistic, "you're really there" effect — **J.J. Abrams' anamorphic streaks** are the cliché. It also **conveys very high contrast / the presence of a bright source**: flare reads as "that light is *intense*," a cue the eye trusts, so a little flare can sell a bright window or the sun better than a clean frame. (An aesthetic now also **faked computationally** — cross-ref anamorphic flare, *Special optics*.)

• **not the same as a sunstar**: lens **flare/ghosts** are **inter-surface reflections**; the **star / sunstar** pattern radiating from a bright point is a **diffraction spike** — diffraction at the **aperture blades** (the FT of the blade polygon, cross-ref *Aberrations*/wave effects), present even in a single perfectly-coated element. Different physics, often confused.

▸9.9.2 Hardware suppression⧉

• **anti-reflection coatings**: a thin transparent film of index $n_c=\sqrt{n_1 n_2}$ and thickness **λ/4** makes the reflections off its two faces **interfere destructively**, cancelling the surface reflection over a band; **multi-layer** stacks broaden it to the whole visible range (the colored sheen of modern coatings). This dropped per-surface reflection from ~4% toward ~0.1–0.5%, which is what *made* many-element lenses practical. **Cross-ref the FUNDAMENTALS lens-coating history.** [`fig-ar-coating-interference`]

• **sidebar — the coating, Zeiss, and the spoils of war**: Smakula's 1935 Zeiss-Jena patent (DE 685767), classified as a military secret (the red "T"/Transparenz mark, ancestor of T\*); independently re-invented in the open by Strong 1936 / Blodgett 1938; then the 1945 Allied seizure of German patents (FIAT microfilming, US Alien Property vesting, 1946 London Agreement) and the Jena↔Oberkochen split of Zeiss scattered it. Lesson: it spread because it was discovered in the open more than once, not because one patent fell.

• **everyday encounters with the same trick**: these are the **same coatings on eyeglasses** (anti-reflective lenses — fewer ghost reflections, less glare for the wearer and for people looking at you) and the **anti-glare / AR coatings on displays and picture-frame glass** (museum / "non-reflective" glass) — the quarter-wave **interference** trick is all around you, not just inside camera lenses.

• **lens hood**: blocks off-axis light (a bright source *outside* the frame) from striking the front element at all — the cheapest, most effective anti-flare tool. [`fig-lens-hood-baffles`]

• **internal baffles, blackened edges, light traps, and anti-reflective barrel paint**: stop grazing rays from skipping down the inside of the barrel; critical in **(space) telescopes**, where stray sunlight must be baffled to image faint targets (cross-ref *Special optics* telescopes). [`fig-lens-hood-baffles`]

▸9.9.3 Computational deflare and glare deconvolution⧉

• model veiling glare as the true image **convolved with a broad glare PSF** (plus discrete ghosts); **measure** that glare spread function and **deconvolve / subtract** it to **recover contrast** — same PSF-deconvolution toolbox as *Aberrations correction*, applied to stray light. [`fig-veiling-glare-psf`]

• **dehaze-style** contrast recovery (cross-ref [[Slopendium/08 Single-image computational photography|dehazing]]) and **lens-specific flare/ghost removal** (learned, since ghost patterns are lens-specific). Important for **HDR** capture, where veiling glare from bright sources otherwise corrupts the dark-end measurements (Talvala 2007). Forward-ref Advanced for learned deflare.

---

▸9.10 Optical stabilization⧉

`fig-handshake-motion-blur` · hand-shake motion blur: a small angular shake $\Delta\theta$ over the exposure smears a point into a streak of length $\approx f\,\Delta\theta$ on the sensor (lens as pivot, focal length $f$); the $t<1/f$ shutter rule of thumb ✅

`fig-ois-vs-ibis` · two stabilization strategies side-by-side: lens-shift OIS (floating lens group moves) vs sensor-shift IBIS (sensor moves on a magnetic stage), moving element highlighted, same gyro/IMU input ✅

`fig-gyro-feedback-loop` · the stabilization control loop as a block diagram: gyro/IMU senses angular rate → controller → actuator moves the lens group/sensor → position sensor feeds residual error back (sense→control→actuate→measure, ~kHz) ✅

`fig-digital-vs-optical-stab` · optical (in lens/sensor, before capture, full resolution & FOV) vs digital/electronic IS (crop-and-warp in software after capture): the EIS safety-margin crop + warp costing resolution/field of view vs the optical in-lens/sensor correction ✅

equations

image-plane shift for a camera rotation $\Delta\theta$ is $\approx f\,\Delta\theta$ (so **longer $f$ ⇒ more blur ⇒ more correction needed**)

stabilization benefit quoted in **stops** = $\log_2$ of the shutter-time ratio it allows

the actuator must move the element/sensor to make the **net image displacement ≈ 0** over the exposure.

▸9.10.1 The problem: hand-shake and the blur budget⧉

• hand tremor **rotates** the camera by small angles during the exposure; the image shifts by $\approx f\,\Delta\theta$, so the **smear grows with both shutter time and focal length** (the old "1/focal-length" minimum-shutter rule). [`fig-handshake-motion-blur`]

• restate the **blur budget** from FUNDAMENTALS: you want a short shutter to freeze shake, but that costs light → wider aperture (less DoF) or higher ISO (more noise). Stabilization **relaxes** that budget by letting you use a *longer* shutter without smear.

• **key scope limit (state up front)**: stabilization fights **camera** motion only. A **moving subject** still blurs — no amount of IS helps; that is **motion deblurring** (forward-ref Advanced/Motion).

▸9.10.2 Optical stabilization: lens-shift vs. sensor-shift (IBIS)⧉

• **sensing**: **MEMS gyroscopes** (and accelerometers) measure the camera's angular velocity in real time; a controller computes the counter-motion and drives an actuator in a **feedback loop** to null the image displacement. [`fig-gyro-feedback-loop`]

• **lens-shift OIS**: a **floating lens group** moves **laterally** (voice-coil actuators) to **steer the image back** onto its original spot. Pros: optimized per lens, the viewfinder/AF sees the stabilized image; corrects shift well. (Canon IS / Nikon VR.) [`fig-ois-vs-ibis`]

• **sensor-shift / IBIS**: the **sensor** rides a magnetic stage and **translates (and rotates/roll)** to follow the image. Pros: works with **any** lens (even un-stabilized/adapted), can correct **roll** (a 5th axis) and pitch/yaw translation; combines with lens OIS ("dual IS"). [`fig-ois-vs-ibis`]

• **how many stops**: modern systems buy **~3–7 stops** of slower shutter; the limit is residual low-frequency drift, actuator travel, and (for IBIS) keeping the moved sensor inside the lens's image circle.

▸9.10.3 Digital / electronic stabilization and the computational alternatives⧉

• **electronic / digital stabilization**: **crop** a margin and **warp each frame** to cancel inter-frame motion (video) — no moving parts, but **costs resolution/field of view** and works *between* frames, not *within* a single exposure (so it doesn't reduce per-frame motion blur the way optical IS does). Phones combine gyro data + optical flow for very stable video. [`fig-digital-vs-optical-stab`]

• **rolling-shutter** interaction: on CMOS, fast motion + electronic stabilization must also undo **rolling-shutter** skew (cross-ref Motion/Video).

• **the computational-photography flip**: instead of one long stabilized exposure, take a **burst of short, blur-free frames** and **align + merge** them — short shutters freeze shake, alignment removes the offset, averaging recovers SNR ($1/\sqrt{N}$). This **handheld burst** approach (cross-ref multi-frame denoising / super-res) is how phones get sharp low-light shots without bulky IS — the software counterpart to optical stabilization (forward-ref Advanced).

▸9.11 The eye as an optical instrument: vision and its correction⧉

`fig-eye-as-camera` · the eye as a camera: cornea + crystalline lens = objective, iris/pupil = aperture, retina = sensor (fovea), accommodation = autofocus — labelled cross-section ✅

`fig-refractive-errors` · the four refractive errors — myopia / hyperopia / astigmatism / presbyopia — where the image focuses vs the retina and the correcting (negative / positive / cylindrical / add) lens ✅

⬜ figure not yet created

bifocal vs progressive vs AF-glasses focal zones [schem] fig-bifocal-progressive-af

▸9.11.1 The eye as a camera⧉

• **cornea + crystalline lens** form the objective (most of the power is the **cornea**); the **iris/pupil** is the aperture (~f/2.1 dilated to ~f/8 constricted); the **retina** is the sensor (the **fovea** is the sharp centre); **accommodation** — the ciliary muscle reshaping the soft lens — is the eye's **autofocus** (cross-ref *Focus*)

• like any lens the eye has **aberrations** and a diffraction-limited **PSF**; visual acuity ("20/20") is its resolution spec — the optical chain of this whole part, in wetware

▸9.11.2 Refractive errors: the eye out of focus⧉

• **myopia (near-sighted)**: eye too long / too powerful → image focuses **in front of** the retina; distant objects blur → corrected with a **negative (diverging)** lens. (Now near-epidemic, linked to near-work and lack of outdoor light.)

• **hyperopia (far-sighted)**: eye too short → focus falls **behind** the retina → corrected with a **positive (converging)** lens

• **astigmatism**: the cornea/lens is not rotationally symmetric (**toric**) → different power in different meridians, orientation-dependent blur → corrected with a **cylindrical** lens — literally this part's *astigmatism*, in the eye

• **presbyopia**: with age the lens stiffens and **accommodation fails** → can no longer focus up close (reading distance); near-universal after ~45 — the driver of the next section

• aside — **the pinhole effect**: squinting (or **pinhole glasses**) sharpens by stopping the pupil down (more DoF), at the cost of light (FUNDAMENTALS — DoF)

▸9.11.3 Correcting vision⧉

• **spectacles** (the original computation-free fix), **contact lenses** (sit on the cornea → wide field, no prism/distortion), **intraocular lenses (IOLs)** (replace the lens, e.g. cataract surgery), and **refractive surgery** (**LASIK / PRK** reshape the cornea — surgically editing the prescription)

• eyewear is **applied lens design**: anti-reflection **coatings**, **high-index** glass for thinner lenses, **aspheric** surfaces to cut aberration, **photochromic** (auto-darkening) and **polarized** sunglasses (cut glare — the same physics as the camera polarizer)

• **the pinhole trick — a smaller aperture sharpens any eye.** Squinting, looking through a pinhole, or **pinhole (stenopeic) glasses** improve vision for *any* refractive error with no prescription: shrinking the entrance aperture keeps only the near-axial rays, so the retinal **blur circle collapses** — the eye's version of *stopping a lens down for more depth of focus*. The catches are a camera's: the image gets **dim**, and a tiny enough hole is **diffraction-limited**.

• **conversely, a wide-open pupil makes things worse.** In dim light (or under dilation) the pupil opens to admit marginal rays — and exactly as a wide camera aperture gives a **shallow depth of focus**, it also exposes the eye's own **aberrations** (spherical aberration, "night myopia," halos/starbursts around lights). Night acuity drops for the same reason a lens is softest wide open and sharpest stopped down; stopping the eye down (bright light, or a pinhole) fixes both the defocus *and* the aberrations at once.

• **pinhole as sunglasses.** Because it also blocks most of the light, a pinhole or narrow slit doubles as a glare-cutting **dark aperture** — sharpening *and* dimming together. Inuit **snow goggles** (a slit cut in bone/antler/wood) are exactly this, used for millennia against snow glare.

• **side bar — a short history of correcting human vision.** Magnifying **reading stones** (plano-convex glass laid on text) appear by the 11th century; the first wearable **spectacles** in Italy around **1286** (convex lenses for presbyopia and hyperopia), with concave lenses for myopia following in the 15th–16th centuries. **Bifocals** are credited to Benjamin **Franklin** (1784). Practical **contact lenses** date to the late 19th century (glass scleral lenses, Fick 1888), turning soft/hydrogel in the 1960s–70s. **Intraocular lenses** begin with Harold **Ridley** (1949). **Refractive surgery** — radial keratotomy, then **LASIK / PRK** — reshapes the cornea from the 1980s–90s. The thread: each step moves the correction **closer to the eye** — on the nose, on the cornea, inside the eye, then carved into the cornea itself.

▸9.11.4 Presbyopia and the bifocal problem: from bifocals to AF glasses⧉

• the problem: presbyopia needs **different power for near vs far**, but a simple lens has **one** focal length

• **bifocals / trifocals** (Franklin): two or three zones with a visible line (near segment low) — abrupt jump and an "image-jump" artifact

• **progressive (varifocal) lenses**: a **continuous gradient** of power top→bottom, no line — at the cost of **peripheral distortion** and a narrow usable corridor

• **adjustable / autofocus (AF) glasses** — the *active* alternative: **focus-tunable lenses** whose power changes on demand, so one pair spans the whole range:

• **fluidic / Alvarez lenses** — manually pumped or slid to set power (adjustable-focus eyewear for the developing world — Josh Silver)

• **liquid-crystal / electronic lenses** — a voltage changes the LC retardance → an electrically **switchable or tunable** add-power (a programmable **phase** element — the same family as the **LC-SLMs** of the Advanced part); paired with an **eye-tracker / rangefinder** they *autofocus* to wherever you look (Deep Optics / "32°N")

• the throughline: **AF glasses are accommodation re-implemented in hardware** — the eye's lost autofocus restored by a tunable lens, the wearable cousin of camera autofocus and of the phase modulators in the Advanced part