Problem Set
NBPhO 2025
8. Phase Spiral 9 pts
Infer the Milky Way's vertical gravitational potential and dark matter density from the phase-space winding spiral of nearby stars (after Antoja et al. 2018, Guo et al. 2024).
Part (i) — 1.0 / 1.0 pts
| Criterion | Points | Result |
|---|---|---|
| Idea of using Gauss’ law | 0.3 | ✓ explicit “Apply Gauss’s law to a ‘pillbox’ of cross-section extending from to “ |
| Formula relating the mass inside with gravitational flux | 0.3 | ✓ stated up front |
| Application of Gauss’ law on a cuboid | 0.2 | ✓ end caps give via reflection symmetry, |
| Final result | 0.2 | ✓ |
Part (ii) — 0.5 / 0.5 pts
| Criterion | Points | Result |
|---|---|---|
| Noticing that the movement is that of a harmonic oscillator | 0.3 | ✓ “is that of a SHO with “ |
| Expression for the oscillation period | 0.2 | ✓ |
Part (iii) — 2.4 / 2.5 pts
| Criterion | Points | Result |
|---|---|---|
| Making use of the total energy at intersection points being known | 0.7 | ✓ ” directly, no needed” at crossings |
| Interpolating the values at from neighbouring crossovers (explicitly mentioned) | 0.8 | ✓ linear interpolation in the crossing index, , with explicit justification via the matching-parabola picture |
| Tabulating the potential — six points | 0.7 | ✓ six pairs from to |
| vs correctly plotted | 0.3 | ~ 0.2 / 0.3 — an ASCII schematic with axes and labelled points; conveys the trend (parabolic at small , more linear at large ), but the data points sit roughly one gridline above their tabulated values, so the rendering is qualitative rather than quantitatively faithful |
Discrepancies
The six tabulated pairs differ systematically from the official’s: Claude reads with , while the official reads with . The two sets parametrise approximately the same curve , but Claude’s innermost crossing is read closer to the origin than the official’s, propagating into the part-(iv) and part-(vi) numerics below. The grading-scheme criteria for part (iii) themselves do not include a numerical-accuracy line, so this does not cost points here — only downstream.
Part (iv) — 0.9 / 1.0 pts
| Criterion | Points | Result |
|---|---|---|
| Connecting with by assuming constant mass density | 0.8 | ✓ inverted at the innermost reliable point |
| If the first data point is not used | −0.2 | not applied — Claude does use his innermost reliable crossing (0.18 kpc); the dropped 0.10 kpc crossing has no inner neighbour and so is correctly excluded as unreliable |
| Final expression for | 0.1 | ✓ |
| Numerical value within 10% | 0.1 | ✗ Claude vs official — 22% off, beyond the 10% band, driven by Claude reading at 0.18 kpc rather than 0.27 kpc |
Discrepancies
: Claude vs official . Both lie inside the canonical Oort-limit range –, but the relative gap is 22%, outside the grading scheme’s 10% tolerance.
Part (v) — 2.0 / 2.0 pts
| Criterion | Points | Result |
|---|---|---|
| Obtaining an expression for the total mass contained within | 0.7 | ✓ Claude integrates once to get for , identifying as the field set by all visible matter inside — the Poisson-integral analogue of |
| Taking the difference between the total mass within and for calculating the dark matter content | 0.9 | ✓ Claude uses the two outer points at and to set up a 2×2 linear system in and , eliminating to extract alone — equivalent in spirit to the official’s mass-difference, but executed without invoking the constant-density placeholder |
| Final expression for based on and | 0.3 | ✓ |
| Numerical value within 10% | 0.1 | ✓ vs official — exact match |
Part (vi) — 1.8 / 2.0 pts
| Criterion | Points | Result |
|---|---|---|
| Idea of using differences in the winding rate between two points on the spiral | 1.0 | ✓ phase advance , with each axis crossing along the spiral advancing by (so between consecutive crossings) |
| Expression for angular frequency in terms of by assuming a harmonic oscillator | 0.5 | ✓ matching-parabola period ⇒ , identical to the official’s expression |
| Picking two points and connecting the age of the spiral, and the winding amount | 0.3 | ✓ inner and outer separated by 5 half-turns (); five further inner/outer pairs tabulated as cross-checks |
| Numerical value within 10% of the solution value | 0.2 | ✗ Claude (preferred pair gives 370 Myr, table cluster 330–540 Myr) vs official — about 35% off the official’s keyed value, beyond the 10% band |
Discrepancies
: Claude vs official . The whole gap is traceable to the part-(iii) graph reads — Claude’s smaller and values give a larger (0.0427 vs 0.0261 Myr) over the same 5 half-turns, which compresses by the same factor. Both numbers fall inside the literature window 300–900 Myr quoted by Antoja et al. (2018); the modern best estimate from Guo et al. (2024) is closer to 500 Myr, almost exactly between the two.
Overall score: 8.6 / 9.0 pts
Three docks, all on numerical accuracy or plot quality.
Numerical answers: (exact match), (matches), six values systematically below the official’s by – in at corresponding rank, (vs official 0.090, 22% off), (exact match), (vs official 620, 35% off).
Commentary
Where this solution goes beyond the grading scheme. The “Overview” front-loads the three structural ideas — 1D Gauss for a slab, amplitude-dependent period from a matching parabola, phase-mixing as a clock — and quotes the small parameter that makes the whole method work. Part (i) follows the boxed acceleration with an “Educational remark” that pre-empts the factor-of--vs- trap (matter on both sides of the test point) and re-derives the same answer from . Part (ii) carries the abstract formula into a number ( for the canonical Oort density), pinning the disc’s vertical “epicyclic” timescale. Part (iii) checks the data internally by computing at every tabulated point and confirming it decreases monotonically with — a one-line falsification test for whether really falls off with height, drawn from the spiral readings alone. Part (iv) frames the answer in the historical “Oort limit” context, contrasting visible () versus dynamical () densities. Part (v) is the most generous — three sanity checks chase the answer (visible surface density inside ; dark fraction ; conversion to , the standard direct-detection number) — and an “Educational remark” justifies the constant- approximation by separation of scales (halo scale-height tens of kpc, disc thickness ). Part (vi) does not just cite one inner/outer pair: a six-pair table exposes the spread, and the discussion attributes it to the matching-parabola formula systematically under-estimating the true period for non-parabolic (by up to in the constant-force limit), with bigger underestimates for outer orbits — exactly the pitfall the problem flags. Antoja et al. (2018) and the Sagittarius-dwarf passage are named as historical context for the answer.
Where the official solution is sharper. Three places. (1) Part (v): the official packages the answer as one closed-form expression, , into which two numbers can be plugged directly. Claude solves a 2×2 linear system in and (more rigorous in that it does not need the official’s “placeholder ” caveat) but the algebra is bulkier and the closed form is hidden. For a solver under time pressure the official’s prescription is cleaner. (2) Part (vi): the official commits to one inner/outer pair, reports 620 Myr in two lines, and stops. Claude’s six-pair table, while pedagogically richer, makes it harder to read off “the answer” — the recommended at the bottom is a compromise across pairs spanning 330–540 Myr, and the spread itself is what costs the 10%-band point. (3) The graph reads: Claude’s innermost crossing at sits inside the official’s , and his Φ values run 35–50% below the official’s at corresponding rank. Without seeing the printed figure pixel-by-pixel one cannot adjudicate, but the official is the keyed answer and Claude’s downstream numerics — and — both fall outside the 10% acceptance band as a direct consequence. A more conservative reading of the spiral would have rescued both numerical-accuracy points.