Materials: Chapter 2 of Onno Pols’ lecture notes, Chapter 3 of Kippenhahn’s book, Sec. 1.5 of “Tapestry of Modern Astrophysics” by S. N. Shore, pages 134-139 Clayton’s book.
In the previous lecture, we have discussed that it is appropriate to use thermodynamics to describe the local properties of the stellar gas. We had already established that (self-)gravity determines the pressure gradient in the star through the hydrostatic equilibrium equation (which is just the particular form that momentum conservation takes in a stellar context). Therefore, in this lecture we will explore the relation there may be between gravitational physics and thermodynamics which will allow us to get some crucial facts about stars.
The virial theorem relates the kinetic and potential energy of system of point masses interacting through long-range forces, and thus allows one to connect thermodynamics and gravitational physics. It is a statistical statement on mutually interacting particles (e.g., because of their mutual gravity!), and thus it is very general. The etymology of the name comes from “vir” which is latin for “forces”.
Let’s start with a very general derivation (see for example Goldstein’s classical mechanics book or Shore’s Tapesty of Modern Astrophysics Sec. 1.5). For a collection of particles with (constant) mass $mi$, Netwon’s second law states that the change in their momentum is equal to the total force they are subject to:
Note that here $\mathbf{r}i$ and $\mathbf{F}i$ are vector quantities. We take the first-order spatial moment of this distribution, which means we take the dot product of each term by the position of the particles and sum over all the particles. Why? The reason is to make something like the work done by forces (and thus the potential energy) appear in the second term:
The r.h.s. looks like a sum of work done by the total foce on each
particle: $∑i \mathbf{F}i ⋅ \mathbf{r}i ≡ W$. This is the so
called “virial”, which contains only long-range forces between the
particles (collisions cancel out in the sum because of Netwon’s 3rd
law). This by conservation of energy has to be equal to the net
potential energy available
The l.h.s. can be re-written using the chain rule for the second time-derivative:
And now we have on the right hand side (assuming the mass of the particles to be constant) twice the second time derivative of the moment of inertia $I = ∑i mi ri2$ and twice the kinetic energy $E_\mathrm{kin} = ∑i mi vi2/2$ (because of course $vi = \dot{ri}$). So putting the r.h.s. of this last equation in the first spatial moment of Newton’s second law (Eq. \ref{eq:first_moment_second_law}) we obtain:
This is the general form of the Virial theorem derived by Clausius. In general, in the virial theorem there is a second derivative of the moment of inertia on the r.h.s., which is often forgotten. We did not specify almost anything about the system (see Goldstein for a momentum-based derivation where the assumption of constant masses mi of the particles implicit in the way we wrote Newton’s second law is not even needed).
The virial theorem effectively says that for a bound system of self-interacting particles, there is a limited amount of potential energy that can be extracted, because part of the potential will need to go into the kinetic terms and the acceleration of the moment of inertia.
So far, this general form doesn’t give us much intuition on its meaning. We have a relation between the kinetic energy of a collection of particles, its potential energy, and the second time-derivative of its moment of inertia. But in the frame of the center of mass of such collection, the kinetic energy of the particles is a measure of their thermal energy, and this is non-trivial: why would the thermal energy know about the potential energy! This statement is however very general and its implication for stellar structure and evolution are profound.
As a demonstration for the generality of the virial theorem, we can apply it to a collection of particle with only 2 particles subject only to the gravitational force, that is a binary system of two point-mass stars on a Keplerian orbit!
One can write the total energy of the binary as:
where in the last term we are using Kepler’s laws. The gravitational potential energy is $E_\mathrm{pot} = -GM1M2/a$, thus making it explicit we obtain: $Ekin = |Epot|/2$. Since Keplerian orbits in Newtonian gravity are stable (energy losses through gravitational waves make this statement not physically true, but this is only relevant for close binaries of compact objects), this implies that the second (and first) time-derivative of the moment of inertia of the orbit is zero, and thus this is the virial theorem! Keplerian binary orbits are “virialized”.
N.B.: Here we considered the case of just 2 particles, but nothing
stops us to consider
Let’s now re-derive the Virial theorem for a star. Let’s assume a
continuous description of the stellar gas, hydrostatic equilibrium and
spherical symmetry, and write explicitly the forces acting on each
shell
By analogy with what is done in the general derivation above, we want
to make the gravitational potential appear on the r.h.s., which can be
done by multiplying by $4π r3$ and integrating in
(so that $E_\mathrm{pot}=|∫0M -Gm/r dm| >0$). The l.h.s. becomes integrating per parts
where the first term is zero:
Note that if we had kept a non-zero
Let’s interpret the r.h.s., which in the general derivation would be
part of the $∑i \mathbf{F}⋅ \mathbf{r}i$ term. By dimensional
analysis we know it has to have the dimension of an energy. In fact,
still by dimensional analysis we can infer that [P/ρ] = [E]/[L3] /
[M]/[L3] = [E]/[M] has the units of a specific energy (i.e., energy
per unit mass), and thus
To specify Φ, let’s consider the second law of thermodynamics per unit
mass: the heat exchanged by a gas (
We want to derive a relation between
Now by differentiating
where $Γ\equivΓ1$ is the first adiabatic index.
Finally, substituting in the first-order moment of the hydrostatic equilibrium Eq. \ref{eq:vir} equation we have:
Assuming that
For a monoatomic gas where each particle has 3 degrees of freedom
(motion in 3 direction, no internal degrees of freedom for
point-particles),
Now lets define the total energy of the gas:
Again, for a monoatomic gas with
From the form of
One can encounter cases in stellar evolution where
-
Q: for a star made only of photons, what is
$Γ$ ? Hint: think of the $P≡ P(u)$ relation!
Let’s consider a star made of a perfect mono-atomic gas with
The contraction increases the gravitational potential energy
$E_\mathrm{pot} \propto GM2/R$ since
For an ideal gas, the internal energy is related to the mean temperature by
where $N = ∫ ρ/(μ mu)dV = M/(μ mu)$ is the number of particles, with
where $C = \{∫0M Gmdm/r\}/\{GM2/R\}$ is a constant of
From this equation, several important facts follow for any self-gravitating star in hydrostatic equilibrium:
- the mean temperature of a star depends only on its mass
$M$ and radius$R$ (and chemical composition through μ); -
as a star contracts (
$R$ decreases at constant mass$M$ ), the temperature must rise! This, as we will see, governs the evolution of stars. - A self-gravitating collection of particle with finite temperature
must radiate away energy, thus it will lose energy. This energy
loss, since $E_\mathrm{tot} = |E_\mathrm{pot}|/2 = - CGM2/2R <0$ implies that
$R$ must decrease. But then,$\langle T \rangle$ must increase! This is the “gravothermal” collapse of a cloud/star. As thermal energy is lost to radiation at the surface, the (average) temperature raises! This is why stars can be thought of objects with negative heat capacity: it heats up as it loses energy! This is a property typical of self-gravitating systems only (stars and gravitationally bound stellar clusters) and because of the virial theorem it does not violate energy conservation. - The gravothermal collapse must go on until either:
- an internal energy source, compensating for the surface energy loss kicks in, or
- the ideal gas approximation does not hold anymore.
As we will see, both 1. and 2. occur in nature: 1. is the typical option for stars using nuclear fusion as internal energy source to delay the collapse, and 2. is what occurs for white dwarfs, where quantum mechanical effects stop the collapse.
Thus, gravity determines not only the structure (through the hydrostatic equilibrium equation) but also the evolution of a star. It dictates that as the star loses energy, it must heat up. Because of this, at some point nuclear fusion can occur (as we will discuss later): stars don’t shine because they burn, stars burn because they shine. Without the loss of (internal/thermal) energy by radiation they would not contract, without contracting they would not reach temperatures high enough to do nuclear fusion. The nuclear fusion is a consequence of the fact that stars shine, and not the cause. Stars shine as any object with finite temperature must do.
Internal energy sources such as nuclear fusion ultimately only delay the gravothermal collapse of the stars until either the ideal gas approximation does not hold (for white dwarfs) or even nuclear fusion cannot stop the collapse, and gravity wins, leading to a supernova explosion and/or the formation of a black hole.
This is by definition the timescale it takes a star to radiate away all its internal energy at a constant rate in absence of any other energy sources. Note that this is a global timescale for the whole star!
Let’s call L the “luminosity of the star”, that is the rate at which is loses energy from radiating away photons at its surface. Note that L has the units of power [L] = [E]/[t].
Then by definition:
where we have used the virial theorem and set
Clearly $τKH \gg τff
- Q: A star may lose energy also by means other than photons, especially neutrinos. If such energy losses are important, how can we modify Eq. \ref{eq:tau_KH}?
Using the virial theorem:
- Estimate the average temperature of the Sun and compare it with its
surface temperature. Is the Sun in global thermal equilibrium? (you
can assume C∼1 in the notation used in class/in my notes, or use the
mass and radius profile from the
MESA-web
model you already computed to calculate a more precise value for C) - Find an order of magnitude (∼) relation between the average sound speed in the star and the escape velocity (assume the star is made of ideal gas of temperature \langle T \rangle).
- Demonstrate that if a binary loses instantaneously an amount of mass greater than half the total mass of the binary, Δ M ≥ (M1+M2)/2, then the orbit is unbound. This can happen when a supernova goes off in a binary (cf. Blaauw 1961)!
- determine a condition for the minimum mass of a gas cloud to collapse as a function of its temperature and density (Hint: collapse ⇔ d2 I/dt2<0)
Using the model of the 1Mo star computed earlier with MESA-web
:
- use one profile file to plot Γ1≡\Gamma1(m) for a Sun-like star of roughly the same age as the Sun. Label the age of the star for the corresponding profile.
Let’s use MESA-web
to revisit the late 19th century/early 20th century
debate on the age of the Earth/Sun. (Astro)physicists calculated the
Kelvin-Helmholtz timescale and assumed that this was the age of the
Sun (from Eq. \ref{eq:KH_timescale_scaling}). Geologists and
biologists instead argued for a much longer age.
Let’s assume we know the age of the Earth to be 4.5×109 years (this is
what the geologist and biologists argued!), but let’s assume, like
physicists had to before knowing nuclear physics, that there is no
internal energy source in the Sun. In MESA-web
, there is a Burning
Modifiers
option where you can disable energy release and chemical
evolution. Make a model of a 1Mo star without energy release
and/or chemical evolution, until 4.5×109 years, and plot an HR
diagram. Plot also the 1Mo star you ran previously (which should
have included the nuclear energy release and chemical evolution).
N.B.: we can make a computer code do whatever we want! Never take computer simulations as ground truth, they are at best only as good as the input!
Write up a short paragraph answering the following:
- Which agrees better with the observation of the real Sun at the Earth’s age?
- What is its radius and how does it compare to the measured Ro?
(hint: you can plot the lines at constant radius on the HRD, or read
the outermost radius from the last
profile*.data
file you saved) - What is the average density of the model without energy generation? (an order of magnitude estimate is sufficient for the purpose).