Quote
It seems probable that most of the grand underlying principles have been firmly established and that further advances are to be sought chiefly in the rigorous application of these principles to all the phenomena which come under our notice.
— Albert Michelson, 1894
lol
Anyways, in the first quarter of the 20th century people realized that the laws of physics known up to that point didn’t do a great job explaining a wide range of phenomena involving electrons, atoms, and light. After a great deal of effort, a new theory (and a new law of motion) emerged in 1924:— quantum mechanics. It is now part of the basic framework for understanding atomic, nuclear, and subnuclear physics. As well as condensed matter (or “solid-state”) physics. Although, I admit to having no idea what that last one is. The laws of motion (Galileo, Newton, etc.) that came before it are called classical mechanics.
Classical mechanics are now understood as an approximation of quantum mechanics, but since much of the structure of quantum theory is inherited from classical theory, it is still absolutely worth going over!
The Principle of Least Action
Take a baseball and throw it straight up in the air. After a few seconds the baseball come back down. Because how high the baseball is depends on the time since it’s been thrown, we can denote it as a function of time: . This is the notation my textbook uses which kinda pisses me off because I’d like to use . For height. Actually let’s just do that. Denote the height of the baseball, as a function of time, as ; this is the trajectory of the baseball. If we plot as a function of (the usual caveat of ignoring air resistance and pretending we’re in a universal gravitational field blah blah blah applies here) then any trajectory has the form of a parabola and there are an infinite number of possible trajectories. Whichever one it follows depends on the (momentum) of the baseball as it leaves your hand.
Buuuut if we say that the baseball returns to your hand at exactly seconds after it leaves your hand, there is ONE trajectory the ball can follow. Now, while there is a simple way to figure out this trajectory for a baseball moving in a uniform gravitational field, we want a method which can be applied to a particle moving in any potential field . (Potential field means “field” of “potential energy” btw) Let us begin with Newton’s law , which is actually a second order differential equation
Show Work
If position is , velocity () is how fast changes; . Acceleration is the derivative of velocity so that’s how we get that. Force is a little more interesting. is potential energy, that is how much stored energy the system has because of where it is. The reason is because being higher up translates to more potential energy. (This is also why a stretched spring where is large, has more energy.) is a function of because potential energy depends on where you are, not how fast you are moving. If we think of as a hill with a ball sitting on it, we can intuit that the ball would roll downhill. The “steepness” of the hill is it’s derivative; . Interestingly, force is defined as , here the minus sign ensures the force points downhill, i.e., toward lower potential energy.
Just a concrete example here if , then the slope is and the force is , and we can confirm this because we know the force of gravity is . Just keep this in mind because thinking like this lets us ignore forces entirely and just derive everything from potential and kinetic energy via the Lagrangian . But that comes later… :)
Let’s reëxpress this second-order equation as a pair of first order equations, where is the mass and is the momentum of the baseball:
Show Work
If momentum is defined as = where , .
So from the definition of we get . That’s our first equation. Next, if we start with again, we can differentiate and take the derivative w.r.t time to get
And then plug in newtons law to get
Yielding
We want to find a solution to these equations such that and , where and are the initial height of your hand when the baseball leaves it, and the final height of your hand when you catch the baseball, respectively.
The textbook talks about using a computer by splitting the problem into intervals but quite frankly, I couldn’t care less so I’m going to skip that. This is all to say there’s this thing called the Principle of Least Action. It says that the action is stationary at any trajectory (wait! what do the braces mean?) satisfying the conditions of motion , at every time .
So to solve the trajectory of the baseball, we just program a computer to find the set of points (ohh its braces cos its a set of points 😌😌😌 so its like ) which minimizes the quantity:
is “action,” defined as where where is kinetic energy and is potential energy. But when we discretize it, our integral becomes a sum, . Dividing by is essentially asking “how much changes if I tweak ” and squaring it is just a mathematical trick because sometimes it could be negative and we want to avoid that and if we use , its “sharper” near zero. An ML background might give insight in why we want to avoid a “sharp” gradient. The minimum is obtained at , where is stationary. just measures how “badly” the whole path “violates” the principle of least action. Anyways, this set of points, joined by straight line segments, gives us the approximate trajectory of the baseball.
Problem: Dyre’s Dilemma
In discussing the motion of the baseball, we have been ignoring a lot of details about baseballs, such as the composition of the interior, the pattern of the stitching, and the brand-name printed on the surface. Instead, the baseball has been treated as though it were essentially a structureless point of mass . It is necessary to make idealizations like this in physics; the real world is otherwise too complicated to describe. But sometimes an idealization misses something crucial. See if you can find what goes wrong in the following argument, which tries to prove that a rolling wheel (or, for that matter, a rolling baseball) can never come to rest through friction with the ground.
“Proof”: […] [T]he forward momentum of a wheel in the positive -direction can only be eliminated by a force applied in the opposite direction. But the only place this force could be applied by friction is the point where the wheel touches the ground. And a force in the negative -direction, applied at this point, will have the effect of making the wheel spin faster! Therefore, the wheel will never come to rest due to friction. QED.
Is this reasoning correct? Can you solve Dyre’s Dilemma?
Yea basically what Dyre’s dilemma does wrong is to treat friction as only affecting rotation via torque, while ignoring that ALSO reduces forward motion. Fundamentally, the idealized model assumes no energy dissipation, so there is no mechanism for the wheel to lose kinetic energy and come to rest. In reality, deformation and internal friction convert mechanical energy into heat, which is why objects actually stop.
Euler-Lagrange and Hamilton’s Equations
Note
Man, what would be really nice to have (for this section in particular, and perhaps future sections) is a plugin that turns math equations into manim animations that can be played. This would be both an Obsidian plugin, and have some thing that lets me view them if I were to export to HTML…
Basically, the Euler-Lagrange equations are the second-order form of the equations of motion, while Hamilton’s equations are the first-order form. In either form, the equations of motion are a consequence of the Principle of Least Action. We shall now re-write those equations in a very general way, which can be applied to any mechanical system, including those which are more complicated than a baseball.
I’m switching using to as the coordinate. I feel like was intuitive because we were talking about height specifically, but feels more general. It’s worth nothing that everything that follows generalizes further, like you can swap for and you can describe any mechanical system with any number of degrees of freedom.
We begin by writing
Explainer
is the entire path, meaning the list of positions . Because indexing starts at 0 and ends at , the total number of points is . However, the sum runs from to because each term corresponds to an interval between two consecutive points, using and . There are only such intervals, even though there are points. The quantity is the timestep , and the velocity at each step is approximated by . The action is therefore computed by summing the Lagrangian evaluated on each small segment of the path, which is a discrete approximation of the continuous integral .
Think of it like this… let be a list containing the path . Then gives the number of intervals. Let . Define velocity as . Define the Lagrangian as . Then initialize , and for each from to , compute and update . This loop runs exactly times, once per interval, which is why the sum stops at .
where
and where
is known as the Lagrangian function. Then the principle of least action requires that, for each ,
and, since
this becomes
Recalling that , this last equation can be written
Anyways, this is the Euler-Lagrange equation for the baseball. It becomes simpler when we take the limit (the “continuum” limit). In that limit, we have:
where the Lagrangian function for the baseball is
and the Euler-Lagrange equation, in the continuum limit, becomes
For the Lagrangian of the baseball, the relevant partial derivatives are
which, when substituted into (1.13), give
This is just the Newton’s second law in second-order form (1.1) again,
We now want to rewrite the Euler-Lagrange equation in first-order form. Of course, we already know the answer, which is (1.2), but let us “forget” this answer for a moment, in order to introduce a very general method. The reason the Euler-Lagrange equation is second-order in the time derivatives is that is first-order in the time derivative.
Why rewrite it as a pair of first-order equations?
First, a pair of first-order equations is easier to integrate numerically than one second-order equation. But also, quantum mechanics does not describe a system’s position and velocity; it uses its position and momentum. The state space of a classical system is , not , and this space (called phase space) turns out be the right framework for the quantum theory we’re leading to.
We are now going to declare war on s. We like the s. — My Prof.
That is to say is velocity, which is a derivative — it depends on where the particle is at two nearby moments in time. The momentum is a property of the state right now. Trading for as the variable of the the point of the Hamiltonian.
So let us define the momentum corresponding to the coordinate to be
This gives as a function of and , but, alternatively, we can solve for as a function of and , i.e.
Next, we introduce the Hamiltonian function
Since is a function of and , is also a function of and .
The reason for introducing the Hamiltonian is that its first derivatives with respect to and have a remarkable property; namely, on a trajectory satisfying the Euler-Lagrange equations, the and derivatives of are proportional to the time-derivatives of and . To see this, first differentiate the Hamiltonian with respect to ,
where we have applied (1.16). Next, differentiating with respect to ,
Using (1.13) (and this is where the equations of motion enter), we find
Thus, with the help of the Hamiltonian function, we have rewritten the single 2nd order Euler-Lagrange equation (1.13) as a pair of 1st order differential equations
which are known as Hamilton’s Equations.
Note: When has no explicit time dependence —that is to say the laws governing the system don’t change with time—it turns out that . That is to say the Hamiltonian is conserved. This is “Noether’s theorem” for time-translation symmetry. I mention this because the point is just that being the total energy and being conserved are the same fact stated two different ways.
For a baseball, the Lagrangian is given by (1.12), and therefore the momentum is
This is inverted to give
and the Hamiltonian is
Note that the Hamiltonian for the baseball is simply the kinetic energy plus the potential energy; i.e. the Hamiltonian is an expression for the total energy of the baseball.
Why is when ?
I asked my Professor this and he straight up said “There are many ways the universe could be constructed that could be satisfying to you, a descendant of primates on earth […] But out here in the real world, where the rubber meets the road […] It’s like they tell kindergarteners; you get what you get and you don’t throw a fit.”
But it’s actually not arbitrary, rather it’s forced by the definition (1.18). If we substitute the baseball directly:
For a particle, and , so . Therefore:
The sign flip on comes from the definition; the works out because of the specific relation for kinetic energy.
Substituting into (1.22), one finds
which is simply the first-order form of Newton’s Law (1.2).
Classical Mechanics in a Nutshell
All the machinery of the Least Action Principle, the Lagrangian Function, and Hamilton’s equations, is overkill in the case of a baseball. In that case, we knew the equation of motion from the beginning. But for more involved dynamical systems, involving, say, wheels, springs, levers, and pendulums, all coupled together in some complicated way, the equations of motion are often far from obvious, and what is needed is some systematic way to derive them.
For any mechanical system, the generalized coordinates are a set of variables needed to describe the configuration of the system at a given time. These could be a set of cartesian coordinates of a number of different particles, or the angular displacement of a pendulum, or the displacement of a spring from equilibrium, or all of the above. The dynamics of the system, in terms of these coordinates, is given by a Lagrangian function , which depends on the generalized coordinates and their first time-derivatives . Normally, in non-relativistic mechanics, we first specify
The Lagrangian
One then defines
The Action
From the Least Action Principle, following a method similar to the one we used for the baseball, we derive
The Euler-Lagrange Equations
These are the 2nd-order equations of motion. To go to 1st-order form, first define
The Generalized Momenta
which can be inverted to give the time-derivatives of the generalized coordinates in terms of the generalized coordinates and momenta
Viewing as a function of and , one then defines
The Hamiltonian
Usually the Hamiltonian has the form
Finally, the equations of motion in 1st-order form are given by
Hamilton’s Equations
Problem Set
Example: The Plane Pendulum
Our pendulum is a mass at the end of a weightless rigid rod of length , which pivots in a plane around the point P. The “generalized coordinate”, which specifies the position of the pendulum at any given time, is the angle .
We get our Lagrangian to be
where is the gravitational potential at the height of point P, which the pendulum reaches at . Since is arbitrary, we will just set it to .
Next we get the Action to be . The next step is to derive the Euler-Lagrange Equations since we have and , therefore is the Euler-Lagrange form of the equations of motion.
Then we need to get The Generalized Momentum, which is . Then to get the Hamiltonian, we insert into to get
Thus, Hamilton’s Equations are
which are easily seen to be equivalent to the Euler-Lagrange equations.
Problem
Two pointlike particles moving in three dimensions have masses and respectively, and interact via a potential . Find Hamilton’s equations of motion for the particles.
We know that . is , and is . Thus
Now let’s derive our Euler-Lagrange Equations. Since , for and ,
Wait I’m a moron we totally didn’t need that. Anyways, since
We’ll, need to use
Whereby
So, let’s do this. The conjugate momenta are
So
inverting, . The Hamiltonian is
Hamilton’s equations then give
Note that , so . This is newton’s third law.
Problem
Suppose, instead of a rigid rod, the mass of the plane pendulum is connected to point P by a weightless spring. The potential energy of the spring is , where is the length of the spring, and is its length when not displaced by an external force. Choosing and as the generalized coordinates, find Hamilton’s equations.
TODO: this problem....
The Classical State
Prediction is a rather important thing in physics because the only reliable test of a scientific theory is the ability to predict the future. Stated “rather abstractly,” the process of prediction works as follows: By a slight disturbance (viewing a thing implies that photons are bouncing off of that thing, after all this is what it means for an object to be “illuminated”) known as a measurement, an object is assigned a mathematical representation which we will call its physical state. The laws of motion are mathematical rules by which, given a physical state at a particular time, one can deduce the physical state of some object at a later time.
The later physical state is the prediction, which can be checked by a subsequent measurement of the object. From the discussion so far, its easy to see that what is meant in classical physics and by the “physical state” of a system is simply its set of generalized coordinates and the generalized momenta . These are supposed to be obtained, at some time , by the measurement process. Given the physical state at some time , the state at is obtained by the rule.
In this way, the physical state at any later time can be obtained (in principle) to an arbitrary degree of accuracy, by making the time-step sufficiently small (or else, if possible, by solving the equations of motion exactly).
Note that the coordinates are alone are not enough to specify the physical state, because they are not sufficient to predict the future. Information about the momenta is also required. Blah blah blah the space of all possible is known as phase space. For a single particle moving in (three-dimensions), there are three components of position and three components of momentum, so the “physical state” is specified by 6 numbers , which can be viewed as a point in phase space. Likewise, the physical state of a system of particles consists of coordinates for each particle ( coordinates in all), and components of momentum for each particle ( momentum components in all), so the state is given by a set of numbers, which can be viewed as a single point in , that is -dimensional space.
Classical mechanics fails to predict correctly the behavior of both light and matter at the atomic level and is replaced with quantum mechanics. But classical and quantum mechanics lowkey have a lot in common. They both assign physical states to objects, and these physical states evolve according to 1st-order diff-eqs. The difference between them lies in the contrast between a physical state as understood by classical mechanics, the “classical state”, and its quantum counterpart, the “quantum state”. This difference will be explored in the next new chapters…