How to start an online math group?

What is an intuitive explanation for why group velocity is [math]\frac{\mathrm{d} \omega}{\mathrm{d}k}[/math] and phase velocity is [math]\frac{\omega}{k}[/math]?

  • EDIT: The only explanation I'm familiar with for phase velocity is that waves take on the form [math]cos(k x - \omega t)[/math] and a point of constant phase, say, for simplicity, zero, must satisfy [math]k x - \omega t = 0[/math], or equivalently, [math]v = \frac{\mathrm{d} x}{\mathrm{d} t} = \frac{\omega}{k}[/math]. I can't remember getting a satisfying explanation for group velocity in any physics classes.

  • Answer:

    The question comes up in the case of waves that propagate through a dispersive medium, that is, a medium in which the speed of wave propagation depends on the frequency of the wave. The function [math]\omega (k)[/math] is called the dispersion relation. The phase velocity is by definition the speed at which the phase of a plane wave, having a unique single frequency omega, and unique wavelength k, propagates through the medium. Thus, that the phase velocity is [math]\omega(k)/k[/math] is fairly clear - you just write down a plane wave, and evaluate the motion in space of any point of fixed phase [math](k x - \omega t) = const[/math]. That implies the result. Just as you noted in the details. I struggled a bit with giving an intuitive explanation for the group velocity. But if the wave equation in the medium is linear, then the superposition of any two solutions, say any two plane waves, is also a solution of the wave equation. But the two different frequency components of this superposition will, depending on the dispersion relation [math]\omega(k)[/math], move with different phase velocities. So if we make a superposition up out of two plane waves, having equal amplitudes, we can ask how the wave will look as it propagates in space. If the two wavenumbers k1 and k2 are very close to one another, then for any fixed time, the waveform will look like a plane wave with wave number equal to the average of the two wavenumbers. But it will also have an envelope, a very slow variation in amplitude, which will be sinusoidal, but which will have a wavelength defined by the difference in wavenumbers. It's this slow variation in the envelope of the superposition of the two waves which defines the groups - and we want to calculate how fast the groups propagate. Now if you introduce time into the picture, the waveform will begin to shift, both the group envelopes and the underlying higher wavenumber oscillation within the groups will move. The whole waveform is just the superposition of two sinusoidal waves which move at two very slightly different speeds, by our assumptions. This can be written as the product of a cosine wave that oscillates with half the difference frequency, and has a wavenumber which is equal to half the difference wavenumber, and a sine wave that oscillates with the average frequency and has the average wavenumber. So the phase velocity of the wave controlling the slow variation in amplitude, which is what we call the group velocity, in this special case, is just: [math]v_g = \frac{\omega(k_1) - \omega(k_2)}{k_1-k_2}[/math] But assuming [math]k_1=k_2 + dk[/math] we can expand to first order in the differential dk, and this gives the required result for the group velocity. Now we can go on and discuss more complicated things such as general wavepackets, second order effects, anomalous dispersion, non-linearities and so on - but this discussion contains the central idea, I think. And that's about as intuitive as I can make it! I made an error in my reasoning the first time through this, so let me just write out the superposition of the two plane waves explicitly, because this makes the result very, very clear. [math]y(x,t) = a \sin(k_1 x - \omega_1 t) + a \sin(k_2 x - \omega_2 t)[/math] [math]y(x,t) = 2 a \cos( \frac{k_1-k_2}{2} x- \frac{\omega_1-\omega_2}{2} t)  \sin (\frac{k_1+k_2}{2} x - \frac{\omega_1 + \omega_2}{2} t)[/math] using a well known identity for the product of a sine function and a cosine function. Now we can see that there is one wave, the cosine wave, which oscillates with half the difference frequency, and one wave the sine wave, which oscillates with the average frequency. The relationships between the wave numbers are similar. The velocity of the cosine wave moves with the group velocity and describes the amplitude pattern of the over all wave. But the group velocity is just, by definition, the phase velocity of the cosine wave, which is: [math]v_g = \frac{\omega_1 -\omega_2}{k_1-k_2}[/math] So in the limit of [math]k_1\rightarrow k_2[/math] we get the desired result for the group velocity.

David Kahana at Quora Visit the source

Was this solution helpful to you?

Other answers

That's pretty easy to answer. If you assume the wave is monochromatic, then if you want to travel with the wave so that the phase stays constant, you must travel at a speed where the argument of the cos function is constant. [math]cos(\omega t - kx) = const\rightarrow \omega t - kx = constvelocity = \frac{dx}{dt}[/math] [math]kx = \omega t - constx = \frac{(\omega t - const)}{k}\frac{dx}{dt} = \frac{w}{k}[/math] For group velocity, it is a little tougher.  But if you assume that instead of being monochromatic, the wave contains multiple frequencies that travel at different speeds, then you can compose the wave as a sum of a bunch of plane waves: [math]A(x,t) = \[ \int_{-\infty}^{\infty} A(k) e^{ i (kx - \omega(k)t }\,dk\][/math] where [math]\omega[/math] is a function of [math]k[/math]. If we assume that the wave is narrowband (that is, the frequencies aren't spread out that much), then we can approximate the function w(k) with its linearization: [math]w(k) ≈ \omega_0 + (k-k_0)\frac{d \omega}{dk}[/math] Moving this out, we get [math]A(x,t) = \[ \int_{-\infty}^{\infty} A(k) e^{ i (kx - \omega_0 t + k_0 \frac{d \omega}{dk} t - k \frac{d \omega}{dk} t }\,dk\][/math] extracting the terms that are not functions of k, you get [math]A(x,t) = \exp{it(\frac{d \omega}{dk} k_0 - w_0)} \[ \int_{-\infty}^{\infty} A(k) e^{ i (k(x - \frac{d \omega}{dk} t }\,dk\][/math] From there, you can see that the outside multiple gives it a wavy component, and it has a magnitude of 1. From there, you can take the magnitude of the whole function, which will give you just the amplitude -- or the height of the wave, which height is what you want to follow everywhere. Inside the integral, to maintain a constant height, you must keep the argument of [math]\exp{i k(x - \frac{d \omega}{dk} t)}[/math] constant.  To do that, keep [math]x - \frac{d \omega}{dk} t = constx = const + \frac{d \omega}{dk} t[/math] [math]\frac{dx}{dt} = \frac{d \omega}{dk}[/math] And there is your derivation right there. (If anyone wants to edit this and put it in LaTex to make it look better, feel free to do so, I just don't consider it worth the effort at the moment).

Jacob VanWagoner

This is going to be a somewhat nonstandard answer that is the result of me trying to find a neat and intuitively satisfying explanation for what phase and group velocities are in the most general context possible. So, apologies if this turns out to fly in the face of what people working on optics typically deal with. If there are blatant factual inaccuracies in what follows, please correct me. First of all, the notion of velocity as defined for particles is naturally associated with real space. This is not surprising at all as particles are after all objects most naturally defined on real space. By contrast, waves are objects that are more naturally defined on the Fourier space, so it makes sense to say that the notion corresponding to velocity in this case should be naturally associated with the Fourier space as opposed to real space. Now, velocity itself is not a well-defined notion here; what we will instead look at is the inverse velocity, which I shall denote by [math]\boldsymbol\upsilon[/math] (I don't know if it's discernible, but that's supposed to be an upsilon as opposed to a [math]\boldsymbol v[/math] which LaTeX renders as somewhat thinner). This shouldn't be too alien; after all lattice spacings in the reciprocal lattice of a crystal (which is nothing but its Fourier space) are measured in units of inverse length. Now, let's talk about a wave which may be defined in any number of dimensions (say [math]N[/math]). We can express this in general as a complex valued function [math]\psi(t,\boldsymbol x)[/math]. This can be written in terms of real-valued functions [math]A(t,\boldsymbol x)[/math] and [math]\phi(t,\boldsymbol x)[/math] as [math]\psi = A e^{i\phi}[/math]. Of course, [math]A[/math] is the amplitude while [math]\phi[/math] is the phase. We think of the wave as being formed by two families of surfaces, one composed of surfaces of constant [math]A[/math] and the other of surfaces of constant [math]\phi[/math], billowing and propagating through space with time. We may thus assign two inverse velocity fields [math]\boldsymbol \upsilon_g(t,\boldsymbol x)[/math] and [math]\boldsymbol\upsilon_p(t,\boldsymbol x)[/math] which basically measure how slowly these surfaces are moving through space. More precisely, we have [math]\boldsymbol\upsilon_g \partial_t A + \nabla_{\boldsymbol x} A=0,[/math] [math]\boldsymbol\upsilon_p \partial_t \phi + \nabla_{\boldsymbol x} \phi=0.[/math] As the subscripts may indicate, I'm eventually going to be identifying these inverse velocities as the 'reciprocals' of the group and phase velocities respectively. Of course, to be able to distill two specific vectors out of this, we need to first stipulate that the inverse velocity fields are uniform vector fields. So, setting [math]\boldsymbol\upsilon_g[/math] and [math]\boldsymbol\upsilon_p[/math] as constant in the two differential equations above yields the following general solution: [math]A = A(t-\boldsymbol\upsilon_g\cdot \boldsymbol x),[/math] [math]\phi = \phi(t-\boldsymbol\upsilon_p\cdot \boldsymbol x).[/math] From now onwards we shall thus treat [math]A[/math] and [math]\phi[/math] as single variable functions, so that [math]A'[/math] and [math]\phi'[/math] denote [math]A'(t-\boldsymbol\upsilon_g\cdot \boldsymbol x)[/math] and [math]\phi'(t-\boldsymbol\upsilon_p\cdot \boldsymbol x)[/math] respectively. Also, we'll use the shorthands [math]\phi_0=\phi(0)[/math] and [math]\phi'_0=\phi'(0)[/math]. We know from quantum mechanics that given that the functions are square integrable or alternatively, that we integrate over some finite region and subtract off boundary terms, the expectation value for [math]\omega[/math] and [math]\boldsymbol k[/math] are given by [math]\langle \omega\rangle= -i\int\mathrm d^N\boldsymbol x\,\psi^\star \partial_t\psi,[/math] [math]\langle \boldsymbol k\rangle= i\int\mathrm d^N\boldsymbol x\,\psi^\star \nabla_{\boldsymbol x}\psi.[/math] You may now readily verify that the inverse phase velocity is given by [math]\boldsymbol \upsilon_p = \langle \boldsymbol k\rangle/\langle \omega\rangle[/math]. To obtain an expression for [math]\boldsymbol \upsilon_g[/math] in terms of [math]\omega[/math] and [math]\boldsymbol k[/math], we'll have to make the assumption that [math]\phi=\phi_0+\phi'_0(t-\boldsymbol\upsilon_p\cdot \boldsymbol x).[/math] This essentially has to do with the fact that we require the wave to be of narrow bandwidth. We also note that [math]A[/math] may be Fourier decomposed as [math]A = \int\mathrm d\omega\, \hat A(\omega) \exp[i\omega(t-\boldsymbol\upsilon_g\cdot \boldsymbol x)].[/math] Plugging these expressions for [math]A[/math] and [math]\phi[/math] back into [math]\psi = A e^{i\phi}[/math] gives us [math]\psi = e^{i\phi_0}\int\mathrm d\omega\, \hat A(\omega) \exp[i(\omega + \phi'_0)t[/math] [math]- \,i(\omega\boldsymbol \upsilon_g + \phi'_0\boldsymbol\upsilon_p)\cdot \boldsymbol x].[/math] Furthermore, carrying out a change of variables [math]\omega \mapsto \omega-\phi'_0[/math] yields [math]\psi = e^{i\phi_0}\int  \mathrm d\omega\, \hat A(\omega-\phi'_0) \exp[i\omega t-i\boldsymbol k\cdot \boldsymbol x],[/math] where [math]\boldsymbol k = \omega\boldsymbol  \upsilon_g + \phi'_0(\boldsymbol\upsilon_p-\boldsymbol\upsilon_g)[/math]. We thus obtain a sort of inverse dispersion relation that immediately gives us the result [math]\boldsymbol \upsilon_g = d\boldsymbol k/d\omega[/math]. It may be readily seen that when [math]N=1[/math], the inverse phase and group velocities are actually reciprocals of the phase and group velocity respectively, just as expected. A well-defined notion of taking the reciprocal of a vector doesn't carry over into higher dimensions (unless you specify an entire basis of vectors), but for what it's worth, [math]\boldsymbol \upsilon_p\cdot \boldsymbol v_p=\boldsymbol \upsilon_g\cdot \boldsymbol v_g=1[/math].

Arpan Saha

I don't really think you can do a lot better than the derivation: https://en.wikipedia.org/wiki/Group_velocity#Derivation

Mark Barton

Related Q & A:

Just Added Q & A:

Find solution

For every problem there is a solution! Proved by Solucija.

  • Got an issue and looking for advice?

  • Ask Solucija to search every corner of the Web for help.

  • Get workable solutions and helpful tips in a moment.

Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.