Misplaced Pages

Special relativity: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 07:06, 20 November 2006 editJRSpriggs (talk | contribs)Autopatrolled, Extended confirmed users, Pending changes reviewers18,799 edits revert. A few of these change may be OK. But most are destructive.← Previous edit Revision as of 16:03, 20 November 2006 edit undo207.200.116.139 (talk) Metric and tranformations of coordinates: The Lorentz transformation matrix is a matrix not a tensor and you don't transform an index(a number), but the elements, ultimately the tensorNext edit →
(6 intermediate revisions by the same user not shown)
Line 107: Line 107:
{{main|Velocity-addition formula}} {{main|Velocity-addition formula}}


If the observer in <math>S\!</math> sees an object moving along the <math>x\!</math> axis at velocity <math>w\!</math>, then the observer in the <math>S'\!</math> system, a frame of reference moving at velocity <math>v\!</math> in the <math>x\!</math> direction with respect to <math>S\!</math>, will see the object moving with velocity <math>w'\!</math> where If the observer for <math>S\!</math> sees an object moving along the <math>x\!</math> axis at velocity <math>u_{x}\!</math>, then the observer in the <math>S'\!</math> system, a frame of reference moving at velocity <math>v\!</math> in the <math>x\!</math> direction with respect to <math>S\!</math>, will see the object moving with velocity <math>u_{x}'\!</math> where


:<math>w'=\frac{w-v}{1-wv/c^2}.</math> :<math>u_{x}'=\frac{u_{x}-v}{1-u_{x}v/c^2}.</math>
and
:<math>u_{x}=\frac{u_{x}'+v}{1+u_{x}'v/c^2}.</math>


This equation can be derived from the space and time transformations above. Notice that if the object is moving at the speed of light in the <math>S\!</math> system (i.e. <math>w=c\!</math>), then it will also be moving at the speed of light in the <math>S'\!</math> system. Also, if both <math>w\!</math> and <math>v\!</math> are small with respect to the speed of light, we will recover the intuitive Galilean transformation of velocities: <math>w' \approx w-v\!.</math> This equation can be derived from the space and time transformations above. Notice that if the object is moving at the speed of light in the <math>S\!</math> system (i.e. <math>u_{x}=c\!</math>), then it will also be moving at the speed of light in the <math>S'\!</math> system. Also, if both <math>u_{x}\!</math> and <math>v\!</math> are small with respect to the speed of light, we will recover the intuitive Galilean transformation of velocities: <math>u_{x} \approx u_{x}'+v\!.</math>


== Mass, momentum, and energy == == Mass, momentum, and energy ==
Line 203: Line 205:
SR uses a 'flat' 4-dimensional ], which is an example of a ]. This space, however, is very similar to the standard 3 dimensional ], and fortunately by that fact, very easy to work with. SR uses a 'flat' 4-dimensional ], which is an example of a ]. This space, however, is very similar to the standard 3 dimensional ], and fortunately by that fact, very easy to work with.


The ] of distance(''ds'') in ] 3D space is defined as: The ] of distance <math>d\sigma</math> in ] 3D space is defined as:


:<math> ds^2 = dx_1^2 + dx_2^2 + dx_3^2 </math> :<math> d\sigma ^2 = (dx^1)^2 + (dx^2)^2 + (dx^3)^2 </math>


where <math>(dx_1,dx_2,dx_3)</math> are the differentials of the three spatial dimensions. In the geometry of special relativity, a fourth dimension, time, is added, with units of ], so that the equation for the differential of distance becomes: where <math>(dx^1 ,dx^2 ,dx^3 )</math> are the differentials of the three spatial dimensions. In the geometry of special relativity, a fourth dimension, time, is added multiplied by c, <math>x^0 = ct</math> so that the equation for the differential of distance becomes:
:<math> ds^2 = (dx^0)^2 - d\sigma ^2 </math>

:<math> ds^2 = dx_1^2 + dx_2^2 + dx_3^2 - c^2 dt^2 </math> :<math> ds^2 = (dx^0)^2 - (dx^1)^2 - (dx^2)^2 - (dx^3)^2 </math>

If we wished to make the time coordinate look like the space coordinates, we could treat time as ]: ''x<sub>4</sub> = ict'' . In this case the above equation becomes symmetric:

:<math> ds^2 = dx_1^2 + dx_2^2 + dx_3^2 + dx_4^2 </math>


This suggests what is in fact a profound theoretical insight as it shows that special relativity is simply a ] of our ], very similar to rotational symmetry of ]. Just as Euclidean space uses a ], so space-time uses a Minkowski metric. According to Misner (1971 §2.3), ultimately the deeper understanding of both special and general relativity will come from the study of the Minkowski metric (described below) rather than a "disguised" Euclidean metric using ''ict'' as the time coordinate. This suggests what is in fact a profound theoretical insight as it shows that special relativity is simply a ] of our ], very similar to rotational symmetry of ]. Just as Euclidean space uses a ], so space-time uses a Minkowski metric. According to Misner (1971 §2.3), ultimately the deeper understanding of both special and general relativity will come from the study of the Minkowski metric (described below) rather than a "disguised" Euclidean metric using ''ict'' as the time coordinate.
Line 219: Line 217:
If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3-D space If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3-D space


:<math> ds^2 = dx_1^2 + dx_2^2 - c^2 dt^2 </math> :<math> ds^2 = dct^2 - (dx^1)^2 - (dx^2)^2 </math>


We see that the ] ]s lie along a dual-cone: We see that the ] ]s lie along a dual-cone(c=1 in the image):


] ]
Line 227: Line 225:
defined by the equation defined by the equation


:<math> ds^2 = 0 = dx_1^2 + dx_2^2 - c^2 dt^2 </math> :<math> ds^2 = 0 = dct^2 - (dx^1)^2 - (dx^2)^2 </math>


or or


:<math> dx_1^2 + dx_2^2 = c^2 dt^2 </math> :<math> (dx^1)^2 + (dx^2)^2 = dct^2 </math>


Which is the equation of a circle with ''r=c*dt''. Which is the equation of a circle with ''r=c*dt''.
Line 239: Line 237:
] ]


:<math> ds^2 = 0 = dx_1^2 + dx_2^2 + dx_3^2 - c^2 dt^2 </math> :<math> ds^2 = 0 = dct^2 - (dx^1)^2 - (dx^2)^2 - (dx^3)^2</math>


:<math> dx_1^2 + dx_2^2 + dx_3^2 = c^2 dt^2 </math> :<math> (dx^1)^2 + (dx^2)^2 + (dx^3)^2 = dct^2 </math>


This null dual-cone represents the "line of sight" of a point in space. That is, when we look at the ]s and say "The light from that star which I am receiving is X years old", we are looking down this line of sight: a null geodesic. We are looking at an event <math>d = \sqrt{x_1^2+x_2^2+x_3^2} </math> meters away and ''d/c'' seconds in the past. For this reason the null dual cone is also known as the 'light cone'. (The point in the lower left of the picture below represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".) This null dual-cone represents the "line of sight" of a point in space. That is, when we look at the ]s and say "The light from that star which I am receiving is X years old", we are looking down this line of sight: a null geodesic. We are looking at an event <math>d = \sqrt{(x^1)^2+(x^2)^2+(x^3)^2} </math> meters away and ''d/c'' seconds in the past. For this reason the null dual cone is also known as the 'light cone'. (The point in the lower left of the picture below represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".)


] ]
Line 252: Line 250:


== Physics in spacetime == == Physics in spacetime ==
Here, we see how to write the equations of special relativity in a manifestly invariant form. The position of an event in spacetime is given by a ] four vector whose components are: Here, we see how to write the equations of special relativity in a manifestly covariant form. The position of an event in spacetime is is not a four vector because simple translations change it in a different way than the cordinate differential dx<SUP><FONT FACE=SYMBOL>m</FONT></SUP> however it can be given the same component format as a rank one contravariant four-vector:


:<math>x^\nu=\left(t, x, y, z\right)</math> :<math>x^\nu=\left(ct, x, y, z\right)</math>


That is, <math>x^0 = t</math> and <math>x^1 = x</math> and <math>x^2 = y</math> and <math>x^3 = z</math>. Superscripts are contravariant indices in this section rather than exponents except when they indicate a square. Subscripts are ] indices which also range from zero to three as with the spacetime gradient of a field φ: That is, <math>x^0 = ct</math> and <math>x^1 = x</math> and <math>x^2 = y</math> and <math>x^3 = z</math>. Superscripts are contravariant indices in this section rather than exponents except when they indicate a square. Subscripts are ] indices which also range from zero to three as with the spacetime gradient of a field φ:


:<math>\partial_0 \phi = \frac{\partial \phi}{\partial t}, \quad \partial_1 \phi = \frac{\partial \phi}{\partial x}, \quad \partial_2 \phi = \frac{\partial \phi}{\partial y}, \quad \partial_3 \phi = \frac{\partial \phi}{\partial z}.</math> :<math>\partial_0 \phi = \frac{\partial \phi}{\partial t}, \quad \partial_1 \phi = \frac{\partial \phi}{\partial x}, \quad \partial_2 \phi = \frac{\partial \phi}{\partial y}, \quad \partial_3 \phi = \frac{\partial \phi}{\partial z}.</math>
Line 264: Line 262:


:<math>\eta_{\alpha\beta} = \begin{pmatrix} :<math>\eta_{\alpha\beta} = \begin{pmatrix}
-c^2 & 0 & 0 & 0\\ 1 & 0 & 0 & 0\\
0 & 1 & 0 & 0\\ 0 & -1 & 0 & 0\\
0 & 0 & 1 & 0\\ 0 & 0 & -1 & 0\\
0 & 0 & 0 & 1 0 & 0 & 0 & -1
\end{pmatrix}</math> \end{pmatrix}</math>


Line 273: Line 271:


:<math>\eta^{\alpha\beta} = \begin{pmatrix} :<math>\eta^{\alpha\beta} = \begin{pmatrix}
-1/c^2 & 0 & 0 & 0\\ 1 & 0 & 0 & 0\\
0 & 1 & 0 & 0\\ 0 & -1 & 0 & 0\\
0 & 0 & 1 & 0\\ 0 & 0 & -1 & 0\\
0 & 0 & 0 & 1 0 & 0 & 0 & -1
\end{pmatrix}</math> \end{pmatrix}</math>


Then we recognise that co-ordinate transformations between inertial reference frames are given by the ] tensor Λ. For the special case of motion along the x-axis, we have: Then we recognise that co-ordinate transformations between inertial reference frames are given by the ] matrix Λ. For the special case of motion along the x-axis, we have:


:<math>\Lambda^{\mu'}{}_\nu = \begin{pmatrix} :<math>\Lambda '^{\mu}{}_\nu = \begin{pmatrix}
\gamma & -\beta\gamma/c & 0 & 0\\ \gamma & -\beta\gamma & 0 & 0\\
-\beta\gamma c & \gamma & 0 & 0\\ -\beta\gamma & \gamma & 0 & 0\\
0 & 0 & 1 & 0\\ 0 & 0 & 1 & 0\\
0 & 0 & 0 & 1 0 & 0 & 0 & 1
\end{pmatrix}</math> \end{pmatrix}</math>


which is simply the matrix of a boost (like a rotation) between the ''x'' and ''t'' coordinates. Where μ' indicates the row and ν indicates the column. Also, β and γ are defined as: which is simply the matrix of a boost (like a rotation) between the ''x'' and ''t'' coordinates. Where μ indicates the row and ν indicates the column numbers. Also, β and γ are defined as:


:<math>\beta = \frac{v}{c},\ \gamma = \frac{1}{\sqrt{1-\beta^2}}.</math> :<math>\beta = \frac{v}{c},\ \gamma = \frac{1}{\sqrt{1-\beta^2}}.</math>
Line 294: Line 292:
More generally, a transformation from one inertial frame (ignoring translations for simplicity) to another must satisfy: More generally, a transformation from one inertial frame (ignoring translations for simplicity) to another must satisfy:


:<math>\eta_{\alpha\beta} = \eta_{\mu'\nu'} \Lambda^{\mu'}{}_\alpha \Lambda^{\nu'}{}_\beta \!</math> :<math>\eta_{\alpha\beta} = \eta '_{\mu \nu} \Lambda '^{\mu}{}_\alpha \Lambda '^{\nu}{}_\beta \!</math>


where there is an implied summation of <math>\mu' \!</math> and <math>\nu' \!</math> from 0 to 3 on the right-hand side in accordance with the ]. The ] is the most general group of transformations which preserves the ] and this is the physical symmetry underlying special relativity. where there is an implied summation of <math>\mu \!</math> and <math>\nu \!</math> from 0 to 3 on the right-hand side in accordance with the ]. The ] is the most general group of transformations which preserves the ] and this is the physical symmetry underlying special relativity.


All proper physical quantities are given by tensors. So to transform from one frame to another, we use the well known ] All proper physical quantities are given by tensors. So to transform from one frame to another, we use the well known ]


:<math>T^{\left}_{\left} = :<math>\mathbf{T'^{\left}_{\left} =
\Lambda^{i_1'}{}_{i_1}\Lambda^{i_2'}{}_{i_2}...\Lambda^{i_p'}{}_{i_p} \Lambda '^{i_1}{}_{i_1}\Lambda '^{i_2}{}_{i_2}...\Lambda '^{i_p}{}_{i_p}
\Lambda_{j_1'}{}^{j_1}\Lambda_{j_2'}{}^{j_2}...\Lambda_{j_q'}{}^{j_q} \Lambda '_{j_1}{}^{j_1}\Lambda '_{j_2}{}^{j_2}...\Lambda '_{j_q}{}^{j_q}
T^{\left}_{\left}</math> T^{\left}_{\left}}</math>


Where <math>\Lambda_{j_k'}{}^{j_k} \!</math> is the reciprocal matrix of <math>\Lambda^{j_k'}{}_{j_k} \!</math>. Where <math>\Lambda '_{j_k}{}^{j_k} \!</math> is the reciprocal matrix of <math>\Lambda^{j_k}{}_{j_k} \!</math>.


To see how this is useful, we transform the position of an event from an unprimed co-ordinate system ''S'' to a primed system ''S''', we calculate If origins coincide when the origin clocls are synched to zero, we can transform the position of an event from an unprimed co-ordinate system ''S'' to a primed system ''S''', using the transformation matrix


:<math> :<math>
\begin{pmatrix} \begin{pmatrix}
t'\\ x'\\ y'\\ z' ct'\\ x'\\ y'\\ z'
\end{pmatrix} = x^{\mu'}=\Lambda^{\mu'}{}_\nu x^\nu= \end{pmatrix} = x'^{\mu'}=\Lambda '^{\mu}{}_\nu x^\nu=
\begin{pmatrix} \begin{pmatrix}
\gamma & -\beta\gamma/c & 0 & 0\\ \gamma & -\beta\gamma & 0 & 0\\
-\beta\gamma c & \gamma & 0 & 0\\ -\beta\gamma & \gamma & 0 & 0\\
0 & 0 & 1 & 0\\ 0 & 0 & 1 & 0\\
0 & 0 & 0 & 1 0 & 0 & 0 & 1
\end{pmatrix} \end{pmatrix}
\begin{pmatrix} \begin{pmatrix}
t\\ x\\ y\\ z ct\\ x\\ y\\ z
\end{pmatrix} = \end{pmatrix} =
\begin{pmatrix} \begin{pmatrix}
\gamma t- \gamma\beta x/c\\ \gamma ct- \gamma\beta x\\
\gamma x - \beta \gamma ct \\ y\\ z \gamma x - \beta \gamma ct \\ y\\ z
\end{pmatrix} \end{pmatrix}
Line 332: Line 330:
The squared length of the differential of the position four-vector <math>dx^\mu \!</math> constructed using The squared length of the differential of the position four-vector <math>dx^\mu \!</math> constructed using


:<math>\mathbf{dx}^2 = \eta_{\mu\nu}dx^\mu dx^\nu = -(c \cdot dt)^2+(dx)^2+(dy)^2+(dz)^2\,</math> :<math>\mathbf{ds^2 = \eta_{\mu\nu}dx^\mu dx^\nu = (dct)^2-(dx)^2-(dy)^2-(dz)^2\,}</math>


is an invariant. Being invariant means that it takes the same value in all inertial frames, because it is a scalar (0 rank tensor), and so no Λ appears in its trivial transformation. Notice that when the ] <math>\mathbf{dx}^2</math> is negative that <math>d\tau=\sqrt{-\mathbf{dx}^2} / c</math> is the differential of ], while when <math>\mathbf{dx}^2</math> is positive, <math>\sqrt{\mathbf{dx}^2}</math> is differential of the ]. is an invariant. Being invariant means that it takes the same value in all inertial frames, because it is a (0 rank tensor), an invariant scalar, and so no Λ appears in its trivial transformation. Notice that when the ] <math>ds^2</math> is positive that <math>d\tau=ds</math> is the differential of ], while when <math>ds^2</math> is negative, <math>ds</math> is differential of the ].


The primary value of expressing the equations of physics in a tensor form is that they are then manifestly invariant under the Poincaré group, so that we do not have to do a special and tedious calculation to check that fact. Also in constructing such equations we often find that equations previously thought to be unrelated are, in fact, closely connected being part of the same tensor equation. The primary value of expressing the equations of physics in a tensor form is that they are then manifestly invariant under the Poincaré group, so that we do not have to do a special and tedious calculation to check that fact. Also in constructing such equations we often find that equations previously thought to be unrelated are, in fact, closely connected being part of the same tensor equation.
Line 357: Line 355:


:<math>p_\nu = m \cdot \eta_{\nu\mu} U^\mu = \begin{pmatrix} :<math>p_\nu = m \cdot \eta_{\nu\mu} U^\mu = \begin{pmatrix}
-E \\ p_x\\ p_y\\ p_z\end{pmatrix}.</math> E/c \\ -p_x\\ -p_y\\ -p_z\end{pmatrix}.</math>


where ''m'' is the ]. where ''m'' is the ].
Line 363: Line 361:
The invariant magnitude of the ] is: The invariant magnitude of the ] is:


:<math>\mathbf{p}^2 = \eta^{\mu\nu}p_\mu p_\nu = -(E/c)^2 + p^2 .</math> :<math>\mathbf{p}^2 = \eta^{\mu\nu}p_\mu p_\nu = (E/c)^2 - p^2 .</math>


We can work out what this invariant is by first arguing that, since it is a scalar, it doesn't matter which reference frame we calculate it, and then by transforming to a frame where the total momentum is zero. We can work out what this invariant is by first arguing that, since it is a scalar, it doesn't matter which reference frame we calculate it, and then by transforming to a frame where the total momentum is zero.


:<math>\mathbf{p}^2 = - (E_{rest}/c)^2 = - (m \cdot c)^2 .</math> :<math>\mathbf{p}^2 = (E_{rest}/c)^2 = (m \cdot c)^2 .</math>


We see that the rest energy is an independent invariant. A rest energy can be calculated even for particles and systems in motion, by translating to a frame in which momentum is zero. We see that the rest energy is an independent invariant. A rest energy can be calculated even for particles and systems in motion, by translating to a frame in which momentum is zero.

Revision as of 16:03, 20 November 2006

For a more accessible and less technical introduction to this topic, see Introduction to special relativity.

The special theory of relativity was proposed in 1905 by Albert Einstein in his article "On the Electrodynamics of Moving Bodies". Some three centuries earlier, Galileo's principle of relativity had stated that all uniform motion was relative, and that there was no absolute and well-defined state of rest; a person on the deck of a ship may be at rest in his opinion, but someone observing from the shore would say that he was moving. Einstein's theory combines Galilean relativity with the postulate that all observers will always measure the speed of light to be the same no matter what their state of uniform linear motion is.

This theory has a variety of surprising consequences that seem to violate common sense, but that have been verified experimentally. Special relativity overthrows Newtonian notions of absolute space and time by stating that distance and time depend on the observer, and that time and space are perceived differently, depending on the observer. It yields the equivalence of matter and energy, as expressed in the famous equation E=mc, where c is the speed of light. Special relativity agrees with Newtonian mechanics in their common realm of applicability, in experiments in which all velocities are small compared to the speed of light.

The theory was called "special" because it applies the principle of relativity only to inertial frames. Einstein developed general relativity to apply the principle generally, that is, to any frame, and that theory includes the effects of gravity. Special relativity doesn't account for gravity, but it can deal with accelerations.

Although special relativity makes relative some quantities, such as time, that we would have imagined to be absolute based on everyday experience, it also makes absolute some others that we would have thought were relative. In particular, it states that the speed of light is the same for all observers, even if they are in motion relative to one another. Special relativity reveals that c is not just the velocity of a certain phenomenon - light - but rather a fundamental feature of the way space and time are tied together. In particular, special relativity states that it is impossible for any material object to travel as fast as light.

For history and motivation, see the article: history of special relativity

Postulates

Main article: Postulates of special relativity

  1. First postulate - Special principle of relativity - The laws of physics are the same in all inertial frames of reference. In other words, there are no privileged inertial frames of reference.
  2. Second postulate - Invariance of c - The speed of light in a vacuum is a universal constant (c) which is independent of the motion of the light source.

The power of Einstein's argument stems from the manner in which he derived startling and seemingly implausible results from two simple assumptions that were founded on analysis of observations. An observer attempting to measure the speed of light's propagation will get the same answer no matter how the observer or the system's components are moving.

Lack of an absolute reference frame

The principle of relativity, which states that there is no stationary reference frame, dates back to Galileo, and was incorporated into Newtonian Physics. However, in the late 19 century, the existence of electromagnetic waves led some physicists to suggest that the universe was filled with a substance known as "aether", which would act as the medium through which the vibrations traveled. The aether was thought to constitute an absolute reference frame against which speeds could be measured. In other words, the aether was the only fixed or motionless thing in the universe. Aether supposedly had some wonderful properties: it was sufficiently elastic that it could support electromagnetic waves, and those waves could interact with matter, yet it offered no resistance to bodies passing through it. The results of various experiments, including the Michelson-Morley experiment, indicated that the Earth was always 'stationary' relative to the aether — something that was difficult to explain, since the Earth is in orbit around the Sun. Einstein's elegant solution was to discard the notion of an aether and an absolute state of rest. Special relativity is formulated so as to not assume that any particular frame of reference is special; rather, in relativity, a system appears to observe the same laws of physics independent of an observer's velocity with respect to it. In particular, the speed of light in a vacuum is always measured to be c, even when measured by multiple systems that are moving at different (but constant) velocities.

Consequences

Main article: Consequences of special relativity

Einstein has said that all of the consequences of special relativity can be found from examination of the Lorentz transformations.

These transformations, and hence special relativity, lead to different physical predictions than Newtonian mechanics when relative velocities become comparable to the speed of light. The speed of light is so much larger than anything humans encounter that some of the effects predicted by relativity are initially counter-intuitive:

  • Time dilation — the time lapse between two events is not invariant from one observer to another, but is dependent on the relative speeds of the observers' reference frames (e.g., the twin paradox which concerns a twin who flies off in a spaceship travelling near the speed of light and returns to discover that his twin has aged much more).
  • Relativity of simultaneity — two events happening in two different locations that occur simultaneously to one observer, may occur at different times to another observer (lack of absolute simultaneity).
  • Lorentz contraction — the dimensions (e.g., length) of an object as measured by one observer may be smaller than the results of measurements of the same object made by another observer (e.g., the ladder paradox involves a long ladder travelling near the speed of light and being contained within a smaller garage).
  • Composition of velocities — velocities (and speeds) do not simply 'add', for example if a rocket is moving at ⅔ the speed of light relative to an observer, and the rocket fires a missile at ⅔ of the speed of light relative to the rocket, the missile does not exceed the speed of light relative to the observer. (In this example, the observer would see the missile travel with a speed of 12/13 the speed of light.)
  • Inertia and momentum — as an object's velocity gets close to the speed of light, it becomes more and more difficult to accelerate it.
  • Equivalence of mass and energy, E=mc — mass and energy can be converted to one another, and play equivalent roles (e.g., the gravitational force on a falling apple is partly due to the kinetic energies of the subatomic particles it is made of).

Reference frames, coordinates and the Lorentz transformation

Full article: Lorentz transformations
Diagram 1. Changing views of spacetime along the world line of a rapidly accelerating observer.

In this animation, the vertical direction indicates time and the horizontal direction indicates distance, the dashed line is the spacetime trajectory ("world line") of the observer. The lower quarter of the diagram shows the events that are visible to the observer, and the upper quarter shows the light cone- those that will be able to see the observer. The small dots are arbitrary events in spacetime.

The slope of the world line (deviation from being vertical) gives the relative velocity to the observer. Note how the view of spacetime changes when the observer accelerates.

Relativity theory depends on "reference frames". A reference frame is a point in space at rest, or in uniform motion, from which a position can be measured along 3 spatial axes. In addition, a reference frame has a clock moving with the reference frame, allowing the measurement of the time of events.

An event is an occurrence that can be assigned a single unique time and location in space relative to a reference frame: it is a "point" in space-time. Since the speed of light is constant in relativity in each and every reference frame, pulses of light can be used to unambiguously measure distances and refer back the times that events occurred to the clock, even though light takes time to reach the clock after the event has transpired.

For example, the explosion of a firecracker may be considered to be an "event". We can completely specify an event by its four space-time coordinates: The time of occurrence and its 3-dimensional spatial location from a reference point. Let's call this reference frame S.

In relativity theory we often want to calculate the position of a point from a different reference point.

Suppose we have a second reference frame S', whose spatial axes and clock exactly coincide with that of S at time zero, but it is moving at a constant velocity v {\displaystyle v\,} with respect to S along the x {\displaystyle x\,} axis.

Since there is no absolute reference frame in relativity theory, a concept of 'moving' doesn't strictly exist, as everything is always moving with respect to some other reference frame. Instead, any two frames that move at the same speed in the same direction are said to be comoving. Therefore S and S' are not comoving.

Let's define the event to have space-time coordinates ( t , x , y , z ) {\displaystyle (t,x,y,z)\,} in system S and ( t , x , y , z ) {\displaystyle (t',x',y',z')\,} in S'. Then the Lorentz transformation specifies that these coordinates are related in the following way:

t = γ ( t v x c 2 ) {\displaystyle t'=\gamma \left(t-{\frac {vx}{c^{2}}}\right)}
x = γ ( x v t ) {\displaystyle x'=\gamma (x-vt)\,}
y = y {\displaystyle y'=y\,}
z = z {\displaystyle z'=z\,}

where γ = 1 1 v 2 / c 2 {\displaystyle \gamma ={\frac {1}{\sqrt {1-v^{2}/c^{2}}}}} is called the Lorentz factor and c {\displaystyle c\,} is the speed of light in a vacuum.

The y {\displaystyle y\,} and z {\displaystyle z\,} coordinates are unaffected, but the x {\displaystyle x\,} and t {\displaystyle t\,} axes are mixed up by the transformation. In a way this transformation can be understood as a hyperbolic rotation.

A quantity invariant under Lorentz transformations is known as a Lorentz scalar.

Simultaneity

From the first equation of the Lorentz transformation in terms of coordinate differences

Δ t = γ ( Δ t v Δ x c 2 ) {\displaystyle \Delta t'=\gamma \left(\Delta t-{\frac {v\Delta x}{c^{2}}}\right)}

it is clear that two events that are simultaneous in frame S (satisfying Δ t = 0 {\displaystyle \Delta t=0\,} ), are not necessarily simultaneous in another inertial frame S' (satisfying Δ t = 0 {\displaystyle \Delta t'=0\,} ). Only if these events are colocal in frame S (satisfying Δ x = 0 {\displaystyle \Delta x=0\,} ), will they be simultaneous in another frame S'.

Time dilation and length contraction

Writing the Lorentz Transformation and its inverse in terms of coordinate differences we get

Δ t = γ ( Δ t v Δ x c 2 ) {\displaystyle \Delta t'=\gamma \left(\Delta t-{\frac {v\Delta x}{c^{2}}}\right)}
Δ x = γ ( Δ x v Δ t ) {\displaystyle \Delta x'=\gamma (\Delta x-v\Delta t)\,}

and

Δ t = γ ( Δ t + v Δ x c 2 ) {\displaystyle \Delta t=\gamma \left(\Delta t'+{\frac {v\Delta x'}{c^{2}}}\right)}
Δ x = γ ( Δ x + v Δ t ) {\displaystyle \Delta x=\gamma (\Delta x'+v\Delta t')\,}

Suppose we have a clock at rest in the unprimed system S. Two consecutive ticks of this clock are then characterized by Δ x = 0 {\displaystyle \Delta x=0} . If we want to know the relation between the times between these ticks as measured in both systems, we can use the first equation and find

Δ t = γ Δ t {\displaystyle \Delta t'=\gamma \Delta t\,}

This shows that the time Δ t {\displaystyle \Delta t'} between the two ticks as seen in the 'moving' frame S' is larger than the time Δ t {\displaystyle \Delta t} between these ticks as measured in the rest frame of the clock. This phenomenon is called time dilation.

Similarly, suppose we have a measuring rod at rest in the unprimed system. In this system, the length of this rod is written as Δ x {\displaystyle \Delta x} . If we want to find the length of this rod as measured in the 'moving' system S', we must make sure to measure the distances x {\displaystyle x'} to the end points of the rod simultaneously in the primed frame S'. In other words, the measurement is characterized by Δ t = 0 {\displaystyle \Delta t'=0} , which we can combine with the fourth equation to find the relation between the lengths Δ x {\displaystyle \Delta x} and Δ x {\displaystyle \Delta x'} :

Δ x = Δ x γ {\displaystyle \Delta x'={\frac {\Delta x}{\gamma }}}

This shows that the length Δ x {\displaystyle \Delta x'} of the rod as measured in the 'moving' frame S' is shorter than the length Δ x {\displaystyle \Delta x} in its own rest frame. This phenomenon is called length contraction or Lorentz contraction.

These effects are not merely appearances; they are explicitly related to our way of measuring time intervals between 'colocal' events and distances between simultaneous events.

See also the twin paradox.

Causality and prohibition of motion faster than light

Diagram 2. Light cone

In diagram 2 the interval AB is 'time-like'; i.e., there is a frame of reference in which event A and event B occur at the same location in space, separated only by occurring at different times. If A precedes B in that frame, then A precedes B in all frames. It is hypothetically possible for matter (or information) to travel from A to B, so there can be a causal relationship (with A the cause and B the effect).

The interval AC in the diagram is 'space-like'; i.e., there is a frame of reference in which event A and event C occur simultaneously, separated only in space. However there are also frames in which A precedes C (as shown) and frames in which C precedes A. If it was possible for a cause-and-effect relationship to exist between events A and C, then logical paradoxes would result. For example, if A was the cause, and C the effect, then there would be frames of reference in which the effect preceded the cause. Another way of looking at it is that if there were a technology that allowed faster-than-light motion, it would also function as a time machine. Therefore, one of the consequences of special relativity is that (assuming causality is to be preserved as a logical principle), no information or material object can travel faster than light. On the other hand, the logical situation is not as clear in the case of general relativity, so it is an open question whether or not there is some fundamental principle that preserves causality (and therefore prevents motion faster than light) in general relativity.

Even without considerations of causality, there are other strong reasons why faster-than-light travel is forbidden by special relativity. For example, if a constant force is applied to an object for a limitless amount of time, then integrating F=dp/dt gives a momentum that grows without bound, but this is simply because p = m γ v {\displaystyle p=m\gamma v} approaches infinity as v approaches c. To an observer who is not accelerating, it appears as though the object's inertia is increasing, so as to produce a smaller acceleration in response to the same force. This behavior is in fact observed in particle accelerators.

Composition of velocities

Main article: Velocity-addition formula

If the observer for S {\displaystyle S\!} sees an object moving along the x {\displaystyle x\!} axis at velocity u x {\displaystyle u_{x}\!} , then the observer in the S {\displaystyle S'\!} system, a frame of reference moving at velocity v {\displaystyle v\!} in the x {\displaystyle x\!} direction with respect to S {\displaystyle S\!} , will see the object moving with velocity u x {\displaystyle u_{x}'\!} where

u x = u x v 1 u x v / c 2 . {\displaystyle u_{x}'={\frac {u_{x}-v}{1-u_{x}v/c^{2}}}.}

and

u x = u x + v 1 + u x v / c 2 . {\displaystyle u_{x}={\frac {u_{x}'+v}{1+u_{x}'v/c^{2}}}.}

This equation can be derived from the space and time transformations above. Notice that if the object is moving at the speed of light in the S {\displaystyle S\!} system (i.e. u x = c {\displaystyle u_{x}=c\!} ), then it will also be moving at the speed of light in the S {\displaystyle S'\!} system. Also, if both u x {\displaystyle u_{x}\!} and v {\displaystyle v\!} are small with respect to the speed of light, we will recover the intuitive Galilean transformation of velocities: u x u x + v . {\displaystyle u_{x}\approx u_{x}'+v\!.}

Mass, momentum, and energy

Main article: Conservation of mass in special relativity Main article: Conservation of energy § Modern physics

In addition to modifying notions of space and time, special relativity forces one to reconsider the concepts of mass, momentum, and energy, all of which are important constructs in Newtonian mechanics. Special relativity shows, in fact, that these concepts are all different aspects of the same physical quantity in much the same way that it shows space and time to be interrelated.

There are a couple of (equivalent) ways to define momentum and energy in SR. One method uses conservation laws. If these laws are to remain valid in SR they must be true in every possible reference frame. However, if one does some simple thought experiments using the Newtonian definitions of momentum and energy one sees that these quantities are not conserved in SR. One can rescue the idea of conservation by making some small modifications to the definitions to account for relativistic velocities. It is these new definitions which are taken as the correct ones for momentum and energy in SR.

Given an object of invariant mass m traveling at velocity v the energy and momentum are given (and even defined) by

E = γ m c 2 {\displaystyle E=\gamma mc^{2}\,\!}
p = γ m v {\displaystyle {\vec {p}}=\gamma m{\vec {v}}\,\!}

where γ (the Lorentz factor) is given by

γ = 1 1 β 2 {\displaystyle \gamma ={\frac {1}{\sqrt {1-\beta ^{2}}}}}

where β {\displaystyle \beta } is the velocity as a ratio of the speed of light. The term γ occurs frequently in relativity, and comes from the Lorentz transformation equations.

Relativistic energy and momentum can be related through the formula

E 2 ( p c ) 2 = ( m c 2 ) 2 {\displaystyle E^{2}-(pc)^{2}=(mc^{2})^{2}\,\!}

which is referred to as the relativistic energy-momentum equation.

For velocities much smaller than those of light, γ can be approximated using a Taylor series expansion and one finds that

E m c 2 + 1 2 m v 2 {\displaystyle E\approx mc^{2}+{\begin{matrix}{\frac {1}{2}}\end{matrix}}mv^{2}\,\!}
p m v {\displaystyle {\vec {p}}\approx m{\vec {v}}\,\!}

Barring the first term in the energy expression (discussed below), these formulas agree exactly with the standard definitions of Newtonian kinetic energy and momentum. This is as it should be, for special relativity must agree with Newtonian mechanics at low velocities.

Looking at the above formulas for energy, one sees that when an object is at rest (v = 0 and γ = 1) there is a non-zero energy remaining:

E r e s t = m c 2 {\displaystyle E_{rest}=mc^{2}\,\!}

This energy is referred to as rest energy. The rest energy does not cause any conflict with the Newtonian theory because it is a constant and, as far as kinetic energy is concerned, it is only differences in energy which are meaningful.

Taking this formula at face value, we see that in relativity, mass is simply another form of energy. In 1927 Einstein remarked about special relativity:

Under this theory mass is not an unalterable magnitude, but a magnitude dependent on (and, indeed, identical with) the amount of energy.

This formula becomes important when one measures the masses of different atomic nuclei. By looking at the difference in masses, one can predict which nuclei have extra stored energy that can be released by nuclear reactions, providing important information which was useful in the development of the nuclear bomb. The implications of this formula on 20th-century life have made it one of the most famous equations in all of science.

Relativistic mass

Introductory physics courses and some older textbooks on special relativity sometimes define a relativistic mass which increases as the velocity of a body increases. According to the geometric interpretation of special relativity, this is often deprecated and the term 'mass' is reserved to mean invariant mass and is thus independent of the inertial frame, i.e., invariant.

Using the relativistic mass definition, the mass of an object may vary depending on the observer's inertial frame in the same way that other properties such as its length may do so. Defining such a quantity may sometimes be useful in that doing so simplifies a calculation by restricting it to a specific frame. For example, consider a body with an invariant mass m moving at some velocity relative to an observer's reference system. That observer defines the relativistic mass of that body as:

M = γ m {\displaystyle M=\gamma m\!}

"Relativistic mass" should not be confused with the "longitudinal" and "transverse mass" definitions that were used around 1900 and that were based on an inconsistent application of the laws of Newton: those used f=ma for a variable mass, while relativistic mass corresponds to Newton's dynamic mass in which p=Mv and f=dp/dt.

Note also that the body does not actually become more massive in its proper frame, since the relativistic mass is only different for an observer in a different frame. The only mass that is frame independent is the invariant mass. When using the relativistic mass, the used reference frame should be specified if it isn't already obvious or implied. It also goes almost without saying that the increase in relativistic mass does not come from an increased number of atoms in the object. Instead, the relativistic mass of each atom and subatomic particle has increased.

Physics textbooks sometimes use the relativistic mass as it allows the students to utilize their knowledge of Newtonian physics to gain some intuitive grasp of relativity in their frame of choice (usually their own!). "Relativistic mass" is also consistent with the concepts "time dilation" and "length contraction".

Force

The classical definition of ordinary force f is given by Newton's Second Law in its original form:

f = d p / d t {\displaystyle {\vec {f}}=d{\vec {p}}/dt}

and this is valid in relativity.

Many modern textbooks rewrite Newton's Second Law as

f = M a {\displaystyle {\vec {f}}=M{\vec {a}}}

This form is not valid in relativity or in other situations where the relativistic mass M is varying.

This formula can be replaced in the relativistic case by

f = γ m a + γ 3 m v a c 2 v {\displaystyle {\vec {f}}=\gamma m{\vec {a}}+\gamma ^{3}m{\frac {{\vec {v}}\cdot {\vec {a}}}{c^{2}}}{\vec {v}}}

As seen from the equation, ordinary force and acceleration vectors are not necessarily parallel in relativity.

However the four-vector expression relating four-force F μ {\displaystyle F^{\mu }\,} to invariant mass m and four-acceleration A μ {\displaystyle A^{\mu }\,} restors the same equation form

F μ = m A μ {\displaystyle F^{\mu }=mA^{\mu }\,}

The geometry of space-time

SR uses a 'flat' 4-dimensional Minkowski space, which is an example of a space-time. This space, however, is very similar to the standard 3 dimensional Euclidean space, and fortunately by that fact, very easy to work with.

The differential of distance d σ {\displaystyle d\sigma } in cartesian 3D space is defined as:

d σ 2 = ( d x 1 ) 2 + ( d x 2 ) 2 + ( d x 3 ) 2 {\displaystyle d\sigma ^{2}=(dx^{1})^{2}+(dx^{2})^{2}+(dx^{3})^{2}}

where ( d x 1 , d x 2 , d x 3 ) {\displaystyle (dx^{1},dx^{2},dx^{3})} are the differentials of the three spatial dimensions. In the geometry of special relativity, a fourth dimension, time, is added multiplied by c, x 0 = c t {\displaystyle x^{0}=ct} so that the equation for the differential of distance becomes:

d s 2 = ( d x 0 ) 2 d σ 2 {\displaystyle ds^{2}=(dx^{0})^{2}-d\sigma ^{2}}
d s 2 = ( d x 0 ) 2 ( d x 1 ) 2 ( d x 2 ) 2 ( d x 3 ) 2 {\displaystyle ds^{2}=(dx^{0})^{2}-(dx^{1})^{2}-(dx^{2})^{2}-(dx^{3})^{2}}

This suggests what is in fact a profound theoretical insight as it shows that special relativity is simply a rotational symmetry of our space-time, very similar to rotational symmetry of Euclidean space. Just as Euclidean space uses a Euclidean metric, so space-time uses a Minkowski metric. According to Misner (1971 §2.3), ultimately the deeper understanding of both special and general relativity will come from the study of the Minkowski metric (described below) rather than a "disguised" Euclidean metric using ict as the time coordinate.

If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3-D space

d s 2 = d c t 2 ( d x 1 ) 2 ( d x 2 ) 2 {\displaystyle ds^{2}=dct^{2}-(dx^{1})^{2}-(dx^{2})^{2}}

We see that the null geodesics lie along a dual-cone(c=1 in the image):

defined by the equation

d s 2 = 0 = d c t 2 ( d x 1 ) 2 ( d x 2 ) 2 {\displaystyle ds^{2}=0=dct^{2}-(dx^{1})^{2}-(dx^{2})^{2}}

or

( d x 1 ) 2 + ( d x 2 ) 2 = d c t 2 {\displaystyle (dx^{1})^{2}+(dx^{2})^{2}=dct^{2}}

Which is the equation of a circle with r=c*dt. If we extend this to three spatial dimensions, the null geodesics are the 4-dimensional cone:

d s 2 = 0 = d c t 2 ( d x 1 ) 2 ( d x 2 ) 2 ( d x 3 ) 2 {\displaystyle ds^{2}=0=dct^{2}-(dx^{1})^{2}-(dx^{2})^{2}-(dx^{3})^{2}}
( d x 1 ) 2 + ( d x 2 ) 2 + ( d x 3 ) 2 = d c t 2 {\displaystyle (dx^{1})^{2}+(dx^{2})^{2}+(dx^{3})^{2}=dct^{2}}

This null dual-cone represents the "line of sight" of a point in space. That is, when we look at the stars and say "The light from that star which I am receiving is X years old", we are looking down this line of sight: a null geodesic. We are looking at an event d = ( x 1 ) 2 + ( x 2 ) 2 + ( x 3 ) 2 {\displaystyle d={\sqrt {(x^{1})^{2}+(x^{2})^{2}+(x^{3})^{2}}}} meters away and d/c seconds in the past. For this reason the null dual cone is also known as the 'light cone'. (The point in the lower left of the picture below represents the star, the origin represents the observer, and the line represents the null geodesic "line of sight".)

The cone in the -t region is the information that the point is 'receiving', while the cone in the +t section is the information that the point is 'sending'.

The geometry of Minkowski space can be depicted using Minkowski diagrams, which are also useful in understanding many of the thought-experiments in special relativity.

Physics in spacetime

Here, we see how to write the equations of special relativity in a manifestly covariant form. The position of an event in spacetime is is not a four vector because simple translations change it in a different way than the cordinate differential dx however it can be given the same component format as a rank one contravariant four-vector:

x ν = ( c t , x , y , z ) {\displaystyle x^{\nu }=\left(ct,x,y,z\right)}

That is, x 0 = c t {\displaystyle x^{0}=ct} and x 1 = x {\displaystyle x^{1}=x} and x 2 = y {\displaystyle x^{2}=y} and x 3 = z {\displaystyle x^{3}=z} . Superscripts are contravariant indices in this section rather than exponents except when they indicate a square. Subscripts are covariant indices which also range from zero to three as with the spacetime gradient of a field φ:

0 ϕ = ϕ t , 1 ϕ = ϕ x , 2 ϕ = ϕ y , 3 ϕ = ϕ z . {\displaystyle \partial _{0}\phi ={\frac {\partial \phi }{\partial t}},\quad \partial _{1}\phi ={\frac {\partial \phi }{\partial x}},\quad \partial _{2}\phi ={\frac {\partial \phi }{\partial y}},\quad \partial _{3}\phi ={\frac {\partial \phi }{\partial z}}.}

Metric and tranformations of coordinates

Having recognised the four-dimensional nature of spacetime, we are driven to employ the Minkowski metric, η, given in components (valid in any inertial reference frame) as:

η α β = ( 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 ) {\displaystyle \eta _{\alpha \beta }={\begin{pmatrix}1&0&0&0\\0&-1&0&0\\0&0&-1&0\\0&0&0&-1\end{pmatrix}}}

Its reciprocal is:

η α β = ( 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 ) {\displaystyle \eta ^{\alpha \beta }={\begin{pmatrix}1&0&0&0\\0&-1&0&0\\0&0&-1&0\\0&0&0&-1\end{pmatrix}}}

Then we recognise that co-ordinate transformations between inertial reference frames are given by the Lorentz transformation matrix Λ. For the special case of motion along the x-axis, we have:

Λ μ ν = ( γ β γ 0 0 β γ γ 0 0 0 0 1 0 0 0 0 1 ) {\displaystyle \Lambda '^{\mu }{}_{\nu }={\begin{pmatrix}\gamma &-\beta \gamma &0&0\\-\beta \gamma &\gamma &0&0\\0&0&1&0\\0&0&0&1\end{pmatrix}}}

which is simply the matrix of a boost (like a rotation) between the x and t coordinates. Where μ indicates the row and ν indicates the column numbers. Also, β and γ are defined as:

β = v c ,   γ = 1 1 β 2 . {\displaystyle \beta ={\frac {v}{c}},\ \gamma ={\frac {1}{\sqrt {1-\beta ^{2}}}}.}

More generally, a transformation from one inertial frame (ignoring translations for simplicity) to another must satisfy:

η α β = η μ ν Λ μ α Λ ν β {\displaystyle \eta _{\alpha \beta }=\eta '_{\mu \nu }\Lambda '^{\mu }{}_{\alpha }\Lambda '^{\nu }{}_{\beta }\!}

where there is an implied summation of μ {\displaystyle \mu \!} and ν {\displaystyle \nu \!} from 0 to 3 on the right-hand side in accordance with the Einstein summation convention. The Poincaré group is the most general group of transformations which preserves the Minkowski metric and this is the physical symmetry underlying special relativity.

All proper physical quantities are given by tensors. So to transform from one frame to another, we use the well known tensor transformation law

Failed to parse (Conversion error. Server ("https://wikimedia.org/api/rest_") reported: "Cannot get mml. TeX parse error: Double exponent: use braces to clarify"): {\displaystyle \mathbf {T'_{\left}^{\left}=\Lambda '^{i_{1}}{}_{i_{1}}\Lambda '^{i_{2}}{}_{i_{2}}...\Lambda '^{i_{p}}{}_{i_{p}}\Lambda '_{j_{1}}{}^{j_{1}}\Lambda '_{j_{2}}{}^{j_{2}}...\Lambda '_{j_{q}}{}^{j_{q}}T_{\left}^{\left}} }

Where Λ j k j k {\displaystyle \Lambda '_{j_{k}}{}^{j_{k}}\!} is the reciprocal matrix of Λ j k j k {\displaystyle \Lambda ^{j_{k}}{}_{j_{k}}\!} .

If origins coincide when the origin clocls are synched to zero, we can transform the position of an event from an unprimed co-ordinate system S to a primed system S', using the transformation matrix

( c t x y z ) = x μ = Λ μ ν x ν = ( γ β γ 0 0 β γ γ 0 0 0 0 1 0 0 0 0 1 ) ( c t x y z ) = ( γ c t γ β x γ x β γ c t y z ) {\displaystyle {\begin{pmatrix}ct'\\x'\\y'\\z'\end{pmatrix}}=x'^{\mu '}=\Lambda '^{\mu }{}_{\nu }x^{\nu }={\begin{pmatrix}\gamma &-\beta \gamma &0&0\\-\beta \gamma &\gamma &0&0\\0&0&1&0\\0&0&0&1\end{pmatrix}}{\begin{pmatrix}ct\\x\\y\\z\end{pmatrix}}={\begin{pmatrix}\gamma ct-\gamma \beta x\\\gamma x-\beta \gamma ct\\y\\z\end{pmatrix}}}

which is the Lorentz transformation given above. All tensors transform by the same rule.

The squared length of the differential of the position four-vector d x μ {\displaystyle dx^{\mu }\!} constructed using

d s 2 = η μ ν d x μ d x ν = ( d c t ) 2 ( d x ) 2 ( d y ) 2 ( d z ) 2 {\displaystyle \mathbf {ds^{2}=\eta _{\mu \nu }dx^{\mu }dx^{\nu }=(dct)^{2}-(dx)^{2}-(dy)^{2}-(dz)^{2}\,} }

is an invariant. Being invariant means that it takes the same value in all inertial frames, because it is a (0 rank tensor), an invariant scalar, and so no Λ appears in its trivial transformation. Notice that when the line element d s 2 {\displaystyle ds^{2}} is positive that d τ = d s {\displaystyle d\tau =ds} is the differential of proper time, while when d s 2 {\displaystyle ds^{2}} is negative, d s {\displaystyle ds} is differential of the proper distance.

The primary value of expressing the equations of physics in a tensor form is that they are then manifestly invariant under the Poincaré group, so that we do not have to do a special and tedious calculation to check that fact. Also in constructing such equations we often find that equations previously thought to be unrelated are, in fact, closely connected being part of the same tensor equation.

Velocity and acceleration in 4D

Recognising other physical quantities as tensors also simplifies their transformation laws. First note that the velocity four-vector U is given by

U μ = d x μ d τ = ( γ γ v x γ v y γ v z ) {\displaystyle U^{\mu }={\frac {dx^{\mu }}{d\tau }}={\begin{pmatrix}\gamma \\\gamma v_{x}\\\gamma v_{y}\\\gamma v_{z}\end{pmatrix}}}

Recognising this, we can turn the awkward looking law about composition of velocities into a simple statement about transforming the velocity four-vector of one particle from one frame to another. U also has an invariant form:

U 2 = η ν μ U ν U μ = c 2 . {\displaystyle {\mathbf {U} }^{2}=\eta _{\nu \mu }U^{\nu }U^{\mu }=-c^{2}.}

So all velocity four-vectors have a magnitude of c. This is an expression of the fact that there is no such thing as being at coordinate rest in relativity: at the least, you are always moving forward through time. The acceleration 4-vector is given by A μ = d U μ / d τ {\displaystyle A^{\mu }=d{\mathbf {U} ^{\mu }}/d\tau } . Given this, differentiating the above equation by τ produces

2 η μ ν A μ U ν = 0. {\displaystyle 2\eta _{\mu \nu }A^{\mu }U^{\nu }=0.\!}

So in relativity, the acceleration four-vector and the velocity 4-vector are orthogonal.

Momentum in 4D

The momentum and energy combine into a covariant 4-vector:

p ν = m η ν μ U μ = ( E / c p x p y p z ) . {\displaystyle p_{\nu }=m\cdot \eta _{\nu \mu }U^{\mu }={\begin{pmatrix}E/c\\-p_{x}\\-p_{y}\\-p_{z}\end{pmatrix}}.}

where m is the invariant mass.

The invariant magnitude of the momentum 4-vector is:

p 2 = η μ ν p μ p ν = ( E / c ) 2 p 2 . {\displaystyle \mathbf {p} ^{2}=\eta ^{\mu \nu }p_{\mu }p_{\nu }=(E/c)^{2}-p^{2}.}

We can work out what this invariant is by first arguing that, since it is a scalar, it doesn't matter which reference frame we calculate it, and then by transforming to a frame where the total momentum is zero.

p 2 = ( E r e s t / c ) 2 = ( m c ) 2 . {\displaystyle \mathbf {p} ^{2}=(E_{rest}/c)^{2}=(m\cdot c)^{2}.}

We see that the rest energy is an independent invariant. A rest energy can be calculated even for particles and systems in motion, by translating to a frame in which momentum is zero.

The rest energy is related to the mass according to the celebrated equation discussed above:

E r e s t = m c 2 {\displaystyle E_{rest}=mc^{2}\,}

Note that the mass of systems measured in their center of momentum frame (where total momentum is zero) is given by the total energy of the system in this frame. It may not be equal to the sum of individual system masses measured in other frames.

Force in 4D

To use Newton's third law of motion, both forces must be defined as the rate of change of momentum with respect to the same time coordinate. That is, it requires the 3D force defined above. Unfortunately, there is no tensor in 4D which contains the components of the 3D force vector among its components.

If a particle is not traveling at c, one can transform the 3D force from the particle's co-moving reference frame into the observer's reference frame. This yields a 4-vector called the four-force. It is the rate of change of the above energy momentum four-vector with respect to proper time. The covariant version of the four-force is:

F ν = d p ν d τ = ( d E / d τ d p x / d τ d p y / d τ d p z / d τ ) {\displaystyle F_{\nu }={\frac {dp_{\nu }}{d\tau }}={\begin{pmatrix}-{dE}/{d\tau }\\{dp_{x}}/{d\tau }\\{dp_{y}}/{d\tau }\\{dp_{z}}/{d\tau }\end{pmatrix}}}

where τ {\displaystyle \tau \,} is the proper time.

In the rest frame of the object, the time component of the four force is zero unless the "invariant mass" of the object is changing in which case it is the negative of that rate of change times c. In general, though, the components of the four force are not equal to the components of the three-force, because the three force is defined by the rate of change of momentum with respect to coordinate time, i.e. d p d t {\displaystyle {\frac {dp}{dt}}} while the four force is defined by the rate of change of momentum with respect to proper time, i.e. d p d τ {\displaystyle {\frac {dp}{d\tau }}} .

In a continuous medium, the 3D density of force combines with the density of power to form a covariant 4-vector. The spatial part is the result of dividing the force on a small cell (in 3-space) by the volume of that cell. The time component is the negative of the power transferred to that cell divided by the volume of the cell. This will be used below in the section on electromagnetism.

Relativity and unifying electromagnetism

The Lorentz transformation of the electric field of a moving charge into a non-moving observer's reference frame results in the appearance of a mathematical term commonly called the magnetic field. Conversely, the magnetic field generated by a moving charge disappears and becomes a purely electrostatic field in a comoving frame of reference. Maxwell's equations are thus simply an empirical fit to special relativistic effects in a classical model of the Universe. As electric and magnetic fields are reference frame dependent and thus intertwined, one speaks of electromagnetic fields. Special relativity provides the transformation rules for how an electromagnetic field in one inertial frame appears in another inertial frame.

Electromagnetism in 4D

Main article: Formulation of Maxwell's equations in special relativity

Maxwell's equations in the 3D form are already consistent with the physical content of special relativity. But we must rewrite them to make them manifestly invariant.

The charge density ρ {\displaystyle \rho \!} and current density [ J x , J y , J z ] {\displaystyle \!} are unified into the current-charge 4-vector:

J μ = ( ρ J x J y J z ) {\displaystyle J^{\mu }={\begin{pmatrix}\rho \\J_{x}\\J_{y}\\J_{z}\end{pmatrix}}}

The law of charge conservation becomes:

μ J μ = 0. {\displaystyle \partial _{\mu }J^{\mu }=0.\!}

The electric field [ E x , E y , E z ] {\displaystyle \!} and the magnetic induction [ B x , B y , B z ] {\displaystyle \!} are now unified into the (rank 2 antisymmetric covariant) electromagnetic field tensor:

F μ ν = ( 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 ) {\displaystyle F_{\mu \nu }={\begin{pmatrix}0&-E_{x}&-E_{y}&-E_{z}\\E_{x}&0&B_{z}&-B_{y}\\E_{y}&-B_{z}&0&B_{x}\\E_{z}&B_{y}&-B_{x}&0\end{pmatrix}}}

The density of the Lorentz force f μ {\displaystyle f_{\mu }\!} exerted on matter by the electromagnetic field becomes:

f μ = F μ ν J ν . {\displaystyle f_{\mu }=F_{\mu \nu }J^{\nu }.\!}

Faraday's law of induction and Gauss's law for magnetism combine to form:

λ F μ ν + μ F ν λ + ν F λ μ = 0. {\displaystyle \partial _{\lambda }F_{\mu \nu }+\partial _{\mu }F_{\nu \lambda }+\partial _{\nu }F_{\lambda \mu }=0.\!}

Although there appear to be 64 equations here, it actually reduces to just four independent equations. Using the antisymmetry of the electromagnetic field one can either reduce to an identity (0=0) or render redundant all the equations except for those with λ,μ,ν = either 1,2,3 or 2,3,0 or 3,0,1 or 0,1,2.

The electric displacement [ D x , D y , D z ] {\displaystyle \!} and the magnetic field [ H x , H y , H z ] {\displaystyle \!} are now unified into the (rank 2 antisymmetric contravariant) electromagnetic displacement tensor:

D μ ν = ( 0 D x D y D z D x 0 H z H y D y H z 0 H x D z H y H x 0 ) {\displaystyle {\mathcal {D}}^{\mu \nu }={\begin{pmatrix}0&D_{x}&D_{y}&D_{z}\\-D_{x}&0&H_{z}&-H_{y}\\-D_{y}&-H_{z}&0&H_{x}\\-D_{z}&H_{y}&-H_{x}&0\end{pmatrix}}}

Ampere's law and Gauss's law combine to form:

ν D μ ν = J μ . {\displaystyle \partial _{\nu }{\mathcal {D}}^{\mu \nu }=J^{\mu }.\!}

In a vacuum, the constitutive equations are:

μ 0 D μ ν = η μ α η ν β F α β . {\displaystyle \mu _{0}{\mathcal {D}}^{\mu \nu }=\eta ^{\mu \alpha }\eta ^{\nu \beta }F_{\alpha \beta }.}

Antisymmetry reduces these 16 equations to just six independent equations.

The energy density of the electromagnetic field combines with Poynting vector and the Maxwell stress tensor to form the 4D stress-energy tensor. It is the flux (density) of the momentum 4-vector and as a rank 2 mixed tensor it is:

T α π = F α β D π β 1 4 δ α π F μ ν D μ ν {\displaystyle T_{\alpha }^{\pi }=F_{\alpha \beta }{\mathcal {D}}^{\pi \beta }-{\frac {1}{4}}\delta _{\alpha }^{\pi }F_{\mu \nu }{\mathcal {D}}^{\mu \nu }}

where δ α π {\displaystyle \delta _{\alpha }^{\pi }} is the Kronecker delta. When upper index is lowered with η, it becomes symmetric and is part of the source of the gravitational field.

The conservation of linear momentum and energy by the electromagnetic field is expressed by:

f μ + ν T μ ν = 0 {\displaystyle f_{\mu }+\partial _{\nu }T_{\mu }^{\nu }=0\!}

where f μ {\displaystyle f_{\mu }\!} is again the density of the Lorentz force. This equation can be deduced from the equations above (with considerable effort).

Status

Main article: Status of special relativity

Special relativity is accurate only when gravitational potential is much less than c; in a strong gravitational field one must use general relativity (which becomes special relativity at the limit of weak field). At very small scales, such as at the Planck length and below quantum effects must be taken into consideration resulting in quantum gravity. However, at macroscopic scales and in the absence of strong gravitational fields, special relativity is experimentally tested to extremely high degree of accuracy (10) and thus accepted by the physics community. Experimental results which appear to contradict it are not reproducible and are thus widely believed to be due to experimental errors.

Because of the freedom one has to select how one defines units of length and time in physics, it is possible to make one of the two postulates of relativity a tautological consequence of the definitions, but one cannot do this for both postulates simultaneously, as when combined they have consequences which are independent of one's choice of definition of length and time.

Special relativity is mathematically self-consistent, and it is an organic part of all modern physical theories, most notably quantum field theory, string theory, and general relativity (in the limiting case of negligible gravitational fields).

Newtonian mechanics mathematically follows from special relativity at small velocities (compared to the speed of light) - thus Newtonian mechanics can be considered as a special relativity of slow moving bodies. See Status of special relativity for a more detailed discussion.

A few key experiments can be mentioned that led to special relativity:

  • The Trouton-Noble experiment showed that the torque on a capacitor is independent on position and inertial reference frame — such experiments led to the first postulate
  • The famous Michelson-Morley experiment gave further support to the postulate that detecting an absolute reference velocity was not achievable. It should be stated here that, contrary to many alternative claims, it said little about the invariance of the speed of light with respect to the source and observers velocity, as both source and observer were travelling together at the same velocity at all times.

A number of experiments have been conducted to test special relativity against rival theories. These include:

In addition, particle accelerators run almost every day somewhere in the world, and routinely accelerate and measure the properties of particles moving at near lightspeed. Many effects seen in particle accelerators are highly consistent with relativity theory and are deeply inconsistent with the earlier Newtonian mechanics.

See also

People: Arthur Eddington | Albert Einstein | Hendrik Lorentz | Hermann Minkowski | Bernhard Riemann | Henri Poincaré | Alexander MacFarlane | Harry Bateman | Robert S. Shankland | Walter Ritz
Relativity: Theory of relativity | principle of relativity | general relativity | frame of reference | inertial frame of reference | Lorentz transformations | Bondi k-calculus | Einstein synchronisation | Rietdijk-Putnam Argument
Physics: Newtonian Mechanics | spacetime | speed of light | simultaneity | physical cosmology | Doppler effect | relativistic Euler equations | Aether drag hypothesis | Moving magnet and conductor problem
Math: Minkowski space | four-vector | world line | light cone | Lorentz group | Poincaré group | geometry | tensors | split-complex number
Philosophy: actualism | convensionalism | formalism

External links

Explanations of special relativity

Visualization

Other

References

  1. Edwin F. Taylor and John Archibald Wheeler (1992). Spacetime Physics: Introduction to Special Relativity. W. H. Freeman. ISBN 0-7167-2327-1.
  2. E. J. Post (1962). Formal Structure of Electromagnetics: General Covariance and Electromagnetics. Dover Publications Inc. ISBN 0-486-65427-3.
  3. The number of works is vast, see as example:
    Sidney Coleman, Sheldon L. Glashow, Cosmic Ray and Neutrino Tests of Special Relativity, Phys.Lett. B405 (1997) 249-252, online
    An overview can be found on this page of John Baez

Textbooks

  • Einstein, Albert. "Relativity: The Special and the General Theory".
  • Silberstein, Ludwik (1914) The Theory of Relativity.
  • Tipler, Paul; Llewellyn, Ralph (2002). Modern Physics (4th ed.). W. H. Freeman Company. ISBN 0-7167-4345-0
  • Schutz, Bernard F. A First Course in General Relativity, Cambridge University Press. ISBN 0-521-27703-5
  • Taylor, Edwin, and Wheeler, John (1992). Spacetime Physics (2nd ed.). W.H. Freeman and Company. ISBN 0-7167-2327-1
  • Einstein, Albert (1996). The Meaning of Relativity. Fine Communications. ISBN 1-56731-136-9
  • Geroch, Robert (1981). General Relativity From A to B. University of Chicago Press. ISBN 0-226-28864-1
  • Logunov, Anatoly A. (2005) Henri Poincaré and the Relativity Theory (transl. from Russian by G. Pontocorvo and V. O. Soleviev, edited by V. A. Petrov) Nauka, Moscow .
  • Misner, Charles W. (1971). Gravitation. San Francisco: W. H. Freeman & Co. ISBN 0-7167-0334-3. {{cite book}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Post, E.J., Formal Structure of Electromagnetics: General Covariance and Electromagnetics, Dover Publications Inc. Mineola NY, 1962 reprinted 1997.

Journal articles

  • On the Electrodynamics of Moving Bodies, A. Einstein, Annalen der Physik, 17:891, June 30, 1905 (in English translation)
  • Wolf, Peter and Gerard, Petit. "Satellite test of Special Relativity using the Global Positioning System", Physics Review A 56 (6), 4405-4409 (1997).
  • Will, Clifford M. "Clock synchronization and isotropy of the one-way speed of light", Physics Review D 45, 403-411 (1992).
  • Rizzi G. et al, "Synchronization Gauges and the Principles of Special Relativity", Found.Phys. 34 (2005) 1835-1887
  • Alvager et al., "Test of the Second Postulate of Special Relativity in the GeV region", Physics Letters 12, 260 (1964).
Major branches of physics
Divisions
Approaches
Classical
Modern
Interdisciplinary
Related

Template:Link FA Template:Link FA Template:Link FA Template:Link FA Template:Link FA

Category: