ABOUT EHRENFEST PARADOX AND SAGNAC EXPERIMENT

A STRICTLY SPECIAL-RELATIVISTIC DISCUSSION OF EHRENFEST PARADOX AND SAGNAC EXPERIMENT SUGGESTS ANOTHER POSSIBLE EXPERIMENTAL FALSIFICATION OF SPECIAL RELATIVITY

Author’s address:

Umberto Bartocci

Dipartimento di Matematica

Università - 06100 Perugia,

Italy

(bartocci@dipmat.unipg.it - http://www.dipmat.unipg.it/~bartocci)

September, 1999; Revised, July 2000

Abstract - This paper suggests to utilize Sagnac effect in order to introduce a possible definition of the "proper length" L’ of a rotating circumference C’, and the phenomenon of relativistic aberration in order to define the "proper diameter" D’ of C’, in such a way that L’/D’ = p , and Eherenfest paradox disappears. At last, an "experiment" is proposed: is it possible to test light’s relativistic aberration in a terrestrial laboratory?

Résumé - On propose ici d’utiliser l’effect de Sagnac pour mesurer la "longueur propre" L’ d’une circonférence roulante, et l’aberration relativistique pour mesurer le "diamètre propre" D’ de C’. L’on obtient enfin que L’/D’ = p , et le paradoxe d’Ehrenfest va donc disparaître. Enfin, on propose un’expérience: est-il possible de vérifier en laboratoire l’aberration relativistique de la lumière?

Key words - Special Relativity, Rotating Platform, Accelerated Observers, Sagnac Effect, Light’s Aberration.

1 - A correct statement of the paradox

It was already in 1909 that Paul Ehrenfest⁽¹⁾ pointed out the difficulties that Special Relativity (SR) has in the definition of "rigid body", by means of his famous question concerning the length’s variation of a rotating circumference. Since then, countless attempts of "explanation" have been given, and to the whole argument of the "rotating platform" some attention is dedicated until today^(2),(3), notwithstanding it has been apparently set-up by deeper analyses of SR, and rigorous foundations of the concept of "relativistic rigid body"^(4),(5). Many of these explanations introduce General Relativity, or deal with the concrete physical structure of the spinning disk, then introducing terms as: "elastic dilations", "radial stresses", "bendings of the disk’s surface", etc.. These arguments appear to the present author rather unsatisfactory, since they fail to give a "solution" of the question in the same conceptual frame in which it was expressed, namely in a purely geometrical-kinematical set-up¹.

First of all, let us state Ehrenfest’s dilemma, in the same "rough terms" which are currently used in its popularizations^(7),(8).

In an inertial reference frame (IRF) W in the 3-dimensional Minkowski space-time M, whose coordinates are (x,y,t)², let us introduce a circular platform P’ of radius R, rotating in W at some angular speed w different from zero, but whose centre A º (0,0,t) is at rest in W . Then, think of an (accelerated) "observer" A’ placed in the rim C’ of P’, whose speed with respect to (from now on: wrt) W will be v = w R. We can suppose for instance that A’ is defined by the motion equations:

x = Rcos(w t), y = Rsin(w t)

(all physical and mathematical quantities introduced above are defined wrt the given IRF, which is indeed a "privileged" IRF wrt P’).

Well, if L’ is the length of C’ as seen by A’ (in this paradox, L’ will play the role of the "true" length of C’, or of the "proper" length of C’), what is the relation between the length L of C’ in W , and L’?

In order to answer to this question, one generally suggests to "regularly" divide C’ in n parts C’_i , i=0,...,n-1 , each one of "proper length" L’_i , in such a way that L’ = S (L’_i) . Then, in a given instant t* (once again, wrt W !), if n is large enough, one could think that each one of these C’_i could be approximatively considered as "inertial", at least in a "small" neighborhood of t*. Let us call W _i(t*) the IRFs in which these C’_i can be considered at rest, in that instant t* . Then, because of the well known relativistic length contraction, in the passage from W _i(t*) to W , each length L_i of C’_i wrt W should be equal to the corresponding proper length L’_i of C’_i in W _i(t*), multiplied by the shrinking factor Ö (1-v²). Thus one would get: L_i = L’_iÖ(1-v²) , and this would imply, since L = S (L_i) :

L = S (L_i) = S [L’_iÖ(1-v²)] = L’Ö (1-v²) (1) .

Furthermore, if we introduce the proper radius R’ of C’, or of P’ (once again, the radius of the platform as seen by A’), we should have no length contraction at all for R’, since the radius’s motion is transversal to the motion of all W _i(t*) (as usual, wrt W ), and then:

R’ = R (2).

From (1) and (2), Ehrenfest paradox follows immediately, since one should expect:

L = 2p R , L’ = 2p R’ Þ L = L’ (applying (2)) (3),

which is incompatible with (1).

The first solution of the paradox consists in rejecting the second identity in (3). One suggested to write instead: L’ = 2p ’R’ , where p ’ is the convenient value of "p " in the "geometry of the rotating platform". It would then follow, from (1), that:

L = 2p R = L’Ö (1-v²) = 2p ’R’Ö (1-v²) Þ p = p ’Ö (1-v²) .

In conclusion, the value of p in the rotating platform’s geometry would seem to be different - and greater - from the "ordinary" one. This was interpreted by asserting that the geometry of P’ should be considered, in some sense, a non-euclidean geometry³.

Let us start our purely "geometrical" discussion of this riddle showing how the common argument which implies (1) could be rejected. As a matter of fact, the previous deduction of this identity can be criticized at least at a double level of understanding.

The first one, is that those "proper lengths" L’_i of C’_i , namely the lengths of C’_i as seen by A’ - whatever this expression could exactly mean! - do not necessarily coincide with the lengths L _i of the C’_i in the IFRs W _i(t*). These C’_i are indeed not at rest wrt any IFR containing another one of them, but are moving at different speeds. For this reason, leaving for the moment apart problems of synchronization⁴, if C’₀ is that part of C’ corresponding to A’ itself, and W ₀(t*) the associated IFR, all proper lengths L _i , i = 1,...,n-1 , will be seen by W ₀(t*) contracted by a shrinking factor Ö (1-v_i²) of the same kind as before, where v_i is now the relative speed between W _i(t*) and W ₀(t*) (we shall come back to this argument in the next section). That is to say, the "rough" argument which led to (1) would have been correct only in the case of the famous Einstein’s train, in which all passengers are at rest, one wrt to the other, in the same IFR. In the case of a rotating platform, on the contrary, this is no longer true, at least from the point of view of W ₀(t*), and here it comes the second level of understanding of the paradox.

The real problem one has to face, does concern the exact meaning of expressions like: "as seen by some observer", "measured in the platform system", and so on.

As a matter of fact, in order to define the "proper length" L’, which is the key of the whole paradox, first of all one should discuss how length measures can be introduced in SR in a general coordinate system, "adapted" to some observer field X⁵.

Because of the particular nature of "space" and "time" in relativity, general coordinates (X,Y,T) give spatial length measures of "objects", and "trajectories", when the ds² admits a splitting of the kind: ds² = ds ² + gdT² , where ds ² is a positive definite quadratic form in (X,Y) (we can say that in this case the coordinate system is orthogonal)⁶. Moreover, one knows that it is impossible to find a coordinate time T which coincides with the proper time of all observers of X, unless X is an inertial field of observers (geodesic observers), and (X,Y,T) are the familiar Lorentz coordinates. That is to say, if one leaves the usual Lorentz coordinates in M, and introduces general ones (even orthogonal in the sense above specified), then the "time" in these systems cannot be always measured by "clocks"⁷.

This proves that, if one wants to define the proper length we are investigating, first of all he must give up this requirement. But this is not the only trouble that one meets in trying to introduce a "suitable" coordinate system associated to the platform⁸ (that is to say, a system in which all, or "most", observers in the platform P’, or just in C’, are at rest). In order to carry on a rigorous analysis, one has to introduce first the observer field U, defined as the field of the 3-velocities of all observers:

x = r cos(w t+q ), y = r sin(w t+q ) , -p £ q < p , (4)

for any r : R-e < r < R+e , and for some "small" e ⁹. Then, one must check whether is it possible, or not (perhaps even introducing some "restriction" of U, either depending on r , on q , or on t), to find an orthogonal coordinate system adapted to this field U¹⁰. Well, each one of these 3-velocities, let us call it u, has indeed an infinitesimal rest-space, defined as the 2-dimensional linear subspace of M Lorentz-orthogonal to u, and one could think, to begin with, to define L’ as the length of the section of this rest-space, in some given instant, with the cylinder: x² + y² = R² , which is the surface corresponding to all observers on C’¹¹.

Making computations in our IFR W (that is to say, using the coordinates (x,y,t)), in the instant t = t*, the rest-space corresponding to the observer A’ is defined by the equation:

-vsin(w t*)(x-Rcos(w t*)) + vcos(w t*)(y-Rsin(w t*)) - (t-t*) = 0 (5)

and the section we are looking for is the ellipse of parametric equations:

l = Rcos(u) , m = Rsin(u),

in the 2-dimensional linear sub-space of parametric equations:

x = l , y = m , t = -vsin(w t*)l + vcos(w t*)m + t* .

These equations give for the quadratic form ds ² the expression:

ds ² = g₁₁dl ² + 2g₁₂dl dm + g₂₂dm ² = (1-v²sin²(w t*))dl ² +

+ 2v²sin(w t*)cos(w t*)dl dm + (1-v²cos²(w t*))dm ² .

The required length L’ is then given by the formula:

L’ = R_[0,2p
]Ö[(1-v²sin²(w t*))sin²(u)+

-2v²sin(w t*)cos(w t*)sin(u)cos(u)+(1-v²cos²(w t*))cos²(u)]du =

= R_[0,2p
]Ö(1-v²cos²(u-w t*))du = R_[0,2p
]Ö(1-v²cos²(u))du .

This identity shows at least that L’ should indeed rather be smaller than greater than L = 2p R, which is in better qualitative agreement with our previous considerations. For instance, for v << 1 (remember that, in the actual notations, v = v/c), one gets the following approximation, up to second order in v:

L’ » R_[0,2p
][(1-v²cos²(u)/2]du = R(2p -p v²/2) = 2p R(1-v²/4) ,

which is quite different from the analogous one coming from (1):

L’ » 2p R/(1-v²/2) » 2p R(1+v²/2) (6) .

Apart that, the previous value of L’ cannot be truly endowed with any "physical meaning", because, due to the curvature of A’, the rest-spaces (5) (when t* varies), are not parallel in M, and of course this is "bad", since we want that these spaces would correspond to "simultaneity spaces": T = constant, for some coordinate time T. This implies that one should at least exclude the intersection of these spaces from any possible domain of the coordinate system we are looking for (at least in the case of flat rest-spaces), and this would not have painful consequences for our purposes only if these intersections, which are lines, would not meet our cylinder, but a straightforward computation shows that this is not the case!

The question becomes then to see whether these rest-spaces could be integrated in order to give a global rest-subvariety, possibly not a flat one, but a straightforward computation shows that the vector field U (or any one of its "restrictions") is not irrotational, and this is unfortunately a necessary and sufficient condition for the required integrability¹². In other words, it is impossible to find a coordinate system, with the required properties, which would "include" the whole, or even part, of C’.

At this point one could think of playing other "tricks", for instance to give up the demand of including in the coordinate system we are looking for "too many" rotating observers on P, and just to concentrate on C’, trying to find a suitable "extension" of the vector field above (first restricted to the circumference C’, that is to say, for r = R). Even in this case, one finds difficulties¹³, as one would meet even trying to introduce any "proper" measure of the radius of C’ (of P’), but let us pass over this point¹⁴. The truth is that it is not possible to univocally introduce in SR any "coordinate system associated to the platform"¹⁵, and that direct¹⁶ rigorous definitions of both L’ and R’ are impossible.

Summing up, one could possibly reject (which is in some sense a "solution"!) Ehrenfest’s argument, saying that L’ and R’ cannot be suitably (univocally) defined in SR. But instead of doing that, we think rather instructive, instead, to look for a possible alternative definition of these quantities. We shall show that it seems plausible to introduce these definitions in such a way that the relation between L and L’ should be the "inverse" of (1) (and (6)), namely:

L’ = LÖ (1-v²) » L(1-v²/2) = 2p R(1-v²/2) (7)

and that the relation between R and R’ could be, instead of (2):

R’ = RÖ (1-v²) (8).

The first shrinking will be interpreted as a consequence of time dilation, rather than of length contraction, and the second one of relativistic light’s aberration. In conclusion, there would not be an Ehrenfest paradox anymore, since, with the suggested definitions:

L’/R’ = L/R = 2p !

2 - A possible connection with Sagnac experiment

The definition we are looking for, will be obtained by introducing another widely discussed argument, the Sagnac experiment⁽¹⁰⁾, which even nowadays somebody believes, but erroneously, a confutation of SR⁽¹¹⁾.

As a matter of fact, even if it is never part of a suitable coordinate system in which it is at rest, A could obtain a measure of L’, just by means of its own unique clock, in the following way. Suppose that A’ sends, at some instant t ₀ of its proper time, two light’s beams along C’, in the two opposite directions. The two beams will cover all the length of C’, and then will come back to A’, of course not simultaneously, both wrt A’ and wrt W . From W ’s point of view, the computation is quite easy: the forward beam will come back to A’ after a time interval: D t_I = 2p R/(1-v), the backward beam after a time interval: D t_B = 2p R/(1+v).

The ratio k between these two time intervals, k = D t_I/D t_B, is the so-called Sagnac effect, and it is actually a quantity greater that 1, depending on the speed v of A’ wrt W (or, which is the same, wrt the centre A of the platform).

From the point of view of the proper time t of A’¹⁷, the two corresponding proper time intervals, D t _I and D t _B, will be equal to:

D t _I = D t_IÖ(1-v²) , D t _B = D t_BÖ(1-v²) (9) .

These identities imply that: the Sagnac effect k is the same, either as seen by A, or as seen by A’.

As a consequence, A’ can indeed realize that the platform is rotating, with no contradiction at all with the I Postulate of SR¹⁸, and can even measure its "absolute" rotational speed (namely, wrt the centre A):

v = (k-1)/(k+1) (10).

Well, from (9), and from the above values for D t_I and D t_B, we have:

D t _I = [2p RÖ (1-v²)]/(1-v) , D t _B = [2p RÖ (1-v²)]/(1+v) (11) .

From these identities, we could at last conclude that¹⁹:

- 2p RÖ (1-v²) could be defined as the "proper length" of C’ (the length of C’ as seen by A’);

- according to this definition, the average light’s speeds, again wrt A’, forward and backward, are respectively equal to the "classically expected values": c-v and c+v²⁰.

Remark 1 - It is perhaps curious to point out that precisely this same reasoning gives also a physical argument in favour of choice (1)! Use the so called "radar method", and define the distance of something X which is far from any given observer Y as the half of the proper time that light spends in order to go from Y to X, and then to come back (remember that in our notations is c = 1). If we use this definition in the case of the rotating circular platform, we find that the proper time interval that a light’s beam spends going from A’ to A’ itself, along the circumference in some direction, plus the time which is spent going back, from A’ to A’ again, but in the other direction, is equal to: [2p RÖ (1-v²)]/(1-v) + [2p RÖ (1-v²)]/(1+v) =

= [4p RÖ (1-v²)]/(1-v²) = 4p R/Ö (1-v²)]

and the half of this value is exactly (1).

It is rather important to remark that there are even two more independent arguments which can justify the previous definition. The first one goes as follows. Think of a circumference C "strictly contiguous" to C’ (same centre A, same radius R), but which does not rotate (in W ) (C is quite a different observer field than C’!). Suppose to choose an "observer" B Î C (with a slight abuse of notation!), which has the role of indicating to A’ when it has made a whole rotation (when B comes back to A’, as A’ sees it!). From the point of view of the clock of B, the event: coming back of A’, will happen at time intervals equal to 2p R/v . From the point of view of the clock of A’, instead, by time dilation, the corresponding time intervals will be equal to [2p RÖ (1-v²)]/v, which is perfectly compatible with our suggested definition (do not forget that A’ is indeed able to evaluate its W -speed v by means of the Sagnac effect k)²¹.

Remark 2 - The previous argument shows that the shrinking (7) could be interpreted indeed as nothing else but as the "ordinary" length contraction of the proper length of C as seen by the (really moving) oberver A’. That is to say, if A’ meets some trouble in measuring C’, he has less difficulties in measuring C!

It is very instructive to show that the approximation given always in formula (7), up to second order in v: L’ » 2p R(1-v²/2), can be deduced also with a quite different purely geometrical-kinematical procedure. The idea is to start from those single "local lengths" L _i introduced in section 1, and then to modify them according to our previous considerations, before proposing their sum as a possible value of the circumferences’s length "as seen by an observer on it"²².

To this purpose, let us introduce the following n observers on C:

x = Rcos(w t+2p i/n), y = Rsin(w t+2p i/n) , i = 0,1,...,n-1 ,

and for each one of these, in a given W -instant t = t*, the IFRs that we have called W _i(t*). Then, the IFR W ₀(t*) will be the one associated to A’, and in order to evaluate L’ as seen by A’, as we have already said, we must add to the value 2p R/[nÖ (1-v²)], which is the length in W ₀(t*) of that part of circumference "near" A’, the other n-1 analogous values 2p R/[nÖ (1-v²)], multiplied by the shrinking factor Ö (1-v_i²), where v_i is the relative speed between W _i(t*) and W ₀(t*):

L’ = lim { S [2p RÖ (1-v_i²)]/[nÖ (1-v²)]} (12)

where the sum is meant from i = 0 to i = n-1, and the limit must be taken for n ® ¥ .

In order to compute v_i , we must find the coordinate transformations F_i connecting W ₀(t*) and W _i(t*), F_i : (x°,y°,t°) ® (xⁱ,yⁱ,tⁱ) . If one introduces the transformations:

G_i : Xⁱ = -xsin(2p i/n)+ycos(2p i/n), Yⁱ = xcos(2p i/n)-ysin(2p i/n) , t = t

H_i : xⁱ = (Xⁱ-vt)/Ö (1-v²), yⁱ=Yⁱ, tⁱ=(t-vXⁱ)/Ö (1-v²) ,

then F_i is the product of the following ones (we can obviously confine ourselves to the case t* = 0):

(H₀)^-1 : (x°,y°,t°) ® (X⁰,Y⁰,t) , (G₀)^-1 : (X°,Y°,t) ® (x,y,t) ,

G_i : (x,y,t) ® (Xⁱ,Yⁱ,t) , H_i : (Xⁱ,Yⁱ,t) ® (xⁱ,yⁱ,tⁱ) ;

F_i = H_i G_i (G₀)^-1 (H₀)^-1 :

x^iÖ(1-v²) = (Y°-R)sin(2p i/n) + X°cos(2p i/n) - vt =

= (y°-R)sin(2p i/n) + (x°+vt°)cos(2p i/n)/Ö (1-v²) - v(t°+vx°)/Ö (1-v²)

yⁱ = (Y°-R)cos(2p i/n) - X°sin(2p i/n) + R =

= (y°-R)cos(2p i/n) + (x°+vt°)sin(2p i/n)/Ö (1-v²) + R

t^iÖ(1-v²) = t-v((Y°-R)sin(2p i/n) + X°cos(2p i/n)) =

= (t°+vx°)/Ö (1-v²) - v((y°-R)sin(2p i/n)+(x°+vt°)cos(2p i/n))/Ö (1-v²)).

From these equations one can get the origin’s motion, wrt the time t°, and then the precise value of v_i . The corresponding rigorous expression is rather complicated, but one can at least compute its approximation, up to second order in v. One gets:

v_i » v(cos(4p i/n)-cos(2p i/n), sin(4p i/n)-sin(2p i/n)) (13),

and (12) becomes:

L’ = lim { S [2p RÖ (1-v_i²)]/[nÖ (1-v²)]} »

» lim { S [2p RÖ (1-2v²+2v²cos(2p i/n))]/[nÖ (1-v²)]} »

» lim { S [2p R(1-v²+v²cos(2p i/n))]/[n(1-v²/2)]} »

» lim { S [2p R(1-v²+v²cos(2p i/n))(1+v²/2)]/n}

» lim { S [2p R(1+v²/2-v²+v²cos(2p i/n))]/n} =

= 2p R(1-v²/2) + lim { Rv^2S[2p cos(2p i/n))]/n} ,

which implies exactly (7), as S [2p cos(2p i/n)]/n is an integral sum of the differential form cos(x)dx !

This could be the end of our discussion, since A’ could ascribe the apparent shrinking (7) of C’ (or of C) to its own time dilation, and then alter correspondingly its "longitudinal" length measures in such a way that no changes would be required at all (wrt to the corresponding measures in W ). All the same, in the next section we shall discuss the question concerning the "transversal" length measures, once again as seen by A’. That is to say, we shall discuss also the validity of (2), which has been until now untouched!

3 - A possible connection with relativistic aberration

We shall now examine another interesting feature of Ehrenfest paradox. One should really accept the identity R’ = R ? Namely, that the real diameter D = 2R of C’ (or of P’, or of C), as measured in W (or by A), is equal to the apparent diameter of C’, the one seen by A’?²³

According to our previous comments, certain quantities cannot be "really" measured by A’, and one should always perform a careful mathematical analysis in order to show whether or not some values could be precisely (and univocally) defined. Anyway, we can introduce in our "game" the following "physical" considerations. Since the W -diameter 2R of C is the distance between a "point" B in C and its antipodal point B*, 2R = d(B,B*), one could ask: which should be thought as the "antipodal point" of B, as seen by A’? Would it be the same B* we have introduced before, or another point B°? Then, we shall compute the "transversal distance" between B and B° (just the ordinary distance in W , d(B,B°)), and we propose to call it the apparent diameter of C (we shall denote it by D’).

Well, in order to answer to this question, we can suppose that, in some instant t*, for instance when A’ is exactly in front of the previous B, A’ sends a mono-directional light’s beam (a photon) "towards the centre of the platform", namely in the orthogonal direction (in W ²⁴) to the tangent of C. Then, we could look for the point B° in C which would "receive" this photon. It is clear that, according to SR, the photon sent by A’ does not pass through the centre of P, and does not "hit" C in the point B*. The reason for that is easily explained by means of relativistic light’s aberration.

The starting point for understanding this phenomenon is to carefully distinguish between speed (scalar velocity) and velocity (vectorial velocity), which in some language is not possible²⁵. With this specification, SR’s II Postulate prescribes just that the light’s speed is independent on the source’s motion (in any inertial frame), but not the light’s velocity, which in fact can depend on the source’s velocity.

As a simple example, in a very common set-up, let us take a photon travelling backward along the y-axis (the photon is supposed to start at the time t = 0 from some indefinite distance L > 0):

f : x = 0, y = L-t, z = 0 (velocity (0,-1,0), speed 1).

If you imagine the "usual" observer travelling along x-axis with some uniform velocity (v,0,0), endowed with a Lorentz coordinate system (x’,y’,z’), then you can use a Lorentz transformation in order to connect coordinates (x,y,z) and coordinates (x’,y’,z’):

x = (x’+vt’)/Ö (1-v²), y = y’, z = z’, t = (t’+vx’)/Ö (1-v²),

and then the motion of f becomes, in these new coordinates:

x = 0 ® (x’+vt’) = 0 ® x’ = -vt’,

y = L-t ® y’ = L-(t’+vx’)/Ö (1-v²) ® y’ = L-t’Ö (1-v²),

z = 0 ® z’ = 0.

These equations show that, in the new coordinates, the photon’s velocity is (-v,-Ö (1-v²),0) (of course, the photon’s speed is always 1, since: v²+(1-v²) = 1!), which clearly does depend on the velocity of the source (in the system (x’,y’,z’), this velocity is equal to (-v,0,0)).

This is the reason for relativistic aberration, since the light coming from the source will be received by the "moving observer" shifted under an angle q such that tg(q ) = v/Ö (1-v²) » v (up to second order terms in v)²⁶, and that is all.

Coming back to our case - and we repeat it, according to SR - the photon sent by A’ will go into a straight line only with respect to the virtual observer which would go on, in a state of uniform motion, with the same velocity of A’ in the very moment of the photon’s emission. Thus, the photon will be "aberrated" in W , wrt the geometrical diameter d of C starting from the point B, with an angle q such that, as a simple geometrical argument shows (see figure 1):

cos(q ) = Ö (1-v²) (14) .

As a matter of fact, if we call D t the W -time employed by the photon in the part of its travel until it "cuts" the diameter orthogonal to d , in the point A°, we obviously have (D t)² = (vD t)² + R², and then cos(q ) = R/D t = Ö (1-v²), as asserted.

It is very easy to compute now which is the point B° on C that our photon would hit. Since the triangle BB°B* is a right triangle in B°, we will get at last:

D’ = d(B,B°) = apparent diameter of C =

= 2Rcos(q ) = 2RÖ (1-v²) (15) .

From this identity, by means of (7), one would get, as announced:

L’/D’ = p ’ = L/2R = p , which could be interpreted as another possible "solution" of Ehrenfest paradox!

4 - A possible experimental falsification of Special Relativity

The above discussed relativistic difference between velocity and speed of light, and the corresponding correct understanding of the II SR Postulate, could perhaps be the conceptual ground for some attempt to compare SR predictions with analogous aether-theoretic expectations²⁷. As a matter of fact, one could suppose that it would be natural, in an aether-frame, to have total independence, namely vectorial independence, of light’s velocity on the velocity of the source (and not only independence of the scalar values).

One could think for instance to use the same circular platform P’ of the previous discussion. First suppose P’ at rest, and place a mono-directional (the most possible point-like!) photon source S in the rim C of the platform, near to a "fixed" point B, directed towards the centre A of P’. A "fixed" light’s detector S* is placed near the antipodal point B* of B, and can detect the arrival of the photons emitted by S. Then, we can make the platform rotate, and arrange things in such a way that, in "stationary" conditions, S does emit a photon only when it is in front of B. At last, one could check whether S* will continue to detect these photons, as an aether theory would foresee, or not. That is to say, whether light is really "dragged" by the velocity of the source, as SR would predict, or not.

One could even think to take both detector S* and source S fixed in the laboratory, and to use instead as moving source an (almost possible point-like, as before!) mirror M placed in the rim C’ of P’. S emits photons towards a point A’, such that the line SA’ contains the centre A of P’, and these photons are reflected only when M passes through this line (that is to say, with a W -period equal to 2p R/v = 2p /w ). The backward photons can be detected by a fixed screen-detector S*. In order to avoid S and S* to be too much close one to the other, one could place a semi-transparent mirror M’ at some distance L₁ from A (between S and A, sees figure 2), orthogonal to the photon beam, in such a way that the emitted photons pass through M’, and could then place S* at some distance L₂ from A, in order to detect the backward photons reflected by the "other" face of M’. One can even think to increase the researched effect, by repeated reflections, making use as detector S* of another "mirror" (in figure 2, A’ is the point in which the light’s beam hits S* when there is not the interference of the mirror M, A’’ is the point in which the light’s beam hits S* after one reflection by M and M’, and so on).

It is very easy to give quantitative evaluations for this Gedanken-Experiment. For a given radius R, a given angular speed w , the distance d x between the impact point A’ of the forward non-aberrated photons on S*, and the impact point A’’ of possibly aberrated photons, after only one reflection, would be equal, in force of (14), to:

(16) d x = (L₁+R)tg(q ) + (L₁+L₂)tg(q ) = (2L₁+L₂+R)tg(q ) =

= [(2L₁+L₂+R)Ö (1-cos²(q ))]/cos(q ) = (2L₁+L₂+R)v/Ö (1-v²) »

» (2L₁+L₂+R)v .

This identity shows that the described test would be called to measure at least a first order effect. For 1 metre radius, a distance L₁ = L₂ equal to 10 metres, an angular speed w corresponding to 100 Herz, according to SR one should have a "shifting effect", after just one reflection, of about 63 microns, which is perhaps a shifting not so small to start with, and then to be finally detected after many reflections.

It would be perhaps even better, from practical purposes, to replace the rotating platform with a "very sly" vertical spinning rotor - see figure 3 - in order to get always an angle of 90° between the forward beam, and the reflecting mirror M. In such a way, the proposed "experiment" would rather be similar to the famous Fizeau cog-wheel experiment.

Summing up, the light emitted by S is periodically reflected by M. Then, after a new reflection by M’, the beam hits the mirror-detector S*, from which it is once again reflected to M’, and so on. If relativity is correct, one should be able to appreciate an increasing displacement of the trace of the reflected photons, compared with the original trace, which corresponds to photons which have not been reflected. If an aether theory is correct, one should not observe any displacement. The qualitative side of the proposed experiment could perhaps be one of its most attractive features²⁸.

Endnotes

^{1 And more than that, in strictly special-relativistic terms, since gravitation has obviously nothing to do with this question. For a discussion of the impossibility of a purely kinematical solution see for instance G. Cavalleri⁽⁶⁾.
^{2 We shall make use of geometrical unities, namely, we shall assume that the fundamental relativistic constant c (light’s speed in vacuo) is equal to 1.
^{3 Sometimes, one finds the paradox expressed in the form L’ = 2p
R, and then the "absurd" consequence would be now that a non-euclidean geometry holds in W
!
^{4 But one could say that there exists an external synchronization in P, induced by the privileged frame W
(see also later on).
^{5 For a quite rigorous introduction to this whole argument, see for instance B. O’Neill⁽⁹⁾, pp. 358-360.
^{6 g = g₃₃ , in standard notations, is a priori any (negative) function of the coordinates (X,Y,T), and since ds² = 0 defines a photon’s path, then Ö
(-g) is the light’s speed in the given coordinate system (as a consequence, this speed can assume any value, even bigger than c = 1; see B. O’Neill⁽⁹⁾, pp. 181-183). If the coordinate system is not orthogonal, then space and time are not separated enough, even from a relative point of view, in order to introduce the fundamental concepts of Physics.
^{7 This is of course true under the relativistic assumption that only proper time is the time measured by clocks.
^{8 Even just a "local" one, namely defined only on an open subset of M.
^{9 In order to have smooth functions, one has to exclude in any case the value r
= 0.
^{10 From the point of view of SR, the "set of curves" (4) is the rotating platform, and this interpretation implies that all "physical" considerations about "materials" could be considered off the point.
^{11 This cylinder is the rotating circumference in the Minkowski space-time M, in the same sense of endnote 10, since all curves (4), for r
= R, "generate" this surface. As a consequence, one could say that the "rotation" does not change the circumference (or the platform) as absolute objects in space-time, the only difference is in the set of worldlines (observers) that one can introduce in order to generate the same surface.
^{12 B. O’Neill⁽⁹⁾, Prop. 30, p. 358. We remember that, if xⁱ are the Lorentz coordinates we are dealing with, and U_i the covariant components of the field U (with the usual symbolism), then U is said to be irrotational if the 2-form curl(U) = d(U_idxⁱ) = dU_iÙdxⁱ vanishes on each pair of vector fields V, W orthogonal to U: curl(U)(V,W) = 0. Vector fields such that curl(U) globally vanishes, are just the geodesic and irrotational ones, and this vanishing is obviously impossible in our case (with the further consequence that one could never introduce for our purposes, as we have already said, a proper time synchronizable coordinate system - or even a simple synchronizable one, whose associated observer field then should be at least irrotational).
^{13 If one tries to make other circumferences of radius r
rotate with the same speed v as the circumference of radius R, then he would not get an irrotational field. If one extends the field on C’ by simple "parallel translation" in M, then he would get at last an irrotational field, but this cannot obviously include the "whole" of C’.
^{14 As a matter of fact, one could notice that the "geometrical distance" between the event (Rcos(w
t*),Rsin(w
t*),t*) and the intersection of the rest-space (5) with the world-line (0,0,t) (this event is indeed (0,0,t*)), is exactly equal to R, for each instant t*.
^{15 This is just a "classical concept", and the root of all relativistic paradoxes is indeed the attempt to force such concepts into relativistic schemes!
^{16 Namely, wrt to just one coordinate system.
^{17 Which is defined by means of: dt
= Ö
(-ds²) = (Ö
(1-v²))dt .
^{18 Even if this should be obvious, let us remark that in SR only inertial (geodesic) observers cannot detect any possible "absolute motion", and that in our case j
is indeed non-inertial.
^{19 Of course, we do not assert that this possible definition overcomes the previous objections: indeed, it has not been given by means of a coordinate system associated to C’, or even simply to A’.
^{20 In this case of the rotating circular platform, there is truly a great difference with the well-known relativistic uniform motion case (which one can easily study by means of Lorentz coordinates and transformations): the relativistic average light’s velocities forward and backward could be defined in such a way to coincide with the "classically expected" values. A result which could appear "strange", and even more strange if one points out that the instantaneous light’s speeds, as locally measured by any observer on C (by any coordinate system which can be "associated" to the given observer), by means of its proper time, always must give the same result, namely the constant c = 1. As a matter of fact, one should not forget indeed that the only one invariant in SR, namely in Minkowski space-time, is the ds², which allows for instance to simply define (measure) proper times. The transfer from proper times to speeds, understood as the ratio: space/time, requires the definition of lengths, and for that one needs, as we have already said, the introduction of particular orthogonal coordinate systems. In the case of a non-inertial coordinate system, the coordinate time T will not coincide in general with the proper time of the observers associated to the coordinate system. This implies that average measures of the light’s speed in the given coordinate system, made wrt a single clock, could possibly not coincide with c (see also endnote 6). Moreover, let us remark again that in our case we have not introduced at all any coordinate system associated either to P’ or to C’, or to the observer A’!
^{21 This could be considered a particular case of the famous "twin paradox", but one should observe indeed that only the non-inertial observer A’ undergoes time dilation, wrt B, and not vice versa!
^{22 But along this line of thought, one should be careful now in defining the radius of C’, since the centre of P’ would not be at rest for instance in W
₀(t*).
^{23 With the present use of the terms "apparent" and "real", we would wish to remark that all "proper quantities" in our context should be, contrarily to the common custom, the ones measured by the only one privileged frame which appears in this discussion, namely W
, and not by some rotating observer on P’!
^{24 Here we mean of course in the 2-dimensional geometrical sense. As a matter of fact, if we introduce the "tangent" IFR that we have called W
₀(T), this direction coincides with the direction of the y⁰-axis in W
₀(T).
^{25 For instance, in Italian we have just one word: "velocità", like in German one has only: "geschwindigkeit".
^{26 And when v is "very small", one can directly approximate q
»
v, as usual.
^{27 In: U. Bartocci, M. Mamone Capria⁽²⁷⁾ one suggests instead an experiment aimed to test the validity of the I SR Postulate (namely, of the Principle of Relativity) for electromagnetic interactions.
^{28 Needless to say, one should discuss many more practical details of this "experiment", such as the necessity to provide for thermic stability of the apparatus, and so on. From a theoretical point of view, instead, one should make some hypothesis about the possible "state" of a terrestrial laboratory wrt the "aether". The author believes that such a laboratory is very likely "in absolute rest" (Stokes theory), and so that it could be indeed a "privileged" (local) reference frame, but this discussion would obviously go quite far the limited present purpose of this paper...

Bibliographical references

(1) Phys. Z., 10, 918 (1909).
(2) A. Tartaglia, Foundations of Physics Letters, 12, 17 (1999).
(3) F. Selleri, "On a Physical and Mathematical Discontinuity in Relativity Theory", Open Questions in Relativistic Physics, F. Selleri Ed., Apeiron, Montreal (1999).
(4) W. Pauli, "Relativitätstheorie", Encyklopädie der mathematischen Wissenschaften, 5, 19 Leipzig (1921); Theory of Relativity, Dover, New York (1958), Ch. 5, Sec. 45.
(5) R. Sachs, W. Hu, General Relativity for Mathematicians, Springer-Verlag, New York & Berlin (1977).
(6) G. Cavalleri, Il Nuovo Cimento, X, 53B (1968).
(7) A. Einstein and L. Infeld, The Evolution of Physics, New York (1938).
(8) A. Einstein, The Meaning of Relativity, Princeton University Press (1950).
(9) B. O’Neill, Semi-riemannian geometry - With Applications to Relativity, Academic Press, New York-London (1983).
(10) M.G. Sagnac, Compt. Rend., 157, 708 (1913).
(11) R. Monti, Phys. Ess., 9, 2 (1996).
(12) U. Bartocci, M. Mamone Capria, Found. of Phys., 21, 7 (1991).

Figures Captions

Figure 1: light’s aberration from the spinning source A’, as seen by the fixed circumference C. q
is the angle between the two half-lines BB* and BB°.

Figure 2: the proposed experiment, using the rotating circular platform of centre A, seen from the top.

S = photon source
S* = mirror-screen detector
M’ = semi-transparent mirror
P’ = rotating platform
M = reflecting mirror
L₁ = distance between M’ and A
L₂ = distance between A and S*

Figure 3: the proposed experiment, using the spinning rotor, seen from a side.

S and S* as above
M and M’ as above
W = rotating wheel}}}}}}}}}}}}}}}}}}}}}}}}}}}}