In this post we look at the following problem:
Given a point P in a given angle of size formed by two lines, what is the shortest length d of a segment MN passing through P with M and N on each line?
I found out recently that such a line is called Philo’s line and the problem of finding the shortest segment is more tricky than one first suspects. It is named after Philo of Byzantium who lived in the 3rd century BC. Had I known it was as tricky to begin with I may not have spent as long investigating it! It was certainly worthwhile though and I encountered a fun mix of calculus, geometry and algebra (polynomials) on the way which I show here.
We can specify the point P in more that one way – for example by its distances to the two sides (a,b), or by the lengths OE, OF (e,f) as shown in the following figure.
The two pairs are related by the equations
We shall work with a and b with the knowledge that if we need to use e and f, they are simply related to a and b by linear transformations. Note that it is possible for e or f to be negative if .
In the symmetric case of , we can say that for any angle , P will be on the angle bisector of the angle and Philo’s line will be perpendicular to OP by symmetry. In this case, we find by simple trigonometry and so by the cosine rule,
(We can also write the simpler expression but above form is handier for future reference!)
In the case when and , one may think that Philo’s line is that which makes MO = ON, but this is not the case. Take the example of a = 8, b = 1:
The equivalent formulation is this: what is the shortest line segment through the point (8,1) lying in the first quadrant? This is equivalent to the “ladder around a corner” problem that I discussed in an earlier post. The shortest line segment through (8,1) is also the longest ladder that can fit in a corridor with perpendicular corner bound by the axes and with inside corner at (8,1).
As found in that post, the optimal line has intercepts and The resulting squared distance is
Shown here is this line in the case (a,b) = (8,1) and how it differs from the symmetric line keeping OM = MN. We see that Philo’s line (in red) has length while the symmetric line has the much greater length .
This is an instance where our intuition may defy us. In general it is very difficult to identify Philo’s line unless we have an alternative way of characterising it. That is, we need a way of constructing it, or to find other properties that the line satisfies. The above expressions for the intercepts cannot be constructed with compass and straight edge for general a and b since they involve cube roots. In fact Philo was motivated by this problem while working on the classical problem of doubling the cube (equivalently, constructing the cube root of 2) [1, 2]. It was only shown in the 19th century that such a construction with compass and straight edge is impossible so the problem eluded mathematicians for more than two millenia! Even Newton worked on the problem of constructing Philo’s line by compass and straight edge .
We will now turn to the general case of an arbitrary angle and use calculus to find the minimum distance in terms of the roots of a cubic polynomial. On the way we will see a few interesting equivalent characterisations of the line. The special case of was solved in an earlier post using Hölder’s inequality without the need for calculus.
This post will prove the following.
Given two lines intersecting in an angle and a point P with distances a and b to the two lines, the squared length of Philo’s line, the shortest line segment through P and joining the two lines, is given by
where is the positive root satisfying the cubic equation
Before proving this, let us check to see what happens in the special cases we saw earlier. In the case a=b, the cubic (4) becomes
The quadratic factor has only complex roots, and so we have the unique real solution . This leads to
matching with (1).
In the other special case , the cubic becomes
giving us as the unique real solution. Substituting this into (3) we obtain (2):
Hence both earlier results are verified. Isn’t it interesting how the cubic yields such different looking solutions? 🙂
Finding equivalent characterisations
I will firstly show my initial approach leading to a couple of known characterisations of the problem. Then I will present a second approach leading to the above result.
Let be the angle between the shortest line and the line ON (). Let be the angle between the shortest line and the line OM. Observe that .
The distance to be minimised is where .
Setting to 0 gives
where . Neither of the boundary conditions or will lead to the solution (for a, b > 0), so the minimum d will occur when (5) is satisfied.
Let D be the foot of the altitude from O to MN (refer to next figure below). Then we also have
Combining (5), (6) and (7) it follows that P must satisfy ND/MD = MP/NP, from which MD = NP. That is, Philo’s line is chosen so that P is the isotomic conjugate of the foot of the altitude from O to MN. This is possibly the most commonly known characterisation of Philo’s line as presented in .
A second characterisation is that the intercepts M and N of Philo’s line with the two lines of the given angle are equidistant to the point Q which is the midpoint of OP. Equivalently, Q is on the perpendicular bisector of Philo’s line.
The following figure shows these two characterisations.
I also played with other equivalent formulations of the condition (5) and came up with the following figures in an attempt to find and given and .
1) A maps to B and B maps to A under inversion in each circle illustrated below, where is the angle between the tangents shown.
2) The following dual figure can be constructed:
3) In this construction, unlike the previous two, one can at least draw part of the figure with the known information. Start with a triangle OAB with lengths a, b and included angle and construct rays from O perpendicular to OA and OB. Then we wish to construct points C and D on these rays so that the red lines shown form 90-degree angles.
Proof of result using Lagrange multipliers
The approach that leads one to the cubic (4) is by solving the following optimisation problem through Lagrange multipliers. Let OM = m, ON = n. Then by the cosine rule, . The condition that P is on MN is equivalent to the condition that the areas of triangles OMP and ONP add to OMN, or (equivalently, ). Hence we state our optimisation problem as
To solve this, we form the Lagrangian and set its partial derivatives and to 0:
This gives us
At this point we make the substitution . This is motivated by the simpler cases considered earlier: geometrically x and y are as shown below and they have more manageable forms in those simpler cases.
We also find that the constraint or becomes
Ordinarily one might be satisfied with such a condition, but further simplification is possible!
Cool! I never would have thought that is equivalent to . 🙂
The equations give and so (8) under this subsitution for m and n becomes
Substituting into this equation gives us , which is equivalent to the following quartic equation (for ).
This quartic has a factor (found with help from Wolfram|Alpha) and cancelling this factor from both sides gives us our cubic (4):
In terms of x, the length of Philo’s line is given by
Some simplification here is possible by using the quartic form (9), which gives us . Subsituting this into (10) gives our desired form (3):
Finally we show why there is only one positive root of the cubic (4). There is an easier approach than evaluating the cubic’s discriminant. Let . Since , there is either one positive root or three (Imagine a graph of the positive cubic – by the intermediate value theorem it crosses the x axis at some positive x value. Here we are counting repeated roots more than once). Suppose there are three positive roots . We now seek a contradiction.
The relationship leads to and . Multiplying these two equations together gives
But this contradicts the left side being at least 3 (in fact it is at least 9). We conclude that the three roots cannot all be positive (even including multiplicity), and so there is only one positive root.
Finally here are a few more cases where a nice solution is found from specific values of and :
- leads to (drawing the figure gives a 13-14-15 triangle)
- leads to
- leads to (drawing the figure gives a 10-17-21 triangle)
- leads to
For more about Philo’s line see the references below. In particular, a Euclidean (non-calculus) proof of why Philo’s line is characterised by P being the isotomic conjugate of the foot of the altitude from O is given in p198 of . According to one of the links from , Newton found a characterisation in the more general case when the lines OM and ON are curves and the shortest line segment is required to be tangent to a given curve – it involves the concurrency of three normals to the curves.
 Project Gutenberg’s First Six Books of the Elements of Euclid, by John Casey. Available at ftp://ftp.pg.psnc.pl/pub/2/1/0/7/21076/21076-pdf.pdf