Skip to main content

an analysis of Mosers worm

·11229 words·53 mins
Jules Johnson
Author
Jules Johnson
This article is based off a previous writeup on the subject. To read that article, click here.

Introduction
#

Moser’s Worm is an important unsolved problem in geometry with a very amusing name. The name comes from an odd hypothetical. Suppose you have an inch-long pet worm, who’s a very restless sleeper. When the worm goes to bed, there’s no way of knowing how it might be twisted or curved. Nonetheless, you want to cover your worm with a little blanket. What’s the smallest blanket you could use, regardless of how the worm is positioned?

Although many unsolved problems in mathematics are known for their incomprehensibility, the standard statement of Moser’s worm only requires requires one definition:

A set of points (in euclidean 2D space) AA accommodates a set BB when there is a set BB’ such that BAB’\subseteq A, and BB’ is congruent to BB.

Equivalently, a set of points AA accommodates a set BB when there is a set AA’ such that BAB\subseteq A’, and AA’ is congruent to AA.

For these notes, AA accommodates BB will sometimes be notated with ABA \sqsupseteq B. If BB does not accommodate AA, then we write ABA \sqsupset B

From here on out, I might be a little bit looser. I’ll be certain to eliminate ambiguity whenever necesary, but the point of these notes is mostly to ensure that these concepts are written down somewhere they’re easy to locate later on, without necessarily being up to the hyper-rigorous standards of mathematical publication. This is my website, after all. Without further ado, here’s the statement of the problem:

Question 1
What is the smallest convex shape that can accommodate any curve of length 1?

It’s a shockingly simple question for something that has remained unsolved for so long! Although my background is in mathematics, and I studied differential geometry in my undergrad, this question in particular is one I don’t know a lot about. It’s a bit refreshing to work on something unrelated to what you normally do, so I’m looking forward to it! I should mention that I’m deliberately not looking into the already existing body of work on this problem. This is partly for my own satisfaction of discovering things for myself, and partly to avoid bias. Although I don’t suppose that I could fully crack an infamous problem outside my own specialty, it can still be very invigorating for a field to get a fresh pair of eyes and a fresh perspective.

There’s also the second benefit of writing an article on a subject from an outsider’s perspective, which is that it should be very approachable for the reader. Because I don’t know any of the jargon, techniques, or theorems in this field, I can’t possibly confuse a reader by using them. That said, I will use some basic calculus at a couple points.

Type 1 and Type 2 Curves
#

My goal here is to see what curves we don’t need to worry about. There’s two prongs to this:

  1. Show that curve AA is always accommodated when curve BB is accommodated.
  2. Show that when curves AA and BB are accommodated, curve CC is also accommodated.

To that end, we give these definitions:

Given a shape aa, the convex hull of aa will be noted [a][a].

The length of a curve XX will be noted X\ell X. The section of an open curve XX between two points aa and bb will be noted with XabX_a^b, so that the length of an open curve between between two points aa and bb will be noted with Xab\ell X_a^b.

For any set of points p1,p2p_1,p_2\dots, the polygonal chain that connects each of them in order will be denoted p1,p2\langle p_1, p_2\dots\rangle.

This entire problem exists in euclidean space. If dd is the euclidean distance metric, and aa and bb are points, then a,b\ell\langle a,b\rangle will be used as a synonym for d(a,b)d(a,b)

A unit-length curve XX is taut if there does not exist another unit-length curve XX’ such that [X][X][X’] \sqsupset [X].

Immediately there are some lemmas we can form regarding taut curves without any further context. For example, this is the lemma that inspired the choice of the word “taut”:

Lemma 1
If a curve experiences any non-zero curvature at a point that is not on the border of its convex hull, it is not taut.

proof:
Let XX be a unit-length curve, and let xx be a contiguous section of XX that does not touch the border of [X][X], except at two endpoints, aa and bb. Then let XX’ be identical to XX, except that xx is replaced with a straight line from aa to bb. The length of XX’ is necessarily less than 1, although XX’ has the same convex hull. Finally, let X’’X’’ be the unit-length curve that includes all of XX’, but with an added length at the end. X’’X’’ must have a convex hull that includes everything in [X][X] and more.

\square

Hopefully, this drawing explains why I chose the word "taut" to describe this property. The curve on the left is "taut", in the sense that it could be stretched to coincide with the border of its convex hull, and that extra length could be used to create a larger hull. The light grey area is the same between the two shapes, but the dark grey area is added by using length more efficiently

Here’s another brief and straightforward observation about taut curves.

No self intersecting curves are taut.

Let’s start our effort to characterize taut curves with yet another definition

A Type 1 curve is a curve for which all points on the curve lie on the boundary of its convex hull.

A Type 2 curve is a curve for which there is one single segment that crosses the inside of the convex hull.

In general, a Type n curve is one with nn distinct, discontinuous segments along the boundary of its convex hull, and n1n-1 distinct segments passing through the interior of the convex hull

An example of a Type 1 and Type 2 curve.

Type 1 and Type 2 curves can also be thought of in terms of path direction. A path along a Type 1 curve moves entirely either clockwise or counterclockwise across the edge of its convex hull. A path along a Type 2 curve starts moving either clockwise or counterclockwise along the border, but then moves in the opposite direction for a length.

Here’s a brief lemma about Type nn curves that will be useful in several results coming up

Lemma 2
Let XX be a taut curve, and let aa and bb be points on XX such that XabX_a^b consists only of internal points of [X][X]. XabX_a^b has zero curvature.

proof:
Suppose, for the sake of contradiction, that XabX_a^b does not have zero curvature. Then Xab>a,b\ell X_a^b>\ell\langle a,b\rangle. Let XX’ be formed by replacing XabX_a^b in XX with a,b\langle a,b\rangle. It is clear that X<X\ell X’ < X, and so XX is accommodated by XX’, and is not taut.

\square

I’ve only talked about Type 1 and Type 2 curves so far. Why hasn’t any special attention been given to curves of other types? As it turns out, these curves are never taut.

Theorem 1
There are no taut Type nn curves, for n3n \geq 3.

proof:
To force a contradiction, let XX be such a curve. Let aa, bb, cc, dd, ee, ff, and gg be points on XX, appearing in that order, with the added requirements that:

  1. aa, bb, cc, dd, ee, ff, and gg are all on the boundary of [X][X].
  2. aa and gg are the endpoints of XX.
  3. dd is the point furthest from a,g\langle a,g \rangle, on the opposite side from bb and ff
  4. b,c\langle b,c \rangle is internal to the convex hull of XX.
  5. e,f\langle e,f \rangle is internal to the convex hull of XX.

Here’s a drawing to explain.

For ease of upcoming calculations, let a,g\langle a,g\rangle lie on the xx axis, with aa existing at 0,00,0.

Let ϕ\phi be the minimal distance from dd to a,g\langle a,g \rangle. It immediately follows that:

Xbf2ϕ \frac{\ell X_b^f}{2}\leq\phi

In this case, define aa’ and gg’ to be these two points:

a=(0,Xbf2) a’=\left( 0, -\frac{\ell X_b^f}{2}\right)

g=(a,g,Xbf2) g’=\left( \ell\langle a,g\rangle, -\frac{\ell X_b^f}{2}\right)

Then we can define XX’ to be the curve

X=a,aXabb,fXfgg,g X’ = \langle a’,a\rangle \cup X_a^b \cup \langle b,f\rangle \cup X_f^g \cup \langle g,g’\rangle

This is given in the drawing below:

Because aa’ and gg’ are both below dd, the entirety of XX must be inside XX’.

\square

Let’s limit a bit further which curves can be Type 1.

Lemma 3
Let XX be a Type 1 taut curve with endpoints aa and bb. Every point on XX lies on a line perpendicular to a,b\langle a, b\rangle.

proof:
Let XX be a curve as described above, and let cc be the point on XX that lies furthest from any line perpendicular to a,b\langle a, b\rangle. without loss of generality, assume cc is closer to aa than it is to bb.

Let α\alpha be the line that includes the line segment a,b\langle a, b\rangle, and let cc’ be the point on α\alpha closest to cc. Let XX’ be a curve identical to XX, except that the section between cc and aa is replaced with a straight line between cc and cc’.

This situation is illustrated below:

It is clear that

c,c<Cca \ell\langle c,c’\rangle < \ell C_c^a

and therefore

X<X \ell X’<\ell X

Therefore, XX’ has a length less than XX, and so is a unit curve. The convex hull of XX’ can accommodate XX, but is larger. By definition, XX is not taut.

\square

Finally, these past few lemmas and a few other ideas are compiled in a theorem that helps to characterize Type 1 curves:

Theorem 2
A curve XX with endpoints aa and bb is not accommodated by a distinct Type 1 curve if and only if:

  1. All points on XX are on its convex hull.
  2. Every point on XX lies on a line perpendicular to a point on a,b\langle a,b \rangle.
  3. There does not exist a set of points {c,d,e,f}\lbrace c, d, e, f\rbrace such that:
    a. d,e\langle d,e\rangle is tangent to XX.
    b. ff is one of the end points of XX.
    c. ee is a point on XX that is not an end point.
    d. c,d\langle c,d\rangle is perpendicular to f,c\langle f,c\rangle and d,e\langle d,e\rangle.
    e. a,b+c,f+e,dXfe\ell \langle a,b\rangle + \ell \langle c,f\rangle + \ell\langle e,d\rangle \leq \ell X_f^e.

proof:
The fist of these requirements is implied by the definition of a Type 1 curve, and the second is implied by Lemma 3. Therefore, this proof focuses on the third requirement

First, suppose that a set of points as described exists. Without loss of generality, let f=af=a. As before, I’ll include a diagram to clarify what the situation described in the lemma statement is.

Let X=c,a,bXbee,dX’ = \langle c,a,b\rangle \cup X_b^e \cup \langle e,d\rangle. As evident from the diagram, the convex hull of XX’ accommodates the convex hull of XX. Next, consider the inequality assumed earlier:

a,b+c,a+e,dXaec,a,b+e,dXaec,a,b+Xbe+e,dXae+XbeXX\begin{align*} \ell \langle a,b\rangle + \ell \langle c,a\rangle + \ell\langle e,d\rangle &\leq \ell X_a^e \\ \ell \langle c,a,b\rangle + \ell\langle e,d\rangle &\leq \ell X_a^e \\ \ell \langle c,a,b\rangle + \ell X_b^e + \ell\langle e,d\rangle &\leq \ell X_a^e + \ell X_b^e \\ \ell X’ &\leq \ell X \end{align*}

Therefore, [X][X’] both accommodates [X][X] and includes other area as well, but XX’ is shorter than XX. By definition, XX is not taught.

An “if and only if” proof requires that the statement is proven in both directions, so Theorem 2 will now be proved in the opposite direction. Suppose that such a set of points as c,d,e,f{c, d, e, f} does not exist, but that X1X_1 is a Type 1 curve such that [X1]X[X_1] \sqsupset X. Let cc and dd be the endpoints of X1X_1. An example of what this might look like is shown below:

Next, let α\alpha be the line passing through cc and dd. let X2X_2 be the shortest curve with endpoints on α\alpha that accommodates XX. X2X_2 must consist of two lines parallel to α\alpha, as well as a section of the boundary of [X][X], which may or may not include a,b\langle a,b\rangle. Let cc’ and dd’ be the endpoints of X2X_2, and let ee and ff be the points where X2X_2 meets the convex hull of XX, as shown below:

Finally, the length of X2X_2 can be reduced even further. Let d’’d’’ and c’’c’’ be points on d,e\langle d’,e\rangle and c,e\langle c’,e \rangle that are both a distance ϕ\phi away from dd’ and cc’, with ϕ\phi having the largest possible value without d’’,c’’\langle d’’,c’’\rangle intersecting the interior of [X][X].

Let X3X_3 be X2X_2, without the lines from dd’ to d’’d’’ and from cc’ to c’’c’’. If ff and ee are both endpoints of XX, then X3X_3 must contain the entirety of XX. This is impossible, as X=1\ell X = 1, so either ee or ff is not an endpoint of XX.

Instead, assume (for the sake of contradiction), that neither ff nor ee are endpoints of XX. This would require that at least one of the two points is not on a line parallel to a,b\langle a,b \rangle.

Therefore, {c’’,f,e,d’’}\lbrace c’’, f, e, d’’\rbrace satisfies the requirements of the lemma.

\square

The reason I set out this lemma is because I want to demonstrate that there is a limit on the “height” of Type 1 curves. If a,b\langle a,b \rangle is very short, and XX has a point very distant from a,b\langle a,b \rangle, it would be easy to find a way to maximize Xae\ell X_a^e and minimize a,c\ell \langle a,c \rangle

This theorem serves to characterize type 1 curves fairly precisely. One of the broader goal of this project is to characterize Type 2 curves equally as well.

Lemma 4
Let XX be a taught Type 2 curve, and let aa and dd be the end points of XX. Let bb and cc be the endpoints of the section of XX which are internal to the convex hull of XX. It it required that:

b,ca,c \ell \langle b,c\rangle \leq \ell\langle a,c\rangle b,cb,d \ell \langle b,c\rangle \leq \ell\langle b,d\rangle

proof:
The locations of aa though dd can be clarified by this diagram:

Suppose for the sake of contradiction that a.c>b,c\ell\langle a.c\rangle > \ell\langle b,c\rangle. Define XX’ such that

X=Xbaa,cXcd X’ = X_b^a \cup \langle a,c\rangle \cup X_c^d

Because a,c>b,c\ell\langle a,c\rangle > \ell\langle b,c\rangle, X<X=1\ell X’ < \ell X = 1. Then let X’’X’’ be identical to XX’, except that an extra bit is added to increase the area of the convex hull while keeping X’’1\ell X’’\leq 1. By definition, XX is not taut.

By symmetry, XX is also not taut if b.d>b,c\ell \langle b.d\rangle > \ell \langle b,c\rangle.

\square

The non-internal sections of a Type 2 curve behave something like Type 1 curves themselves. For example, consider the following Lemma:

Lemma 5
Let XX be a taut Type 2 curve with endpoints aa and dd, and let bb and cc be the endpoints of the section of XX that is internal to [X][X]. Let L1L_1 be the line passing through aa and cc, and let L2L_2 be the line perpendicular to b,d\langle b,d\rangle that passes through dd. Then any point pp on XcdX_c^d must lie in the triangle bound by L1L_1, L2L_2, and c,d\langle c,d\rangle.

proof:
First, for clarity, a diagram of these lines is shown:

First, if pp is a point on XcdX_c^d that is on the “wrong” side of L1L_1, then cc is an internal point of [X][X]. This is not true by assumption.

The point pp must be on the exterior of [X][X], by assumption, and so cannot be on the wrong side of c,d\langle c,d\rangle.

Finally, suppose for the sake of contradiction that pp lies on the wrong side of L2L_2. Without loss of generality, let pp specifically be the point furthest from L2L_2. Then let pp’ be the point on the line containing bb and dd that is closest to pp. If X=Xapp,pX’ = X_a^p \bigcup \langle p, p’\rangle, then XX’ is shorter than XX. Because p,p\langle p,p’\rangle is perpendicular to p,d\langle p’,d\rangle, pp’ must also be further from L2L_2 than any other point in XX. Therefore, XX’ accommodates XX, giving a contradiction.

\square

This lemma comes with a few charming corollaries:

Let XX be a Type 2 curve with endpoints aa and dd, and let bb and cc be the endpoints of the section of XX that is internal to [X][X]. Let L2L_2 be the line perpendicular to b,d\langle b,d\rangle that passes through dd. If pp and qq are points on XcdX_c^d such that Xqd<Xpd\ell X_q^d < \ell X_p^d, then qq is not further from L2L_2 than pp is.
Let XX be a Type 2 curve with endpoints aa and dd, and let bb and cc be the endpoints of the section of XX that is internal to [X][X]. The angles formed by c,b,d\langle c,b,d\rangle and d,a,c\langle d,a,c\rangle are acute.

Finally, we can make a little theorem effectively characterizing taut Type 2 curves.

Theorem 3
Let XX be a Type 2 curve, and let aa and dd be the end points of XX. Let bb and cc be the endpoints of the section of XX which are internal to the convex hull of XX. Let aa’ be the point that is co-linear to aa and cc, such that a,a,b\langle a, a’, b \rangle is a right triangle. Similarly, let dd’ be the point that is co-linear to bb and dd, such that c,d,d\langle c, d’, d \rangle is a right triangle. This is shown in the illustration below.

If XX is taught, then:

  1. b,ca,c\ell \langle b,c\rangle \leq \ell\langle a,c\rangle
  2. b,cb,d\ell \langle b,c\rangle \leq \ell\langle b,d\rangle
  3. Every point on XX between cc and dd lies in the triangle bounded by L1L_1, L2L_2, and c,d\langle c,d\rangle.
  4. Every point on XX between aa and bb lies in the triangle bounded by L3L_3, L4L_4, and a,b\langle a,b\rangle.
  5. If pp and qq are points on XX between cc and dd, with qq closer to dd, then qq is closer to L2L_2.
  6. If pp and qq are points on XX between bb and aa, with qq closer to aa, then qq is closer to L4L_4.

Defining Several Important Shapes
#

Now that the framework for which curves are worth considering has been well established, let’s build up a vocabulary of specific curves to consider. First, what exactly do I mean by “curve”? In a context where curves can be translated and rotated, it makes sense to define a curve more broadly than usual.

When I refer to a curve, I actually mean the congruence class to a specific curve. The term “curve” refers only to a particular shape, and not a position.

When I wish to refer to a curve in a particular position, I’ll mention an instance of a curve. If an instance of a curve is translated, rotated, or flipped, it becomes a different instance of that curve.

As a useful convention, curves will be named in capital letters, while their instances are named in lowercase letters.

Which curves are we going to give names to? In some sense, the simplest unit-length curve is the single line segment. I don’t want to use the letter L or I for this purpose, so I’ll call this GG, in reference to the French spelling of the word line, or ligne.

The curve GG will refer to the straight line segment of length 1. Equivalently, GG is congruent to (0,0),(1,0)\left\langle\left(0,0\right),\left(1,0\right)\right\rangle

There’s a bit of an opposite to GG. While GG has a maximum distance from end to end (or diameter), it has no width whatsoever. A correlating question would be to search for the maximum radius of a circle accomodated by a single curve’s convex hull.

The maximum radius of a circle accomodated by a single cure’s convex hull is 12+π\frac{1}{2+\pi}.
proof:
Consider the circle cc of radius rr defined with (yr)2+x2=(r)2(y-r)^2+x^2=(r)^2. Suppose mm is a curve with endpoints aa and bb that accomodates cc with minimal length. In order to minimize the length of mm, a,b\langle a,b \rangle must be tangent to cc. Without loss of generality, let aa and bb lie on the xx axis. The locations of aa and bb that minimize the length of mm are directly beneath the ends of cc, at (±r,0)(\pm r,0)

To minimize the size of mm, assume that mm is “taut” against the circle cc, as shown in the drawing below:

The total length of mm is therefore 2r+πr2r+\pi r. If mm is a unit length curve, then 1=2r+πr=r(2+π)12+π=r\begin{align*} 1 =2r+\pi r &=r(2+\pi)\\ \frac{1}{2+\pi} &= r \end{align*}

\square

Now that that’s done, we can use this as the basis of the definition of our second useful curve!

The curve MM will refer to the curve that accomodates the circle with radius 12+π\frac{1}{2+\pi}. This can be defined as the curve congruent to the set:

x=±12+πfor y{,0}x2+y2=1(2+π)2for y{0,r}}\begin{rcases} x=\pm\frac{1}{2+\pi} &\text{for } y\in\lbrace -\infty,0 \rbrace \\ x^2+y^2=\frac{1}{(2+\pi)^2} &\text{for } y\in\lbrace 0,r\rbrace \end{rcases}

That’s the largest possible circle that can be accomodated with a unit length curve, but what about the largest possible square? We might as well define it as simply as possible:

A curve SS will refer to the unit length curve with three orthogonal sides of length 13\frac{1}{3}. This is conruent to the curve defined by

(0,0),(13,0),(13,13),(0,13)\left\langle (0,0), \left(\frac{1}{3},0\right), \left(\frac{1}{3},\frac{1}{3}\right), \left(0,\frac{1}{3}\right)\right\rangle

Of course, SS was chosen as a name because that is the first letter of the word “square”. This shape is shown below:

What other simple curves are important enough to be given brief names? There’s a fair argument to be made that the simplest curves would be those of constant curvature, also known as arcs.

For any radius rr, Let ArA_r be the unit length arc of a circle with radius rr. Equivalently, ArA_r is congruent to

y=r2x2rsin(πr12r)y=\sqrt{r^{2}-x^{2}}-r\sin\left(\frac{\pi r-1}{2r}\right)

for y>0y>0

It’s worth showing that the shape given in this definition does in fact have a length of 1. The two places where y=0y=0 are given with:

0=r2x2rsin(πr12r)rsin(πr12r)=r2x2r2sin2(πr12r)=r2x2r2r2sin2(πr12r)=x2r2(1sin2(πr12r))=x2r2cos2(πr12r)=x2±rcos(πr12r)=x\begin{align*} 0 &= \sqrt{r^{2}-x^{2}}-r\sin\left(\frac{\pi r-1}{2r}\right)\\ r\sin\left(\frac{\pi r-1}{2r}\right) &= \sqrt{r^{2}-x^{2}}\\ r^2\sin^2\left(\frac{\pi r-1}{2r}\right) &= r^2-x^2\\ r^2 - r^2\sin^2\left(\frac{\pi r-1}{2r}\right) &= x^2\\ r^2\left(1-\sin^2\left(\frac{\pi r-1}{2r}\right)\right) &= x^2\\ r^2\cos^2\left(\frac{\pi r-1}{2r}\right) &= x^2\\ \pm r\cos\left(\frac{\pi r-1}{2r}\right) &= x \end{align*}

Note also that y=xr2x2y’=\frac{x}{\sqrt{r^{2}-x^{2}}}. Using the standard formula for length of a curve gives:

rcos(πr12r)rcos(πr12r)1+y2dx=rcos(πr12r)rcos(πr12r)1+x2r2x2dx=rcos(πr12r)rcos(πr12r)r2r2x2dx=rcos(πr12r)rcos(πr12r)11x2r2dx=rcos1(xr)rcos(πr12r)rcos(πr12r)=rcos1(rcos(πr12r)r)+rcos1(rcos(πr12r)r)=rcos1(cos(πr12r))+rcos1(cos(πr12r))=r(πr12r)+r(cos1(cos(πr12r))+π)=r(πr12r)+r((πr12r)+π)=πr12πr12+πr=πr+1+πr=1\begin{align*} \int_{-r\cos\left(\frac{\pi r-1}{2r}\right)}^{r\cos\left(\frac{\pi r-1}{2r}\right)}\sqrt{1+y’^2}dx &= \int_{-r\cos\left(\frac{\pi r-1}{2r}\right)}^{r\cos\left(\frac{\pi r-1}{2r}\right)}\sqrt{1+\frac{x^2}{r^2-x^2}}dx\\ &= \int_{-r\cos\left(\frac{\pi r-1}{2r}\right)}^{r\cos\left(\frac{\pi r-1}{2r}\right)}\sqrt{\frac{r^2}{r^2-x^2}}dx\\ &= \int_{-r\cos\left(\frac{\pi r-1}{2r}\right)}^{r\cos\left(\frac{\pi r-1}{2r}\right)}\frac{1}{\sqrt{1-\frac{x^2}{r^2}}}dx\\ &= \left.-r\cos^{-1}\left(\frac{x}{r}\right)\right|_{-r\cos\left(\frac{\pi r-1}{2r}\right)}^{r\cos\left(\frac{\pi r-1}{2r}\right)}\\ &= -r\cos^{-1}\left(\frac{r\cos\left(\frac{\pi r-1}{2r}\right)}{r}\right) + r\cos^{-1}\left(\frac{-r\cos\left(\frac{\pi r-1}{2r}\right)}{r}\right)\\ &= -r\cos^{-1}\left(\cos\left(\frac{\pi r-1}{2r}\right)\right) + r\cos^{-1}\left(-\cos\left(\frac{\pi r-1}{2r}\right)\right)\\ &= -r\left(\frac{\pi r-1}{2r}\right) + r\left(-\cos^{-1}\left(\cos\left(\frac{\pi r-1}{2r}\right)\right)+\pi\right)\\ &= -r\left(\frac{\pi r-1}{2r}\right) + r\left(-\left(\frac{\pi r-1}{2r}\right)+\pi\right)\\ &= -\frac{\pi r-1}{2} -\frac{\pi r-1}{2}+\pi r\\ &= -\pi r+1 +\pi r = 1 \end{align*}

One could also argue that the simplest shape to consider is two line segments meeting at a single vertex, also known as a 2-chain (no relation), with both line segments of equal length. The convex hulls of these shapes are simple triangles. From this perspective, it would only make sense to define

For any angle θ\theta, Let VθV_\theta be the union of two line segments of length 12\frac{1}{2} meeting at an angle of θ\theta. Equivalently, VθV_\theta is congruent to

(12cosθ2,0),(0,12sinθ2),(12cosθ2,0)\left\langle \left(-\frac{1}{2}\cos{\frac{\theta}{2}},0\right), \left(0,\frac{1}{2}\sin{\frac{\theta}{2}}\right), \left(\frac{1}{2}\cos{\frac{\theta}{2}},0\right) \right\rangle

For convenience, define E=Vπ6E=V_{\frac{\pi}{6}}.

As you might have guessed, the name VV was chosen due to its resemblance to an angle. The name EE was chosen because of the similarity to an equilateral triangle. As before, it’s worthwhile to show that the length of this curve is 1. The distance between (0,12sinθ2)\left(0,\frac{1}{2}\sin{\frac{\theta}{2}}\right) and (12cosθ2,0)\left(\frac{1}{2}\cos{\frac{\theta}{2}},0\right) is given by:

(12sinθ2)2+(12cosθ2)2=14sin2θ2+14cos2θ2=12 \sqrt{\left(\frac{1}{2}\sin{\frac{\theta}{2}}\right)^2 + \left(\frac{1}{2}\cos{\frac{\theta}{2}}\right)^2} = \sqrt{\frac{1}{4}\sin^2{\frac{\theta}{2}} + \frac{1}{4}\cos^2{\frac{\theta}{2}}} = \frac{1}{2}

All of these have been type 1 curves! In a sense, the simplest type 2 curve is one that can be formed with just four points.

Let ZZ be 3-chain consisting of three line segments of length 13\frac{1}{3} sitting at alternating right angles to one another. More precicely, ZZ is congruent to

(0,0),(13,0),(13,13),(23,13)\left\langle (0,0), \left(\frac{1}{3},0\right), \left(\frac{1}{3},\frac{1}{3}\right), \left(\frac{2}{3},\frac{1}{3}\right)\right\rangle

Of course, this is also shown below:

A hopeful reader might wonder if this curve might in fact be accomodated by a type 1 curve. ZZ certainly doesn’t look like it could be accomodated by a type 1 curve, by any of the lemmas encountered earlier.

To see if it’s possible to accomodate this with a type 1 curve, let’s start by placing the ZZ curve on it’s side, so that the starting and ending points are on the yy-axis. I will refer to this instance of ZZ as zz, and label its points with:

z1=(0,0)z_1 = \left(0,0\right) z2=(2515,515)z_2 = \left(\frac{2\sqrt{5}}{15},-\frac{\sqrt{5}}{15}\right) z3=(3515,515)z_3 = \left(\frac{3\sqrt{5}}{15},\frac{\sqrt{5}}{15}\right) z4=(53,0)z_4 = \left(\frac{\sqrt{5}}{3},0\right)

I’m loath to stretch the length of this paper out any further with information the reader could easily find for themselves. That said, it is important to show that these points really do form an instance of ZZ. First, note that the distances between each point really does come out to ⅓.

z1,z2=(25150)2+(5150)2=20152+5152=515=13\ell\langle z_1,z_2\rangle = \sqrt{\left(\frac{2\sqrt{5}}{15} -0\right)^2 + \left(-\frac{\sqrt{5}}{15} -0\right)^2} = \sqrt{\frac{20}{15^2} + \frac{5}{15^2}} = \frac{5}{15}=\frac{1}{3}

z2,z3=(35152515)2+(515+515)2=5152+20152=515=13\ell\langle z_2,z_3\rangle = \sqrt{\left(\frac{3\sqrt{5}}{15} -\frac{2\sqrt{5}}{15}\right)^2 + \left(\frac{\sqrt{5}}{15} +\frac{\sqrt{5}}{15} \right)^2} = \sqrt{\frac{5}{15^2} + \frac{20}{15^2}} = \frac{5}{15}=\frac{1}{3}

z3,z4=(55153515)2+(0515)2=20152+5152=515=13\ell\langle z_3,z_4\rangle = \sqrt{\left(\frac{5\sqrt{5}}{15} -\frac{3\sqrt{5}}{15}\right)^2 + \left(\frac{0-\sqrt{5}}{15}\right)^2} = \sqrt{\frac{20}{15^2} + \frac{5}{15^2}} = \frac{5}{15}=\frac{1}{3}

Additionally, the slope between z1z_1 and z2z_2 is 515025150=12\frac{-\frac{\sqrt{5}}{15}-0}{\frac{2\sqrt{5}}{15}-0} = \frac{-1}{2}. The slope between z2z_2 and z3z_3 is 515+51535152515=2\frac{\frac{\sqrt{5}}{15}+\frac{\sqrt{5}}{15}}{\frac{3\sqrt{5}}{15}-\frac{2\sqrt{5}}{15}} = 2. Finally, the slope between z4z_4 and z3z_3 is 0515533515=12\frac{0-\frac{\sqrt{5}}{15}}{\frac{\sqrt{5}}{3}-\frac{3\sqrt{5}}{15}} = \frac{-1}{2}. This shows that each edge is orthogonal to the previous one, confirming that this is an instance of ZZ.

I want to define a type 1 curve that gets very close to accomodating this. For a start, I’ll have this curve include z1z_1, z2z_2, and z3z_3. This curve is shown in purple:

The length of z1,z3,z4\langle z_1, z_3, z_4\rangle is given by:

z1,z3+z3,z4=(5150)2+(35150)2+13=5152+45152+515=52+515=2+13 \ell\langle z_1, z_3\rangle + \ell\langle z_3, z_4\rangle = \sqrt{\left(\frac{\sqrt{5}}{15}-0\right)^2+\left(\frac{3\sqrt{5}}{15}-0\right)^2} + \frac{1}{3} = \sqrt{\frac{5}{15^2}+ \frac{45}{15^2}} + \frac{5}{15} = \frac{5\sqrt{2}+5}{15}=\frac{\sqrt{2}+1}{3}

In order to get as close as possible to accomodating z2z_2, we may place a single point at a distance of 223\frac{2-\sqrt{2}}{3} from z1z_1. The optimal point pp would be the one where z1,p\langle z_1,p \rangle and p,z4\langle p, z_4 \rangle are orthogonal. (trust me)

If p=(px,py)p = \left(p_x, p_y\right), then these two requirements may be expressed as:

(px0)2+(py0)2=223\sqrt{\left(p_x-0\right)^2+\left(p_y-0\right)^2} = \frac{2-\sqrt{2}}{3}

py0px0=53px0py\frac{p_y-0}{p_x-0}=-\frac{\frac{\sqrt{5}}{3}-p_x}{0-p_y}

If we take the first of these two equations and square both sides, this becomes:

px2+py2=6429p_x^2+p_y^2 = \frac{6-4\sqrt{2}}{9}

Meanwhile, we multiply both sides of the other equation by pxpyp_xp_y to get

py2=53pxpx2p_y^2=\frac{\sqrt{5}}{3}p_x-p_x^2

Now we can plug this into the other equation for py2p_y^2 and solve to get

53px=6429\frac{\sqrt{5}}{3}p_x = \frac{6-4\sqrt{2}}{9} px=64235p_x = \frac{6-4\sqrt{2}}{3\sqrt{5}}

Square this to get

px2=6848245p_x^2 = \frac{68-48\sqrt{2}}{45}

Plug this and 53px\frac{\sqrt{5}}{3}p_x into our equation for py2p_y^2 to get

py2=64296848245py2=30202456848245py2=38+28245py=±38+28235\begin{align*} p_y^2 &= \frac{6-4\sqrt{2}}{9}-\frac{68-48\sqrt{2}}{45}\\ p_y^2 &= \frac{30-20\sqrt{2}}{45}-\frac{68-48\sqrt{2}}{45}\\ p_y^2 &= \frac{-38+28\sqrt{2}}{45}\\ p_y &= \pm\frac{\sqrt{-38+28\sqrt{2}}}{3\sqrt{5}} \end{align*}

Of course, this is pretty ugly, but it is accurate. We’ll use the negative version of pyp_y to get p=(64235,38+28235)p=\left(\frac{6-4\sqrt{2}}{3\sqrt{5}},\frac{\sqrt{-38+28\sqrt{2}}}{3\sqrt{5}}\right), as shown:

Let ZZ’ be the 3-chain congruent to

(64235,38+28235),(0,0),(13,13),(23,13)\left\langle \left(\frac{6-4\sqrt{2}}{3\sqrt{5}},\frac{\sqrt{-38+28\sqrt{2}}}{3\sqrt{5}}\right), (0,0), \left(\frac{1}{3},\frac{1}{3}\right), \left(\frac{2}{3},\frac{1}{3}\right)\right\rangle

Convex Hulls of Multiple Curves
#

My plan is to spend this section establishing some useful lemmas about the convex hulls of multiple curves, as a bit of set dressing for the sections to come. The nature of this problem requires that we seek to minimize the size of a convex hull, so I’ll give a definition relating to this:

Given a set of curves {A,B,}=Ω\lbrace A,B,\dots\rbrace =\Omega, let {a,b,}=ω\lbrace a,b,\dots\rbrace=\omega be a set of containing exactly one instance of each curve in ω\omega. define [ω][\omega] to be the single convex hull of the union of ω\omega. Define [Ω][\Omega] to be the set of all possible instances of ω\omega.

Let [ω][\omega] be a convex hull in [Ω][\Omega]. We say a[ω][\omega] is minimal if it has the smallest possible area of all elements of [Ω][\Omega]. The set of minimal convex hulls in [Ω][\Omega] will be denoted with [Ω][\Omega]^-.

For convenience, we order [Ω][\Omega] by area, so that it is a weakly ordered set. We use ω1ω2\omega_1\lesssim \omega_2 to indicate that ω1\omega_1 is smaller than or equal in size to ω2\omega_2. By this ordering, [Ω][\Omega]^- is the set of minimal elements of [C][C].

Here’s the first lemma for this section:

Lemma 6 Let Ω={A,B,C,}\Omega=\lbrace A, B, C, \dots\rbrace be a set of curves, and let ω={a,b,c,}\omega=\lbrace a, b, c, \dots\rbrace be a set of their instances, such that [ω][Ω][\omega] \in [\Omega]^-. If there is an instance of AA inside [ω][\omega] that does not touch its boundary, then [ω][Ω/A][\omega]\in[\Omega/A]^-.

proof:
Let aa’ be the instance of AA that is internal to XX but does not touch its boundary, let ω={a,b,c,}\omega’=\lbrace a’, b, c, \dots\rbrace, and let XX’ be the convex hull of the union of ω\omega’.

It can be immediately seen by definition that ωΩ\omega’\in\Omega, and therefore that X[Ω]X’\in [\Omega]. Because XX is a minimal element of [Ω][\Omega], XXX\lesssim X’. XX’ contains no points outside the boundary of XX, and so XXX\nless X’.

Therefore, XX’ has the same area as XX. This implies that X=XX=X’.

\square

In order to produce the next lemma. we give a special definition:

Let AA and BB be two shapes, and let aa and bb be instances of those shapes. We say that aa and bb are fully intersecting if there exists no linear translation of aa, called aa’, such that [ba][b\cup a’] is a strict subset of [ba][b\cup a]
It should be noted that “fully intersecting” and minimal are not the same thing. Given two shapes AA and BB, and a pair of instances aa and bb, it is possible that aa and bb may be fully intersecting, even if [a,b]∉[A,B][a,b]\not\in[A,B]^-. On the other hand, it is true that if aa and bb are not fully intersecting, then [a,b]∉[A,B][a,b]\not\in[A,B]^-.

This definition comes with an almost immediate lemma:

Lemma 7
Let Ω\Omega be a set of curves, and let ω\omega be a set of their instances. If AA and BB are elements of Ω\Omega, then [ω][\omega] accomodates some pair of instances aa and bb, where aa and bb are fully intersecting.

proof:

Let a,bωa,b\in\omega. First, we know that [ω][a,b][\omega]\sqsupseteq[a,b]. We also know that if aa and bb are not fully intersecting, they at least have a fully intersecting pair of linear transformations, aa’ and bb’, such that [a,b][a,b][a,b]\sqsupseteq[a’,b’]. By the transitive property, [ω][a,b][\omega]\sqsupseteq[a’,b’]

\square

We can extend this lemma just a little bit further, finally reaching a theorem about the nature of minimal convex hulls.

Theorem 4
Let Ω\Omega be a set of curves. If ω\omega is a set of instances, such that [ω][Ω][\omega]\in[\Omega]^-, then every pair of elements in ω\omega are fully intersecting.

proof:

Let ω\omega be such that [ω][Ω][\omega]\in[\Omega]^-. In order to force a contradiction, assume that there exists a,bωa,b\in\omega such that aa and bb are not fully intersecting. Because of Lemma 6, we may assume that both aa and bb touch the boundary of [ω][\omega]. In fact, they both extend the boundary of [ω][\omega], meaning that removing them would result in a smaller overall convex hull.

Let’s assume (WLOG) that aa’ is a linear transformation of aa, such that a[a,b]a’\sqsubset[a,b]. Let ω\omega’ be formed by replacing aa in ω\omega with aa’. Note that, by the definition of fully intersecting, [a,b][ab][a,b]\sqsupseteq[a’b], and therefore, [ω][ω][\omega]\sqsupseteq[\omega’]. This means that if there is any point on the boundary of [ω][\omega] that is not present in [ω][\omega’], then [ω][ω][\omega]\sqsupset[\omega’]. But this cannot be the case if [ω][Ω][\omega]\in[\Omega]^-, so it must be that [ω]=[ω][\omega]=[\omega’].

We now know that there are some points present in [a,b][a,b] that are absent in [a,b][a’,b], but that none of these points are on the boundary of [ω][\omega]. As mentioned, aa lies on the boundary of [ω][\omega]. If aa’ is also on the boundary of [ω][\omega], that means that aa must not have actually expanded the boundary of [ω][\omega]. This also violates the spirit of Lemma 6, giving a contradiction.

\square

For this lemma to be maximally useful, it’s a good idea to characterize better what makes two curves fully intersecting.

Theorem 4½
Let cc and dd be closed convex shapes, neither of which is a subset of the other, and neither of which has an area of 0. Suppose also that the boundaries of cc and dd intersect at fewer than three points.

If [c,d]/(cd)[c,d]/\left(c\cup d\right) is discontinuous, cc and dd are not fully intersecting.

proof:

To use more down-to-earth terminology, if [c,d]/(cd)[c,d]/\left(c\cup d\right) is discontinuous, this means that [c,d][c,d] adds two non-adjacent regions to (cd)\left(c\cup d\right). An example of this is shown below. The dark grey regions are the interiors of cc and dd, while the light grey regions are the areas added by [c,d][c,d]

Let c1,d1\left\langle c_1,d_1\right\rangle and c2,d2\left\langle c_2,d_2\right\rangle be the two lines on the border of [c,d][c,d] that are not part of cc or dd, where c1c_1 and c2c_2 lie on cc, and d1d_1 and d2d_2 lie on dd. Assume, without loss of generality, that c1,c2d1,d2\ell\left\langle c_1,c_2\right\rangle \leq \ell\left\langle d_1,d_2\right\rangle, so that the two lines are either parallel, or slanting to be closer near cc. This is shown below:

For what comes next, consider rotating [c,d][c,d] such that the slope of c1,d1\left\langle c_1,d_1\right\rangle is non-negative, and the slope of c2,d2\left\langle c_2,d_2\right\rangle is non-positive, as in the previous diagram. For every point pp in cc, there is a positive distance between pp and the point on the border of dd and [c,d][c,d] that shares pp’s yy-coordinate. If there wasn’t, then pp would lie on the the border of dd, violating one of our assumptions. Let xx be the minimal possible such distance, and let cc’ be the result of shifting cc a distance of xx towards dd.

We have now demonstrated that [c,d][c,d][c’,d]\subset[c,d], and by definition, cc and dd are not fully intersecting.

\square

Corrolary 4½.1

Let gg be an instance of GG, and let cc be a closed convex shape that does not accommodate GG. If either endpoint of gg is internal to cc, then gg and cc are not fully intersecting.

proof:

Let g1g_1 be the endpoint of gg that is internal to cc, meaning that it is in cc, but not on its boundary. Let g2g_2 be the endpoint of gg that is external to cc. There exists a closed ball with positive radius rr around g1g_1 entirely in cc. Let g2g_2’ be the point on gg at distance rr from g2g_2, and let g1g_1’ be the point 1 unit from g2g_2’ such that g1,g2\left\langle g_1’,g_2’\right\rangle passes through g1g_1. furthermore, let g=g1,g2g’=\left\langle g_1’,g_2’\right\rangle. We may observe that g1g_1’ an g2g_2’ are both in [c,g][c,g], but g2g_2 is not in [c,g][c,g’]. By definition, [c,g][c,g][c,g’]\subset[c,g], and so gg and cc are not fully intersecting.

\square

For the final result of this this section, it’ll be helpful to note one final definition. To be clear, this isn’t a definition of my own, but one that is commonly used.

Given a shape AA, the maximal distance between any two points in AA is called AA’s diameter.

In mathematical equations, the diameter of AA will be notated with diam(A)\text{diam}(A)

The theorem we’re attempting to develop with this is:

Theorem 4¾

For any convex shapes CC and DD, the diameter of any element of [C,D][C,D]^- is max(diam(C),diam(D))\text{max}\left(\text{diam}\left(C\right), \text{diam}\left(D\right)\right).

Showing this is quite hard! It seems true, but that’s hardly proof. I’m spinning off this discussion into its own mini-article, so it doesn’t become a massive part of this one.

Moserville
·558 words·3 mins

Examples of Minimal Covers on Finite Sets
#

In a previous section, I limited the space of curves worth considering by defining taughtness. I showed that some curves are accomodated by other curves, and therefore not worth considering. In the next section, I’ll try to limit the space of curves even further by finding some curves that are accomodated by sets of other curves.

A curve XX is proxy accommodated by a set of curves Ξ\Xi if XX is accommodated by every shape in [Ξ][\Xi]^-.

My initial goal was to hopefully find a finite set of taut curves that proxy accommodate all other taut curves, and then to find the smallest set that contains all curves in that finite set. This goal seems reasonable at first, as it is much easier to find [Ξ][\Xi]^- when Ξ\Xi is a finite set of curves as opposed to an infinite one. There are some severe problems with this as an objective, but there’s still much to be learned by exploring it!

In this section, to illustrate that it is in fact possible to find the minimal size of covering shapes for certain finite numbers of curves, we’ll find a few examples. These next two theorems are 5% insight and 95% bookkeeping, so feel free to skip their proofs if you’re in a hurry.

Theorem 5
All elements of [S,G][S,G]^- are congruent to a quadrilateral defined with:

(13,0),(a,13),(0,13),(b,0),(13,0)\left\langle \left(\frac{1}{3},0\right), \left(a,\frac{1}{3}\right), \left(0,\frac{1}{3}\right), \left(b,0\right), \left(\frac{1}{3},0\right)\right\rangle

Where b<0b<0, a>13a>\frac{1}{3}, and ab=223a-b = \frac{2\sqrt{2}}{3}. Their area is approximately 0.2680.268.

proof:
First, assume without loss of generality that the instance of SS is located on the points (0,0)(0,0), (13,0)\left(\frac{1}{3},0\right), (13,13)\left(\frac{1}{3},\frac{1}{3}\right), and (0,13)\left(0,\frac{1}{3}\right), as in the definition’s example. Call this instance ss. Let the intance of GG have endpoints AA and BB, so that A,B=1\ell\langle A,B\rangle = 1. Define this instance of the line so that A,B=g\langle A,B\rangle = g. Note that the line between AA and BB must pass through the square in order for these shapes to be fully intersecting.

Of course, [s,g][s,g] is the convex hull of these points. A potential instance of [s,g][s,g] is shown in the illustration below:

[s,g][s,g] can be deconstructed into the central square and multiple triangles:

Let A=(ax,ay)A=(a_x,a_y), and let B=(bx,by)B=(b_x,b_y). Additionally, let f(x)f(x) be the distance of xx to the interval [0,13]\left[0,\frac{1}{3}\right], so that

f(x)={xif x00if x(0,13)x13if x13f(x) = \begin{cases} -x &\text{if } x\leq 0 \\ 0 &\text{if } x\in \left(0,\frac{1}{3}\right)\\ x-\frac{1}{3} &\text{if } x \geq \frac{1}{3} \end{cases}

To give an example of how this could be useful, consider the triangle formed by axa_x. If AA is the lower point on the figure above, this would be the green triangle. Using the standard formula for area of a triangle, we get 1213f(ax)\frac{1}{2}\cdot\frac{1}{3}\cdot f(a_x)

Then, the full area of [s,g][s,g] would be given by:

19+if(i)6\frac{1}{9}+\sum_{i} \frac{f(i)}{6}

Where ii is an element of {ax,ay,bx,by}\left\lbrace a_x,a_y,b_x,b_y\right\rbrace . The presence of 19\frac{1}{9} is due to the area of ss. There is also the limitation from the length of gg which implies that:

1=A,B1=(axbx)2+(ayby)21=(axbx)2+(ayby)2\begin{align*} 1 &= \ell \langle A,B\rangle\\ 1 &= \sqrt{\left(a_x-b_x\right)^2+\left(a_y-b_y\right)^2}\\ 1 &= \left(a_x-b_x\right)^2+\left(a_y-b_y\right)^2 \end{align*}

For the time being, let’s hold AA constant, and consider the optimal placement of BB, assuming without loss of generality that bx0b_x \geq 0 and that by13b_y\geq \frac{1}{3}. Solving the previous equation for byb_y gives

1=(ayby)2+(axbx)21(ayby)2=(axbx)21(ayby)2=axbxax1(ayby)2=bx\begin{align*} 1 &= \left(a_y-b_y\right)^2+\left(a_x-b_x\right)^2\\ 1- \left(a_y-b_y\right)^2 &= \left(a_x-b_x\right)^2\\ \sqrt{1- \left(a_y-b_y\right)^2} &= a_x-b_x\\ a_x - \sqrt{1- \left(a_y-b_y\right)^2} &= b_x \end{align*}

Therefore, the maximum value of bxb_x is:

bxax1(ay13)2b_x \leq a_x - \sqrt{1- \left(a_y-\frac{1}{3}\right)^2}

By symmetry, we also know that by=ay1(axbx)2b_y= a_y - \sqrt{1- \left(a_x-b_x\right)^2}. Therefore, the total area due to BB can be expressed as:

AB=f(bx)+f(by)=f(bx)+by=f(bx)+ay1(axbx)2A_B = f(b_x) + f(b_y) = f(b_x) + b_y = f(b_x) + a_y - \sqrt{1- \left(a_x-b_x\right)^2}

This function is non-differentiable or has boundaries when bx=0b_x=0, when bx=13b_x=\frac{1}{3}, and when bx=ax1(ay13)2b_x= a_x - \sqrt{1- \left(a_y-\frac{1}{3}\right)^2}. If x[0,13]x\in\left[0,\frac{1}{3}\right], then

AB=ay1(axbx)2A_B = a_y - \sqrt{1- \left(a_x-b_x\right)^2}

The derivative of this is

dABdbx=2ax2bx21(axbx)2=axbx1(axbx)2\frac{dA_B}{db_x} = \frac{2a_x-2b_x}{2\sqrt{1- \left(a_x-b_x\right)^2}} = \frac{a_x-b_x}{\sqrt{1- \left(a_x-b_x\right)^2}}

This is equal to 0 only when bx=axb_x = a_x. On the other hand, if x>13x > \frac{1}{3}, then

AB=bx13+ay1(axbx)2A_B = b_x-\frac{1}{3} + a_y - \sqrt{1- \left(a_x-b_x\right)^2}

The derivative of this is quite similar to the derivative from before:

dABdbx=1+axbx1(axbx)2\frac{dA_B}{db_x} = 1 + \frac{a_x-b_x}{\sqrt{1- \left(a_x-b_x\right)^2}}

For this derivative to be equal to 0, this comes to:

0=1+axbx1(axbx)21=bxax1(axbx)21(axbx)2=bxax1(axbx)2=(bxax)21=0\begin{align*} 0 &= 1 + \frac{a_x-b_x}{\sqrt{1- \left(a_x-b_x\right)^2}}\\ 1 &= \frac{b_x-a_x}{\sqrt{1- \left(a_x-b_x\right)^2}}\\ \sqrt{1- \left(a_x-b_x\right)^2} &= b_x-a_x\\ 1- \left(a_x-b_x\right)^2 &= \left(b_x-a_x\right)^2\\ 1 &= 0 \end{align*}

This is obviously not possible, meaning that the critical points of this area function are when bx=0b_x=0, when bx=13b_x=\frac{1}{3}, when by=13b_y=\frac{1}{3}, or when 0<bx=ax<00 < b_x=a_x < 0.

If bx,ax[0,13]b_x,a_x \in \left[0,\frac{1}{3}\right], then the formula for area is given by ay1(axbx)2a_y - \sqrt{1- \left(a_x-b_x\right)^2}. It is clear from inspection that this function is larger when ax=bxa_x=b_x as opposed to otherwise. Because of this, we conclude that BB lies on either x=0x=0, x=13x=\frac{1}{3}, y=0y=0, or y=13y=\frac{1}{3}. By symmetry, AA also lies on one of these lines. This leads to one of two possibilities:

In one possibility, assume that both AA and BB lie on x=0x=0 and y=0y=0. Without loss of generality, let bx=0b_x = 0 while ax=13a_x = \frac{1}{3}. Then the distance between \(A) and \(B\) gives the restriction that:

1=(axbx)2+(ayby)21=(13)2+(ayby)21=19+(ayby)289=(ayby)2223=ayby\begin{align*} 1 &= \sqrt{\left(a_x-b_x\right)^2+\left(a_y-b_y\right)^2}\\ 1 &= \sqrt{\left(\frac{1}{3}\right)^2+\left(a_y-b_y\right)^2}\\ 1 &= \frac{1}{9}+\left(a_y-b_y\right)^2\\ \frac{8}{9} &= \left(a_y-b_y\right)^2\\ \frac{2\sqrt{2}}{3} &= |a_y-b_y| \end{align*}

Assuming that the square lies between these two points, the area of the full shape is then given by:

A=19+16223=2+22180.268A= \frac{1}{9}+\frac{1}{6}\cdot\frac{2\sqrt{2}}{3}=\cdot\frac{2+2\sqrt{2}}{18}\approx 0.268

In the other possibility, AA lies on a vertical line, while BB lies on a horizontal line, or vice versa. Without loss of generality, let ax=0a_x=0, and let by=0b_y = 0.Then the distance between \(A) and \(B\) gives the restriction that:

1=(axbx)2+(ayby)21=bx2+ay21=bx2+ay21ay2=bx21ay2=bx\begin{align*} 1 &= \sqrt{\left(a_x-b_x\right)^2+\left(a_y-b_y\right)^2}\\ 1 &= \sqrt{b_x^2+a_y^2}\\ 1 &= b_x^2+a_y^2\\ 1 - a_y^2 &= b_x^2\\ \sqrt{1 - a_y^2} &= b_x \end{align*}

And the total area of the shape is given by:

A=19+16bx+16ay=19+1ay26+ay6A= \frac{1}{9}+\frac{1}{6}b_x + \frac{1}{6}a_y=\frac{1}{9}+\frac{\sqrt{1 - a_y^2}}{6} + \frac{a_y}{6}

Once again, we find the minimum of this using first-term calculus. The derivative of this area is:

dAday=ay61ay2+16\frac{dA}{da_y}= \frac{ -a_y}{6\sqrt{1 - a_y^2}}+\frac{1}{6}

This derivative evaluates to 0 when

0=ay61ay2+16ay61ay2=16ay=1ay2ay2=1ay22ay2=1ay=12\begin{align*} 0 &= \frac{ -a_y}{6\sqrt{1 - a_y^2}}+\frac{1}{6}\\ \frac{a_y}{6\sqrt{1 - a_y^2}} &= \frac{1}{6}\\ a_y &= \sqrt{1 - a_y^2}\\ a_y^2 &= 1 - a_y^2\\ 2a_y^2 &= 1\\ a_y &= \frac{1}{\sqrt{2}} \end{align*}

If ay=12a_y=\frac{1}{\sqrt{2}}, then bx=12b_x=\frac{1}{\sqrt{2}} as well. This solution is invalid, however, as the instance of LL in this case would not pass through the instance of SS. Rather than showing this mathematically, I’ll just show the results of a graphing utility.

The non-differentiable points in the second possibility are identical to the non-differentiable points in the first possibility, proving the theorem.

\square

Admittedly, the statement of that theorem might have been highly intuitive to you. You might find it much less intuitive to consider elements of [G,Vπ6]\left[G,V_{\frac{\pi}{6}}\right]^-.

Theorem 6
All elements of [E,G][E,G]^- are congruent to a triangle with base length 1 and height 34\frac{\sqrt{3}}{4}, and have an area of 38\frac{\sqrt{3}}{8}.

proof:

To start, let’s place an instance of EE in the cartesian plane, and we’ll call this instance ee (Euler’s number does not appear anywhere in this article). I’ll place this triangle with two points on the xx-axis, and one point on the yy-axis. Then the points in this triangle are:

e1=(14,0)e_1=\left(\frac{-1}{4},0\right) e2=(14)e_2=\left(\frac{1}{4}\right) e3=(0,34)e_3=\left(0,\frac{\sqrt{3}}{4}\right)

Note that it isn’t really important which two sides of this triangle make up ee, as we’ll be considering the convex hull, so all three sides are included in this graph. I will illustrate this triangle below:

Note that I have labeled the sections of this triangle with numbers 1 though 6. This is because the instance of GG must pass through ee by Theorem 4. Note that GG cannot have endpoints in adjacent sections if it passes through ee. Note also that GG cannot have both its endpoints in odd-numbered sections. Therefore, define the endpoints of GG to be a=(ax,ay)a=\left(a_x,a_y\right) and b=(bx,by)b=\left(b_x,b_y\right). Without loss of generality, assume aa is in section 4, and assume bb is in section 1 or 6.

What is the size of [e,a,b]\left[e,\langle a,b\rangle\right]? First, Let’s find the area of [e,a]\left[e,a\right]. This consists of the area inside ee, as well as the area in the triangle defined by the points e1e_1, e2e_2, and aa.

The area of the grey region inside ee is given by 121234\frac{1}{2}\cdot\frac{1}{2}\cdot\frac{\sqrt{3}}{4}. The area of the blue region created by aa is given by 1212ay\frac{1}{2}\cdot\frac{1}{2}\cdot a_y. Therefore, the total area of these two regions is:

316+ay4\frac{\sqrt{3}}{16}+\frac{a_y}{4}

Let’s start by assuming that bb is located in region 6. In this case, the inclusion of bb creates an additional triangle defined by the points e3e_3, e2e_2, and bb, as shown below:

The area of this green region is determined by the distance from bb to the line from e3e_3 to e2e_2. There’s a useful formula that gives the area of a triangle given the locations of its three points:

Area=bx(340)+0(0by)+14(by34)2=bx34+14(by34)2=bx3+by348=bx38+by8332\begin{align*} Area &=\left|{\frac{b_x\left(\frac{\sqrt{3}}{4}-0\right)+0\left(0-b_y\right)+\frac{1}{4}\left(b_y-\frac{\sqrt{3}}{4}\right)}{2}}\right|\\ &= \left|{\frac{\frac{b_x\sqrt{3}}{4}+\frac{1}{4}\left(b_y-\frac{\sqrt{3}}{4}\right)}{2}}\right|\\ &= \left|{\frac{b_x\sqrt{3}+b_y-\frac{\sqrt{3}}{4}}{8}}\right|\\ &= \frac{b_x\sqrt{3}}{8}+\frac{b_y}{8}-\frac{\sqrt{3}}{32} \end{align*}

Recall that bb and aa are exactly one unit apart. This gives the restriction that:

1=(byay)2+(bxax)21(bxax)2=(byay)21(bxax)2=byay1(bxax)2+ay=by\begin{align*} 1 &= \left(b_y-a_y\right)^2 +\left(b_x-a_x\right)^2\\ 1-\left(b_x-a_x\right)^2 &= \left(b_y-a_y\right)^2 \\ \sqrt{1-\left(b_x-a_x\right)^2} &= b_y-a_y \\ \sqrt{1-\left(b_x-a_x\right)^2}+a_y &= b_y\\ \end{align*}

Substituting this into the equation of the area of the green region would give

Area=bx38+1(bxax)2+ay8332Area = \frac{b_x\sqrt{3}}{8}+\frac{\sqrt{1-\left(b_x-a_x\right)^2}+a_y}{8}-\frac{\sqrt{3}}{32}

To find the minimum possible value of this area, we take the derivative with respect to bxb_x and set that derivative equal to 0.

0=38+2(bxax)821(bxax)20=38bxax81(bxax)20=3bxax1(bxax)23=bxax1(bxax)233(bxax)2=bxax33(bxax)2=(bxax)23=4(bxax)2±32=bxax\begin{align*} 0&= \frac{\sqrt{3}}{8}+\frac{-2\left(b_x-a_x\right)}{8\cdot 2\sqrt{1-\left(b_x-a_x\right)^2}}\\ 0&= \frac{\sqrt{3}}{8}-\frac{b_x-a_x}{8\sqrt{1-\left(b_x-a_x\right)^2}}\\ 0&= \sqrt{3}-\frac{b_x-a_x}{\sqrt{1-\left(b_x-a_x\right)^2}}\\ \sqrt{3} &= \frac{b_x-a_x}{\sqrt{1-\left(b_x-a_x\right)^2}}\\ \sqrt{3-3\left(b_x-a_x\right)^2} &= b_x-a_x\\ 3-3\left(b_x-a_x\right)^2 &= \left(b_x-a_x\right)^2\\ 3 &= 4\left(b_x-a_x\right)^2\\ \pm\frac{\sqrt{3}}{2} &= b_x-a_x\\ \end{align*}

Again, the distance between bb and aa must be exactly one unit. This means that:

1=(byay)2+(±32)21=(byay)2+3414=(byay)212=byay\begin{align*} 1 &= \left(b_y-a_y\right)^2 +\left(\pm\frac{\sqrt{3}}{2}\right)^2\\ 1 &= \left(b_y-a_y\right)^2 + \frac{3}{4}\\ \frac{1}{4} &= \left(b_y-a_y\right)^2\\ \frac{1}{2} &= b_y-a_y\\ \end{align*}

Because bx>0b_x>0, and byb_y is so low above aya_y, we may assume that b=(ax+32,ay+12)b= \left(a_x+\frac{\sqrt{3}}{2},a_y+\frac{1}{2}\right) . This is signifigant, because we may observe that the slope from aa to bb is 13\frac{1}{\sqrt{3}}. This is perpendicular to the slope from Z3Z_3 to Z2Z_2, meaning that this location of bb maximizes the area of the green triangle.

The other posibility that would create a critical point for the area of the green triangle is that bb lies on either the xx-axis, or on the line y=3x+34y=\sqrt{3}x+\frac{\sqrt{3}}{4}, as these are the bounds of bb’s location. It is not possible for bb to lie on the xx-axis, as this would result in the line GG lying outside ee. Therefore, the minimum area is achieved when bb lies on y=3x+34y=\sqrt{3}x+\frac{\sqrt{3}}{4}.

The other posibility is that bb lies in section 1. If this is the case, we may ignore the area of ee, as the overall area can now be defined by a triangle between bb, e1e_1, and e2e_2.

In this case, the total area of [e,a,b][e,\langle a,b\rangle] is 14ay+14by\frac{1}{4}a_y+\frac{1}{4}b_y. Holding the position of aa constant, byb_y is minimized by maximizing (bxax)\left|\left(b_x-a_x\right)\right|. This means that bb must lie on the boundary of Section 1.

We have concluded that bb must lie on the boundary of section 1, regardless of what other assumptions have been made. Let’s assume, without loss of generality, that bb lies on the line y=3x+34y=\sqrt{3}x+\frac{\sqrt{3}}{4}, while aa is in section 4. From here, we’ll hold the location of bb constant and find the optimal location of aa. As before, to minimize the area of the blue triangle formed by aa, we must minimize aya_y, which means maximizing (bxax)\left|\left(b_x-a_x\right)\right|. We conclude that aa must lie on the boundary of section 1. If aa lies on the line passing through e3e_3 and e2e_2, then a,b\langle a,b\rangle does not intersect ee.

We have now shown that [e,a,b][e, \langle a,b\rangle] is congruent to a triangle with base length 1 and height 34\frac{\sqrt{3}}{4}. The area of such a triangle is 34\frac{\sqrt{3}}{4}.

\square

Some Difficulties in Finding Minimal Covers
#

There are two ideas that I imediately had when considering how to find minimal covers. Neither of them are effective, but it’s informative to see how these strategies can fail. The first would look like this:

Theorem 6 and a half
Let AA and BB be curves, and let aa and bb be instances of those curves. Additionally, let aa’ be such that [a,b][A,B][a’,b] \in [A,B]^-, while [a,b]∉[A,B][a,b] \not\in [A,B]^-. There does not nececarily exist a linear transformation T:aaT:a\rightarrow a’ such that the area of [Tt(a),b][T^t(a),b] is monotonically decreasing on t[0,1]t\in[0,1].

Another way of phrasing this is that there might not be a way to smoothly slide a shape into position to minimize the size of the complex hull without first increasing the size of the complex hull. A third way of phrasing it is that there may be local minima in the size of the convex hull of two shapes that is not an absolute minimum.

proof

Let bb be a rectangle defined by the points 0,00,0, 0,140,\frac{1}{4}, 12,14\frac{1}{2},\frac{1}{4}, and 12,0\frac{1}{2},0. Consider the convex hull of this rectangle and a unit length line AA. It was shown in Theorem 5 that the optimal locations for this line are such that its endpoints lie on opposite parallel extensions of the sides of bb. This means that the endpoints of aa either lie on x=0x=0 and x=12x=\frac{1}{2}, or they lie on y=0y=0 and y=14y=\frac{1}{4}. Without loss of generality, assume one endpoint of aa lies at the origin. Because the other endpoint of aa must lie a unit away from the origin, the two possible locations are (12,32)\left(\frac{1}{2},\frac{\sqrt{3}}{2}\right) and (154,14)\left(\frac{\sqrt{15}}{4},\frac{1}{4}\right), as shown below:

The area of the box on its own is 1214=18\frac{1}{2}\cdot\frac{1}{4} = \frac{1}{8}. The total area of [a,b][a,b] is:

18+1212(3214)=216+142314=23+1160.279 if an endpoint lies at (12,32)18+1214(15412)=432+181524=15+2320.184 if an endpoint lies at (154,14)\begin{align*} \frac{1}{8} + \frac{1}{2}\cdot\frac{1}{2}\cdot\left(\frac{\sqrt{3}}{2} - \frac{1}{4}\right) = \frac{2}{16} + \frac{1}{4}\cdot\frac{2\sqrt{3}-1}{4} = \frac{2\sqrt{3}+1}{16}\approx 0.279 & \text{ if an endpoint lies at } \left(\frac{1}{2},\frac{\sqrt{3}}{2}\right) \\ \frac{1}{8} + \frac{1}{2}\cdot\frac{1}{4}\cdot\left(\frac{\sqrt{15}}{4} - \frac{1}{2}\right) = \frac{4}{32} + \frac{1}{8}\cdot\frac{\sqrt{15}-2}{4} = \frac{\sqrt{15}+2}{32}\approx 0.184 & \text{ if an endpoint lies at } \left(\frac{\sqrt{15}}{4},\frac{1}{4}\right) \\ \end{align*}

Suppose aa is located with an endpoint at (12,32)\left(\frac{1}{2},\frac{\sqrt{3}}{2}\right). A transformation that brings aa to (154,14)\left(\frac{\sqrt{15}}{4},\frac{1}{4}\right) would have to first bring it through the area above y=14y=\frac{1}{4} and to the right of x=12x=\frac{1}{2}, which would increase the size of [a,b][a,b].

\square

What a shame! I was hoping that it would be possible to smoothly shift shapes to find their best convex hulls. An even bigger issue came when I realized this:

Theorem 6 and three quarters
Let AA, BB, and CC be curves. It is not necesarily the case that there exists an element of [A,B,C]\left[A, B, C\right]^- accomodates an element of [A,B]\left[A, B\right]^-.

In other words, it doesn’t work to find the minimal shell of two shapes and then find the minimal shell of that with a third shape. You have to consider all three shapes at the same time.

proof

Consider the set of curves consisting of the unit length line segment, the equilateral triangle with sides of length 12\frac{1}{2}, and the square with sides of length 13\frac{1}{3}, or G,E,S{G, E, S}. To cause a contradiction, let [s,g][s,g] be an instance of [S,G][S,G]^- and let [g,e][g’,e] be an instance of [G,E][G, E]^-, so that

[s,g,g,e][S,G,E][s,g,g’,e]\in[S, G, E]^-

Immediately, we have that [s,g,e][s,g,e] is not larger than [s,g,g,e][s,g,g’,e]. Because [s,g,g,e][S,G,E][s,g,g’,e]\in[S, G, E]^-, we have that [s,g,e][s,g,e] cannot be smaller than [s,g,g,e][s,g,g’,e], and so they must be the same size. By the same reasoning, [s,g,e][s,g’,e] must be the same size as [s,g,g,e][s,g,g’,e] as well. This means that gg or gg’ can be removed from [s,g,g,e][s,g,g’,e] without shrinking it, but not both. Therefore, gg and gg’ must share at least one endpoint on the boundary of [s,g,g,e][s,g,g’,e]. Without loss of generality, let this endpoint be the origin.

First, let’s assume that g=gg = g’. Without loss of generality, we can let gg lie on the xx-axis, with ee lying above the xx axis. We know that ee has a height of 34\frac{\sqrt{3}}{4}, and a base length of 12\frac{1}{2}, meaning that [g,e][g,e] consists of a triangle between the points (0,0)(0,0), (1,0)(1,0), and (a,34)(a,\frac{\sqrt{3}}{4}), with a[14,34]a\in[\frac{1}{4},\frac{3}{4}].

Without loss of generality, we’ll also assume that the location of ss is such that the lines passing through the endpoints of gg have positive slope. Then the sides of are defined by the equations:

y=24xy=\frac{\sqrt{2}}{4}x y=24(1)y=\frac{\sqrt{2}}{4}(-1) y=22x+by=-2\sqrt{2}x+b y=22x+1+by=-2\sqrt{2}x+1+b

Where b[0,221]b\in[0,2\sqrt{2}-1]. This is shown below:

The bottom point of this square is located at the intersection of y=22x+by=-2\sqrt{2}x+b and y=122(x1)y=\frac{1}{2\sqrt{2}}\left(x-1\right). This point is located at:

(22b+19,b229)\left(\frac{2\sqrt{2}b+1}{9}, \frac{b-2\sqrt{2}}{9}\right)

This point forms a triangle with the instance of gg on the x axis. That triangle has a size of

b2218\frac{b-2\sqrt{2}}{18}

There is also a triangle formed by the top left point of ss, whenever this point is not inside [g,e][g,e]. This point is located at

(8(1+b)9,1+b9)\left(\frac{\sqrt{8}\left(1+b\right)}{9},\frac{1+b}{9}\right)

This triangle only exists when the top right point of ss is above the top left line of [e,g][e,g]. This line is determined by

y=34a4(x1)y=\frac{\sqrt{3}}{4a-4}\left(x-1\right)

Therefore, if this triangle exists, we may assume 1+b934a4(22(1+b)91)(1+b)(4a4)(22(1+b)9)3\begin{align*} \frac{1+b}{9} &\geq \frac{\sqrt{3}}{4a-4}\left(\frac{2\sqrt{2}\left(1+b\right)}{9}-1\right)\\ \left(1+b\right)\left(4a-4\right) &\geq\left(2\sqrt{2}\left(1+b\right)-9\right)\sqrt{3}\\ \end{align*}

If this is satisfied, there is a triangle with the point of gg located at (1,0)\left(1,0\right), and the point of ee located at (a,34)\left(a, \frac{\sqrt{3}}{4}\right) with a[14,34]a\in\left[\frac{1}{4},\frac{3}{4}\right]. Using the shoelace formula, the area of this triangle is given by

12(122(1+b)9)(340)(1a)(1+b90)1729326(1+b)+4a4+4ab4b\begin{align*} &\frac{1}{2}\left|\left(1-\frac{2\sqrt{2}\left(1+b\right)}{9}\right)\left(\frac{\sqrt{3}}{4}-0\right)-\left(1-a\right)\left(\frac{1+b}{9}-0\right)\right|\\ &\frac{1}{72}\left|9\sqrt{3}-2\sqrt{6}\left(1+b\right)+4a-4+4ab-4b\right|\\ \end{align*}

Because we are assuming that this triangle exists, this may be rewritten as

172((22(1+b)9)(3)(4a4)(1+b))\frac{1}{72}\left(\left(2\sqrt{2}\left(1+b\right)-9\right)\left(\sqrt{3}\right)-\left(4a-4\right)\left(1+b\right)\right)

Therefore, it is ideal to maximize aa, so let a=34a=\frac{3}{4}. The equation for area then simplifies to

172((26+1)(1+b)93)\frac{1}{72}\left(\left(2\sqrt{6}+1\right)\left(1+b\right)-9\sqrt{3}\right)

This is a linear equation in which area grows with bb proportionally to \(\frac{2\sqrt{6}+1}{72}\aprox 0.0819\). This is quicker growth than the growth of the bottom triangle, which grows proportianally to \(\frac{1}{18}\aprox 0.055\). We may conclude that bb must be such that

(1+b)(4a4)=(22(1+b)9)3\left(1+b\right)\left(4a-4\right) =\left(2\sqrt{2}\left(1+b\right)-9\right)\sqrt{3}

(1+b)(4a4)=(22(1+b)9)3(1+b)=9326(1+b)b=9326+11\begin{align*} \left(1+b\right)\left(4a-4\right) &=\left(2\sqrt{2}\left(1+b\right)-9\right)\sqrt{3}\\ \left(1+b\right) &= 9\sqrt{3}-2\sqrt{6}\left(1+b\right)\\ b &=\frac{9\sqrt{3}}{2\sqrt{6}+1}-1\\ \end{align*}

This is shown below:

It is evident that ss can be translated linearly along the line y=34a4(x1)y=\frac{\sqrt{3}}{4a-4}\left(x-1\right) to decrease the overall size of [e,g,s][e,g,s], meaning that [e,g,s]∉[E,G,S][e,g,s]\not\in[E,G,S]^-.

So suppose instead that ggg\neq g’, and that

\square

In a certain sense, this is absolutely disasterous.

Proxy Accommodated Curves
#

As mentioned, one of the big hopes in this project is that we should be able to ignore certain curves because they are accomodated by the unions of other curves. For example, let’s look at what else might be covered by one of the elements of [E,G][E,G]^-.

Theorem 7
The triangle consisting of points at (12,0)\left(\frac{-1}{2},0\right), (12,0)\left(\frac{1}{2},0\right), and (0,34)\left(0,\frac{\sqrt{3}}{4}\right) acommodates VθV_\theta for any value of θ\theta, and VθV_\theta for any value of θ\theta.
proof

Let tt be the triangle described in the theorem statement.

For any angle θ\theta, define vθv_\theta to contain the points v1=(0,12cosθ2)v_1=\left(0,\frac{1}{2}\cos{\frac{\theta}{2}}\right), v2=(12sinθ2,0)v_2=\left(\frac{1}{2}\sin{\frac{\theta}{2}},0\right), and v3=(12sinθ2,0)v_3=\left(-\frac{1}{2}\sin{\frac{\theta}{2}},0\right). Note first that the distance between v1v_1 and v2v_2 is:

(12cosθ2)2+(12sinθ2)2=12cos2θ2+sin2θ2=1\sqrt{\left(\frac{1}{2}\cos{\frac{\theta}{2}}\right)^2+\left(-\frac{1}{2}\sin{\frac{\theta}{2}}\right)^2}=\frac{1}{2}\sqrt{\cos^2{\frac{\theta}{2}}+\sin^2{\frac{\theta}{2}}}=1

By symmetry, the distance from The angle formed by this shape is 12\frac{1}{2}. The angle formed at v1v_1 is θ\theta, confirming that this is an instance of VθV_\theta. Note that 12sinθ2<12\frac{1}{2}\sin{\frac{\theta}{2}} < \frac{1}{2} for all values of θ\theta, meaning that v2v_2 lies inside tt. By symmetry, v2v_2 also lies inside tt.

v1v_1 lies inside tt whenever 12cosθ2<34\frac{1}{2}\cos{\frac{\theta}{2}}<\frac{\sqrt{3}}{4}. This is true when θ<pi6\theta<\frac{pi}{6}. Since VθV_\theta is not taut for θ<π6\theta < \frac{\pi}{6}, we won’t consider these anyway.

For any radius rr, let ara_r be the curve defined with y=r2x2rsin(πr12r)y=\sqrt{r^{2}-x^{2}}-r\sin\left(\frac{\pi r-1}{2r}\right), where y>0y>0. The line that defines the right half of tt is y=32x+34y=-\frac{\sqrt{3}}{2}x+\frac{\sqrt{3}}{4}. These two lines intersect when

r2x2rsin(πr12r)=32x+34r2x2rsin(πr12r)=34(12x)r2x2=34(12x)+rsin(πr12r)r2x2=316(12x)2+32(12x)rsin(πr12r)+r2sin2(πr12r)r2x2=31638x+34x2+3r2(12x)sin(πr12r)+r2sin2(πr12r)r2=31638x+74x2+3r2(12x)sin(πr12r)+r21cos(πr1r)20=3834x+72x2+3r(12x)sin(πr12r)r2r2cos(πr1r)\begin{align*} \sqrt{r^{2}-x^{2}}-r\sin\left(\frac{\pi r-1}{2r}\right) &= -\frac{\sqrt{3}}{2}x+\frac{\sqrt{3}}{4}\\ \sqrt{r^{2}-x^{2}}-r\sin\left(\frac{\pi r-1}{2r}\right) &= \frac{\sqrt{3}}{4}(1-2x)\\ \sqrt{r^{2}-x^{2}} &= \frac{\sqrt{3}}{4}(1-2x)+r\sin\left(\frac{\pi r-1}{2r}\right)\\ r^{2}-x^{2} &= \frac{3}{16}(1-2x)^2+\frac{\sqrt{3}}{2}(1-2x)r\sin\left(\frac{\pi r-1}{2r}\right)+r^2\sin^2\left(\frac{\pi r-1}{2r}\right)\\ r^{2}-x^{2} &= \frac{3}{16}-\frac{3}{8}x+\frac{3}{4}x^2+\frac{\sqrt{3}r}{2}(1-2x)\sin\left(\frac{\pi r-1}{2r}\right)+r^2\sin^2\left(\frac{\pi r-1}{2r}\right)\\ r^2 &= \frac{3}{16}-\frac{3}{8}x+\frac{7}{4}x^2+\frac{\sqrt{3}r}{2}(1-2x)\sin\left(\frac{\pi r-1}{2r}\right)+r^2\frac{1-\cos\left(\frac{\pi r-1}{r}\right)}{2}\\ 0 &= \frac{3}{8}-\frac{3}{4}x+\frac{7}{2}x^2+\sqrt{3}r(1-2x)\sin\left(\frac{\pi r-1}{2r}\right)-r^2-r^2\cos\left(\frac{\pi r-1}{r}\right)\\ \end{align*}

\square

Eliminating Type 2 Curves (hopefully)
#

So far, the previous sections have each had to do with results I have already proven. At the time that I write this, the subject of this section is not yet proven. Therefore, what follows is a hypothesis, rather than a lemma, theorem, or statement.

Question 2
For any Type 2 curve XX, does there exists a set of Type 1 curves that proxy accommodates XX?

This would be a lovely result to find, although it’s easier stated than proven. Let’s start with a simpler hypothesis. Maybe we can use that as a stepping stone.

Question 2.1
Is the curve ZZ proxy accommodated by a set of Type 1 curves?

And in particular, I would sharpen this hypothesis even further:

Question 2.2
Is the curve ZZ proxy accommodated by {L,M,Z}\lbrace L, M, Z’ \rbrace?

To start exploring this, I’ll place an instance of ZZ’ within the cartesian plane at the same location as in its definition, called zz’. As a reminder, the points used here are labeled as follows:

P=(px,py)=(64235,38+28235)P = \left(p_x,p_y\right)=\left(\frac{6-4\sqrt{2}}{3\sqrt{5}},\frac{\sqrt{-38+28\sqrt{2}}}{3\sqrt{5}}\right) Z1=(0,0)Z_1=\left(0,0\right) Z3=(3515,515)Z_3=\left(\frac{3\sqrt{5}}{15},\frac{\sqrt{5}}{15}\right) Z4=(53,0)Z_4=\left(\frac{\sqrt{5}}{3},0\right)

For convenience, we also recall that

Z3=(2515,515)Z_3=\left(\frac{2\sqrt{5}}{15},\frac{-\sqrt{5}}{15}\right)

It is possible, for the purposes of exploring Question 2.2, to expend the effective area of zz’

Lemma 8
Let ω\omega be a

If this shape is part of a union that includes certain other points, it would be possible for ZZ to be included, answering Question 2.2. I’m going to indicate in blue the points which would result in accomodating ZZ. First, it should be clear that including (2515,515)\left(\frac{2\sqrt{5}}{15},\frac{-\sqrt{5}}{15}\right)would allow for ZZ. In addition, there’s an infinite stretch of territory that would include that point by necessity.

It is important to give the precise equations for these lines, as they’ll be built upon moving forward. One boundary of the blue region is the line from PP to Z2Z_2. This is defined with:

y115py+515px25(xpx)+pyy_1\leq\frac{15p_y+\sqrt{5}}{15p_x-2\sqrt{5}}\left(x-p_x\right)+p_y

The other boundary of the blue region is the line from Z2Z_2 to Z4Z_4

y213(x2515)515y_2\leq\frac{1}{3}\left(x-\frac{2\sqrt{5}}{15}\right)-\frac{\sqrt{5}}{15}

Recall that MM includes a circle of radius 1π+2\frac{1}{\pi+2}. Because of this, the center of that circle cannot be located anywhere less than 1π+2\frac{1}{\pi+2} units away from this blue region. So, what is the line consisting of points 1π+2\frac{1}{\pi+2} away from y1y_1? The vector with a slope that is the opposite recipricoal of the slope of y1y_1 is:

v1=[15py+5,15px+25]v_1=\left[15p_y+\sqrt{5},-15p_x+2\sqrt{5}\right]

The magnitude of this vector is:

v1=(15py+5)2+(15px+25)2=225py2+305py+5+225px2605px+5=225py2+305py+225px2605px+10\begin{align*} |v_1| &= \sqrt{\left(15p_y+\sqrt{5}\right)^2+\left(-15p_x+2\sqrt{5}\right)^2}\\ &= \sqrt{225p_y^2+30\sqrt{5}p_y+5+225p_x^2-60\sqrt{5}p_x+5}\\ &= \sqrt{225p_y^2+30\sqrt{5}p_y+225p_x^2-60\sqrt{5}p_x+10} \end{align*}

The line that lies 1π+2\frac{1}{\pi+2} units away from y1y_1 is therefore:

y15py+515px25(xpx15py+5(π+2)v1)+py15px25(π+2)v1y\leq\frac{15p_y+\sqrt{5}}{15p_x-2\sqrt{5}}\left(x-p_x-\frac{15p_y+\sqrt{5}}{(\pi+2)|v_1|}\right)+p_y-\frac{15p_x-2\sqrt{5}}{(\pi+2)|v_1|}

Similarly, if we define

v2=[1,3]v_2=\left[-1,3\right]

Then the line that lies 1π+2\frac{1}{\pi+2} units away from y1y_1 is:

y13(x2515+1(π+2)10)515+3(π+2)10y\leq\frac{1}{3}\left(x-\frac{2\sqrt{5}}{15}+\frac{1}{(\pi+2)\sqrt{10}}\right) -\frac{\sqrt{5}}{15}+ \frac{3}{(\pi+2)\sqrt{10}}

There will also be a circle of radius 1π+2\frac{1}{\pi+2} around Z2Z_2. I’ll illustrate this “forbidden zone” in black. Let f1f_1 be the name of the boundary of this black region.

Next, I’ll notice that Wherever we place this instance of MM, it must be fully intersecting with zz’. If Z4Z_4 is in the interior of MM, then MM is not fully intersecting with zz’. Similarly, if both Z1Z_1 and PP are in the interior of MM, MM is not fully intersecting with zz’. I’ll illustrate these regions in black as well.

What is the line with the greatest slope passing through both Z1Z_1 and MM? There’s a whole host of ways to solve this question, but I’ve found that this is the quickest one. Let’s define p,qp,q to be center of the circle inside MM, so that our diagram must include all points in the circle defined by

(yq)2+(xp)2=1(2+π)2(y-q)^2+(x-p)^2=\frac{1}{\left(2+\pi\right)^2}

Any line passing through Z1Z_1 has the form y=mxy=mx where mm is the slope. At what locations does this pass through the circle? Substituting this into the previous equation gives:

(mxq)2+(xp)2=1(2+π)2m2x22qmx+q2+x22px+p2=1(2+π)2(m2+1)x2+(2qm2p)x+q2+p21(2+π)2=0\begin{align*} (mx-q)^2+(x-p)^2 &=\frac{1}{\left(2+\pi\right)^2}\\ m^2x^2-2qmx+q^2+x^2-2px+p^2 &=\frac{1}{\left(2+\pi\right)^2}\\ (m^2+1)x^2+(-2qm-2p)x+q^2+p^2 -\frac{1}{\left(2+\pi\right)^2}&=0 \end{align*}

The line is tangent to the circle when this equation has only one solution. Using the discriminant gives:

(2qm2p)24(m2+1)(q2+p21(2+π)2)=04q2m2+8qpm+4p24(m2+1)(1(2+π)2q2p2)=0q2m2+2qpm+p2(m2+1)(q2+p21(2+π)2)=0(q2+1(2+π)2q2p2)m2+2qpm+p2+(1(2+π)2q2p2)=0(1(2+π)2p2)m2+2qpm+1(2+π)2q2=0\begin{align*} (-2qm-2p)^2-4(m^2+1)\left(q^2+p^2 -\frac{1}{\left(2+\pi\right)^2}\right) &= 0\\ 4q^2m^2+8qpm+4p^2-4(m^2+1)\left(\frac{1}{\left(2+\pi\right)^2}-q^2-p^2\right) &= 0\\ q^{2}m^{2}+2qpm+p^{2}-(m^2+1)\left(q^2+p^2-\frac{1}{\left(2+\pi\right)^{2}}\right) &= 0\\ \left(q^2+\frac{1}{\left(2+\pi\right)^2}-q^2-p^2\right)m^2+2qpm+p^2+\left(\frac{1}{\left(2+\pi\right)^2}-q^2-p^2\right) &= 0\\ \left(\frac{1}{\left(2+\pi\right)^2}-p^2\right)m^2+2qpm+\frac{1}{\left(2+\pi\right)^2}-q^2 &= 0 \end{align*}

Therefore, the possible slopes are given by

m=2qp±(2qp)24(1(2+π)2p2)(1(2+π)2q2)2(1(2+π)2p2)m=qp±q2p2(1(2+π)2p2)(1(2+π)2q2)1(2+π)2p2m=qp±q2p2(1(2+π)4p2+q2(2+π)2+p2q2)1(2+π)2p2m=qp±q2p21(2+π)4+p2+q2(2+π)2p2q21(2+π)2p2m=qp±p2+q2(2+π)21(2+π)41(2+π)2p2m=qp±12+πp2+q21(2+π)21(2+π)2p2m=qp±12+πp2+q21(2+π)2p21(2+π)2\begin{align*} m &=\frac{-2qp\pm\sqrt{\left(2qp\right)^2-4\left(\frac{1}{\left(2+\pi\right)^2}-p^2\right)\left(\frac{1}{\left(2+\pi\right)^2}-q^2\right)}}{2\left(\frac{1}{\left(2+\pi\right)^2}-p^2\right)}\\ m &=\frac{-qp\pm\sqrt{q^2p^2-\left(\frac{1}{\left(2+\pi\right)^2}-p^2\right)\left(\frac{1}{\left(2+\pi\right)^2}-q^2\right)}}{\frac{1}{\left(2+\pi\right)^2}-p^2}\\ m &=\frac{-qp\pm\sqrt{q^2p^2-\left(\frac{1}{\left(2+\pi\right)^4}-\frac{p^2+q^2}{\left(2+\pi\right)^2}+p^2q^2\right)}}{\frac{1}{\left(2+\pi\right)^2}-p^2}\\ m &=\frac{-qp\pm\sqrt{q^2p^2-\frac{1}{\left(2+\pi\right)^4}+\frac{p^2+q^2}{\left(2+\pi\right)^2}-p^2q^2}}{\frac{1}{\left(2+\pi\right)^2}-p^2}\\ m &=\frac{-qp\pm\sqrt{\frac{p^2+q^2}{\left(2+\pi\right)^2}-\frac{1}{\left(2+\pi\right)^4}}}{\frac{1}{\left(2+\pi\right)^2}-p^2}\\ m &=\frac{-qp\pm\frac{1}{2+\pi}\sqrt{p^2+q^2-\frac{1}{\left(2+\pi\right)^2}}}{\frac{1}{\left(2+\pi\right)^2}-p^2}\\ m &=\frac{qp\pm\frac{1}{2+\pi}\sqrt{p^2+q^2-\frac{1}{\left(2+\pi\right)^2}}}{p^2-\frac{1}{\left(2+\pi\right)^2}}\\ \end{align*}

Because of this, we can use the following equation to give an extra side to zz’

y=qp+12+πp2+q21(2+π)2p21(2+π)2xy=\frac{qp+\frac{1}{2+\pi}\sqrt{p^2+q^2-\frac{1}{\left(2+\pi\right)^2}}}{p^2-\frac{1}{\left(2+\pi\right)^{2}}}x

This is shown here:

By the same token, we can draw a line from Z3Z_3 to a point on MM that it is tangent to. We can take our previous equation, and replace pp with p55p-\frac{\sqrt{5}}{5}, while replacing qq with q515q-\frac{\sqrt{5}}{15}. This equation would be:

y515=(q515)(p55)12+π(p55)2+(q515)21(2+π)2(p55)21(2+π)2(x55)y-\frac{\sqrt{5}}{15}=\frac{\left(q-\frac{\sqrt{5}}{15}\right)\left(p-\frac{\sqrt{5}}{5}\right)-\frac{1}{2+\pi}\sqrt{\left(p-\frac{\sqrt{5}}{5}\right)^2+\left(q-\frac{\sqrt{5}}{15}\right)^2-\frac{1}{\left(2+\pi\right)^2}}}{\left(p-\frac{\sqrt{5}}{5}\right)^2-\frac{1}{\left(2+\pi\right)^2}}\left(x-\frac{\sqrt{5}}{5}\right)

This is shown here:

What is the lowest possible slope of the line tangent to MM passing through Z1Z_1? We may place the center of MM at the point where f1f_1 intersects the circle around Z4Z_4. This point is about (0.57391,0.0918346)\left(0.57391,0.0918346\right). Plugging these points in for pp and qq gives a particular slope for that line. We will define this with m1m_1. Writing out the precise value of m1m_1 would time a very long time, so sufice it to say that m10.5461546m_1\approx 0.5461546.

Similarly, I would define m2m_2 as the minimal slope of the line tangent to MM passing through Z3Z_3. Placing the center of MM at the point where f1f_1 intersects the circle around PP would mean setting p0.115614p\approx 0.115614, while q0.004593q\approx -0.004593. If MM is placed at this point, then m20.127902662317m_2\approx-0.127902662317.

Now, graph the two lines

y=m1xy=m_1x y515=m2(x55)y-\frac{\sqrt{5}}{15}=m_2\left(x-\frac{\sqrt{5}}{5}\right)

These two lines intersect at a point that is approximately (0.306014,0.167131)(0.306014,0.167131). Regardless of where MM is located, this point must be included in the convex hull of MM and zz’. Additionally, I’ve been rounding down, meaning that this point really is included, not just one close to it. This extended version of zz’ is shown below:

Building a taxonomy of type 2 curves
#

I’ve been thinking a lot about ZZ curves and their variants. I would like to build a framework for thinking about simple type 2 curves. The basic ZZ curve consists of four points:

Z1=(0,0)Z_1=\left(0,0\right) Z2=(13,0)Z_2=\left(\frac{1}{3},0\right) Z3=(13,13)Z_3=\left(\frac{1}{3},\frac{1}{3}\right) Z4=(23,13)Z_4=\left(\frac{2}{3},\frac{1}{3}\right)

How might these points be different for other curves? Let’s consider only the unit length 3-chains with parallel end segments. If the basic ZZ curve is given a variable height α\alpha, its points may be written as:

Z1=(0,0)Z_1=\left(0,0\right) Z2=(1α2,0)Z_2=\left(\frac{1-\alpha}{2},0\right) Z3=(1α2,α)Z_3=\left(\frac{1-\alpha}{2},\alpha\right) Z4=(1α,α)Z_4=\left(1-\alpha,\alpha\right)

It is also possible that the first segment may be lengthened by a distance of β\beta. Due to the ability to rotate the shape so that the first segment is longer, we may assume β>0\beta > 0. Then the points of this curve are:

Z1=(0,0)Z_1=\left(0,0\right) Z2=(1α2+β,0)Z_2=\left(\frac{1-\alpha}{2}+\beta,0\right) Z3=(1α2+β,α)Z_3=\left(\frac{1-\alpha}{2}+\beta,\alpha\right) Z4=(1α,α)Z_4=\left(1-\alpha,\alpha\right)

Finally, the second segment of this curve does not have to be parallel to the other segments. Let’s shift the third point of this curve to the left or right by a distance of γ\gamma, while letting the fourth point exist where it has to be to maintain a length of 1. Therefore, the length of the second segment is:

α2+γ2\sqrt{\alpha^2+\gamma^2}

Finally, the locations of the points are:

Z1=(0,0)Z_1=\left(0,0\right) Z2=(1α2+γ22+β,0)Z_2=\left(\frac{1-\sqrt{\alpha^2+\gamma^2}}{2}+\beta,0\right) Z3=(1α2+γ22+β+γ,α)Z_3=\left(\frac{1-\sqrt{\alpha^2+\gamma^2}}{2}+\beta+\gamma,\alpha\right) Z4=(1α2+γ2+γ,α)Z_4=\left(1-\sqrt{\alpha^2+\gamma^2}+\gamma,\alpha\right)

Then, define Z(α,β,γ)Z(\alpha,\beta,\gamma) to be the curve defined with these points.

I’m going to go on a brief tangent here, and it’ll be clear soon why this is important. Consider the possible elements of [L,M]\left[L,M\right] with MM and LL intersecting. The curve MM has a minimum radus of 1π+2\frac{1}{\pi+2}, but it has a maximum radius of 2π+2\frac{\sqrt{2}}{\pi+2}. Let’s locate LL on [0,0],[0,1]\left\langle[0,0],[0,1]\right\rangle. Without loss of generality, let’s assume MM is centered above the xx-axis, and so must have a center with a yy-coordinate less than 2π+2\frac{\sqrt{2}}{\pi+2}. Additionally, there must not be a point more than 1 unit away from either endpoint of LL. The possible locations for the center point of MM is shown here in black:

The convex hull of these two shapes is shown here in red:

This is a nifty little framework here. Let’s consider elements of Z(α,0,0)Z(\alpha,0,0). These are curves defined with:

Z1=(0,0)Z_1=\left(0,0\right) Z2=(1α2,0)Z_2=\left(\frac{1-\alpha}{2},0\right) Z3=(1α2,α)Z_3=\left(\frac{1-\alpha}{2},\alpha\right) Z4=(1α,α)Z_4=\left(1-\alpha,\alpha\right)

An example of these points is shown:

Noteably, this particular element of [L,M]\left[L,M\right] accomodates Z(α,0,0)Z(\alpha,0,0) as shown! What values of α\alpha can be accommodated by any element of [L,M]\left[L,M\right]?

We’ll let (a,b)\left(a,b\right) be the center of our instance of MM. For a given value of α\alpha, which values of aa and bb accomodate Z4Z_4? Assuming ZZ is located as in the example, Z4Z_4 lies on the line y=x+1y=-x+1, while Z3Z_3 lies on the line y=2x+1y=-2x+1. The line defining the top left boundary of the red region is given by:

y=ba+u1vxy=\frac{ba+u_{1}}{v}x

Where v=(a2r2)v=\left(a^{2}-r^{2}\right) and u1=rb2+vu_{1}=r\sqrt{b^{2}+v}. Let’s assume that the slope of this line is at a minimum, meaning that a=1ra=1-r and b=0b=0. In that case, the line defining the top left boundary is:

y=rvvx=rvx=r12rxy=\frac{r\sqrt{v}}{v}x = \frac{r}{\sqrt{v}}x= \frac{r}{\sqrt{1-2r}}x

This means that Z3Z_3 is only just accomodated when these lines intersect at

2x+1=r12rx1=(2+r12r)x1=(212r12r+r12r)x1=(r+212r12r)xx=12rr+212r\begin{align*} -2x+1 &= \frac{r}{\sqrt{1-2r}}x\\ 1 &= \left(2+\frac{r}{\sqrt{1-2r}}\right)x\\ 1 &= \left(\frac{2{\sqrt{1-2r}}}{\sqrt{1-2r}}+\frac{r}{\sqrt{1-2r}}\right)x\\ 1 &= \left(\frac{r+2{\sqrt{1-2r}}}{\sqrt{1-2r}}\right)x\\ x &= \frac{\sqrt{1-2r}}{r+2{\sqrt{1-2r}}}\\ \end{align*}

At this point, the yy-coordinate is given by:

212rr+212r+1=212rr+212r+r+212rr+212r=212r+r+212rr+212r=rr+212r\begin{align*} \frac{-2\sqrt{1-2r}}{r+2{\sqrt{1-2r}}}+1 &= \frac{-2\sqrt{1-2r}}{r+2{\sqrt{1-2r}}}+\frac{r+2{\sqrt{1-2r}}}{r+2{\sqrt{1-2r}}}\\ &= \frac{-2\sqrt{1-2r}+r+2{\sqrt{1-2r}}}{r+2{\sqrt{1-2r}}}\\ &= \frac{r}{r+2{\sqrt{1-2r}}}\\ \end{align*}

Therefore, Z3Z_3 is accomodated whenever α<rr+212r\alpha < \frac{r}{r+2{\sqrt{1-2r}}}. When is Z4Z_4 accomodated? The top left boundary of the red zone is given by

y=bab+u2p(x1)y=\frac{b-ab+u_2}{-p}\left(x-1\right)

Where p=v2a+1p=v-2a+1 and u2=rb2+pu_2=r\sqrt{b^2+p}. Because Z4Z_4 is located on y=x+1y=-x+1, Z4Z_4 is accomodated whenever:

x+1<bab+u2p(x1)-x+1<\frac{b-ab+u_2}{-p}\left(x-1\right)

Rather than go through all the effort of solving this equation, I’ll simply point out that the lowest possible location for (a,b)\left(a,b\right) to be while accomodating Z4Z_4 is the one where MM is tangent to y=x+1y=-x+1. This equation is:

y=x+12ry=-x+1-\sqrt{2}r

The reduced area for (a,b)(a,b) is shown below:

Next, let’s consider rotating ZZ so that Z2Z_2 is located on the xx-axis and Z4Z_4 is located at (1,0)(1,0). The distance between Z2Z_2 and Z4Z_4 is given by:

α2+(1α2)2=α2+12α+α24=12α+5α24=12α+5α22\begin{align*} \sqrt{\alpha^2+\left(\frac{1-\alpha}{2}\right)^2} &= \sqrt{\alpha^2+\frac{1-2\alpha+\alpha^2}{4}}\\ &= \sqrt{\frac{1-2\alpha+5\alpha^2}{4}}\\ &= \frac{\sqrt{1-2\alpha+5\alpha^2}}{2}\\ \end{align*}

This places Z2Z_2 at (112α+5α22,0)\left(1-\frac{\sqrt{1-2\alpha+5\alpha^2}}{2},0\right). To locate Z3Z_3, we’ll find the slopes from Z3Z_3 to Z2Z_2 and Z4Z_4, and set them perpendicular to each other:

1xy=yx1+12α+5α22(1x)(x1+12α+5α22)=y2\begin{align*} \frac{1-x}{y}&= \frac{y}{x-1+\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2}}\\ (1-x)\left(x-1+\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2}\right)&= y^2\\ \end{align*}

We also have that it must be a distance of 1α2\frac{1-\alpha}{2} from Z4Z_4. This gives the limitation that

y2+(x1)2=(1α2)2y2=(1α2)2(x1)2\begin{align*} y^2 +(x-1)^2 &= \left(\frac{1-\alpha}{2}\right)^2\\ y^2 &= \left(\frac{1-\alpha}{2}\right)^2-(x-1)^2\\ \end{align*}

Next, we may combine these and solve for xx.

(1x)(x1+12α+5α22)=(1α2)2(x1)2(x1)((x1)+12α+5α22)=(1α2)2(x1)2(1x)(12α+5α22)=(1α2)2(1x)(12α+5α2)=(1α)22(1x)=(1α)2212α+5α21x=(1α)2212α+5α2x=1(1α)2212α+5α2\begin{align*} (1-x)\left(x-1+\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2}\right)&= \left(\frac{1-\alpha}{2}\right)^2-(x-1)^2\\ -(x-1)\left((x-1)+\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2}\right)&= \left(\frac{1-\alpha}{2}\right)^2-(x-1)^2\\ (1-x)\left(\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2}\right)&= \left(\frac{1-\alpha}{2}\right)^2\\ (1-x)\left(\sqrt{1-2\alpha+5\alpha^{2}}\right)&= \frac{\left(1-\alpha\right)^2}{2}\\ (1-x)&= \frac{\left(1-\alpha\right)^2}{2\sqrt{1-2\alpha+5\alpha^{2}}}\\ 1-x&= \frac{\left(1-\alpha\right)^2}{2\sqrt{1-2\alpha+5\alpha^{2}}}\\ x&= 1-\frac{\left(1-\alpha\right)^2}{2\sqrt{1-2\alpha+5\alpha^{2}}}\\ \end{align*}

Plugging this in lets us find yy as well:

y2=(1α2)2((1α)2212α+5α2)2y2=(1α)24(1α)44(12α+5α2)y2=(1α)24(1(1α)2(12α+5α2))y2=(1α)24(12α+5α212α+5α212α+α212α+5α2)y2=(1α)24(4α212α+5α2)y2=α2(1α)212α+5α2y=α(1α)12α+5α2\begin{align*} y^2 &= \left(\frac{1-\alpha}{2}\right)^2-\left(\frac{\left(1-\alpha\right)^2}{2\sqrt{1-2\alpha+5\alpha^{2}}}\right)^2\\ y^2 &= \frac{\left(1-\alpha\right)^2}{4}-\frac{\left(1-\alpha\right)^4}{4\left(1-2\alpha+5\alpha^2\right)}\\ y^2 &= \frac{\left(1-\alpha\right)^2}{4}\left(1-\frac{\left(1-\alpha\right)^2}{\left(1-2\alpha+5\alpha^2\right)}\right)\\ y^2 &= \frac{\left(1-\alpha\right)^2}{4}\left(\frac{1-2\alpha+5\alpha^2}{1-2\alpha+5\alpha^2}-\frac{1-2\alpha + \alpha^2}{1-2\alpha+5\alpha^2}\right)\\ y^2 &= \frac{\left(1-\alpha\right)^2}{4}\left(\frac{4\alpha^2}{1-2\alpha+5\alpha^2}\right)\\ y^2 &= \frac{\alpha^2\left(1-\alpha\right)^2}{1-2\alpha+5\alpha^2}\\ y &= \frac{\alpha(1-\alpha)}{\sqrt{1-2\alpha+5\alpha^2}}\\ \end{align*}

Therefore, Z3=(1(1α)2212α+5α2,α(1α)12α+5α2)Z_3=\left(1-\frac{\left(1-\alpha\right)^2}{2\sqrt{1-2\alpha+5\alpha^{2}}},\frac{\alpha(1-\alpha)}{\sqrt{1-2\alpha+5\alpha^2}}\right)

Finally, finding Z1Z_1 is fairly straightforward:

Z1=(1(1α)2212α+5α212α+5α22,α(1α)12α+5α2)Z1=(112α+α2212α+5α212α+5α2212α+5α2,α(1α)12α+5α2)Z1=(1+2+4α6α2212α+5α2,α(1α)12α+5α2)Z1=(112α3α212α+5α2,α(1α)12α+5α2)\begin{align*} Z_1 &= \left(1-\frac{\left(1-\alpha\right)^2}{2\sqrt{1-2\alpha+5\alpha^{2}}}-\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2},\frac{\alpha(1-\alpha)}{\sqrt{1-2\alpha+5\alpha^2}}\right)\\ Z_1 &= \left(1-\frac{1-2\alpha+\alpha^2}{2\sqrt{1-2\alpha+5\alpha^{2}}}-\frac{1-2\alpha+5\alpha^{2}}{2\sqrt{1-2\alpha+5\alpha^{2}}},\frac{\alpha(1-\alpha)}{\sqrt{1-2\alpha+5\alpha^2}}\right)\\ Z_1 &= \left(1+\frac{-2+4\alpha-6\alpha^2}{2\sqrt{1-2\alpha+5\alpha^{2}}},\frac{\alpha(1-\alpha)}{\sqrt{1-2\alpha+5\alpha^2}}\right)\\ Z_1 &= \left(1-\frac{1-2\alpha-3\alpha^2}{\sqrt{1-2\alpha+5\alpha^{2}}},\frac{\alpha(1-\alpha)}{\sqrt{1-2\alpha+5\alpha^2}}\right)\\ \end{align*}

Finally, Here’s that rotated version displayed:

The line containing Z1Z_1 is given by:

y=α(1α)12α+5α21+2α3α2xy=\frac{\alpha\left(1-\alpha\right)}{\sqrt{1-2\alpha+5\alpha^{2}}-1+2\alpha-3\alpha^{2}}x

Once again, the parallel line that is located a distance of rr under the previous one is:

y+rm2+1=m(xmrm2+1)y+\frac{r}{\sqrt{m^2+1}}=m\left(x-\frac{mr}{\sqrt{m^2+1}}\right)

where mm is the slope from the previous equation. Because the center of MM must be located underneath this, the total possible area for (a,b)(a,b) is reduced even further.

Next, let’s rotate ZZ to have both end points on the xx axis. The distance between Z1Z_1 and Z4Z_4 is given by:

α2+(1α)2=α2+12α+α2=2α22α+1\sqrt{\alpha^2+(1-\alpha)^2}=\sqrt{\alpha^2+1-2\alpha+\alpha^2}=\sqrt{2\alpha^2-2\alpha+1}

As before, the distance from Z1Z_1 to Z2Z_2 is 1α2\frac{1-\alpha}{2}, while the distance from Z4Z_4 to Z2Z_2 is 12α+5α22\frac{\sqrt{1-2\alpha+5\alpha^2}}{2}. Therefore, if Z1Z_1 is located at the origin, then the xx and yy-coordinates of Z2Z_2 are restricted by:

y2+x2=(1α2)2y^2+x^2=\left(\frac{1-\alpha}{2}\right)^2 y2+(x2α22α+1)2=(12α+5α22)2y^2+\left(x-\sqrt{2\alpha^2-2\alpha+1}\right)^2=\left(\frac{\sqrt{1-2\alpha+5\alpha^2}}{2}\right)^2 y<0y<0

Setting y2y^2 equal to each other and solving for xx gives:

(1α2)2x2=(12α+5α22)2(x2α22α+1)2(1α)24x2=12α+5α24(x22x2α22α+1+2α22α+1)(1α)24=12α+5α24+2x2α22α+12α2+2α10=4α24+2x2α22α+12α2+2α10=2x2α22α+1α2+2α1(α1)2=2x2α22α+1x=(α1)222α22α+1\begin{align*} \left(\frac{1-\alpha}{2}\right)^{2}-x^{2} &= \left(\frac{\sqrt{1-2\alpha+5\alpha^{2}}}{2}\right)^{2}-\left(x-\sqrt{2\alpha^{2}-2\alpha+1}\right)^{2}\\ \frac{\left(1-\alpha\right)^{2}}{4}-x^{2} &= \frac{1-2\alpha+5\alpha^{2}}{4}-\left(x^{2}-2x\sqrt{2\alpha^{2}-2\alpha+1}+2\alpha^{2}-2\alpha+1\right)\\ \frac{\left(1-\alpha\right)^{2}}{4} &= \frac{1-2\alpha+5\alpha^{2}}{4}+2x\sqrt{2\alpha^{2}-2\alpha+1}-2\alpha^{2}+2\alpha-1\\ 0 &= \frac{4\alpha^{2}}{4}+2x\sqrt{2\alpha^{2}-2\alpha+1}-2\alpha^{2}+2\alpha-1\\ 0 &= 2x\sqrt{2\alpha^{2}-2\alpha+1}-\alpha^{2}+2\alpha-1\\ \left(\alpha-1\right)^{2} &= 2x\sqrt{2\alpha^{2}-2\alpha+1}\\ x &= \frac{\left(\alpha-1\right)^{2}}{2\sqrt{2\alpha^{2}-2\alpha+1}}\\ \end{align*}

Then, solving for yy gives:

y2=(α1)24(α1)44(2α22α+1)y2=(α1)2((2α22α+1)4(2α22α+1)(α1)24(2α22α+1))y2=(α1)2((2α22α+1)(α1)24(2α22α+1))y2=(α1)2(α24(2α22α+1))y=α(α1)22α22α+1\begin{align*} y^{2} &= \frac{\left(\alpha-1\right)^{2}}{4}-\frac{\left(\alpha-1\right)^{4}}{4\left(2\alpha^{2}-2\alpha+1\right)}\\ y^{2} &= \left(\alpha-1\right)^{2}\left(\frac{\left(2\alpha^{2}-2\alpha+1\right)}{4\left(2\alpha^{2}-2\alpha+1\right)}-\frac{\left(\alpha-1\right)^{2}}{4\left(2\alpha^{2}-2\alpha+1\right)}\right)\\ y^{2} &= \left(\alpha-1\right)^{2}\left(\frac{\left(2\alpha^{2}-2\alpha+1\right)-\left(\alpha-1\right)^{2}}{4\left(2\alpha^{2}-2\alpha+1\right)}\right)\\ y^{2} &= \left(\alpha-1\right)^{2}\left(\frac{\alpha^{2}}{4\left(2\alpha^{2}-2\alpha+1\right)}\right)\\ y &= \frac{\alpha\left(\alpha-1\right)}{2\sqrt{2\alpha^{2}-2\alpha+1}}\\ \end{align*}

Finally, the points within ZZ are given by:

Z1=(0,0)Z2=((α1)222α22α+1,α(α1)22α22α+1)Z3=(3α22α+122α22α+1,α(1α)22α22α+1)Z4=(2α22α+1,0)\begin{align*} Z_1 &= \left(0,0\right)\\ Z_2 &= \left(\frac{\left(\alpha-1\right)^{2}}{2\sqrt{2\alpha^{2}-2\alpha+1}},\frac{\alpha\left(\alpha-1\right)}{2\sqrt{2\alpha^{2}-2\alpha+1}}\right)\\ Z_3 &= \left(\frac{3\alpha^{2}-2\alpha+1}{2\sqrt{2\alpha^{2}-2\alpha+1}},\frac{\alpha\left(1-\alpha\right)}{2\sqrt{2\alpha^{2}-2\alpha+1}}\right)\\ Z_4 &= \left(\sqrt{2\alpha^{2}-2\alpha+1},0\right)\\ \end{align*}

Up next, I want to shift this instance of ZZ upwards as much as possible. As mentioned, the slope of the upper left edge of the red zone is:

ba+u1v\frac{ba+u_{1}}{v}

Let’s say we want to shift ZZ up by y1y_1 and to the right by x1x_1. If it’s up as high as possible, we have that

y1=ba+u1vx1y_1=\frac{ba+u_{1}}{v} x_1

The location of Z4Z_4 is now (2α22α+1+x1,y1)\left(\sqrt{2\alpha^{2}-2\alpha+1} + x_1, y_1\right)

And this must lie on or below the red zone’s upper right boundary of

y=bab+u2p(x1)y = \frac{b-ab+u_{2}}{-p}\left(x-1\right)

plugging in the location of Z4Z_4 gives a requirement of

y1=bab+u2p(2α22α+1+x11)y_1 = \frac{b-ab+u_{2}}{-p}\left(\sqrt{2\alpha^{2}-2\alpha+1} + x_1-1\right)

And plugging in the requirement from a few lines ago gives:

ab+u1vx1=bab+u2p(2α22α+1+x11)(ab+u1v+bab+u2p)x1=bab+u2p(2α22α+1+1)(abp+u1pvp+bvabv+u2vvp)x1=bab+u2p(12α22α+1)(abp+u1p+bvabv+u2vv)x1=(bab+u2)(12α22α+1)x1=v(q+u2)(12α22α+1)abp+u1p+qv+u2v\begin{align*} \frac{ab+u_{1}}{v} x_1 &= \frac{b-ab+u_{2}}{-p}\left(\sqrt{2\alpha^{2}-2\alpha+1} + x_1-1\right)\\ \left(\frac{ab+u_{1}}{v}+\frac{b-ab+u_{2}}{p}\right) x_1 &= \frac{b-ab+u_{2}}{-p}\left(\sqrt{2\alpha^{2}-2\alpha+1} + -1\right)\\ \left(\frac{abp+u_1p}{vp}+\frac{bv-abv+u_2v}{vp}\right) x_1 &= \frac{b-ab+u_2}{p}\left(1-\sqrt{2\alpha^2-2\alpha+1}\right)\\ \left(\frac{abp+u_1p+bv-abv+u_2v}{v}\right) x_1 &= \left(b-ab+u_2\right)\left(1-\sqrt{2\alpha^2-2\alpha+1}\right)\\ x_1 &= \frac{v\left(q+u_2\right)\left(1-\sqrt{2\alpha^2-2\alpha+1}\right)}{abp+u_1p+qv+u_2v}\\ \end{align*}

And then we also get that:

$$y_1 &= \frac{(ba+u_1)\left(q+u_2\right)\left(1-\sqrt{2\alpha^2-2\alpha+1}\right)}{abp+u_1p+qv+u_2v}$$

This is shown below:

When does MM contain Z2Z_2? This is the only point that could possibly be excluded. Well, the bottom half of MM is defined by:

y=r2(xa)2+by=-\sqrt{r^2-(x-a)^2}+b

The location of Z2Z_2 is given by:

((α1)222α22α+1+x1,α(α1)22α22α+1+y1)\left(\frac{\left(\alpha-1\right)^{2}}{2\sqrt{2\alpha^{2}-2\alpha+1}}+x_1,\frac{\alpha\left(\alpha-1\right)}{2\sqrt{2\alpha^{2}-2\alpha+1}}+y_1\right)

Therefore, Z2Z_2 is inside MM whenever:

α(α1)22α22α+1+y1>r2((α1)222α22α+1+x1a)2+b\frac{\alpha\left(\alpha-1\right)}{2\sqrt{2\alpha^{2}-2\alpha+1}}+y_1>-\sqrt{r^2-\left(\frac{\left(\alpha-1\right)^{2}}{2\sqrt{2\alpha^{2}-2\alpha+1}}+x_1-a\right)^2}+b

Trying to simplify this mess would be a disaster. Instead, I’ll just