# Euclidean distance

(Redirected from Distance formula)

Using the Pythagorean theorem to compute two-dimensional Euclidean distance

In mathematics, the Euclidean distance between two points in Euclidean space is a number, the length of a line segment between the two points. It can be calculated from the Cartesian coordinates of the points using the Pythagorean theorem, and is occasionally called the Pythagorean distance. These names come from the ancient Greek mathematicians Euclid and Pythagoras, but Euclid did not represent distances as numbers, and the connection from the Pythagorean theorem to distance calculation was not made until the 17th century.

The distance between two objects that are not points is usually defined to be the smallest distance between any two points from the two objects. Formulas are known for computing distances between different types of objects, such as the distance from a point to a line. In advanced mathematics, the concept of distance has been generalized to abstract metric spaces, and other distances than Euclidean have been studied. The square of the Euclidean distance is not a metric, but is convenient for many applications in statistics and optimization.

## Distance formulas

### One dimension

The distance between any two points on the real line is the absolute value of the numerical difference of their coordinates. Thus if ${\displaystyle p}$ and ${\displaystyle q}$ are two points on the real line, then the distance between them is given by:[1]

${\displaystyle d(p,q)=|p-q|.}$

A more complicated formula, giving the same value, but generalizing more readily to higher dimensions, is:[1]

${\displaystyle d(p,q)={\sqrt {(p-q)^{2}}}.}$

In this formula, squaring and then taking the square root leaves any positive number unchanged, but replaces any negative number by its absolute value.[1]

### Two dimensions

In the Euclidean plane, let point ${\displaystyle p}$ have Cartesian coordinates ${\displaystyle (p_{1},p_{2})}$ and let point ${\displaystyle q}$ have coordinates ${\displaystyle (q_{1},q_{2})}$. Then the distance between ${\displaystyle p}$ and ${\displaystyle q}$ is given by:[2]

${\displaystyle d(p,q)={\sqrt {(p_{1}-q_{1})^{2}+(p_{2}-q_{2})^{2}}}.}$

This can be seen by applying the Pythagorean theorem to a right triangle with horizontal and vertical sides, having the line segment from ${\displaystyle p}$ to ${\displaystyle q}$ as its hypotenuse. The two squared formulas inside the square root give the areas of squares on the horizontal and vertical sides, and the outer square root converts the area of the square on the hypotenuse into the length of the hypotenuse.

It is also possible to compute the distance for points given by polar coordinates. If the polar coordinates of ${\displaystyle p}$ are ${\displaystyle (r,\theta )}$ and the polar coordinates of ${\displaystyle q}$ are ${\displaystyle (s,\psi )}$, then their distance is[2]

${\displaystyle d(p,q)={\sqrt {r^{2}+s^{2}-2rs\cos(\theta -\psi )}}.}$

When ${\displaystyle p}$ and ${\displaystyle q}$ are expressed as complex numbers in the complex plane, the same formula for one-dimensional points expressed as real numbers can be used:[3]

${\displaystyle d(p,q)=|p-q|.}$

### Higher dimensions

Deriving the ${\displaystyle n}$-dimensional Euclidean distance formula by repeatedly applying the Pythagorean theorem

In general, for points given by Cartesian coordinates in ${\displaystyle n}$-dimensional Euclidean space, the distance is[4]

${\displaystyle d(p,q)={\sqrt {(p_{1}-q_{1})^{2}+(p_{2}-q_{2})^{2}+\cdots +(p_{i}-q_{i})^{2}+\cdots +(p_{n}-q_{n})^{2}}}.}$

### Other objects than points

For pairs of objects that are not both points, the distance can most simply be defined as the smallest distance between any two points from the two objects, although more complicated generalizations from points to sets such as Hausdorff distance are also commonly used.[5] Formulas for computing distances between different types of objects include:

## Squared Euclidean distance

In many applications, and in particular when comparing distances, it may be more convenient to omit the final square root in the calculation of Euclidean distances. The value resulting from this omission is the square of the Euclidean distance, and is called the squared Euclidean distance.[6] As an equation:

${\displaystyle d^{2}(p,q)=(p_{1}-q_{1})^{2}+(p_{2}-q_{2})^{2}+\cdots +(p_{i}-q_{i})^{2}+\cdots +(p_{n}-q_{n})^{2}.}$

Beyond its application to distance comparison, squared Euclidean distance is of central importance in statistics, where it is used in the method of least squares, a standard method of fitting statistical estimates to data by minimizing the average of the squared distances between observed and estimated values.[7] The addition of squared distances to each other, as is done in least squares fitting, corresponds to an operation on (unsquared) distances called Pythagorean addition.[8] In cluster analysis, squared distances can be used to strengthen the effect of longer distances.[6]

Squared Euclidean distance is not a metric, as it does not satisfy the triangle inequality.[9] However it is a smooth, strictly convex function of the two points, unlike the distance, which is non-smooth for equal points and convex but not strictly convex. The squared distance is thus preferred in optimization theory, since it allows convex analysis to be used. Since squaring is a monotonic function of non-negative values, minimizing squared distance is equivalent to minimizing the Euclidean distance, so the optimization problem is equivalent in terms of either, but easier to solve using squared distance.[10]

The collection of all squared distances between pairs of points from a finite set may be stored in a Euclidean distance matrix.[11] In rational trigonometry, squared Euclidean distance is used because (unlike the Euclidean distance itself) the squared distance between points with rational number coordinates is always rational; in this context it is also called "quadrance".[12]

## Generalizations

In more advanced areas of mathematics, Euclidean space and its distance provides a standard example of a metric space, called the Euclidean metric. Euclidean distance geometry studies properties of Euclidean geometry in terms of its distances, and properties of sets of distances that can be used to determine whether they come from the Euclidean metric.[13] When viewing Euclidean space as a vector space, its distance is associated with a norm called the Euclidean norm, defined as the distance of each vector from the origin. One of the important properties of this norm, relative to other norms, is that it remains unchanged under arbitrary rotations of space around the origin.[14] By Dvoretzky's theorem, every finite-dimensional normed vector space has a high-dimensional subspace on which the norm is approximately Euclidean; the Euclidean norm is the only norm with this property.[15] It can be extended to infinite-dimensional vector spaces as the L2 norm or L2 distance.[16]

Other common distances on Euclidean spaces and low-dimensional vector spaces include:[17]

• Chebyshev distance, which measures distance assuming only the most significant dimension is relevant.
• Manhattan distance, which measures distance following only axis-aligned directions.
• Minkowski distance, a generalization that unifies Euclidean distance, Manhattan distance, and Chebyshev distance.

For points on surfaces in three dimensions, the Euclidean distance should be distinguished from the geodesic distance, the length of a shortest curve that belongs to the surface. In particular, for measuring great-circle distances on the earth or other near-spherical surfaces, distances that have been used include the Haversine distance giving great-circle distances between two points on a sphere from their longitudes and latitudes, and Vincenty's formulae also known as "Vincent distance" for distance on a spheroid.[18]

## History

Euclidean distance is the distance in Euclidean space; both concepts are named after ancient Greek mathematician Euclid, whose Elements became a standard textbook in geometry for many centuries.[19] Concepts of length and distance are widespread across cultures, can be dated to the earliest surviving "protoliterate" bureaucratic documents from Sumer in the fourth millenium BC (far before Euclid),[20] and have been hypothesized to develop in children earlier than the related concepts of speed and time.[21] But the notion of a distance, as a number defined from two points, does not actually appear in Euclid's Elements. Instead, Euclid approaches this concept implicitly, through the congruence of line segments, through the comparison of lengths of line segments, and through the concept of proportionality.[22]

The Pythagorean theorem is also ancient, but it only took its central role in the measurement of distances with the invention of Cartesian coordinates by René Descartes in 1637.[23] Because of this connection, Euclidean distance is also sometimes called Pythagorean distance.[24] Although accurate measurements of long distances on the earth's surface, which are not Euclidean, had again been studied in many cultures since ancient times (see history of geodesy), the idea that Euclidean distance might not be the only way of measuring distances between points in mathematical spaces came even later, with the 19th-century formulation of non-Euclidean geometry.[25] The definition of the Euclidean norm and Euclidean distance for geometries of more than three dimensions also first appeared in the 19th century, in the work of Augustin-Louis Cauchy.[26]

## References

1. ^ a b c Smith, Karl (2013), Precalculus: A Functional Approach to Graphing and Problem Solving, Jones & Bartlett Publishers, p. 8, ISBN 9780763751777
2. ^ a b Cohen, David (2004), Precalculus: A Problems-Oriented Approach (6th ed.), Cengage Learning, p. 698, ISBN 9780534402129
3. ^ Andreescu, Titu; Andrica, Dorin (2014), "3.1.1 The Distance Between Two Points", Complex Numbers from A to ... Z (2nd ed.), Birkhäuser, pp. 57–58, ISBN 978-0-8176-8415-0
4. ^ Tabak, John (2014), Geometry: The Language of Space and Form, Facts on File math library, Infobase Publishing, p. 150, ISBN 9780816068760
5. ^ Ó Searcóid, Mícheál (2006), "2.7 Distances from Sets to Sets", Metric Spaces, Springer Undergraduate Mathematics Series, Springer, pp. 29–30, ISBN 9781846286278
6. ^ a b Spencer, Neil H. (2013), "5.4.5 Squared Euclidean Distances", Essentials of Multivariate Data Analysis, CRC Press, p. 95, ISBN 9781466584792
7. ^ Randolph, Karen A.; Myers, Laura L. (2013), Basic Statistics in Multivariate Analysis, Pocket Guide to Social Work Research Methods, Oxford University Press, p. 116, ISBN 9780199764044
8. ^ Moler, Cleve and Donald Morrison (1983), "Replacing Square Roots by Pythagorean Sums" (PDF), IBM Journal of Research and Development, 27 (6): 577–581, CiteSeerX 10.1.1.90.5651, doi:10.1147/rd.276.0577
9. ^ Mielke, Paul W.; Berry, Kenneth J. (2000), "Euclidean distance based permutation methods in atmospheric science", in Brown, Timothy J.; Mielke, Paul W. Jr. (eds.), Statistical Mining and Data Visualization in Atmospheric Sciences, Springer, pp. 7–27, doi:10.1007/978-1-4757-6581-6_2
10. ^ Kaplan, Wilfred (2011), Maxima and Minima with Applications: Practical Optimization and Duality, Wiley Series in Discrete Mathematics and Optimization, 51, John Wiley & Sons, p. 61, ISBN 9781118031049
11. ^ Alfakih, Abdo Y. (2018), Euclidean Distance Matrices and Their Applications in Rigidity Theory, Springer, p. 51, ISBN 9783319978468
12. ^ Henle, Michael (December 2007), "Review of Divine Proportions by N. J. Wildberger", American Mathematical Monthly, 114 (10): 933–937, JSTOR 27642383
13. ^ Liberti, Leo; Lavor, Carlile (2017), Euclidean Distance Geometry: An Introduction, Springer Undergraduate Texts in Mathematics and Technology, Springer, p. xi, ISBN 9783319607924
14. ^ Kopeikin, Sergei; Efroimsky, Michael; Kaplan, George (2011), Relativistic Celestial Mechanics of the Solar System, John Wiley & Sons, p. 106, ISBN 9783527634576
15. ^ Matoušek, Jiří (2002), Lectures on Discrete Geometry, Graduate Texts in Mathematics, Springer, p. 349, ISBN 978-0-387-95373-1
16. ^ Ciarlet, Philippe G. (2013), Linear and Nonlinear Functional Analysis with Applications, Society for Industrial and Applied Mathematics, p. 173, ISBN 9781611972580
17. ^ Klamroth, Kathrin (2002), "Section 1.1: Norms and Metrics", Single-Facility Location Problems with Barriers, Springer Series in Operations Research, Springer, pp. 4–6, doi:10.1007/0-387-22707-5_1
18. ^ Panigrahi, Narayan (2014), "12.2.4 Haversine Formula and 12.2.5 Vincenty's Formula", Computing in Geographic Information Systems, CRC Press, pp. 212–214, ISBN 9781482223149
19. ^ Zhang, Jin (2007), Visualization for Information Retrieval, Springer, ISBN 9783540751489
20. ^ Høyrup, Jens (2018), "Mesopotamian mathematics" (PDF), in Jones, Alexander; Taub, Liba (eds.), The Cambridge History of Science, Volume 1: Ancient Science, Cambridge University Press, pp. 58–72
21. ^ Acredolo, Curt; Schmid, Jeannine (1981), "The understanding of relative speeds, distances, and durations of movement", Developmental Psychology, 17 (4): 490–493, doi:10.1037/0012-1649.17.4.490
22. ^
23. ^ Maor, Eli (2019), The Pythagorean Theorem: A 4,000-Year History, Princeton University Press, p. 133, ISBN 9780691196886
24. ^ Rankin, William C.; Markley, Robert P.; Evans, Selby H. (March 1970), "Pythagorean distance and the judged similarity of schematic stimuli", Perception & Psychophysics, 7 (2): 103–107, doi:10.3758/bf03210143
25. ^ Milnor, John (1982), "Hyperbolic geometry: the first 150 years", Bulletin of the American Mathematical Society, 6 (1): 9–24, doi:10.1090/S0273-0979-1982-14958-8, MR 0634431
26. ^ Ratcliffe, John G. (2019), Foundations of Hyperbolic Manifolds, Graduate Texts in Mathematics, 149 (3rd ed.), Springer, p. 32, ISBN 9783030315979