Why are there no general equations for polynomials with degrees 5 or above? – A Study with Galois Theory

The field of abstract algebra, which explores the fundamental structures and operations underlying mathematics, has played a pivotal role in reshaping the way mathematicians perceive and solve problems. Among the many branches of abstract algebra, Galois Theory stands as a beacon of mathematical elegance and power. Named after the brilliant French mathematician Évariste Galois (25 October 1811 – 31 May 1832), who tragically met his untimely end at the age of 20, this theory has become an indispensable tool in understanding the symmetries and solvability of polynomial equations. Galois Theory emerged in the early 19th century, marking a profound shift in algebraic thinking, and has since become a cornerstone of modern mathematics. Putting aside the fact that Galois was 17 when he published this ground-breaking work, the work has continued influence in group theory and its application range from cryptography and coding theory (the study of error-correcting codes). In this article, we will delve deeper into the question of why there are no radical solutions for quintic equations (more generally, polynomials with degree \geq5), and prove such a result utilizing Galois Theory. 

The Abel-Ruffini theorem, named after mathematicians Niels Henrik Abel and Paolo Ruffini, was first published in 1799 and refined in 1824. It was the first of its kind to show that it was impossible to solve quintic polynomials algebraically in the sense that there was no general equation for solving such polynomials (however, solutions can still be found for such equations through methods such as Newton-Raphson iteration). This was a revelation at the time – given the easy solvability of quadratics with the quadratic equation (x=\frac{-b\pm\sqrt{b^2-4ac}}{2a})  and the existence of the cubic and quartic equation, the discovery of a quintic polynomial seemed inevitable accompanied with the development in the complex world. This provided elementary solutions to quadratic, cubic and quartic polynomials – all such polynomials with integer coefficients must have algebraically expressible solutions, however complex it may be. This however does not hold for polynomials with degree 5 or above : the simplest example is x^5+x+1=0 where there are simply no expressions only involving algebraic functions that can express the solution. (In this case, it is actually necessary to use Bring radicals1 – however in real applications this is almost always never used as numerical approximations are sufficient alone).

Babylonian mathematicians as early as 2000 BC have explored how to solve quadratics, albeit not with the algebraic method we are accustomed with. The Babylonians used what is known as the “Babylonian tablet method” to solve quadratic equations. This method involved finding geometric shapes, such as squares and rectangles, whose areas were related to the coefficients of the quadratic equation.

For example, to solve the quadratic equation x^2 + 3x = 10, the Babylonians would construct a square with an area of 10 square units and then add a rectangle with a width of 3 units and an unknown length (representing x) such that the total area of the square and rectangle equaled the area of the larger square.

Through a series of geometric manipulations and measurements, the Babylonians were able to find approximate solutions to quadratic equations. While their methods may seem primitive by modern standards, they were remarkably effective and laid the foundation for later developments in algebraic techniques for solving quadratic equations. Building on this framework, by around the 9th century CE the quadratic formula was well established. However, the concept of a complex root was not defined so only real solutions were ever considered, with those with such solutions deemed unsolvable.

After the ancient Greeks had developed methods for solving quadratic equations primarily in geometric contexts, mathematicians in the Islamic world, such as Al-Khwarizmi and Omar Khayyam, made significant contributions to algebra during the Middle Ages. They further advanced the understanding of quadratic and cubic equations, although their approaches were often geometric or algebraic rather than purely algebraic. Later, in the Renaissance era, European mathematicians such as Viète and Descartes began to develop algebraic notation and methods that laid the groundwork for more systematic approaches to solving polynomial equations. These developments culminated in the work of Gerolamo Cardano, who, in the mid-16th century, published his method for solving cubic equations, which became known as Cardano’s formula. This marked a significant milestone in the history of algebra, paving the way for further advancements in the study of polynomial equations and laying the foundation for the emergence of modern algebra.

Cardano’s formula is a method for finding the roots of a cubic equation. The formula was first published by the Italian mathematician Gerolamo Cardano in his book “Ars Magna” in 1545. It is a fundamental result in algebra and was one of the key developments in the study of polynomial equations.

Cardano’s formula is a method for finding the solutions to a cubic equation, which is an equation of the form ax^3 + bx^2 + cx + d = 0, where a, b, c, d \in \mathbb{R}. The formula provides a way to find the roots (solutions) of such cubic equations.

Cardano’s formula was first published by the Italian mathematician Gerolamo Cardano in his book “Ars Magna” in 1545.

Cardano’s Equation

This formula allows one to find the roots of any cubic equation, including those that do not have rational solutions. More interestingly, this introduced the concept of an imaginary number inside a square root resulting in a real number – this was never heard of at the time however this is not our main focus for this topic – for more details please see How it all began with the complex numbers by North Dakota State University.

Soon after, in the 16th century, the quartic formula was derived by mathematicians such as Lodovico Ferrari and Rafael Bombelli, following earlier work on solving cubic equations. The solution for quartic equations involves a combination of algebraic operations, including solving a depressed quartic (a quartic equation without the x^3 term) and then using techniques such as substitution, completing the square, and solving quadratic equations to find the roots.

However, the quartic formula is considerably more complex than the quadratic and cubic formulas, and its practical utility is limited by the complexity of its solutions. In many cases, quartic equations are solved using numerical methods or approximation techniques rather than by applying the quartic formula directly.

So with all of this, we can start to observe why a generalized quintic equation might be unfeasible. With the formula becoming more and more complicated, it’s no wonder that at the quintic level it is impossible to produce such an equation – more mathematically, there are no expressions involving only algebraic functions that can express the solution.

To prove this here, we will utilize group theory, more specifically the sets relating the roots of the equation and the coefficients of the polynomial. We define the solutions of an n^\text{th}-ordered polynomial as elements of a set {s_1, s_2,\ldots, s_n}, where said polynomial is defined as f(x)=x^n+{a_1}x^{n-1}+\ldots+a_{n-1}x+a_n, a\in\mathbb{R} (This can be generalized for all a\in\mathbb{C} however this is rarely required thus we will not consider this), with leading coefficient 1. Now, according to the fundamental theorem of algebra, there are at most n distinct solutions for a arbitrary polynomial of n^\text{th} order, thus we can express the polynomial in the form (x-s_1)(x-s_2)\dots(x-s_n). Now, given this information, we are to attempt to construct a function which takes the input of \{a_1, a_2, \ldots, a_n\} and returns \{s_1, s_2, \ldots, s_n\}, and prove that this is impossible for n\geq5. (Note that however the inverse is possible – it is trivial that for any set of solutions we can indeed construct the respective polynomial as seen in the next paragraph.)

We first note that switching the order of individual solutions does not affect the outcome of the polynomial as multiplication is commutative, i.e. (x-1)(x-2)=(x-2)(x-1)=x^2-3x+2. However, the inverse is not true as e.g. x^2-3x+2\neq{x^2+2x-3}, which gives us a hint at why our statement of constructing a polynomial given its set of solutions is trivial but not the inverse. In fact, given this we can prove our first statement that any general solutions for polynomials with order > 1 must involve at least 1 multi-valued function, i.e. a root of k^\text{th} order where k \in\mathbb{R}. We achieve this through contradiction – this is trivial and is left as an exercise for the readers.

Now with this we are able to rephrase our question – in fact our aim now is to prove that any general solution for a n^\text{th}-ordered polynomial where n\geq5 must necessarily involve more than what is expressible by such functions which makes it undefinable – more specifically the ability to generate all solutions with all of them commutative. This will prove crucial as we consider solution sets of different sizes as we are able to invoke Galois Theory for such sets.

For the set of the respective solutions, we note that we have two operations that we can carry out. The first of which are transpositions, which are carried out by exchanging the position of two solutions. Note we can have solutions exchanging positions with itself, which can be treated as an identity transformation. This is shown below :

The second operation we can carry out is a cycle, which exchanges the position of 3 solutions in a cycle as shown below.

We can first look at the solutions for a quadratic, i.e. the set \{s_1, s_2\}. By plotting such solutions on the complex plane, we have the following distinct operations that we could carry out on the set of solutions:

We denote this commutator as \omega. To attain our original solution we require \omega^{-1}, the inverse operation to be carried out again. As seen here, \omega\omega^{-1} returns the original result. This is in fact the group S_2.

Based on this result, we can look at the set of solutions of third and fourth order polynomials which follow from the same result. In particular, they represent what is called the S_3 and S_4 symmetric groups, which are solvable groups, i.e. a group that can be constructed from abelian groups using extensions. As all such operators are commutative as is indicated by their abelian nature, any operations when reversed in order will result in the same result, i.e. \alpha\beta\alpha^{-1}\beta^{-1}=E where E is the identity transformation. This implies that we can in fact construct a general formula for the roots of the equation for cubic and quartic polynomials as the results are commutative – implying that for any coefficients of the polynomial only one set of solutions are produced through our function.

The composition series of a group gives us a way to break down the group into simpler, normal subgroups. In the case of the symmetric group S_5, its composition series is {E, A_5, S_5}, where E is the identity element and A_5 is the group of even permutations of a finite set. The Jordan‒Hölder theorem states that every composition series is equivalent, thus we shall only consider the above series.

The significance of this composition series lies in the factor groups obtained at each step. The factor group S_5/A_5 is isomorphic to C_2, the cyclic group of order 2. This means that when we “mod out” A_5 from S_5, we’re left with a group that behaves like the integers modulo 2 under addition. The remaining factor group A_5/E is just A_5 itself.

Now, why does A_5 being a non-abelian group matter in the context of solvability? The fact that A_5 is non-abelian implies that the symmetric group S_5 is not solvable (this is easily proved from a counterexample, as (1, 2, 3)(1, 2, 3, 4, 5)\neq(1, 2, 3, 4, 5)(1, 2, 3) where both are elements of A_5). This is shown as a fundamental result from group theory, where a group is solvable if it has a composition series where each factor group is abelian. Since S_5 contains a non-abelian subgroup (A_5), it cannot be broken down into abelian factor groups, thus making it unsolvable.

Galois Theory tells us that for a polynomial equation to be solvable by radicals (i.e., its roots expressible in terms of radicals and arithmetic operations), its associated group of symmetries must be solvable. Since S_5 is not solvable, there’s no general formula for the roots of a quintic polynomial using radicals and arithmetic operations alone. This is the essence of why there’s no quintic formula analogous to the quadratic, cubic, and quartic formulas.

To conclude : the mathematics behind such a theorem is incredibly complicated yet elementary – involving a variety of different approaches from group theory stemming from an seemingly algebra-like question. I would recommend reading the below references for more information on the topic, and also the following book by Alekseev for a more comprehensive understanding of the work by Abel. Also as a side note : I will be updating this website monthly following a new schedule, so expect a new piece out every 15th at 11:00 a.m. GMT!

References :
Abel–Ruffini’s Theorem: Complex but Not Complicated! – Paul Ramond https://mathcenter.oxford.emory.edu/site/math108/abel/abels_impossibility_proof.pdf
Why you can’t solve quintic equations (Galois theory approach) – Mathmaniac
https://www.youtube.com/watch?v=zCU9tZ2VkWc
Why There’s ‘No’ Quintic Formula (proof without Galois theory) – not all wrong
https://www.youtube.com/watch?v=BSHv9Elk1MU

  1. https://en.wikipedia.org/wiki/Bring_radical ↩︎

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *