Polynomial: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Sébastien Moulin
(New article : writing the basics (not finished))
 
mNo edit summary
 
(45 intermediate revisions by 12 users not shown)
Line 1: Line 1:
In [[algebra]], a '''polynomial''' is, roughly speaking, an formal expression obtained from constant numbers and one or several unspecified numbers called "variables", denoted by letters like <math>x</math>, <math>y</math>, etc., by making a finite number of additions, subtractions and multiplications. For instance, <math>x^2-2x+1</math> and <math>\frac{1}{2}x^3+x-\sqrt{2}</math> are polynomials of one variable, whereas <math>x^2+y^2</math> is a polynomial of two variables. Expressions like <math>\frac{x-1}{x^2+2}</math> or <math>\sqrt{x^2+1}</math> are ''not'' polynomials ; the first one is a [[rational fraction]], and the second one is an [[irrational]] expression, due to the [[square root]] symbol. Let us remark however that any operation may be allowed between constant numbers (including fractions or taking square root), as the <math>\frac{1}{2}x^3+x-\sqrt{2}</math> example suggests.
{{subpages}}


It may be convenient to think of a polynomial as a function of its variables, that is, <math>x\mapsto x^2-2x+1</math> or <math>(x,y)\mapsto x^2+y^2</math>. Such a function is called a [[polynomial function]]. But in reality, both concepts are different, the unspecified variables being purely ''formal'' entities when one thinks of an abstract polynomial, whereas they are meant to be replaced by ''any number'' when one thinks of a function. The distinction is important in [[abstract algebra]], because what we have called "constant numbers" is more generally replaced by any [[ring (mathematics)|ring]], and for some rings the two concepts cannot be identified. There is not such a problem with polynomials over rings of usual numbers like [[integer]]s, [[rational number|rational]], [[real number|real]] or [[complex number|complex]] numbers. Still it is important to understand that calculations with polynomials can be conceived in an only formal way, without giving any special ontological status to the variables. To make the distinction clear, it is common in algebra to denote the abstract variables with capital letters (<math>X</math>, <math>Y</math>, etc.), while variables of functions are still denoted with lowercase letters. We will use this convention in what follows.
In [[algebra]], a '''polynomial''' is, roughly speaking, a formal [[expression (mathematics)|expression]] obtained from [[constant]]s and one or several [[variable]]s by making a finite number of additions, subtractions and multiplications. For instance, <math>x^2-2x+1</math> is a polynomial involving one variable, ''x'' (often called a polynomial ''in one variable''), whereas <math>x^2+y^2</math> is a polynomial in two variables, <math>x</math> and <math>y</math>.  


== Polynomials of one variable ==
Polynomials are an essential element of most applications of mathematics to a systematic study of physical problems and phenomena.  Polynomials are the most basic objects that can be used to represent situations where a certain quantity is unknown, like the time it will take for a ball thrown in the air to fall back to the ground.  They are also the most basic objects that can be used to represent a quantity that varies.  For instance, the [[reaction rate|rate]] at which a [[chemical reaction]] proceeds often varies as the [[concentration]]s of the [[reactant]]s change.  This variation is often specified by a polynomial whose variables are the concentrations. 


In this section we deal with the simplest case, that is, polynomials of only one variable, denoted <math>X</math>. The "constant numbers" are the element of any [[commutative]] [[ring (mathematics)|ring]] <math>R</math>. The reader who is not accustomed to abstract algebra may replace <math>R</math> by a familiar set of numbers, like the set <math>\mathbb{R}</math> of the [[real number]]s, as it is the case in the <math>\frac{1}{2}X^3+X-\sqrt{2}</math> example above, and still can grasp most of what follows.
The study of many phenomena requires more sophisticated objects than polynomials.   Sometimes such analysis is so complicated that it becomes essential to approximate certain objects with polynomials.  Approximation of sophisticated mathematical objects with polynomials is an important application of [[calculus]], and is an essential tool in [[numerical analysis]].


=== Definition ===
The polynomials encountered first in school and occurring most commonly in applications have [[real number]] coefficients.  However, there are applications where other types of coefficients occur.  For instance, the study of very small phenomena using [[quantum mechanics]] requires the use of [[complex numbers]].  [[Error-correcting code]]s, used to avoid loss or corruption of data, involve coefficients governed by [[modular arithmetic|arithmetic modulo 2]].  Polynomials with complex number coefficients or coefficients modulo 2  behave similarly to polynomials with real number coefficients in many ways.  The similarities and differences are discussed on the [[polynomial/Advanced|advanced version]] of this page.  Polynomials with even more general types of coefficients are important in advanced mathematics, and are discussed on the [[polynomial ring]] page.  For the rest of the present article, all polynomials considered will have real number coefficients.


Let us consider some expressions like <math>X^2-2X+1</math>, <math>\frac{1}{2}X^3+X-\sqrt{2}</math>, or <math>2X^5-3X^2+1</math>. We can write all of them as follows:
<center><math>X^2-2X+1=1+(-2)X+1X^2+0X^3+0X^4+\cdots,</math></center>
<center><math>\frac{1}{2}X^3+X-\sqrt{2}=-\sqrt{2}+1X+0X^2+\frac{1}{2}X^3+0X^4+\cdots,</math></center>
<center><math>2X^5-3X^2+1=1+0X+(-3)X^2+0X^3+0X^4+2X^5+0X^6+\cdots.</math></center>
This suggests that a polynomial can be entirely defined by giving a sequence of numbers, which are called its ''coefficients'', all of them being zero from some rank. For instance the three polynomials above can be written respectively <math>(1,-2,1,0,0,\cdots)</math>, <math>\left(-\sqrt{2},1,0,\frac{1}{2},0,\cdots\right)</math>, and <math>(1,0,-3,0,0,2,0,\cdots)</math>, the dots meaning the sequence continues with an infinity of zeros. This leads to the definition below.


'''Definition.''' A ''polynomial'' <math>P</math>, over the ring <math>R</math> is a sequence <math>P=\left(a_0,a_1,a_2,\cdots,a_n,\cdots\right)</math> of elements of <math>R</math>, called the ''coefficients'' of <math>P</math>, this sequence containing only a finite number of nonzero terms. The rank of the last nonzero term is called the ''degree'' of the polynomial.
== Non-Examples ==


Hence, the degrees of the three polynomials given above are respectively 2, 3 and 5. By convention, the degree of <math>(0,0,\cdots)</math> is set to <math>-\infty</math>.
Expressions like <math>\frac{x-1}{x^2+2}</math> or <math>\sqrt{x^2+1}</math> are ''not'' polynomials ; the first one is a [[rational function]], because of the fraction bar (representing [[division]]), and the second one is an irrational expression, because of the [[square root]] symbol. Such operations might be expressed within the constant numbers, as in the example <math>\frac{1}{2}x^3+x-\sqrt{2}</math>, but this is only because <math>\frac{1}{2}</math> and <math>\sqrt{2}</math> are elements of the set (e.g. real numbers) that are being used as coefficients of the polynomials.


This definition may surprise the reader, because in reality, one thinks of a polynomial as an expression of the form <math>a_0+a_1X+a_2X^2+\cdots+a_nX^n</math> rather than <math>\left(a_0,a_1,a_2,\cdots ,a_n,\cdots\right)</math>. We will progressively show how to return to this usual way of writing a polynomial. First, we identify any element <math>a_0</math> of the ring to the polynomial <math>\left(a_0,0,0,\cdots\right)</math>. For instance, we write only <math>7</math> instead of the cumbersome <math>\left(7,0,0,\cdots\right)</math>, (or in the familiar fashion <math>7+0X+0X^2+\cdots</math>).
Expressions involving [[exponential function]]s, like <math>2^x + 1</math>, are often mistaken for polynomials because of the similarity of notation. It is important to realize that the [[exponent]]s appearing in a polynomial are always constants; when an expression has a variable in an exponent, such as the exponent ''x'' on the base ''2'' in the above expression, the expression is not a polynomial.


Secondly, we merely denote by <math>X</math> the polynomial
== Monomials and simplification ==
<center><math>X=\left(0,1,0,0,\cdots\right)</math>.</center>
This is natural, as in the familiar fashion this sequence corresponds to <math>0+1X+0X^2+0X^3+\cdots</math> It remains to give a sense to <math>X^2</math>, <math>X^3</math>, etc. This will be made in the next two subsections.


=== Calculation rules ===
The simplest polynomials are the [[monomials]] having only one [[term (mathematics)|term]].  A monomial is either a [[constant]], or a product of constants and positive whole number [[exponent|power]]s of variables.  The constant is called the '''coefficient''' of the monomial.  A coefficient equal to ''1'' is typically dropped from the notation, so that <math>x^2</math> represents the same monomial as <math>1x^2</math>.  Coefficients are usually written in as simplified a form as possible, but even then can sometimes look rather complicated, such as the coefficient in <math> (\pi + \sqrt{3} + 1)x</math>.  When a coefficient involves a sum, parentheses must be inserted around the coefficient.


We now define addition and multiplication of polynomials, beginning with addition, which is easy.
Monomials are usually immediately simplified so that each variable is written once.  For instance, <math>x^2 y x</math> would be simplified by combing both powers of the variable ''x'', producing <math>x^3 y</math>.  When a monomial involves several variables, the order in which the variables are written is unimportant.  By convention, the variables are typically listed in alphabetical order.  For instance, <math>x^3 y</math> and <math>y x^3</math> are considered to be the same monomial, but the latter would usually be rewritten as the former.


==== Addition ====
Polynomials are constructed by adding together a finite number of monomials.  For instance, ''1'', <math>2x^4</math>, and <math>-3x^2yz^3</math> are monomials, and their sum, <math>1 + 2x^4 - 3x^2yz^3</math> is a polynomial which is not a monomial. 


With the traditional notation, if we have <math>P=2X^5-3X^2+1</math> and <math>Q=-X^5+4X^4+2X^2-1</math>, we want to have <math>P+Q=(2-1)X^5+4X^4+(-3+2)X^2+1-1=X^5+4X^4-X^2</math>, that is, one wants to add coefficients separately for each degree. This leads to the formal definition below.
There are two main procedures used in simplifying a polynomial.  First, one can remove [[grouping (mathematics)|groupings]] by [[distributive property|distributing]], which produces a sum of monomials.  For instance, <math>2x (x+2) + x(2x+2)</math> simplifies to <math>2x^2 + 4x + 2x^2 + 2x</math>. Second, one can combine [[like terms]] to reduce the number of monomials in the expression.  For instance, the above polynomial can be further simplified to <math>4x^2 + 6x</math> by combining the two terms involving  <math>x^2</math> and the two terms involving ''x''.


'''Definition.''' Given two polynomials <math>P=\left(a_0,a_1,a_2,\cdots\right)</math> and <math>Q=\left(b_0,b_1,b_2,\cdots\right)</math>, the sum <math>P+Q</math> is defined by <math>P+Q=\left(a_0+b_0,a_1+b_1,a_2+b_2,\cdots\right)</math>.
== Polynomials in one variable ==


==== Multiplication ====
In this section we deal with the simplest case, that is, polynomials involving only one variable, denoted <math>x</math>.


Multiplication is harder to define. Let us begin with an example using traditional notation. For <math>P=X^2+X-2</math> and <math>Q=2X^2-3X+1</math>, we want to have
===Degree and the Standard Form===
<center><math>PQ=X^2\left(2X^2-3X+1\right)+X\left(2X^2-3X+1\right)-2\left(2X^2-3X+1\right)</math>;</center>
<center><math>PQ=2X^4+(-3+2)X^3+(1-3-2\cdot 2)X^2+(1-2\cdot (-3))X-2</math>;</center>
<center><math>PQ=2X^4-X^3-6X^2+7X-2</math>.</center>
One can observe that the coefficient of say, <math>X^2</math>, is obtained by adding <math>1\cdot 1</math>, <math>1\cdot (-3)</math> and <math>-2\cdot 2</math>, that is, by adding all the <math>a_ib_j</math> so that <math>i+j=2</math>, where the <math>a_i</math> denote the coefficients of <math>P</math> and the <math>b_j</math> those of <math>Q</math>. Those mechanics lead to give the definition below.


'''Definition.''' Given two polynomials <math>P=\left(a_0,a_1,a_2,\cdots\right)</math> and <math>Q=\left(b_0,b_1,b_2,\cdots\right)</math>, the product <math>PQ</math> is defined by <math>PQ=\left(c_0,c_1,c_2,\cdots\right)</math>, where for every index <math>k</math>, the coefficient <math>c_k</math> is given by <math>c_k=\sum_{i+j=k}a_ib_j</math>.


The reader which is upset by those cumbersome notations should just retain that this definition allows to multiply polynomials (considered as mere sequences of coefficients) as one is used to do in elementary algebra (using the traditional notation, as in the example). The only striking fact is that in our construction, <math>X</math> does not represent a number, but a pure abstract entity for which we have defined some rules of calculation.
After [[simplify (algebra)|simplification]], a polynomial can be written as a finite sum of monomials.    It is sometimes useful to explicitly write a power of a variable in each monomial, even the constants. To accomplish this, you can write <math>x^0</math> after the constant, so that ''2'' and <math>2x^0</math> are considered the same.


==== The algebra <math>R[X]</math> ====
The power of the variable appearing in a monomial is the '''degree''' of the monomial.  By the above convention, a constant ''c'' is the same as <math>c x^0</math> and has degree equal to ''0''.  The degree of a polynomial is the ''largest'' of the degrees of the monomials appearing in the polynomial.  The only exception is the constant polynomial 0, which typically is not assigned a degree (for reasons made clear below).  As an example, ''2'' has degree 0, <math>x^2 + x</math> has degree ''2'', and <math>-3x^4+x^2+x^5</math> has degree ''5''.


With the definition above, one can verify that the product of the polynomial <math>X=\left(0,1,0,0,\cdots\right)</math> par itself, that is <math>X^2</math>, is the sequence <math>X^2=\left(0,0,1,0,0,\cdots\right)</math>. More generally, for each [[natural number]] <math>n</math>, one can verify that the <math>n</math>-th power of <math>X</math> is given by
The degree is an important identifier when working with polynomials.  For instance, many procedures for [[factor]]ing or solving [[polynomial equation]]s require identifying the degree of the polynomial first.  In the last example above, we had to scan through the polynomial from the left all the way through the right to determine that the degree is ''5''.  To facilitate identifying the degree of a polynomial, as well as manipulations of polynomials, they are usually written in '''standard form'''.  The standard form of a polynomial is obtained by simplifying as described above and then writing the monomials so that the exponents decrease from left to right. The degree of a polynomial in standard form is the degree of the first monomial appearing.  The term of highest degree is the '''leading term''' and its coefficient is the '''leading coefficient'''.  A polynomial with leading coefficient equal to 1 is '''monic'''. We can put the last example above in standard form by rearranging the monomials to obtain <math>x^5-3x^4+x^2</math>.  It is of course just as easy to work with polynomials where the monomials are written so that the degrees increase from left to right.
<math>X^n=\left(0,\cdots,0,1,0,0,\cdots\right)</math>, where the <math>1</math> is the coefficient of index <math>n</math> and all other coefficients are zeros. In particular, we have the usual convention <math>X^0=\left(1,0,0,\cdots\right)</math>, which we identified to the constant <math>1</math>.


Now, any polynomial <math>P=\left(a_0,a_1,a_2,\cdots,a_n,0,0,\cdots\right)</math> is ''exactly'' equal to <math>a_0+a_1X+a_2X^2+\cdots+a_nX^n</math>, where the addition and the powers (which are mere repetitions of multiplications) are defined as in the preceding subsection. Our whole construction legitimates the traditional notation, and from now on, we will only use the later, with which calculations use natural rules of elementary algebra. It is however important to remember that the "variable" <math>X</math> did not denote some number in our construction, but a particular sequence of coefficients. We have succeeded in defining polynomials in a purely formal manner.


==== Operations and degree: the algebra <math>R_n[X]</math> ====


=== Polynomial function ===


== Arithmetics ==
== Polynomials in several variables ==


== Polynomials of several variables ==
==Polynomial equations ==


== Applications of polynomials ==
== Polynomial functions ==
 
== Applications of polynomials ==[[Category:Suggestion Bot Tag]]

Latest revision as of 16:00, 5 October 2024

This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
Advanced [?]
 
This editable Main Article is under development and subject to a disclaimer.

In algebra, a polynomial is, roughly speaking, a formal expression obtained from constants and one or several variables by making a finite number of additions, subtractions and multiplications. For instance, is a polynomial involving one variable, x (often called a polynomial in one variable), whereas is a polynomial in two variables, and .

Polynomials are an essential element of most applications of mathematics to a systematic study of physical problems and phenomena. Polynomials are the most basic objects that can be used to represent situations where a certain quantity is unknown, like the time it will take for a ball thrown in the air to fall back to the ground. They are also the most basic objects that can be used to represent a quantity that varies. For instance, the rate at which a chemical reaction proceeds often varies as the concentrations of the reactants change. This variation is often specified by a polynomial whose variables are the concentrations.

The study of many phenomena requires more sophisticated objects than polynomials. Sometimes such analysis is so complicated that it becomes essential to approximate certain objects with polynomials. Approximation of sophisticated mathematical objects with polynomials is an important application of calculus, and is an essential tool in numerical analysis.

The polynomials encountered first in school and occurring most commonly in applications have real number coefficients. However, there are applications where other types of coefficients occur. For instance, the study of very small phenomena using quantum mechanics requires the use of complex numbers. Error-correcting codes, used to avoid loss or corruption of data, involve coefficients governed by arithmetic modulo 2. Polynomials with complex number coefficients or coefficients modulo 2 behave similarly to polynomials with real number coefficients in many ways. The similarities and differences are discussed on the advanced version of this page. Polynomials with even more general types of coefficients are important in advanced mathematics, and are discussed on the polynomial ring page. For the rest of the present article, all polynomials considered will have real number coefficients.


Non-Examples

Expressions like or are not polynomials ; the first one is a rational function, because of the fraction bar (representing division), and the second one is an irrational expression, because of the square root symbol. Such operations might be expressed within the constant numbers, as in the example , but this is only because and are elements of the set (e.g. real numbers) that are being used as coefficients of the polynomials.

Expressions involving exponential functions, like , are often mistaken for polynomials because of the similarity of notation. It is important to realize that the exponents appearing in a polynomial are always constants; when an expression has a variable in an exponent, such as the exponent x on the base 2 in the above expression, the expression is not a polynomial.

Monomials and simplification

The simplest polynomials are the monomials having only one term. A monomial is either a constant, or a product of constants and positive whole number powers of variables. The constant is called the coefficient of the monomial. A coefficient equal to 1 is typically dropped from the notation, so that represents the same monomial as . Coefficients are usually written in as simplified a form as possible, but even then can sometimes look rather complicated, such as the coefficient in . When a coefficient involves a sum, parentheses must be inserted around the coefficient.

Monomials are usually immediately simplified so that each variable is written once. For instance, would be simplified by combing both powers of the variable x, producing . When a monomial involves several variables, the order in which the variables are written is unimportant. By convention, the variables are typically listed in alphabetical order. For instance, and are considered to be the same monomial, but the latter would usually be rewritten as the former.

Polynomials are constructed by adding together a finite number of monomials. For instance, 1, , and are monomials, and their sum, is a polynomial which is not a monomial.

There are two main procedures used in simplifying a polynomial. First, one can remove groupings by distributing, which produces a sum of monomials. For instance, simplifies to . Second, one can combine like terms to reduce the number of monomials in the expression. For instance, the above polynomial can be further simplified to by combining the two terms involving and the two terms involving x.

Polynomials in one variable

In this section we deal with the simplest case, that is, polynomials involving only one variable, denoted .

Degree and the Standard Form

After simplification, a polynomial can be written as a finite sum of monomials. It is sometimes useful to explicitly write a power of a variable in each monomial, even the constants. To accomplish this, you can write after the constant, so that 2 and are considered the same.

The power of the variable appearing in a monomial is the degree of the monomial. By the above convention, a constant c is the same as and has degree equal to 0. The degree of a polynomial is the largest of the degrees of the monomials appearing in the polynomial. The only exception is the constant polynomial 0, which typically is not assigned a degree (for reasons made clear below). As an example, 2 has degree 0, has degree 2, and has degree 5.

The degree is an important identifier when working with polynomials. For instance, many procedures for factoring or solving polynomial equations require identifying the degree of the polynomial first. In the last example above, we had to scan through the polynomial from the left all the way through the right to determine that the degree is 5. To facilitate identifying the degree of a polynomial, as well as manipulations of polynomials, they are usually written in standard form. The standard form of a polynomial is obtained by simplifying as described above and then writing the monomials so that the exponents decrease from left to right. The degree of a polynomial in standard form is the degree of the first monomial appearing. The term of highest degree is the leading term and its coefficient is the leading coefficient. A polynomial with leading coefficient equal to 1 is monic. We can put the last example above in standard form by rearranging the monomials to obtain . It is of course just as easy to work with polynomials where the monomials are written so that the degrees increase from left to right.



Polynomials in several variables

Polynomial equations

Polynomial functions

== Applications of polynomials ==