Cauchy-Schwarz inequality: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Aleksander Stos
m (templates)
mNo edit summary
 
(10 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{subpages}}
{{subpages}}
In [[mathematics]], the '''Cauchy-Schwarz inequality''' is a fundamental and ubiquitously used inequality that relates the absolute value of the [[inner product]] of two elements of an [[inner product space]] with the magnitude the two said vectors. It is named in the honor of the French mathematician [[Augustin Louis Cauchy]]<ref>{{MacTutorBio|id=Cauchy}}</ref> and German mathematician [[Hermann Amandus Schwarz]]<ref>{{MacTutorBio|id=Schwarz}}</ref>.  
In [[mathematics]], the '''Cauchy-Schwarz inequality''' is a fundamental and ubiquitously used inequality that relates the absolute value of the [[inner product]] of two elements of an [[inner product space]] with the magnitude of the two said vectors. It is named in the honor of the French mathematician [[Augustin-Louis Cauchy]] and German mathematician [[Hermann Amandus Schwarz]]<ref>{{MacTutorBio|id=Schwarz}}</ref>.  


==Statement of the Cauchy-Schwarz inequality==
==The inequality for real numbers==
Let ''V'' be a [[complex number|complex]] [[inner product space]] with inner product <math>\langle \cdot,\cdot \rangle</math>. Then for any two elements <math>x_1,x_2 \in V</math> it holds that
The simplest form of the inequality, and the first one considered historically, states that
:<math> ( x_1 y_1 + x_2 y_2 + \cdots + x_n y_n )^2 \le ( x_1^2 + x_2^2 + \cdots + x_n^2 ) ( y_1^2 + y_2^2 + \cdots + y_n^2 ) </math>
for all real numbers ''x''<sub>1</sub>, &hellip;, ''x''<sub>''n''</sub>, ''y''<sub>1</sub>, &hellip;, ''y''<sub>''n''</sub> (where ''n'' is a arbitrary positive integer). Furthermore, the inequality is in fact an equality
:<math> ( x_1 y_1 + x_2 y_2 + \cdots + x_n y_n )^2 = ( x_1^2 + x_2^2 + \cdots + x_n^2 ) ( y_1^2 + y_2^2 + \cdots + y_n^2 ) </math>
if and only if there is a number ''C'' such that <math>x_i = Cy_i</math> for all ''i''.


::<math>|\langle x_1,x_2 \rangle|\leq \|x_1\|\|x_2\|,\quad (1)</math>
==The inequality for inner product spaces==
Let ''V'' be a [[complex number|complex]] [[inner product space]] with inner product <math>\langle \cdot,\cdot \rangle</math>. Then for any two elements <math>x,y \in V</math> it holds that


where <math>\|y\|=\langle y,y \rangle^{1/2} </math> for all <math>y \in V</math>. Inequality (1) is the Cauchy-Schwarz inequality.
:<math>|\langle x,y \rangle|\leq \|x\|\|y\|,\quad (1)</math>
 
where <math>\|a\|=\langle a,a \rangle^{1/2} </math> for all <math>a \in V</math>. Furthermore, the equality in (1) holds if and only if the vectors <math>x</math> and <math>y</math> are [[linear independence|linearly dependent]] (in this case proportional one to the other).
 
If ''V'' is the [[Euclidean space]] '''R'''<sup>''n''</sup>, whose inner product is defined by
:<math> \langle x,y \rangle = \sum_{i=1}^n x_i y_i, </math>
then (1) yields the inequality for real numbers mentioned in the previous section.
 
Another important example is where ''V'' is the [[Lp space|space L<sup>2</sup>([''a'', ''b''])]]. In this case, the Cauchy-Schwarz inequality states that
:<math> \left( \int_a^b f(x) g(x) \,\mathrm{d}x \right)^2 \le \int_a^b \big( f(x) \big)^2 \,\mathrm{d}x \cdot \int_a^b \big( g(x) \big)^2 \,\mathrm{d}x </math>
for all real functions ''f'' and ''g'' in <math>\scriptstyle L^2([a,b])</math>.


==Proof of the inequality==
==Proof of the inequality==
A standard yet clever idea for a proof of the Cauchy-Schwarz inequality is to exploit the fact that the inner product induces a [[quadratic form]] on ''V''. Let <math>x,y</math> be some fixed pair of vectors in ''V'' and let <math>\phi(x,y)</math> be the argument of the complex number <math>\langle x,y\rangle</math>. Now, consider the expression <math>f(t)=\langle x+t e^{i\phi(x,y)} y, x+te^{i\phi(x,y)} y\rangle</math> for any real number ''t'' and notice that, by the properties of a complex inner product, ''f'' is a quadratic function of ''t''. Moreover, ''f'' is non-negative definite: <math>f(t)\geq 0 </math> for all ''t''. Expanding the expression for ''f'' gives the following:
A standard yet clever idea for a proof of the Cauchy-Schwarz inequality for inner product spaces is to exploit the fact that the inner product induces a [[quadratic form]] on ''V''. Let <math>x,y</math> be some fixed pair of vectors in ''V'' and let <math>\phi(x,y)</math> be the argument of the complex number <math>\langle x,y\rangle</math>. Now, consider the expression <math>f(t)=\langle x+t e^{i\phi(x,y)} y, x+te^{i\phi(x,y)} y\rangle</math> for any real number ''t'' and notice that, by the properties of a complex inner product, ''f'' is a quadratic function of ''t''. Moreover, ''f'' is non-negative definite: <math>f(t)\geq 0 </math> for all ''t''. Expanding the expression for ''f'' gives the following:


<math>
:<math>
\begin{align} f(t) &=  \langle x+te^{i\phi(x,y)} y,x+te^{i\phi(x,y)} y\rangle \\
\begin{align} f(t) &=  \langle x+te^{i\phi(x,y)} y,x+te^{i\phi(x,y)} y\rangle \\
     &=  \|x\|^2 + t e^{i\phi(x,y)}\langle y,x \rangle + t e^{-i\phi(x,y)}\langle x,y \rangle + t^2\|y\|^2 \\
     &=  \|x\|^2 + t e^{i\phi(x,y)}\langle y,x \rangle + t e^{-i\phi(x,y)}\langle x,y \rangle + t^2\|y\|^2 \\
Line 18: Line 33:
</math>  
</math>  
    
    
Since ''f'' is a non-negative definite quadratic function of ''t'', if follows that the [[discriminant]] of ''f'' is non-positive definite. That is,
Since ''f'' is a non-negative definite quadratic function of ''t'', it follows that the [[discriminant of a polynomial|discriminant]] of ''f'' is non-positive definite. That is,


:: <math> 4|\langle x,y \rangle|^2-4 \|x\|^2\|y\|^2=4(|\langle x,y \rangle|^2- \|x\|^2\|y\|^2) \leq 0, </math>
: <math> 4|\langle x,y \rangle|^2-4 \|x\|^2\|y\|^2=4(|\langle x,y \rangle|^2- \|x\|^2\|y\|^2) \leq 0, </math>


from which (1) follows immediately by the substitution <math>x_1 \rightarrow x</math> and <math>x_2 \rightarrow y</math>.
from which (1) follows immediately.


==References==
==References==
{{reflist|2}}
{{reflist}}[[Category:Suggestion Bot Tag]]

Latest revision as of 16:00, 25 July 2024

This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
This editable Main Article is under development and subject to a disclaimer.

In mathematics, the Cauchy-Schwarz inequality is a fundamental and ubiquitously used inequality that relates the absolute value of the inner product of two elements of an inner product space with the magnitude of the two said vectors. It is named in the honor of the French mathematician Augustin-Louis Cauchy and German mathematician Hermann Amandus Schwarz[1].

The inequality for real numbers

The simplest form of the inequality, and the first one considered historically, states that

for all real numbers x1, …, xn, y1, …, yn (where n is a arbitrary positive integer). Furthermore, the inequality is in fact an equality

if and only if there is a number C such that for all i.

The inequality for inner product spaces

Let V be a complex inner product space with inner product . Then for any two elements it holds that

where for all . Furthermore, the equality in (1) holds if and only if the vectors and are linearly dependent (in this case proportional one to the other).

If V is the Euclidean space Rn, whose inner product is defined by

then (1) yields the inequality for real numbers mentioned in the previous section.

Another important example is where V is the space L2([a, b]). In this case, the Cauchy-Schwarz inequality states that

for all real functions f and g in .

Proof of the inequality

A standard yet clever idea for a proof of the Cauchy-Schwarz inequality for inner product spaces is to exploit the fact that the inner product induces a quadratic form on V. Let be some fixed pair of vectors in V and let be the argument of the complex number . Now, consider the expression for any real number t and notice that, by the properties of a complex inner product, f is a quadratic function of t. Moreover, f is non-negative definite: for all t. Expanding the expression for f gives the following:

Since f is a non-negative definite quadratic function of t, it follows that the discriminant of f is non-positive definite. That is,

from which (1) follows immediately.

References

  1. Biography at MacTutor History of Mathematics, John J. O'Connor and Edmund F. Robertson, School of Mathematics and Statistics, University of St Andrews, Scotland.