Linear Program Polynomial Interpolation Methods

Posted on -
Linear Program Polynomial Interpolation Methods Rating: 4,6/5 4248 reviews
(Redirected from Vandermonde interpolation)

HZ HZ HZ HZ. Interpolation Curve Fitting f(x) = ax + b f(x) = ax + b for each line for entire range. CGN 3421 - Computer Methods Gurley Numerical Methods Lecture 5 - Curve Fitting Techniques page 87 of 99. Other examples of data sets that we can fit a function to. The interp1d class in the scipy.interpolate is a convenient method to create a function based on fixed data points, which can be evaluated anywhere within the domain defined by the given data using linear interpolation. By using the above data, let us create a interpolate function and draw a new interpolated graph.

In numerical analysis, polynomial interpolation is the interpolation of a given data set by the polynomial of lowest possible degree that passes through the points of the dataset.[1]

  • 4Uniqueness of the interpolating polynomial
  • 7Interpolation error

Applications[edit]

Polynomials can be used to approximate complicated curves, for example, the shapes of letters in typography,[citation needed] given a few points. A relevant application is the evaluation of the natural logarithm and trigonometric functions: pick a few known data points, create a lookup table, and interpolate between those data points. This results in significantly faster computations.[specify] Polynomial interpolation also forms the basis for algorithms in numerical quadrature and numerical ordinary differential equations and Secure Multi Party Computation, Secret Sharing schemes.

Polynomial interpolation is also essential to perform sub-quadratic multiplication and squaring such as Karatsuba multiplication and Toom–Cook multiplication, where an interpolation through points on a polynomial which defines the product yields the product itself. For example, given a = f(x) = a0x0 + a1x1 + .. and b = g(x) = b0x0 + b1x1 + .., the product ab is equivalent to W(x) = f(x)g(x). Finding points along W(x) by substituting x for small values in f(x) and g(x) yields points on the curve. Interpolation based on those points will yield the terms of W(x) and subsequently the product ab. In the case of Karatsuba multiplication this technique is substantially faster than quadratic multiplication, even for modest-sized inputs. This is especially true when implemented in parallel hardware.

Definition[edit]

Given a set of n + 1 data points (xi, yi) where no two xi are the same, one is looking for a polynomial p of degree at most n with the property

p(xi)=yi,i=0,,n.{displaystyle p(x_{i})=y_{i},qquad i=0,ldots ,n.}

The unisolvence theorem states that such a polynomial p exists and is unique, and can be proved by the Vandermonde matrix, as described below.

The theorem states that for n + 1 interpolation nodes (xi), polynomial interpolation defines a linear bijection

Ln:Kn+1Πn{displaystyle L_{n}:mathbb {K} ^{n+1}to Pi _{n}}

where Πn is the vector space of polynomials (defined on any interval containing the nodes) of degree at most n.

Constructing the interpolation polynomial[edit]

The red dots denote the data points (xk, yk), while the blue curve shows the interpolation polynomial.

Suppose that the interpolation polynomial is in the form

p(x)=anxn+an1xn1++a2x2+a1x+a0.(1){displaystyle p(x)=a_{n}x^{n}+a_{n-1}x^{n-1}+cdots +a_{2}x^{2}+a_{1}x+a_{0}.qquad (1)}

The statement that p interpolates the data points means that

p(xi)=yifor all i{0,1,,n}.{displaystyle p(x_{i})=y_{i}qquad {mbox{for all }}iin left{0,1,dots ,nright}.}

If we substitute equation (1) in here, we get a system of linear equations in the coefficients ak. The system in matrix-vector form reads the following multiplication:

[x0nx0n1x0n2x01x1nx1n1x1n2x11xnnxnn1xnn2xn1][anan1a0]=[y0y1yn].{displaystyle {begin{bmatrix}x_{0}^{n}&x_{0}^{n-1}&x_{0}^{n-2}&ldots &x_{0}&1x_{1}^{n}&x_{1}^{n-1}&x_{1}^{n-2}&ldots &x_{1}&1vdots &vdots &vdots &&vdots &vdots x_{n}^{n}&x_{n}^{n-1}&x_{n}^{n-2}&ldots &x_{n}&1end{bmatrix}}{begin{bmatrix}a_{n}a_{n-1}vdots a_{0}end{bmatrix}}={begin{bmatrix}y_{0}y_{1}vdots y_{n}end{bmatrix}}.}

We have to solve this system for ak to construct the interpolant p(x). The matrix on the left is commonly referred to as a Vandermonde matrix.

The condition number of the Vandermonde matrix may be large,[2] causing large errors when computing the coefficients ai if the system of equations is solved using Gaussian elimination.

Several authors have therefore proposed algorithms which exploit the structure of the Vandermonde matrix to compute numerically stable solutions in O(n2) operations instead of the O(n3) required by Gaussian elimination.[3][4][5] These methods rely on constructing first a Newton interpolation of the polynomial and then converting it to the monomial form above.

Alternatively, we may write down the polynomial immediately in terms of Lagrange polynomials:

p(x)=(xx1)(xx2)(xxn)(x0x1)(x0x2)(x0xn)y0+(xx0)(xx2)(xxn)(x1x0)(x1x2)(x1xn)y1++(xx0)(xx1)(xxn1)(xnx0)(xnx1)(xnxn1)yn=i=0n(ji0jnxxjxixj)yi{displaystyle {begin{aligned}p(x)&={frac {(x-x_{1})(x-x_{2})cdots (x-x_{n})}{(x_{0}-x_{1})(x_{0}-x_{2})cdots (x_{0}-x_{n})}}y_{0}+{frac {(x-x_{0})(x-x_{2})cdots (x-x_{n})}{(x_{1}-x_{0})(x_{1}-x_{2})cdots (x_{1}-x_{n})}}y_{1}+ldots +{frac {(x-x_{0})(x-x_{1})cdots (x-x_{n-1})}{(x_{n}-x_{0})(x_{n}-x_{1})cdots (x_{n}-x_{n-1})}}y_{n}&=sum _{i=0}^{n}left(prod _{stackrel {!0leq jleq n}{jneq i}}{frac {x-x_{j}}{x_{i}-x_{j}}}right)y_{i}end{aligned}}}

For matrix arguments, this formula is called Sylvester's formula and the matrix-valued Lagrange polynomials are the Frobenius covariants.

Uniqueness of the interpolating polynomial[edit]

Proof 1[edit]

Suppose we interpolate through n + 1 data points with an at-most n degree polynomial p(x) (we need at least n + 1 datapoints or else the polynomial cannot be fully solved for). Suppose also another polynomial exists also of degree at most n that also interpolates the n + 1 points; call it q(x).

Consider r(x)=p(x)q(x){displaystyle r(x)=p(x)-q(x)}. We know,

  1. r(x) is a polynomial
  2. r(x) has degree at most n, since p(x) and q(x) are no higher than this and we are just subtracting them.
  3. At the n + 1 data points, r(xi)=p(xi)q(xi)=yiyi=0{displaystyle r(x_{i})=p(x_{i})-q(x_{i})=y_{i}-y_{i}=0}. Therefore, r(x) has n + 1 roots.

But r(x) is a polynomial of degree n. It has one root too many. Formally, if r(x) is any non-zero polynomial, it must be writable as r(x)=A(xx0)(xx1)(xxn){displaystyle r(x)=A(x-x_{0})(x-x_{1})cdots (x-x_{n})}, for some constant A. By distributivity, the n + 1x's multiply together to give leading term Axn+1{displaystyle Ax^{n+1}}, i.e. one degree higher than the maximum we set. So the only way r(x) can exist is if A = 0, or equivalently, r(x) = 0.

r(x)=0=p(x)q(x)p(x)=q(x){displaystyle r(x)=0=p(x)-q(x)implies p(x)=q(x)}

So q(x) (which could be any polynomial, so long as it interpolates the points) is identical with p(x), and q(x) is unique.

Proof 2[edit]

Given the Vandermonde matrix used above to construct the interpolant, we can set up the system

Va=y{displaystyle Va=y}

To prove that V is nonsingular we use the Vandermonde determinant formula:

det(V)=i,j=0,i<jn(xixj){displaystyle det(V)=prod _{i,j=0,i<j}^{n}(x_{i}-x_{j})}

since the n + 1 points are distinct, the determinant can't be zero as xixj{displaystyle x_{i}-x_{j}} is never zero, therefore V is nonsingular and the system has a unique solution.

Either way this means that no matter what method we use to do our interpolation: direct, Lagrange etc., (assuming we can do all our calculations perfectly) we will always get the same polynomial.

Non-Vandermonde solutions[edit]

We are trying to construct our unique interpolation polynomial in the vector space Πn of polynomials of degree n. When using a monomial basis for Πn we have to solve the Vandermonde matrix to construct the coefficients ak for the interpolation polynomial. This can be a very costly operation (as counted in clock cycles of a computer trying to do the job). By choosing another basis for Πn we can simplify the calculation of the coefficients but then we have to do additional calculations when we want to express the interpolation polynomial in terms of a monomial basis.

One method is to write the interpolation polynomial in the Newton form and use the method of divided differences to construct the coefficients, e.g. Neville's algorithm. The cost is O(n2) operations, while Gaussian elimination costs O(n3) operations. Furthermore, you only need to do O(n) extra work if an extra point is added to the data set, while for the other methods, you have to redo the whole computation.

Another method is to use the Lagrange form of the interpolation polynomial. The resulting formula immediately shows that the interpolation polynomial exists under the conditions stated in the above theorem. Lagrange formula is to be preferred to Vandermonde formula when we are not interested in computing the coefficients of the polynomial, but in computing the value of p(x) in a given x not in the original data set. In this case, we can reduce complexity to O(n2).[6]

The Bernstein form was used in a constructive proof of the Weierstrass approximation theorem by Bernstein and has gained great importance in computer graphics in the form of Bézier curves.

Linear combination of the given values[edit]

The Lagrange form of the interpolating polynomial is a linear combination of the given values. In many scenarios, an efficient and convenient polynomial interpolation is a linear combination of the given values, using previously known coefficients. Given a set of k+1{displaystyle k+1} data points (x0,y0),,(xj,yj),,(xk,yk){displaystyle (x_{0},y_{0}),ldots ,(x_{j},y_{j}),ldots ,(x_{k},y_{k})} where each data point is a (position, value) pair and where no two positions xj{displaystyle x_{j}} are the same, the interpolation polynomial in the Lagrange form is a linear combination

y(x):=j=0kyjcj(x){displaystyle y(x):=sum _{j=0}^{k}y_{j}c_{j}(x)}

of the given values yj{displaystyle y_{j}} with each coefficient cj(x){displaystyle c_{j}(x)} given by evaluating the corresponding Lagrange basis polynomial using the given k+1{displaystyle k+1} positions xj{displaystyle x_{j}}.

cj(x)=j(x,x0,x1,,xk):=0mkmjxxmxjxm=(xx0)(xjx0)(xxj1)(xjxj1)(xxj+1)(xjxj+1)(xxk)(xjxk).{displaystyle c_{j}(x)=ell _{j}(x,x_{0},x_{1},ldots ,x_{k}):=prod _{begin{smallmatrix}0leq mleq kmneq jend{smallmatrix}}{frac {x-x_{m}}{x_{j}-x_{m}}}={frac {(x-x_{0})}{(x_{j}-x_{0})}}cdots {frac {(x-x_{j-1})}{(x_{j}-x_{j-1})}}{frac {(x-x_{j+1})}{(x_{j}-x_{j+1})}}cdots {frac {(x-x_{k})}{(x_{j}-x_{k})}}.}

Each coefficient cj(x){displaystyle c_{j}(x)} in the linear combination depends on the given positions xj{displaystyle x_{j}} and the desired position x{displaystyle x}, but not on the given values yj{displaystyle y_{j}}. For each coefficient, inserting the values of the given positions xj{displaystyle x_{j}} and simplifying yields an expression cj(x){displaystyle c_{j}(x)}, which depends only on x{displaystyle x}. Thus the same coefficient expressions cj(x){displaystyle c_{j}(x)} can be used in a polynomial interpolation of a given second set of k+1{displaystyle k+1} data points (x0,v0),,(xj,vj),,(xk,vk){displaystyle (x_{0},v_{0}),ldots ,(x_{j},v_{j}),ldots ,(x_{k},v_{k})} at the same given positions xj{displaystyle x_{j}}, where the second given values vj{displaystyle v_{j}} differ from the first given values yj{displaystyle y_{j}}. Using the same coefficient expressions cj(x){displaystyle c_{j}(x)} as for the first set of data points, the interpolation polynomial of the second set of data points is the linear combination

v(x):=j=0kvjcj(x).{displaystyle v(x):=sum _{j=0}^{k}v_{j}c_{j}(x).}

For each coefficient cj(x){displaystyle c_{j}(x)} in the linear combination, the expression resulting from the Lagrange basis polynomial j(x,x0,x1,,xk){displaystyle ell _{j}(x,x_{0},x_{1},ldots ,x_{k})} only depends on the relative spaces between the given positions, not on the individual value of any position. Thus the same coefficient expressions cj(x){displaystyle c_{j}(x)} can be used in a polynomial interpolation of a given third set of k+1{displaystyle k+1} data points

(t0,w0),,(tj,wj),,(tk,wk){displaystyle (t_{0},w_{0}),ldots ,(t_{j},w_{j}),ldots ,(t_{k},w_{k})}

where each position tj{displaystyle t_{j}} is related to the corresponding position xj{displaystyle x_{j}} in the first set by ti=axi+b{displaystyle t_{i}=a*x_{i}+b} and the desired positions are related by t=ax+b{displaystyle t=a*x+b}, for a constant scaling factor a and a constant shift b for all positions. Using the same coefficient expressions cj(t){displaystyle c_{j}(t)} as for the first set of data points, the interpolation polynomial of the third set of data points is the linear combination

w(t):=j=0kwjcj(t).{displaystyle w(t):=sum _{j=0}^{k}w_{j}c_{j}(t).}

In many applications of polynomial interpolation, the given set of k+1{displaystyle k+1} data points is at equally spaced positions. In this case, it can be convenient to define the x-axis of the positions such that xi=i{displaystyle x_{i}=i}. For example, a given set of 3 equally-spaced data points (x0,y0),(x1,y1),(x2,y2){displaystyle (x_{0},y_{0}),(x_{1},y_{1}),(x_{2},y_{2})} is then (0,y0),(1,y1),(2,y2){displaystyle (0,y_{0}),(1,y_{1}),(2,y_{2})}.

The interpolation polynomial in the Lagrange form is the linear combination

y(x):=j=02yjcj(x)=y0(x1)(x2)(01)(02)+y1(x0)(x2)(10)(12)+y2(x0)(x1)(20)(21)=y0(x1)(x2)2+y1(x0)(x2)1+y2(x0)(x1)2.{displaystyle {begin{aligned}y(x):=sum _{j=0}^{2}y_{j}c_{j}(x)=y_{0}{frac {(x-1)(x-2)}{(0-1)(0-2)}}+y_{1}{frac {(x-0)(x-2)}{(1-0)(1-2)}}+y_{2}{frac {(x-0)(x-1)}{(2-0)(2-1)}}=y_{0}{frac {(x-1)(x-2)}{2}}+y_{1}{frac {(x-0)(x-2)}{-1}}+y_{2}{frac {(x-0)(x-1)}{2}}.end{aligned}}}

This quadratic interpolation is valid for any position x, near or far from the given positions. So, given 3 equally-spaced data points at x=0,1,2{displaystyle x=0,1,2} defining a quadratic polynomial, at an example desired position x=1.5{displaystyle x=1.5}, the interpolated value after simplification is given by y(1.5)=y1.5=(y0+6y1+3y2)/8{displaystyle y(1.5)=y_{1.5}=(-y_{0}+6y_{1}+3y_{2})/8}

Excel

This is a quadratic interpolation typically used in the Multigrid method. Again given 3 equally-spaced data points at x=0,1,2{displaystyle x=0,1,2} defining a quadratic polynomial, at the next equally spaced position x=3{displaystyle x=3}, the interpolated value after simplification is given by

y(3)=y3=y03y1+3y2{displaystyle y(3)=y_{3}=y_{0}-3y_{1}+3y_{2}}

In the above polynomial interpolations using a linear combination of the given values, the coefficients were determined using the Lagrange method. In some scenarios, the coefficients can be more easily determined using other methods. Examples follow.

Program

According to the method of finite differences, for any polynomial of degree d or less, any sequence of d+2{displaystyle d+2} values at equally spaced positions has a (d+1){displaystyle (d+1)}th difference exactly equal to 0. The element sd+1 of the Binomial transform is such a (d+1){displaystyle (d+1)}th difference. This area is surveyed here[7]. The binomial transform, T, of a sequence of values {vn}, is the sequence {sn} defined by

sn=k=0n(1)k(nk)vk.{displaystyle s_{n}=sum _{k=0}^{n}(-1)^{k}{n choose k}v_{k}.}

Ignoring the sign term (1)k{displaystyle (-1)^{k}}, the n+1{displaystyle n+1} coefficients of the element sn are the respective n+1{displaystyle n+1} elements of the row n of Pascal's Triangle. The triangle of binomial transform coefficients is like Pascal's triangle. The entry in the nth row and kth column of the BTC triangle is (1)k(nk){displaystyle (-1)^{k}{tbinom {n}{k}}} for any non-negative integer n and any integer k between 0 and n. This results in the following example rows n = 0 through n = 7, top to bottom, for the BTC triangle:

For convenience, each row n of the above example BTC triangle also has a label d=n1{displaystyle d=n-1}. Thus for any polynomial of degree d or less, any sequence of d+2{displaystyle d+2} values at equally spaced positions has a linear combination result of 0, when using the d+2{displaystyle d+2} elements of row d as the corresponding linear coefficients.

For example, 4 equally spaced data points of a quadratic polynomial obey the linear equation given by row d=2{displaystyle d=2} of the BTC triangle. 0=y03y1+3y2y3{displaystyle 0=y_{0}-3y_{1}+3y_{2}-y_{3}} This is the same linear equation as obtained above using the Lagrange method.

The BTC triangle can also be used to derive other polynomial interpolations. For example, the above quadratic interpolation

y(1.5)=y1.5=(y0+6y1+3y2)/8{displaystyle y(1.5)=y_{1.5}=(-y_{0}+6y_{1}+3y_{2})/8}

can be derived in 3 simple steps as follows. The equally spaced points of a quadratic polynomial obey the rows of the BTC triangle with d=2{displaystyle d=2} or higher. First, the row d=3{displaystyle d=3} spans the given and desired data points y0,y1,y1.5,y2{displaystyle y_{0},y_{1},y_{1.5},y_{2}} with the linear equation

0=1y04y0.5+6y14y1.5+1y2{displaystyle 0=1y_{0}-4y_{0.5}+6y_{1}-4y_{1.5}+1y_{2}}

Second, the unwanted data point y0.5{displaystyle y_{0.5}} is replaced by an expression in terms of wanted data points. The row d=2{displaystyle d=2} provides a linear equation with a term 1y0.5{displaystyle 1y_{0.5}}, which results in a term 4y0.5{displaystyle 4y_{0.5}} by multiplying the linear equation by 4. 0=1y0.53y1+3y1.51y2=4y0.512y1+12y1.54y2{displaystyle 0=1y_{0.5}-3y_{1}+3y_{1.5}-1y_{2}=4y_{0.5}-12y_{1}+12y_{1.5}-4y_{2}} Third, the above two linear equations are added to yield a linear equation equivalent to the above quadratic interpolation for y1.5{displaystyle y_{1.5}}. 0=(1+0)y0+(4+4)y0.5+(612)y1+(4+12)y1.5+(14)y2=y06y1+8y1.53y2{displaystyle 0=(1+0)y_{0}+(-4+4)y_{0.5}+(6-12)y_{1}+(-4+12)y_{1.5}+(1-4)y_{2}=y_{0}-6y_{1}+8y_{1.5}-3y_{2}}

Similar to other uses of linear equations, the above derivation scales and adds vectors of coefficients. In polynomial interpolation as a linear combination of values, the elements of a vector correspond to a contiguous sequence of regularly spaced positions. The p non-zero elements of a vector are the p coefficients in a linear equation obeyed by any sequence of p data points from any degree d polynomial on any regularly spaced grid, where d is noted by the subscript of the vector. For any vector of coefficients, the subscript obeys dp2{displaystyle dleq p-2}. When adding vectors with various subscript values, the lowest subscript applies for the resulting vector. So, starting with the vector of row d=3{displaystyle d=3} and the vector of row d=2{displaystyle d=2} of the BTC triangle, the above quadratic interpolation for y1.5{displaystyle y_{1.5}} is derived by the vector calculation

(1,4,6,4,1)3+4(0,1,3,3,1)2=(1,0,6,+8,3)2{displaystyle (1,-4,6,-4,1)_{3}+4*(0,1,-3,3,-1)_{2}=(1,0,-6,+8,-3)_{2}}

Similarly, the cubic interpolation typical in the Multigrid method,

y1.5=(y0+9y1+9y2y3)/16,{displaystyle y_{1.5}=(-y_{0}+9y_{1}+9y_{2}-y_{3})/16,}

can be derived by a vector calculation starting with the vector of row d=5{displaystyle d=5} and the vector of row d=3{displaystyle d=3} of the BTC triangle.

(1,6,15,20,15,6,1)5+6(0,1,4,6,4,1,0)3=(1,0,9,16,9,0,1)3{displaystyle (1,-6,15,-20,15,-6,1)_{5}+6*(0,1,-4,6,-4,1,0)_{3}=(1,0,-9,16,-9,0,1)_{3}}

Interpolation error[edit]

When interpolating a given function f by a polynomial of degree n at the nodes x0,..,xn we get the error

f(x)pn(x)=f[x0,,xn,x]i=0n(xxi){displaystyle f(x)-p_{n}(x)=f[x_{0},ldots ,x_{n},x]prod _{i=0}^{n}(x-x_{i})}

where

f[x0,,xn,x]{displaystyle f[x_{0},ldots ,x_{n},x]}

is the notation for divided differences.

If f is n + 1 times continuously differentiable on a closed interval I and pn(x){displaystyle p_{n}(x)} is a polynomial of degree at most n that interpolates f at n + 1 distinct points {xi} (i=0,1,..,n) in that interval, then for each x in the interval there exists ξ in that interval such that

f(x)pn(x)=f(n+1)(ξ)(n+1)!i=0n(xxi).{displaystyle f(x)-p_{n}(x)={frac {f^{(n+1)}(xi )}{(n+1)!}}prod _{i=0}^{n}(x-x_{i}).}

The above error bound suggests choosing the interpolation points xi such that the product (xxi),{displaystyle left prod (x-x_{i})right ,} is as small as possible. The Chebyshev nodes achieve this.

Proof[edit]

Set the error term as

Rn(x)=f(x)pn(x){displaystyle R_{n}(x)=f(x)-p_{n}(x)}

and set up an auxiliary function:

Y(t)=Rn(t)Rn(x)W(x)W(t){displaystyle Y(t)=R_{n}(t)-{frac {R_{n}(x)}{W(x)}}W(t)}

where

W(t)=i=0n(txi){displaystyle W(t)=prod _{i=0}^{n}(t-x_{i})}

Since xi are roots of Rn(t){displaystyle R_{n}(t)} and W(t){displaystyle W(t)}, we have Y(x) = Y(xi) = 0, which means Y has at least n + 2 roots. From Rolle's theorem, Y(t){displaystyle Y^{prime }(t)} has at least n + 1 roots, then Y(n+1)(t){displaystyle Y^{(n+1)}(t)} has at least one root ξ, where ξ is in the interval I.

So we can get

Y(n+1)(t)=Rn(n+1)(t)Rn(x)W(x)(n+1)!{displaystyle Y^{(n+1)}(t)=R_{n}^{(n+1)}(t)-{frac {R_{n}(x)}{W(x)}} (n+1)!}

Since pn(x){displaystyle p_{n}(x)} is a polynomial of degree at most n, then

Rn(n+1)(t)=f(n+1)(t){displaystyle R_{n}^{(n+1)}(t)=f^{(n+1)}(t)}

Thus

Y(n+1)(t)=f(n+1)(t)Rn(x)W(x)(n+1)!{displaystyle Y^{(n+1)}(t)=f^{(n+1)}(t)-{frac {R_{n}(x)}{W(x)}} (n+1)!}

Since ξ is the root of Y(n+1)(t){displaystyle Y^{(n+1)}(t)}, so

Y(n+1)(ξ)=f(n+1)(ξ)Rn(x)W(x)(n+1)!=0{displaystyle Y^{(n+1)}(xi )=f^{(n+1)}(xi )-{frac {R_{n}(x)}{W(x)}} (n+1)!=0}

Therefore,

Rn(x)=f(x)pn(x)=f(n+1)(ξ)(n+1)!i=0n(xxi){displaystyle R_{n}(x)=f(x)-p_{n}(x)={frac {f^{(n+1)}(xi )}{(n+1)!}}prod _{i=0}^{n}(x-x_{i})}.

Thus the remainder term in the Lagrange form of the Taylor theorem is a special case of interpolation error when all interpolation nodes xi are identical.[8] Note that the error will be zero when x=xi{displaystyle x=x_{i}} for any i. Thus, the maximum error will occur at some point in the interval between two successive nodes.

For equally spaced intervals[edit]

In the case of equally spaced interpolation nodes where xi=a+ih{displaystyle x_{i}=a+ih}, for i=0,1,,n,{displaystyle i=0,1,ldots ,n,} and where h=(ba)/n,{displaystyle h=(b-a)/n,} the product term in the interpolation error formula can be bound as[9]

i=0n(xxi)=i=0nxxin!4hn+1{displaystyle left prod _{i=0}^{n}(x-x_{i})right =prod _{i=0}^{n}left x-x_{i}right leq {frac {n!}{4}}h^{n+1}}.

Thus the error bound can be given as

Rn(x)hn+14(n+1)maxξ[a,b]f(n+1)(ξ){displaystyle left R_{n}(x)right leq {frac {h^{n+1}}{4(n+1)}}max _{xi in [a,b]}left f^{(n+1)}(xi )right }

However, this assumes that f(n+1)(ξ){displaystyle f^{(n+1)}(xi )} is dominated by hn+1{displaystyle h^{n+1}}, i.e. f(n+1)(ξ)hn+11{displaystyle f^{(n+1)}(xi )h^{n+1}ll 1}. In several cases, this is not true and the error actually increases as n → ∞ (see Runge's phenomenon). That question is treated in the section Convergence properties.

Lebesgue constants[edit]

See the main article: Lebesgue constant.

We fix the interpolation nodes x0, .., xn and an interval [a, b] containing all the interpolation nodes. The process of interpolation maps the function f to a polynomial p. This defines a mapping X from the space C([a, b]) of all continuous functions on [a, b] to itself. The map X is linear and it is a projection on the subspace Πn of polynomials of degree n or less.

The Lebesgue constant L is defined as the operator norm of X. One has (a special case of Lebesgue's lemma):

fX(f)(L+1)fp.{displaystyle f-X(f) leq (L+1) f-p^{*} .}

In other words, the interpolation polynomial is at most a factor (L + 1) worse than the best possible approximation. This suggests that we look for a set of interpolation nodes that makes L small. In particular, we have for Chebyshev nodes:

L2πlog(n+1)+1.{displaystyle Lleq {frac {2}{pi }}log(n+1)+1.}

We conclude again that Chebyshev nodes are a very good choice for polynomial interpolation, as the growth in n is exponential for equidistant nodes. However, those nodes are not optimal.

Convergence properties[edit]

It is natural to ask, for which classes of functions and for which interpolation nodes the sequence of interpolating polynomials converges to the interpolated function as n → ∞? Convergence may be understood in different ways, e.g. pointwise, uniform or in some integral norm.

The situation is rather bad for equidistant nodes, in that uniform convergence is not even guaranteed for infinitely differentiable functions. One classical example, due to Carl Runge, is the function f(x) = 1 / (1 + x2) on the interval [−5, 5]. The interpolation error fpn grows without bound as n → ∞. Another example is the function f(x) = x on the interval [−1, 1], for which the interpolating polynomials do not even converge pointwise except at the three points x = ±1, 0.[10]

One might think that better convergence properties may be obtained by choosing different interpolation nodes. The following result seems to give a rather encouraging answer: Fahren lernen max keygen for mac torrent.

Theorem. For any function f(x) continuous on an interval [a,b] there exists a table of nodes for which the sequence of interpolating polynomials pn(x){displaystyle p_{n}(x)} converges to f(x) uniformly on [a,b].

Proof. It's clear that the sequence of polynomials of best approximation pn(x){displaystyle p_{n}^{*}(x)} converges to f(x) uniformly (due to Weierstrass approximation theorem). Now we have only to show that each pn(x){displaystyle p_{n}^{*}(x)} may be obtained by means of interpolation on certain nodes. But this is true due to a special property of polynomials of best approximation known from the equioscillation theorem. Specifically, we know that such polynomials should intersect f(x) at least n + 1 times. Choosing the points of intersection as interpolation nodes we obtain the interpolating polynomial coinciding with the best approximation polynomial.

The defect of this method, however, is that interpolation nodes should be calculated anew for each new function f(x), but the algorithm is hard to be implemented numerically. Does there exist a single table of nodes for which the sequence of interpolating polynomials converge to any continuous function f(x)? The answer is unfortunately negative:

Theorem. For any table of nodes there is a continuous function f(x) on an interval [a, b] for which the sequence of interpolating polynomials diverges on [a,b].[11]

The proof essentially uses the lower bound estimation of the Lebesgue constant, which we defined above to be the operator norm of Xn (where Xn is the projection operator on Πn). Now we seek a table of nodes for which

limnXnf=f, for every fC([a,b]).{displaystyle lim _{nto infty }X_{n}f=f,{text{ for every }}fin C([a,b]).}

Due to the Banach–Steinhaus theorem, this is only possible when norms of Xn are uniformly bounded, which cannot be true since we know that

Xn2πlog(n+1)+C.{displaystyle X_{n} geq {tfrac {2}{pi }}log(n+1)+C.}

For example, if equidistant points are chosen as interpolation nodes, the function from Runge's phenomenon demonstrates divergence of such interpolation. Note that this function is not only continuous but even infinitely differentiable on [−1, 1]. For better Chebyshev nodes, however, such an example is much harder to find due to the following result:

Theorem. For every absolutely continuous function on [−1, 1] the sequence of interpolating polynomials constructed on Chebyshev nodes converges to f(x) uniformly.[12]

Related concepts[edit]

Runge's phenomenon shows that for high values of n, the interpolation polynomial may oscillate wildly between the data points. This problem is commonly resolved by the use of spline interpolation. Here, the interpolant is not a polynomial but a spline: a chain of several polynomials of a lower degree.

Interpolation of periodic functions by harmonic functions is accomplished by Fourier transform. This can be seen as a form of polynomial interpolation with harmonic base functions, see trigonometric interpolation and trigonometric polynomial.

Hermite interpolation problems are those where not only the values of the polynomial p at the nodes are given, but also all derivatives up to a given order. This turns out to be equivalent to a system of simultaneous polynomial congruences, and may be solved by means of the Chinese remainder theorem for polynomials. Birkhoff interpolation is a further generalization where only derivatives of some orders are prescribed, not necessarily all orders from 0 to a k.

Collocation methods for the solution of differential and integral equations are based on polynomial interpolation.

The technique of rational function modeling is a generalization that considers ratios of polynomial functions.

Linear Program Polynomial Interpolation Methods Pdf

At last, multivariate interpolation for higher dimensions.

See also[edit]

Notes[edit]

  1. ^Tiemann, Jerome J. (May–June 1981). 'Polynomial Interpolation'. I/O News. 1 (5): 16. ISSN0274-9998. Retrieved 3 November 2017.
  2. ^Gautschi, Walter (1975). 'Norm Estimates for Inverses of Vandermonde Matrices'. Numerische Mathematik. 23 (4): 337–347. doi:10.1007/BF01438260.
  3. ^Higham, N. J. (1988). 'Fast Solution of Vandermonde-Like Systems Involving Orthogonal Polynomials'. IMA Journal of Numerical Analysis. 8 (4): 473–486. doi:10.1093/imanum/8.4.473.
  4. ^Björck, Å; V. Pereyra (1970). 'Solution of Vandermonde Systems of Equations'. Mathematics of Computation. American Mathematical Society. 24 (112): 893–903. doi:10.2307/2004623. JSTOR2004623.
  5. ^Calvetti, D & Reichel, L (1993). 'Fast Inversion of Vanderomnde-Like Matrices Involving Orthogonal Polynomials'. BIT. 33 (33): 473–484. doi:10.1007/BF01990529.
  6. ^R.Bevilaqua, D. Bini, M.Capovani and O. Menchi (2003). Appunti di Calcolo Numerico. Chapter 5, p. 89. Servizio Editoriale Universitario Pisa - Azienda Regionale Diritto allo Studio Universitario.
  7. ^Boyadzhiev, Boyad (2012). 'Close Encounters with the Stirling Numbers of the Second Kind'(PDF). Math. Mag. 85: 252–266.
  8. ^'Errors in Polynomial Interpolation'(PDF).
  9. ^'Notes on Polynomial Interpolation'(PDF).
  10. ^Watson (1980, p. 21) attributes the last example to Bernstein (1912).
  11. ^Watson (1980, p. 21) attributes this theorem to Faber (1914).
  12. ^Krylov, V. I. (1956). 'Сходимость алгебраического интерполирования покорням многочленов Чебышева для абсолютно непрерывных функций и функций с ограниченным изменением' [Convergence of algebraic interpolation with respect to the roots of Chebyshev's polynomial for absolutely continuous functions and functions of bounded variation]. Doklady Akademii Nauk SSSR (N.S.) (in Russian). 107: 362–365. MR 18-32.

Polynomial Interpolation Excel

References[edit]

  • Atkinson, Kendell A. (1988), 'Chapter 3.', An Introduction to Numerical Analysis (2nd ed.), John Wiley and Sons, ISBN0-471-50023-2
  • Bernstein, Sergei N. (1912), 'Sur l'ordre de la meilleure approximation des fonctions continues par les polynômes de degré donné' [On the order of the best approximation of continuous functions by polynomials of a given degree], Mem. Acad. Roy. Belg. (in French), 4: 1–104
  • Brutman, L. (1997), 'Lebesgue functions for polynomial interpolation — a survey', Ann. Numer. Math., 4: 111–127
  • Faber, Georg (1914), 'Über die interpolatorische Darstellung stetiger Funktionen' [On the Interpolation of Continuous Functions], Deutsche Math. Jahr. (in German), 23: 192–210
  • Powell, M. J. D. (1981), 'Chapter 4', Approximation Theory and Methods, Cambridge University Press, ISBN0-521-29514-9
  • Schatzman, Michelle (2002), 'Chapter 4', Numerical Analysis: A Mathematical Introduction, Oxford: Clarendon Press, ISBN0-19-850279-6
  • Süli, Endre; Mayers, David (2003), 'Chapter 6', An Introduction to Numerical Analysis, Cambridge University Press, ISBN0-521-00794-1
  • Watson, G. Alistair (1980), Approximation Theory and Numerical Methods, John Wiley, ISBN0-471-27706-1

External links[edit]

  • Hazewinkel, Michiel, ed. (2001) [1994], 'Interpolation process', Encyclopedia of Mathematics, Springer Science+Business Media B.V. / Kluwer Academic Publishers, ISBN978-1-55608-010-4
  • ALGLIB has an implementations in C++ / C#.
  • GSL has a polynomial interpolation code in C
  • Interpolating Polynomial by Stephen Wolfram, the Wolfram Demonstrations Project.
Retrieved from 'https://en.wikipedia.org/w/index.php?title=Polynomial_interpolation&oldid=913772386'