Davidon–Fletcher–Powell formula

The Davidon–Fletcher–Powell formula (or DFP; named after William C. Davidon, Roger Fletcher, and Michael J. D. Powell) finds the solution to the secant equation that is closest to the current estimate and satisfies the curvature condition. It was the first quasi-Newton method to generalize the secant method to a multidimensional problem. This update maintains the symmetry and positive definiteness of the Hessian matrix.

Given a function , its gradient (), and positive-definite Hessian matrix , the Taylor series is

and the Taylor series of the gradient itself (secant equation)

is used to update .

The DFP formula finds a solution that is symmetric, positive-definite and closest to the current approximate value of :

where

and is a symmetric and positive-definite matrix.

The corresponding update to the inverse Hessian approximation is given by

is assumed to be positive-definite, and the vectors and must satisfy the curvature condition

The DFP formula is quite effective, but it was soon superseded by the Broyden–Fletcher–Goldfarb–Shanno formula, which is its dual (interchanging the roles of y and s).[1]

Compact representation

edit

By unwinding the matrix recurrence for  , the DFP formula can be expressed as a compact matrix representation. Specifically, defining

   

and upper triangular and diagonal matrices

 

the DFP matrix has the equivalent formula

 

 

 

The inverse compact representation can be found by applying the Sherman-Morrison-Woodbury inverse to  . The compact representation is particularly useful for limited-memory and constrained problems.[2]

See also

edit

References

edit
  1. ^ Avriel, Mordecai (1976). Nonlinear Programming: Analysis and Methods. Prentice-Hall. pp. 352–353. ISBN 0-13-623603-0.
  2. ^ Brust, J. J. (2024). "Useful Compact Representations for Data-Fitting". arXiv:2403.12206 [math.OC].

Further reading

edit