#non-linear #fitting #function #regression #linear-algebra #least-squares

varpro

A straightforward nonlinear least-squares fitting library which uses the Variable Projection algorithm

14 releases (breaking)

0.10.1 Dec 1, 2024
0.10.0 Jul 27, 2024
0.9.0 May 21, 2024
0.8.0 Feb 9, 2024
0.2.0 Mar 16, 2021

#84 in Algorithms

Download history 269/week @ 2024-09-11 430/week @ 2024-09-18 256/week @ 2024-09-25 475/week @ 2024-10-02 218/week @ 2024-10-09 400/week @ 2024-10-16 706/week @ 2024-10-23 486/week @ 2024-10-30 235/week @ 2024-11-06 737/week @ 2024-11-13 325/week @ 2024-11-20 509/week @ 2024-11-27 375/week @ 2024-12-04 324/week @ 2024-12-11 34/week @ 2024-12-18

885 downloads per month

MIT license

290KB
4K SLoC

Rust 3.5K SLoC // 0.1% comments Objective-C 739 SLoC Python 46 SLoC // 0.3% comments Shell 3 SLoC // 0.4% comments

varpro

build tests lints crates Coverage Status maintenance-status crates

Nonlinear function fitting made simple. This library provides robust and fast least-squares fitting of a wide class of model functions to data. It uses the VarPro algorithm to achieve this, hence the name.

Introduction

This crate implements a powerful algorithm to fit model functions to data, but it is restricted to so called separable models. The lack of formulas on this site makes it hard to go into detail, but a brief overview is provided in the next sections. Refer to the documentation for all the meaty details including the math.

What are Separable Models?

Put simply, separable models are nonlinear functions which can be written as a linear combination of some nonlinear basis functions. A common use case for VarPro is e.g. fitting sums of exponentials, which is a notoriously ill-conditioned problem.

What is VarPro?

Variable Projection (VarPro) is an algorithm that tatkes advantage of the fact that its fitting problem can be separated into linear and nonlinear parameters. First, the linear parameters are eliminated using some clever linear algebra. Then, the fitting problem is rewritten so that it depends only on the nonlinear parameters. Finally, this reduced problem is solved by using a general purpose nonlinear minimization algorithm, such as Levenberg-Marquardt (LM).

When Should You Give it a Try?

VarPro can dramatically increase the robustness and speed of the fitting process compared to using a general purpose nonlinear least squares fitting algorithm. When

  • the model function you want to fit is a linear combination of nonlinear functions,
  • and you know the analytical derivatives of all those functions

then you should give it a whirl. Also consider the section on global fitting below, which provides another great use case for this crate.

Example Usage

The following example shows, how to use this crate to fit a double exponential decay with constant offset to a data vector y obtained at time points t. Refer to the documentation for a more in-depth guide.

use varpro::prelude::*;
use varpro::solvers::levmar::{LevMarProblemBuilder, LevMarSolver};
use nalgebra::{dvector,DVector};

// Define the exponential decay e^(-t/tau).
// Both of the nonlinear basis functions in this example
// are exponential decays.
fn exp_decay(t :&DVector<f64>, tau : f64) 
  -> DVector<f64> {
  t.map(|t|(-t/tau).exp())
}

// the partial derivative of the exponential
// decay with respect to the nonlinear parameter tau.
// d/dtau e^(-t/tau) = e^(-t/tau)*t/tau^2
fn exp_decay_dtau(t: &DVector<f64>,tau: f64) 
  -> DVector<f64> {
  t.map(|t| (-t / tau)
    .exp() * t / tau.powi(2))
}

// temporal (or spatial) coordintates of the observations
let t = dvector![0.,1.,2.,3.,4.,5.,6.,7.,8.,9.,10.];
// the observations we want to fit
let y = dvector![6.0,4.8,4.0,3.3,2.8,2.5,2.2,1.9,1.7,1.6,1.5];

// 1. create the model by giving only the nonlinear parameter names it depends on
let model = SeparableModelBuilder::<f64>::new(&["tau1", "tau2"])
  // provide the nonlinear basis functions and their derivatives.
  // In general, base functions can depend on more than just one parameter.
  // first function:
  .function(&["tau1"], exp_decay)
  .partial_deriv("tau1", exp_decay_dtau)
  // second function and derivatives with respect to all parameters
  // that it depends on (just one in this case)
  .function(&["tau2"], exp_decay)
  .partial_deriv("tau2", exp_decay_dtau)
  // a constant offset is added as an invariant basefunction
  // as a vector of ones. It is multiplied with its own linear coefficient,
  // creating a fittable offset
  .invariant_function(|v|DVector::from_element(v.len(),1.))
  // give the coordinates of the problem
  .independent_variable(t)
  // provide guesses only for the nonlinear parameters in the
  // order that they were given on construction.
  .initial_parameters(vec![2.5,5.5])
  .build()
  .unwrap();
// 2. Cast the fitting problem as a nonlinear least squares minimization problem
let problem = LevMarProblemBuilder::new(model)
  .observations(y)
  .build()
  .unwrap();
// 3. Solve the fitting problem
let fit_result = LevMarSolver::default()
    .fit(problem)
    .expect("fit must exit successfully");
// 4. obtain the nonlinear parameters after fitting
let alpha = fit_result.nonlinear_parameters();
// 5. obtain the linear parameters
let c = fit_result.linear_coefficients().unwrap();

For more in depth examples, please refer to the crate documentation.

Fit Statistics

Additionally to the fit member function, the LevMarSolver provides a fit_with_statistics function that calculates quite a bit of useful additional statistical information.

Global Fitting of Multiple Right Hand Sides

In the example above, we have passed a single column vector as the observations. The library also allows fitting multiple right hand sides, by constructing a problem via LevMarProblemBuilder::mrhs. When fitting multiple right hand sides, vapro will performa a global fit, in which the nonlinear parameters are optimized across all right hand sides, but linear coefficients of the fit are optimized for each right hand side individually.

This is another application where varpro really shines, since it can take advantage of the separable nature of the problem. It allows us to perform a global fit over thousands, or even tens of thousands of right hand sides in reasonable time (fractions of seconds to minutes), where conventional purely nonlinear solvers must perform much more work.

Maximum Performance and Advanced Use Cases

The example code above will already run many times faster than just using a nonlinear solver without the magic of varpro. But this crate offers an additional way to eek out the last bits of performance.

The SeparableNonlinearModel trait can be manually implemented to describe a model function. This often allows us to shave off the last hundreds of microseconds from the computation, e.g. by caching intermediate calculations. The crate documentation contains detailed examples.

This is not only useful for performance, but also for use cases that are difficult or impossible to accomodate using only the SeparableModelBuilder. The builder was created for ease of use and performance, but it has some limitations by design.

Acknowledgements

I am grateful to Professor Dianne P. O'Leary and Bert W. Rust ✝ who published the paper that enabled me to understand varpro and come up with this implementation. Professor O'Leary also graciously answered my questions on her paper and some implementation details.

References and Further Reading

(O'Leary2013) O’Leary, D.P., Rust, B.W. Variable projection for nonlinear least squares problems. Comput Optim Appl 54, 579–593 (2013). DOI: 10.1007/s10589-012-9492-9

attention: the O'Leary paper contains errors that are fixed (so I hope) in this blog article of mine.

(Golub2003) Golub, G. , Pereyra, V Separable nonlinear least squares: the variable projection method and its applications. Inverse Problems 19 R1 (2003) https://iopscience.iop.org/article/10.1088/0266-5611/19/2/201

Dependencies

~4MB
~87K SLoC