48 releases (17 breaking)

new 0.18.12 May 15, 2024
0.18.10 Apr 12, 2024
0.18.9 Mar 15, 2024
0.18.6 Nov 19, 2023
0.10.3 Nov 28, 2022

#31 in Hardware support

Download history 9181/week @ 2024-01-25 10459/week @ 2024-02-01 12102/week @ 2024-02-08 11690/week @ 2024-02-15 11557/week @ 2024-02-22 14955/week @ 2024-02-29 13942/week @ 2024-03-07 16073/week @ 2024-03-14 20445/week @ 2024-03-21 17081/week @ 2024-03-28 17058/week @ 2024-04-04 16468/week @ 2024-04-11 16356/week @ 2024-04-18 16296/week @ 2024-04-25 14724/week @ 2024-05-02 12694/week @ 2024-05-09

62,874 downloads per month
Used in 109 crates (19 directly)

MIT license

1.5MB
30K SLoC

pulp is a safe abstraction over SIMD instructions, that allows you to write a function once and dispatch to equivalent vectorized versions based on the features detected at runtime.

Documentation Crate

Autovectorization example

use pulp::Arch;

let mut v = (0..1000).map(|i| i as f64).collect::<Vec<_>>();
let arch = Arch::new();

arch.dispatch(|| {
    for x in &mut v {
        *x *= 2.0;
    }
});

for (i, x) in v.into_iter().enumerate() {
    assert_eq!(x, 2.0 * i as f64);
}

Manual vectorization example

use pulp::{Arch, Simd, WithSimd};

struct TimesThree<'a>(&'a mut [f64]);
impl<'a> WithSimd for TimesThree<'a> {
    type Output = ();

    #[inline(always)]
    fn with_simd<S: Simd>(self, simd: S) -> Self::Output {
        let v = self.0;
        let (head, tail) = S::f64s_as_mut_simd(v);

        let three = simd.f64s_splat(3.0);
        for x in head {
            *x = simd.f64s_mul(three, *x);
        }

        for x in tail {
            *x = *x * 3.0;
        }
    }
}

let mut v = (0..1000).map(|i| i as f64).collect::<Vec<_>>();
let arch = Arch::new();

arch.dispatch(TimesThree(&mut v));

for (i, x) in v.into_iter().enumerate() {
    assert_eq!(x, 3.0 * i as f64);
}

Less boilerplate using pulp::with_simd

Only available with the macro feature.

Requires the first non-lifetime generic parameter, as well as the function's first input parameter to be the SIMD type.

#[pulp::with_simd(sum = pulp::Arch::new())]
#[inline(always)]
fn sum_with_simd<'a, S: Simd>(simd: S, v: &'a mut [f64]) {
    let (head, tail) = S::f64s_as_mut_simd(v);
    let three = simd.f64s_splat(3.0);
    for x in head {
        *x = simd.f64s_mul(three, *x);
    }
    for x in tail {
        *x = *x * 3.0;
    }
}

let mut v = (0..1000).map(|i| i as f64).collect::<Vec<_>>();
sum(&mut v);

for (i, x) in v.into_iter().enumerate() {
    assert_eq!(x, 3.0 * i as f64);
}

Dependencies

~1MB
~18K SLoC