17 releases

0.7.2 May 22, 2023
0.6.2 Jul 6, 2023
0.6.1 Jul 1, 2022
0.6.0 Jan 24, 2022
0.1.1 Jul 28, 2020

#35 in Graphics APIs

Download history 858/week @ 2023-11-20 1022/week @ 2023-11-27 970/week @ 2023-12-04 1354/week @ 2023-12-11 1004/week @ 2023-12-18 192/week @ 2023-12-25 649/week @ 2024-01-01 997/week @ 2024-01-08 825/week @ 2024-01-15 460/week @ 2024-01-22 392/week @ 2024-01-29 657/week @ 2024-02-05 677/week @ 2024-02-12 611/week @ 2024-02-19 661/week @ 2024-02-26 821/week @ 2024-03-04

2,816 downloads per month
Used in 30 crates (2 directly)

MIT/Apache

86KB
1.5K SLoC

rust-gpu-tools Crates.io

An abstraction library to run kernels on both CUDA and OpenCL.

Example

You need to write the code that interacts with the GPU only once. Below is such code that runs a kernel on CUDA and/or OpenCL. For a full working example, please see the examples directory. You can run it via cargo run --example add.

let closures = program_closures!(|program, _args| -> Result<Vec<u32>, GPUError> {
    // Make sure the input data has the same length.
    assert_eq!(aa.len(), bb.len());
    let length = aa.len();

    // Copy the data to the GPU.
    let aa_buffer = program.create_buffer_from_slice(&aa)?;
    let bb_buffer = program.create_buffer_from_slice(&bb)?;

    // The result buffer has the same length as the input buffers.
    let result_buffer = unsafe { program.create_buffer::<u32>(length)? };

    // Get the kernel.
    let kernel = program.create_kernel("add", 8, 4)?;

    // Execute the kernel.
    kernel
        .arg(&(length as u32))
        .arg(&aa_buffer)
        .arg(&bb_buffer)
        .arg(&result_buffer)
        .run()?;

    // Get the resulting data.
    let mut result = vec![0u32; length];
    program.read_into_buffer(&result_buffer, &mut result)?;

    Ok(result)
});

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Dependencies

~1–10MB
~93K SLoC