2 releases
0.1.1 | Sep 24, 2023 |
---|---|
0.1.0 | Sep 23, 2023 |
#2207 in Algorithms
Used in config-it
37KB
722 lines
strseq: Immutable String Segmentation in Rust
strseq
is a Rust library designed for handling multiple immutable string segments efficiently. It offers a lean approach to represent string hierarchies without constant delimiter checks or allocating distinct strings for individual tokens.
Features
-
Compact Storage with
StringSequence
:- Stores multiple strings in a single linear buffer.
- Span information (begin, end) is located at the buffer's front.
- Actual string content fills the rest of the buffer.
-
Shared Sequences with
SharedStringSequence
:- Functions similarly to
StringSequence
. - Reference-counted, allowing cheap cloning and sharing across contexts.
- Functions similarly to
-
Mutable Operations with
MutableStringSequence
:- Contains two distinct dynamic buffers: one for indices and another for text.
- Supports efficient mutations like pop, push, and insert.
Use Cases
- Hierarchical path representation.
- Token-based systems where separate string allocation for each token is inefficient.
Getting Started
In your Cargo.toml
, add following:
strseq = "0.1.0"
Features
serde
: Enables serialization and deserialization ofStringSequence
andSharedStringSequence
,MutableStringSequence
using Serde.
Serde representation
All defined structs are represented as list of strings.
Feedback
We welcome contributions, feedback, and issues on our GitHub repository.
Dependencies
~165KB