58 releases (25 breaking)

0.25.3 Jan 8, 2025
0.25.2 Dec 12, 2024
0.25.0 Nov 20, 2024
0.22.7 Dec 12, 2024
0.0.2 Oct 14, 2022

#93 in Parser implementations

Download history 25540/week @ 2024-10-08 22057/week @ 2024-10-15 22513/week @ 2024-10-22 23757/week @ 2024-10-29 23888/week @ 2024-11-05 23481/week @ 2024-11-12 22822/week @ 2024-11-19 26281/week @ 2024-11-26 26998/week @ 2024-12-03 28627/week @ 2024-12-10 26335/week @ 2024-12-17 20766/week @ 2024-12-24 27896/week @ 2024-12-31 35975/week @ 2025-01-07 38686/week @ 2025-01-14 31468/week @ 2025-01-21

137,668 downloads per month
Used in 346 crates (5 directly)

MIT/Apache

2.5MB
59K SLoC

read-fonts

This crate handles parsing and reading of OpenType fonts. It is intended to be a high performance implementation suitable for tasks such as shaping, while still providing a convenient, high-level API.

Safety

Unsafe code is forbidden by a #![forbid(unsafe_code)] attribute in the root of the library.

codegen

Much of the code in this crate is generated automatically. This generated code lives in the generated directory. Each file in this directory is included in a module in the src directory, using Rust's include! macro. This allows us to separate the generated code from any custom implementation code, while allowing them to exist in the same module.

what we generate

With certain exceptions that require manual handling, we generate code for each table, record, enum, and flags type for each portion of the spec that we cover.

tables

All tables are aliases of the type TableRef<Marker>, where Marker is a struct that indicates the type of the table. For instance, the GDEF table is defined as TableRef<GdefMarker>. TableRef itself is a wrapper around a slice of bytes, with the marker type providing typed access to those bytes.

The marker type can only be created from a specific byte slice, and is always associated with that slice. It is created through a parse method that performs a one-time validation of the slice, ensuring that all expected fields are present. This includes bounds checking any arrays, as well as ensuring the presence of fields the existence of which may depend on the table's version.

variable lengths and version-dependent fields

n.b: the design described below has not been benchmarked against the alternatives, and may change

For fields that have variable length, or which only exist in certain table versions, the marker struct has a corresponding field where that length or offset is stored. This means that at runtime there is no need to double check a version or length.

TableRef methods

For each table, we define methods on the type TableRef<Marker> that provide access to that table's fields. These methods use the marker type to determine the byte range of a given field, and then interpret those bytes as the appropriate type.

records

Unlike tables, which are essentially a set of methods for reading into a byte slice, records are in general represented as simple packed structs containing scalar types in big-endian encodings. This means that, in general, records are zerocopy types that can be cast from raw bytes.

The exception to this is when a record has variable length; in this case the record is still a simple struct, but cannot be cast from raw bytes, and must be copied.

flags and enums

For flags, we generate a type modeled on those generated by the bitflags! macro. For enums, we generate a raw Rust enum.

Dependencies

~0.3–1MB
~20K SLoC