#table #serde #serialization #csv

tsv

tsv (tab-seperated-values) data format for serde (serialization/deserialization)

2 releases

Uses old Rust 2015

0.1.1 Sep 26, 2018
0.1.0 Aug 16, 2018

#1982 in Encoding

Download history 20/week @ 2024-03-11 18/week @ 2024-03-18 15/week @ 2024-03-25 37/week @ 2024-04-01 1/week @ 2024-04-08 4/week @ 2024-04-15 14/week @ 2024-04-22 10/week @ 2024-04-29 14/week @ 2024-05-13 15/week @ 2024-05-20 9/week @ 2024-05-27 26/week @ 2024-06-03 16/week @ 2024-06-10 2/week @ 2024-06-17 16/week @ 2024-06-24

61 downloads per month

MIT license

66KB
1.5K SLoC

The tsv project introduces a new format for data serialization/deserialization, which is text-based and deals with tabular data.

The problem

At serde's point of view, the classic tsv is only applicable to the schema of (a sequence of) a struct composed of primitives( integer, floats, strings etc). The specification has to be extended to allow arbitrary schemas, such as a struct of a struct.

The solution

This project extends the spec by placing sequences in columns. See tsv-spec.txt for specification. It uses serde crate for serialization/deserialization, and reflection crate for generating column names and dealing with enums.

Notice

If you impl Serialize/Deserialize for your types to tell serde they are sequences/maps, do make sure their schemata() and Vec::schemata()/HashMap::schemata() are isomorphic.

Pros

  1. Simple. The only requirement for end users to use tsv files is to understand what a table is. It is deadly simple as a configuration file format for non-technical users.

  2. Available. You can use Microsoft Excel, OpenOffice/LibreOffie Calc and text editors that support elastic tabstops to view/edit tsv files. And it is easy to write tsv by hand if you have read all the 63 lines of the spec.

Cons

  1. Not efficiency-oriented.

  2. Not self-descripting.

License

Under MIT.

Example

A cargo configuration file written in tsv format could look like the following table( with spaces replacing tabs ):

                                        deps
package                         lib             value    
name    version authors keyword macro   name    Version Path
tsv     0.1.0   oooutlk tsv     X       serde   1.0
                        tab             trees           ~/trees
                        table
                        serde

See serialization example and deserialization example.

Dependencies

~2.5MB
~59K SLoC