8 breaking releases

0.9.0 Apr 3, 2024
0.8.0 Dec 15, 2023
0.7.0 Dec 5, 2023
0.5.0 Jan 24, 2022
0.2.0 Nov 28, 2020

#172 in Database interfaces

Download history 5/week @ 2024-02-18 13/week @ 2024-02-25 1/week @ 2024-03-17 1/week @ 2024-03-24 187/week @ 2024-03-31 95/week @ 2024-04-07

284 downloads per month

Unlicense

3MB
1.5K SLoC

Tass

A pager for tabular data

It's like less, but for CSV and parquet files! It looks like this:

You can see tass's schema inference in action here. The fourth and fifth columns only contains a small number of unique values, so tass colour-codes them automatically. The columns to the right of those are numerical, so tass right-aligns them and shows negative values in red.

This demo shows data being piped into (an older version of) tass. You can also open files which are still being appeanded to, and new rows will appear as the file grows.

Installing

If you already have rust, you can install tass like this:

$ cargo install tass

Usage

You can pass a filename, or pipe data to stdin:

$ tass mydata.csv
$ cat mydata.csv | tass
Key Action
Up/j, PageUp Move up one row, page
Down/k,PageDown Move down one row, page
Left/h Move left one column
Right/l Move right one column
Home, End Move to start/end of file
number g Move to line $number
f Move to end and auto-scroll as new rows come in
/, ? Search, reverse-search for string
n, N Jump to next, previous match
q/Esc Quit

Comparison to other tools

Tool Functionality Filetypes Loads whole file into memory Streaming
tass ⭐ Viewing data, basic searching CSV, parquet no 😌 ✔️
csvlens ⭐ Similar to tass CSV, TSV no 😌 ✔️
VisiData ⭐⭐ Summary stats, plots, etc. CSV, parquet, JSON, ... yes 😱 ✔️
Excel/Calc/Numbers/Google Sheets ⭐⭐⭐ It's a spreadsheet! CSV, xls, ods, ... yes 😱

My advice is to use the most full-featured tool you can get away with. However, if you are cursed with multi-gigabyte CSV files, then your options are limited.

Tips & tricks

Here are some more tips for working with large CSV files:

  • If you want to see summary statistics but don't care about the underlying data itself, you can still use VisiData/Excel/etc.: just downsample it first. This will reduce the precision of your stats and plots, but not the overall shape (probably). xsv has a subcommand that can do this for you.
  • If you want exact answers to complex questions, consider dumping your CSV into a sqlite database and working with that instead.
  • For ad-hoc computations on CSV files, take a look at frawk - it's really nice.

Dependencies

~32–50MB
~1M SLoC