#html-css #css-selectors #html-content #extract #command-line #html-text #attributes

app select-html

Extract HTML using CSS selectors in the command-line

3 releases

0.1.2 Jan 25, 2024
0.1.1 Sep 23, 2022
0.1.0 Sep 5, 2022

#10 in #html-text

AGPL-3.0-or-later

26KB
61 lines

Select 👁 HTML

Extract HTML using CSS selectors in the command-line.

Features

  • Input HTML using stdin or a file.
  • Output full elements, inner text or attribute values.
  • Trim excess whitespace from output.

Installation

Cargo

With a working Rust and Cargo installation, you can install select-html from Crates.io.

cargo install select-html

Binaries

Precompiled x86_64-unknown-linux-gnu binaries are available on the Releases page.

Examples

# Use stdin to pipe HTML to select-html.
curl -s https://example.org | select-html

# Or use `--file`.
select-html --file 'example.html'

# Select and output HTML.
select-html 'title'

# Output only the inner text of selected elements.
select-html '.example' --text

# Output attribute values.
select-html '[property^="og:"]' -a 'property' -a 'content'

Usage

USAGE:
    select-html [OPTIONS] <SELECTOR>

ARGS:
    <SELECTOR>    The CSS selector to use

OPTIONS:
    -a, --attribute <ATTRIBUTE>    Output the attribute's value from the selected element, can be
                                   used multiple times
        --file <FILE>              A HTML file to read, if not specified stdin will be used instead
    -h, --help                     Print help information
    -t, --text                     Output inner text of the selected elements
        --trim                     Trim whitespace from selected items
    -V, --version                  Print version information

Development

With Nix flakes and direnv installed and enabled, all the required dependencies are automatically loaded from shell.nix. Then cargo-make can be used to build, deploy and lint the code. The available tasks are all described in the Makefile.toml configuration.

Feedback

Found a problem or want to request a new feature? Email me@bauke.xyz and I'll see what I can do for you.

License

Distributed under the AGPL-3.0-or-later license, see LICENSE for more information.

Dependencies

~7–14MB
~169K SLoC