3 releases
0.1.2 | Jan 25, 2024 |
---|---|
0.1.1 | Sep 23, 2022 |
0.1.0 | Sep 5, 2022 |
#10 in #html-text
26KB
61 lines
Select 👁 HTML
Extract HTML using CSS selectors in the command-line.
Features
- Input HTML using stdin or a file.
- Output full elements, inner text or attribute values.
- Trim excess whitespace from output.
Installation
Cargo
With a working Rust and Cargo installation, you can install select-html
from Crates.io.
cargo install select-html
Binaries
Precompiled x86_64-unknown-linux-gnu
binaries are available on the Releases page.
Examples
# Use stdin to pipe HTML to select-html.
curl -s https://example.org | select-html
# Or use `--file`.
select-html --file 'example.html'
# Select and output HTML.
select-html 'title'
# Output only the inner text of selected elements.
select-html '.example' --text
# Output attribute values.
select-html '[property^="og:"]' -a 'property' -a 'content'
Usage
USAGE:
select-html [OPTIONS] <SELECTOR>
ARGS:
<SELECTOR> The CSS selector to use
OPTIONS:
-a, --attribute <ATTRIBUTE> Output the attribute's value from the selected element, can be
used multiple times
--file <FILE> A HTML file to read, if not specified stdin will be used instead
-h, --help Print help information
-t, --text Output inner text of the selected elements
--trim Trim whitespace from selected items
-V, --version Print version information
Development
With Nix flakes and direnv installed and enabled, all the required dependencies are automatically loaded from shell.nix
. Then cargo-make can be used to build, deploy and lint the code. The available tasks are all described in the Makefile.toml
configuration.
Feedback
Found a problem or want to request a new feature? Email me@bauke.xyz and I'll see what I can do for you.
License
Distributed under the AGPL-3.0-or-later license, see LICENSE for more information.
Dependencies
~7–14MB
~169K SLoC