#meta #http-service #json #url #web #extract #page

app pages-meta

This is an application http service that takes url as input and returns json with page meta tags by url

3 unstable releases

0.2.0 Oct 5, 2024
0.1.1 Sep 17, 2024
0.1.0 Sep 16, 2024

#938 in Web programming

Download history 215/week @ 2024-09-11 108/week @ 2024-09-18 11/week @ 2024-09-25 154/week @ 2024-10-02 21/week @ 2024-10-09 2/week @ 2024-10-16

193 downloads per month

MIT license

29KB
598 lines

Page Meta Extractor

Rust License: MIT GitHub release Docker

Page Meta Extractor is an HTTP service that takes a URL as input and returns JSON with page meta tags extracted from the specified URL.

Features

  • Extract page title, description, favicon, and web app manifest information
  • Support for HTTP and HTTPS URLs
  • JSON output for easy integration with other services

Installation

From Binary (Release Page)

  1. Go to the Releases page
  2. Download the latest binary for your platform
  3. Make the binary executable: chmod +x pages-meta
  4. Run the binary: ./pages-meta

Using cargo-binstall

If you have cargo-binstall installed:

cargo binstall pages-meta

Using Cargo Install

If you have Rust and Cargo installed:

cargo install pages-meta

Using Docker

docker pull henek/pages-meta
docker run -p 3000:3000 henek/pages-meta

Configuration

The application can be configured using the following environment variables:

  • HOST: The host address to bind the server to (default: 127.0.0.1)
  • PORT: The port number to listen on (default: 3000)

Example:

HOST=0.0.0.0 PORT=8080 ./pages-meta

Usage

Web Frontend

You can also interact with the service using the web frontend available at https://pages-meta.henek.ru/.

Example

  1. Open the web frontend in your browser: https://pages-meta.henek.ru/.
  2. Enter the URL https://example.com in the input field.
  3. Click the "Extract" button.
  4. The extracted meta information will be displayed in JSON format.

API Usage

Send a GET request to the service with the URL you want to extract meta information from:

http://localhost:3000/https://example.com

The service will return a JSON response with the extracted meta information.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Dependencies

~14–27MB
~415K SLoC