1 unstable release

0.1.0 Mar 26, 2023

#13 in #gpt-4

MIT license

21KB
322 lines

A browser AI agent, using GPT-4

This project provides a bridge between GPT-4 and a headless Chromium browser, allowing you to automate actions simply by describing them to the program. It takes the form of a Rust CLI, but also exports most of the internals as a library for others to use.

Installation

browser-agent is built using Rust, so you'll need to install the Rust toolchain. You can do this by following the instructions at rustup.rs.

Once you have Rust installed, you can install browser-agent by running:

cargo install browser-agent

Usage

Usage: browser-agent [OPTIONS] <GOAL>

Arguments:
  <GOAL>  The goal for the agent to achieve

Options:
      --visual                Whether to show the browser window. Warning: this makes the agent more unreliable
  -v...                       Set the verbosity level, can be used multiple times
      --include-page-content  Whether to include text from the page in the prompt
  -h, --help                  Print help
  -V, --version               Print version

Aknowledgements

This project was inspired and builds on top of Nat Friedman's natbot experiment.

License

This project is licensed under the MIT license. See LICENSE for more details.

Dependencies

~23–38MB
~634K SLoC