#text-encoding #utf-8 #utf-16 #data-encoding #bom #unicode

file-content

A library for working with files and common text data encodings

4 releases (2 breaking)

0.3.1 Mar 27, 2024
0.3.0 Mar 25, 2024
0.2.0 Mar 18, 2024
0.1.0 Mar 18, 2024

#1853 in Encoding

MIT license

26KB
496 lines

file-content

Crates.io Version docs.rs

A small library for reading file content/text data from disk, or anywhere else, into a String.

Supported Encodings

  • UTF-8
  • UTF-8-BOM
  • UTF-16-BE
  • UTF-16-LE
  • or raw bytes

Usage

There are two main structs in this crate.

  • file_content::File: A wrapper around a PathBuf and a file_content::FileContent.

    Use this struct for easily reading file content from disk that may be in any of the supported encodings.

  • file_content::FileContent: An enum of the kind of content, either Encoded or Binary. If Encoded, the variant holds the encoding that content had, and a String representation of it in memory. If Binary, then a Vec<u8> of the raw data is held.

Example: read_file.rs reads a file from disk and prints the path, content type, and content:

use anyhow::anyhow;
use file_content::File;

fn main() -> anyhow::Result<()> {
    let file_path = std::env::args()
        .nth(1)
        .ok_or_else(|| anyhow!("Usage: read_file <file>"))?;

    let file: File = File::new_from_path(&file_path)?;

    println!("{:?}", file);

    let content_only: String = file_content::read_to_string(&file_path)?;

    println!("{content_only}");

    Ok(())
}

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Dependencies

~235–690KB
~16K SLoC