#string #8-bit #local #ansi #codec #charset #codepage

local-encoding

Rust library for encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.

2 unstable releases

Uses old Rust 2015

0.2.0 Aug 9, 2016
0.1.0 Jul 12, 2016

#1613 in Text processing

Download history 503/week @ 2024-07-20 513/week @ 2024-07-27 470/week @ 2024-08-03 609/week @ 2024-08-10 534/week @ 2024-08-17 546/week @ 2024-08-24 493/week @ 2024-08-31 605/week @ 2024-09-07 663/week @ 2024-09-14 694/week @ 2024-09-21 627/week @ 2024-09-28 432/week @ 2024-10-05 540/week @ 2024-10-12 686/week @ 2024-10-19 590/week @ 2024-10-26 526/week @ 2024-11-02

2,417 downloads per month
Used in 19 crates (16 directly)

MIT license

18KB
312 lines

local-encoding

Join the chat at https://gitter.im/bozaro/local-encoding-rs Build Status Crates.io

This repository contains rust library for encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.

Unfortunately Windows widly use 8-bit character encoding instead UTF-8. This causes a lot of pain.

For example, in Russian version:

  • CP-1251 (ANSI codepage) used for 8-bit files;
  • CP-866 (OEM codepage) used for console output.

To convert between 8-bit and Unicode used Windows have function: MultiByteToWideChar and WideCharToMultiByte.

This library provide simple function to convert between 8-bit and Unicode characters on Windows.

UTF-8 used as 8-bit codepage for non-Windows system.

Rustdoc: https://bozaro.github.io/local-encoding-rs/local_encoding/

Usage

Put this in your Cargo.toml:

[dependencies]
local-encoding = "*"

For example:

extern crate local_encoding;

use local_encoding::{Encoding, Encoder};

fn main()
{
	println!("Unicode string: {}", Encoding::ANSI.to_string(b"ANSI string").unwrap());
	println!("Unicode string: {}", Encoding::OEM.to_string(b"OEM string").unwrap());
}

Dependencies

~0–260KB