5 releases (3 breaking)

0.4.1 Nov 3, 2020
0.4.0 Oct 2, 2019
0.3.0 Jul 10, 2018
0.2.0 Feb 4, 2018
0.1.0 Jan 3, 2017

#597 in Parser implementations

Download history 552/week @ 2023-10-19 437/week @ 2023-10-26 309/week @ 2023-11-02 363/week @ 2023-11-09 441/week @ 2023-11-16 396/week @ 2023-11-23 399/week @ 2023-11-30 440/week @ 2023-12-07 437/week @ 2023-12-14 442/week @ 2023-12-21 448/week @ 2023-12-28 472/week @ 2024-01-04 678/week @ 2024-01-11 544/week @ 2024-01-18 474/week @ 2024-01-25 478/week @ 2024-02-01

2,289 downloads per month
Used in 8 crates (7 directly)

MIT license

29KB
592 lines

sitemap, an Sitemap library for Rust

Chrono-utils on Travis CI Chrono-utils on crates.io

sitemap is an library for Rust programming language.

features

  • Streaming reading sitemap

Restrictions

  • no other encodings but UTF-8 are supported yet
  • validation is not supported

Building and using

sitemap uses Cargo, so just add a dependency section in your project's manifest:

[dependencies]
sitemap = "0.4"

Reading sitemap documents

sitemap::reader::SiteMapReader requires a Read instance to read from. When a proper stream-based encoding library is available, it is likely that sitemap will be switched to use whatever character stream structure this library would provide, but currently it is a Read.

Using SiteMapReader is very straightforward. Just provide a Read instance to obtain an iterator over events:

use sitemap::reader::{SiteMapReader,SiteMapEntity};
use std::fs::File;
fn main() {
    let mut urls = Vec::new();
    let mut sitemaps = Vec::new();
    let mut errors = Vec::new();
    let file = File::open("sitemap.xml").expect("Unable to open file.");
    let parser = SiteMapReader::new(file);
    for entity in parser {
        match entity {
            SiteMapEntity::Url(url_entry) => {
                urls.push(url_entry);
            },
            SiteMapEntity::SiteMap(sitemap_entry) => {
                sitemaps.push(sitemap_entry);
            },
            SiteMapEntity::Err(error) => {
                errors.push(error);
            },
        }
    }
    println!("urls = {:?}",urls);
    println!("sitemaps = {:?}",sitemaps);
    println!("errors = {:?}",errors);
}

Writng sitemap documents

use sitemap::writer::SiteMapWriter;
use sitemap::structs::UrlEntry;
use std::io::stdout;
fn main() {
    let mut output = stdout();
    let sitemap_writer = SiteMapWriter::new(&mut output);
    let mut urlwriter = sitemap_writer.start_urlset().expect("Unable to write urlset");
    urlwriter.url("http://github.com").expect("Unable to write url");
    urlwriter.url(UrlEntry::builder().loc("http://google.com")).expect("Unable to write url");
    urlwriter.url(UrlEntry::builder().loc("http://yandex.ru").build().unwrap()).expect("Unable to write url");
    urlwriter.end().expect("Unable to write close tags");
}

Roadmap

Highest priority first, approximately.

  1. Sitemap writer - done
  2. Sitemap validation

Known issues

All known issues are present on GitHub issue tracker: http://github.com/svmk/sitemap/issues. Feel free to post any found problems there.

License

This library is licensed under MIT license.

Dependencies

~3MB
~83K SLoC