Catmandu-HTML/lib/Catmandu/HTML.pm
package Catmandu::HTML;
our $VERSION = '0.02';
1;
__END__
=encoding utf8
=head1 NAME
Catmandu::HTML - Modules for handling HTML data within the Catmandu framework
=head1 SYNOPSIS
Command line client C<catmandu>:
catmandu convert HTML to JSON < ex/test.html > data/test.json
catmandu convert JSON to HTML < data/test.json > ex/test.html
# Keep the text
curl -s http://librecat.org |
catmandu convert HTML to Text --fix "html_text()"
# Filter out the anchors
curl -s http://librecat.org |
catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")"
# Filter out the meta tags as YAML
curl -s http://librecat.org |
catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"
See documentation of modules for more examples.
=head1 AVAILABLE MODULES
=over
=item L<Catmandu::Exporter::HTML>
Serialize HTML data
=item L<Catmandu::Importer::HTML>
Parse HTML data
=item L<Catmandu::Fix::html_filter_tag>
Filter HTML by name and optional group them
=item L<Catmandu::Fix::html_filter_type>
Filter HTML by type
=item L<Catmandu::Fix::html_text>
Select only the literal text values from the HTML
=back
=head1 SEE ALSO
This module is based on L<Catmandu>
=head1 AUTHORS
=over
=item Patrick Hochstenbach, C<< patrick.hochstenbach at ugent.be >>
=back
=head1 COPYRIGHT AND LICENSE
The Perl software is copyright (c) 2018 by Patrick Hochstenbach.
This is free software; you can redistribute it and/or modify it under the same
terms as the Perl 5 programming language system itself.
=encoding utf8
=cut