HTML::FormatText - Format HTML as plaintext
version 2.16
use HTML::TreeBuilder;
$tree = HTML::TreeBuilder->new->parse_file("test.html");
use HTML::FormatText;
$formatter = HTML::FormatText->new(leftmargin => 0, rightmargin => 50);
print $formatter->format($tree);
or, more simply:
use HTML::FormatText;
my $string = HTML::FormatText->format_file(
'test.html',
leftmargin => 0, rightmargin => 50
);
HTML::FormatText is a formatter that outputs plain text. All character
attributes (bold/italic/underline) are ignored. Formatting of HTML tables and
forms is not implemented.
HTML::FormatText is built on the HTML::Formatter manpage and documentation for that
module applies to this - especially new in the HTML::Formatter manpage,
format_file in the HTML::Formatter manpage and format_string in the HTML::Formatter manpage.
You might specify the following parameters when constructing the formatter:
- leftmargin (alias lm)
-
The column of the left margin. The default is 3.
- rightmargin (alias rm)
-
The column of the right margin. The default is 72.
the HTML::Formatter manpage
This software is copyright (c) 2016 by Nigel Metheringham, 2002-2005 Sean M Burke, 1999-2002 Gisle Aas.
This is free software; you can redistribute it and/or modify it under
the same terms as the Perl 5 programming language system itself.
|