The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

App::Zapzi::Transformers::HTML - process HTML without doing readability transforms

VERSION

version 0.017

DESCRIPTION

This class takes HTML and returns the body without doing additional readable transforms - so tags such as script are removed but no text should be changed. Use this if HTMLExtractMain does not provide the desired results.

METHODS

name

Name of transformer visible to user.

handles($content_type)

Returns true if this module handles the given content-type

transform(input)

Converts input to readable text. Returns true if converted OK.

AUTHOR

Rupert Lane <rupert@rupert-lane.org>

COPYRIGHT AND LICENSE

This software is copyright (c) 2015 by Rupert Lane.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.