The current code relies on string input and outputs a string. This is inefficient in situations such as using gulp. It would be better if it input a node Stream and output a node Stream. (This should be easy enough as html2parser is already Stream-based.)