I don't understand why people use node-htmlparser when @aredridel's got a top-notch implementation of the HTML5 parser in JS.