Class Nokogiri::HTML::SAX::Parser inherits from Nokogiri::XML::SAX::Parser
This class lets you perform SAX style parsing on HTML with HTML error correction.
Here is a basic usage example:
class MyDoc < Nokogiri::XML::SAX::Document
def start_element name, attributes = []
puts "found a #{name}"
end
end
parser = Nokogiri::HTML::SAX::Parser.new(MyDoc.new)
parser.parse(File.read(ARGV[0], 'rb'))
For more information on SAX parsers, see Nokogiri::XML::SAX
Public Instance Methods
- parse_file(filename, encoding = 'UTF-8') Show Source
Parse a file with filename
-
# File lib/nokogiri/html/sax/parser.rb, line 39 39: def parse_file filename, encoding = 'UTF-8' 40: raise ArgumentError unless filename 41: raise Errno::ENOENT unless File.exists?(filename) 42: raise Errno::EISDIR if File.directory?(filename) 43: ParserContext.file(filename, encoding).parse_with self 44: end
- parse_memory(data, encoding = 'UTF-8') Show Source
Parse html stored in data using encoding
-
# File lib/nokogiri/html/sax/parser.rb, line 31 31: def parse_memory data, encoding = 'UTF-8' 32: raise ArgumentError unless data 33: return unless data.length > 0 34: ParserContext.memory(data, encoding).parse_with self 35: end