org.cyberneko.html

Class HTMLScanner.ContentScanner

public class HTMLScanner.ContentScanner extends Object implements HTMLScanner.Scanner

The primary HTML document scanner.

Author: Andy Clark

Method Summary
protected voidaddLocationItem(XMLAttributes attributes, int index)
Adds location augmentations to the specified attribute.
booleanscan(boolean complete)
Scan.
protected booleanscanAttribute(XMLAttributesImpl attributes, boolean[] empty)
Scans a real attribute.
protected booleanscanAttribute(XMLAttributesImpl attributes, boolean[] empty, char endc)
Scans an attribute, pseudo or real.
protected voidscanCDATA()
Scans a CDATA section.
protected voidscanCharacters()
Scans characters.
protected voidscanComment()
Scans a comment.
protected voidscanEndElement()
Scans an end element.
protected booleanscanMarkupContent(XMLStringBuffer buffer, char cend)
Scans markup content.
protected voidscanPI()
Scans a processing instruction.
protected booleanscanPseudoAttribute(XMLAttributesImpl attributes)
Scans a pseudo attribute.
protected StringscanStartElement(boolean[] empty)
Scans a start element.

Method Detail

addLocationItem

protected void addLocationItem(XMLAttributes attributes, int index)
Adds location augmentations to the specified attribute.

scan

public boolean scan(boolean complete)
Scan.

scanAttribute

protected boolean scanAttribute(XMLAttributesImpl attributes, boolean[] empty)
Scans a real attribute.

Parameters: attributes The list of attributes. empty Is used for a second return value to indicate whether the start element tag is empty (e.g. "/>").

scanAttribute

protected boolean scanAttribute(XMLAttributesImpl attributes, boolean[] empty, char endc)
Scans an attribute, pseudo or real.

Parameters: attributes The list of attributes. empty Is used for a second return value to indicate whether the start element tag is empty (e.g. "/>"). endc The end character that appears before the closing angle bracket ('>').

scanCDATA

protected void scanCDATA()
Scans a CDATA section.

scanCharacters

protected void scanCharacters()
Scans characters.

scanComment

protected void scanComment()
Scans a comment.

scanEndElement

protected void scanEndElement()
Scans an end element.

scanMarkupContent

protected boolean scanMarkupContent(XMLStringBuffer buffer, char cend)
Scans markup content.

scanPI

protected void scanPI()
Scans a processing instruction.

scanPseudoAttribute

protected boolean scanPseudoAttribute(XMLAttributesImpl attributes)
Scans a pseudo attribute.

Parameters: attributes The list of attributes.

scanStartElement

protected String scanStartElement(boolean[] empty)
Scans a start element.

Parameters: empty Is used for a second return value to indicate whether the start element tag is empty (e.g. "/>").

(C) Copyright 2002-2005, Andy Clark. All rights reserved.