sidekick.html.parser.html
Class HtmlCollector

java.lang.Object
  extended by sidekick.html.parser.html.HtmlVisitor
      extended by sidekick.html.parser.html.HtmlCollector

public class HtmlCollector
extends HtmlVisitor

An HtmlVisitor which modifies the structure of the document so that begin tags are matched properly with end tags and placed in TagBlock elements. Typically, an HtmlDocument is created by the parser, which simply returns a flat list of elements. The HtmlCollector takes this flat list and gives it the structure that is implied by the HTML content.

Author:
Brian Goetz, Quiotix

Field Summary
protected  boolean collected
           
protected static java.util.Set dontMatch
           
protected  sidekick.html.parser.html.HtmlCollector.ElementStack elements
           
protected  sidekick.html.parser.html.HtmlCollector.ElementStack tagStack
           
 
Constructor Summary
HtmlCollector()
           
 
Method Summary
static void main(java.lang.String[] args)
           
protected  int pushNode(HtmlDocument.HtmlElement e)
           
 void visit(HtmlDocument.Comment c)
           
 void visit(HtmlDocument.ElementSequence s)
           
 void visit(HtmlDocument.EndTag t)
           
 void visit(HtmlDocument.Newline n)
           
 void visit(HtmlDocument.Tag t)
           
 void visit(HtmlDocument.TagBlock bl)
           
 void visit(HtmlDocument.Text t)
           
 
Methods inherited from class sidekick.html.parser.html.HtmlVisitor
finish, start, visit, visit, visit
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

tagStack

protected sidekick.html.parser.html.HtmlCollector.ElementStack tagStack

elements

protected sidekick.html.parser.html.HtmlCollector.ElementStack elements

collected

protected boolean collected

dontMatch

protected static java.util.Set dontMatch
Constructor Detail

HtmlCollector

public HtmlCollector()
Method Detail

pushNode

protected int pushNode(HtmlDocument.HtmlElement e)

visit

public void visit(HtmlDocument.Comment c)
Overrides:
visit in class HtmlVisitor

visit

public void visit(HtmlDocument.Text t)
Overrides:
visit in class HtmlVisitor

visit

public void visit(HtmlDocument.Newline n)
Overrides:
visit in class HtmlVisitor

visit

public void visit(HtmlDocument.Tag t)
Overrides:
visit in class HtmlVisitor

visit

public void visit(HtmlDocument.EndTag t)
Overrides:
visit in class HtmlVisitor

visit

public void visit(HtmlDocument.TagBlock bl)
Overrides:
visit in class HtmlVisitor

visit

public void visit(HtmlDocument.ElementSequence s)
Overrides:
visit in class HtmlVisitor

main

public static void main(java.lang.String[] args)
                 throws java.lang.Exception
Throws:
java.lang.Exception