Package org.htmlcleaner
Class HtmlTokenizer
- java.lang.Object
-
- org.htmlcleaner.HtmlTokenizer
-
public class HtmlTokenizer extends Object
Main HTML tokenizer.It's task is to parse HTML and produce list of valid tokens: open tag tokens, end tag tokens, contents (text) and comments. As soon as new item is added to token list, cleaner is invoked to clean current list at the end.
Created by: Vladimir Nikic.
Date: November, 2006
-
-
Constructor Summary
Constructors Constructor Description HtmlTokenizer(HtmlCleaner cleaner, Reader reader, org.htmlcleaner.CleanTimeValues cleanTimeValues)
Constructor - creates instance of the parser with specified content.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description DoctypeToken
getDocType()
-
-
-
Constructor Detail
-
HtmlTokenizer
public HtmlTokenizer(HtmlCleaner cleaner, Reader reader, org.htmlcleaner.CleanTimeValues cleanTimeValues)
Constructor - creates instance of the parser with specified content.- Parameters:
cleaner
-reader
-
-
-
Method Detail
-
getDocType
public DoctypeToken getDocType()
-
-