A B C D E F G H I J K L M N O P Q R S T U V W X _

A

A - Static variable in interface HTMLElementName
HTML element A - anchor.
ABBR - Static variable in interface HTMLElementName
HTML element ABBR - abbreviated form (e.g., WWW, HTTP, etc.).
ACRONYM - Static variable in interface HTMLElementName
HTML element ACRONYM - acronym.
ADDRESS - Static variable in interface HTMLElementName
HTML element ADDRESS - information on author.
APPLET - Static variable in interface HTMLElementName
HTML element APPLET - Java applet.
AREA - Static variable in interface HTMLElementName
HTML element AREA - client-side image map area.
Attribute - class Attribute.
Represents a single attribute name/value segment within a StartTag.
AttributeNames - Static variable in class FormControlOutputStyle.ConfigDisplayValue
Defines the names of the attributes that are copied from the normal form control output element to a display value element.
Attributes - class Attributes.
Represents the list of Attribute objects present within a particular StartTag.
addValue(CharSequence) - Method in class FormControl
Adds the specified value to this control's submission values *.
addValue(CharSequence) - Method in class FormField
Adds the specified value to the field submission values of this field.
addValue(String, CharSequence) - Method in class FormFields
Adds the specified value to the field submission values of the constituent form field with the specified name.
allowsMultipleValues() - Method in class FormField
Indicates whether the field allows multiple values.
atEndOfAttributes(Source, int, boolean) - Method in class StartTagType
Indicates whether the specified source document position is at the end of a tag's attributes.
au.id.jericho.lib.html - package au.id.jericho.lib.html
A simple but powerful java library allowing analysis and manipulation of parts of an HTML document, including some common server-side tags, while reproducing verbatim any unrecognised or invalid HTML.

B

B - Static variable in interface HTMLElementName
HTML element B - bold text style.
BASE - Static variable in interface HTMLElementName
HTML element BASE - document base URI.
BASEFONT - Static variable in interface HTMLElementName
HTML element BASEFONT - base font size.
BDO - Static variable in interface HTMLElementName
HTML element BDO - I18N BiDi over-ride.
BIG - Static variable in interface HTMLElementName
HTML element BIG - large text style.
BLOCKQUOTE - Static variable in interface HTMLElementName
HTML element BLOCKQUOTE - long quotation.
BODY - Static variable in interface HTMLElementName
HTML element BODY - document body.
BR - Static variable in interface HTMLElementName
HTML element BR - forced line break.
BUTTON - Static variable in class FormControlType
The form control type given to a submit button control implemented using a BUTTON element.
BUTTON - Static variable in interface HTMLElementName
HTML element BUTTON - push button.
BasicLogFormatter - class BasicLogFormatter.
Provides basic formatting for log messages.
BasicLogFormatter() - Constructor for class BasicLogFormatter
 

C

CAPTION - Static variable in interface HTMLElementName
HTML element CAPTION - table caption.
CDATA_SECTION - Static variable in class StartTagType
The tag type given to a CDATA section (<![CDATA[ ... ]]>).
CENTER - Static variable in interface HTMLElementName
HTML element CENTER - shorthand for DIV align=center.
CHECKBOX - Static variable in class FormControlType
The form control type given to a checkbox control.
CITE - Static variable in interface HTMLElementName
HTML element CITE - citation.
CODE - Static variable in interface HTMLElementName
HTML element CODE - computer code fragment.
CODE_POINTS_ALL - Static variable in class Config.CompatibilityMode
Indicates the recognition of all unicode code points.
CODE_POINTS_NONE - Static variable in class Config.CompatibilityMode
Indicates the recognition of no unicode code points.
COL - Static variable in interface HTMLElementName
HTML element COL - table column.
COLGROUP - Static variable in interface HTMLElementName
HTML element COLGROUP - table column group.
COMMENT - Static variable in class StartTagType
The tag type given to an HTML comment (<!-- ... -->).
COMPARATOR - Static variable in interface OutputSegment
The comparator used to sort output segments in the OutputDocument before output.
CharStreamSource - interface CharStreamSource.
Represents a character stream source.
CharStreamSourceUtil - class CharStreamSourceUtil.
Contains static utility methods for manipulating the way data is retrieved from a CharStreamSource object.
CharacterEntityReference - class CharacterEntityReference.
Represents an HTML Character Entity Reference.
CharacterReference - class CharacterReference.
Represents an HTML Character Reference, implemented by the subclasses CharacterEntityReference and NumericCharacterReference.
CheckedHTML - Static variable in class FormControlOutputStyle.ConfigDisplayValue
Defines the HTML which replaces the normal output element of a CHECKBOX or RADIO form control if it contains a checked attribute.
ColumnMultipleValueSeparator - Static variable in class Config
Determines the string used to separate a single column's multiple values in the output of the FormFields.getColumnValues(Map) method.
ColumnValueFalse - Static variable in class Config
Determines the string that represents the value false in the output of the FormFields.getColumnValues(Map) method.
ColumnValueTrue - Static variable in class Config
Determines the string that represents the value true in the output of the FormFields.getColumnValues(Map) method.
Config - class Config.
Encapsulates global configuration properties which determine the behaviour of various functions.
Config.CompatibilityMode - class Config.CompatibilityMode.
Represents a set of configuration parameters that relate to user agent compatibility issues.
Config.CompatibilityMode(String) - Constructor for class Config.CompatibilityMode
Constructs a new CompatibilityMode with the given name.
CurrentCompatibilityMode - Static variable in class Config
Determines the currently active compatibility mode.
charAt(int) - Method in class ParseText
Returns the character at the specified index.
charAt(int) - Method in class Segment
Returns the character at the specified index.
clearCache() - Method in class Source
Clears the tag cache of all tags.
clearValues() - Method in class FormControl
Clears the control's existing submission values.
clearValues() - Method in class FormField
Clears the submission values of all the constituent form controls in this field.
clearValues() - Method in class FormFields
Clears the submission values of all the constituent form controls.
compareTo(Object) - Method in class Segment
Compares this Segment object to another object.
constructEndTag(Source, int, int, String) - Method in class EndTagType
Internal method for the construction of an EndTag object of this type.
constructStartTag(Source, int, int, String, Attributes) - Method in class StartTagType
Internal method for the construction of a StartTag object if this type.
constructTagAt(Source, int) - Method in class EndTagTypeGenericImplementation
Constructs a tag of this type at the specified position in the specified source document if it matches all of the required features.
constructTagAt(Source, int) - Method in class StartTagTypeGenericImplementation
Constructs a tag of this type at the specified position in the specified source document if it matches all of the required features.
constructTagAt(Source, int) - Method in class TagType
Constructs a tag of this type at the specified position in the specified source document if it matches all of the required features.
containsAt(String, int) - Method in class ParseText
Indicates whether this parse text contains the specified string at the specified position.

D

DD - Static variable in interface HTMLElementName
HTML element DD - definition description.
DEL - Static variable in interface HTMLElementName
HTML element DEL - deleted text.
DFN - Static variable in interface HTMLElementName
HTML element DFN - instance definition.
DIR - Static variable in interface HTMLElementName
HTML element DIR - directory list.
DISABLED - Static variable in interface LoggerProvider
A LoggerProvider implementation that disables all log messages.
DISPLAY_VALUE - Static variable in class FormControlOutputStyle
The normal output element is replaced with a simple representation of the form control's submission value(s).
DIV - Static variable in interface HTMLElementName
HTML element DIV - generic language/style container.
DL - Static variable in interface HTMLElementName
HTML element DL - definition list.
DOCTYPE_DECLARATION - Static variable in class StartTagType
The tag type given to a document type declaration (<!DOCTYPE ... >).
DT - Static variable in interface HTMLElementName
HTML element DT - definition term.
debug(String) - Method in interface Logger
Logs a message at the DEBUG level.
debug(String) - Method in class WriterLogger
 
decode(CharSequence) - Static method in class CharacterReference
Decodes the specified HTML encoded text into normal text.
decode(CharSequence, boolean) - Static method in class CharacterReference
Decodes the specified HTML encoded text into normal text.
decodeCollapseWhiteSpace(CharSequence) - Static method in class CharacterReference
Decodes the specified text after collapsing its white space.
defines(TagType) - Static method in class MasonTagTypes
Indicates whether the specified tag type is defined in this class.
defines(TagType) - Static method in class PHPTagTypes
Indicates whether the specified tag type is defined in this class.
deregister() - Method in class TagType
Deregisters this tag type.

E

EM - Static variable in interface HTMLElementName
HTML element EM - emphasis.
Element - class Element.
Represents an element in a specific source document, which encompasses a start tag, an optional end tag and all content in between.
ElementName - Static variable in class FormControlOutputStyle.ConfigDisplayValue
Defines the name of display value elements.
EmptyHTML - Static variable in class FormControlOutputStyle.ConfigDisplayValue
Defines the content of a display value element if the submission value of the control is null or an empty string.
EndTag - class EndTag.
Represents the end tag of an element in a specific source document.
EndTagType - class EndTagType.
Defines the syntax for an end tag type.
EndTagType(String, String, String, boolean) - Constructor for class EndTagType
Constructs a new EndTagType object with the specified properties.
EndTagTypeGenericImplementation - class EndTagTypeGenericImplementation.
Provides a generic implementation of the abstract EndTagType class based on the most common end tag behaviour.
EndTagTypeGenericImplementation(String, String, String, boolean, boolean) - Constructor for class EndTagTypeGenericImplementation
Constructs a new EndTagTypeGenericImplementation object based on the specified properties.
encloses(Segment) - Method in class Segment
Indicates whether this Segment encloses the specified Segment.
encloses(int) - Method in class Segment
Indicates whether this segment encloses the specified character position in the source document.
encode(CharSequence) - Static method in class CharacterReference
Encodes the specified text, escaping special characters into character references.
encode(char) - Static method in class CharacterReference
Encodes the specified character into a character reference if required.
encode(CharSequence) - Static method in class NumericCharacterReference
Encodes the specified text, escaping special characters into numeric character references.
encodeDecimal(CharSequence) - Static method in class NumericCharacterReference
Encodes the specified text, escaping special characters into decimal character references.
encodeHexadecimal(CharSequence) - Static method in class NumericCharacterReference
Encodes the specified text, escaping special characters into hexadecimal character references.
encodeWithWhiteSpaceFormatting(CharSequence) - Static method in class CharacterReference
Encodes the specified text, preserving line breaks, tabs and spaces for rendering by converting them to markup.
equals(Object) - Method in class Segment
Compares the specified object with this Segment for equality.
error(String) - Method in interface Logger
Logs a message at the ERROR level.
error(String) - Method in class WriterLogger
 
excludeElement(StartTag) - Method in class TextExtractor
Indicates whether the text inside the Element of the specified start tag should be excluded from the output.
extractText() - Method in class Segment
Deprecated. Use Segment.getTextExtractor().toString() instead.
extractText(boolean) - Method in class Segment
Deprecated. Use Segment.getTextExtractor().setIncludeAttributes(includeAttributes).toString() instead.

F

FIELDSET - Static variable in interface HTMLElementName
HTML element FIELDSET - form control group.
FILE - Static variable in class FormControlType
The form control type given to a file select control.
FONT - Static variable in interface HTMLElementName
HTML element FONT - local change to font.
FORM - Static variable in interface HTMLElementName
HTML element FORM - interactive form.
FRAME - Static variable in interface HTMLElementName
HTML element FRAME - subwindow.
FRAMESET - Static variable in interface HTMLElementName
HTML element FRAMESET - window subdivision.
FormControl - class FormControl.
Represents an HTML form control.
FormControlOutputStyle - class FormControlOutputStyle.
An enumerated type representing the three major output styles of a form control's output element.
FormControlOutputStyle.ConfigDisplayValue - class FormControlOutputStyle.ConfigDisplayValue.
Contains static properties that configure the FormControlOutputStyle.DISPLAY_VALUE form control output style.
FormControlType - class FormControlType.
Represents the control type of a FormControl.
FormField - class FormField.
Represents a field in an HTML form, a field being defined as the group of all form controls having the same name.
FormFields - class FormFields.
Represents a collection of FormField objects.
FormFields(Collection) - Constructor for class FormFields
Constructs a new FormFields object consisting of the specified form controls.
findAllCharacterReferences() - Method in class Segment
Returns a list of all CharacterReference objects that are enclosed by this segment.
findAllElements() - Method in class Segment
Returns a list of all Element objects that are enclosed by this segment.
findAllElements(String) - Method in class Segment
Returns a list of all Element objects with the specified name that are enclosed by this segment.
findAllElements(StartTagType) - Method in class Segment
Returns a list of all Element objects with start tags of the specified type that are enclosed by this segment.
findAllElements(String, String, boolean) - Method in class Segment
Returns a list of all Element objects with the specified attribute name/value pair that are enclosed by this segment.
findAllElements() - Method in class Source
Returns a list of all elements in this source document.
findAllStartTags() - Method in class Segment
Returns a list of all StartTag objects that are enclosed by this segment.
findAllStartTags(String) - Method in class Segment
Returns a list of all StartTag objects with the specified name that are enclosed by this segment.
findAllStartTags(String, String, boolean) - Method in class Segment
Returns a list of all StartTag objects with the specified attribute name/value pair that are enclosed by this segment.
findAllStartTags() - Method in class Source
Returns a list of all start tags in this source document.
findAllTags() - Method in class Segment
Returns a list of all Tag objects that are enclosed by this segment.
findAllTags(TagType) - Method in class Segment
Returns a list of all Tag objects of the specified type that are enclosed by this segment.
findAllTags() - Method in class Source
Returns a list of all tags in this source document.
findEnclosingElement(int) - Method in class Source
Returns the most nested Element that encloses the specified position in the source document.
findEnclosingElement(int, String) - Method in class Source
Returns the most nested Element with the specified name that encloses the specified position in the source document.
findEnclosingTag(int) - Method in class Source
Returns the Tag that encloses the specified position in the source document.
findEnclosingTag(int, TagType) - Method in class Source
Returns the Tag of the specified type that encloses the specified position in the source document.
findEnd(Source, int) - Method in class StartTagTypeGenericImplementation
Finds the end of a tag of this type, starting from the specified position in the specified source document.
findFormControls() - Method in class Segment
Returns a list of the FormControl objects that are enclosed by this segment.
findFormFields() - Method in class Segment
Returns the FormFields object representing all form fields that are enclosed by this segment.
findNameEnd(int) - Method in class Source
Returns the end position of the XML Name that starts at the specified position.
findNextCharacterReference(int) - Method in class Source
Returns the CharacterReference beginning at or immediately following the specified position in the source document.
findNextElement(int) - Method in class Source
Returns the Element beginning at or immediately following the specified position in the source document.
findNextElement(int, String) - Method in class Source
Returns the Element with the specified name beginning at or immediately following the specified position in the source document.
findNextElement(int, String, String, boolean) - Method in class Source
Returns the Element with the specified attribute name/value pair beginning at or immediately following the specified position in the source document.
findNextEndTag(int) - Method in class Source
Returns the EndTag beginning at or immediately following the specified position in the source document.
findNextEndTag(int, String) - Method in class Source
Returns the normal EndTag with the specified name beginning at or immediately following the specified position in the source document.
findNextEndTag(int, String, EndTagType) - Method in class Source
Returns the EndTag with the specified name and type beginning at or immediately following the specified position in the source document.
findNextStartTag(int) - Method in class Source
Returns the StartTag beginning at or immediately following the specified position in the source document.
findNextStartTag(int, String) - Method in class Source
Returns the StartTag with the specified name beginning at or immediately following the specified position in the source document.
findNextStartTag(int, String, String, boolean) - Method in class Source
Returns the StartTag with the specified attribute name/value pair beginning at or immediately following the specified position in the source document.
findNextTag(int) - Method in class Source
Returns the Tag beginning at or immediately following the specified position in the source document.
findNextTag(int, TagType) - Method in class Source
Returns the Tag of the specified type beginning at or immediately following the specified position in the source document.
findNextTag() - Method in class Tag
Returns the next tag in the source document.
findPreviousCharacterReference(int) - Method in class Source
Returns the CharacterReference at or immediately preceding (or enclosing) the specified position in the source document.
findPreviousEndTag(int) - Method in class Source
Returns the EndTag beginning at or immediately preceding the specified position in the source document.
findPreviousEndTag(int, String) - Method in class Source
Returns the normal EndTag with the specified name at or immediately preceding (or enclosing) the specified position in the source document.
findPreviousStartTag(int) - Method in class Source
Returns the StartTag at or immediately preceding (or enclosing) the specified position in the source document.
findPreviousStartTag(int, String) - Method in class Source
Returns the StartTag with the specified name at or immediately preceding (or enclosing) the specified position in the source document.
findPreviousTag(int) - Method in class Source
Returns the Tag beginning at or immediately preceding (or enclosing) the specified position in the source document.
findPreviousTag(int, TagType) - Method in class Source
Returns the Tag of the specified type beginning at or immediately preceding (or enclosing) the specified position in the source document.
findPreviousTag() - Method in class Tag
Returns the previous tag in the source document.
format(LogRecord) - Method in class BasicLogFormatter
Returns a formatted string representing the log entry information contained in the specified java.util.logging.LogRecord.
format(String, String, String) - Static method in class BasicLogFormatter
Returns a formatted string representing the specified log entry information.
fullSequentialParse() - Method in class Source
Parses all of the tags in this source document sequentially from beginning to end.

G

generateHTML(Map) - Static method in class Attributes
Returns the contents of the specified attributes map as HTML attribute name/value pairs.
generateHTML(String) - Static method in class EndTag
Generates the HTML text of a normal end tag with the specified tag name.
generateHTML(String) - Method in class EndTagType
Generates the HTML text of an end tag of this type given the name of a corresponding start tag.
generateHTML(String) - Method in class EndTagTypeGenericImplementation
Generates the HTML text of an end tag of this type given the name of a corresponding start tag.
generateHTML(String, Map, boolean) - Static method in class StartTag
Generates the HTML text of a normal start tag with the specified tag name and attributes map.
get(String) - Method in class Attributes
Returns the Attribute with the specified name (case insensitive).
get(String) - Method in class FormFields
Returns the FormField with the specified name.
getAttributeValue(String) - Method in class Element
Returns the decoded value of the attribute with the specified name (case insensitive).
getAttributeValue(String) - Method in class StartTag
Returns the decoded value of the attribute with the specified name (case insensitive).
getAttributes() - Method in class Element
Returns the attributes specified in this element's start tag.
getAttributes() - Method in class StartTag
Returns the attributes specified in this start tag.
getAttributesMap() - Method in class FormControl
Returns a map of the names and values of this form control's output attributes.
getBegin() - Method in interface OutputSegment
Returns the character position in the source text of the output document where this segment begins.
getBegin() - Method in class Segment
Returns the character position in the Source document at which this segment begins.
getBlockIndentSize() - Method in class Renderer
Returns the size of the indent to be used for anything other than LI elements.
getBlockLevelElementNames() - Static method in class HTMLElements
Returns a set containing the names of all the block-level elements.
getCacheDebugInfo() - Method in class Source
Returns a string representation of the tag cache, useful for debugging purposes.
getChar() - Method in class CharacterReference
Returns the character represented by this character reference.
getCharacterReferenceString() - Method in class CharacterEntityReference
Returns the correct encoded form of this character entity reference.
getCharacterReferenceString(int) - Static method in class CharacterEntityReference
Returns the character entity reference encoded form of the specified unicode code point.
getCharacterReferenceString() - Method in class CharacterReference
Returns the encoded form of this character reference.
getCharacterReferenceString(int) - Static method in class CharacterReference
Returns the encoded form of the specified unicode code point.
getCharacterReferenceString() - Method in class NumericCharacterReference
Returns the correct encoded form of this numeric character reference.
getCharacterReferenceString(int) - Static method in class NumericCharacterReference
Returns the numeric character reference encoded form of the specified unicode code point.
getChildElements() - Method in class Element
Returns a list of the immediate children of this element in the document element hierarchy.
getChildElements() - Method in class Segment
Returns a list of the immediate children of this segment in the document element hierarchy.
getChildElements() - Method in class Source
Returns a list of the top-level elements in the document element hierarchy.
getClosingDelimiter() - Method in class TagType
Returns the character sequence that marks the end of the tag.
getCodePoint() - Method in class CharacterReference
Returns the unicode code point represented by this character reference.
getCodePointFromCharacterReferenceString(CharSequence) - Static method in class CharacterReference
Parses a single encoded character reference text into a unicode code point.
getCodePointFromName(String) - Static method in class CharacterEntityReference
Returns the unicode code point of the specified character entity reference name.
getCollapseWhiteSpace() - Method in class SourceFormatter
Indicates whether white space in the text between the tags is to be collapsed.
getColumn() - Method in class RowColumnVector
Returns the column number of this character position in the source document.
getColumn(int) - Method in class Source
Returns the column number of the specified character position in the source document.
getColumnLabels() - Method in class FormFields
Returns a string array containing the column labels corresponding to the values from the FormFields.getColumnValues(Map) method.
getColumnValues(Map) - Method in class FormFields
Converts the data values in the specified field data set into a simple string array, suitable for storage in a tabular format such as a database table or .CSV file.
getColumnValues() - Method in class FormFields
Converts all the form submission values of the constituent form fields into a simple string array, suitable for storage in a tabular format such as a database table or .CSV file.
getContent() - Method in class Element
Returns the segment representing the content of the element.
getConvertNonBreakingSpaces() - Method in class Renderer
Indicates whether non-breaking space (&nbsp;) character entity references are converted to spaces.
getConvertNonBreakingSpaces() - Method in class TextExtractor
Indicates whether non-breaking space (&nbsp;) character entity references are converted to spaces.
getCorrespondingEndTagType() - Method in class StartTagType
Returns the type of end tag required to pair with a start tag of this type to form an element.
getCorrespondingStartTagType() - Method in class EndTagType
Returns the type of start tag that is usually paired with an end tag of this type to form an Element.
getCount() - Method in class Attributes
Returns the number of attributes.
getCount() - Method in class FormFields
Returns the number of FormField objects.
getDataSet() - Method in class FormFields
Returns the entire field data set represented by the values of the constituent form fields.
getDebugInfo() - Method in class Attribute
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class Attributes
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class CharacterEntityReference
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class Config.CompatibilityMode
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class Element
 
getDebugInfo() - Method in class EndTag
 
getDebugInfo() - Method in class FormControl
 
getDebugInfo() - Method in class FormControlOutputStyle
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class FormField
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class FormFields
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class NumericCharacterReference
 
getDebugInfo() - Method in class OutputDocument
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in interface OutputSegment
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class Segment
Returns a string representation of this object useful for debugging purposes.
getDebugInfo() - Method in class StartTag
 
getDecimalCharacterReferenceString() - Method in class CharacterReference
Returns the decimal encoded form of this character reference.
getDecimalCharacterReferenceString(int) - Static method in class CharacterReference
Returns the decimal encoded form of the specified unicode code point.
getDecorateFontStyles() - Method in class Renderer
Indicates whether decoration characters are to be included around the content of some font style elements and phrase elements.
getDefaultMaxErrorCount() - Static method in class Attributes
Returns the default maximum error count allowed when parsing attributes.
getDeprecatedElementNames() - Static method in class HTMLElements
Returns a set containing the names of all deprecated elements in HTML 4.01.
getDepth() - Method in class Element
Returns the nesting depth of this element in the document element hierarchy.
getDescription() - Method in class TagType
Returns a description of this tag type useful for debugging purposes.
getDocumentSpecifiedEncoding() - Method in class Source
Returns the document encoding specified within the text of the document.
getElement() - Method in class EndTag
Returns the element that is ended by this end tag.
getElement() - Method in class FormControl
Returns the element representing this form control in the source document.
getElement() - Method in class StartTag
Returns the element that is started by this start tag.
getElement() - Method in class Tag
Returns the element that is started or ended by this tag.
getElementById(String) - Method in class Source
Returns the Element with the specified id attribute value.
getElementName() - Method in class FormControlType
Returns the name of the Element that constitues this form control type.
getElementNames() - Static method in class HTMLElements
Returns a list containing all of the HTML element names.
getEncoding() - Method in class Source
Returns the character encoding scheme of the source byte stream used to create this object.
getEncodingFilterWriter(Writer) - Static method in class CharacterReference
Returns a filter Writer that encodes all text before passing it through to the specified Writer.
getEncodingSpecificationInfo() - Method in class Source
Returns a concise description of how the encoding of the source document was determined.
getEnd() - Method in interface OutputSegment
Returns the character position in the source text of the output document where this segment ends.
getEnd() - Method in class Segment
Returns the character position in the Source document immediately after the end of this segment.
getEndTag() - Method in class Element
Returns the end tag of the element.
getEndTagForbiddenElementNames() - Static method in class HTMLElements
Returns a set containing the names of all of the HTML elements for which the end tag is forbidden.
getEndTagOptionalElementNames() - Static method in class HTMLElements
Returns a set containing the names of all of the HTML elements for which the end tag is optional.
getEndTagRequiredElementNames() - Static method in class HTMLElements
Returns a set containing the names of all of the HTML elements for which the end tag is required.
getEndTagType() - Method in class EndTag
Returns the type of this end tag.
getEstimatedMaximumOutputLength() - Method in interface CharStreamSource
Returns the estimated maximum number of characters in the output, or -1 if no estimate is available.
getEstimatedMaximumOutputLength() - Method in class OutputDocument
 
getEstimatedMaximumOutputLength() - Method in class Renderer
 
getEstimatedMaximumOutputLength() - Method in class SourceFormatter
 
getEstimatedMaximumOutputLength() - Method in class TextExtractor
 
getExcludeNonHTMLElements() - Method in class TextExtractor
Indicates whether the content of non-HTML elements is excluded from the output.
getFormControl() - Method in class Element
Returns the FormControl defined by this element.
getFormControl(String) - Method in class FormField
Returns the constituent FormControl with the specified predefined value.
getFormControl() - Method in class FormField
Returns the first FormControl from this field.
getFormControl() - Method in class StartTag
Returns the FormControl defined by this start tag.
getFormControlType() - Method in class FormControl
Returns the type of this form control.
getFormControls() - Method in class FormField
Returns a collection of all the constituent form controls in this field.
getFormControls() - Method in class FormFields
Returns a list of all the constituent form controls from all the form fields in this collection.
getHexadecimalCharacterReferenceString() - Method in class CharacterReference
Returns the hexadecimal encoded form of this character reference.
getHexadecimalCharacterReferenceString(int) - Static method in class CharacterReference
Returns the hexadecimal encoded form of the specified unicode code point.
getIncludeAttributes() - Method in class TextExtractor
Indicates whether the values of title, alt, label, and summary, and content attributes of normal tags are to be included in the output.
getIndentAllElements() - Method in class SourceFormatter
Indicates whether all elements are to be indented, including inline-level elements and those with preformatted contents.
getIndentString() - Method in class SourceFormatter
Returns the string to be used for indentation.
getInlineLevelElementNames() - Static method in class HTMLElements
Returns a set containing the names of all the inline-level elements.
getKey() - Method in class Attribute
Returns the name of this attribute in lower case.
getListBullets() - Method in class Renderer
Returns the bullet characters to use for list items inside UL elements.
getListIndentSize() - Method in class Renderer
Returns the size of the indent to be used for LI elements.
getLogWriter() - Method in class Source
Deprecated. Use ((WriterLogger)Source.getLogger()).getWriter() instead.
getLogger(String) - Method in interface LoggerProvider
Creates a new Logger instance with the specified name.
getLogger() - Method in class Source
Returns the Logger that handles log messages.
getMaxLineLength() - Method in class Renderer
Returns the column at which lines are to be wrapped.
getName() - Method in class Attribute
Returns the name of this attribute in original case.
getName() - Method in class CharacterEntityReference
Returns the name of this character entity reference.
getName(char) - Static method in class CharacterEntityReference
Returns the character entity reference name of the specified character.
getName(int) - Static method in class CharacterEntityReference
Returns the character entity reference name of the specified unicode code point.
getName() - Method in class Config.CompatibilityMode
Returns the name of this compatibility mode.
getName() - Method in class Element
Returns the name of the start tag of this element, always in lower case.
getName() - Method in class FormControl
Returns the name of the control.
getName() - Method in class FormField
Returns the control name shared by all of this field's constituent controls.
getName() - Method in class Tag
Returns the name of this tag, always in lower case.
getName() - Method in class WriterLogger
Returns the name of this logger.
getNamePrefix() - Method in class TagType
Returns the name prefix required by this tag type.
getNameSegment() - Method in class Attribute
Returns the segment spanning the name of this attribute.
getNameSegment() - Method in class Tag
Returns the segment spanning the name of this tag.
getNameToCodePointMap() - Static method in class CharacterEntityReference
Returns a map of character entity reference names (String) to unicode code points (Integer).
getNestingForbiddenElementNames() - Static method in class HTMLElements
Returns a set containing the names of all of the HTML elements which should never contain elements of the same name, either as direct or indirect descendants.
getNewLine() - Method in class Renderer
Returns the string to be used to represent a newline in the output.
getNewLine() - Method in class Source
Returns the newline character sequence used in the source document.
getNewLine() - Method in class SourceFormatter
Returns the string to be used to represent a newline in the output.
getNonterminatingElementNames(String) - Static method in class HTMLElements
Returns the names of elements that do NOT implicitly terminate an HTML element with the specified name.
getOptionElementIterator() - Method in class FormControl
Returns an iterator over the OPTION elements contained within this control, in order of appearance.
getOutputStyle() - Method in class FormControl
Returns the current output style of this form control.
getOverlappingOutputSegments() - Method in class OverlappingOutputSegmentsException
Deprecated. Returns null as this exception is never thrown.
getParentElement() - Method in class Element
Returns the parent of this element in the document element hierarchy.
getParseText() - Method in class Source
Returns the parse text of this source document.
getPos() - Method in class RowColumnVector
Returns the character position in the source document.
getPredefinedValue() - Method in class FormControl
Returns the initial value of this control if it has a predefined value.
getPredefinedValues() - Method in class FormControl
Returns a collection of all predefined values in this control.
getPredefinedValues() - Method in class FormField
Returns a collection of the predefined values of all constituent controls in this field.
getPreliminaryEncodingInfo() - Method in class Source
Returns the preliminary encoding of the source document together with a concise description of how it was determined.
getQuoteChar() - Method in class Attribute
Returns the character used to quote the value.
getReader(CharStreamSource) - Static method in class CharStreamSourceUtil
Returns a Reader that reads the output of the specified CharStreamSource.
getRegisteredOutputSegments() - Method in class OutputDocument
Returns a list all of the registered OutputSegment objects in this output document.
getRegisteredTagTypes() - Static method in class TagType
Returns a list of all the currently registered tag types in order of lowest to highest precedence.
getRenderer() - Method in class Segment
Performs a simple rendering of the HTML markup in this segment into text.
getRow() - Method in class RowColumnVector
Returns the row number of this character position in the source document.
getRow(int) - Method in class Source
Returns the row number of the specified character position in the source document.