Package org.writersforge.catalan.text.extractors

Interface Summary
ITextExtractor Generic interface for pulling specific fragments of text out of a text document.
 

Class Summary
BraceMatchSplitter Text splitter which divides a text document according to matching pairs of open/close brace tokens.
LineSplitter Text splitter which divides a text document into an array of lines.
ParagraphSplitter Text splitter which divides text content into paragraphs delimited by one or more blank (whitespace-only) lines.
RegexpExtractor Text extractor which extracts text from all parenthetical groups defined in the driving regular expression.
RegexpLineSplitter Text splitter which splits text each time a single line matches the given regular expression.
RegexpSplitter Text splitter which identifies delimiters according to a regular expression.
TokenSplitter Text splitter which splits documents each time it encounters a static token.