org.writersforge.catalan.text.extractors
Class RegexpLineSplitter

java.lang.Object
  extended byorg.writersforge.catalan.text.extractors.RegexpLineSplitter
All Implemented Interfaces:
ITextExtractor

public class RegexpLineSplitter
extends java.lang.Object
implements ITextExtractor

Text splitter which splits text each time a single line matches the given regular expression. The regular expression can only span a single line of text input.

Author:
jsheets

Constructor Summary
RegexpLineSplitter(java.lang.String regexp, boolean keepDelimiters)
          Creates a new instance of RegexpLineSplitter.
 
Method Summary
 java.lang.String[] extractText(java.lang.String text)
          Extracts fragments of text from the input text document.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

RegexpLineSplitter

public RegexpLineSplitter(java.lang.String regexp,
                          boolean keepDelimiters)
Creates a new instance of RegexpLineSplitter.

Parameters:
regexp - single-line regular expression
keepDelimiters - true to pass splitting lines to output as separate nodes
Method Detail

extractText

public java.lang.String[] extractText(java.lang.String text)
Extracts fragments of text from the input text document.

Specified by:
extractText in interface ITextExtractor
Parameters:
text - input text document
Returns:
extracted text fragments