infomat.io.textword
Class TokenFileTextWordIO

java.lang.Object
  extended by infomat.io.textword.TokenFileTextWordIO

public class TokenFileTextWordIO
extends java.lang.Object

For loading a token file into a matrix with the texts as rows and words as columns. Do not use in combination with IObjectGroupingIO. The later shares id:s with CorpusIO that are not present in the TokenFiles.


Nested Class Summary
private  class TokenFileTextWordIO.TokenFileTextWordHandler
          Exports the IMatrix in XML format The file is saved in compressed .zip-format.
 
Field Summary
private  IMatrix matrix
           
private  TextWordIMatrixConstructor twmc
           
 
Constructor Summary
TokenFileTextWordIO()
           
TokenFileTextWordIO(TextWordIMatrixConstructor twmc)
           
 
Method Summary
 IMatrix load(java.io.File file)
          Loads a IMatrix from file in XML format
private  IMatrix load(org.xml.sax.InputSource source)
           
 IMatrix loadCompressed(java.io.File file)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

matrix

private IMatrix matrix

twmc

private TextWordIMatrixConstructor twmc
Constructor Detail

TokenFileTextWordIO

public TokenFileTextWordIO()

TokenFileTextWordIO

public TokenFileTextWordIO(TextWordIMatrixConstructor twmc)
Method Detail

load

public IMatrix load(java.io.File file)
Loads a IMatrix from file in XML format


loadCompressed

public IMatrix loadCompressed(java.io.File file)

load

private IMatrix load(org.xml.sax.InputSource source)