IceMorph morphological analysis data files

Timothy Tangherlini, Sean Crist, Peter M. Broadwell, David Gabriel, Kryztof Urban, Aurelijus Vijunas & Jackson Crawford
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for word class and inflectional detail; a corpus of Old Icelandic sagas in plain text and chunked by chapter; a tagged version of the same text, output of the IceMorph system; a training corpus labeled "Expert" for training and testing a machine learning module; and a training corpus labeled "Gold" for training and testing a machine learning module.
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see our documentation.