Conversion Code

We wrote conversion code  that takes the Queen’s cabinet collections’ MONK format and converts it to OAC phase II. This conversion maintains all information present in MONK. The same code has an option to generate W3C Open Annotation format for line strip regions generated by the CODA Line Cutout Service. This code is actually built in in the Line Strip Service. Find the conversion code on GitHub.

The FoLiA format that is generated by the ‘frog’ linguistic analysis tool is converted to OAC phase II. For this conversion only token and named entity information is maintained, all other linguistic information is ignored for the moment. Find the conversion code on GitHub.