What's New

 toolService 
toolService
Description:
Annotald is a program for annotating parsed corpora in the Penn Treebank format. For more information on the format (as instantiated by the Penn Parsed Corpora of Historical English), see the documentation by Beatrice ...
 This item contains 2 files (2.89 MB).
 
Publicly Available
 toolService 
toolService
Description:
Yfirlestur.is is a public website where you can enter or submit your Icelandic text and have it checked for spelling and grammar errors. The tool also gives hints on words and structures that might not be appropriate, ...
 This item contains 2 files (1.27 MB).
 
Publicly Available
 toolService 
toolService
Description:
This is a pipeline for creating GreynirSeq domain-aware translation models. A valid checkpoint of a base translation model based on mBART25 can be finetuned as a domain translation model. The resulting model can be queried ...
 This item contains 2 files (4.54 MB).
 
Publicly Available

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
Talrómur is a public domain speech corpus for text-to-speech research and development. The corpus consists of 122,417 short audio clips of eight different speakers reading short sentences. The audio was recorded in 2020 ...
 This item contains 11 files (19.99 GB).
 
Publicly Available
 corpus 
corpus
Description:
A corpus of: * 70,000 sentences taken from general text, both before normalization and normalized using Regína normalizer * 70,000 sentences taken from sports news, both before normalization and normalized using Regína ...
 This item contains no files.
 
Publicly Available
 toolService 
toolService
Description:
A configurable machine translation web client for Google Translate V3 compatible backends. Stillanlegt vefviðmót fyrir vélþýðingakerfi sem styðja Google Translate V3 API sniðmátið.
 This item contains 1 file (242.65 KB).
 
Publicly Available