dc.contributor.author | Loftsson, Hrafn |
dc.date.accessioned | 2019-11-14T14:02:04Z |
dc.date.available | 2019-11-14T14:02:04Z |
dc.date.issued | 2019-11-14 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/8 |
dc.description | IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java. IceNLP er safn málgreiningartóla, gefið út með opnu leyfi, til þess að greina og vinna íslenskan texta. Tólin eru unnin í Java. |
dc.language.iso | isl |
dc.publisher | Reykjavik University |
dc.rights | GNU General Public License, version 2 |
dc.rights.uri | https://www.gnu.org/licenses/gpl-2.0.html |
dc.rights.label | PUB |
dc.source.uri | https://github.com/hrafnl/icenlp |
dc.subject | parsing |
dc.subject | shallow parsing |
dc.subject | tokenization |
dc.subject | pos-tagging |
dc.subject | lemmatization |
dc.title | IceNLP Natural Language Processing toolkit |
dc.type | toolService |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | false |
hidden | false |
hasMetadata | false |
has.files | yes |
branding | Clarin IS Repository |
demo.uri | http://nlp.cs.ru.is:8080/IceNLPWeb/icenlp.html |
contact.person | Loftsson Hrafn hrafn@ru.is Scool of Compuer Science, Reykjavik University |
files.size | 14855072 |
files.count | 2 |
Files in this item
Download all files in item (14.17 MB)- Name
- IceNLPCore-12.11.zip
- Size
- 14.13 MB
- Format
- application/zip
- Description
- NLP toolkit
- MD5
- 6676e8be15a4e6972756a6fc2a3ca36b
- IceNLPCore
- ngrams
- corpus.txt10 kB
- computeNgrams173 B
- buildDictTagFreq296 B
- buildDictTagFreq.pl4 kB
- computeNgrams.pl5 kB
- models
- corpus.lex7 kB
- otb.lex2 MB
- otb.ngram2 MB
- corpus.orig.lex7 kB
- corpus.lambda143 B
- corpus.ngram19 kB
- otb.lambda148 B
- otb.orig.lex2 MB
- train215 B
- corpus.txt.freq10 kB
- bat
- icetagger
- icetaggerParam.sh700 B
- paramDefaultWithDicts.txt2 kB
- icetaggerBig.sh89 B
- icetagger.bat113 B
- paramDefault.txt1 kB
- mogginn.out112 kB
- mogginn.txt62 kB
- icetagger.sh89 B
- icetaggerApertium.sh159 B
- demo
- test3.txt293 B
- test2.txt274 B
- paramDefault.txt2 kB
- test9.txt61 B
- tagAndParse.sh88 B
- test.txt236 B
- tagAndParseGUI.sh91 B
- parse.out10 kB
- prufa.txt23 B
- tagAndParse.bat108 B
- tagAndParseGUI.bat91 B
- tritagger
- tritaggerBig.sh90 B
- paramDefault.txt765 B
- tritagger.sh89 B
- paramDefaultWithDicts.txt1 kB
- tritaggerParam.sh499 B
- test.txt422 B
- test.out28 kB
- tritagger.bat113 B
- icemorphy
- paramAnalyzeWithDicts.txt1 kB
- paramFill.txt682 B
- icemorphy.bat92 B
- test.txt365 B
- paramAnalyze.txt629 B
- test.dict81 kB
- paramFillWithDicts.txt1 kB
- icemorphy.sh92 B
- srxsegmentizer
- testinput.txt1 kB
- srxsegmentizer.sh186 B
- srxsegmentizer.bat185 B
- readme.txt471 B
- iceparser
- iceparser.bat540 B
- iceparserOutOld.sh4 kB
- 200sent.txt45 kB
- iceparser.sh152 B
- iceparserOut.bat540 B
- 5.sent1 kB
- errorSearch
- pp_errors.sh102 B
- vp_errors.sh102 B
- np_errors.sh102 B
- 200sent_func.gdc90 kB
- iceparserOut.sh155 B
- iceNER
- prufa.txt555 B
- iceNER.sh2 kB
- tokenizer
- test.txt59 B
- mbl.txt643 B
- tokenize.sh80 B
- tokenize.bat377 B
- lemmald
- testinput.txt46 B
- lemmatize.sh87 B
- plaintext.txt49 B
- readme.txt1 kB
- lemmatize.bat87 B
- icetagger
- doc
- Tagset.pdf211 kB
- IceNLP.pdf333 kB
- dist
- IceNLPCore.jar5 MB
- lib
- junit-4.8.2.jar231 kB
- commons-io-1.4.jar106 kB
- segment-1.3.3.jar164 kB
- commons-logging-1.1.1.jar59 kB
- commons-cli-1.2.jar40 kB
- xerces.jar1 MB
- dict
- icetagger
- otb.verbObj.dict51 kB
- otbTags.freq.dict5 kB
- otb.verbPrep.dict129 kB
- prefixes.dict160 B
- baseEndings.dict39 kB
- otb.dict1 MB
- otb.endingsProper.dict102 kB
- otb.apertium.dict28 kB
- otb.verbAdverb.dict5 kB
- baseDict.dict78 kB
- idioms.dict6 kB
- otb.endings.dict220 kB
- tokenizer
- lexicon.txt7 kB
- lemmald
- rule_database_utf8.txt6 MB
- postfixRules.txt175 B
- readme.txt249 B
- rule_hand_written_utf8.txt753 B
- makeRules.sh437 B
- settings.txt306 B
- rule_database_utf8.dat2 MB
- iceNER
- location.txt12 kB
- tritagger
- idioms.dict6 kB
- baseDict.dict75 kB
- formald
- segment.srx92 kB
- icetagger
- ngrams
- Name
- icenlp-projectSample.jpeg
- Size
- 37.09 KB
- Format
- JPEG image
- Description
- PageSample
- MD5
- fdbf7ea5a8d090c900feb3eed50dcc44