dc.contributor.author | Loftsson, Hrafn |
dc.contributor.author | Rögnvaldsson, Eiríkur |
dc.contributor.author | Pálsson, Gunnar |
dc.date.accessioned | 2021-07-12T22:59:16Z |
dc.date.available | 2021-07-12T22:59:16Z |
dc.date.issued | 2021-07-07 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/122 |
dc.description | IceParser is a shallow parser for Icelandic. The parser comprises a sequence of finite-state transducers, which add syntactic information, in an incremental manner, into the input text. The input to IceParser is part-of-speech (PoS) tagged text and it produces output which includes annotation of both constituent structure and syntactic functions. The distributed file contains the entirety of IceNLP, a toolkit of various NLP tools for processing and analysing Icelandic. The current version of IceParser in IceNLP has been specifically changed and updated to be able to annotate input tagged with the revised Icelandic POS tagset. --- IceParser er hlutaþáttari fyrir íslensku. Þáttarinn samanstendur af röð af stöðuferjöldum sem bæta setningafræðilegum upplýsingum inn í inntakstextann á stigvaxandi hátt. Inntakið í IceParser er markaður texti og þáttarinn skilar af sér úttaki sem inniheldur bæði merkingar á setningaliðum og setningafræðilegum hlutverkum. Skráin sem fylgir inniheldur allt IceNLP, þ.e. safn tóla til að vinna með og greina íslensku. Núverandi útgáfa af IceParser í IceNLP hefur verið breytt og uppfærð til að greina og merkja inntak sem er markað með hinu endurskoðað íslenska markamengi. |
dc.language.iso | isl |
dc.publisher | Reykjavik University |
dc.relation.isreferencedby | https://aclanthology.org/W07-2419.pdf |
dc.relation.isreferencedby | https://skemman.is/bitstream/1946/39413/1/Adapting%20and%20Improving%20a%20Shallow%20Parser%20-%20Gunnar%20P%c3%a1lsson.pdf |
dc.rights | GNU General Public Licence, version 3 |
dc.rights.uri | https://opensource.org/licenses/GPL-3.0 |
dc.rights.label | PUB |
dc.source.uri | https://github.com/hrafnl/icenlp |
dc.subject | icelandic |
dc.subject | pos-tagging |
dc.subject | shallow parsing |
dc.subject | icenlp |
dc.title | IceParser 1.5.0 |
dc.type | toolService |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
has.files | yes |
branding | Clarin IS Repository |
demo.uri | http://nlp.cs.ru.is/ |
contact.person | Hrafn Loftsson hrafn@ru.is Reykjavik University |
sponsor | Ministry of Education, Science and Culture Parser (I5) Language Technology for Icelandic 2019-2023 nationalFunds |
files.size | 65726410 |
files.count | 1 |
Files in this item
- Name
- IceNLP-1.5.0.zip
- Size
- 62.68 MB
- Format
- application/zip
- Description
- Latest release on GitHub
- MD5
- 9117ecbfec076fb7740687ab0687e47d
- IceNLP
- ngrams
- corpus.txt10 kB
- computeNgrams173 B
- buildDictTagFreq296 B
- buildDictTagFreq.pl4 kB
- computeNgrams.pl5 kB
- train215 B
- corpus.txt.freq10 kB
- models
- corpus.lex7 kB
- otb.ngram2 MB
- otb.lex1 MB
- corpus.orig.lex7 kB
- corpus.ngram19 kB
- corpus.lambda143 B
- otb.lambda143 B
- bat
- icetagger
- paramDefaultDicts.txt2 kB
- icetagger.bat113 B
- paramDefault.txt1 kB
- test.out100 B
- paramDefaultHmm.txt1 kB
- mogginn.out65 kB
- mogginn.txt17 kB
- icetagger.sh89 B
- icetaggerApertium.sh159 B
- paramDefaultBin.txt1 kB
- demo
- test2.txt274 B
- tagAndParseErrors.sh97 B
- paramDefault.txt2 kB
- tagger.out42 B
- tagAndParse.sh88 B
- test.txt236 B
- testErrors.txt41 B
- tagAndParseGUI.sh91 B
- parse.out731 B
- tagAndParse.bat108 B
- tagAndParseGUI.bat91 B
- tritagger
- paramDefault.txt767 B
- tritagger.sh90 B
- paramDefaultBin.txt801 B
- test.txt422 B
- paramDefaultDicts.txt1 kB
- tritagger.bat113 B
- mogginn.txt17 kB
- icemorphy
- paramAnalyzeWithDicts.txt1 kB
- paramFill.txt682 B
- icemorphy.bat92 B
- test.txt365 B
- paramAnalyze.txt629 B
- test.dict81 kB
- paramFillWithDicts.txt1 kB
- icemorphy.sh92 B
- srxsegmentizer
- testinput.txt1 kB
- srxsegmentizer.sh186 B
- srxsegmentizer.bat185 B
- readme.txt471 B
- iceparser
- iceparser.bat540 B
- wordpl2sentpl.sh359 B
- iceparserOutOld.sh4 kB
- 200sent.txt45 kB
- iceparser.sh152 B
- iceparserOut.bat540 B
- 5.sent1 kB
- errorSearch
- pp_errors.sh102 B
- vp_errors.sh102 B
- np_errors.sh102 B
- 200sent_func.gdc90 kB
- testData
- test.tags106 kB
- dev.tags.sent.parsed1 MB
- dev.tags1007 kB
- dev.tags.sent.parsed.orig1 MB
- test.gold.sent106 kB
- dev.tags.sent1002 kB
- test.gold.sent.gold173 kB
- test.gold106 kB
- test.gold.sent.parsed173 kB
- readme.txt355 B
- prufa.out134 B
- iceparserOut.sh155 B
- iceNER
- prufa.txt561 B
- iceNER.sh2 kB
- tokenizer
- test3.txt38 B
- testTokenizer2.sh52 B
- test2.out58 B
- test2.txt58 B
- test.out307 B
- test.txt299 B
- tokenize.bat377 B
- tokenize.sh80 B
- testTokenizer3.sh52 B
- testTokenizer.sh50 B
- test3.out34 B
- icestagger
- lemmald
- testinput.txt46 B
- lemmatize.sh87 B
- plaintext.txt49 B
- readme.txt1 kB
- lemmatize.bat87 B
- icetagger
- doc
- Tagset.pdf211 kB
- IceNLP.pdf473 kB
- lib
- junit-4.8.2.jar231 kB
- commons-io-1.4.jar106 kB
- segment-1.3.3.jar164 kB
- commons-logging-1.1.1.jar59 kB
- commons-cli-1.2.jar40 kB
- xerces.jar1 MB
- dist
- IceNLPCore.jar8 MB
- dict
- icetagger
- otb.verbObj.dict51 kB
- otbTags.freq.dict5 kB
- otb.verbPrep.dict129 kB
- prefixes.dict160 B
- baseEndings.dict39 kB
- otb.dict1 MB
- otb.endingsProper.dict102 kB
- otb.apertium.dict28 kB
- otb.verbAdverb.dict5 kB
- baseDict.dict79 kB
- idioms.dict6 kB
- otb.endings.dict220 kB
- BIN
- bin2Otb.sh498 B
- buildDictFromBin.pl2 kB
- bin2Icetagger.sh370 B
- combineDicts.pl2 kB
- README195 B
- combineFreqDicts.pl1 kB
- bin2Tritagger.sh529 B
- bin2Stagger.sh149 B
- extractBinData.sh145 B
- bin2Otb.pl12 kB
- combineOtbFreqBinTri263 B
- tokenizer
- lexicon.txt7 kB
- tritagger
- idioms.dict6 kB
- baseDict.dict75 kB
- lemmald
- rule_database_utf8.txt6 MB
- postfixRules.txt175 B
- rule_hand_written_utf8.txt753 B
- readme.txt249 B
- settings.txt306 B
- makeRules.sh437 B
- rule_database_utf8.dat2 MB
- iceNER
- location.txt12 kB
- formald
- segment.srx92 kB
- icetagger
- ngrams