Show simple item record

 
dc.contributor.author Loftsson, Hrafn
dc.contributor.author Rögnvaldsson, Eiríkur
dc.contributor.author Pálsson, Gunnar
dc.date.accessioned 2021-07-12T22:59:16Z
dc.date.available 2021-07-12T22:59:16Z
dc.date.issued 2021-07-07
dc.identifier.uri http://hdl.handle.net/20.500.12537/122
dc.description IceParser is a shallow parser for Icelandic. The parser comprises a sequence of finite-state transducers, which add syntactic information, in an incremental manner, into the input text. The input to IceParser is part-of-speech (PoS) tagged text and it produces output which includes annotation of both constituent structure and syntactic functions. The distributed file contains the entirety of IceNLP, a toolkit of various NLP tools for processing and analysing Icelandic. The current version of IceParser in IceNLP has been specifically changed and updated to be able to annotate input tagged with the revised Icelandic POS tagset. --- IceParser er hlutaþáttari fyrir íslensku. Þáttarinn samanstendur af röð af stöðuferjöldum sem bæta setningafræðilegum upplýsingum inn í inntakstextann á stigvaxandi hátt. Inntakið í IceParser er markaður texti og þáttarinn skilar af sér úttaki sem inniheldur bæði merkingar á setningaliðum og setningafræðilegum hlutverkum. Skráin sem fylgir inniheldur allt IceNLP, þ.e. safn tóla til að vinna með og greina íslensku. Núverandi útgáfa af IceParser í IceNLP hefur verið breytt og uppfærð til að greina og merkja inntak sem er markað með hinu endurskoðað íslenska markamengi.
dc.language.iso isl
dc.publisher Reykjavik University
dc.relation.isreferencedby https://aclanthology.org/W07-2419.pdf
dc.relation.isreferencedby https://skemman.is/bitstream/1946/39413/1/Adapting%20and%20Improving%20a%20Shallow%20Parser%20-%20Gunnar%20P%c3%a1lsson.pdf
dc.rights GNU General Public Licence, version 3
dc.rights.uri https://opensource.org/licenses/GPL-3.0
dc.rights.label PUB
dc.source.uri https://github.com/hrafnl/icenlp
dc.subject icelandic
dc.subject pos-tagging
dc.subject shallow parsing
dc.subject icenlp
dc.title IceParser 1.5.0
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding Clarin IS Repository
demo.uri http://nlp.cs.ru.is/
contact.person Hrafn Loftsson hrafn@ru.is Reykjavik University
sponsor Ministry of Education, Science and Culture Parser (I5) Language Technology for Icelandic 2019-2023 nationalFunds
files.size 65726410
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
GNU General Public Licence, version 3
Icon
Name
IceNLP-1.5.0.zip
Size
62.68 MB
Format
application/zip
Description
Latest release on GitHub
MD5
9117ecbfec076fb7740687ab0687e47d
 Download file  Preview
 File Preview  
  • IceNLP
    • ngrams
      • corpus.txt10 kB
      • computeNgrams173 B
      • buildDictTagFreq296 B
      • buildDictTagFreq.pl4 kB
      • computeNgrams.pl5 kB
      • train215 B
      • corpus.txt.freq10 kB
      • models
        • corpus.lex7 kB
        • otb.ngram2 MB
        • otb.lex1 MB
        • corpus.orig.lex7 kB
        • corpus.ngram19 kB
        • corpus.lambda143 B
        • otb.lambda143 B
    • bat
      • icetagger
        • paramDefaultDicts.txt2 kB
        • icetagger.bat113 B
        • paramDefault.txt1 kB
        • test.out100 B
        • paramDefaultHmm.txt1 kB
        • mogginn.out65 kB
        • mogginn.txt17 kB
        • icetagger.sh89 B
        • icetaggerApertium.sh159 B
        • paramDefaultBin.txt1 kB
      • demo
        • test2.txt274 B
        • tagAndParseErrors.sh97 B
        • paramDefault.txt2 kB
        • tagger.out42 B
        • tagAndParse.sh88 B
        • test.txt236 B
        • testErrors.txt41 B
        • tagAndParseGUI.sh91 B
        • parse.out731 B
        • tagAndParse.bat108 B
        • tagAndParseGUI.bat91 B
      • tritagger
        • paramDefault.txt767 B
        • tritagger.sh90 B
        • paramDefaultBin.txt801 B
        • test.txt422 B
        • paramDefaultDicts.txt1 kB
        • tritagger.bat113 B
        • mogginn.txt17 kB
      • icemorphy
        • paramAnalyzeWithDicts.txt1 kB
        • paramFill.txt682 B
        • icemorphy.bat92 B
        • test.txt365 B
        • paramAnalyze.txt629 B
        • test.dict81 kB
        • paramFillWithDicts.txt1 kB
        • icemorphy.sh92 B
      • srxsegmentizer
        • testinput.txt1 kB
        • srxsegmentizer.sh186 B
        • srxsegmentizer.bat185 B
        • readme.txt471 B
      • iceparser
        • iceparser.bat540 B
        • wordpl2sentpl.sh359 B
        • iceparserOutOld.sh4 kB
        • 200sent.txt45 kB
        • iceparser.sh152 B
        • iceparserOut.bat540 B
        • 5.sent1 kB
        • errorSearch
          • pp_errors.sh102 B
          • vp_errors.sh102 B
          • np_errors.sh102 B
        • 200sent_func.gdc90 kB
        • testData
          • test.tags106 kB
          • dev.tags.sent.parsed1 MB
          • dev.tags1007 kB
          • dev.tags.sent.parsed.orig1 MB
          • test.gold.sent106 kB
          • dev.tags.sent1002 kB
          • test.gold.sent.gold173 kB
          • test.gold106 kB
          • test.gold.sent.parsed173 kB
          • readme.txt355 B
          • prufa.out134 B
        • iceparserOut.sh155 B
      • iceNER
        • prufa.txt561 B
        • iceNER.sh2 kB
      • tokenizer
        • test3.txt38 B
        • testTokenizer2.sh52 B
        • test2.out58 B
        • test2.txt58 B
        • test.out307 B
        • test.txt299 B
        • tokenize.bat377 B
        • tokenize.sh80 B
        • testTokenizer3.sh52 B
        • testTokenizer.sh50 B
        • test3.out34 B
      • icestagger
        • trainIceStagger.sh831 B
        • tagIceStagger.sh92 B
        • trainIceStaggerBin.sh344 B
        • sentences.txt961 B
        • corpora
          • README40 B
        • tagIceStaggerBIN.sh83 B
        • icestagger.sh127 B
        • models
          • otb.bin114 MB
          • README48 B
      • lemmald
        • testinput.txt46 B
        • lemmatize.sh87 B
        • plaintext.txt49 B
        • readme.txt1 kB
        • lemmatize.bat87 B
    • doc
      • Tagset.pdf211 kB
      • IceNLP.pdf473 kB
    • lib
      • junit-4.8.2.jar231 kB
      • commons-io-1.4.jar106 kB
      • segment-1.3.3.jar164 kB
      • commons-logging-1.1.1.jar59 kB
      • commons-cli-1.2.jar40 kB
      • xerces.jar1 MB
    • dist
      • IceNLPCore.jar8 MB
    • dict
      • icetagger
        • otb.verbObj.dict51 kB
        • otbTags.freq.dict5 kB
        • otb.verbPrep.dict129 kB
        • prefixes.dict160 B
        • baseEndings.dict39 kB
        • otb.dict1 MB
        • otb.endingsProper.dict102 kB
        • otb.apertium.dict28 kB
        • otb.verbAdverb.dict5 kB
        • baseDict.dict79 kB
        • idioms.dict6 kB
        • otb.endings.dict220 kB
      • BIN
        • bin2Otb.sh498 B
        • buildDictFromBin.pl2 kB
        • bin2Icetagger.sh370 B
        • combineDicts.pl2 kB
        • README195 B
        • combineFreqDicts.pl1 kB
        • bin2Tritagger.sh529 B
        • bin2Stagger.sh149 B
        • extractBinData.sh145 B
        • bin2Otb.pl12 kB
        • combineOtbFreqBinTri263 B
      • tokenizer
        • lexicon.txt7 kB
      • tritagger
        • idioms.dict6 kB
        • baseDict.dict75 kB
      • lemmald
        • rule_database_utf8.txt6 MB
        • postfixRules.txt175 B
        • rule_hand_written_utf8.txt753 B
        • readme.txt249 B
        • settings.txt306 B
        • makeRules.sh437 B
        • rule_database_utf8.dat2 MB
      • iceNER
        • location.txt12 kB
      • formald
        • segment.srx92 kB

Show simple item record