dc.contributor.author | Símonarson, Haukur Barri |
dc.contributor.author | Óladóttir, Hulda |
dc.contributor.author | Snæbjarnarson, Vésteinn |
dc.contributor.author | Þorsteinsson, Vilhjálmur |
dc.date.accessioned | 2021-09-30T20:48:01Z |
dc.date.available | 2021-09-30T20:48:01Z |
dc.date.issued | 2021-09-30 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/149 |
dc.description | The Miðeind neural constituency parser is an experimental variant of the Berkeley neural parser architecture. It is self-contained and conveniently plug-and-play via a docker image. Currently POS tags are not part of its constituency trees. The input to the parser is a full path to a text file (${INPUT_FILE}) where each line contains a sentence that will be parsed. No prior tokenization is required. The output file will be located in ${OUTPUT_DIR}/output.txt and the output format is line-separated bracketed trees . To run the parser use the following: docker run --volume ${INPUT_FILE}:/data/input.txt --volume ${OUTPUT_DIR}:/data/ mideind/neural-parser:${TAG} The output follows the bracketed tree format described at https://www.ling.upenn.edu/~janabeck/tutorial.html --- Tauganetsþáttari Miðeindar er tilraunaafbrigði af Berkeley tauganetsþáttaranum. Þáttarinn skilar stofnliðatrjám án POS-marka (eins og er). Inntakið í þáttarann er full algjör slóð texta að skrá (${INPUT_FILE}) þar sem hver lína geymir eina málsgrein. Eftir keyrslu má finna úttakið í skránni ${OUTPUT_DIR}/output.txt þar sem úttakssniðið er tré á svigaformi með auðri línu á milli . Til að keyra þáttarann skal nota: docker run --volume ${INPUT_FILE}:/data/input.txt --volume ${OUTPUT_DIR}:/data/ mideind/neural-parser:${TAG} (edited) |
dc.language.iso | isl |
dc.publisher | MIðeind ehf |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.subject | constituency parser |
dc.subject | parsing |
dc.title | Miðeind's Neural Constituency Parser - v. 1.0 |
dc.type | toolService |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Haukur Barri Símonarson haukur@mideind.is Miðeind ehf |
sponsor | Ministry of Education, Science and Culture I5 - Parser Language Technology for Icelandic 2019-2023 nationalFunds |
files.size | 6256098818 |
files.count | 2 |
Files in this item
Download all files in item (5.83 GB)This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- README
- Size
- 1.18 KB
- Format
- Unknown
- Description
- Unknown
- MD5
- 5468286fc12a42ddd6ae2e300ded61cf
- Name
- neural-parser.gz
- Size
- 5.83 GB
- Format
- application/gzip
- Description
- Unknown
- MD5
- d29e9c56cba39775c2e6423f05425354