Show simple item record

 
dc.contributor.author Þorsteinsson, Vilhjálmur
dc.contributor.author Óladóttir, Hulda
dc.contributor.author Þórðarson, Sveinbjörn
dc.date.accessioned 2022-09-26T14:36:16Z
dc.date.available 2022-09-26T14:36:16Z
dc.date.issued 2022-09-23
dc.identifier.uri http://hdl.handle.net/20.500.12537/267
dc.description BinPackage is a Python Package that embeds the vocabulary of the DMII (https://bin.arnastofnun.is) and offers various lookups and queries of the data. The database, maintained by The Árni Magnússon Institute for Icelandic Studies, contains over 6.5 million entries, over 3.1 million unique word forms, and about 300,000 distinct lemmas. The database has been encapsulated in an easy-to-install Python package, and compressed from 400+ megabyte CSV file to an ~80 megabyte indexed binary structure. More information at: https://github.com/mideind/BinPackage BinPackage er Python-pakki utan um BÍN, Beygingarlýsingu íslensks nútímamáls (https://bin.arnastofnun.is), sem inniheldur yfir 6,5 milljónir færslna, 3,1 milljón einstakra orðmynda og um 300.000 stakar lemmur. Stofnun Árna Magnússonar í íslenskum fræðum heldur utan um gagnagrunninn. Gagnagrunninum, um 400 megabæta CSV-skrá, hefur verið pakkað í um 80 megabæta tvíundarbyggingu með vísum. Frekari upplýsingar á: https://github.com/mideind/BinPackage
dc.language.iso isl
dc.publisher Miðeind ehf.
dc.relation.replaces http://hdl.handle.net/20.500.12537/177
dc.rights The MIT License (MIT)
dc.rights.uri https://opensource.org/licenses/mit-license.php
dc.rights.label PUB
dc.source.uri https://github.com/mideind/BinPackage/releases/tag/0.4.4
dc.subject vocabulary
dc.subject dictionary
dc.subject nlp
dc.title BinPackage 0.4.4 (22.10)
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding Clarin IS Repository
contact.person Vilhjálmur Þorsteinsson mideind@mideind.is Miðeind ehf.
sponsor Ministry of Education, Science and Culture DMII in a Python Package (G4b) Language Technology for Icelandic 2019-2023 nationalFunds
files.size 9687988
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
The MIT License (MIT)
Icon
Name
BinPackage-0.4.4.zip
Size
9.24 MB
Format
application/zip
Description
Unknown
MD5
b4cb64afdec3f7a57e110afc95fa029c
 Download file  Preview
 File Preview  
  • BinPackage-0.4.4
    • src
      • islenska
        • __init__.py1 kB
        • settings.py10 kB
        • resources
          • prefixes.txt6 MB
          • suffixes.txt37 MB
          • ord.auka.csv2 MB
          • ord.add.csv559 kB
          • ord.suffixes.csv27 kB
          • systematic_additions.csv2 MB
        • config
          • BinErrata.conf33 kB
          • Adjectives.conf26 kB
          • BinPackage.conf1 kB
          • Prefs.conf7 kB
        • bin.cpp7 kB
        • bin.h1 kB
        • dawgdictionary.py17 kB
        • bindb.py42 kB
        • version.py22 B
        • py.typed0 B
        • bincompress.py31 kB
        • basics.py15 kB
        • bin_build.py2 kB
    • setup.py4 kB
    • README.md36 kB
    • .gitignore974 B
    • img
      • greynir-logo-large.png56 kB
      • MideindLogoVert400.png23 kB
    • tools
      • dawgbuilder.py32 kB
      • binpack.py49 kB
    • test
      • test_ord.py5 kB
      • test_bin.py31 kB
    • .gitattributes717 B
    • wheels.sh937 B
    • .github
    • release.sh500 B
    • LICENSE1 kB
    • MANIFEST.in501 B
    • build_wheels.sh894 B

Show simple item record