Sýna einfalda færslu atriðis
dc.contributor.author
Þorsteinsson, Vilhjálmur
dc.contributor.author
Óladóttir, Hulda
dc.date.accessioned
2025-09-19T11:47:49Z
dc.date.available
2025-09-19T11:47:49Z
dc.date.issued
2022
dc.identifier.uri
http://hdl.handle.net/20.500.12537/368
dc.description
Icegrams is a Python 3 package that encapsulates a large trigram library for Icelandic. 14 million unique trigrams and their frequency counts are heavily compressed using radix tries and quasi-succinct indices employing Elias-Fano encoding. This enables the ~43 megabyte compressed trigram file to be mapped directly into memory, with no ex ante decompression, for fast queries (typically ~10 microseconds per lookup). More information at: https://github.com/mideind/Icegrams
Icegrams er Python 3 pakki sem inniheldur stórt safn orðaþrennda (trigrams) fyrir íslensku. Í safninu eru um 14 milljónir ólíkra þrennda ásamt tíðniupplýsingum. Öllu safninu hefur verið þjappað niður í u.þ.b. 43 megabæti sem varpað er beint í minni þannig að uppfletting er mjög hraðvirk (~10 míkrósekúndur fyrir hverja uppflettingu). Frekari upplýsingar á: https://github.com/mideind/Icegrams
dc.language.iso
isl
dc.publisher
Miðeind ehf.
dc.relation.replaces
http://hdl.handle.net/20.500.12537/176
dc.rights
The MIT License (MIT)
dc.rights.uri
https://opensource.org/licenses/mit-license.php
dc.rights.label
PUB
dc.source.uri
https://github.com/mideind/Icegrams/releases/tag/1.1.3-final
dc.subject
language model
dc.subject
trigrams
dc.subject
ngrams
dc.title
Icegrams v1.1.3 (2025-09-15)
dc.type
languageDescription
metashare.ResourceInfo#ContentInfo.detailedType
other
metashare.ResourceInfo#ContentInfo.mediaType
text
has.files
yes
branding
Clarin IS Repository
contact.person
Vilhjálmur Þorsteinsson mideind@mideind.is Miðeind ehf.
sponsor
Ministry of Education, Science and Culture Word lists and language models (L4) Language Technology for Icelandic 2019-2023 nationalFunds
size.info
14000000 trigrams
files.size
157450
files.count
2
Files in this item
Download all files in item (153.76
KB)
×
Large Size
The requested files are being packed into one large file. This process can take some time, please be patient.
Continue
Cancel
This item is
Publicly Available
and licensed under:
The MIT License (MIT)
Name
Icegrams-1.1.3-final.tar.gz
Size
71.11
KB
Format
application/gzip
Description
Icegrams source code
MD5
7dd7fad5a38d44cea1246d7adffe9bf9
Download file
Preview
Icegrams-1.1.3-final src icegrams trie.h 3 kB trie_build.py 4 kB trie.cpp 24 kB __init__.py 1 kB resources correct.txt 57 kB trigrams.bin 133 B split.txt 8 kB delete.txt 1 kB py.typed 0 B trie.py 12 kB ngrams.py 68 kB setup.py 539 B .gitignore 904 B README.rst 14 kB old build_wheels.sh 898 B wheels.sh 937 B test.py 1 kB release.sh 522 B pyproject.toml 1 kB test .gitattributes 72 B utils .github workflows python-package.yml 732 B wheels.yml 1 kB doc LICENSE.txt 1 kB MANIFEST.in 301 B
Name
Icegrams-1.1.3-final.zip
Size
82.65
KB
Format
application/zip
Description
Icegrams source code
MD5
e8232e97ec42a9dac738c9779d9d5c1b
Download file
Preview
Icegrams-1.1.3-final src icegrams trie.h 3 kB trie_build.py 4 kB trie.cpp 24 kB __init__.py 1 kB resources correct.txt 57 kB trigrams.bin 133 B split.txt 8 kB delete.txt 1 kB py.typed 0 B trie.py 12 kB ngrams.py 68 kB setup.py 539 B .gitignore 904 B README.rst 14 kB old build_wheels.sh 898 B wheels.sh 937 B test.py 1 kB release.sh 522 B pyproject.toml 1 kB test .gitattributes 72 B utils .github workflows python-package.yml 732 B wheels.yml 1 kB doc LICENSE.txt 1 kB MANIFEST.in 301 B
Sýna einfalda færslu atriðis