top of page
האיגוד הישראלי לטכנולוגיות שפת אנוש
الرابطة الإسرائيلية لتكنولوجيا اللغة البشرية
The Israeli Association of Human Language Technologies

Hebrew & Arabic Corpus Linguistics Infrastructure
Universal Dependencies (UD) Tagging Universal Dependencies (UD) is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 300 contributors producing more than 150 treebanks in 90 languages. If you’re new to UD, you should start by reading the first part of the Short Introduction and then browsing the annotation guidelines.
https://universaldependencies.org/
UD Hebrew
https://universaldependencies.org/he/index.html
UD Arabic
https://universaldependencies.org/ar/index.html
AlephBERT
A Hebrew Large Pre-Trained Language Model
https://paperswithcode.com/paper/alephbert-a-hebrew-large-pre-trained-language
HebPipe Hebrew NLP Pipeline
https://github.com/amir-zeldes/HebPipe/tree/dev
Arborator
The Arborator software is aimed at collaboratively annotating dependency corpora.
https://arborator.ilpga.fr
doccano
doccano is an open source text annotation tool for humans. It provides annotation features for text classification, sequence labeling and sequence to sequence tasks.
https://github.com/doccano/doccano
bottom of page