This commit is contained in:
Txus 2020-04-25 15:16:40 +02:00 коммит произвёл GitHub
Родитель 73d6a2f901
Коммит 4e817ff418
Не найден ключ, соответствующий данной подписи
Идентификатор ключа GPG: 4AEE18F83AFDEB23
1 изменённых файлов: 25 добавлений и 0 удалений

Просмотреть файл

@ -0,0 +1,25 @@
---
language: catalan
---
# CALBERT: a Catalan Language Model
## Introduction
CALBERT is an open-source language model for Catalan based on the ALBERT architecture.
It is now available on Hugging Face in its `base-uncased` version, and was pretrained on the [OSCAR dataset](https://traces1.inria.fr/oscar/).
For further information or requests, please go to the [GitHub repository](https://github.com/codegram/calbert)
## Pre-trained models
| Model | Arch. | Training data |
|-------------------------------------|------------------|-----------------------------------|
| `codegram` / `calbert-base-uncased` | Base (uncased) | OSCAR (4.3 GB of text) |
## Authors
CALBERT was trained and evaluated by [Txus Bach](https://twitter.com/txustice), as part of [Codegram](https://www.codegram.com)'s applied research.