Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ language:
|
|
| 8 |
# AraT5 CODAfication Model
|
| 9 |
|
| 10 |
## Model description
|
| 11 |
-
**AraT5
|
| 12 |
The model was built by fine-tuning [AraT5-v2](https://huggingface.co/UBC-NLP/AraT5v2-base-1024) on the [MADAR CODA](https://camel.abudhabi.nyu.edu/madar-coda-corpus/) dataset.
|
| 13 |
Our fine-tuning procedure and the hyperparameters we used can be found in our paper *"[Exploiting Dialect Identification
|
| 14 |
in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."* Our fine-tuning code and data can be found [here](https://github.com/CAMeL-Lab/codafication).
|
|
|
|
| 8 |
# AraT5 CODAfication Model
|
| 9 |
|
| 10 |
## Model description
|
| 11 |
+
**AraT5 CODA** is a text normalization model that normalizes dialectal Arabic text into the Conventional Orthography for Dialectal Arabic (CODA).
|
| 12 |
The model was built by fine-tuning [AraT5-v2](https://huggingface.co/UBC-NLP/AraT5v2-base-1024) on the [MADAR CODA](https://camel.abudhabi.nyu.edu/madar-coda-corpus/) dataset.
|
| 13 |
Our fine-tuning procedure and the hyperparameters we used can be found in our paper *"[Exploiting Dialect Identification
|
| 14 |
in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."* Our fine-tuning code and data can be found [here](https://github.com/CAMeL-Lab/codafication).
|