CAMeL-Lab
/

arat5-coda

text2text-generation

text-generation-inference

Model card Files Files and versions

balhafni commited on Jul 6, 2024

Commit

653ef09

·

verified ·

1 Parent(s): dc0b443

Update README.md

Files changed (1) hide show

README.md +26 -1

README.md CHANGED Viewed

@@ -2,6 +2,9 @@
 license: mit
 language:
 - ar
 ---
@@ -15,10 +18,32 @@ in Automatic Dialectal Text Normalization](https://arxiv.org/abs/2407.03020)."*
 ## Intended uses
-You can use the **AraT5** CODAfication model as part of Hugging Face's transformers >= 4.22.2.
 ## How to use
 ## Citation
 ```bibtex

 license: mit
 language:
 - ar
+widget:
+ - text: 'اثنين همبرقر واثنين قهوة، لوسمحت. بآخذهم تيك اوي.'
 ---
 ## Intended uses
+You can use the **AraT5 CODA** model as part of Hugging Face's transformers >= 4.22.2.
 ## How to use
+```python
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+import torch
+tokenizer = AutoTokenizer.from_pretrained('CAMeL-Lab/arat5-coda')
+model = AutoModelForSeq2SeqLM.from_pretrained('CAMeL-Lab/arat5-coda')
+text = 'اثنين همبرقر و اثنين قهوة، لوسمحت. باخذهم تيك اواي.'
+inputs = tokenizer(text, return_tensors='pt')
+gen_kwargs = {'num_beams': 5, 'max_length': 200,
+              'num_return_sequences': 1,
+              'no_repeat_ngram_size': 0, 'early_stopping': False
+              }
+codafied_text = model.generate(**inputs, **gen_kwargs)
+codafied_text = tokenizer.batch_decode(codafied_text,
+                                       skip_special_tokens=True,
+                                       clean_up_tokenization_spaces=False)[0]
+print(codafied_text)
+"اثنين همبرقر واثنين قهوة، لوسمحت. بآخذهم تيك اوي."
+```
 ## Citation
 ```bibtex