abisee/cnn_dailymail
Viewer • Updated • 936k • 204k • 345
How to use bbhattar/flan_t5_xl_cnn_dailymail with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("bbhattar/flan_t5_xl_cnn_dailymail")
model = AutoModelForSeq2SeqLM.from_pretrained("bbhattar/flan_t5_xl_cnn_dailymail")This model is a fine-tuned version of google/flan-t5-xl on the cnn_dailymail dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|---|---|---|---|---|---|---|---|---|
| 1.4352 | 1.0 | 2991 | 1.2645 | 43.8582 | 21.2227 | 30.7038 | 40.761 | 101.9968 |
| 1.3198 | 2.0 | 5982 | 1.2525 | 44.4594 | 21.8174 | 31.4304 | 41.4563 | 94.0733 |
| 1.2151 | 3.0 | 8973 | 1.2648 | 45.1318 | 22.2773 | 31.9084 | 42.0558 | 94.2332 |