Token BiLSTM-CRF baseline
Token-level weak-supervision model. Structural labels remain rule-derived; the learned layer covers semantic, syntactic, and pragmatic tags.
| القسم | القيمة | ملاحظة |
| Model | /Users/al-hmouz/Documents/Arabic_Tasreef/functional_syntax/models/generated_reference_token_bilstm_crf.pt | saved checkpoint |
| Train sentences | 6719 | split summary |
| Dev sentences | 840 | split summary |
| Test sentences | 840 | split summary |
| Train tokens | 26590 | token corpus |
| Dev tokens | 3352 | token corpus |
| Test tokens | 3332 | token corpus |
| Vocab size | 3328 | token vocabulary |
| Test semantic accuracy | 0.9214 | published baseline |
| Test syntactic accuracy | 0.9208 | published baseline |
| Test pragmatic accuracy | 0.9469 | published baseline |
| Test joint accuracy | 0.7738 | sequence-level exactness |
Best-state history
| Epoch | Train loss | Dev loss | Dev sem | Dev syn | Dev prag | Dev joint |
| 1 |
2.759 |
1.3413 |
0.8699 |
0.8699 |
0.9045 |
0.575 |
| 2 |
1.1177 |
1.0195 |
0.8965 |
0.8974 |
0.9266 |
0.6738 |
| 3 |
0.7277 |
0.9254 |
0.9081 |
0.9078 |
0.9382 |
0.7131 |
| 4 |
0.4847 |
0.9633 |
0.9126 |
0.9132 |
0.9382 |
0.7238 |
| 5 |
0.3237 |
0.9584 |
0.9147 |
0.915 |
0.9394 |
0.7429 |
| 6 |
0.2203 |
1.0079 |
0.9192 |
0.9186 |
0.9436 |
0.7512 |