Multi-Intent Detection (MID) Model

This model was fine-tuned for the task of Multi-Intent Detection (MID), a type of multi-label classification where each input can have multiple labels assigned. The dataset used for fine-tuning is specifically designed to simplify the MID task, with the number of labels limited to two per instance.

Model Details

  • Base Model: DeBERTa-v3-base
  • Task: Multi-label classification
  • Number of Labels: 2
  • Fine-tuning Framework: Hugging Face Transformers

Training Configuration

  • Training Arguments:
    • Learning Rate: 2e-5
    • Batch Size (Train): 16
    • Batch Size (Eval): 16
    • Gradient Accumulation Steps: 2
    • Number of Epochs: 8
    • Weight Decay: 0.01
    • Warmup Ratio: 10%
    • Learning Rate Scheduler Type: Cosine
    • Mixed Precision Training: Enabled (FP16)
    • Logging Steps: 50

Performance Metrics

Epoch Training Loss Validation Loss Precision Recall F1 Score Accuracy
0 0.069100 0.069115 0.000000 0.000000 0.000000 0.000000
2 0.024100 0.022929 0.952334 0.316920 0.475576 0.078652
4 0.009200 0.010799 0.959768 0.819894 0.884334 0.653668
6 0.006300 0.008773 0.963243 0.883344 0.921565 0.770654
7 0.006200 0.008707 0.961635 0.886319 0.922442 0.775281

Final Evaluation Metrics (Epoch 8):

  • Validation Loss: 0.0087
  • Precision: 0.9616
  • Recall: 0.8863
  • F1 Score: 0.9224
  • Accuracy: 0.7753

Limitations

  • Simplified Multi-Label Setting: This model assumes a fixed number of two labels per instance, which may not generalize to datasets with more complex multi-label settings.
  • Performance on Unseen Data: The model's performance may degrade if applied to data distributions significantly different from the training dataset.
Downloads last month
16
Safetensors
Model size
0.2B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support