Text-to-Speech
Hakka Chinese

Model Card for f5-tts-hakka-finetune-v3

Model Details

F5-TTS finetune on Taiwanese Hakka for TTS without word dataset and samples have in hanzi, using pinyin as input.
g2p from this repo.

Training Details

  • learning rate: 0.00001
  • batch size per gpu: 20169
  • batch size type: frame
  • max samples: 64
  • grad accumulation steps: 1
  • max grad norm: 1
  • epochs: 861 (incomplete)
  • num warmup updates: 13362

Model Sources

Uses

please refer source repo

Demo

https://huggingface.co/spaces/formospeech/taiwanese-hakka-f5-tts

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for formospeech/f5-tts-hita-finetune-v1

Base model

SWivid/F5-TTS
Finetuned
(69)
this model

Datasets used to train formospeech/f5-tts-hita-finetune-v1