Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenMOSS-Team 's Collections
MOVA
ABC-Bench
FutureOmni
Game-RL
MOSS Transcribe Diarize
FRoM-W1
DiRL
RoboOmni
MOSS-Speech
MOSS-TTSD
MOSS Embodied Planner
Low Rank Sparse Attention
MHA2MLA-refactor
MHA2MLA
MOSS

MOSS Transcribe Diarize

updated about 1 month ago

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.

Upvote
3

  • MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

    Paper • 2601.01554 • Published Jan 4 • 57

  • Running
    Featured
    50

    MOSS Transcribe Diarize

    🏢
    50

    Transcribe audio/video with speaker identification

Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs