ASR-MMS-SALT Speech Recognition Model

Property	Value
License	CC-BY-NC-4.0
Base Model	facebook/mms-1b-all
Supported Languages	6 (Luganda, Acholi, Lugbara, Ateso, Runyankole, English)
Downloads	36,557

What is asr-mms-salt?

ASR-MMS-SALT is a specialized speech recognition model designed specifically for Ugandan languages. Built upon Facebook's MMS-1B-ALL architecture, this model represents a significant advancement in African language processing, supporting six languages with unique code-switching capabilities.

Implementation Details

The model utilizes the wav2vec2 architecture and implements language-specific adapters for each supported language. It's built using PyTorch and includes both monolingual and code-switching variants for each language, making it highly versatile for real-world applications.

Dual adapter system for each language: pure language and English code-switching
Built on transformer architecture with wav2vec2 implementation
Trained on the SALT dataset specifically for Ugandan languages
Outputs unpunctuated, lowercase text

Core Capabilities

Accurate speech recognition in 6 Ugandan languages
Support for English code-switching scenarios
Efficient processing with CUDA support
Easy integration with the Transformers library
Specialized handling of regional accents and dialects

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its specialized support for Ugandan languages with dedicated adapters for both pure language and code-switching scenarios, making it particularly valuable for real-world applications in East Africa.

Q: What are the recommended use cases?

This model is ideal for applications requiring speech recognition in Ugandan languages, particularly in scenarios where speakers might switch between local languages and English. It's suitable for transcription services, voice assistants, and language documentation projects in East Africa.

asr-mms-salt