Munajjam

A Python library to synchronize Quran ayat with audio recitations.

Munajjam uses AI-powered speech recognition to automatically generate precise timestamps for each ayah in a Quran audio recording.

Installation

Clone the repository:

git clone https://github.com/Itqan-community/munajjam.git
cd munajjam/munajjam

Install the package:

pip install .

For faster transcription with faster-whisper:

pip install ".[faster-whisper]"

For development (editable install):

pip install -e ".[dev]"

Quick Start

1. Download a sample recitation

Download a sample audio file (Surah Al-Fatiha):

curl -L -o 001.mp3 "https://pub-9ee413c8af4041c6bd5223d08f5d0f0f.r2.dev/media/uploads/assets/11/recitations/001.mp3"

Note: Audio files should be named by surah number (e.g., 001.mp3, 002.mp3). Browse more recitations at cms.itqan.dev

2. Run the alignment

from munajjam.transcription import WhisperTranscriber
from munajjam.core import align
from munajjam.data import load_surah_ayahs

# Transcribe audio
with WhisperTranscriber() as transcriber:
    segments = transcriber.transcribe("001.mp3")

# Align to ayahs (uses auto strategy by default; override with "greedy", "dp", or "hybrid")
ayahs = load_surah_ayahs(1)
results = align("001.mp3", segments, ayahs)

# Get timestamps
for result in results:
    print(f"Ayah {result.ayah.ayah_number}: {result.start_time:.2f}s - {result.end_time:.2f}s")

3. Output

Ayah 1: 5.62s - 9.57s
Ayah 2: 10.51s - 14.72s
Ayah 3: 15.45s - 18.53s
Ayah 4: 19.21s - 22.54s
Ayah 5: 23.27s - 28.19s
Ayah 6: 29.00s - 33.07s
Ayah 7: 33.98s - 46.44s

Features

Whisper Transcription - Uses faster-whisper as default backend with Quran-tuned models
Four Alignment Strategies - Auto, Hybrid, DP, and Greedy
Arabic Text Normalization - Handles diacritics, hamzas, and character variations
Automatic Drift Correction - Multi-pass zone realignment for long recordings
Quality Metrics - Confidence scores for each aligned ayah
Phonetic Similarity - Arabic ASR confusion-aware similarity scoring
Word-level Precision - Uses per-word timestamps (when available) to improve drift recovery

Alignment Strategies

The default auto strategy works best for most cases. You can override it:

from munajjam.core import Aligner

# Auto (recommended) - picks the best strategy, full pipeline by default
aligner = Aligner("001.mp3")

# Hybrid - DP with greedy fallback (legacy)
aligner = Aligner("001.mp3", strategy="hybrid")

# Greedy - fastest, good for clean recordings
aligner = Aligner("001.mp3", strategy="greedy")

# DP - optimal alignment using dynamic programming
aligner = Aligner("001.mp3", strategy="dp")

results = aligner.align(segments, ayahs)

Examples

See the examples directory for more usage patterns:

01_basic_usage.py - Simple transcription and alignment
02_comparing_strategies.py - Compare alignment strategies
03_advanced_configuration.py - Custom settings and options
04_batch_processing.py - Process multiple files

Requirements

Python 3.10+
PyTorch 2.0+
FFmpeg (for audio processing)

Community

Acknowledgments

Tarteel AI for the Quran-specialized Whisper model

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
munajjam		munajjam
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
test-requirements.txt		test-requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Munajjam

Installation

Quick Start

1. Download a sample recitation

2. Run the alignment

3. Output

Features

Alignment Strategies

Examples

Requirements

Community

Acknowledgments

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Munajjam

Installation

Quick Start

1. Download a sample recitation

2. Run the alignment

3. Output

Features

Alignment Strategies

Examples

Requirements

Community

Acknowledgments

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages