Modalities
This page summarizes what is currently shipped in hama and what remains on the roadmap.
Available
Text → IPA (G2P): Available in Python, Node/Bun, and the browser. Returns IPA plus per-phoneme alignment metadata.
Audio → Phoneme (ASR): Available in Python, Node/Bun, and the browser. Accepts waveform input and returns collapsed phoneme output from
asr_waveform_fp16.onnx.
Coming soon
- IPA → Text
- Audio → Text
- Text → Embeddings
Runtime coverage
| Runtime | G2P | Waveform ASR |
|---|---|---|
| Browser | Yes | Yes |
| Node/Bun | Yes | Yes |
| Python | Yes | Yes |