hama on-device NLP

Modalities

This page summarizes what is currently shipped in hama and what remains on the roadmap.

Available

  • Text → IPA (G2P): Available in Python, Node/Bun, and the browser. Returns IPA plus per-phoneme alignment metadata.

  • Audio → Phoneme (ASR): Available in Python, Node/Bun, and the browser. Accepts waveform input and returns collapsed phoneme output from asr_waveform_fp16.onnx.

Coming soon

  • IPA → Text
  • Audio → Text
  • Text → Embeddings

Runtime coverage

RuntimeG2PWaveform ASR
BrowserYesYes
Node/BunYesYes
PythonYesYes