torch
torchaudio
librosa
numpy
huggingface_hub
einops
scipy
tokenizers
soundfile
s3tokenizer
conformer
safetensors
transformers
diffusers