speechbrain.lobes.models.huggingface_transformers

High level processing blocks.

This subpackage gathers higher level blocks, or “lobes” for HuggingFace Transformers.

speechbrain.lobes.models.huggingface_transformers.discrete_ssl

This lobe enables the integration of pretrained discrete SSL (hubert,wavlm,wav2vec) for extracting semnatic tokens from output of SSL layers.

speechbrain.lobes.models.huggingface_transformers.encodec

This lobe enables the integration of huggingface pretrained EnCodec.

speechbrain.lobes.models.huggingface_transformers.gpt

This lobe enables the integration of huggingface pretrained GPT2LMHeadModel model.

speechbrain.lobes.models.huggingface_transformers.hubert

This lobe enables the integration of huggingface pretrained hubert models.

speechbrain.lobes.models.huggingface_transformers.huggingface

This lobe is the interface for huggingface transformers models It enables loading config and model via AutoConfig & AutoModel.

speechbrain.lobes.models.huggingface_transformers.labse

This lobe enables the integration of huggingface pretrained LaBSE models.

speechbrain.lobes.models.huggingface_transformers.mbart

This lobe enables the integration of huggingface pretrained mBART models.

speechbrain.lobes.models.huggingface_transformers.nllb

This lobe enables the integration of huggingface pretrained NLLB models.

speechbrain.lobes.models.huggingface_transformers.textencoder

This lobe enables the integration of generic huggingface pretrained text encoders (e.g. BERT).

speechbrain.lobes.models.huggingface_transformers.wav2vec2

This lobe enables the integration of huggingface pretrained wav2vec2 models.

speechbrain.lobes.models.huggingface_transformers.wavlm

This lobe enables the integration of huggingface pretrained wavlm models.

speechbrain.lobes.models.huggingface_transformers.weighted_ssl

This lobe enables the integration of huggingface pretrained wav2vec2 models.

speechbrain.lobes.models.huggingface_transformers.whisper

This lobe enables the integration of huggingface pretrained whisper model.

Summary

Reference