patrickvonplaten/wav2vec2_tiny_random简介
发布时间:2026-05-17 01:30:33
文章来源:www.cxwl.com
访问次数:6

Test model
To test this model run the following code:
from datasets import load_dataset
from transformers import Wav2Vec2ForCTC
import torchaudio
import torch
ds = load_dataset("patrickvonplaten/librispeech_asr_dummy", "clean", split="validation")
model = Wav2Vec2ForCTC.from_pretrained("patrickvonplaten/wav2vec2_tiny_random")
def load_audio(batch):
batch["samples"], _ = torchaudio.load(batch["file"])
return batch
ds = ds.map(load_audio)
input_values = torch.nn.utils.rnn.pad_sequence([torch.tensor(x[0]) for x in ds["samples"][:10]], batch_first=True)
# forward
logits = model(input_values).logits
pred_ids = torch.argmax(logits, dim=-1)
# dummy loss
dummy_labels = pred_ids.clone()
dummy_labels[dummy_labels == model.config.pad_token_id] = 1 # can't have CTC blank token in label
dummy_labels = dummy_labels[:, -(dummy_labels.shape[1] // 4):] # make sure labels are shorter to avoid "inf" loss (can still happen though...)
loss = model(input_values, labels=dummy_labels).loss
标签:漫画下载,pdf漫画下载,跨境电商,媒体,独立站,百度文库,站联,影音网站,PanDownload,其它网站
关于文章《patrickvonplaten/wav2vec2_tiny_random简介》特别声明
《patrickvonplaten/wav2vec2_tiny_random简介》更新日期为:2026-05-17 01:30:33;目前浏览的小伙伴达到6,初夏导航所有作品(图文、音视频)均由用户自行上传分享,仅供网友学习交流。若您的权利被侵害,请联系

