You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I had also to fix modules.pyget_mask_from_lengths function to make it work and stop warning. The function became:
def get_mask_from_lengths(memory, memory_lengths):
"""Get mask tensor from list of length
Args:
memory: (batch, max_time, dim)
memory_lengths: array like
"""
mask = memory.data.new(memory.size(0), memory.size(1)).bool().zero_()
for idx, l in enumerate(memory_lengths):
mask[idx][:l] = 1
return mask^1
However I could not train the model due to "RuntimeError: Caught RuntimeError in pin memory thread for device 0." error
Hi!
I am trying to train a model with LJSpeech data using default preset but failling when using GPU in the latest PyTorch version.
For PyTorch 1.3:
I had to make some fixes to my drivers so running
torch.cuda.is_available()
in my Python CLI gives me True. I used the following setup with conda:
conda install pytorch torchvision cudatoolkit=10.0 -c pytorch
I had also to fix
modules.py
get_mask_from_lengths function to make it work and stop warning. The function became:However I could not train the model due to "RuntimeError: Caught RuntimeError in pin memory thread for device 0." error
For PyTorch 1.2:
I used
conda install pytorch==1.2.0 torchvision==0.4.0 cudatoolkit=10.0 -c pytorch
But could not train due to "RuntimeError: reduce failed to synchronize: device-side assert triggered"
For PyTorch 1.1:
I did the setup using
conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=10.0 -c pytorch
So far the training is running...
Cheers!
The text was updated successfully, but these errors were encountered: