Skip to content

Google colab平台运行项目时出现共享内存不足 #1

@jsuyanyong

Description

@jsuyanyong

由于自己的笔记本无GPU加速,所以将此项目运行在Google colab平台上。colab提供免费的GPU,但是运行时发现 共享内存不足,不知道有没有其他同学遇到此问题?

2018-07-28 12:48:24 [INFO]: Start training epoch 1
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).
Traceback (most recent call last):
  File "classifier_train.py", line 65, in <module>
    trainer.train()
  File "/content/drive/pytorch_hand_classifier/utils/Trainer.py", line 104, in train
    self._train_one_epoch()
  File "/content/drive/pytorch_hand_classifier/utils/Trainer.py", line 133, in _train_one_epoch
    for step, (data, label) in enumerate(self.train_data):
  File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 275, in __next__
    idx, batch = self._get_batch()
  File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 254, in _get_batch
    return self.data_queue.get()
  File "/usr/lib/python3.6/multiprocessing/queues.py", line 335, in get
    res = self._reader.recv_bytes()
  File "/usr/lib/python3.6/multiprocessing/connection.py", line 216, in recv_bytes
    buf = self._recv_bytes(maxlength)
  File "/usr/lib/python3.6/multiprocessing/connection.py", line 407, in _recv_bytes
    buf = self._recv(4)
  File "/usr/lib/python3.6/multiprocessing/connection.py", line 379, in _recv
    chunk = read(handle, remaining)
  File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 175, in handler
    _error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 2068) is killed by signal: Bus error.

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions