Skip to content

pytorch mini batch size

2019.12.19 06:12

WHRIA 조회 수:82

https://stackoverflow.com/questions/52518324/how-to-compensate-if-i-cant-do-a-large-batch-size-in-neural-network/52523847

 

 

4

In pytorch, when you perform the backward step (calling loss.backward() or similar) the gradients are accumulated in-place. This means that if you call loss.backward() multiple times, the previously calculated gradients are not replaced, but in stead the new gradients get added on to the previous ones. That is why, when using pytorch, it is usually necessary to explicitly zero the gradients between minibatches (by calling optimiser.zero_grad() or similar).

If your batch size is limited, you can simulate a larger batch size by breaking a large batch up into smaller pieces, and only calling optimiser.step() to update the model parameters after all the pieces have been processed.

For example, suppose you are only able to do batches of size 64, but you wish to simulate a batch size of 128. If the original training loop looks like:

optimiser.zero_grad()
loss = model(batch_data) # batch_data is a batch of size 128
loss.backward()
optimiser.step()

then you could change this to:

optimiser.zero_grad()

smaller_batches = batch_data[:64], batch_data[64:128]
for batch in smaller_batches:
    loss = model(batch) / 2
    loss.backward()

optimiser.step()

and the updates to the model parameters would be the same in each case (apart maybe from some small numerical error). Note that you have to rescale the loss to make the update the same.

번호 제목 글쓴이 날짜 조회 수
1686 Mediainfo 를 이용해서 날짜 알아내기 WHRIA 2012.12.20 10405
1685 마음이 편해지는 그림 file WHRIA 2007.02.18 10356
1684 마음의 결정이 끝났습니다. WHRIA 2009.11.20 10334
1683 애드센스 수입금 - 48만원 WHRIA 2010.12.28 10277
1682 MedicalPhoto moved to http://medicalphoto.org WHRIA 2008.06.20 10104
1681 rAthena [1] file WHRIA 2016.06.26 10074
1680 어려운 결정 WHRIA 2008.10.11 10019
1679 피얼룩 지우기 WHRIA 2012.05.26 9956
1678 Trader file WHRIA 2009.11.17 9882
1677 승석이 홈페이지입니다. 한승석 2000.05.29 9874
1676 피부과 홈페이지 제작중 WHRIA 2009.11.11 9712
1675 일본으로 workshop 초청이 왔는데 못가게 되었다. [1] WHRIA 2009.11.23 9632
1674 간판 제작 업체 WHRIA 2009.10.31 9558
1673 인플레이션인가 디플레이션인가 WHRIA 2009.06.21 9546
1672 돌사진 WHRIA 2011.03.21 9446

Powered by Xpress Engine / Designed by Sketchbook

sketchbook5, 스케치북5

sketchbook5, 스케치북5

나눔글꼴 설치 안내


이 PC에는 나눔글꼴이 설치되어 있지 않습니다.

이 사이트를 나눔글꼴로 보기 위해서는
나눔글꼴을 설치해야 합니다.

설치 취소