pytorch mini batch size

2019.12.19 06:12

WHRIA 조회 수:93

https://stackoverflow.com/questions/52518324/how-to-compensate-if-i-cant-do-a-large-batch-size-in-neural-network/52523847

In pytorch, when you perform the backward step (calling loss.backward() or similar) the gradients are accumulated in-place. This means that if you call loss.backward() multiple times, the previously calculated gradients are not replaced, but in stead the new gradients get added on to the previous ones. That is why, when using pytorch, it is usually necessary to explicitly zero the gradients between minibatches (by calling optimiser.zero_grad() or similar).

If your batch size is limited, you can simulate a larger batch size by breaking a large batch up into smaller pieces, and only calling optimiser.step() to update the model parameters after all the pieces have been processed.

For example, suppose you are only able to do batches of size 64, but you wish to simulate a batch size of 128. If the original training loop looks like:

optimiser.zero_grad()
loss = model(batch_data) # batch_data is a batch of size 128
loss.backward()
optimiser.step()

then you could change this to:

optimiser.zero_grad()

smaller_batches = batch_data[:64], batch_data[64:128]
for batch in smaller_batches:
    loss = model(batch) / 2
    loss.backward()

optimiser.step()

and the updates to the model parameters would be the same in each case (apart maybe from some small numerical error). Note that you have to rescale the loss to make the update the same.

이 게시물을

번호	제목	글쓴이	날짜	조회 수
214	pytorch - caffe	WHRIA	2019.12.19	79
213	kaggle leakage	WHRIA	2019.12.19	45
212	pytorch object detect / retinanet	WHRIA	2019.12.22	223
211	lvm 확장 [1]	WHRIA	2019.12.25	53
210	윈도우 raid ahci 전환	WHRIA	2019.12.26	55
209	melanoma awareness [1]	WHRIA	2019.12.27	47
208	R graph	WHRIA	2019.12.29	242
207	참고 또 참고	WHRIA	2020.01.03	76
206	add extra raid disk	WHRIA	2020.01.05	96
205	raid 6 rebuild	WHRIA	2020.01.07	65
204	usb 3.1 + DP	WHRIA	2020.01.07	168
203	lvm [2]	WHRIA	2020.01.09	47
202	retinanet nvidia [3]	WHRIA	2020.01.12	74
201	nvidia caffe	WHRIA	2020.01.12	52
200	xml json pascal [4]	WHRIA	2020.01.12	92

첫 페이지 103 104 105 106 107 108 109 110 111 112 끝 페이지

쓰기...

태그

로그인

Whria World

pytorch mini batch size

댓글 0

나눔글꼴 설치 안내

이 PC에는 나눔글꼴이 설치되어 있지 않습니다.