pytorch mini batch size

2019.12.19 06:12

WHRIA 조회 수:82

https://stackoverflow.com/questions/52518324/how-to-compensate-if-i-cant-do-a-large-batch-size-in-neural-network/52523847

In pytorch, when you perform the backward step (calling loss.backward() or similar) the gradients are accumulated in-place. This means that if you call loss.backward() multiple times, the previously calculated gradients are not replaced, but in stead the new gradients get added on to the previous ones. That is why, when using pytorch, it is usually necessary to explicitly zero the gradients between minibatches (by calling optimiser.zero_grad() or similar).

If your batch size is limited, you can simulate a larger batch size by breaking a large batch up into smaller pieces, and only calling optimiser.step() to update the model parameters after all the pieces have been processed.

For example, suppose you are only able to do batches of size 64, but you wish to simulate a batch size of 128. If the original training loop looks like:

optimiser.zero_grad()
loss = model(batch_data) # batch_data is a batch of size 128
loss.backward()
optimiser.step()

then you could change this to:

optimiser.zero_grad()

smaller_batches = batch_data[:64], batch_data[64:128]
for batch in smaller_batches:
    loss = model(batch) / 2
    loss.backward()

optimiser.step()

and the updates to the model parameters would be the same in each case (apart maybe from some small numerical error). Note that you have to rescale the loss to make the update the same.

이 게시물을

번호	제목	글쓴이	날짜	조회 수
1716	이건 비밀글이얍	J	2004.04.23	2
1715	아저씨에게 [1]	^&^	2004.04.25	3
1714	아저씨~ [1]	^&^	2004.05.15	5
1713	다시 이곳으로...	WHRIA	2024.03.02	6
1712	모든 연구 자료를 정리. 이것으로 마무리 짓기로.	WHRIA	2024.03.03	15
1711	MedicalPhoto MSVC 2015 와 최신 boost 로...	WHRIA	2016.08.26	17
1710	Visual Studio 설치후 Excel 2002 (XP) 종료시 에러 문제	WHRIA	2015.11.20	21
1709	headless PC 를 위한 dummy plug in 을 구입해서 달다.	WHRIA	2015.11.22	21
1708	XP shutdown 시 강제종료 시키기	WHRIA	2016.08.16	21
1707	XE 에 tinymce 에디터를 달다.	WHRIA	2015.11.22	22
1706	USB 에뮬레이션 램디스트 imdisk	WHRIA	2016.09.02	27
1705	학술저널 받는법	WHRIA	2015.11.30	28
1704	imbalanced dataset	WHRIA	2018.12.26	28
1703	raid monitor	WHRIA	2019.12.15	31
1702	지급명세서	WHRIA	2015.10.14	32

첫 페이지 1 2 3 4 5 6 7 8 9 10 끝 페이지

쓰기...

태그

로그인

Whria World

pytorch mini batch size

댓글 0

나눔글꼴 설치 안내

이 PC에는 나눔글꼴이 설치되어 있지 않습니다.