img read data type, convolution

Alex_Rose 2017. 10. 21. 12:32

data type explain range

uint8 부호없는 (unsigned) 8 bit, 1 byte integer 0~255

uint16 부호없는 16 bit, 2 byte integer 0~65535

Convolution feature

Convolution layer의 input, output data를 feature map이라고 부른다.

여기 정의에서 두 함수를 image, filter라고 생각하면, filter를 striding하면서 image와 multiplication (matrix multiplication인지 elementwise multiplication인지는 모르겠음. 그냥 곱셈이라고만 되어 있다.)한 output을 적분 (덧셈) 해나간다는 뜻이 된다.

- 여기에서 수행하는 적분(덧셈)은 단일 곱셈 - 누산 (fused multiply-add, FMA) 라고 한다.

- Matrix에서 reverse에 대응하는 것은 flipping

- filter = kernel

input size : (H, W)

filter size : (FH, FW)

output size : (OH, OW)

output size 공식 : (N - F) / stride + 1

pooling을 해주는 이유? 입력의 변화에 영향을 적게 받는다 (robust) 입력 데이터가 조금 변해도 pooling의 결과는 잘 변하지 않는다.

저작자표시 비영리 동일조건 (새창열림)