If padding is non-zero, then the input is implicitly zero-padded on both sides for padding number of points. dilation controls the spacing between the kernel points. It is harder to describe, but this link has a nice visualization of what dilation does.
The parameters kernel_size, stride, padding, dilation can either be:
a single int -- in which case the same value is used for the height and width dimension
a tuple of two ints -- in which case, the first int is used for the height dimension, and the second int for the width dimension
if(torch_is_installed()){# pool of square window of size=3, stride=2m <- nn_max_pool2d(3, stride =2)# pool of non-square windowm <- nn_max_pool2d(c(3,2), stride = c(2,1))input <- torch_randn(20,16,50,32)output <- m(input)}