input: input tensor of shape (\mboxminibatch,\mboxin_channels,iH,iW)
weight: filters of shape (\mboxin_channels,\mboxgroups\mboxout_channels,kH,kW)
bias: optional bias of shape (\mboxout_channels). Default: NULL
stride: the stride of the convolving kernel. Can be a single number or a tuple (sH, sW). Default: 1
padding: dilation * (kernel_size - 1) - padding zero-padding will be added to both sides of each dimension in the input. Can be a single number or a tuple (padH, padW). Default: 0
output_padding: additional size added to one side of each dimension in the output shape. Can be a single number or a tuple (out_padH, out_padW). Default: 0
groups: split input into groups, \mboxin_channels should be divisible by the number of groups. Default: 1
dilation: the spacing between kernel elements. Can be a single number or a tuple (dH, dW). Default: 1