GLU¶
- class torch.nn.GLU(dim=- 1)[source]¶
Applies the gated linear unit function.
\({GLU}(a, b)= a \otimes \sigma(b)\) where \(a\) is the first half of the input matrices and \(b\) is the second half.
- Parameters:
dim (int) – the dimension on which to split the input. Default: -1
- Shape:
Input: \((\ast_1, N, \ast_2)\) where * means, any number of additional dimensions
Output: \((\ast_1, M, \ast_2)\) where \(M=N/2\)
Examples:
>>> m = nn.GLU() >>> input = torch.randn(4, 2) >>> output = m(input)