torch.nn.functional.gelu¶
- torch.nn.functional.gelu(input, approximate='none') Tensor ¶
When the approximate argument is ‘none’, it applies element-wise the function \(\text{GELU}(x) = x * \Phi(x)\)
where \(\Phi(x)\) is the Cumulative Distribution Function for Gaussian Distribution.
When the approximate argument is ‘tanh’, Gelu is estimated with
\[\text{GELU}(x) = 0.5 * x * (1 + \text{Tanh}(\sqrt{2 / \pi} * (x + 0.044715 * x^3))) \]