Smoother than ReLU, learnable slope like PReLU: Swishβ(x)=x⋅σ(βx) Sigmoid linear unit: SiLU=Swish1(x)=x⋅σ(x)