by Vanessa McHale | Apple
One can improve on arcsine's performance with Horner's method and then bittwiddling.
Apple outperforms R in important cases.
Consider a softmax layer from Aditya Srinivas Menon's tutorial: