Transformer Neural Networks Derived from Scratch