Stop extracting the number of base functions from the reference elements in the local operator

Extracting it from the local basis should work just fine (and should actually be more correct). And it means we don't need to get the reference elements onto the GPU

Edited by Dr. Jorrit Fahlke