Username or EmailPassword
In addition non trivially parallel tasks kill the performance. In short if it's not a linear algorithm, the GPGPU will struggle to perform.
At least that is the fact in CUDA. AMD's Stream processing might be a bit different.
Add to that the overhead of transfering data from main RAM to GPU RAM(even if it's the physically the same thing).