To view parent comment, click here.
To read all comments associated with this story, please click here.
Little endian simplifies some assembly language trick, big endian simplifies other assembly language trick, I'm not sure that there is really a benefit of little endian for the assembly language programming here, but human read in big endian and this is a big difference.
OMG, I can't believe people are still beating that particular dead horse.
The differences between the two approaches are so small that the choice is really only a matter of taste, and that's of course why we ended up with the industry split down the middle on that one.
The supposed advantages of either approach are laughable compared to the huge disadvantage of having to deal with both of them in heterogenous settings.
While I agree that the difference of performance between both approaches are negligible (both have some algorithm where they are better but no particular advantage), the readability for human is a big difference though.
Agreed about your last point: I still don't understand why Intel went that way, where most other company used big endian: Motorola, IBM, etc..
Does SSE3 has a multiply-accumulate operation?
No, because x86 does not allow for three-operand instructions. There is a multiply-add instruction for two 16-bit integer vectors, but that's not the same.
Therefore multiply-accumulate requires separate multiply and add instructions. Whether that actually has a performance impact depends on what execution units there are and how the operations are scheduled. Anyone got experience with that?




Member since:
2005-07-06
Does SSE3 has a multiply-accumulate operation?
I remember that it was missing in previous version.
Little endian simplifies some assembly language trick, big endian simplifies other assembly language trick, I'm not sure that there is really a benefit of little endian for the assembly language programming here, but human read in big endian and this is a big difference.
What do you say that Thumb sucked? Because of the performance impact?
Anyway I've not said Thumb but Thumb2 where you can mix 32 and 16 bit instructions.