Linked by Thom Holwerda on Wed 12th Mar 2008 17:54 UTC, submitted by RJop
General Development The GNU project has released version 4.3.0 of the GNU Compiler Collection. "The GNU project and the GCC developers are pleased to announce the release of GCC 4.3.0. This release is a major release, containing new features (as well as many other improvements) relative to GCC 4.2.x."
Permalink for comment 304774
To read all comments associated with this story, please click here.
by tyrione on Wed 12th Mar 2008 20:15 UTC in reply to "SSE4"
Member since:

I read somewhere else that this release included SSE4 support.
Here is an article where they asked the GCC dev's (along with xvid, and ffmpeg devs) what benefit they will see.

New Targets and Target Specific Improvements

* Tuning for Intel Core 2 processors is available via -mtune=core2 and -march=core2.
* Tuning for AMD Geode processors is available via -mtune=geode and -march=geode.
* Code generation of block move (memcpy) and block set (memset) was rewritten. GCC can now pick the best algorithm (loop, unrolled loop, instruction with rep prefix or a library call) based on the size of the block being copied and the CPU being optimized for. A new option -minline-stringops-dynamically has been added. With this option string operations of unknown size are expanded such that small blocks are copied by in-line code, while for large blocks a library call is used. This results in faster code than -minline-all-stringops when the library implementation is capable of using cache hierarchy hints. The heuristic choosing the particular algorithm can be overwritten via -mstringop-strategy. Newly also memset of values different from 0 is inlined.
* GCC no longer places the cld instruction before string operations. Both i386 and x86-64 ABI documents mandate the direction flag to be clear at the entry of a function. It is now invalid to set the flag in asm statement without reseting it afterward.
* Support for SSSE3 built-in functions and code generation are available via -mssse3.
* Support for SSE4.1 built-in functions and code generation are available via -msse4.1.
* Support for SSE4.2 built-in functions and code generation are available via -msse4.2.
* Both SSE4.1 and SSE4.2 support can be enabled via -msse4.
* A new set of options -mpc32, -mpc64 and -mpc80 have been added to allow explicit control of x87 floating point precision.
* Support for __float128 (TFmode) IEEE quad type and corresponding TCmode IEEE complex quad type is available via the soft-fp library on x86_64 targets. This includes basic arithmetic operations (addition, subtraction, negation, multiplication and division) on __float128 real and TCmode complex values, the full set of IEEE comparisons between __float128 values, conversions to and from float, double and long double floating point types, as well as conversions to and from signed or unsigned integer, signed or unsigned long integer and signed or unsigned quad (TImode) integer types. Additionally, all operations generate the full set of IEEE exceptions and support the full set of IEEE rounding modes.
* GCC can now utilize the ACML library for vectorizing calls to a set of C99 functions on x86_64 if -mveclibabi=acml is specified and you link to an ACML ABI compatible library.

Reply Parent Score: 3