Граф коммитов

13 Коммитов

Автор SHA1 Сообщение Дата
Steve Snyder 726ac89b07 Bug 512865. qcms: Improve SSE2 performance, add SSE support. r=jrmuizel
This patch greatly improves the performance of QCMS transformations on x86 &
x86_64 systems.  Some notes:

0. On 32-bit x86 systems it does runtime selection between non-SIMD, SSE, and
SSE2 code paths.

1. On x86_64 systems the SSE2 code path is always taken.  The non-SIMD and SSE
code paths are left intact, but contemporary versions of the GCC and MSVC
compilers will see that they cannot be reached and optimize them away.

2. The execution of the SSE2 code path is reduced by 67%, relative to the
original Intel/Microsoft formatted ASM code.  The relative performance is seen
on a Pentium4 (Northwood) 2.4GHz CPU with DDR1 RAM.

3. The SSE code path provides a 80% reduction in execution time, relative to
the non-SIMD code path.  The relative performance is seen on a Pentium3
(Coppermine) 1.26GHz CPU with SDRAM.

4. The code has been split out into separate files so that it can be built
with different cflags (-msse, and -msse2) when using gcc.

5. Try to land again, this time with __attribute__((__force_align_arg_pointer__))
to avoid crashes on linux.
2009-10-01 14:19:41 -04:00
Jeff Muizelaar 3f907e258a Backed out changeset cb4f078cc8cb (bug 512865)
Was causing crashes on the leak test box.
2009-09-24 23:35:53 -04:00
Steve Snyder 7e089e6fae Bug 512865. qcms: Improve SSE2 performance, add SSE support. r=jrmuizel
This patch greatly improves the performance of QCMS transformations on x86 &
x86_64 systems.  Some notes:

0. On 32-bit x86 systems it does runtime selection between non-SIMD, SSE, and
SSE2 code paths.

1. On x86_64 systems the SSE2 code path is always taken.  The non-SIMD and SSE
code paths are left intact, but contemporary versions of the GCC and MSVC
compilers will see that they cannot be reached and optimize them away.

2. The execution of the SSE2 code path is reduced by 67%, relative to the
original Intel/Microsoft formatted ASM code.  The relative performance is seen
on a Pentium4 (Northwood) 2.4GHz CPU with DDR1 RAM.

3. The SSE code path provides a 80% reduction in execution time, relative to
the non-SIMD code path.  The relative performance is seen on a Pentium3
(Coppermine) 1.26GHz CPU with SDRAM.

4. The code has been split out into separate files so that it can be built
with different cflags (-msse, and -msse2) when using gcc.
2009-09-24 11:58:24 -04:00
timeless@mozdev.org aa0a779122 Bug 491195 fix warnings in qcms
mostly comparisons between unsigned and signed
also an incomplete initializer
and a warning about using C++ style variables
r=jrmuizel
2009-07-22 17:31:03 -05:00
Jeff Muizelaar 99d30c3bf9 Bug 497363. qcms: Add reference to the rationale for not correcting alpha. r=joedrew 2009-06-12 14:38:32 -04:00
Jeff Muizelaar 341c2ba4c7 Bug 497363. qcms: Use a minimum of 256 entries when calling invert_lut. r=joedrew
Makes the number of output entries produced by invert_lut() a parameter and
changes all callers to use a minimum of 256 entries when computing the inverse.
2009-06-12 14:38:30 -04:00
Jeff Muizelaar a0d9e260b1 Bug 488955 - Fix linear gamma input tables, used in iCCP PNG testsuite images. r=joe 2009-04-22 13:49:09 -04:00
Jeff Muizelaar dcd15d3bce Bug 489133 - Handle bad cHRM chunks in PNGs. r=joe 2009-04-21 22:21:48 -04:00
Jeff Muizelaar 780d2ffe8c Bug 489133 - Fix handling of bad whitepoints in PNGs. r=joe 2009-04-21 22:21:13 -04:00
Makoto Kato 784aeab18f Bug 487900 - Enable SSE2 code for x86_64 in qcms. r=jmuizelaar 2009-04-15 22:14:03 +09:00
Jeff Muizelaar 52c97edf1e Bug 481926 - Rewrite color management component sr=vlad, r=ted, r=joedrew (\o/)
Replaces lcms with qcms
2009-04-07 12:02:11 -04:00
Joe Drew db7246feb6 Backed out changeset 6f3c2171bbb2:
Bug 481926 - Rewrite color management component. r=joe,ted sr=vlad
2009-04-03 16:27:27 -04:00
Jeff Muizelaar e823d73623 Bug 481926 - Rewrite color management component. r=joe,ted sr=vlad 2009-04-03 15:09:56 -04:00