libavcodec/x86/dsputil_mmx.c · 620e76f99e0a86afb500dface8d1fc79946ccc3a · videolan / ffmpeg-mt

refactor and optimize scalarproduct · 620e76f9

lorenm authored Dec 05, 2009

29-105% faster apply_filter, 6-90% faster ape decoding on core2
(Any x86 other than core2 probably gets much less, since this is mostly due to ssse3 cachesplit avoidance and I haven't written the full gamut of other cachesplit modes.)
9-123% faster ape decoding on G4.



git-svn-id: file:///var/local/repositories/ffmpeg/trunk@20739 9553f0bf-9b14-0410-a0b8-cfaf0461ba5b

620e76f9

dsputil_mmx.c 124 KB

Replace dsputil_mmx.c