ref: 9e7b09bc5d45e309c0deeb1c95981ccb7b857876
parent: 928ff03889dadc3f63883553b443c08e625b4885
author: Jingning Han <jingning@google.com>
date: Fri May 2 12:29:08 EDT 2014
SSSE3 8x8 inverse 2D-DCT with first 10 coeffs non-zero This commit enables ssse3 assembly implementation of the 8x8 inverse 2D-DCT with only first 10 coefficients non-zero. The average runtime for this unit goes down from 198 cycles to 129 cycles (34.8% faster). Change-Id: Ie7fa4386f6d3a2fe0d47a2eb26fc2a6bbc592ac7