瀏覽單個文章
yhnui
Junior Member
 

加入日期: Mar 2002
文章: 925
x264對堆土機XOP優化 性能暴增?

引用:
2011-10-04 04:46:38 < Dark_Shikari> C, with mode analysis shortcuts: 253 cycles
2011-10-04 04:46:45 < Dark_Shikari> My crappy, badly optimized XOP asm: 93 cycles
2011-10-04 04:46:56 < Dark_Shikari> This is kinda awesome
2011-10-04 04:49:35 < Dark_Shikari> Oh, and old without shortcuts: 379 cycles
2011-10-04 04:49:45 < Dark_Shikari> My asm is 4 times faster than the existing... wait where have we seen this before? XD
2011-10-04 04:49:57 < Dark_Shikari> It's just like SAD_4x4_x9 all over again!
2011-10-04 04:50:10 < JEEB>
2011-10-04 04:50:18 < JEEB> that sounds pretty awesome
2011-10-04 04:50:21 < Dark_Shikari> Except this time I'm still wondering how best to do it without vpperm
2011-10-04 04:50:33 < Dark_Shikari> Thanks AMD, for bringing back the best instruction ever after 15+ years of hiatus.


來源是IRC X264dev 大意是 Dark_Shikari對Xop做簡單的優化 效能增加三-四倍
這是堆土機的一大利多

12號之後就知道結果,到時候又有得吵了
     
      
舊 2011-10-05, 07:04 AM #1
回應時引用此文章
yhnui離線中