: 1140: c7 44 24 f8 01 00 00 movl $0x1,-0x8(%rsp) 1147: 00 1148: 8b 44 24 f8 mov -0x8(%rsp),%eax 114c: 89 44 24 fc mov %eax,-0x4(%rsp) 1150: 31 c0 xor %eax,%eax 1152: c3 ret ``` ### 2.2 实现原理具体实现是使用编译器的asm扩展实现，不同编译器实现原理不同，本文主要聚焦clang/gcc实现，msvc有兴趣的同学自己看。 ```cpp template inline BENCHMARK_ALWAYS_INLINE void DoNotOptimize(Tp& value) { #if defined(__clang__) asm volatile("" : "+r,m"(value) : : "memory"); #else asm volatile("" : "+m,r"(value) : : "memory"); #endif } ``` 由于clang和gcc的实现类似，只存在细节上的差异，这里只描述gcc的。 - https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html - https://releases.llvm.org/14.0.0/tools/clang/docs/LanguageExtensions.html DoNotOptimize的实现使用了编译内链汇编的扩展,具体语法如下： ``` asm asm-qualifiers ( AssemblerTemplate : OutputOperands [ : InputOperands [ : Clobbers ] ]) ``` - ```volatile```：asm-qualifiers标记为```volatile```表示禁止编译器优化； - ```""```：一个空语句； - ```"+r,m"(value)```：约束value读写内存的行为，+表示这是一个读/写操作数，r表示可以使用通用寄存器，m表示可以使用内存； - ```memory```：是一个内存屏障，告诉编译器以此语句为分界线，上面的语句不能排序到下面，下面的语句不能排序到上面，来保证执行顺序。 ```volatile```的唯一作用就是告诉编译器这个变量不能被优化必须经过寄存器读写到内存，而不是直接操作内存。而```"+r,m"(value)```限制了具体读写的行为。```memory```为了保证执行语义而存在，比如程序： ```cpp int testFunc(int a){ return a; } int main(int argc, char **argv){ int a = 1; auto t = clock(); DoNotOptimize(a); auto c = testFunc(a); DoNotOptimize(t); auto t2 = clock(); return 0; } ``` 如果没有```memory```的存在，```a```和```t```便来仍然会读写内存不会被优化，但是不同语句的执行顺序不能严格保证的。```auto c=testFunc(a)```和```auto t=clock()```由于两个语句质量没有任何的关联性，在编译器看来对该语句进行排序不会有任何副作用，但是实际上的你的意图是计算耗时排序反而会导致计算不准确。 ## 3 总结为了精确的测试程序的耗时，尽量在测试区间添加```DonotOptimize```，避免编译器优化而导致测试错误。 ## 4 参考文献 - https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html - https://github.com/google/benchmark/blob/0baacde3618ca617da95375e0af13ce1baadea47/include/benchmark/benchmark.h#L331-L337 - https://github.com/google/benchmark/blob/e451e50e9b8af453f076dec10bd6890847f1624e/include/benchmark/benchmark.h#L339-L368 - https://releases.llvm.org/14.0.0/tools/clang/docs/LanguageExtensions.html - https://github.com/google/benchmark/issues/242 - https://theunixzoo.co.uk/blog/2021-10-14-preventing-optimisations.html