AtomicAdd 300% faster than simple store on Pascal GPUs!

Post your questions, suggestions and experiences regarding to Image manipulation, 3d modeling and level editing for the Irrlicht engine here.
Post Reply
devsh
Competition winner
Posts: 2057
Joined: Tue Dec 09, 2008 6:00 pm
Location: UK
Contact:

AtomicAdd 300% faster than simple store on Pascal GPUs!

Post by devsh »

I have a very peculiar request as I'm getting very peculiar results (atomic to the same address by all threads faster than a simple write), could anyone on Linux with a GPU supporting OpenGL 4.3+ help me with a benchmark?

It's already compiled for Ubuntu 17.04 (will run on higher Ubu or Debian).
I just need it ran in the terminal (command-line) with parameters 1920,1080,MODE,n when std::cin asks for them.

MODE will be a number from 0 to 7 inclusive

Press on the window (focus on it) and hit Q to stop the benchmark.

If you could let me know what numbers you get I would be much obliged!
(some tests won't run if you don't have NVidia)

https://github.com/buildaworldnet/Irrli ... Benchmark/ (everything in one folder)

P.S. I'm working on a Windows Visual Studio project and a compute shader-based benchmark.
Last edited by devsh on Thu Jul 05, 2018 6:41 pm, edited 1 time in total.
devsh
Competition winner
Posts: 2057
Joined: Tue Dec 09, 2008 6:00 pm
Location: UK
Contact:

Re: Help Me with a Benchmark Please...

Post by devsh »

I actually figured out why this strange behaviour happens.

I wrote a wiki page on it
https://github.com/buildaworldnet/Irrli ... hink-Again!

If you are interested in atomics on GPUs work, how shaders get scheduled and how this all works in hardware and what the NVIDIA shader compiler does behind the scenes... give it a read.
Post Reply