http://www.farrarfocus.com/atom/090105.htm
090105 / NV GPU Programming Guide
http://www.farrarfocus.com/atom/081224.htm
081224 / Larrabee
GPU Binning
AMD’s binning paper shows results for HD 4870 at only 1M bins in 0.02 seconds (50Mbin/sec) on 64^3 grid. Uses the multipass DrawAuto trick with StreamOut recirculation of non-binned points. Humm, the method seems a little too slow for me regardless of AMD or NVidia GPU. Too much overhead for the result. Say this was a GT280, at 141.7 GBs bandwidth, 50M/s points averages to over 2KB available peek bandwidth per point.
http://japan.internet.com/webtech/20090619/3.html
「NVIDIA、OpenCL 1.0 コンフォーマント ドライバを発表」
http://www.goodgearguide.com.au/article/270416/inside_tsubame_-_nvidia_gpu_supercomputer?pp=1
Inside Tsubame – the Nvidia GPU supercomputer
http://hal.archives-ouvertes.fr/docs/00/37/47/15/PDF/stats_on_instruction.pdf
Barra, a Modular Functional GPU Simulator for GPGPU
http://www2.lifl.fr/MAP/paap/firstPaapWorkshop/presentations/paap2007_matsuoka.pdf
” TSUBAME 2.0 and Beyond Infinity”