I've tried a 55nm Gf9600gt as well, with 288GFlops (
8600gt: 113GFlops,
Hd6570: 624GFlops). While the two previous cards perform similarly, the
Gf9600gt outperforms both ones on games. I understand its
Ram throughput makes the difference.
To put figures on that, I used
3DMark 2001se. It's a dX8 test, about as old as the games I play, and I put its resolution at
1920*1200*32 which is my target at games. I noted the
smallest Fps (and the mean Fps) at the tests "Car Chase High", "Lobby High" and "Nature". These gave me the same ranking as the games.
I also
varied the Gddr3 frequency a lot at the Gf9600gt, because the initial 1000MHz were abnormally low. The result is convincing, with
some test improving nearly as much as the Ram frequency. Please note that on the graph here (log in to see, click to magnify), the Fps of "Lobby high" are halved.
9600gt_MHz_Fps_Series.png (20.47K)
Number of downloads: 2
9600gtEtc_FiguresMHzFps.png (4.56K)
Number of downloads: 2
From these measurements, we see clearly that
the Ram determines the speed, more so the minimum Fps than the mean ones.
- Measured at 1024*768*32 with the dX9.0 test 3DMark 2003, the Ram throughput is still very important. Anyway, recent cards have to address big screens.
- All these cards are dX10 or dX11 with less efficient
unified shaders, while my dX9 games and tests use
vertex and pixel shaders. Maybe the translation by the drivers demands more bandwidth.
- It could be (or not) that dX10 games take a better advantage of the on-Gpu caches.
- Multi-Gpu cards and Sli improve the bandwidth, but they may also need more if each Gpu requests the full amount of data in the Ram. It depends on driver programming.
Marc Schaefer, aka Pointertovoid