Investigations into Socket 939 Athlon 64 Overclocking
by Jarred Walton on October 3, 2005 4:35 PM EST- Posted in
- CPUs
RAM Latency
We talked about memory latency a bit before, and we estimated the total latency. Using CPU-Z and the included Latency.exe utility, we can get an actual real value for memory latency - or at least, more real than our estimate. There are different ways of measuring memory latency, but we simply took the highest reported value from the table that was generated. For every system, that value was in the bottom-right corner of the table, indicating a data set of 32MB and a stride of 512. While the values may or may not be entirely accurate, they should at least be consistently measured across all the tested configurations. Here are the results, in CPU cycles as well as nanoseconds. Remember that at higher CPU speeds, each cycle is faster, so pay more attention to the nanoseconds graph.
The value RAM is obviously slower than the performance RAM across the tests. This is not at all surprising. You can see how the latency of the performance RAM trends downward until we were forced to drop to 2T or PC2700 in order to reach a stable result, while the value RAM fluctuates more. Every time that we have to drop the speed, i.e. from PC3200 to PC2700, there is an initial increase in latency, followed by a trend downward until we have to drop to the next RAM ratio.
While the value RAM is clearly slower than the expensive RAM, the big question is: how much performance do you actually lose by opting for more economical RAM? This is one of the areas that we are interested in testing for this article, and we'll comment on the results throughout the benchmarks.
RAM Possibilities
One area of the BIOS that's missing (for now) is support for additional RAM ratios. The reality is that the RAM speed is derived from the CPU clock with a divider. Ideally, we'd like to see a BIOS that gives direct access to the CPU divider rather than hiding it behind approximate memory speeds. That would allow for the selection of a larger range of options, but we're not sure if that's something that AMD controls in the CPU or if the BIOS programmers can do this. Here's a list of the standard RAM Settings, CPU multipliers and the resultant memory dividers that were available in the Infinity BIOS.
We have a LanParty UT nF3 250Gb that includes support for many in-between options from DDR200 through DDR500. Some of the interesting inclusions are DDR240, DDR280, DDR300, DDR360, DDR440, and DDR500. (CPU-Z actually failed to report the divider on a couple of those settings, but the resultant RAM speed was still read properly.) While additional memory dividers on the high end won't really help tweakers looking to get the most from the TCCD, BH5, or CH5 RAM, they can be particularly useful when using value RAM. All you need to do is select the appropriate divider to get your RAM under DDR400 - assuming standard value RAM. That way, you wouldn't take as much of a performance hit by running something like 10x250 (2.5 GHz). You could select a 13X divider rather than the standard 15X divider.
Additional ratios can even be useful for tuning performance RAM. For example, OCZ VX would not run stable above DDR500 (10x250), and we had to use 2-3-3-8-1T timings even then. (The 3.2V limit of the motherboard was at least partially to blame.) Running at 10x270, we had to drop to DDR333 (CPU/12), which resulted in the RAM running at DDR450 rather than the DDR540 result that would have been required for the normal CPU/10 ratio. However, if we could have selected a CPU/11 ratio, we could have run the RAM at DDR490 and gained a bit more performance. The additional ratios aren't required, but they would be nice to have.
If the last two paragraphs didn't make sense, then you can guess why we don't get additional access to RAM dividers. Experienced users might know how to make use of the settings, but many people would simply get confused; a "Catch-22".
We talked about memory latency a bit before, and we estimated the total latency. Using CPU-Z and the included Latency.exe utility, we can get an actual real value for memory latency - or at least, more real than our estimate. There are different ways of measuring memory latency, but we simply took the highest reported value from the table that was generated. For every system, that value was in the bottom-right corner of the table, indicating a data set of 32MB and a stride of 512. While the values may or may not be entirely accurate, they should at least be consistently measured across all the tested configurations. Here are the results, in CPU cycles as well as nanoseconds. Remember that at higher CPU speeds, each cycle is faster, so pay more attention to the nanoseconds graph.
The value RAM is obviously slower than the performance RAM across the tests. This is not at all surprising. You can see how the latency of the performance RAM trends downward until we were forced to drop to 2T or PC2700 in order to reach a stable result, while the value RAM fluctuates more. Every time that we have to drop the speed, i.e. from PC3200 to PC2700, there is an initial increase in latency, followed by a trend downward until we have to drop to the next RAM ratio.
While the value RAM is clearly slower than the expensive RAM, the big question is: how much performance do you actually lose by opting for more economical RAM? This is one of the areas that we are interested in testing for this article, and we'll comment on the results throughout the benchmarks.
RAM Possibilities
One area of the BIOS that's missing (for now) is support for additional RAM ratios. The reality is that the RAM speed is derived from the CPU clock with a divider. Ideally, we'd like to see a BIOS that gives direct access to the CPU divider rather than hiding it behind approximate memory speeds. That would allow for the selection of a larger range of options, but we're not sure if that's something that AMD controls in the CPU or if the BIOS programmers can do this. Here's a list of the standard RAM Settings, CPU multipliers and the resultant memory dividers that were available in the Infinity BIOS.
Standard AMD Memory Ratios | |||
RAM Setting | CPU Multiplier | RAM Divider | True RAM Speed |
DDR200 | 9X | CPU/18 | DDR200 |
DDR266 | 9X | CPU/14 | DDR257 |
DDR333 | 9X | CPU/11 | DDR327 |
DDR400 | 9X | CPU/9 | DDR400 |
DDR200 | 10X | CPU/20 | DDR200 |
DDR266 | 10X | CPU/15 | DDR267 |
DDR333 | 10X | CPU/12 | DDR333 |
DDR400 | 10X | CPU/10 | DDR400 |
DDR200 | 11X | CPU/22 | DDR200 |
DDR266 | 11X | CPU/17 | DDR259 |
DDR333 | 11X | CPU/14 | DDR314 |
DDR400 | 11X | CPU/11 | DDR400 |
DDR200 | 12X | CPU/24 | DDR200 |
DDR266 | 12X | CPU/18 | DDR267 |
DDR333 | 12X | CPU/15 | DDR320 |
DDR400 | 12X | CPU/12 | DDR400 |
We have a LanParty UT nF3 250Gb that includes support for many in-between options from DDR200 through DDR500. Some of the interesting inclusions are DDR240, DDR280, DDR300, DDR360, DDR440, and DDR500. (CPU-Z actually failed to report the divider on a couple of those settings, but the resultant RAM speed was still read properly.) While additional memory dividers on the high end won't really help tweakers looking to get the most from the TCCD, BH5, or CH5 RAM, they can be particularly useful when using value RAM. All you need to do is select the appropriate divider to get your RAM under DDR400 - assuming standard value RAM. That way, you wouldn't take as much of a performance hit by running something like 10x250 (2.5 GHz). You could select a 13X divider rather than the standard 15X divider.
Additional ratios can even be useful for tuning performance RAM. For example, OCZ VX would not run stable above DDR500 (10x250), and we had to use 2-3-3-8-1T timings even then. (The 3.2V limit of the motherboard was at least partially to blame.) Running at 10x270, we had to drop to DDR333 (CPU/12), which resulted in the RAM running at DDR450 rather than the DDR540 result that would have been required for the normal CPU/10 ratio. However, if we could have selected a CPU/11 ratio, we could have run the RAM at DDR490 and gained a bit more performance. The additional ratios aren't required, but they would be nice to have.
If the last two paragraphs didn't make sense, then you can guess why we don't get additional access to RAM dividers. Experienced users might know how to make use of the settings, but many people would simply get confused; a "Catch-22".
101 Comments
View All Comments
Deathcharge - Saturday, October 15, 2005 - link
also what do you think of opteron 144 or 146? the 144 are very cheap and they OC quite well apprentlypmorcos - Thursday, October 13, 2005 - link
Before I comment, you should know that I have been overclocking for 8 years now and literally overclocked all but one of the chips you mentioned in the beginning of this very good article. The HT multiplier was new to me with my most recent DFI NF4-SLI-DR board so I found that extremely useful and plan to see if I can up my speeds...but I digress.I think it would be extremely valuable to TRY to put in words the order with which an overclocker should approach making changes to settings. In other words, which is likely to be the most limiting/critical aspect(s) and from there tweak the others to max the system out.
It would be interesting to say, for example, that you start with a "safe" power settings (which is pretty obviously the limiting factor). For example, let's say your CPU and memory are rated at 1.3 and 2.8 V respectively. Why not go straight to "safe" settings for the two and tweak from there? It seems that the most useful piece of information that is NOT provided by anandtech or anyone else for that matter is a voltage and temp graph of stability/viability for these chips. It would be simple to take 3 samples (at a cost) of each chip and run the test with "average" cooling and find out what is "safe". For example if running all stock settings but upping voltages to say 2.4/3.6 V in the example above, you might see stability up to 1.65 / 3.1 V with the parts catching fire at say 1.8/3.3 V or stable at temp readings for cpu/memory of 44/47C but unstable above that. Once armed with these two graphs of information averaged from 3 chips tested the rest is very straight forward.
You simply set the cpu volts to 1.65 and memory to 3.1 V (the safe settings; check real voltages vis bios monitoring) and now you up your fsb and tweak your memory timings and in a few minutes you are running max.
Why do I think this is more valuable that showing us a graph of your results? Because like many I'm squeemish about upping the voltage on my processor and memory. I'm worried much more about the power-on affects than I am the "long-term" effects.
In computers, there are no long-terms for an overclocker. An overclocker's comp is 60% hardware and 40% software. Their greatest joy is in posting results on their favorite forum. I want to know that when I hit the power button...that the 1.7V setting does NOT have a 10% chance of blowing my processor.
My ramblings. Thanks again for another great article from by far the VERY BEST place in the world to find out how computer parts work.
JarredWalton - Thursday, October 13, 2005 - link
Thanks pmorcos.I'm working on the X2 3800+ OC followup, and I've gone back and done further testing of temperatures and voltages. Chips differ, so the real advice I have on that subject is to test your own chip extensively. I've heard of people doing 2.8 GHz on 1.500V with the Venice chips, but mine won't even POST at those settings. I think 1.65 or 1.70V was required to POST, and even then I couldn't run stable benchmarks without more voltage.
I will also be trying to cover a bit more of the "how to" process in the next one. Consider this the foundation, and the next article will refine the approach a bit. Your comments on what you'd like to see more of are definitely welcome, though, and I'll try to address the order and approach I take next.
Concerning another comment: "I want to know that when I hit the power button...that the 1.7V setting does NOT have a 10% chance of blowing my processor." I'm not quite sure I understand the concern or know how to test that. Are you saying that the power on process has more voltage fluctuations and may therefore toast the CPU in the first second? (I haven't had that happen over the past several months of testing this chip and others in overclocked setups.) I must admit that I'm extremely nervous about the 1.850V I used for running at 2.80 GHz, but even then the chip continued to function (for now - heheh).
Cheers!
Jarred Walton
WhipperSnapper - Thursday, October 13, 2005 - link
That was one of the best computer enthusiast website articles that I've read in a long time, but perhaps I don't get around too much. I'd like to hear more about the problems that spilled over to other components, such as the SATA hard drive (mentioned in the Final Thoughts) and whether or not the overclocking can be isolated to the CPU and RAM. I also wondered if there was a reason why you guys used a SATA hard drive and not an IDE drive and whether overclocking requires a SATA hard drive. (I don't see why it would.)
Also, have you guys tried to do any tests using memory stick heatsinks? Do they actually do anything? That subject might make for a worthwhile article on its own--RAM cooling.
aptinio - Saturday, October 8, 2005 - link
bravo! great article. very informative but not too bloated. can't wait to finally upgrade my amd k6-II with 1mb l3 cache on the motherboard! lol!Kougar7 - Thursday, October 6, 2005 - link
Thank you for the excellent, comprehensive, and very thorough article! :-) It must have taken a massive amount of work and time to complete. It’s answered my recent musings about my own Crucial value ram, which looks much nicer now! It’s also solved a question about OCing with recent AMD 64 chips, amongst also correcting a few personal misconceptions I’ve had.I just wish to ask if you plan to include a similar article on OCing with P4s? I personally run a 2.8C (Northwood) @ 3.4 rock solid at the 3.4C’s default voltage, but am now wondering exactly what performance hits, if any, that I’ve taken from having to use a 5:4 CPU:DRAM ratio instead of the previous 1:1, even though I’ve kept it at DDR390 and the timings better than specs.
I’m planning to bench the differences from a 1:1 ratio, a 3:2 ratio at highest speed I can get (sub-DDR333), my current setup, and finally one other setting where I got the value memory to run 2-2-2-6 timings, to get a more solid idea on which performs best with some solid figures.
Although the core and the platform itself both have both changed, I’d still be interested in a Intel processor based test! Perhaps instead of a P4, maybe a Pentium “D” OCing article similar to what you have planned with the X2 3800+? ;-)
I’m very much looking forward to your X2 3800+ OCing review!! You rock :-D Thanks in advance for it!
JarredWalton - Thursday, October 6, 2005 - link
I'm trying to get a socket 775 motherboard that will overclock well with Pentium D 820. Once I get that, I can give it a go. I've also got a Pentium 4 505 and a 540 that I want to run some similar tests on. First, though, I need an appropriate motherboard.clue22 - Thursday, October 6, 2005 - link
so basically what the everybody is saying about the value RAM vs. low latency more expensive RAM is that for the athlon 64 it is basically a waste of money (i.e. you only get about 5% performance gain), but usually spend 100% or more money to get the "better" RAM. i have to build a couple of systems pretty soon and now i believe that my money would be better spent on 2GB of value RAM vs. 1GB of the more expensive stuff. does anyone know of a test that has been run with 2.5-3-3-8-1t vs. 2-2-2-5-1t? also why does every mid-range/gaming/hot-rod price guide ever recommend the either the samsung tccd (or tcc5) or winbond bh5/ch5 based memory if it has so little effect on performance. finally is it even important anymore (if it ever was) to get matched pairs of memory that are bundled together (supposedly manufactured at the same time)? i was looking at some corsair (had good experience with them in the past) xms3200xl RAM but now i think i should get more of their value select memory instead.thanks
RupertS - Wednesday, October 26, 2005 - link
so basically what the everybody is saying about the value RAM vs. low latency more expensive RAM is that for the athlon 64 it is basically a waste of moneyThis may not be a general rule.
It may just be that at this stage of development for GPU's, CPU's and memory, memory has more than enough capacity - it is not the choke point. If GPU and CPU speed were to improve while memory speed stayed the same, you might reach the point where increasing GPU and CPU speed was non-productive for games, while overclocking memory provided large performance improvements.
rabbit fighter - Wednesday, October 5, 2005 - link
Where was this explained? He said the 3200 was better in the first paragraph and that he would explain later, but I can't find the later explanation!