Processor Obfuscation and You: An Update on AMD and Intel CPUs
by Kristopher Kubicki & Jarred Walton on May 17, 2005 12:00 PM EST- Posted in
- CPUs
Socket 939
The best alternative is all that remains, and that's Socket 939. With dual core Athlon 64 X2 chips planned at least through the 4800+ (2.4 GHz with 1MB L2), 939 has quite a lot of headroom remaining. At present, there are at least four variants of Athlon 64 processors for Socket 939, and two more are planned for the near future.The oldest of the offerings is the Sledgehammer core, which was used in the FX-53 and FX-55. (The FX-55 used strained silicon, but we're not aware of any additional code name for that model.) Sledgehammer (which we often just interchange with Hammer) was also used for the original "CG" Opteron processors, and there was a slight modification made to allow the use of non-registered (unbuffered) RAM. Performance is very good; in fact, the Sledgehammer was, until very recently, the fastest processor around. The major drawback is the same as for Clawhammer - the core was expensive to produce, due to the 130nm process and the 105+ million transistors. That brings us to the next processor core.
Newcastle was available for either 754 or 939 and used a 130nm process. Newcastle differs from Hammer primarily by the reduced L2 cache size and "CG" stepping. Since the 1MB cache was about half of the total CPU core, reducing the cache made the core substantially smaller and thus, cheaper to produce. What's interesting to note is that the Socket 939 CPUs had a huge price premium relative to their Socket 754 counterparts - a price premium that appears to be related to supply/demand and price/performance concerns more than anything else. Consider that both the 3400+ and 3500+ are basically identical, other than the socket. Both are 2.2 GHz 512K cache chips made on a 130nm process. However, the Athlon 64 3500+ carried a roughly 50% price premium. The end result is that Socket 939 chips tend to outperform their Socket 754 counterparts, due to the dual channel memory controller. The market basically dictates the prices that AMD and Intel can charge, and the market determined (with good reason) that Socket 939 was preferable to Socket 754.
The follow up to the Newcastle core is Winchester, which is more or less the same chip design, but only produced on a 90nm process instead of 130nm. We say "more or less" because there are measurable performance differences, and we're not entirely sure what caused the increased performance of the 90nm parts. More important than the performance difference - it's generally less than 5% and sometimes as little as 1%, which means that you won't notice it in practical use - is that the 90nm SOI parts run substantially cooler than the 130nm parts. Our system power draw tests have shown that even the fastest 90nm parts still draw less power than the slowest 130nm parts. In fact, the dual-core 90nm parts are also less power-hungry than most of the 130nm parts. But we're jumping ahead.
Newcastle 3500+ models are still available at a discount relative to the newer 90nm parts, again most likely due to marketing factors rather than the actual cost to produce the chips. AMD may also be trying to clear out the old inventory before making it obsolete. The end result is that Winchester 3200+ and 3000+ parts are cheap and readily available, while the 3500+ Winchester costs quite a bit more than the 3500+ Newcastle. For most people, we'd either go with the cheaper and slower 90nm cores or make the jump to one of the higher performance 90nm cores; the 3500+ 90nm just isn't a very good bargain right now.
Generally, you can determine a processor is a Winchester CPU if the SKU ends in "BI". Winchester processors only utilize the "D0" stepping that we mentioned earlier, so don't expect SSE3 either. Although Winchesters are certainly a step up from the 130nm Newcastles, they are not the only 90nm chips around.
The most recent addition to AMD's lineup is the new Venice core. It appears to be the same as Winchester, only with SSE3 extensions enabled. Applications that take advantage of SSE3 are still few and far between, and even then, the difference isn't tremendous. It's sort of like the x86-64 advantage when using less than 2 GB of RAM. The extra registers might improve performance by 5% or so, but it's not a huge deal. One thing that does differentiate Venice from Winchester is that Venice parts are also available in 3800+ models. Winchester tops out at the 3500+ (2.2 GHz) part right now, which is odd, considering that many overclockers are reaching 2.6 GHz and beyond quite easily. Given the recent overlap in chip names from AMD, perhaps we'll even see 2.6 GHz Venice parts in the future as an alternative 4000+, but that's pure speculation.
San Diego is the follow-up to Sledgehammer; the 1MB cache version of Venice. The new Athlon 64 3700+ and Athlon 64 4000+ parts use San Diego, as will the FX-57 when it's launched sometime in the next few months. San Diego chips have already begun to show up at retail (albeit in small quantities), and the prices are pretty attractive, for a high-end processor. Judging by the performance comparisons between Newcastle and Winchester, we would expect San Diego to outperform Sledgehammer clock for clock by a few percent - this is mostly due to caching algorithm optimizations, and the SSE3 additions are merely one more point in favor of the new core. We also expect the power consumption of San Diego to be significantly lower than Hammer, and quite a bit lower than Newcastle.
Venice and San Diego both use the "E" stepping cores, although there is actually a distinction between the steppings this time around. Venice, denoted by the "BP" at the end of the SKU, use the "E3" stepping. San Diego, on the other hand, use the "E4" stepping and are denoted by "BN" in the SKU.
Looking forward to later this year, we'll see the arrival of the Manchester and Toledo cores. Toledo is the dual core version of San Diego and it will power the Athlon 64 X2 4400+ and 4800+ parts (1024K L2 per core); the Manchester core will be used in the 4200+ and 4600+ (512K L2 per core), though we may also see "rejected" Toledo cores in those chips if the past is any indication. We've covered dual core performance in several recent articles: Intel Part I and Part II, along with the nForce4/855 comparison, and then the AMD Opteron and Athlon X2 articles. The basic summary is that anyone who runs a lot of applications at the same time - especially CPU intensive applications - will definitely benefit from dual core processors. The only fly in the ointment for AMD is that their dual core parts are currently targeting a much higher price range than their single core processors. In fact, the cost of a dual core chip like the 4200+ (2.2 GHz and 512K cache per core) is actually more than the cost of two 3500+ 130nm chips. That's definitely not the way to spur market adoption of dual cores, but it may be a business decision caused by the limited manufacturing capacity of AMD.
As a final topic that we didn't address yet, let's talk about overclocking for the various Socket 939 cores. Newcastle and Sledgehammer both top out around 2.5 GHz for the best chips, and many chips won't even reach that high. The heat and power requirements certainly play a role, although it's interesting that the FX-55 core with strained silicon added into the mix manages to run at 2.6 GHz, with overclocking attempts getting it as high as 3.0 GHz. The better choice for overclocking is definitely the 90nm parts. As usual, nothing is guaranteed with overclocking, and it's almost equal parts luck, components, and - for lack of a better term - artistry that determine your final clock speed. If you know what you're doing, we would venture to say that 2.5 GHz is achievable on all of the single core 90nm parts, though the 9X multiplier of the 3000+ may make it more difficult to reach. Early previews of the X2 chips are also showing results above 2.5 GHz, and when you think about it, it makes sense. The thermal density is actually the bigger problem with CPUs these days, so having a core that's twice as large in transistor count as well as die size basically means that the highest attainable clock speed (with a sufficient heat sink) will be similar. We won't know for sure until the retail launch when we can see results for more than a few hand-selected CPUs, but even without overclocking, the performance offered by the Athlon X2 is impressive.
Socket A
We don't consider Socket A/462 to be a valid platform for anyone anymore, at least for new purchases. If you're still running a socket A system and you want to prolong its life, your best bet is to grab an Athlon XP Mobile processor and overclock it. The Athlon XP-M 2500+ [RTPE: AXMH2500FQQ4C] has basically been at a flat price of $90 for a while now, and we don't see it dropping any further. The 2600+ increases the clock speed slightly, but since both CPUs have unlocked multipliers and can overclock to 2.2 to 2.4 GHz with relative ease, there's not much point in spending any extra money (though, it is only $6). If you're already running an Athlon XP 3200+, you're pretty much maxed out on the CPU and the only real performance gain will come by purchasing an Athlon 64 motherboard and CPU. Performance of a 2.4 GHz Barton core is roughly equal to that of the Athlon 64 3000+ - the integrated memory controller essentially improves performance by 20% relative to the older Athlon XP design.As for the individual cores, they all used 130nm process technology, and the maximum clock speed was similar - around 2.2 GHz. The added cache of the Barton core helped performance quite a bit, as did the increase in bus speed that came with the later parts. Modern Sempron chips for socket A use the Thoroughbred core, due to its smaller size. The one exception is the Sempron 3000+, which uses a Barton core clocked at 2.0 GHz. It's interesting to note that the Barton 2.0 GHz would have originally qualified as something like an Athlon XP 2800+, only 200 points below the Sempron rating. The Thoroughbred 2.0 GHz, on the other hand, was labeled as an Athlon XP 2400+ and later the Sempron 2800+, a difference of 400 points. Most of us realize that the performance ratings of the later Athlon XP chips were more marketing than reality. Joe Consumer figures a 3200+ is the same as a 3.2 GHz, right? The Sempron chips are now supposed to be compared with Intel's Celeron line rather than Pentium 4, and their model numbers seem to reflect true performance ratings better.
55 Comments
View All Comments
flatblastard - Tuesday, May 17, 2005 - link
"you're probably best off waiting a bit longer to upgrade from anything sporting an 865PE or 875P chipset (or else jumping ship and purchasing an AMD system)."I would like to know which one of you, Kris or Jarred, decided to throw in that last little tidbit. While I agree that it's probably best to wait on upgrading from 865pe/875p, was that last part really necessary? I was really enjoying the unbiased explanations right up until that moment. You should save comments like that for the conclusion section, where your opinions belong.
JustAnAverageGuy - Tuesday, May 17, 2005 - link
:thumbsup;IntelUser2000 - Tuesday, May 17, 2005 - link
Irwindale is Xeon variant of Prescott 2M, Prescott 2M is the code name for... Prescott 2M.ta2 - Tuesday, May 17, 2005 - link
I have a Socket 754 Sempron 2800+. CPU-Z gives me this information:Codename: Palermo
Technology: 90nm
Revision: DH8-D0
This processor DOES NOT have SSE3 as stated in your article.
Rand - Tuesday, May 17, 2005 - link
" At the time of publication, the only Paris Sempron comes in the form of the Sempron 3100+"The Paris core Sempron's are also available at 2600+ and 2800+, and available in 128KB varians not just 256K variants.
I've no idea on 3000+ varients, but I'd be surprised if there weren't some Paris 3000+ models out there.
I also didn't see any mention made of the S754 Athlon64 3300+. Granted it would appear to be available only to OEM's and that in relatively small quantities but still worth mentioning.
(2.4GHz/256K Winchester core).
Given the above you should also mention that the Winchester is available in 256K varients... at least in the one model only.
JarredWalton - Tuesday, May 17, 2005 - link
17 - the "partial" was to indicate that not all Prescott or Celeron D chips have EM64T. I probably should have made that more clear. Basically, I didn't want to put "EM64T" and have people assume that *all* Prescotts have it. They do, of course, but it's only enabled on the +1 models.BlvdKing - Tuesday, May 17, 2005 - link
I also have a 'CG' Clawhammer with 1mb L2 cache.ElFenix - Tuesday, May 17, 2005 - link
thanks!pxc - Tuesday, May 17, 2005 - link
Kristopher or Jarred: You labelled the Celeron D and P4 5xx columns "Partial EM64T". What is "partial" about it?I happen to have a P4 551 I got in December and there is nothing "partial" about EM64T support on it.
icarus4586 - Tuesday, May 17, 2005 - link
"Hammer chips (Sledgehammer and Clawhammer) are the oldest in the Athlon 64 fleet, and still us the "C0" stepping."There are also newer CG stepping models. (Clawhammer at least, I have one)