The SSD Anthology: Understanding SSDs and New Drives from OCZ
by Anand Lal Shimpi on March 18, 2009 12:00 AM EST- Posted in
- Storage
Putting Theory to Practice: Understanding the SSD Performance Degradation Problem
Let’s look at the problem in the real world. You, me and our best friend have decided to start making SSDs. We buy up some NAND-flash and build a controller. The table below summarizes our drive’s characteristics:
Our Hypothetical SSD | |
Page Size | 4KB |
Block Size | 5 Pages (20KB) |
Drive Size | 1 Block (20KB |
Read Speed | 2 KB/s |
Write Speed | 1 KB/s |
Through impressive marketing and your incredibly good looks we sell a drive. Our customer first goes to save a 4KB text file to his brand new SSD. The request comes down to our controller, which finds that all pages are empty, and allocates the first page to this text file.
Our SSD. The yellow boxes are empty pages
The user then goes and saves an 8KB JPEG. The request, once again, comes down to our controller, and fills the next two pages with the image.
The picture is 8KB and thus occupies two pages, which are thankfully empty
The OS reports that 60% of our drive is now full, which it is. Three of the five open pages are occupied with data and the remaining two pages are empty.
Now let’s say that the user goes back and deletes that original text file. This request doesn’t ever reach our controller, as far as our controller is concerned we’ve got three valid and two empty pages.
For our final write, the user wants to save a 12KB JPEG, that requires three 4KB pages to store. The OS knows that the first LBA, the one allocated to the 4KB text file, can be overwritten; so it tells our controller to overwrite that LBA as well as store the last 8KB of the image in our last available LBAs.
Now we have a problem once these requests get to our SSD controller. We’ve got three pages worth of write requests incoming, but only two pages free. Remember that the OS knows we have 12KB free, but on the drive only 8KB is actually free, 4KB is in use by an invalid page. We need to erase that page in order to complete the write request.
Uhoh, problem. We don't have enough empty pages.
Remember back to Flash 101, even though we have to erase just one page we can’t; you can’t erase pages, only blocks. We have to erase all of our data just to get rid of the invalid page, then write it all back again.
To do so we first read the entire block back into memory somewhere; if we’ve got a good controller we’ll just read it into an on-die cache (steps 1 and 2 below), if not hopefully there’s some off-die memory we can use as a scratch pad. With the block read, we can modify it, remove the invalid page and replace it with good data (steps 3 and 4). But we’ve only done that in memory somewhere, now we need to write it to flash. Since we’ve got all of our data in memory, we can erase the entire block in flash and write the new block (step 5).
Now let’s think about what’s just happened. As far as the OS is concerned we needed to write 12KB of data and it got written. Our SSD controller knows what really transpired however. In order to write that 12KB of data we had to first read 12KB then write an entire block, or 20KB.
Our SSD is quite slow, it can only write at 1KB/s and read at 2KB/s. Writing 12KB should have taken 12 seconds but since we had to read 12KB and then write 20KB the whole operation now took 26 seconds.
To the end user it would look like our write speed dropped from 1KB/s to 0.46KB/s, since it took us 26 seconds to write 12KB.
Are things starting to make sense now? This is why the Intel X25-M and other SSDs get slower the more you use them, and it’s also why the write speeds drop the most while the read speeds stay about the same. When writing to an empty page the SSD can write very quickly, but when writing to a page that already has data in it there’s additional overhead that must be dealt with thus reducing the write speeds.
250 Comments
View All Comments
matrixireland - Thursday, December 24, 2009 - link
hi would like to know what you pros think of the;Golden Leopard ASAX-ZIF1.8-SSD? what would you add to it?
And how would you rate it against other ssd?
Specifications:
product description
ASAX-ZIF1.8-SSD is a high-performance design solid state drive based on the high-end micro-control IC with flash memory storage medium integrated advantaged of high speed,convenient ,aseismatic,energy-saving etc.
specification
Model
Size
Interface
Material
ASAX-ZIF1.8-SSD
1.8inch 70×54×6mm
ZIF
Aluminum-magnesium alloy appearance ,drawbench and colorful oxidation surface,elegant temperament
performance
read speed:80- 96Mbytes/second write speed:50- 60Mbytes/second
support ATA-7 V3 PIO/multi word/ultra DMA MODES
Low power TFBGA,4 channel of flash controller,masked ROM and data SRAM
SAMSUNG flash keeps the data faster on reliability and endurance
Dynamic and static wear-leveling prolong NAND FLASH and SSD for longer life
8/16 bit BCH ECC data error correction ability effectively guarantee the data read security.
Design consideration
Capacity
16G/32G/64G/128G/256G
Average access time
<0.25MS
operating temperature
0-85°
power consumption
DC Input Voltage(3.3V or 5 V ± 10%)Read and write:135mA/194Ma wait:70mA
shock
1500G
Application
the Laptop, pc, server,workstation,portable media player,digital collection apparatus and any computer equipment which need consecutive read and write speed and high reliability storage.
jay401 - Wednesday, March 18, 2009 - link
yeah, he wants "more expensive than" or "too expensive for".Spoelie - Wednesday, March 18, 2009 - link
Second page as well:missing charts before and after this paragraph:
"The chart above shows how much faster these affordable MLC SSDs were than the fastest 3.5” hard drive in sequential transfers. But now look at random write performance:"
Spoelie - Wednesday, March 18, 2009 - link
chart 1 on page 2 now shows sequential read but the paragraph is changed to mention random read ;)page 21: As far as I know, this is THE one of THE only reviews
Some very surprising benchmark results for the ocz vertex, I thought the new firmware tanked sequential read speeds (to 80-90) based on the explanation beforehand, but not according to the actual graphs.
Spoelie - Wednesday, March 18, 2009 - link
third page, first table, first column: SSD and HDD entries are switchedmikaela - Tuesday, March 16, 2010 - link
yeah great info. also great resourceSpoelie - Wednesday, March 18, 2009 - link
page 19: I’d never reviewed it'd & -ed?
HolyFire - Wednesday, March 18, 2009 - link
"I'd never reviewed it" is correct. "I'd" here means "I had", it's Past Perfect tense.FishTankX - Wednesday, March 18, 2009 - link
That should have bolded "too"FishTankX - Wednesday, March 18, 2009 - link
Also, I think the velociraptor vs X-25 figures are swapped. 6 odd ms for the intel drive and 0.11ms for the velociraptor..