Categories: Data Storage

Facebook Study Into SSDs Finds ‘Several Distinct Failure Periods’

A joint study by Facebook engineers and Carnegie Mellon University experts into SSD failure rates has found that SSDs go through “several distinct failure periods” corresponding to the amount of data written to flash chips.

The paper, which its authors said is the “first comprehensive study of flash-based SSD reliability trends”, six SSD platforms used by Facebook were cross-examined for failure causes.

Although the SSDs studied were not disclosed, the paper said that the components were “similar to those” used in server hardware available from firms such as Fusion-io, Hitachi, Intel, OCZ, Seagate and Virident.

‘Several distinct failure periods’

“We observe that SSDs go through several distinct failure periods – early detection, early failure, usable life, and wearout – during their lifecycle, corresponding to the amount of data written to flash chips,” was the authors’ first conclusion.

The researchers advised that additional error correction at the start of an SSD’s life would go some way in reducing the failure rates during the early detection period.

Inside Facebook’s Swedish data centre

Another observation from the researchers was that SSDs that do not use throttling techniques to manage temperature have more chance of failure.

“Higher temperatures lead to increased failure rates, but do so most noticeably for SSDs that do not employ throttling techniques,” said the study. “In general, we find techniques like throttling, which may be employed to reduce SSD temperature, to be effective at reducing the failure rate of SSDs.”

The most interesting finding from the study, which examined the SSDs over a four-year period, was that read disturbance errors, are “not prevalent in the field”. The researchers said that SSDs that have read the most data do not show a statistically significant increase in failure rates.

“We find that the effect of read disturbance errors is not a predominant source of errors in the SSDs we examine,” said that two Facebook and two Carnegie researchers.

“While prior work has shown that such errors can occur under certain access patterns in controlled environments… we do not observe this effect across the SSDs we examine.”

Take our cloud quiz here!

Ben Sullivan

Ben covers web and technology giants such as Google, Amazon, and Microsoft and their impact on the cloud computing industry, whilst also writing about data centre players and their increasing importance in Europe. He also covers future technologies such as drones, aerospace, science, and the effect of technology on the environment.

Recent Posts

TikTok Viewed As Chinese Influence Tool By Most Americans – Poll

Most people in the United States view TikTok as a Chinese influence tool a poll…

8 hours ago

Ofcom Confirms OnlyFans Investigation Over Age Verification

UK regulator confirms it is investigating whether OnlyFans is doing enough to prevent children accessing…

8 hours ago

Ex Google Staff Fired Over Israel Protest File NLRB Complaint

Dismissed staff file complaint with a US labor board, and allege Google unlawfully terminated their…

9 hours ago

Tesla Axes Entire Supercharger Team, Plus Senior Executives

Elon Musk dismisses two senior Tesla executives, plus the entire division that runs Tesla's Supercharger…

11 hours ago

Microsoft, OpenAI Sued By More Newspaper Publishers

Eight newspaper publishers in the US allege Microsoft and OpenAI used their millions of their…

12 hours ago

Binance’s Changpeng Zhao Sentenced To Four Months In Prison

US judge sentences Binance founder, Changpeng Zhao, to four months in prison for ignoring money…

15 hours ago