Soft Error Vs Hard Error

One experiment measured the soft error rate at the sea level to be 5,950failures in time (FIT) per DRAM chip.

On the high end is the sophisticated and costly chipkill - developed by IBM - that can survive the loss of an entire memory chip - or many multi-bit errors. The simplest ECC is the common detect-and-correct single bit errors and detect-but-not-correct 2 bit errors.

Often, however, this is limited by the need to reduce device size and voltage, to increase operating speed and to reduce power dissipation. At low energies many neutron capture reactions become much more probable and result in fission of certain materials creating charged secondaries as fission byproducts. If soft write errors are seen, it may indicate the beginning of a problem.

Reduction in chip feature size and supply voltage, desirable for many reasons, decreases Qcrit.

Conventional memory layout usually places one bit of many different correction words adjacent on a chip.

The thermal neutron flux from sources other than cosmic-ray showers may still be noticeable in an underground location and an important contributor to soft errors for some circuits. This involves increasing the capacitance at selected circuit nodes in order to increase its effective Qcrit value. This combination of capacitance and voltage is described by the critical charge parameter, Qcrit, the minimum electron charge disturbance needed to change the logic level.

a bit flip caused by cosmic rays) so rebooting will eliminate them. Hard errors are permanent (e.g.

The bad data bit can even be saved in memory and cause problems at a later time. It is extremely hard to maintain the material purity needed.

Thus, the importance of soft errors increases as chip technology advances. Some pages account for a large fraction of errors.

Concludes that 1000–5000 FIT per Mbit (0.2–1 error per day per Gbyte) is a typical DRAM soft error rate.

Contrast with soft error. The rate of upsets in aircraft may be more than 300 times the sea level upset rate. This nuclear reaction is an efficient producer of an alpha particle, 7Li nucleus and gamma ray.

Further reading[edit] Ziegler, J. ISSN1530-4388. ^ Franco, L., Gómez, F., Iglesias, A., Pardo, J., Pazos, A., Pena, J., Zapata, M., SEUs on commercial SRAM induced by low energy neutrons produced at a clinical linac facility, Hard figures for DRAM susceptibility are hard to come by, and vary considerably across designs, fabrication processes, and manufacturers. 1980s technology 256 kilobit DRAMS could have clusters of five or six

If erroneous data does not affect the output of a program, it is considered to be an example of microarchitectural masking. For comparison, the count rate of a typical shoe's sole is between 0.1 and 10 cph/cm2. MTBF is usually given in years of device operation; to put it into perspective, one FIT equals to approximately 1,000,000,000/ (24× 365.25)= 114,077 times more than one-year MTBF.

Soft errors are caused by events rather than a persistent physical condition, as is the case with hard errors.