Linked by Thom Holwerda on Thu 20th Oct 2005 19:18 UTC, submitted by anonymous
Hardware, Embedded Systems Achieve a level of high reliability in a microprocessor system by adding a second identical processor to a system to monitor and verify the system processor operation - also known as the lockstep processor technique. This tip demonstrates the integrated lockstep facility in the PowerPC 750GX processor.
Order by: Score:
Only part of the problem
by Anonymous on Thu 20th Oct 2005 22:37 UTC
Anonymous
Member since:
---

Stratus does this in hardware. Marathon Technologies does this in software. Both take care of many other failure modes (disk, power supply, I/O, memory, and in Marathon's design, even driver faults) that are much more likely to cause a system failure. I mean it's cool IBM is working on this (and they have some really strong high availability talent in house), but they need to take it to the system level (not just one component, and especially one - the CPU - that is among the least likely to fail).

Reply Score: 0

IBM?
by Anonymous on Thu 20th Oct 2005 23:44 UTC
Anonymous
Member since:
---

System 88?

Reply Score: 0

RE: IBM?
by Anonymous on Thu 20th Oct 2005 23:51 UTC in reply to "IBM?"
Anonymous Member since:
---

The System 88 was OEM'd from Stratus.

Reply Score: 0

Same chip?
by Anonymous on Fri 21st Oct 2005 01:26 UTC
Anonymous
Member since:
---

Are these the dual core chips used in the high end PowerMac G5?

Reply Score: 0

RE: Same chip?
by Anonymous on Fri 21st Oct 2005 02:41 UTC in reply to "Same chip?"
Anonymous Member since:
---

No, the Power Mac G5 uses the 970MP.

Reply Score: 0

G3
by Anonymous on Fri 21st Oct 2005 02:55 UTC
Anonymous
Member since:
---

The PowerPC 750GX processor is also known as the G3.

Reply Score: 0

IBM is the right group for the job
by Anonymous on Fri 21st Oct 2005 12:28 UTC
Anonymous
Member since:
---

They have been doing this for years with their mainframes, to the point that it swaps out the bad hardware and calls service with no user or operator intervention, and without a single bad bit hitting disk. This is how they have years between reboots on what is now called the Z series (370/390 for you old schoolers.)

I think it will be great if we can get this kind of fail over all of the way down into SMB sized machines, but I amso agree that it needs to be not only the processor, but other componenets too Add memory to the list, since we already have RAID, and we are getting close.

Reply Score: 1

The more the better?
by Anonymous on Fri 21st Oct 2005 15:15 UTC
Anonymous
Member since:
---

So can this have more than 2 CPUs? And could they interface to different, identical circuits for I/O?

Reply Score: 0