help

help. HELP.

Okay, so, I am at my wits’ end. I need some computer hardware help. Details below:

Okay, the sequence of my troubleshooting so far is roughly this:

  1. I start getting random freezes on my PCs, with increasing frequency after a day goes by. Nothing in the logs or any obvious problem. Just a hard freeze. I figure this is probably a memory problem, so I run memtest. Sure enough, I get tons of errors.
  2. I buy a new DIMM from Compusa and plug it in. I run memtest again and I’m still getting errors. Okay, so must be a bad motherboard.
  3. I order a new mobo/CPU combo. Biostar mobo w/ AMD Athlon XP 2400
  4. I swap the new PC133 512M DIMM I bought for the old motherboard for a new DDR 512M DIMM for the new motherboard.
  5. I put it all together, assuming all my problems are over, so I didn’t think to run memtest again. I boot up and .. oops. kernel panic trying to mount the root filesystem with SCSI errors like below:
    > scsi1: ERROR on channel 0, id 0, lun 0, CDB: 0x28 00 00 00 10 9f 00 00 08 00 Info fld=0x109f, Current sd08:01: sns = f0 3 ASC=11 ASCQ= 0
    > Raw sense data:0xf0 0x00 0x03 0x00 0x00 0x10 0x9f 0x0a 0x10 0x1f 0x00 0x02 0x11 0x00 0x00 0x80 0x00 0xa0 I/O Error: dev 08:01, sector 4192

  6. I assume at this point that I screwed something up while rebuilding the PC around the new motherboard. I boot up Knoppix on the PC (runs on a ramdisk) and I get the same errors trying to mount either drive. I swap the drives around on the cable. I try them one at a time. Nothing helps. Same errors.
  7. I buy a new SCSI cable (after scouring the city for a u2w SCSI cable), thinking that the cable is the most obvious faulty part. No change – same errors.
  8. Starting to get a little frustrated at this point. So, I figure maybe it’s some incompatibility between my SCSI card (Adaptec AHA-2940U2W) and my motherboard. So I rebuild everything around my old motherboard outside the case and boot it up. Same SCSI errors. So, scratch that theory. Something is definitely broken that wasn’t before.
  9. So I put everything back together around the new motherboard and boot up Knoppix again. Same SCSI error – hey, wait a minute. The Adaptec card isn’t even in the PC. This error was something to do with the CDROM (which in Knoppix is utilized via SCSI emulation). WTF? So, I am starting to think there’s some problem with the power supply – the only common element. So, I go get a new power supply, plug it in .. same problems. Okay, scratch that theory.
  10. So I’ve basically given up, and I’m running Knoppix to browse the web and check my e-mail. I notice performance isn’t great – firefox and mozilla keep segfaulting. ssh even segfaults once in a while, and performance eventually degrades until it locks up. Keep in mind this is on the brand new motherboard/CPU/RAM, with no hard drive or controller even hooked up.
  11. So, since I’m seeing this terrible performance,I run memtest again on all this new hardware. Sure as shit, I am still seeing tons of errors.
  12. I realized there’s one more common element between the two – my video card. Is is possible an AGP card could cause all these problems? Well, only one way to find out. I swapped the card with an older AGP card I had laying around. Rebooted, ran memtest. Same errors.

So, that’s where I stand. I am at a complete loss. I’ve never been so frustrated in my life. The SCSI and RAM problems could be unrelated or not, I don’t know at this point. I am going to try buying a new SCSI card tomorrow.

My only remaining theory is that maybe my old power supply was bad – bad enough to fry good hardware, resulting in the exact same problems with my new hardware that I had with the old hardware. The downside is that requires returning and replacing all the new hardware I’ve gotten and trying it with the new power supply, which is a time-consuming process, if mwave will even take my mobo/CPU back.

Sigh.

Any ideas?

UPDATE: Okay, I replaced the SCSI controller. No dice. Tested the RAM again with the new power supply in. Still testing bad.

Remaining theories:

  • Bad SCSI card
  • Both SCSI drives are bad AND new motherboard or RAM is bad by chance
  • Bad power supply caused both problems with the old motherboard/RAM/CPU and is bad enough to fry the new motherboard/RAM/CPU in .. exactly the same way (?!)

Any theories I am forgetting?


Comments

Wow, that is quite a puzzler. Let’s see… hmm.

Try resetting your CMOS and then choosing safe values – disable performance options. Try underclocking your CPU/FSB and/or PCI. Try increasing the voltage to CPU and/or RAM, if the BIOS allows.

Could something very odd be happening with your mains power? Undervoltage? How is the ground? If the humidity is very low, you could have static (ESD) problems and could have zapped new component(s) in the course of the experiments.

BenjaminMarch 22, 2005 at 09:26 · reply

I am have the same problem: I was just under way with setting up knoppix 3.7 on a lap top made by good quality computer when all of a suden my installer hung. I waited sometime, and decided it was time for a reboot. when the livecd cranked up I gave it the command Knoppix26. expecting it to act as it always had. NOW I get Raw Sense data a and scsi errors.0x0x0x0x0 or somthing of the sorts. you’ve probly seen how nasty it gets. I am now searching for answers.

BenjaminMarch 22, 2005 at 09:30 · reply

Trust me I tried alot of stuff. All my other distros like BSD 5.3 has no problem at all. I am hooking up a BSD Webserver. and currently wrestling with understanding BIND8 AND 9. I would like to have Knoppix ISO installed on my hard disk as a ba sorry– as a back up Web server– knoppix is worth all the work.

BenjaminMarch 22, 2005 at 10:17 · reply

Trust me I tried alot of stuff. All my other distros like BSD 5.3 has no problem at all. I am hooking up a BSD Webserver. and currently wrestling with understanding BIND8 AND 9. I would like to have Knoppix ISO installed on my hard disk as a ba sorry– as a back up Web server– knoppix is worth all the work.

BenjaminMarch 22, 2005 at 10:24 · reply

ok I fixed it maybe it will work for you. trus me i feel your emotions in your words. Download the debian woody distro. Please make shure it is disk number 6iso. I just partitioned my hard drive and with it, the knoppix problem! I rebooted the from debian woody disk 6 install. loaded my knoppix in an presto.

I think if you have built in ram on your mother board find anything that will clear it. “asapINFOinto | motherboardram0” i am relieved

Thanks! Your comment has been submitted and will appear shortly.


Leave a comment