820-3209 - Random power downs and 1/5 chance of getting "3 beeps"

smiba

New member
Hi everyone,

No idea about the backstory here.
Doesn't look like water damage as all the indicators are white and I see no marks at all on the motherboard. The motherboard has most likely never been removed from the case, but the cpu cooler has been before (Noticeable because the little rubber on the fan was not sticking on anymore). There was not a lot of paste on there so I replaced this for good measure but it did not solve the issue. I don't think this could've caused any issue because even though there was not a lot of paste, it did have enough paste not to destroy the cpu.

The battery in this system is not original, but from a different company called LMP. The battery works fine

The system boots into the OS but after a while (sometimes 10 seconds, sometimes a couple of minutes) it just powers down all of the sudden. No warning, it just goes back to S5. The charging light stays consistent (no SMC reboot). In some rare cases the system does not respond to the power button anymore after this happened and the charger needs to be reconnected to get it to respond again. This is with the battery holding enough change and keeping the S5 and G3H power lines up so the system does not reset or anything.

On battery the system pretty much always works but powers down randomly, but with the charger plugged in there is a random 20% chance of getting 3 beeps before the system powers down again.

Where to look in this case? To me it sounds like the 3 beeps are just happening because something is going wrong with the CPU and that happens to be while the memory is being initialized. PPDDR_S3 is stable at 1.34V

EDIT: The hotter the system is the bigger the chance of it passing POST but powering off right after or sometimes even while the "dong" sound is still being played. Not sure what this might mean though. Cracked solder joint? After the system crashed off it can not be powered on again (the SMC does not respond to SMC_ONOFF_L)
EDIT2: Seems to be the PCH thats overheating
 
Last edited:

SMMRepair

Member
This one is going to suck...probably a bad RAM chip. Very small chance of crap under one of the chips, but with no liquid (presumably), then it's going to be a lot more difficult. Such an old board, might want to consider if it's worth taking this repair on.
 

smiba

New member
I'll see if a nice and good reflow helps but I'm definitely not replacing or taking off the chips, I don't charge enough for that haha

Just to be sure, the system sometimes does actually POST and boot. Would this still be a ram issue? Woudn't the system just freeze and/or kernel panic if it was?
 

smiba

New member
PCH is overheating

Keeping it cooled by rubbing soaked cotton swaps over it and making sure there is isopropanol laying on top of it (Meaning it is less then 82C on the surface) keeps the system booted for a little while. But once I stop applying fresh isopropanol it takes just a couple of seconds before the system shuts down.

Power cycling the system while the PCH is very hot (right after all the isopropanol has evaporated) gives the 3 beep error, letting it cool down for a couple of seconds makes it POST again.

So I found the issue, now the solution :)

EDIT: Did some testing because I wanted to know if it was just me wanting to believe its the issue or it actually being.
Seconds are the time it takes before the system powers down unexpectedly, cooldown is the time I wait between testing after a unexpected powerdown

61s, air - 2 min. cooldown period
98s, air - 15 min. cooldown period
119s, isopropanol, 2 min. cooldown period
162s, big heat transfer pad, 2 min. cooldown period

EDIT2: Might it be worthwhile to give the big heat transfer pad method another try while running EFI ASD to check the sensors? EFI ASD because I'm not sure if I can get it to OS ASD in time. The small tests should take about 60 seconds so the 2 to 3 minutes I can get the system to run should be enough

EDIT3: Too bad, EFI ASD gives no sensor errors Click image for larger version  Name:	IMG_20170829_232431.jpg Views:	1 Size:	811.9 KB ID:	24805

Probably going to reflow it tomorrow if no one else has any other idea's. If that won't fix it its probably semi-shorted internally and just not worth my time
 
Last edited:

dukefawks

Administrator
Reflow won't fix anything. If you are going to do anything then replace the PCH and fix the ME region. These boards are garbage.
 

smiba

New member
I assume the PCH can not be replaced by hand and requires a BGA machine to have a reasonable chance of success?
 

dukefawks

Administrator
I can do it without BGA machine, but really this board isn't worth the effort and risk I think. I may have a stack of these board somewhere, send me a DM if interested.
 

smiba

New member
If I'm going to be doing this in the future I feel like I should learn it on something thats not owned by the customer first.

Decided to set the repair to unsuccessful and ready for pick up. The customer knows the device does work for a couple of minutes so if I don't succeed with soldering the PCH the system won't boot and I'm not risking a "you broke it" customer.

Thanks for the offer though!
 
Top