Skip to main content
X

Send us a Topic or Tip

Have a suggestion for the blog? Perhaps a topic you'd like us to write about? If so, we'd love to hear from you! Fancy yourself a writer and have a tech tip, handy computer trick, or "how to" to share? Let us know what you'd like to contribute!

Thanks for reaching out!

Investigating & Preventing Kernel Panics in the 2019 Mac Pro

Mac Pro Rack 2019 - top

Nobody likes to experience a kernel panic on their Mac. This is when your Mac restarts itself while you are working. After it finishes the restart process, you see a dialog telling you that your Mac shut down because of a problem. Worse yet, you may have lost all of your recent work as files are not saved during this type of restart.

We Start Hearing From Our Customers

A kernel panic is even worse when it is happening on your brand new 2019 Mac Pro, a computer that may have cost $10,000 or more. So when we started to hear that a few customers were experiencing kernel panics on their 2019 Mac Pros, we wanted to quickly figure out what was happening. These panics only occurred in a very small percentage of customers who had installed our OWC Accelsior 4M2 cards. The log from macOS said that the kernel panic was caused by an error on the PCIe bus, the wires on the circuit board, which connects cards to the Intel processor. This kernel panic was most often occurring to customers with 2 or more OWC Accelsior 4M2 cards installed.

We Investigate The Problem

Through many hours of dedicated work, our teams figured out what hardware configurations are required for a kernel panic to occur. We worked literally around the clock investigating this problem: when our software development team in California finish testing for the day, our hardware design team in Taipei, on the other side of the world, would pick up where the California team left off.

We determined that the kernel panic occurs when the 2019 Mac Pro sleeps. More importantly, we figured out that the problem only occurs when the OWC Accelsior 4M2 is installed in slots 4 or 5 and only if those slots are configured to use pool B of the PCIe lanes (see below). If OWC Accelsior 4M2 is in any other slot, or if it is configured to use pool A, no kernel panics occur.

The teams also found other cards, which, when installed, result in a kernel panic. One example is the Highpoint SSD7101A.

Getting Help From Our Chip Supplier

We then contacted our PCIe chip supplier in Taiwan to get help determining why macOS thought there was an error on the PCIe bus. Their engineers analyzed the signals on the PCIe bus and told us that macOS was mistakenly indicating that there is an error when, in fact, all the signals were correct.

Apple Starts Investigating

We have since reported this problem to Apple and are working with them to develop a solution. Like the problem we discovered recently — where a Mac hangs while transferring large files — I am confident that they will develop a quick and reliable fix for this problem.

How to Prevent the Kernel Panic From Occurring

Since the kernel panic occurs only when the OWC Accelsior 4M2 is in slots 4 or 5, and the slot is using pool B of the PCIe lanes, the solution for your 2019 Mac Pro is easy. You can either move the card to a different slot or change the slot to use pool A of the PCIe lanes.

Move the OWC Accelsior 4M2 Card to a Slot Other Than Slot 4 or 5

You can move your OWC Accelsior 4M2 card to slots 1, 2, 3, 6, or 7, and this will prevent the kernel panic from occurring. Slot 8 is always occupied by the Apple Thunderbolt card.

Inside Mac Pro 2019 showing PCIe slots
Note: Slot 2 is normally covered by a graphics card. If you install your OWC Accelsior 4M2 in slot 2, you will only be able to use one graphics card, installed in slot 3.

Change the Slot With the OWC Accelsior 4M2 Card to Use PCIe Lanes in Pool A

To change the slot which contains the OWC Accelsior 4M2 to use the lanes in pool A, follow these simple steps:

1) Select About this Mac from the  menu.

2) Click on the PCI Cards tab in the About This Mac window.

3) Click on the Expansion Slot Utility… button.

4) Click on the Automatic Bandwidth Configuration checkbox to deselect it.

5) Click on the button in the A column next to the slot containing your OWC Accelsior 4M2. (In this example, it is slot 4.)

6) When you are done, the Expansion Slot Utility window should look like this.

7) Click the close box in the top left of the Expansion Slot Utility window. You will be prompted to save your changes, and then your 2019 Mac Pro will restart.

OWC Tim
the authorOWC Tim
Vice President of Software Engineering, Mac
Tim Standing has been writing drivers and storage utilities for Mac OS since 1986. He is the creator of SoftRAID for macOS and is currently VP of Software Development - Mac at Other World Computing, Inc. He has patented a write acceleration technique that enables the write speed RAID volumes to be as fast as the read speed. Tim's team is responsible for SoftRAID, OWC Dock Ejector, OWC Drive Guide, and all Mac drivers and utilities that make OWC products exceptional. When he's not writing code, he's creating delicious pizzas in his wood-burning pizza oven.
Be Sociable, Share This Post!

Leave a Reply

22 Comments

  • This post is still relevant. I have a Mac Pro 2019 that has the same issues. My expansion slots 1-2, 3-4, 5 & 8 are filled and came from Apple configured that way. So Tim, is this article saying I can maybe add a OWC 4M2 to slot 6 or 7 and this ‘might’ alleviate my kernel issue? Let me know, Thanks

  • I was planning to buy a OWC Accelsior 4M2. Isn’t there any new development in this case?

    Keith posed a question on May 23rd. There is still no answer after 2 months.

    Does Apple say anything?

    I would be happy to get a reaction. Thanks in advance.

    • If you use the configurations in this blog post, you should have no problems. I am not aware of any users experiencing problems in 2019 Mac Pros after changing slots or the PCI Banks they are using to the ones we suggest.

      We have several users with 3 Accelsior 4M2s in one Mac Pro who are very happy with the solution and have not experienced any kernel panics since changing their configuration..

      We have not heard anything back from Apple regarding this problem.

  • I’m running a Mac pro 2019 with 3 Accelsior 4M2 cards in slots 3, 4, 5.
    Slot 1-2 AMD 5700x, slot 6 avid HD Native, slot 7 Sonnet USB and slot 8 Apple I/O. OS Catalina 10.15.3

    Despite running the NVMe cards in the B pool I still experience Sleep/Wake kernel panics. There is often another, similar error that is logged as, ‘PowerOff timed out in phase “Notifying power plane drivers”.

    The errors always occur at boot.

    Sometimes, after shutting down, not being put in sleep mode, the MP will reboot itself as if it’s actually been sleeping, not having been shutdown, sort of an involuntary resurrection.

    This behavior is inconsistent, creating the impression that some juggling of parameters my have solved the problem, but no.

    At one point, allocating the NVMe cards to the B pool seemed to be a cure but that only worked when shutting down while leaving the MP connected to power. Once the master power was shut off then turned on the next day, the boot error reappeared.

    Is there any news of progress on Apples side regarding a fix? I would move to the latest version of Catalina but it doesn’t yet support my principle working app. If that provided a fix I assume word would have been out by now. This is a depressing to way to start off with such an ostensibly fine machine.

    • We have confirmed that this kernel panic still occurs after updating to macOS 10.15.5.

      Your Accelsior 4M2 cards in slots 4 and 5 should be in slot A, not in slot B as you have them currently configured. This has worked for the sleep kernel panic and may fix your problem.

      I will ask our tester at our headquarters in Woodstock to try and repeat your problems with booting as we only have 2 Accelsiors 4M2 cards here in California.

      I don’t have any news from Apple, nor do I expect any until after they ship a fix for this.

      • You’re right about the the reversal of the A/B slot position. That was a dyslectic initial response to the instructions on my part. The Accelsiors
        in slots 4 and 5 are now assigned to A. The third Accelsior remains in slot three. The frequency of the sleep/wake kernel panics has reduced substantially. If the MP remains connected to power while shutdown, things are pretty stable but not perfect.

        I’m curious about the slot 3 Accelsior. Would moving that third card to a B assignable slot improve things? I haven’t tried that yet as you didn’t suggest it and because it would upset the nice resource allocation shown for A/B now, 100% A, 94% B. I’m assuming moving all three Accelsiors to the A pool will result in a 150% A allocation. Maybe that’s not important. Let me know.

        The other odd item, not previously mentioned, is that the MP, after having been shut down (power indicator off) for several seconds, will re-boot itself. Sleep is off as is anything else that seems related but I may be overlooking something. This is annoying and not confidence inspiring.

        Thanks,

        -rb

        • My understanding is that slots 1 and 3 are directly connected to the CPU so, while they are represented in the PCI Configuration Utility window, I don’t believe you can assign cards in either slot to pool A or B.

          • Understood. My question was not clear. It should have rather been, should all three Accelsior’s be in A assignable slots? My thinking was that perhaps A was the only sector not affected by the SSD/kernal panic issue. Experience over the last few days indicates that slot 3 is okay. Everything seems to be hunky dory with the current disposition of Accelsiors in slots 3, 4A and 5A. However, this is true only if the MP
            is constantly connected to power. I’ve had to bypass the studio master AC switch and connect it directly to a wall outlet. If power to the MP is removed, on its next boot it will show the sleep/wake error. It will also, on a random basis, re-boot itself after I’ve done a full shutdown.

            I can hear a relay click inside the MP if it’s off and I disconnect and reconnect the power. Seems there’s always something cooking inside while shut down but still connected to power. It doesn’t like being disconnected from AC.

            • I’m glad to hear your configuration using slots 3, 4A and 5A is working successfully. I have not heard of the problems you are encountering with the power being disconnected. I will have our tester start investigating it on Monday.

              • I’ve been running for three weeks now with no kernel panics. This is with the none of the 3 Accelsior cards assigned to the ‘B’ pool. EXCEPT for this AM.

                Last night, feeling motivated to test if my earlier experience with interruption of AC to the Mac Pro was correct or a fluke, I disconnected its power after shutting the studio down for the night.

                Sure enough, at startup this AM I was greeted by the familiar kernel panic dialog and dutifully sent the report to Apple. They must have a fair number logged to my machine by now as I cycled the machine repeatedly when I first experienced the issue.

                I’m very curious as to what, if anything, your tester comes up with. It’s either generic or unique to my particular MP or the configuration. I suspect one would have to establish a consistent, unvarying pattern of panics that is eliminated by assigning the SSD’s to the ‘A’ pool. then pull the plug after shutdown.

              • Our tester, Mark, spent 8 hours trying to reproduce the kernel panic you are seeing after shutting off AC power to your 2019 Mac Pro. He has tried with both 3 Accelsior 4M2 cards and with 3 Accelsiors and 1 Sonnet M.2 4X4 card. Mark tried all the slot positions he could think of and was never able to reproduce this kernel panic.

  • For what it’s worth, I have this kernel panic occur about 50 percent of the time when my Mac Mini awakes from sleep. I have a 3TB Mercury Elite Pro attached for nightly backups of the Mini. The kernel attacks began happening about the same time I installed Mojave and the Mercury Elite Pro. I’ve tried in vain to find a solution. I always send the crash report log to Apple but am not qualified to read or understand it myself, so don’t know where/why the kernel panic occurs. Could it be related in any way to the issue discussed in this article?

    • No, this issue only happens on the 2019 Mac Pro.

      I know how frustrating it can be to track down a kernel panic. You might want to contact the OWC support center and get some help from them.

    • I realized that I am experiencing a kernel panic on my Mac mini as well, all of the time it goes to sleep (2018 Mac mini). I have 2 startup volumes on this Mac, one for Mojave and the other for Catalina. The kernel panic only occurs when I am starting up from Mojave, so you might want to try creating a Catalina volume and trying that for a few days.

      When I dig into the kernel panic, I see that the panic is caused by Apple’s T2 chip (the operating system which is panicking is running on an ARM chip).

      I also have no external storage devices attached to the Mac mini, so I don’t think your problems are related to the Mercury Elite Pro.

  • I was wondering when that PLX switch would start causing trouble for MP owners. Sure didn’t take very long. Of course, if they had offered a dual CPU version then available lanes wouldn’t have been an issue. IMO Apple is trying to do too much with too little.

    Your engineers noted that the signals at the PCIe pins was correct. Did they also test the signal for deviations after it had crossed the PLX? I’d lay a ten spot that the hyper fast I/O nature of the equipment in question here was sensitive enough to the (mis)timing caused by the PLX switching and it was just enough to KP the machine.

    PC motherboard manufacturers tried using PLX switches to give users more lanes on their high end mainstream boards a few years ago before Intel had brought out the X series HEDT CPUs and motherboards. It didn’t work very well. There were timing issues galore. And that was before we had NVMe and Optane in the mix.

    I’d like to say this is the last we’ll see of the problem seen here, but I doubt it. Unless Apple reduces the I/O rate below the threshold that triggers the mismatched timings, it’s likely other high speed I/O cards will also suffer when using Pool B in certain slots. :(

  • Apple has really slipped up lately. They seem to be spending a whole lot of time and money on Memoji and emojis when they should be spending it on actually making their expensive systems actually work. I’ve now been bitten by both this bug and the large file copy bug — and Apple said both were most certainly not their fault.

    HAH!

    • Yes, it’s unfortunate that both bugs affect the movie production workflow. I am hopeful that in a few months, both issues will be behind us.

      Tim

  • That’s a really good debugging work!

    I congratulate your software group and your hardware design team for all the work you put into this. Far too many vendors would’ve just thrown it over the fence at Apple and just said “something is wrong.”

    Sleep seems to be a problematic area; since I moved to macOS Mojave, I sometimes have issues waking up my MacPro 5,1 as the OS sometimes has problems waking up the OS drive, a PCIe Drive Kit with OWC Mercury Electra 6G SSD.

    Thankfully that hang is rare enough that it’s only a minor nuisance.

      • This is fantastic troubleshooting work. I’ve suffered through this issue and at least 20 hours of troubleshooting through Applecare. They even replaced my Mac’s logic board, but that didn’t fix the sleep issue with two 4x4s in slots 4 and 5.

        I wish I could just move both to Pool A, but my boards are both x16 and there doesn’t seem to be enough bandwidth to get the maximum speed from my drives.

        I hope Apple follows your cue and issues a software update. I also hope Apple rewards you for catching this, because it has undoubtedly cost them a lot of money in needless repairs.

      • Terrific debugging! Thanks.
        I have two new MacPros each with two Accelsior 4M2 Cards.
        For one reason or another, I have those boxes set to never sleep.

        I have not seen this problem but I have been on lock-down most of the time I’ve had the boxes deployed.

        Am I safe?