A soft lockup is defined as a bug that causes the kernel to loop in kernel mode for more than 20 seconds without giving other tasks a chance to run. The watchdog daemon will send an nonmaskable interrupt nmi to all cpus in the system who, in turn, print the stack traces of their currently running tasks. We have also seen these messages on an idle system with a large core count. The linux guest on the osx host is stalling a few times a day for a few months. I have seen this on hp dl360 g8 servers which were using multiple fiber channel controllers and multipath to attach several hundred san. The cap is expressed in percentage of one physical cpu. Hello, i have been receiving cpu stuck messages from the kernel, after which the machine appears to lock up. Guest has divider10 kernel parameter to keep the host cpu load low. However, if i try to install the driver using rpm and then reboot the system, during startup the os gets stuck spitting out the following soft lockup message for all the cpu cores, except for one core that is in soft lockup in one of the threads created by my driver. The problem occurs at a random time, but very often. If you dont have iotop or not able to install it due to company policy you can try. By defa ult, this value is set to 10 seconds and the maximum soft lockup timeout is now increased from 60 seconds to 300 seconds for systems that have a large number of cpus. So far, i havent found any clue as to what to do or try rather, the clues ive found and followed havent stopped this from happening. Guest was set up with 1 cpu, but since my host has a quad intel processor, i decided to try increasing it to 4 cpus on the guest.
This can sometimes be too low if the system is very busy with io. I can correct the problem by entering recovery mode and. Often i have to poweroff the server, but last time i found some information in syslog. Thanks in advance and sorry if i missed something along the way. I heard something about disabling acpi but i dont know how that will affect the server.
After running fine for a couple of hours the server suddenly timed out on ssh, bind etc. I have 16gb ram on my desktop and i am using vmware player to run the sap hana express edition. A soft lockup is the symptom of a task or kernel thread using and not releasing a cpu for a period of time. In this case it may be caused by a bug in the kernel when. Ive seen a few bug reports and questions on stackexchange and elsewhere regarding a nagging bug. The cap optionally fixes the maximum amount of cpu a domain will be able to consume, even if the host system has idle cpu cycles. I was trying to install again, with increased virtual machine resources 8gb. The technical reason behind a soft lock involves cpu interrupts and nmiwatchdog. Download the readaheadcollector program and build it 2. Any message in varlogmessages referencing soft lockups like these.
I have installed sap hana studio and added the sap hana express edition as system. The server became very slow, and couldnt be logined with ssh, after reboot, we found varlogmessage has lots of kernel. The messages log with the lockup is attached below. I am having the exact same problem with kernel version 2. After discovering new san luns, the system locks up. A soft lockup occurs when a cpu reports a memory starvation while it is unable to access a memory node that is being accessed by other cpus. I have a linux machine that is giving a lot of bug. The centosrhel kernel has a default softlockup threshold of 10 seconds.
135 188 1275 786 244 451 84 1003 97 148 827 672 437 1378 339 211 1155 300 1435 257 1478 1581 326 24 1523 686 187 285 423 377 887 840 913 1323 1207 1454 1340 375 1301 464