CODESYS Forge - CODESYS Forge / Talk / Engineering 🇬🇧: BUG: scheduling while atomic: EtherCAT

Originally created by: michele_sponchiado

Hi, we are currently using CODESYS Control V3 version 3.5.8.10 Mar 3 2016 on a custom board, and we are experiencing random oops BUGs "", typically once every 8 -10 hours, from the Linux kernel when executing the EtherCAT_Master module, see the following kernel log taken with kernel debug messages enabled:

192.168.0.196 login: BUG: scheduling while atomic: EtherCAT_Master/0x00000001/625, CPU#0
[<c003bb7c>] (dump_stack+0x0/0x14) from [<c02fd3cc>] (__schedule+0x558/0x7cc)
[<c02fce74>] (__schedule+0x0/0x7cc) from [<c02fd7a0>] (schedule+0x48/0x108)
[<c02fd758>] (schedule+0x0/0x108) from [<c02fedd8>] (rt_spin_lock_slowlock+0xf8/0x1f4)
 r5 = A0000013  r4 = C18FA000
[<c02fece0>] (rt_spin_lock_slowlock+0x0/0x1f4) from [<c02ff17c>] (rt_spin_lock+0x40/0x44)
[<c02ff13c>] (rt_spin_lock+0x0/0x44) from [<c007134c>] (futex_lock_pi+0x1a4/0x978)
[<c00711a8>] (futex_lock_pi+0x0/0x978) from [<c0071fdc>] (do_futex+0x4bc/0xf80)
[<c0071b20>] (do_futex+0x0/0xf80) from [<c0072b08>] (sys_futex+0x68/0xfc)
[<c0072aa0>] (sys_futex+0x0/0xfc) from [<c0037018>] (__sys_trace_return+0x0/0x28)
 r8 = C0037048  r7 = 000000F0  r6 = 406B1490  r5 = 0031E5B8
 r4 = 00000000

In our application, the EtherCAT period is set to 4ms; the EtherCAT stack works stable (checked over a period of a week), but when the BUG arises, if the axes are moving their movement is rough for some milliseconds then it goes back to be smooth.

We set a trigger on the axes drivers (we are using Sanyo RS3) to check whether an Ethercat error is generated, the trigger is set on the "" driver internal variable, but no errors are generated on the Ethercat stack, so I can state that the EtherCAT communication runs fine: if just a frame is lost the trigger is immediately generated by the driver.
We triple-checked the code in our Ethercat_Master PLC task, but we found nothing suspicious; the code is quite simple, it just copies the currently calculated axes positions (these are generated from a different processor and always available to the ARM) into the EtherCAT variables that keep the positions. We set the positions to a fixed value and the oops still appears, so it seems that the BUG is independent form the code written into our module.

The Linux kernel version is 2.6.18, the CPU is an ARM9 @ 444MHz.
We monitored the jitter, and we found that the maximum jitter goes up to 8ms when the BUG is generated, while normally the maximum jitter is well below 1ms.
Looking for the reasons for the BUG, it seems the problem is that the Ethercat_Master goes to sleep while holding a spinlock or something similar.

Can you help us with this issue?

If you need some more information, please do not hesitate to contact us!

Michele Sponchiado

Forums

Help

BUG: scheduling while atomic: EtherCAT_Master

Forums

Help

BUG: scheduling while atomic: EtherCAT_Master document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

BUG: scheduling while atomic: EtherCAT_Master