Intel i40e driver call centos 4.14.44 kernel crash!

Mulyadi Santosa mulyadi.santosa at gmail.com
Tue Jul 13 05:31:08 EDT 2021


On Tue, Jul 13, 2021, 15:55 Oracle <oraclelinux at foxmail.com> wrote:

>  With centos kernel 4.14.44!
> Jul 12 00:49:25 cdm-storage kernel: ---[ end trace 81d5684fa78a43bb ]---
> Jul 12 00:49:25 cdm-storage kernel: i40e 0000:1a:00.1 eth1: tx_timeout:
> VSI_seid: 397, Q 34, NTC: 0x5e, HWB: 0x71, NTU: 0x71, TAIL: 0x71, INT: 0x0
> Jul 12 00:49:25 cdm-storage kernel: i40e 0000:1a:00.1 eth1: tx_timeout
> recovery level 1, hung_queue 34
> Jul 12 00:49:25 cdm-storage kernel: bond0: link status definitely down for
> interface eth1, disabling it
> Jul 12 00:49:25 cdm-storage kernel: i40e 0000:1a:00.1: DCBX offload is not
> supported or is disabled for this PF.
> Jul 12 00:49:25 cdm-storage kernel: i40iw_deinit_device: state = 11
> Jul 12 00:49:25 cdm-storage kernel: i40iw_manage_apbvt: CQP-OP Manage
> APBVT entry fail
> Jul 12 00:49:25 cdm-storage kernel: i40iw_manage_apbvt: CQP-OP Manage
> APBVT entry fail
> Jul 12 00:49:25 cdm-storage kernel: workqueue: WQ_MEM_RECLAIM
> i40e:i40e_service_task [i40e] is flushing !WQ_MEM_RECLAIM infiniband:
>     (null)
> Jul 12 00:49:25 cdm-storage kernel: ------------[ cut here ]------------
> Jul 12 00:49:25 cdm-storage kernel: WARNING: CPU: 8 PID: 411 at
> kernel/workqueue.c:2440 check_flush_dependency+0xb1/0x100
> Jul 12 00:49:25 cdm-storage kernel: Modules linked in: iptable_raw
> xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4
> iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4
> xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 tun bridge stp llc
> ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter devlink
> qla2x00tgt(OE) scst_vdisk(OE) isert_scst(OE) iscsi_scst(OE) scst(OE) dlm
> libcrc32c rpcrdma bonding ib_iser libiscsi scsi_transport_iscsi ib_srp
> scsi_transport_srp ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm
> iw_cm intel_rapl zfs(POE) skx_edac zunicode(POE) x86_pkg_temp_thermal
> zlua(POE) vfat fat coretemp crct10dif_pclmul crc32_pclmul zcommon(POE)
> znvpair(POE) zavl(POE) ghash_clmulni_intel pcbc i40iw icp(POE) aesni_intel
> spl(OE) ses enclosure crypto_simd glue_helper scsi_transport_sas
> Jul 12 00:49:25 cdm-storage kernel: iTCO_wdt qla2xxx_scst(OE) mei_me
> cryptd ib_core iTCO_vendor_support shannon(OE) sg scsi_transport_fc joydev
> mei pcspkr ioatdma shpchp i2c_i801 lpc_ich wmi ipmi_si ipmi_devintf
> ipmi_msghandler acpi_power_meter nfsd auth_rpcgss nfs_acl lockd sch_fq
> binfmt_misc grace sunrpc ip_tables sd_mod ast i2c_algo_bit drm_kms_helper
> syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm ixgbe i40e ahci mdio
> libahci crc32c_intel megaraid_sas libata ptp dca i2c_core pps_core
> dm_mirror dm_region_hash dm_log dm_mod
> Jul 12 00:49:25 cdm-storage kernel: i40e 0000:1a:00.1: HMC error interrupt
> Jul 12 00:49:25 cdm-storage kernel: i40e 0000:1a:00.1: HMC error info
> 0x80160601, HMC error data 0x17f200
> Jul 12 00:49:25 cdm-storage kernel: CPU: 8 PID: 411 Comm: kworker/8:1
> Tainted: P        W  OE   4.14.44 #1
> Jul 12 00:49:25 cdm-storage kernel: Hardware name: eCloudTech
> eCloudTech/Curry, BIOS 4.1.8 06/20/2019
> Jul 12 00:49:25 cdm-storage kernel: Workqueue: i40e i40e_service_task
> [i40e]
> Jul 12 00:49:25 cdm-storage kernel: task: ffffa04bbe4c0000 task.stack:
> ffffc0568d684000
> Jul 12 00:49:25 cdm-storage kernel: RIP:
> 0010:check_flush_dependency+0xb1/0x100
> Jul 12 00:49:25 cdm-storage kernel: RSP: 0018:ffffc0568d687c50 EFLAGS:
> 00010246
> Jul 12 00:49:25 cdm-storage kernel: RAX: 000000000000006f RBX:
> ffffa04ba920fe00 RCX: 0000000000000000
> Jul 12 00:49:25 cdm-storage iscsi-scstd: Initiator
> iqn.1994-05.com.redhat:7b788ca34e34 not allowed to connect to target
> iqn.2016-07.com.ecloudtech:cdm-storage.stor
> Jul 12 00:49:25 cdm-storage kernel: RDX: 0000000000000000 RSI:
> ffffa04bbec169b8 RDI: ffffa04bbec169b8
> Jul 12 00:49:25 cdm-storage kernel: RBP: ffffa00ddabd0000 R08:
> 0000000000000000 R09: 0000000000004351
> Jul 12 00:49:25 cdm-storage kernel: R10: 00000000000003ff R11:
> 0000000000aaaaaa R12: 0000000000000000
> Jul 12 00:49:25 cdm-storage kernel: R13: ffffa04a38e3785c R14:
> 0000000000000001 R15: ffffc0568d687c80
> Jul 12 00:49:25 cdm-storage kernel: FS:  0000000000000000(0000)
> GS:ffffa04bbec00000(0000) knlGS:0000000000000000
> Jul 12 00:49:25 cdm-storage kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
> 0000000080050033
> Jul 12 00:49:25 cdm-storage kernel: CR2: 00007fb4a723f000 CR3:
> 0000003921e0a003 CR4: 00000000007606e0
> Jul 12 00:49:25 cdm-storage kernel: DR0: 0000000000000000 DR1:
> 0000000000000000 DR2: 0000000000000000
> Jul 12 00:49:25 cdm-storage kernel: DR3: 0000000000000000 DR6:
> 00000000fffe0ff0 DR7: 0000000000000400
> Jul 12 00:49:25 cdm-storage kernel: PKRU: 55555554
> Jul 12 00:49:25 cdm-storage kernel: Call Trace:
> Jul 12 00:49:25 cdm-storage kernel: flush_workqueue+0x132/0x460
> Jul 12 00:49:25 cdm-storage kernel: ib_cache_cleanup_one+0x21/0x30
> [ib_core]
> Jul 12 00:49:25 cdm-storage kernel: ib_unregister_device+0x107/0x180
> [ib_core]
> Jul 12 00:49:25 cdm-storage kernel: i40iw_destroy_rdma_device+0x61/0x170
> [i40iw]
> Jul 12 00:49:25 cdm-storage kernel: ? ib_dispatch_event+0x3f/0x70 [ib_core]
> Jul 12 00:49:25 cdm-storage kernel: ? i40iw_port_ibevent+0x3f/0x60 [i40iw]
> Jul 12 00:49:25 cdm-storage kernel: i40iw_deinit_device+0x7e/0x370 [i40iw]
> Jul 12 00:49:25 cdm-storage kernel:
> i40e_notify_client_of_netdev_close+0x47/0x90 [i40e]
> Jul 12 00:49:25 cdm-storage kernel: i40e_service_task+0xb78/0x12f0 [i40e]
> Jul 12 00:49:25 cdm-storage kernel: ? __switch_to_asm+0x40/0x70
> Jul 12 00:49:25 cdm-storage kernel: ? __switch_to_asm+0x34/0x70
> Jul 12 00:49:25 cdm-storage kernel: ? __switch_to_asm+0x40/0x70
> Jul 12 00:49:25 cdm-storage kernel: process_one_work+0x155/0x370
> Jul 12 00:49:25 cdm-storage kernel: worker_thread+0x47/0x3e0
> Jul 12 00:49:25 cdm-storage kernel: kthread+0xff/0x140
> Jul 12 00:49:25 cdm-storage kernel: ? max_active_store+0x80/0x80
> Jul 12 00:49:25 cdm-storage kernel: ? __kthread_parkme+0x70/0x70
> Jul 12 00:49:25 cdm-storage kernel: ret_from_fork+0x35/0x40
> Jul 12 00:49:25 cdm-storage kernel: Code: 9b 48 8b 55 18 48 8d 8b b0 00 00
> 00 48 81 c6 b0 00 00 00 4d 89 e0 48 c7 c7 90 f7 c6 89 31 c0 c6 05 22 43 35
> 01 01 e8 cc e6 03 00 <0f> 0b e9 6a ff ff ff 45 31 e4 e9 59 ff ff ff 31 ed
> eb 8e 80 3d
> Jul 12 00:49:25 cdm-storage kernel: ---[ end trace 81d5684fa78a43bc ]---
> Jul 12 00:49:25 cdm-storage kernel: i40iw_initialize_dev: DCB is set/clear
> = 0
> Jul 12 00:49:25 cdm-storage kernel: i40iw_wait_pe_ready: [1261] fm load
> status[x0703]
> Jul 12 00:49:25 cdm-storage kernel: i40iw_wait_pe_ready: [1263] CSR_CQP
> status[x0080]
> Jul 12 00:49:25 cdm-storage kernel: i40iw_wait_pe_ready: [1266]
> I40E_GLPE_CPUSTATUS1 status[x0080]
> Jul 12 00:49:25 cdm-storage kernel: i40iw_wait_pe_ready: [1269]
> I40E_GLPE_CPUSTATUS2 status[x0080]
> Jul 12 00:49:25 cdm-storage kernel: bond0: link status definitely up for
> interface eth1, 1000 Mbps full duplex
> Jul 12 00:49:25 cdm-storage kernel: bond0: first active interface up!
> _______________________________________________
>

Try to use latest lt or ml kernel provided by elrepo

Kernelnewbies mailing list
> Kernelnewbies at kernelnewbies.org
> https://lists.kernelnewbies.org/mailman/listinfo/kernelnewbies
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.kernelnewbies.org/pipermail/kernelnewbies/attachments/20210713/8e6a7389/attachment.html>


More information about the Kernelnewbies mailing list