分析一串儿kernel crash(第一段trace)

[59235.794233] BUG: Bad page state in process QThread pfn:47e29c6

[59235.800135] page:ffff7e011f8a7180 count:0 mapcount:0 mapping:ffffa02fba689218 index:0x0

[59235.808107] flags: 0x5ffff0000000000()

[59235.811844] raw: 05ffff0000000000 ffffa02fba689218 0000000000000000 00000000ffffffff

[59235.819550] raw: dead000000000100 dead000000000200 0000000000000000 0000000000000000

[59235.827257] page dumped because: non-NULL mapping

[59235.832848] BUG: Bad page state in process QThread  pfn:47e29e3

[59235.838743] page:ffff7e011f8a78c0 count:0 mapcount:0 mapping:ffffa02fba689218 index:0x0

[59235.846713] flags: 0x5ffff0000000000()

[59235.850452] raw: 05ffff0000000000 ffffa02fba689218 0000000000000000 00000000ffffffff

[59235.858162] raw: dead000000000100 dead000000000200 0000000000000000 0000000000000000

[59235.865871] page dumped because: non-NULL mapping

[59235.959846] Unable to handle kernel paging request at virtual address 0023aee7

[59235.967041] Mem abort info:

[59235.969823]  ESR = 0x96000006

[59235.972865]  Exception class = DABT (current EL), IL = 32 bits

[59235.978758]  SET = 0, FnV = 0

[59235.981800]  EA = 0, S1PTW = 0

[59235.984927] Data abort info:

[59235.987795]  ISV = 0, ISS = 0x00000006

[59235.991613]  CM = 0, WnR = 0

[59235.994569] user pgtable: 4k pages, 48-bit VAs, pgd = 00000000ec1fcd2e

[59236.001067] [000000000023aee7] *pgd=0000004d004a5003, *pud=0000004e00825003, *pmd=0000000000000000

[59236.009992] Internal error: Oops: 96000006 [#1] SMP

[59236.014851] Modules linked in: nls_iso8859_1 snd_hda_codec_hdmi ipmi_ssif snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer hns_roce_hw_v2 joydev input_leds hns_roce snd soundcore ipmi_si ipmi_devintf ipmi_msghandler shpchp cppc_cpufreq sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear amdgpu ses enclosure hibmc_drm chash i2c_algo_bit ttm drm_kms_helper hid_generic aes_ce_blk aes_ce_cipher crc32_ce realtek crct10dif_ce hisi_sas_v3_hw syscopyarea usbhid hns3 sysfillrect hisi_sas_main ghash_ce sysimgblt sha2_ce fb_sys_fops sha256_arm64 drm libsas hclge sha1_ce

[59236.085189]  megaraid_sas hid ahci hnae3 libahci scsi_transport_sas gpio_dwapb aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64

[59236.097059] Process sdma0 (pid: 1410, stack limit = 0x00000000f719f93b)

[59236.103649] CPU: 62 PID: 1410 Comm: sdma0 Tainted: G    B            4.15.18 #1

[59236.110924] Hardware name: ********************这段隐去************************* 12/14/2019

[59236.119064] pstate: 40c00089 (nZcv daIf +PAN +UAO)

[59236.123841] pc : ___slab_alloc+0x8c/0x520

[59236.127834] lr : ___slab_alloc+0x314/0x520

[59236.131913] sp : ffff00002cc0bb70

[59236.135213] x29: ffff00002cc0bb70 x28: 000000000023aee7

[59236.140502] x27: ffff802fbe007c00 x26: ffff7e011f8a6980

[59236.145791] x25: ffff0000020eeee4 x24: ffff80579fddf2e0

[59236.151080] x23: 00000000014000c0 x22: 00000000ffffffff

[59236.156368] x21: 00000000014000c0 x20: ffff802fbe007c00

[59236.161656] x19: ffff80579fddf2e0 x18: 0000ffff0dff998d

[59236.166946] x17: 0000ffffa85860a0 x16: ffff000008308b40

[59236.172234] x15: 00005fc3e6000000 x14: 0000000100000043

[59236.172467] BUG: Bad page state in process QThread  pfn:47e29e5

[59236.177522] x13: 0000000100000042 x12: ffff80579fddff48

[59236.183421] page:ffff7e011f8a7940 count:0 mapcount:0 mapping:ffffa02fba689218 index:0x0

[59236.196676] x11: 0000000100000040

[59236.196679] flags: 0x5ffff0000000000()

[59236.196680] x10: 0000000000000ad0

[59236.200070] raw: 05ffff0000000000 ffffa02fba689218 0000000000000000 00000000ffffffff

[59236.207187] raw: dead000000000100 dead000000000200 0000000000000000 0000000000000000

[59236.214895] x9 : ffff00002cc0bd10

[59236.222600] page dumped because: non-NULL mapping

[59236.222601] x8 : 0000000000000000

[59236.225987] Modules linked in: nls_iso8859_1

[59236.234057]  snd_hda_codec_hdmi

[59236.238308] x7 : ffff000009689180

[59236.238309]  ipmi_ssif

[59236.241436] x6 : 0000000000000001

[59236.244822]  snd_hda_intel

[59236.247176] x5 : 0000000000000035

[59236.250564]  snd_hda_codec

[59236.253258] x4 : ffff7e01470db980

[59236.256645]  snd_hda_core

[59236.262726]  snd_hwdep

[59236.265334] x3 : fba417b2dcd16eca

[59236.265336]  snd_pcm

[59236.267684] x2 : 0000000000000000

[59236.271069]  snd_timer

[59236.276632]  hns_roce_hw_v2

[59236.278980] x1 : 0000000009074927

[59236.278982]  joydev

[59236.281764] x0 : 000000000023aee7

[59236.285151]  input_leds

[59236.290626]  hns_roce

[59236.293063] Call trace:

[59236.293063]  snd soundcore

[59236.295335]  ___slab_alloc+0x8c/0x520

[59236.297767]  ipmi_si

[59236.300463]  __slab_alloc+0x50/0x68

[59236.300465]  __kmalloc+0x224/0x298

[59236.304110]  ipmi_devintf

[59236.306370]  amdgpu_vm_grab_id+0xa4/0x8e8 [amdgpu]

[59236.309761]  ipmi_msghandler

[59236.313217]  amdgpu_job_dependency+0xf4/0x148 [amdgpu]

[59236.315754]  shpchp

[59236.320579]  amd_sched_main+0xc8/0x488 [amdgpu]

[59236.323390]  cppc_cpufreq

[59236.328509]  kthread+0x134/0x138

[59236.330593]  sch_fq_codel ib_iser

[59236.335107]  ret_from_fork+0x10/0x18

[59236.337713]  rdma_cm

[59236.340928] Code: f9400661 8b020380 f940a363 91000421 (f8626b82)

[59236.344226]  iw_cm

[59236.347791] SMP: stopping secondary CPUs

[59236.349962]  ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear amdgpu ses enclosure hibmc_drm chash i2c_algo_bit ttm drm_kms_helper hid_generic aes_ce_blk aes_ce_cipher crc32_ce realtek crct10dif_ce hisi_sas_v3_hw syscopyarea usbhid hns3 sysfillrect hisi_sas_main ghash_ce sysimgblt sha2_ce fb_sys_fops sha256_arm64 drm libsas hclge sha1_ce megaraid_sas hid ahci hnae3 libahci scsi_transport_sas gpio_dwapb aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64

[59236.418599] CPU: 56 PID: 7517 Comm: QThread Tainted: G    B            4.15.18 #1

[59236.426045] Hardware name: ****************这段隐去*************, BIOS 1.08 12/14/2019

[59236.434184] Call trace:

[59236.436629]  dump_backtrace+0x0/0x198

[59236.440275]  show_stack+0x24/0x30

[59236.443578]  dump_stack+0x98/0xbc

[59236.446880]  bad_page+0xf0/0x158

[59236.450092]  check_new_page_bad+0x80/0xb0

[59236.454083]  get_page_from_freelist+0xe70/0x1410

[59236.458679]  __alloc_pages_nodemask+0x11c/0xd50

[59236.463191]  alloc_pages_current+0x88/0xe8

[59236.467270]  iommu_dma_alloc+0x13c/0x3e0

[59236.471176]  __iommu_alloc_attrs+0xbc/0x2a0

[59236.475352]  ttm_dma_pool_get_pages+0x324/0x4a8 [ttm]

[59236.480384]  ttm_dma_populate+0x150/0x390 [ttm]

[59236.484960]  amdgpu_ttm_tt_populate+0xec/0x100 [amdgpu]

[59236.490165]  ttm_tt_bind+0x3c/0x80 [ttm]

[59236.494072]  ttm_bo_handle_move_mem+0x454/0x490 [ttm]

[59236.499104]  ttm_bo_validate+0x160/0x170 [ttm]

[59236.503530]  ttm_bo_init_reserved+0x300/0x3e0 [ttm]

[59236.508437]  amdgpu_bo_do_create+0x180/0x3d0 [amdgpu]

[59236.513509]  amdgpu_bo_create+0x88/0x230 [amdgpu]

[59236.518233]  amdgpu_gem_object_create+0xa0/0x140 [amdgpu]

[59236.523648]  amdgpu_gem_create_ioctl+0x194/0x2b0 [amdgpu]

[59236.529047]  drm_ioctl_kernel+0x70/0xd0 [drm]

[59236.533398]  drm_ioctl+0x1fc/0x458 [drm]

[59236.537344]  amdgpu_drm_ioctl+0x58/0x90 [amdgpu]

[59236.541942]  do_vfs_ioctl+0xc4/0xb50

[59236.545502]  SyS_ioctl+0x8c/0xa8

[59236.548715]  el0_svc_naked+0x30/0x34



这段trace很明显出现了多次crash交叉打印的情况,我分析并还原了一下trace,具体有下面这几段:

[19950.518042] BUG: Bad page state in process QThread pfn:2081ba5

[19950.525013] page:ffff7e008206e940 count:0 mapcount:0 mapping:ffffa02fba53c658 index:0x0

[19950.525016] flags: 0x1ffff0000000000()

[19950.525021] raw: 01ffff0000000000 ffffa02fba53c658 0000000000000000 00000000ffffffff

[19950.525022] raw: dead000000000100 dead000000000200 0000000000000000 0000000000000000

[19950.525024] page dumped because: non-NULL mapping

[19950.600974] CPU: 5 PID: 68692 Comm: QThread Not tainted 4.15.18 #1

[19950.605831] Modules linked in:

[19950.611981] Hardware name:  12/14/2019

[19950.615024] Call trace:

[19950.625862]  dump_backtrace+0x0/0x198

[19950.630646]  show_stack+0x24/0x30

[19950.638630]  dump_stack+0x98/0xbc

[19950.644623]  bad_page+0xf0/0x158

[19950.644625]  check_new_page_bad+0x80/0xb0

[19950.650534]  get_page_from_freelist+0xe70/0x1410

[19950.657739]  __alloc_pages_nodemask+0x11c/0xd50

[19950.664686]  alloc_pages_current+0x88/0xe8

[19950.664690]  iommu_dma_alloc+0x13c/0x3e0

[19950.671375]  __iommu_alloc_attrs+0xbc/0x2a0

[19950.679106]  ttm_dma_pool_get_pages+0x324/0x4a8 [ttm]

[19950.679112]  ttm_dma_populate+0x150/0x390 [ttm]

[19950.684935]  amdgpu_ttm_tt_populate+0xec/0x100 [amdgpu]

[19950.692148]  ttm_tt_bind+0x3c/0x80 [ttm]

[19950.701684]  ttm_bo_handle_move_mem+0x454/0x490 [ttm]

[19950.708974]  ttm_bo_validate+0x160/0x170 [ttm]

[19950.715921]  ttm_bo_init_reserved+0x300/0x3e0 [ttm]

[19950.723337]  amdgpu_bo_do_create+0x180/0x3d0 [amdgpu]

[19950.723381]  amdgpu_bo_create+0x88/0x230 [amdgpu]

[19950.730415]  amdgpu_gem_object_create+0xa0/0x140 [amdgpu]

[19950.730458]  amdgpu_gem_create_ioctl+0x194/0x2b0 [amdgpu]

[19950.739292]  drm_ioctl_kernel+0x70/0xd0 [drm]

[19950.747607]  drm_ioctl+0x1fc/0x458 [drm]

[19950.754503]  amdgpu_drm_ioctl+0x58/0x90 [amdgpu]

[19950.765200]  do_vfs_ioctl+0xc4/0xb50

[19950.771538]  SyS_ioctl+0x8c/0xa8

[19950.771541]  el0_svc_naked+0x30/0x34

[19950.511400] Unable to handle kernel paging request at virtual address 1f7c7ec9167269

[19950.519143] Mem abort info:

[19950.527818]  ESR = 0x96000004

[19950.539529]  Exception class = DABT (current EL), IL = 32 bits

[19950.554952]  SET = 0, FnV = 0

[19950.562692]  EA = 0, S1PTW = 0

[19950.571633] Data abort info:

[19950.577637]  ISV = 0, ISS = 0x00000004

[19950.577640]  CM = 0, WnR = 0

[19950.584424] [001f7c7ec9167269] address between user and kernel address ranges

[19950.591540] Internal error: Oops: 96000004 [#1] SMP

[19951.088994] Process gfx (pid: 1351, stack limit = 0x00000000a401cdf7)

[19951.093518] CPU: 33 PID: 1351 Comm: gfx Tainted: G    B            4.15.18 #1

[19951.442902] Hardware name: ****************隐去************************* 12/14/2019

[19951.098217] pstate: 20c00009 (nzCv daif +PAN +UAO)

[19951.102399] pc : kmem_cache_alloc+0xd4/0x220

[19951.111243] lr : kmem_cache_alloc+0x1d8/0x220

[19951.115025] [drm:amdgpu_gem_object_create [amdgpu]] *ERROR* Failed to allocate GEM object (1048576, 2, 4096, -12)

[19951.121039] sp : ffff0000280e3c80

[19951.121041] x29: ffff0000280e3c80

[19951.131442] x28: 0000000000000000

[19951.138733] x27: ffff802f974c09c0 x26: ffffa05789238800

[19951.152110] x25: 0000000000000000

[19951.166143] x24: ffff802f974c0ee0

[19951.263930] x23: ffffa02fa2cf9200

[19951.272861] x22: ffff000002491384

[19951.280326] x21: 00000000014000c0

[19951.290900] x20: ffffa02fa2cf9200

[19951.312049] x19: ffff802081ba40e0

[19951.322623] x18: 0000ffffb7ee3a70

[19951.343769] x17: 0000ffffb944bff8

[19951.354343] x16: ffff000008185188

[19951.369700] x15: 0000000000000000

[19951.377337] x14: 0000000000000002

[19951.392176] x13: 000000000000270f

[19951.400246] x12: 0000000000000001

[19951.418888] x11: 0000000000000000

[19951.428339] x10: 0000000000000ad0

[19951.448277] x9 : ffff0000280e3d10

[19951.457989] x8 : ffff802081ba40e0

[19951.466490] x7 : 721f7c7ec9167269

[19951.473263] x6 : 8de0fc5e48ac3289

[19951.494932] x5 : 00000000017d19ad

[19951.501100] x4 : ffff80579fb103b0

[19951.514125] x3 : 00000000017d19ae

[19951.519685] x2 : 721f03a1b753cd76

[19951.524990] x1 : 0000000000000000

[19951.531336] x0 : 721f7c7ec9167269

[19951.543151] Call trace:

[19951.549061]  kmem_cache_alloc+0xd4/0x220

[19951.555483]  amdgpu_sync_fence+0xc4/0x168 [amdgpu]

[19951.560944]  amdgpu_vm_grab_id+0x234/0x8e8 [amdgpu]

[19951.566322]  amdgpu_job_dependency+0xf4/0x148 [amdgpu]

[19951.566372]  amd_sched_main+0xc8/0x488 [amdgpu]

[19951.571111]  kthread+0x134/0x138

[19951.571114]  ret_from_fork+0x10/0x18

[19950.778644] BUG: Bad page state in process QThread  pfn:2081cf4

[19950.785088] page:ffff7e0082073d00 count:0 mapcount:0 mapping:ffffa02fba53c658 index:0x18f3d5

[19950.791260] flags: 0x1ffff0000000000()

[19950.798555] raw: 01ffff0000000000 ffffa02fba53c658 000000000018f3d5 00000000ffffffff

[19950.804462] raw: dead000000000100 dead000000000200 0000000000000000 0000000000000000

[19950.812621] page dumped because: non-NULL mapping

[19950.823719] Modules linked in:

[19951.433196] CPU: 17 PID: 78440 Comm: QThread Tainted: G    B  W        4.15.18 #1

[19951.098215] Hardware name: **************隐去****************** 12/14/2019

[19951.442904] Call trace:

[19951.453657]  dump_backtrace+0x0/0x198

[19951.461894]  show_stack+0x24/0x30

[19951.470053]  dump_stack+0x98/0xbc

[19951.476822]  bad_page+0xf0/0x158

[19951.476823]  free_pages_check_bad+0x80/0xa0

[19951.491630]  free_pcppages_bulk+0x510/0x568

[19951.491632]  free_unref_page_commit+0xe8/0x118

[19951.498318]  free_unref_page+0x74/0x88

[19951.498320]  __free_pages+0x58/0x68

[19951.504489]  __iommu_dma_free_pages+0x38/0x58

[19951.512037]  iommu_dma_free+0x4c/0x60

[19951.516303]  __iommu_free_attrs+0x94/0x1a8

[19951.521615]  __ttm_dma_free_page.isra.4.constprop.11+0xa8/0xe8 [ttm]

[19951.527775]  ttm_dma_unpopulate+0x100/0x4b0 [ttm]

[19951.527851]  amdgpu_ttm_tt_unpopulate+0xa8/0xb8 [amdgpu]

[19951.534726]  ttm_tt_unpopulate.part.2+0x64/0x70 [ttm]

[19951.540460]  ttm_tt_destroy.part.3+0x68/0x70 [ttm]

[19951.546540]  ttm_tt_destroy+0x24/0x30 [ttm]

[19951.552448]  ttm_bo_cleanup_memtype_use+0x40/0x98 [ttm]

[19951.557493]  ttm_bo_unref+0x1d0/0x270 [ttm]

[19951.562932]  amdgpu_bo_unref+0x44/0x78 [amdgpu]

[19951.568546]  amdgpu_gem_object_free+0x44/0x68 [amdgpu]

[19951.574526]  drm_gem_object_free+0x30/0x70 [drm]

[19951.579729]  drm_gem_object_put_unlocked+0x7c/0xa0 [drm]

[19951.583649]  drm_gem_object_handle_put_unlocked+0x68/0xd0 [drm]

[19951.583664]  drm_gem_object_release_handle+0x5c/0x98 [drm]

[19951.831001]  drm_gem_handle_delete+0x68/0xb8 [drm]

[19951.835784]  drm_gem_close_ioctl+0x3c/0x58 [drm]

[19951.840393]  drm_ioctl_kernel+0x70/0xd0 [drm]

[19951.844744]  drm_ioctl+0x1fc/0x458 [drm]

[19951.848695]  amdgpu_drm_ioctl+0x58/0x90 [amdgpu]

[19951.853292]  do_vfs_ioctl+0xc4/0xb50

[19951.856851]  SyS_ioctl+0x8c/0xa8

[19951.860066]  el0_svc_naked+0x30/0x34

[19951.114264] ------------[ cut here ]------------

[19951.114288] WARNING: CPU: 48 PID: 76977 at drivers/iommu/io-pgtable-arm.c:304 __arm_lpae_map+0x148/0x2f0

[19951.114289] Modules linked in: nls_iso8859_1 ipmi_ssif snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer hns_roce_hw_v2 snd hns_roce soundcore shpchp joydev input_leds ipmi_si ipmi_devintf ipmi_msghandler cppc_cpufreq sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear amdgpu ses enclosure hibmc_drm chash i2c_algo_bit ttm aes_ce_blk aes_ce_cipher crc32_ce hid_generic crct10dif_ce realtek drm_kms_helper ghash_ce usbhid sha2_ce hns3 hisi_sas_v3_hw syscopyarea sysfillrect sysimgblt hisi_sas_main fb_sys_fops sha256_arm64 drm libsas hclge sha1_ce

[19951.114347]  megaraid_sas hid ahci hnae3 libahci scsi_transport_sas gpio_dwapb aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64

[19951.114359] CPU: 48 PID: 76977 Comm: QThread Tainted: G    B            4.15.18 #1

[19951.114359] Hardware name:  12/14/2019

[19951.114361] pstate: 40400009 (nZcv daif +PAN -UAO)

[19951.114362] pc : __arm_lpae_map+0x148/0x2f0

[19951.114364] lr : __arm_lpae_map+0x148/0x2f0

[19951.114364] sp : ffff0000ed9b3260

[19951.114365] x29: ffff0000ed9b3260 x28: ffff802081c98fc8

[19951.114367] x27: 0000000000001000 x26: ffff802081c98000

[19951.114369] x25: 0000000000001000 x24: 0000000000000003

[19951.114370] x23: ffffa02fa0a7d900 x22: 0000000000000844

[19951.114372] x21: 000000465b059000 x20: 0000000000001000

[19951.114374] x19: 000000005b7f9000 x18: 0000000000000000

[19951.114376] x17: 0000ffff8af82b10 x16: ffff000008304950

[19951.114377] x15: ffff0000c1141000 x14: 0140000000000000

[19951.114379] x13: ffff0000c1152000 x12: ffff000009510000

[19951.114381] x11: 0000000000000034 x10: ffff000009696000

[19951.114383] x9 : 0000000000000000 x8 : ffffa057ffe0b09c

[19951.114384] x7 : 0000000000000000 x6 : 0000019dc7764efd

[19951.114386] x5 : 00ffffffffffffff x4 : 0000000000000000

[19951.114388] x3 : 0000000000000000 x2 : f1ee3f9dc7f05d00

[19951.114389] x1 : 0000000000000000 x0 : 0000000000000024

[19951.114391] Call trace:

[19951.114393]  __arm_lpae_map+0x148/0x2f0

[19951.114394]  __arm_lpae_map+0x1b0/0x2f0

[19951.114396]  __arm_lpae_map+0x1b0/0x2f0

[19951.114397]  __arm_lpae_map+0x1b0/0x2f0

[19951.114398]  arm_lpae_map+0x104/0x138

[19951.114401]  arm_smmu_map+0x50/0x70

[19951.114402]  iommu_map+0x108/0x2b0

[19951.114403]  default_iommu_map_sg+0x10c/0x180

[19951.114405]  iommu_dma_alloc+0x23c/0x3e0

[19951.114408]  __iommu_alloc_attrs+0xbc/0x2a0

[19951.114418]  ttm_dma_pool_get_pages+0x324/0x4a8 [ttm]

[19951.114423]  ttm_dma_populate+0x150/0x390 [ttm]

[19951.114496]  amdgpu_ttm_tt_populate+0xec/0x100 [amdgpu]

[19951.114501]  ttm_tt_bind+0x3c/0x80 [ttm]

[19951.114506]  ttm_bo_handle_move_mem+0x454/0x490 [ttm]

[19951.114510]  ttm_bo_validate+0x160/0x170 [ttm]

[19951.114514]  ttm_bo_init_reserved+0x300/0x3e0 [ttm]

[19951.114559]  amdgpu_bo_do_create+0x180/0x3d0 [amdgpu]

[19951.114602]  amdgpu_bo_create+0x88/0x230 [amdgpu]

[19951.114645]  amdgpu_gem_object_create+0xa0/0x140 [amdgpu]

[19951.114688]  amdgpu_gem_create_ioctl+0x194/0x2b0 [amdgpu]

[19951.114712]  drm_ioctl_kernel+0x70/0xd0 [drm]

[19951.114727]  drm_ioctl+0x1fc/0x458 [drm]

[19951.114768]  amdgpu_drm_ioctl+0x58/0x90 [amdgpu]

[19951.114772]  do_vfs_ioctl+0xc4/0xb50

[19951.114773]  SyS_ioctl+0x8c/0xa8

[19951.114776]  el0_svc_naked+0x30/0x34

[19951.114777] ---[ end trace 140038004b48ffce ]---

[19951.863655] BUG: Bad page state in process QThread  pfn:2081c1d

[19951.869550] page:ffff7e0082070740 count:0 mapcount:0 mapping:ffffa02fba53c658 index:0x18f2fe

[19951.877949] flags: 0x1ffff0000000000()

[19951.881683] raw: 01ffff0000000000 ffffa02fba53c658 000000000018f2fe 00000000ffffffff

[19951.889391] raw: dead000000000100 dead000000000200 0000000000000000 0000000000000000

[19951.897096] page dumped because: non-NULL mapping

[19951.901779] Modules linked in: nls_iso8859_1 ipmi_ssif snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer hns_roce_hw_v2 snd hns_roce soundcore shpchp joydev input_leds ipmi_si ipmi_devintf ipmi_msghandler cppc_cpufreq sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear amdgpu ses enclosure hibmc_drm chash i2c_algo_bit ttm aes_ce_blk aes_ce_cipher crc32_ce hid_generic crct10dif_ce realtek drm_kms_helper ghash_ce usbhid sha2_ce hns3 hisi_sas_v3_hw syscopyarea sysfillrect sysimgblt hisi_sas_main fb_sys_fops sha256_arm64 drm libsas hclge sha1_ce

[19951.972096]  megaraid_sas hid ahci hnae3 libahci scsi_transport_sas gpio_dwapb aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64

[19951.983960] CPU: 16 PID: 38546 Comm: QThread Tainted: G    B  W        4.15.18 #1

[19951.991492] Hardware name: *********************隐去********************** 12/14/2019

[19951.999629] Call trace:

[19952.002070]  dump_backtrace+0x0/0x198

[19952.005715]  show_stack+0x24/0x30

[19952.009018]  dump_stack+0x98/0xbc

[19952.012319]  bad_page+0xf0/0x158

[19952.015532]  free_pages_check_bad+0x80/0xa0

[19952.019697]  free_pcppages_bulk+0x510/0x568

[19952.023860]  free_unref_page_commit+0xe8/0x118

[19952.028283]  free_unref_page+0x74/0x88

[19952.032015]  __free_pages+0x58/0x68

[19952.035489]  __iommu_dma_free_pages+0x38/0x58

[19952.039824]  iommu_dma_free+0x4c/0x60

[19952.043472]  __iommu_free_attrs+0x94/0x1a8

[19952.047555]  __ttm_dma_free_page.isra.4.constprop.11+0xa8/0xe8 [ttm]

[19952.053883]  ttm_dma_unpopulate+0x100/0x4b0 [ttm]

[19952.058618]  amdgpu_ttm_tt_unpopulate+0xa8/0xb8 [amdgpu]

[19952.063908]  ttm_tt_unpopulate.part.2+0x64/0x70 [ttm]

[19952.068939]  ttm_tt_destroy.part.3+0x68/0x70 [ttm]

[19952.073709]  ttm_tt_destroy+0x24/0x30 [ttm]

[19952.077875]  ttm_bo_cleanup_memtype_use+0x40/0x98 [ttm]

[19952.083078]  ttm_bo_unref+0x1d0/0x270 [ttm]

[19952.087285]  amdgpu_bo_unref+0x44/0x78 [amdgpu]

[19952.091837]  amdgpu_gem_object_free+0x44/0x68 [amdgpu]

[19952.096966]  drm_gem_object_free+0x30/0x70 [drm]

[19952.101575]  drm_gem_object_put_unlocked+0x7c/0xa0 [drm]

[19952.106875]  drm_gem_object_handle_put_unlocked+0x68/0xd0 [drm]

[19952.112780]  drm_gem_object_release_handle+0x5c/0x98 [drm]

[19952.118252]  drm_gem_handle_delete+0x68/0xb8 [drm]

[19952.123034]  drm_gem_close_ioctl+0x3c/0x58 [drm]

[19952.127643]  drm_ioctl_kernel+0x70/0xd0 [drm]

[19952.131991]  drm_ioctl+0x1fc/0x458 [drm]

[19952.135937]  amdgpu_drm_ioctl+0x58/0x90 [amdgpu]

[19952.140532]  do_vfs_ioctl+0xc4/0xb50

[19952.144090]  SyS_ioctl+0x8c/0xa8

[19952.147303]  el0_svc_naked+0x30/0x34

[19952.609545] SMP: failed to stop secondary CPUs 0-32,34-127

你可能感兴趣的:(分析一串儿kernel crash(第一段trace))