检测到您已登录华为云国际站账号,为了您更好的体验,建议您访问国际站服务网站 https://www.huaweicloud.com/intl/zh-cn

不再显示此消息

  • Intl-English
    International
    • English
    • Bahasa Indonesia
    • Español
    • Português
    • Türkçe
    • عربي
    • ไทย
    • 简体中文
    • 日本語
    中国站
    • 简体中文
    Europe
    • English
    • Deutsch
    • Español
    • Français
    • Nederlands
  • Huawei Cloud
    • Activities
    • Products
    • Solutions
    • Pricing
    • KooGallery
    • Partners
    • Developers
    • Support
    • About Us
      Show more results for “”
      • Contact Us
      • Documentation
      • Console
        • My Account
        • Billing & Costs
        • Service Tickets
        • Unread Messages
        • Console
        • Partner Center
        • Sign In Sign Up
      • Sign In
      • Sign Up
        • My Account Complete Sign Up
        • Billing & Costs
        • Service Tickets
        • Unread Messages
        • Console
        • Partner Center
        • Log Out
      Cancel
      Help Center/ Elastic Cloud Server/ Troubleshooting/ Self-diagnosis of Faulty GPU-accelerated ECSs/ Fault Diagnosis and Handling of Graphics Cards
      Updated on 2025-07-30 GMT+08:00
      View PDF
      Share
      • x.com
      • Facebook
      • LinkedIn
      • Copy link

      Copied.

      Fault Diagnosis and Handling of Graphics Cards

      • How Do I Handle the infoROM Error?
      • What Do I Do If ECC Error "double bit ecc error" Occurs and There Are No Retired Pages Shown in the nvidia-smi -q Command Output?
      • What Do I Do If nvidia-smi Command Output Shows the SRAM ECC Error (V100 GPUs)?
      • What Do I Do If the GPU Is Disconnected or the Graphics Card Can't Be found, or rev ff Is Displayed After lspci | grep -i nvidia Is Executed?
      • What Do I Do If the nvidia-smi Command Output Shows Overheated GPUs?
      • What Do I Do If "Unable to load the kernel module 'nvidia.ko'" Is Displayed During Driver Installation?
      • What Can I Do If an Xid Error Is Displayed in the Message Log When a GPU-accelerated ECS Is Faulty?
      Parent topic: Self-diagnosis of Faulty GPU-accelerated ECSs

      Previous topic: What Do I Do If I Have Installed the GRID Driver but Have Not Purchased or Configured the License?

      Next topic: How Do I Handle the infoROM Error?

      Feedback

      Was this page helpful?

      Helpful Not helpful
      Provide feedback

      Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

      The system is busy. Please try again later.

      Which of the following issues have you encountered?

      Content is inconsistent with the product UI
      Unclear descriptions
      Lack of examples or code
      Incorrect steps
      Can't find what I need
      Lack of best practices

      Feedback (optional)

      0/500

      Select at least one type of issue, and enter your comments or suggestions.

      Enter a maximum of 500 characters.

      Submit Cancel

      For any further questions, feel free to contact us through the chatbot.

      Chatbot
      Contact Sales After-Sales Self Service
      • Site Terms
      • Privacy Statement

      Explore Huawei Cloud

      Why Us Customer Stories Trust Center Legal Press Releases

      Featured Services

      Elastic Cloud Server (ECS) Elastic IP (EIP) RDS for MySQL Elastic Volume Service (EVS) MapReduce Service (MRS)

      Service and Support

      Documentation Contact Us Public Notices Support Plans Service Health Dashboard

      Account and Payment

      Top Up Invoices Billing Center My Account Payment Method

      Quick Links

      Huawei Corporate Huawei Enterprise Huawei Consumer Business Huawei Developers

      © 2025, Huawei Cloud Computing Technologies Co., Ltd. and/or its affiliates. All rights reserved.

      • Site Terms
      • Privacy Statement