Data Storage

Your present location > Home page > Data Storage
MOSFET Selection Strategy and Device Adaptation Handbook for AI Training Servers (8-GPU) with High-Power Density and Reliability Requirements
AI Server MOSFET Selection Strategy Topology Diagram

AI Training Server (8-GPU) Power System Overall Topology Diagram

graph LR %% Input Power Distribution subgraph "Input Power & Distribution" AC_IN["Three-Phase AC Input"] --> PSU["Server Power Supply Unit"] PSU --> DC_48V["48V Intermediate Bus"] DC_48V --> DC_12V["12V Auxiliary Bus"] DC_48V --> GPU_VRM_IN["GPU VRM Input (48V)"] DC_12V --> AUX_LOAD["Auxiliary Loads"] end %% GPU Power Delivery subgraph "GPU VRM & Power Delivery (8 GPUs)" GPU_VRM_IN --> MULTIPHASE_VRM["Multi-Phase VRM Controller"] subgraph "VRM Power Stage Array" VRM_HS1["VBGL1105
High-Side"] VRM_LS1["VBGL1105
Low-Side"] VRM_HS2["VBGL1105
High-Side"] VRM_LS2["VBGL1105
Low-Side"] end MULTIPHASE_VRM --> VRM_DRIVER["VRM Gate Driver"] VRM_DRIVER --> VRM_HS1 VRM_DRIVER --> VRM_LS1 VRM_DRIVER --> VRM_HS2 VRM_DRIVER --> VRM_LS2 VRM_HS1 --> GPU_POWER["GPU Core Power"] VRM_LS1 --> GND_VRM VRM_HS2 --> GPU_MEM_POWER["GPU Memory Power"] VRM_LS2 --> GND_VRM GPU_POWER --> GPU_LOAD1["GPU Core Load"] GPU_MEM_POWER --> GPU_LOAD2["GPU Memory Load"] end %% Auxiliary Systems subgraph "High-Current Auxiliary & Cooling Systems" subgraph "Fan Control & Drive" FAN_CONTROLLER["Fan Controller"] --> FAN_DRIVER["Fan Driver"] FAN_DRIVER --> FAN_SWITCH["VBM2625
P-MOSFET"] FAN_SWITCH --> FAN_ARRAY["Server Fan Array"] FAN_ARRAY --> GND_FAN end subgraph "Auxiliary Power Switching" AUX_SW_CONTROLLER["Auxiliary Controller"] --> AUX_SWITCH["VBM2625
P-MOSFET"] AUX_SWITCH --> HIGH_POWER_AUX["High-Power Auxiliary Loads"] HIGH_POWER_AUX --> GND_AUX end end %% Control & POL Systems subgraph "Control, Protection & Point-of-Load" subgraph "POL & Control Switching" MCU["Main Control MCU"] --> POL_SW1["VBQD3222U
Channel 1"] MCU --> POL_SW2["VBQD3222U
Channel 2"] POL_SW1 --> POL_LOAD1["POL Load 1"] POL_SW2 --> POL_LOAD2["POL Load 2"] POL_LOAD1 --> GND_POL POL_LOAD2 --> GND_POL end subgraph "Protection & Safety Circuits" HOTSWAP_CONTROLLER["Hot-Swap Controller"] --> HOTSWAP_SW["VBJ2456
P-MOSFET"] OVP_CIRCUIT["Over-Voltage Protection"] --> PROTECTION_IC["Protection IC"] OCP_CIRCUIT["Over-Current Protection"] --> CURRENT_SENSE["Current Sensor"] end end %% Thermal Management subgraph "Three-Level Thermal Management Architecture" COOLING_LEVEL1["Level 1: Liquid Cooling
GPU VRM & High-Current MOSFETs"] COOLING_LEVEL2["Level 2: Forced Air Cooling
Primary Side & Auxiliary MOSFETs"] COOLING_LEVEL3["Level 3: PCB Thermal Design
Control & POL MOSFETs"] COOLING_LEVEL1 --> VRM_HS1 COOLING_LEVEL1 --> VRM_LS1 COOLING_LEVEL2 --> FAN_SWITCH COOLING_LEVEL2 --> AUX_SWITCH COOLING_LEVEL3 --> VBQD3222U COOLING_LEVEL3 --> VBJ2456 end %% Monitoring & Communication MCU --> TEMP_SENSORS["Temperature Sensors"] MCU --> VOLTAGE_MONITORS["Voltage Monitors"] MCU --> CURRENT_MONITORS["Current Monitors"] MCU --> SYSTEM_MGMT["System Management Bus"] SYSTEM_MGMT --> REMOTE_MONITOR["Remote Monitoring"] %% Style Definitions style VRM_HS1 fill:#e8f5e8,stroke:#4caf50,stroke-width:2px style VRM_LS1 fill:#e8f5e8,stroke:#4caf50,stroke-width:2px style FAN_SWITCH fill:#e3f2fd,stroke:#2196f3,stroke-width:2px style AUX_SWITCH fill:#e3f2fd,stroke:#2196f3,stroke-width:2px style POL_SW1 fill:#fff3e0,stroke:#ff9800,stroke-width:2px style POL_SW2 fill:#fff3e0,stroke:#ff9800,stroke-width:2px style MCU fill:#fce4ec,stroke:#e91e63,stroke-width:2px

With the explosive growth of AI computing demands, high-performance training servers equipped with 8 GPUs have become the core infrastructure for deep learning. The power delivery system, serving as the "energy heart" of the entire server, must provide ultra-high current, high-efficiency, and precise power conversion for critical loads such as GPU VRMs, high-speed fans, and auxiliary controllers. The selection of power MOSFETs directly determines the system's power density, conversion efficiency, thermal performance, and overall reliability. Addressing the stringent requirements of AI servers for maximum uptime, energy efficiency, and power density, this article focuses on scenario-based adaptation to develop a practical and optimized MOSFET selection strategy.
I. Core Selection Principles and Scenario Adaptation Logic
(A) Core Selection Principles: Four-Dimensional Collaborative Adaptation
MOSFET selection requires coordinated adaptation across four dimensions—voltage, loss, package, and reliability—ensuring precise matching with the severe operating conditions within a server:
Sufficient Voltage Margin: For server power rails (12V, 48V, high-voltage DC bus), reserve a rated voltage withstand margin of ≥60% to handle massive transient spikes and bus ringing caused by GPU load steps.
Ultra-Low Loss Priority: Prioritize devices with extremely low Rds(on) to minimize conduction loss under multi-hundred-ampere currents, and low Qg/Qoss to reduce switching loss at high frequencies (200kHz-1MHz), directly improving PSU efficiency and reducing thermal stress.
Package & Thermal Matching: Choose packages with ultra-low thermal resistance (e.g., TO-247-4L, TO263) and excellent current capability for GPU VRM and primary side stages. Select compact packages (DFN, SOT) for point-of-load (POL) and control circuits, balancing power density and manufacturability.
High Reliability & Ruggedness: Meet 24/7 data center operation requirements, focusing on high junction temperature capability (≥175°C), strong avalanche energy rating, and excellent gate oxide integrity to withstand harsh electrical environments.
(B) Scenario Adaptation Logic: Categorization by Power Stage
Divide server power stages into three core scenarios: First, GPU Voltage Regulator Module (VRM) and Primary Side Conversion, requiring the highest current handling and efficiency. Second, High-Current Auxiliary & Fan Drive, requiring robust switching for cooling and support systems. Third, Control, Isolation & Protection Circuits, requiring precise on/off control and fault isolation for system safety and management.
II. Detailed MOSFET Selection Scheme by Scenario
(A) Scenario 1: GPU VRM & High-Current DC-DC Conversion – The Power Core
Multi-phase GPU VRMs (supporting 8 high-end GPUs) require handling continuous currents exceeding 1000A in aggregate, with extreme di/dt transients. Devices must offer ultra-low Rds(on) and optimized switching characteristics.
Recommended Model: VBGL1105 (Single-N, 100V, 125A, TO263)
Parameter Advantages: Utilizing advanced SGT technology, it achieves an ultra-low Rds(on) of 4mΩ at 10V. A continuous current rating of 125A (with high peak capability) is ideal for synchronous buck converter low-side or high-side applications in 48V intermediate bus architectures. The TO263 (D2PAK) package offers excellent power dissipation and is suitable for automated assembly.
Adaptation Value: Dramatically reduces conduction loss in multi-phase VRMs. For a 100A per-phase application, conduction loss can be below 4W per device, enabling VRM efficiency >94% at full load. Supports high-frequency multiphase operation, reducing the required output capacitance and improving transient response.
Selection Notes: Must be used in conjunction with a high-performance multiphase PWM controller. Careful PCB layout with a minimized high-current loop is critical. Parallel connection may be required for the highest current phases. Adequate heatsinking is mandatory.
(B) Scenario 2: High-Current Auxiliary Switching & Fan Drive – Support & Cooling
This includes control of server fan arrays (requiring high startup current) and switching for high-power auxiliary rails, demanding a balance of current capability, voltage rating, and ruggedness.
Recommended Model: VBM2625 (Single-P, -60V, -50A, TO220)
Parameter Advantages: A robust P-Channel MOSFET with Rds(on) as low as 19mΩ at 10V and a continuous drain current of -50A. The -60V drain-source voltage is suitable for high-side switching on 12V or 48V rails. The TO220 package provides a good balance of cost, current handling, and ease of heatsinking.
Adaptation Value: Excellent for high-side load switches controlling fan banks or auxiliary power modules. Its low Rds(on) minimizes voltage drop and power loss. The P-Channel configuration simplifies gate driving for high-side switches compared to N-Channel with bootstrap circuits.
Selection Notes: Verify the inrush current of the load (e.g., fan startup). Ensure the gate driver can provide sufficient sink current for fast turn-off. A TO220 package requires proper mounting to a chassis or heatsink for high continuous current operation.
(C) Scenario 3: Control, Protection & Point-of-Load (POL) Switching – System Intelligence
This encompasses low-voltage POL converters, hot-swap circuits, and safety isolation switches for various controller boards and sensors, requiring compact size, good efficiency, and logic-level compatibility.
Recommended Model: VBQD3222U (Dual-N+N, 20V, 6A per channel, DFN8(3x2)-B)
Parameter Advantages: A dual N-Channel MOSFET in a compact DFN package, saving significant PCB area. Features a low gate threshold (Vth) of 0.5-1.5V, making it directly drivable by 3.3V/5V MCU GPIO pins. Rds(on) is as low as 22mΩ at 4.5V Vgs.
Adaptation Value: Perfect for space-constrained POL switching, dual-channel load sharing, or as a synchronous switch in low-voltage DC-DC circuits. Enables intelligent power sequencing and granular power gating for different server components, reducing standby power.
Selection Notes: Keep load current well within the 6A per-channel rating, considering thermal limits of the small package. A minimal copper pad under the DFN package is necessary for heat dissipation. Use gate series resistors to dampen ringing in fast-switching control circuits.
III. System-Level Design Implementation Points
(A) Drive Circuit Design: Matching Device Characteristics
VBGL1105: Requires a dedicated high-current gate driver (e.g., 2A-4A source/sink capability) to achieve fast switching and minimize crossover loss. Proper gate resistor selection is critical to manage EMI and dv/dt.
VBM2625: Can be driven by a standard gate driver IC or a discrete bipolar stage. Ensure fast turn-off to prevent shoot-through in bridge configurations. A pull-up resistor on the gate is typically needed.
VBQD3222U: Can be driven directly by MCU pins for moderate speeds. For faster switching, a small buffer (e.g., transistor array) is recommended. Pay close attention to PCB trace inductance due to the high-speed switching loops.
(B) Thermal Management Design: Tiered and Aggressive Cooling
VBGL1105: Demands aggressive cooling. Mount on a dedicated heatsink attached to the server's thermal management system (cold plate or high-airflow zone). Use thermal interface material (TIM) of high quality. Extensive PCB copper pour with multiple thermal vias is essential.
VBM2625: Requires a heatsink for continuous high-current operation. The TO220 package allows for easy attachment to a chassis bar or a dedicated extruded heatsink.
VBQD3222U: Relies on PCB-level cooling. Provide a generous copper pad underneath the package connected to internal ground planes via thermal vias. Ensure adequate airflow over the board.
(C) EMC and Reliability Assurance
EMC Suppression: For VBGL1105 in VRM, use low-ESR/ESL input capacitors very close to the drain-source connection. Consider adding small RC snubbers across the drain-source of the high-side device to dampen high-frequency ringing. For VBM2625 controlling inductive fans, use flyback diodes or TVS for protection.
Reliability Protection:
Derating: Apply conservative derating. Operate VBGL1105 at ≤70-80% of its current rating under worst-case temperature.
Overcurrent Protection (OCP): Implement precise OCP at the VRM controller level. For discrete switches like VBM2625, use a current-sense amplifier or a fuse.
Transient Protection: Use TVS diodes on all power input lines and gate pins. Implement proper input filtering to suppress line-borne surges.
IV. Scheme Core Value and Optimization Suggestions
(A) Core Value
Maximized Power Density and Efficiency: The combination of ultra-low Rds(on) SGT/SiC devices and compact dual MOSFETs enables high-efficiency, high-frequency power conversion, allowing for more compact PSU designs and higher GPU power budgets.
Enhanced System Control and Safety: Granular control via logic-level MOSFETs enables advanced power management, while robust high-current devices ensure reliable operation under heavy computational loads.
Scalable and Serviceable Design: The use of standard, well-characterized packages (TO263, TO220, DFN) facilitates design scalability, testing, and potential field service.
(B) Optimization Suggestions
For Highest Efficiency (>98% Peak) in 48V-12V/1.xV Conversion: Consider pairing VBGL1105 with a SiC MOSFET (like VBP112MC50-4L for the primary side in isolated topologies) to minimize total system loss.
For Space-Critical POL Applications: For higher current needs in POL, consider the VBGE1102N (100V, 35A, TO252) as a robust alternative in a slightly larger package than DFN.
For Redundant Power & Hot-Swap Control: The P-Channel VBJ2456 (-40V, -6.2A, SOT223) offers a very compact solution for board-level hot-swap or OR-ing functions.
Special High-Voltage Applications: For PFC or offline auxiliary power supplies within the server PSU, the VBMB18R05S (800V, 5A, SJ_Multi-EPI) provides a cost-effective high-voltage solution.
Conclusion
Strategic MOSFET selection is paramount to unlocking the full performance potential of 8-GPU AI training servers, where power delivery is a key bottleneck. This scenario-adapted strategy provides a roadmap for optimizing efficiency, power density, and reliability through careful device matching and robust system design. Future evolution will involve deeper integration of wide-bandgap (SiC/GaN) devices and smart power stages to meet the ever-increasing power demands of next-generation AI hardware.

Detailed Topology Diagrams

GPU VRM & High-Current DC-DC Conversion Detail

graph LR subgraph "Multi-Phase VRM Architecture (Per GPU)" INPUT["48V Input"] --> PHASE1["Phase 1 Buck Converter"] INPUT --> PHASE2["Phase 2 Buck Converter"] INPUT --> PHASE3["Phase 3 Buck Converter"] INPUT --> PHASE4["Phase 4 Buck Converter"] subgraph "Phase 1 Power Stage" HS1["VBGL1105
High-Side MOSFET"] LS1["VBGL1105
Low-Side MOSFET"] INDUCTOR1["Output Inductor"] CAP1["Output Capacitor"] end subgraph "Phase 2 Power Stage" HS2["VBGL1105
High-Side MOSFET"] LS2["VBGL1105
Low-Side MOSFET"] INDUCTOR2["Output Inductor"] CAP2["Output Capacitor"] end PHASE_CONTROLLER["Multi-Phase PWM Controller"] --> GATE_DRIVER["High-Current Gate Driver"] GATE_DRIVER --> HS1 GATE_DRIVER --> LS1 GATE_DRIVER --> HS2 GATE_DRIVER --> LS2 HS1 --> SW_NODE1["Switching Node 1"] LS1 --> GND1 SW_NODE1 --> INDUCTOR1 INDUCTOR1 --> OUTPUT["GPU Core Voltage (1.xV)"] OUTPUT --> GPU_CORE["GPU Core Load"] HS2 --> SW_NODE2["Switching Node 2"] LS2 --> GND2 SW_NODE2 --> INDUCTOR2 INDUCTOR2 --> OUTPUT subgraph "Current Sensing & Protection" CS1["Current Sense Amplifier"] CS2["Current Sense Amplifier"] OVP["Over-Voltage Protection"] OCP["Over-Current Protection"] end CS1 --> PHASE_CONTROLLER CS2 --> PHASE_CONTROLLER OVP --> PHASE_CONTROLLER OCP --> PHASE_CONTROLLER end style HS1 fill:#e8f5e8,stroke:#4caf50,stroke-width:2px style LS1 fill:#e8f5e8,stroke:#4caf50,stroke-width:2px style HS2 fill:#e8f5e8,stroke:#4caf50,stroke-width:2px style LS2 fill:#e8f5e8,stroke:#4caf50,stroke-width:2px

High-Current Auxiliary & Fan Drive Topology Detail

graph LR subgraph "Fan Array Control System" FAN_MCU["Fan Controller MCU"] --> PWM_GEN["PWM Generator"] PWM_GEN --> LEVEL_SHIFTER["Level Shifter"] LEVEL_SHIFTER --> GATE_DRIVE_FAN["Gate Driver"] GATE_DRIVE_FAN --> FAN_MOSFET["VBM2625 P-MOSFET
High-Side Switch"] subgraph "Fan Bank Configuration" FAN1["High-Speed Fan"] FAN2["High-Speed Fan"] FAN3["High-Speed Fan"] FAN4["High-Speed Fan"] end FAN_MOSFET --> FAN1 FAN_MOSFET --> FAN2 FAN_MOSFET --> FAN3 FAN_MOSFET --> FAN4 FAN1 --> GND_FAN FAN2 --> GND_FAN FAN3 --> GND_FAN FAN4 --> GND_FAN subgraph "Fan Protection" TVS_FAN["TVS Diode Array"] CURRENT_SENSE_FAN["Current Sense Resistor"] THERMAL_FUSE["Thermal Fuse"] end TVS_FAN --> FAN_MOSFET CURRENT_SENSE_FAN --> FAN_MCU THERMAL_FUSE --> FAN_MOSFET end subgraph "Auxiliary Power Switching" AUX_CONTROLLER["Auxiliary Power Controller"] --> AUX_DRIVER["Gate Driver"] AUX_DRIVER --> AUX_MOSFET["VBM2625 P-MOSFET
High-Side Switch"] AUX_MOSFET --> AUX_LOAD["High-Power Auxiliary Load"] subgraph "Load Examples" SSD_ARRAY["SSD Array"] NETWORK_CARD["High-Speed Network Card"] COOLING_PUMP["Liquid Cooling Pump"] end AUX_LOAD --> SSD_ARRAY AUX_LOAD --> NETWORK_CARD AUX_LOAD --> COOLING_PUMP SSD_ARRAY --> GND_AUX NETWORK_CARD --> GND_AUX COOLING_PUMP --> GND_AUX subgraph "Auxiliary Protection" INRUSH_LIMIT["Inrush Current Limiter"] OVP_AUX["Over-Voltage Protection"] REVERSE_POLARITY["Reverse Polarity Protection"] end INRUSH_LIMIT --> AUX_MOSFET OVP_AUX --> AUX_CONTROLLER REVERSE_POLARITY --> AUX_MOSFET end style FAN_MOSFET fill:#e3f2fd,stroke:#2196f3,stroke-width:2px style AUX_MOSFET fill:#e3f2fd,stroke:#2196f3,stroke-width:2px

Control, Protection & POL Switching Topology Detail

graph LR subgraph "Dual-Channel POL Switching" MCU_GPIO["MCU GPIO (3.3V/5V)"] --> BUFFER["Buffer/Level Shifter"] BUFFER --> DUAL_MOSFET["VBQD3222U Dual N-MOSFET"] subgraph "Channel 1 Configuration" GATE1["Gate 1"] DRAIN1["Drain 1"] SOURCE1["Source 1"] end subgraph "Channel 2 Configuration" GATE2["Gate 2"] DRAIN2["Drain 2"] SOURCE2["Source 2"] end DUAL_MOSFET --> GATE1 DUAL_MOSFET --> DRAIN1 DUAL_MOSFET --> SOURCE1 DUAL_MOSFET --> GATE2 DUAL_MOSFET --> DRAIN2 DUAL_MOSFET --> SOURCE2 DRAIN1 --> POL_INPUT["3.3V/5V Input"] SOURCE1 --> POL_OUTPUT1["POL Output 1"] POL_OUTPUT1 --> SENSOR1["Sensor/Controller"] SENSOR1 --> GND_POL DRAIN2 --> POL_INPUT SOURCE2 --> POL_OUTPUT2["POL Output 2"] POL_OUTPUT2 --> SENSOR2["Sensor/Controller"] SENSOR2 --> GND_POL subgraph "POL Protection" TVS_POL["TVS Protection"] RC_SNUBBER["RC Snubber"] CURRENT_LIMIT["Current Limit"] end TVS_POL --> DUAL_MOSFET RC_SNUBBER --> DUAL_MOSFET CURRENT_LIMIT --> MCU_GPIO end subgraph "Hot-Swap & Safety Switching" HOTSWAP_CTRL["Hot-Swap Controller"] --> HOTSWAP_DRIVER["Driver"] HOTSWAP_DRIVER --> HOTSWAP_MOSFET["VBJ2456 P-MOSFET"] subgraph "Hot-Swap Protection" CURRENT_SENSE_HS["Current Sense"] VOLTAGE_MONITOR["Voltage Monitor"] TIMER_FAULT["Timer & Fault Management"] end HOTSWAP_MOSFET --> BOARD_POWER["Board Power Input"] BOARD_POWER --> LOAD_CIRCUIT["Load Circuit"] LOAD_CIRCUIT --> GND_BOARD CURRENT_SENSE_HS --> HOTSWAP_CTRL VOLTAGE_MONITOR --> HOTSWAP_CTRL TIMER_FAULT --> HOTSWAP_CTRL end subgraph "System Protection Network" subgraph "Transient Protection" TVS_48V["48V TVS Array"] TVS_12V["12V TVS Array"] TVS_3V3["3.3V TVS Array"] end subgraph "Current Monitoring" SHUNT_48V["48V Shunt Resistor"] SHUNT_12V["12V Shunt Resistor"] SHUNT_VRM["VRM Shunt Resistor"] end subgraph "Temperature Monitoring" NTC_VRM["VRM NTC Sensor"] NTC_AUX["Auxiliary NTC Sensor"] NTC_AMBIENT["Ambient NTC Sensor"] end TVS_48V --> PROTECTION_MCU["Protection MCU"] TVS_12V --> PROTECTION_MCU TVS_3V3 --> PROTECTION_MCU SHUNT_48V --> PROTECTION_MCU SHUNT_12V --> PROTECTION_MCU SHUNT_VRM --> PROTECTION_MCU NTC_VRM --> PROTECTION_MCU NTC_AUX --> PROTECTION_MCU NTC_AMBIENT --> PROTECTION_MCU PROTECTION_MCU --> FAULT_OUTPUT["Fault Output Signal"] end style DUAL_MOSFET fill:#fff3e0,stroke:#ff9800,stroke-width:2px style HOTSWAP_MOSFET fill:#fff3e0,stroke:#ff9800,stroke-width:2px
Download PDF document
Download now:VBQD3222U

Sample Req

Online

Telephone

400-655-8788

WeChat

Topping

Sample Req
Online
Telephone
WeChat