ÐÂÎÅÖÐÐÄ

news center

µ±Ç°Î»ÖãºÊ×Ò³ > ÐÂÎÅÖÐÐÄ > ÆóÒµ¶¯Ì¬

Ã×À¼(milan)-ÈôóÄ£ÐÍѵÁ·¸ü¸ßЧ£¬ÆæÒìĦ¶ûÓû¥Áª´´Ð·½°¸¶¨ÒåÏÂÒ»´úAI¼ÆËã

ʱ¼ä£º2025-12-03 14:14:50

×÷ΪÐÐÒµÁìÏȵÄAIÊÕ¼¯È«Õ»Ê½»¥Áª²úÎïºÍ½â¾ö·½°¸ÌṩÉÌ£¬ÆæÌØÄ¦¶û¸ø³öÁËÒ»Ì×¼«¾ß¾ºÕùÁ¦µÄ½â¾ö·½°¸¡ª¡ª»ùÔڸ߻úÄÜRDMA¼°Chiplet¼¼ÄÜ£¬Ê¹Óá°Scale?Out¡±¡°Scale?Up¡±¡°Scale?Inside¡±ÈýÄêÒ¹ÀíÄ½úÉýËãÁ¦»ù´¡¾Ù´ë´ëÊ©ÓÚÍø¼ä¡¢Æ¬¼ä¼°Æ¬ÄڵĴ«ÊäЧÂÊ£¬ÎªÖÇÄÜËãÁ¦³É³¤¸³ÄÜ¡£

½üÒ»¶Îʱ¼äÒÔÀ´£¬DeepSeekÕ÷Ïó¼¶±¬»ð¼¤·¢²Æ²ú¶ÔÓÚÄêÒ¹·¶Î§Êý¾ÝÖÐÑëÉèÖÃ×°±¸°ÚÉèµÄ˼Ë÷¼°ÕùÒé¡£ÓÚÁ·Ï°¶Ë£¬DeepSeekÒÔ¿ªÔ´Ä£×Ó¾­Óɹý³ÌËã·¨ÓÅ»¯£¨ÈçÏ¡Éټƽϡ¢¶¯Ì¬¼Ü¹¹£©½µµÍÁËÁ·Ï°³É±¾£¬Ê¹»¼ÉÏÆóÒµ¿ÉÒÔ»òÐíÒԵͳɱ¾ÊµÏָ߻úÄÜAIÄêҹģ×ÓµÄÁ·Ï°£»ÓÚÍÆÀí¶Ë£¬DeepSeek¼Ó¿ìÁËAIÔËÓôÓÁ·Ï°ÏòÍÆÀí½×¶ÎµÄǨáã¡£ÊÇÒÔ£¬Óв»ÑÅµã³Æ£¬DeepSeekÒÔºóËãÁ¦ÐèÇ󽫷Żº¡£²»Í⣬¸ü¶àµÄº£ÄÚÍâ»ú̸ÅÐÑб¨ÈÏΪ£¬DeepSeek½µµÍÁËAIÔËÓõÄÃÅ¿²£¬½«¼Ó¿ìAIÄêҹģ×ÓÔËÓÃÂ䵨£¬ÎüÒý¸ü¶àµÄÆóÒµ½øÈëÕâ¸öÈüµÀ£¬ËãÁ¦ÐèÇóÈÔ½«¼Ì³ÐÔö¼Ó£¬²»ÍâÐèÇóÖØÐÄ´Ó“µ¥¿¨·åÖµ»úÄܔתÏò“¼¯ÈºÄÜЧÓÅ»¯”¡£ºÃ±È£¬SemiAnalysis²Â²â£¬È«ÊÀ½çÊý¾ÝÖÐÑëÈÝÁ¿½«´Ó2023ÄêµÄ49GWÔö¼ÓÖÁ2026ÄêµÄ96GW£¬´ËÖÐн¨ÖÇËãÖÐÑëÈÝÁ¿½«Õ¼ÔöÁ¿µÄ85%¡£½üÈÕ£¬È«ÊÀ½çËÄÄêÒ¹¾ÞÍ·£¨Meta¡¢ÑÇÂíÑ·¡¢Î¢ÈíºÍ£©Ðû²¼µÄ2025AI»ù´¡¾Ù´ë´ëʩ֧³ö¹²¼Æ³¬3000ÒÚÃÀÔª£¬±ÈÄâ2024ÄêÔö¼Ó30%¡£HwResmc

HwResmc

ͼ1£ºÈ«ÊÀ½çËÄÄêÒ¹ÔÆ³§ÉÌ2025Ä걾Ǯ¿ªÖ§Êý¾ÝÀ´Àú£º¿Æ¼¼¾ÞÍ·¹«È»Åû¶³ÂËßHwResmc

HwResmc

ͼ2£ºÆæÌØÄ¦¶û¿ª´´È˼æCEOÌïݳ¿HwResmc

ÆæÌØÄ¦¶û¿ª´´È˼æCEOÌïݳ¿°µÊ¾£º“‘ScalingLaw’ÒÀÈ»ÓÚÑÓÐø¡£´ÓTransformerµÄ¶ÀÁì·çÁ÷µ½MoEר¼ÒÄ£×ÓµÄÁ¢ÒìͻΧ£¬AI·¶³ëÕýÂõÏòÍòÒÚ¡¢ÉõÖÁÊ®ÍòÒÚ²ÎÊý·¶Î§µÄAIÄêҹģ×ÓÁ·Ï°Ê±´ú¡£DeepSeek-R1ÍÆÀíÄ£×ÓµÄÎÊÊÀÀë²»¿ª»ù´¡Ä£×ÓDeepseek-V3µÄÖØ´óÁ·Ï°¶Ñ¼¯¡£ÓÚÕâÒ»Åä¾°Ï£¬Ç¿Ê¢µÄËãÁ¦¼¯ÈºÒÀÈ»ÊÇÖ§³ÅAIµÄ»ùʯ¡£¶øÔõÑùÌá¸ß¼¯ÈºµÄÏßÐÔ¼Ó¿ì±È£¬Ò»Ö±ÊDzƲúµÄ½¹µã»°Ìâ¡£Óë´Ëͬʱ£¬AIËãÁ¦ÊÕ¼¯µÄÖ÷ÒªÐÔÈÕÇ÷͹ÏÔ£¬ËüÈÃÊý¾ÝÓÚ¼¯ÈºÖи÷¸ö²ãÃæ¡¢¸÷¸öά¶ÈÉ϶¼¿ÉÒÔ»òÐí¿ìËÙ´«Ê䣬ʵÏÖ¸÷½Úµã×ÊÔ´µÄ¸ßЧµ÷¶¯¡£”HwResmc

Ϊ´Ë£¬×÷ΪÐÐÒµÁìÏȵÄAIÊÕ¼¯È«Õ»Ê½»¥Áª²úÎïºÍ½â¾ö·½°¸ÌṩÉÌ£¬ÆæÌØÄ¦¶û¸ø³öÁËÒ»Ì×¼«¾ß¾ºÕùÁ¦µÄ½â¾ö·½°¸——»ùÔڸ߻úÄÜRDMA¼°Chiplet¼¼ÄÜ£¬Ê¹ÓÓScaleOut”“ScaleUp”“ScaleInside”ÈýÄêÒ¹ÀíÄ½úÉýËãÁ¦»ù´¡¾Ù´ë´ëÊ©ÓÚÍø¼ä¡¢Æ¬¼ä¼°Æ¬ÄڵĴ«ÊäЧÂÊ£¬ÎªÖÇÄÜËãÁ¦³É³¤¸³ÄÜ¡£HwResmc

ScaleOut——´òÆÆÌåϵ´«ÊäÆ¿¾±

DeepSeekµÄÀÖ³É֤ʵÁË¿ªÔ´Ä£×ÓÏà½ÏÔÚ±ÕÔ´Ä£×Ӿ߱¸±ØÈ»µÄÓÅʤÐÔ£¬¸ú×ÅÄ£×ÓµÄÖÇÄÜ»¯Ç÷ÏòÑݽø£¬Ä£×ÓÌåÁ¿µÄÔö³¤ÈԾɻáÊÇÐÐÒµ³É³¤µÄÖØÒªÇ÷ÏòÖ®Ò»¡£ÎªÁËÍê³ÉǧÒÚ¡¢ÍòÒÚ²ÎÊý·¶Î§AIÄêҹģ×ÓµÄÁ·Ï°Ê¹Ãü£¬Í¨ÓõÄ×ö·¨Ò»°ã»á²ÉÓÃTensor²¢ÐУ¨TP£©¡¢Pipeline²¢ÐУ¨PP£©¡¢¼°Data²¢ÐУ¨DP£©¼ÆÄ±À´²ð·ÖÁ·Ï°Ê¹Ãü¡£¸ú×ÅMoE£¨MixtureofExperts£¬»ìÏýר¼Ò£©Ä£×ӵijÊÏÖ£¬³ýÁËÁË´¥¼°ÉÏÊö²¢ÐмÆÄ±Í⣬»¹ÓÐÒýÈëÁËר¼Ò²¢ÐÐ(EP)¡£´ËÖУ¬EP¼°TPͨѶÊý¾Ý¿ªÏû½ÏÄêÒ¹£¬ÖØÒª¾­Óɹý³ÌScaleUp»¥Áª·½Ê½Ó¦´ð¡£DP¼°PP²¢ÐмƽϵÄͨѶ¿ªÏûÏà¶ÔÓÚ½ÏС£¬ÖØÒª¾­Óɹý³ÌScaleOut»¥Áª·½Ê½Ó¦´ð¡£HwResmc

ÓÚÊÇ£¬ÒÔÏÂͼËùʾ£¬µ±ÏÂÖ÷Á÷µÄÍò¿¨¼¯ÈºÀï´æÓÚÁ½ÖÖ»¥ÁªÓò——GPUÄÏÏòScaleUp»¥ÁªÓò£¨ScaleUpDomain£¬SUD£©¼°GPU±±ÏòScaleOut»¥ÁªÓò£¨ScaleOutDomain£¬SOD£©¡£Ìïݳ¿¿ä´ó£º“ÒÔScaleUp¼°ScaleOutË«ÇæÇý¶¯·½Ê½¹¹½¨ÄêÒ¹·¶Î§¡¢¸ßЧµÄÖÇË㼯Ⱥ£¬ÊÇÓ¦´ðËãÁ¦ÐèÇó·¢×÷µÄÓÐÓÃÊÖÍó¡£”HwResmc

HwResmc

ͼ3£ºÖÇË㼯ȺÀïµÄScaleUp¼°ScaleOutHwResmc

ÓÚÕâ¸ö¼¯ÈºÊÕ¼¯ÖУ¬ScaleOutרעÔÚºáÏò/³Ì¶ÈµÄÀ©´ó£¬¿ä´ó¾­Óɹý³ÌÔö³¤¸ü¶à¼Æ½Ï½ÚµãʵÏÖ¼¯Èº·¶Î§µÄÀ©´ó¡£µ±Ç°£¬³¤Í¾Ö±½ÓÄÚ´æ°Ýºò£¨RDMA£©ÒѾ­¾­³ÉΪ¹¹½¨ScaleOutÊÕ¼¯µÄÖ÷Á÷Ñ¡Ôñ¡£×÷ΪһÖÖhost-offload/host-bypass¼¼ÄÜ£¬RDMAÌṩÁË´Óһ̨¼Æ½Ï»úÄÚ´æµ½ÁíÍâһ̨¼Æ½Ï»úÄÚ´æµÄÖ±½Ó°Ýºò£¬¾ß±¸µÍÑÓ³Ù¡¢¸ß´ø¿íµÄÌØÕ÷£¬ÓÚÄêÒ¹·¶Î§¼¯ÈºÖÐÊÎÑÝ×ÅÖ÷ÒªµÄ½ÅÉ«¡£ÒÔÏÂͼËùʾ£¬RDMAÖØÒª°üÂÞ‌InfiniBand£¨IB£©¡¢»ùÔÚÒÔÌ«ÍøµÄRoCE¼°»ùÔÚTCP/IPµÄiWARP‌¡£´ËÖУ¬IB¼°ÒÔÌ«ÍøRDMAÊÇËãÁ¦¼¯ÈºÀïÔËÓÃ×î¹ã·ºµÄ¼¼ÄÜ¡£HwResmc

HwResmc

ͼ4£ºRDMAÔËÓü°ÊµÏÖ·½Ê½Í¼Æ¬À´Àú£ºÖªºõ@SavirHwResmc

IBÊÇרÃÅΪRDMA¿ª·¢µÄÒ»ÖÖÊÕ¼¯Í¨Ñ¶¼¼ÄÜ£¬¾ß±¸¸ß´ø¿í¡¢µÍÑÓ³ÙµÈÉϷ磬ÇÒIBĬÐíÊÇÎÞËðÊÕ¼¯£¬ÎÞÐè·Ç·²ÉèÖ᣻¼ÉÏÒæÔÚÕâЩÉϷ磬¹ýÍùIBÓÚScaleOutÊÕ¼¯¹¹½¨ÖÐÅ̾áÖ÷µ¼Ö°Î»µØ·½¡£È»¶ø£¬IBÐèҪרÃųųָü¼ÄܵÄÍø¿¨¼°»¥»»»ú£¬¼Û¸ñÊÇ´«Í³ÊÕ¼¯µÄ5-10±¶£¬³ÉÕæÏà¶ÔÓڽϸߣ¬ÇÒIB»¥»»»ú½»ÆÚ½Ï³¤¡£Í¬Ê±£¬IB¼æÈÝÐԲÄÑÒÔ¼°ÄêÒ¹´ó¶¼ÒÔÌ«Íø×°±¸¼æÈÝ£¬ÀýÈçÍø¿¨¡¢ÏßÀ¡¢»¥»»»ú¼°Â·ÓÉÆ÷µÈ£¬Ã»·¨³ÉΪÐÐҵͬһµÄ³É³¤Ïß·¡£HwResmc

¸ú׿¯Èº·¶Î§ÔöÄêÒ¹£¬ÒÔÌ«ÍøRDMAµÃµ½ÁËÖ÷Á÷³§É̵Ĺ㷺³Å³Ö¡£ÒÔÌ«ÍøRDMAÒ»Ñù¾ß±¸¸ßËÙ¶È¡¢¸ß´ø¿í¡¢CPU¸ºÔص͵ÈÉϷ磬ÓÚµÍʱÑÓ¼°ÎÞËðÊÕ¼¯ÌØÕ÷·½ÃæÒ²ÒѾ­¾­¼°IB»úÄܳ֯½¡£Í¬Ê±£¬ÒÔÌ«ÍøRDMA¾ß±¸¸üºÃµÄ¿ª·ÅÐÔ¡¢¼æÈÝÐÔ¼°Í¬Ò»ÐÔ£¬¸üÀûÔÚ×öÄêÒ¹·¶Î§µÄ×éÍø¼¯Èº¡£´ÓһЩÐÐÒµ´ú±íÐÔ°¸ÀýÀ´¿´£¬Èç×Ö½ÚÌø¶¯µÄÍò¿¨¼¯Èº£¬Meta¹«Ë¾µÄÊýÍò¿¨¼¯Èº£¬ÒÔºÍÌØË¹À­µ«Ô¸´òÔìµÄÊ®Íò¿¨¼¯Èº£¬¶¼Ò»ÖÂÑ¡ÔñÁËÒÔÌ«Íø·½°¸¡£´ËÍ⣬ÓÉÓÚÓ²¼þͨÓü°ÔËά¼òÆÓ£¬ÒÔÌ«ÍøRDMA·½°¸¸ü¾ßÐԼ۱ȡ£HwResmc

ËäÈ»ÒÔÌ«ÍøRDMAÒѾ­¾­±»¹«ÈÏÊǽ«À´ScaleOutµÄÄêÒ¹Ç÷Ïò£¬²»ÍâÌïݳ¿Ö¸³ö£º“¼ÙÈçÊÇ»ùÔÚRoCEv2¹¹½¨·½°¸ÈÔ´æÓÚһЩÎÊÌ⣬ºÃ±ÈÂÒÐòÐèÒªÖØ´«£¬¸ºÔطֹܲ»ÍêÉÆ£¬´æÓÚGo-back-NÎÊÌ⣬ÒÔºÍDCQCN²¿Êðµ÷ÓÅ·±Ôӵȡ£ÓÚÍò¿¨¼°Ê®Íò¿¨¼¯ÈºÖУ¬Òµ½çÐèÒª¼ÓÇ¿ÐÍÒÔÌ«ÍøRDMAÒÔÓ¦´ðÉÏÊöÕâЩÌôÕ½£¬³¬ÒÔÌ«Íø´«Êä(UltraEthernetTransport£¬UET)¼´ÊÇÏÂÒ»´úAI¼Æ½Ï¼°HPCÀïµÄÒªº¦¼¼ÄÜ¡£”HwResmc

ΪÁË¿ÉÒÔ»òÐí½øÒ»²½²ûÑïÒÔÌ«Íø¼°RDMA¼¼ÄܵÄDZÄÜ£¬²©Í¨¡¢Ë¼¿Æ¡¢Arista¡¢Î¢Èí¡¢MetaµÈ¹«Ë¾Ç£Í·½¨Á¢Á˳¬ÒÔÌ«ÍøÍ¬ÃË£¨UEC£©¡£ÒÔÏÂͼËùʾ£¬ÓÚUEC¹æ·¶1.0µÄÔ¤ÀÀ°æ±¾ÖУ¬UEC´ÓÈí¼þAPI¡¢ÔËÊä²ã¡¢Á´Â·²ã¡¢ÊÕ¼¯°²È«¼°¶ÂÈû½ÚÖÆµÈ·½ÃæÁÙTransportLayer´«Êä²ã×öÁËÖÜÈ«µÄÓÅ»¯£¬Òªº¦¹¦Ð§°üÀ¨FEC£¨Ç°Ïò¾À´í£©Í³¼Æ¡¢Á´Â·²ãÖØ´«£¨LLR£©¡¢¶à·¾¶±¨ÎÄÅç·¢¡¢ÐÂÒ»´ú¶ÂÈû½ÚÖÆ¡¢½Ã½ÝÅÅÐò¡¢¶Ëµ½¶ËÒ£²â¡¢»¥»»»úÐ¶ÔØµÈ¡£°´ÕÕAMD·½ÃæµÄÊý¾Ý£¬UEC¾ÍÐ÷£¨UEC-ready£©Ìåϵ¿ÉÒÔ»òÐíÌṩ±È´«Í³RoCEv2Ìåϵ³¬³ö¿çÔ½5-6±¶µÄ»úÄÜ¡£HwResmc

HwResmc

ͼ5£ºUEC¹æ·¶1.0ʾÓÃÒâͼƬÀ´Àú£ºUECHwResmc

Ìïݳ¿°µÊ¾£º“UECÊÇרÃÅΪAIÊÕ¼¯ScaleOut»¥Áª½¨Á¢µÄ¹ú¼ÊͬÃË£¬ÖÂÁ¦ÔÚ¾­Óɹý³ÌModernizedRDMAÓÅ»¯AI¼°HPCÊÂÇé¸ºÔØ¡£½èÖúUECµÄÒªº¦»úÄÜ£¬ScaleOutÊÕ¼¯¿ÉÒÔ»òÐí³äʵʹÓÃÌåϵÄÚËùÓпÉÓõĴ«Êä·¾¶£¬²¢×îС»¯ÊÕ¼¯¶ÂÈû¡£µ±Ç°»ùÔÚRDMARoCEµÄ½â¾ö·½°¸½«À´Ò²Äܹ»¾­Óɹý³Ì¼ùÐÐUECͬÃ˵ij߶Ƚø¼¶¸÷×ÔµÄÒÔÌ«Íø²úÎï·½°¸£¬´òÔì¸üÄêÒ¹·¶Î§µÄÎÞËð¼¯ÈºÍ¨Ñ¶¡£”HwResmc

ÆæÌØÄ¦¶û´òÔìµÄKiwiNDSA-SNICAIÔ­ÉúÖÇÄÜÍø¿¨¼´ÊÇÒ»¿îUEC¾ÍÐ÷·½°¸£¬»úÄܱȼçÈ«ÊÀ½ç±ê¸ËASIC²úÎï¡£KiwiNDSASmartNICÌṩÁìÏÈÐÐÒµµÄ¸ß»úÄÜ£¬³Å³Ö¸ß´ï800GbpsµÄ´«Êä´ø¿í£¬ÌṩµÍÖÁμs¼¶µÄÊý¾Ý´«ÊäÑÓʱ£¬ÂúÒ⵱ǰÊý¾ÝÖÐÑëÐÐÒµ400Gbps-800Gbps½ø¼¶ÐèÇ󣬿ÉʵÏÖTb¼¶±ðÍò¿¨¼¯Èº¼äÎÞËðÊý¾Ý´«Êä¡£HwResmc

HwResmc

ͼ6£ºÆæÌØÄ¦¶ûKiwiNDSA-SNICAIÔ­ÉúÖÇÄÜÍø¿¨·½°¸Í¼Æ¬À´Àú£ºÆæÌØÄ¦¶ûHwResmc

½èÖúUEC¾ÍÐ÷RDMAÖеÄ·¾¶¸ÐÖª¶ÂÈû½ÚÖÆ¡¢ÓÐÐò¶¯¾²Í¨±¨¡¢Ñ¡ÔñÐÔÈ·ÈÏÖØ´«¡¢×Ô˳Ӧ·ÓɺÍÊý¾Ý°üÅçÈ÷µÈÒªº¦¹¦Ð§£¬KiwiNDSA-SNIC¿ÉÒÔ»òÐí³äʵ±£ÕÏAIÊÕ¼¯¼äÊý¾ÝµÄ²»±ä´«Êä¡£ºÃ±È£¬KiwiNDSA-SNICÌṩµÄ×Ô˳Ӧ·ÓɺÍÊý¾Ý°üÅçÈ÷¹¦Ð§¿ÉÒÔ³äʵ²ûÑï¸ßËÙÊÕ¼¯µÄ»úÄÜ£¬³Å³Ö¸ß¼¶·Ö×éÅçÈ÷£¬Ìṩ¶à·¾¶Êý¾Ý°ü´«Ëͼ°Ï¸Á£¶È¸ºÔؾùºâ£¬ÓÐÓÃÓ¦´ð´«Êä¶ÂÈû¡£²»ÒìÓÃÀý»¹ÓÐÓУº¾­Óɹý³ÌÓÐÐò¶¯¾²Í¨±¨£¨In-OrderMessageDelivery£©À´½µµÍÌåϵÑÓ³Ù£¬¾­Óɹý³Ì·¾¶¸ÐÖª¶ÂÈû½ÚÖÆ£¨PathAwareCongestionControl£©À´ÓÅ»¯¶à¸ö·¾¶µÄÊý¾Ý°üÁ÷£¬µÈµÈ¡£HwResmc

´ËÍ⣬KiwiNDSA-SNIC»¹ÓÐÓµÓÐÐí¶àÆäËûµÄÒªº¦ÌØÕ÷¡£ºÃ±È£¬KiwiNDSA-SNIC¾ß±¸¾«²ÊµÄ¸ß²¢·¢ÌØÕ÷£¬³Å³Ö¶à´ïÊý°ÙÍò¸öÐÐÁв½¶Ó¶ÔÓÚ£¬¿ÉÀ©´óÄÚ´æ¿Õ¼äµ½´ïGB£»KiwiNDSA-SNIC¾ß±¸¿É±à³ÌÐÔ£¬¿ÉÓ¦´ð¸÷ÀàÊÕ¼¯Ê¹Ãü¼Ó¿ì£¬ÎªScaleOutÊÕ¼¯´øÀ´Á¬ÐøÁ¢ÒìµÄ¹¦Ð§£¬²¢°ü¹ÜÓ뽫À´µÄÐÐÒµ³ß¶ÈÎÞ·ì¼æÈÝ¡£HwResmc

×ۺ϶øÑÔ£¬ÆæÌØÄ¦¶ûµÄKiwiNDSA-SNICAIÔ­ÉúÖÇÄÜÍø¿¨ÊÇÒ»¸öÓµÓи߻úÄÜ¡¢¿É±à³ÌµÄScaleOutÊÕ¼¯ÒýÇæ£¬½«¿ªÆôAIÊÕ¼¯ScaleOut³É³¤µÄÐÂÆªÕ¡£Ìïݳ¿³Æ£º“µ±Ç°£¬ÆæÌØÄ¦¶ûÒѾ­¾­³ÉΪUECͬÃ˳ÉÔ±¡£¸ú×ÅÒÔÌ«ÍøÖð½¥¹ý¶Éµ½³¬ÒÔÌ«Íø£¬ÆæÌØÄ¦¶ûÔ¸ÁªñÇͬÃË»ï°éÅäºÏÇд貢¼ùÐÐScaleOutÏà¸É³ß¶ÈµÄÖÆ¶©¼°ÍêÃÀ£¬²¢µÚһʱ¼äΪÐÐÒµ´øÀ´»úÄÜÁìÏȵÄUEC·½°¸£¬±Þ²ßAIÊÕ¼¯ScaleOut¼¼ÄÜÏòǰ³É³¤¡£”HwResmc

HwResmc

ͼ7£ºÆæÌØÄ¦¶ûUEC»áԱͼƬÀ´Àú£ºUEC¹ÙÍøHwResmc

ScaleUp——ÈüƽÏоƬ¹²Í¬¸ü¸ßЧ

¼°ºáÏò/³Ì¶ÈÀ©´óµÄScaleOut²î±ð£¬ScaleUpÊÇ´¹Ö±/ÏòÉÏÀ©´ó£¬·½ÕëÊÇ´òÔì»úÄڸߴø¿í»¥ÁªµÄ³¬½Úµã¡£ÉÏÊöÌáµ½£¬TPÕÅÁ¿²¢ÐÐÒÔºÍEPר¼Ò²¢ÐÐÐèÒª¸ü¸ßµÄ´ø¿í¼°¸üµÍµÄʱÑÓÀ´¾ÙÐÐÈ«¾Öͬ²½¡£¾­Óɹý³ÌScaleUpµÄ·½Ê½£¬½«¸ü¶àµÄËãÁ¦Ð¾Æ¬GPU¼¯Öе½Ò»¸ö½ÚµãÉÏ£¬³¤¶Ì³£ÓÐÓõÄÓ¦´ð·½Ê½¡£Èç½ñµÄScaleUpÏÖʵÉϾÍÊÇÒ»¸öÒÔ³¬¸ß´ø¿íΪ½¹µãµÄ»úÄÚGPU-GPU×éÍø·½Ê½£¬»¹ÓÐÓÐÒ»¸öÃû³ÆÊdz¬´ø¿íÓò£¨HBD£¬HighBandwidthDomain£©¡£HwResmc

Ӣΰ´ïGB200NVL72µÄÍÆ³öÒýÁìן£ÄÚÍâAIÊÕ¼¯Éú̬¶ÔÓÚHBD¼¼ÄܵĹ㷺Çд衣Ӣΰ´ïGB200NVL72°ìÊÂÆ÷ÊÇÒ»¸öµäÐ͵ij¬ÄêÒ¹HBD£¬ÊµÏÖÁË36×éGB200£¨36¸öGraceCPU£¬72¸öB200GPU£©Ö®¼äµÄ³¬¸ß´ø¿í»¥Áª¡£ÓÚÕâ¸öHBDÌåϵÀµÚÎå´úNVLinkÊÇ×îÒªº¦µÄ£¬Ëü¿ÉÒÔ»òÐíÌṩGPU-GPUÖ®¼äË«Ïò1.8TBµÄ´«ÊäËÙ¶È£¬Ê¹»¼ÉÏÕâ¸öHBDÌåϵ¿ÉÒÔ×÷Ϊһ¸öÄêÒ¹ÐÍGPUÈ¥ÀûÓã¬Á·Ï°Ð§ÂÊÏà½ÏÔÚH100Ìåϵ½úÉýÁË4±¶£¬ÄÜЧ½úÉýÁË25±¶¡£HwResmc

HwResmc

ͼ8£ºNVL72»¥Áª¼Ü¹¹Í¼Æ¬À´Àú£ºÓ¢Î°´ïHwResmc

¼°IBͬÑù£¬NVLinkÒ²ÊÇÓÉӢΰ´ïÖ÷µ¼£¬ËäÈ»»úÄÜÇ¿¾¢¿ÉÊÇÉú̬¹Ø±Õ£¬Ö»°ìÊÂÔÚӢΰ´ïµÄ¸ß¶ËGPU¡£ÒòΪûÓÐNVLink¼°NVSwitchÈçÐíµÄ¼¼ÄÜ£¬´ËǰÆäËû³§ÉÌÖØÒª²ÉÓÃfullmesh»òÕßÕßcube-mesh²¼¾Ö£¬ÒÔ8¿¨»¥ÁªÎªÖ÷£¬¶ø16-32¿¨»¥ÁªÊÇÏÂÒ»´ú·½°¸¡£HwResmc

DeepSeekÊÂÎñ¼¤·¢ÁËÒµ½ç¶ÔÓÚÔÚÉÏÊöNVLink¼°HBDÐèÇóµÄ²î±ðÔ¤ÆÚ¡£µ«Öг־óɳ¤À´¿´£¬±ÈÄâÈí¼þµü´úËÙÂÊÒÔСʱÀ´¼Æ½Ï£¬Ó²¼þµÄµü´úÔòÒò´ËÄêΪ¼Æ½ÏµÄ°´²¿¾Í°àÀú³Ì£¬²»»áÒ»»Ó¶ø¾Í¡£¾ÝSemiAnalysis¹À¼ÆÄêÒ¹ÐÍÄ£×ӵij߶ÈÖ»»á¸ú׎«À´µÄÄ£×Ó·¢²¼¶ø¼Ì³ÐÉý¸ß£¬µ«´Ó¾­¼ÃЧÓÃÉÏÀ´½²£¬ÆäËù¶ÔÓÚÓ¦µÄÓ²¼þ±ØÐè¶ÔÖÅÀûÓò¢ÓÐÓÃ4-6Ä꣬¶ø²»µ¥µ¥ÊÇÖ±µ½ÏÂÒ»¸öÄ£×Ó·¢²¼¡£HwResmc

¶ÔÓÚ´Ë£¬Ìïݳ¿ÈÏΪ£º“½«À´MoEÄ£×ӵĽø½×Ïß·ÓÚ±ØÈ»Ë®Æ½ÉÏ´æÓÚ²»È·¶¨ÐÔ£¬Á¢ÒìËæÊ±¿ÉÄÜ·¢Éú¡£µ«¹ú²úAIÊÕ¼¯µÄÉú̬±Õ»·ÊÆÓÚ±ØÐС£Ó¢Î°´ïNVLink¼°CudaµÄ»¤³ÇºÓÈԾɴæÓÚ£¬ÆðÊ×Òª½â¾öScaleUp»¥Áª¹ú²úÌæ»»·½°¸ÓÐÎÞµÄÎÊÌ⣬ÔÙÀ´¿´×÷µ½ÄÄÒ»ÖÖˮƽ¡£½«À´¸ú׏ú²úÄêҹģ×Ó¡¢Ð¾Æ¬¼Ü¹¹µÈÈíÓ²¼þÉú̬µÄЭͬ³É³¤£¬ÓÐÍûÂýÂýʵÏÖ¹ú²úËãÁ¦±Õ»·¡£”HwResmc

Èç½ñ£¬¿Æ¼¼¾ÞÍ·Õý½áºÏÉú̬ÉÏÏÂÁ÷ÓÚGPU-GPU¸ßЧ»¥Áª·½ÃæÖØÒª·ÖΪÁ½¸öÃÅ»§£ºÄÚ´æÓïÒå¼°¶¯¾²ÓïÒå¡£ÄÚ´æÓïÒåLoad/Store/AtomicÊÇGPUÄÚ²¿×ÜÏß´«ÊäµÄÔ­ÉúÓïÒ壬Ӣΰ´ïNVLink¼´ÊÇ»ùÔÚÄÚ´æÓïÒ壬¶ÔÓÚ±êNVLinkµÄUAlinkµÈÒ²ÊÇ»ùÔÚÕâÀàÓïÒ壻¶¯¾²ÓïÒåÔòÊDzÉÓýüËÆScaleOutµÄDMAÓïÒåSend/Read/Write£¬½«Êý¾Ý¾ÙÐдò°ü´«Ê䣬ÑÇÂíÑ·¼°TenstorrentµÈ¹«Ë¾¼´ÊÇ»ùÔÚ¶¯¾²ÓïÒå´òÔìScaleUp»¥Áª·½°¸¡£HwResmc

ÄÚ´æÓïÒå¼°¶¯¾²ÓïÒå¸÷ÓÐËù³¤¡£ÄÚ´æÓïÒåÊÇGPUÄÚ²¿´«ÊäµÄÔ­ÉúÓïÒ壬´¦Öóͷ£Æ÷³Ðµ£¸üС£¬ÓÚÊý¾Ý°üÌåÁ¿Ð¡Ê±Ð§Âʸü¸ß£»¶¯¾²ÓïÒå²ÉÓÃÊý¾Ý´ò°üµÄ·½Ê½£¬¸ú×ÅÊý¾Ý°üÌåÁ¿±äÄêÒ¹£¬»úÄÜÖð½¥×·ÉÏÁËÄÚ´æÓïÒ壬¸ú×ÅAIÄêҹģ×ÓÌåÁ¿ÔöÄêÒ¹£¬ÕâÒ»µãÒ²ºÜÊÇÖ÷Òª¡£HwResmc

²»Í⣬Ìïݳ¿Ö¸³ö£º“²»¹ÜÊÇÄÚ´æÓïÒ廹ÓÐÊǶ¯¾²ÓïÒ壬¶ÔÓÚÔÚ³§É̶øÑÔ£¬¶¼Ãæ¶ÔһЩ¹²ÐÔµÄÌôÕ½£¬ºÃ±È´«Í³GPUÖ±³ö½«IO¼¯³ÉÓÚGPUÄÚ²¿£¬»úÄܽúÉýÔâµ½Á˹âÕֳߴçµÄÑÏ¿áÏÞ¶¨£¬Áô¸øIOµÄ¿Õ¼äºÜÊÇÓÐÏÞ£¬IOÃܶȽúÉý¼á¿à£»ScaleUpÊÕ¼¯¼°Êý¾Ý´«ÊäºÍ̸·±ÔÓ£¬¼Æ½ÏоƬ³§É̶àÊýȱÉÙÏà¸É¾­Ñé£¬ÌØ±ðÊÇ¿ª·¢»¥»»»úоƬµÄ¾­Ñ飻³ýÁËNVLinkÒÔÍ⣬ÆäËûScaleUpºÍ̸Æäʵ²»³ÉÊìÇÒ²»Í¬Ò»£¬ºÍ̸µü´ú¶ÔÓڼƽÏоƬµü´úÔì³ÉΪÁ˾ÞÄêÒ¹µÄÀ§ÈÅ¡£”HwResmc

HwResmc

ͼ9£ºGPUIO¼¯³ÉÓÚGPUÄÚ²¿Í¼Æ¬À´Àú£ºÆæÌØÄ¦¶ûHwResmc

ΪÁË¿ÉÒÔ»òÐí¸üºÃµØÓ¦´ðÉÏÊöÌôÕ½£¬²Æ²ú½çÌá³öÁËÒ»ÖÖÁ¢ÒìµÄGPUÖ±³ö·½Ê½——¼Æ½Ï¼°IO·ÖÉ¢¡£ÆæÌØÄ¦¶ûNDSA-G2G»¥Áª·½°¸¼´ÊÇÕâÌõ¼¼ÄÜ·¾¶ÀïºÜÊÇÓоºÕùÁ¦µÄÒ»¿î·½°¸¡£HwResmc

½èÖúNDSA-G2G¿ÉÒÔʵÏּƽÏоÁ£¼°IOоÁ£½âñ¾­Óɹý³ÌͨÓÃоÁ£»¥Áª¼¼ÄÜUCIe¾ÙÐл¥Áª¡£ÈçÐí×öµÄÀûÒæÊÇ£¬Ö»ÐèÒª¾èÇûÒ»µãµãµÄоµ¥·½Ãæ»ý£¨Ð¡°Ù·ÖÖ®¼¸£©£¬¾ÍÄܹ»½«Ãû¹óµÄÖнé²ã×ÊÔ´½üºõ100%ÓÃÔڼƽϣ¬²¢¸ù¾Ý¿Í»§µÄÐèÇó½Ã½ÝµØÔö³¤IOоÁ£µÄÊýÄ¿£¬ÇҼƽÏоÁ£¼°IOоÁ£¿ÉÒÔ»ùÔÚ²î±ðµÄ¹¤ÒÕ¼¼ÄÜ¡£ÔÙ¼ÓÖ®IOоÁ£µÄ¸´ÓÃÌØÕ÷£¬¿ÉÒÔ»òÐíÏÔÖø½úÉý¸ß»úÄܼƽÏоƬµÄ»úÄܼ°ÐԼ۱ȡ£HwResmc

NDSA-G2GµÄµÚ¶þÄêÒ¹ÉÏ·çÊǽúÉýIOÃܶȼ°»úÄÜ£¬¾ß±¸¸ß´ø¿í¡¢µÍÑÓʱ¼°¸ß²¢·¢µÄÌØÕ÷¡£Óڸߴø¿í·½Ã棬»ùÔÚNDSA-G2GоÁ££¬¿ÉÒÔʵÏÖ1TB¼¶ÁíÍâÊÕ¼¯²ãÍÌÍÂÁ¿£¬TB¼¶µÄGPU²àÍÌÍÂÁ¿£»ÓÚµÍÑÓʱ·½Ã棬NDSA-G2GоÁ£Ìṩ°Ùns¼¶µÄÊý¾Ý´«ÊäÑÓʱ¼°ns¼¶D2DÊý¾Ý´«ÊäÑÓʱ£»Óڸ߲¢·¢·½Ã棬¸Ã²úÎï³Å³Ö¶à´ïÊý°ÙÍò¸öÐÐÁв½¶Ó¶ÔÓÚ£¬¿ÉÀ©´óÌåϵÖеÄÄÚ´æ×ÊÔ´¡£Ò²¾ÍÊÇ˵£¬½èÖúÆæÌØÄ¦¶ûNDSA-G2GоÁ£¿ÉÒÔ»òÐí¸³Äܹú²úAIоƬʵÏÖ×ÔÁ¢Í»Î§£¬¹¹½¨»úÄÜæÇÃÀӢΰ´ïNVSwitch+NVLinkµÄScaleUp·½°¸¡£HwResmc

HwResmc

ͼ10£ºKiwiNDSA-G2G²úÎïʾÓÃÒâͼƬÀ´Àú£ºÆæÌØÄ¦¶ûHwResmc

NDSA-G2GµÄµÚÈýÄêÒ¹ÉÏ·çÊǾ߱¸¾«²ÊµÄ½Ã½ÝÐÔ¡£ÈçÉÏËùÊö£¬½ñ³¯ScaleUp¼¼ÄÜÏß·Æäʵ²»Í¬Ò»£¬ÇÒÖÇËãÖÐÑë³§ÉÌÓÚºÍ̸·½Ãæ¶àÊý²ÉÓÃ×ÔÓкÍ̸£¬»òÕßÕß±¾ÉíÖ÷µ¼µÄͬÃ˺Í̸¡£Õâ¾ÍÖÂʹ¸ß»úÄܼƽÏоƬÐèÒªÓÚÉè¼ÆÊ±Ë¼Á¿½«À´2¡«3Ä꣬ÉõÖÁÊÇ3¡«5ÄêµÄºÍ̸³É³¤£¬¾ß±¸ºÜÊÇÄêÒ¹µÄÌôÕ½¡£NDSA-G2GÒԼƽÏоÁ£¼°IOоÁ£·ÖÉ¢µÄ·½Ê½ÈÃIOоÁ£¿ÉÒԽýݽø¼¶£¬Í¬Ê±NASG-G2G»ùÔھ߱¸¿É±à³ÌÐÔ£¬¿ÉÒԳųֽñ³¯ÊеÀÉϸ÷ÀàIOºÍ̸¡£ÕâÀà½Ã½ÝÐÔÈø߻úÄܼƽÏоƬ³§ÉÌ¿ÉÒÔ×ÔÔÚÓ¦´ðµ±Ç°ScaleUp¼¼ÄÜÏß·²»Í¬Ò»ÇÒºÍ̸ÔÓÂÒµÄÌôÕ½¡£HwResmc

ͬʱ£¬Ìïݳ¿Ò²ºôÓõ£º“µ«Ô¸¿Æ¼¼ÐÐÒµÓÚScaleUp±êµÄÄ¿µÄÉÏ¿ÉÒÔ»òÐíÓµ±§Ò»ÖÖ¿ª·Å¶øÍ¬Ò»µÄÎïÀí½Ó¿Ú£¬ÊµÏÖ¸üºÃµÄЭͬ³É³¤£¬ÕâÒ²ÊÇ´òÔì¹ú²ú×ÔÁ¢¿É¿ØËãÁ¦µ××ùµÄÒªº¦Ò»²½¡£”HwResmc

ScaleInside——ÖÜÈ«½úÉý¼Æ½ÏоƬ´«ÊäЧÂÊ

ÓÚScaleOut¼°ScaleUp¸ßËٳɳ¤µÄÀú³ÌÖУ¬×÷ΪËãÁ¦»ù´¡µ¥Î»£¬ScaleInsideµÄ½ø¶ÈҲûÓÐÂäÏ£¬²¢ÖÂÁ¦ÔÚ¾­Óɹý³Ì½ø²½Ç°±²·â×°¼¼ÄÜÌĦ¶û¶¨ÂÉËÙÂÊ·Å»ºµÄÓ°Ïì¡£ÓÚÕû¸öÖÇËãÌåϵÀ¸ü¸ßËãÁ¦µÄ¼Æ½ÏоƬ¿ÉÒÔ»òÐí½øÒ»²½½úÉýScaleUp¼°ScaleOutµÄ»úÄ̶ܳȣ¬Ê¹»¼ÉÏAIÄêҹģ×ÓµÄÁ·Ï°Ô½·¢¸ßЧ¡£HwResmc

µ±Ç°£¬µ¥¿Å¸ß»úÄܼƽÏоƬµÄ³É±¾ÒѾ­¾­ºÜÊǿɺ§£¬¸ú×ÅÖÆ³Ì¹¤ÒÕ½øÒ»²½¾«½ø£¬ÕâÒ»Êý×Ö»¹Óн«¼Ì³Ðì­Éý£¬ÓÚÊÇChiplet¼¼ÄÜ»ñµÃÁ˹㷺µÄÆ÷ÖØ¡£Chiplet¼¼ÄÜ´ðÓ¦¾­Óɹý³Ì»ìÏý·â×°µÄ·½Ê½´òÔì¸ß»úÄܼƽÏоƬ£¬Ò²¾ÍÊÇ˵¼Æ½Ïµ¥Î»¼°IO¡¢´æ´¢µÈÆäËû¹¦Ð§µ¥Î»¿ÉÒÔÑ¡Ôñ²î±ðµÄ¹¤ÒÕʵÏÖ£¬¾ß±¸¼«¸ßµÄ½Ã½ÝÐÔ£¬´ðÓ¦³§Ḛ́´ÕÕ±¾ÉíµÄÐèÇó¾ÙÐж¨ÖÆÐ¾Á££¬²»½ö¿ÉÒÔ»òÐíÏÔÖø½µµÍоƬÉè¼Æ¼°ÖÆÔìµÄ³É±¾£¬Á¼ÂÊÒ²¿ÉÒÔ»ñµÃºÜÄêÒ¹µÄ¸ÄÉÆ¡£HwResmc

ÓÚScaleInside±êµÄÄ¿µÄÉÏ£¬ÆæÌØÄ¦¶û¿ÉÒÔ»òÐíÌṩ¸»ºñµÄChiplet¼¼ÄÜ·½°¸£¬°üÀ¨KiwiLinkUCIeDie2Die½Ó¿ÚIP¡¢CentralIODie,3DBaseDieϵÁеÈ¡£´ËÖУ¬KiwiLinkȫϵÁгųÖUCIe³ß¶È£¬¾ß±¸Òµ½çÁìÏȵĸߴø¿í¡¢µÍ¹¦ºÄ¡¢µÍÑÓÊ±ÌØÕ÷£¬²¢³Å³Ö¶àÖÖ·â×°ÀàÐÍ¡£KiwiLink³Å³Ö¸ß´ï16~32GT/sµÄ´«ÊäËٶȼ°µÍÖÁns¼¶µÄ´«ÊäÑÓ³Ù£¬³Å³ÖMulti-Protocol¶àºÍ̸£¬°üÀ¨PCIe¡¢CXL¼°Streaming¡£HwResmc

HwResmc

ͼ11£ºKiwiFabric»¥Áª¼Ü¹¹Í¼Æ¬À´Àú£ºÆæÌØÄ¦¶ûHwResmc

×ۺ϶øÑÔ£¬ÆæÌØÄ¦¶ûµÄ½â¾ö·½°¸¿ÉÒÔ»òÐí´Ó“ScaleOut”“ScaleUp”“ScaleInside”ÈýÄêÒ¹½Ç¶È£¬±Þ²ßAIÄêҹģ×ÓÁ·Ï°Ð§ÂʵĽúÉý¡£ÓÚScaleOut·½Ãæ£¬ÆæÌØÄ¦¶ûÒѾ­¾­Êdz¬ÒÔÌ«ÍøÍ¬ÃËUECµÄ³ÉÔ±£¬¿ÉÒÔ»òÐíÓÚµÚһʱ¼äÏàÓ¦UEC¹æ·¶1.0ÒԺͺóÐø¹æ·¶£»ÓÚScaleUp·½Ãæ£¬ÆæÌØÄ¦¶ûNDSA-G2GоÁ£²»½ö¿ÉÒÔ»òÐí°ïæ¿Æ¼¼¹«Ë¾´òÔìæÇÃÀӢΰ´ïNVSwitch+NVLink»úÄܵÄScaleUp·½°¸£¬ÊÊÅä¸÷À༼ÄÜÏß·¼°ºÍ̸£¬Ò²ÕýÓÚÒýÁì¼Æ½ÏоƬµÄÉè¼Æ¸ÄÔ죻ÓÚScaleInside·½°¸£¬ÆæÌØÄ¦¶ûµÄKiwiLinkUCIeDie2Die½Ó¿ÚIP¡¢CentralIODie¡¢3DBaseDieϵÁеȷ½°¸¿ÉÒÔ»òÐí°ïæ³§ÉÌ´òÔì¾ß±¸¸ßЧ´«ÊäÄÜÁ¦µÄ¸ß»úÄܼƽÏоƬ¡£HwResmc

ÕâЩ·½°¸ºÜºÃµØ¼ùÐÐÁËÆæÌØÄ¦¶û¹«Ë¾µÄÈÎÎñ——ÒÔ»¥ÁªÎªÖÐÑ룬ÒÀÍÐChiplet¼°RDMA¼¼ÄÜ£¬ÐÞÖþAI¸ß»úÄܼƽϵĻùʯ¡£“¶ÔÓÚÔÚ¹ú²úAIÄêҹģ×Ó¼°¹ú²úAIоƬ²Æ²ú¶øÑÔ£¬ÆæÌØÄ¦¶ûµÄ·½°¸ÊÇÐÂÖʳö²úÁ¦µÄ´ú±í£¬ÓÐןüÄêÒ¹µÄDZÄÜÖµ»¼ÉÏÈ¥ÍÚ¾ò¡£ÎªÊµÏÖ¹ú²úAIоƬ²Æ²úµÄ‘ÖйúÃÎ’£¬ÆæÌØÄ¦¶û²»½öÌṩ³Å³Ö×îÇ°ÑØºÍ̸µÄIOоÁ££¬ÒÔʵÏÖ¸ßËÙ¶È¡¢¸ß´ø¿í¡¢µÍʱÑӵĴ«ÊäÌåÏÖ£¬»¹ÓÐÓÚChipletÏß·É϶À±Ùõè¾¶£¬ÓÃÁ¢ÒìµÄоƬ¼Ü¹¹ÖúÁ¦´òÔì¸ü¸ß»úÄܵÄAIоƬ¡£ÆæÌØÄ¦¶ûÔ¸Ó뺣ÄÚ¹«Ë¾ÁªñÇ£¬Îª¹ú²úAIоƬ²Æ²ú³É³¤Ìíש¼ÓÍߣ¬ÅäºÏ¹´ÀÕ¹ú²úAI³É³¤µÄ¹ãÄ®À¶Í¼¡£”Ìïݳ¿Ä©ÁË˵¡£HwResmc

Ôð±à£ºClover.li-Ã×À¼(milan)