È«ÐÂÒ»´úÈð»¢9ÉÏÊУºµ±ÐÂÄÜÔ´³ÉΪÖ÷Á÷ºó£¬ÆæÈðµÄȼÓÍÆì½¢ÕýÔÚѰÕÒеÄÃÕµ×
DeepSeek Ðû²¼ V4 Ô¤ÀÀ°æ£¬Í¬²½¿ªÔ´¡£Í¨¸æÀïÓÐÒ»¾ä»°£º" ´ÓÏÖÔÚ×îÏÈ£¬1M£¨Ò»°ÙÍò£©ÉÏÏÂÎĽ«ÊÇ DeepSeek ËùÓйٷ½·þÎñµÄ±êÅä¡£"OpenAI ºÍ Google Ôç¾ÍÖ§³Ö³¬³¤ÉÏÏÂÎÄÁË¡£ÎÊÌâÊDZ¾Ç®¡£Transformer ×¢ÖØÁ¦»úÖÆµÄÅÌËãÁ¿ËæÐòÁ㤶Èƽ·½ÔöÌí¡ª¡ªÐòÁз±¶£¬ËãÁ¦±äËı¶¡ª¡ª´¦Öóͷ£ 100 Íò token ÔڹŰå¼Ü¹¹ÏÂÏÕЩÎÞ·¨ÉÌÒµ»¯¡£ÊÖÒÕ±¨¸æ¸ø³öÁËÕâ´Î¼Ü¹¹¸Ä¶¯µÄ·ù¶È£ºÔÚ1M token ³¡¾°Ï£¬V4-Pro µÄµ¥ token ÍÆÀí FLOPs Ö»ÓÐ V3.2 µÄ 27%£¬KV »º´æÓÃÁ¿Ö»ÓÐ 10%¡£ Á½°Ñµ¶±ê×¼ Transformer µÄ×Ô×¢ÖØÁ¦£¬ÒªÈÃÿ¸ö token ¸úÐòÁÐÀïËùÓÐÆäËû token ËãÏà¹ØÐÔÈ¨ÖØ¡£ÕâÊÇÆ½·½ÖØÆ¯ºó£¬½á¹¹ÐԵ쬲»Êǹ¤³Ìµ÷ÓÅÄܽâ¾öµÄ¡£ÒÑÍùµÄÓ¦¶Ô·½·¨¸ÅÂÔ·ÖÁ½ÀࣺҪôÇеôÅÌËã¹æÄ££¨»¬¶¯´°¿ÚÖ»¿´¾Ö²¿ÁÚÈË£¬È«¾Ö¸ÐÖªËæÖ®ÏûÊÅ£©£¬ÒªÃ´ÈÆ¿ª³¤Îı¾×Ô¼º£¨RAG ÏȼìË÷ÔÙι¸øÄ£×Ó£¬¼ìË÷ÖÊÁ¿³ÉΪеÄÉÏÏÞ£©¡£ÉÐÓÐÀο¿Ï£º±×¢ÖØÁ¦£¬È˹¤Éè¼ÆÏ£º±Ä£Ê½À´Ìø¹ý²¿·ÖÅÌË㣬µ«Ä£Ê½ÊÇËÀµÄ£¬²î±ðʹÃüµÄÐÅÏ¢ÂþÑܲî±ð´ó£¬·º»¯ÄÜÁ¦ÓÐÏÞ¡£V4 µÄ¼Æ»®ÊÇ CSA + HCA »ìÏý×¢ÖØÁ¦¼Ü¹¹¡£CSA£¨Compressed Sparse Attention£©½â¾öµÄÊÇ " Ëãʲô "¡£ÓÃÇáÁ¿¼¶Ë÷ÒýÆ÷ÏȶÔËùÓÐ token ¶Ô×ö´Öɸ£¬¿ìËÙ¹ÀËãÏà¹ØÐÔÅÅÐò£¬ÔÙ¾«Ñ¡³öÐèÒªÍêÕûÅÌËãµÄ token ÜöÝÍ¡£Òªº¦ÔÚÓÚÕâÌ×Ï£º±½á¹¹ÊÇ¿ÉѵÁ·µÄ¡ª¡ªÄ£×ÓÔÚѵÁ·Àú³ÌÖÐ×Ô¼ºÑ§³öÄÇÀïÐèÒª¸ßÃܶÈ×¢ÖØÁ¦£¬ÄÇÀï¿ÉÒÔÏ£º±¡£V3.2 ʱ´úµÄ DSA ÊdzûÐΣ¬V4 ÔÚ´Ë»ù´¡ÉÏ×öÁ˽øÒ»²½ÑÝ»¯¡£HCA£¨Heavily Compressed Attention£©½â¾öµÄÊÇ " ´æÊ²Ã´ "¡£ÔÚ V3 ʱ´ú MLA£¨Multi-head Latent Attention£©µÄ»ù´¡ÉϼÌÐøÍÆ½ø£¬°Ñ KV ÏòÁ¿Ó³Éäµ½µÍάDZ¿Õ¼ä£¬ÍÆÀíʱ½âѹ¡£µþÉÏ FP4+FP8 »ìÏý¾«¶È¡ª¡ª MoE ר¼Ò²ÎÊýÓà FP4£¬ÆäÓàÓà FP8 ¡ª¡ª KV »º´æµÄÏÔ´æÕ¼ÓÃÔÙ¿³Ò»°ë¡£Á½Õßµþ¼ÓµÄЧ¹û£¬Ö±½ÓÌåÏÖÔÚÄÇÁ½¸öÊý×Ö£º27% µÄ FLOPs£¬10% µÄ KV »º´æ¡£»»Ëã¹ýÀ´£¬Ò»ÂÉËãÁ¦ÏÂÄÜ·þÎñµÄ³¤ÉÏÏÂÎIJ¢·¢Á¿Ô¼ÄªÊÇÔÀ´µÄ 3 µ½ 4 ±¶¡£ÊÖÒÕ±¨¸æÀïÉÐÓÐÁ½¸öϸ½ÚÖµµÃ¼Çһϡ£mHC£¨Manifold-Constrained Hyper-Connections£©¶Ô²Ð²îÅþÁ¬×öÁËÁ÷ÐÎÔ¼ÊøÇ¿»¯£¬Õë¶ÔµÄÊÇ 1.6T ²ÎÊý³¬Éî¶ÈÄ£×ÓѵÁ·Ê±¿ç²ãÐźÅË¥¼õµÄÎÊÌâ¡£Muon ÓÅ»¯Æ÷Ìæ»»ÁË Adam ϵÁУ¬»ùÓÚ¾ØÕóÕý½»»¯¸üУ¬ÔÚ³¬´ó¹æÄ£ÑµÁ·ÀïÊÕÁ²¸ü¿ì£¬¸üÎȹ̡ª¡ª Adam ÔÚ´óÄ£×ÓѵÁ·ÀïÏÕЩÊÇĬÈÏÉèÖã¬DeepSeek Õâ´Î»»µôÁËËü¡£ Êý×Ö¹Ù·½¸ø³öÁËÓë Claude Opus 4.6¡¢GPT-5.4 xHigh¡¢Gemini 3.1 Pro High µÄȫά¶ÈºáÆÀ¡£ÊýѧºÍ¾ºÈüÍÆÀíÊÇ V4-Pro ÌåÏÖ×îÍ»³öµÄά¶È¡£Codeforces ÆÀ·Ö 3206£¬ËļÒ×î¸ß£¨GPT-5.4 ÊÇ 3168£¬Gemini ºÍ V4-Flash ¶¼ÊÇ 3052£©¡£Apex Shortlist 90.2£¬Áè¼Ý Opus 4.6£¨85.9£©¡¢GPT-5.4£¨78.1£©¡¢Gemini£¨89.1£©¡£IMOAnswerBench 89.8£¬½ö´ÎÓÚ GPT-5.4£¨91.4£©¡£Agent ÄÜÁ¦ÉÏ£¬SWE Verified 80.6£¬Opus 4.6 ÊÇ 80.8¡£Toolathlon 51.8£¬Opus 4.6 ÊÇ 47.2£¬GPT-5.4 ÊÇ 54.6¡£Í¨¸æÀïÓÐÒ»¾äÄÚ²¿ÆÀ¼Û£ºV4 ÒѳÉΪԱ¹¤ Agentic Coding µÄÖ÷Á¦Ä£×Ó£¬" ʹÓÃÌåÑéÓÅÓÚ Sonnet 4.5£¬½»¸¶ÖÊÁ¿¿¿½ü Opus 4.6 ·Ç˼Ë÷ģʽ "¡£³¤ÉÏÏÂÎIJâÆÀÓÐÁ½¸öÊý×ÖÒª±ÈÕÕ×Å¿´£ºMRCR 1M£¨³¤Îı¾Òªº¦ÐÅÏ¢¼ìË÷£©83.5£¬Gemini ÊÇ 76.3£¬Opus 4.6 ÊÇ 92.9¡£CorpusQA 1M£¨³¤Îĵµ¾«×¼ÎÊ´ð£©62.0£¬Opus 4.6 ÊÇ 71.7¡£MRCR ×ÅÖØ¼ì²âÒªº¦ÐÅÏ¢ÊÇ·ñ±£´æ£¬CorpusQA ÒªÔÚ°ÙÍò token Àᆱ׼¶¨Î»²¢×ÛºÏÆÊÎö¡ª¡ªÁ½¸ö²âÆÀµÄ·Ö½â·ÅÔÚÒ»Æð£¬ËµÃ÷µÄ¹¤¾ß×ÔÈ»ÇåÎú¡£×ÛºÏ֪ʶºÍ¿ÆÑ§Ç°ÑØÍÆÀí£ºSimpleQA-Verified 57.9£¬Gemini ÊÇ 75.6¡£HLE£¨Ç°ÑØ¿ÆÑ§ÍÆÀí³¬ÄÑÌ⼯£©37.7£¬ËļÒÀï×îµÍ¡£V4-Flash£º284B ×ܲÎÊý£¬13B ¼¤»î£¬Ô¼Îª Pro °æ 18% µÄÌåÁ¿£¬Í¬ÑùÖ§³Ö 1M ÉÏÏÂÎÄºÍ Think/Think Max ÍÆÀíģʽ¡£¹Ù·½Ëµ¼òÆÓ Agent ʹÃüÉÏÓë Pro" Æì¹ÄÏ൱ "¡£DeepSeek °ÑÕâ´ÎÐû²¼½Ð " Ô¤ÀÀ°æ "£¬ÊÖÒÕ±¨¸æÎÊÌâÀïдµÄÊÇ "Towards" ¡ª¡ª³¯Ïò£¬»¹ÔÚ·ÉÏ¡£CSA ºÍ HCA µÄÉè¼ÆÂß¼½ñÌìÒѾ¹ûÕæ£¬Ï£º±ÑµÁ·»úÖÆÔÚ²î±ðʹÃüÂþÑÜÏÂÔõôÌåÏÖ£¬ÊǽÓÏÂÀ´¿ªÔ´ÉçÇø»á¸æËßÎÒÃǵÄÊ¡£Êý¾ÝȪԴ£ºDeepSeek ¹Ù·½Í¨¸æ¡¶DeepSeek-V4 Ô¤ÀÀ°æ£ºÂõÈë°ÙÍòÉÏÏÂÎÄÆÕ»Ýʱ´ú¡·£¨2026 Äê 4 Ô 24 ÈÕ£©£»ÊÖÒÕ±¨¸æ DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence