2025!1"DeepSeek!"#AI$%&'()#$%SACNO.S0570519080006|SFCNO.BQZ938&'(SACNO.S05701220801381!"#$)*+,-./0)*+,-.123456789:;<=>?@ABCD1)*+,-.EFGHAIIJ2%KLCD1MNOPQRS3TUV6WX/89YZV6WX/0AI89[\]^89YZV6WX1?@H_`:S3;<ab2%VcCD/defghijkl*mnQo1VpqIrst/uvefwxyzwAIARR{|}1CapEx~•ÄÅÇ2025-01-30ÉÑghÖefÜá1DeepSeek!"#$%&'3!"#$%&'()DeepSeek%&'()*+,-./012,34/01•DeepSeek!"#$%&'()"#*+,-DeepSeek./01iOS./234567•89:;<=!"#>?@ABCDEF()"#GHI%B<=!"#J*()G!"#$%DeepSeek()*Bloomberg*+,-.4!"#$%&'()DeepSeekvsMeta5/06789:;<•Meta>Llama"#KL>MNOPQRSTLlamaLicense1.0UVWXYZP/[\/]^_`ab"#-cd6efF>\/gh7iT-Llama>()jklmnopqrstMetauv>wx2yz!n">{|}~•7•ÄÅ`ÇÉ\/ÑÖVÜáÉàQâ23ä\/Llama"#yzÄÅãÇÉåç-éèêëíÖ/7•ì|\/ghVTîì|>ïñó/[òSMAUUôö7õÉ-úêùûüMeta>fFPQ-†Q°¢£{|ëí7•§•¶{VTîß®•%§•¶{-•¶Llama•%APIã©™´¨-Q°êùûüMetaPQ-ä≠ÆjkëíÖ/7•DeepSeek-R1Ø/∞MITPQRyz()V†±≤≥ÜáÉàQ⥵∂\/]^_`ab∑"#-∏π/;{|ç>7•ÜáÉàQ⥵∂\/∑"#-∏πÇÉ]∫ªºΩ`{|æø7•ZP\/"#yzÜá¿¡>ñ¬-TÄÅ](b]√ƒ]≈∆}C7•/[Qâ:"#yz^_]«}`_y7•ZP/[≠Æ¥»>êùoh"#-yz…(b7•/[QâÀÃÕ"#ã^_Œ>œ–yz—h]ab`“”7•‘êù’÷>PQãëíÖ/7•MITPQRZP"#>{|}\/7†±≤≥QâÀ"#/;{|≈∆<-èêëíœvÖ/7!"#$%DeepSeek()*Bloomberg*+,-.5!"#$%&'()DeepSeek%)*=>?@•◊;()"#>ÿ6®Ÿ⁄¤‹-›fi"#>$–>fl‡2·-‚$–±≤≥„‚>‰ÂÊÁ`„Ë>ÈÍÎÏÌ-M./n"}ÓÔ>ùÒÚ7DeepSeek◊;ÛÙıˆ±˜¯ÛToken˘˙˚!¸˝¤‹-ZP\/˛ˇ>!¸Ôfi"#`$%&d'(°˜>"#7•Anthropic>)*+z,-./·012SDarioAmodeiU?%-¸˝>y345„fl-Qâ≈6107>_y78GPT-39Ô>$%o:é;-$–2·<1/12007$1MTokens!"1MTokens!#DeepSeek-chat0.140.28DeepSeek-reasoner0.552.19OpenAIo11560OpenAIo1-mini312OpenAI4o515OpenAI4o-mini0.150.60MetaLlama3.2Instruct$70B%0.720.72!"#$%DeepSeek()*Bloomberg*+,-.6!"#$%&'()DeepSeek%)*AB•◊;˛!◊="#>?>$%@¡-:;•Ë"#>$%°˜5Òù7DeepSeek-R1≈∆jklmQB"#>?G7•\/DeepSeek-R1ABScuratedU>80CÇD–-EFßG∞Qwen`LlamaC()"#7ÄÅHîIl-JK>>?L˝M≥NO∞˛P"#>$%°˜7:;>?"#-Q./SFT-‘\/RLSR™HSRLQâ!T•Ë"#U°U7•&√"#()VDeepSeek-R1-Zero]DeepSeek-R1â£◊;Qwen`LlamaADeepSeek-R1<>?>VÇWX"#S1.5B]7B]8B]14B]32B]70BU7!"#$%&'(•1!"#$%&'()*+,-./0,-!•2!1+,-2"#3456789.:;<=>?@ABC89DEF*GHIJ!•3!K1L,-.M6,-!NO:;<PQR;<BC"#!"#$%DeepSeek()*Bloomberg*+,-.7!"#$%&'()DeepSeek%)*AB•>?P"#>U°-«;:P"#EFyzRL7'Y;DeepSeek-R1ZP"#rRL[\]∞^v7•rQwen-32B-Base[\/ò∫]_``STEMòÆyz!n"RL"#-"#ôö10K3-ü<DeepSeek-R1-Zero-Qwen-32B7•HaV\/Qwen2.532B•%◊="#-EFADeepSeek-R1yz>?ü<>"#U°-«;rQwen2.532B[./O}∫b7•1UÀ„O!>"#>?$„P>"#Qâûücd>Hî-éef–g•<>!n"RL>„P>"#-‘Qêùh!>!¸°˜-45Q°i-‘<>?>U°7•2UXY>?jkltmÿdn-cùôop°>qr-Q°sYêù„O!>◊="#`„!n">O}∫b7•◊="#O>t{-uP"#(D„d«v7wxM:;RL¸˝7ST*UVRL*UVTherefore,wecandrawtwoconclusions:First,distillingmorepowerfulmodelsintosmalleronesyieldsexcellentresults,whereassmallermodelsrelyingonthelarge-scaleRLmentionedinthispaperrequireenormouscomputationalpowerandmaynotevenachievetheperformanceofdistillation.Second,whiledistillationstrategiesarebotheconomicalandeffective,advancingbeyondtheboundariesofintelligencemaystillrequiremorepowerfulbasemodelsandlarger-scalereinforcementlearning.!"#$%DeepSeek()*Bloomberg*+,-.2()*+DeepSeek!"#,-9!"#$%&'()DeepSeek%CD)*EFAIGHInfraModelSoftware!"#$%/&'()*bloomberg*+,-.10!"#$%&'()IJKLMNOPQRDeepSeek!"#$%/&'()*bloomberg*+,-.11!"#$%&'()IJKLMNOPQRDeepSeek!"#$%/&'()*bloomberg*+,-.•µ;yzÀ{|òÆB}~G<=ì|-Q...