Microsoft claims AI diagnostic tool can outperform doctors - FT中文网
登录×
电子邮件/用户名
密码
记住我
请输入邮箱和密码进行绑定操作:
请输入手机号码,通过短信验证(目前仅支持中国大陆地区的手机号):
请您阅读我们的用户注册协议隐私权保护政策,点击下方按钮即视为您接受。
微软

Microsoft claims AI diagnostic tool can outperform doctors

Research is first initiative from Big Tech group’s AI health unit formed by ex-DeepMind co-founder Mustafa Suleyman
00:00

{"text":[[{"start":11.28,"text":"Microsoft has built an artificial intelligence-powered medical tool it claims is four times more successful than human doctors at diagnosing complex ailments, as the tech giant unveils research it believes could speed up treatment."}],[{"start":29.119999999999997,"text":"The “Microsoft AI Diagnostic Orchestrator” is the first initiative to come out of an AI health unit formed last year by Mustafa Suleyman with staff poached from DeepMind, the research lab he co-founded and which is now owned by rival Google."}],[{"start":45.42,"text":"In an interview with the Financial Times, the chief executive of Microsoft AI said the trial was a step on the path to “medical superintelligence” that could help solve staffing crises and long waiting times for overstretched health systems."}],[{"start":64.65,"text":"Microsoft’s new system is underpinned by a so-called “orchestrator” that creates virtual panels of five AI agents acting as “doctors” — each with a distinct role, such as coming up with hypotheses or choosing diagnostic tests — which interact and “debate” together to choose a course of action."}],[{"start":87.64,"text":"To test its capabilities, “MAI-DxO” was fed 304 studies from the New England Journal of Medicine (NEJM) that describe how some of the most complicated cases were solved by doctors. "}],[{"start":103.29,"text":"This allowed researchers to test if the programme could figure out the correct diagnosis and relay its decision-making process, using a new technique called “chain of debate”, which makes AI reasoning models give a step-by-step account of how they solve problems."}],[{"start":121.97,"text":"Microsoft used leading large language models from OpenAI, Meta, Anthropic, Google, xAI and DeepSeek. The orchestrator made all LLMs perform better, but worked best with OpenAI’s o3 reasoning model to correctly solve 85.5 per cent of the NEJM cases."}],[{"start":146.71,"text":"That compared with about 20 per cent by experienced human doctors, but those physicians were not allowed access to textbooks or to ask colleagues in the trial, which could have increased their success rate."}],[{"start":163.95000000000002,"text":"A version of the technology could soon also be deployed in Microsoft’s Copilot AI chatbot and Bing search engine, which handle 50mn health queries a day."}],[{"start":175.39000000000001,"text":"Suleyman said Microsoft is nearing “AI models that are not just a little bit better, but dramatically better, than human performance: faster, cheaper and four times more accurate”."}],[{"start":190.54000000000002,"text":"“That is going to be truly transformative,” he added."}],[{"start":194.48000000000002,"text":"Suleyman’s new effort comes after Deepmind has led the way on AI-related heathcare breakthroughs. The Google lab’s chief Sir Demis Hassabis jointly won a chemistry Nobel Prize last year for using AI to unlock the biological secrets of proteins that underpin life. "}],[{"start":213.81,"text":"Microsoft has invested almost $14bn into OpenAI and has exclusive rights to use and sell its technology. However, the tech giant is embroiled in high-stakes brinkmanship with the start-up, which is attempting to convert into a for-profit entity, with both sides clashing over the future terms of their partnership."}],[{"start":239.41,"text":"Suleyman said that while OpenAI’s model performed the best, Microsoft was “agnostic” over which of the four “world-class models” MAI-DxO used. "}],[{"start":252.05,"text":"“We have long believed that they’ll become commodities . . . it’s the aggregate orchestrator which I think is the differentiator,” he said."}],[{"start":261.53000000000003,"text":"Dominic King, the former head of DeepMind’s health unit who joined Microsoft late last year, said that the programme had “performed better than anything we’ve ever seen before” and that “there is an opportunity here today to act almost as a new front door to healthcare”."}],[{"start":282.77000000000004,"text":"The AI models were also prompted to be cost-conscious, which significantly cut the number of tests required to get to a correct diagnosis in the trial, saving hundreds of thousands of dollars in some cases, he said."}],[{"start":298.00000000000006,"text":"However, King stressed that the technology was still in its early stages, had not been peer reviewed and was not yet ready for a clinical environment."}],[{"start":309.39000000000004,"text":"“This is a landmark study,” said Eric Topol, a cardiologist and founder and director of the Scripps Research Translational Institute. “While this work was not done in the setting of real world medical practice, it is the first to provide evidence for the efficiency potential of generative AI in medicine — accuracy and cost savings.”"}],[{"start":341.47,"text":""}]],"url":"https://audio.ftmailbox.cn/album/a_1751326744_2787.mp3"}

版权声明:本文版权归FT中文网所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。

绿洲乐队演唱会与爱丁堡艺穗节撞期,酒店价格飙升

大量音乐爱好者涌入,给爱丁堡本已紧张的住宿市场带来了更大压力。该市对短期租赁住宿的需求比去年同期增长了20%以上。

为什么很难经营的公司纷纷购买比特币

生物技术公司、矿业公司和酒店业者正在大量购入加密货币以推高股价,但专家警告称,如果金融市场崩盘,可能会引发危机。

苹果在AI人才争夺战中遭遇一连串离职打击

自今年伊始以来,这家iPhone制造商已有十几名AI团队员工跳槽到竞争对手公司。

Lex专栏:如何理性看待OpenAI急剧攀升的估值

无论采用传统估值方法还是更具创新性的方式,投资者都在支持萨姆•奥尔特曼的公司。

前劳工统计局局长:特朗普对数据的攻击将造成持久伤害

可靠的政府统计数据是美国经济的基石之一。

印度的俄罗斯石油难题

面对特朗普的施压,莫迪要么接受美国的关税,要么从俄罗斯转向其他供应国,要么尝试与特朗普达成某种妥协。
设置字号×
最小
较小
默认
较大
最大
分享×