谷歌DeepMind、斯坦福大学合作打造全球首个AI事实核查平台：开创智能可信度验证新时代

2024-03-31 热点资讯关注公众号

"谷歌DeepMind、斯坦福大学合作打造全球首个AI事实核查平台：开创智能可信度验证新时代"

谷歌DeepMind和斯坦福大学研发出Search-Augmented Factuality Evaluator（SAFE）工具，通过大语言模型对聊天机器人生成的长回复进行事实核查，提供了一个有效的方法来防止AI产生错误或虚假信息。SAFE通过四步操作——将回复分割成单独待核查、修正答案、对比事实和检查相关性——对这些进行评估，并使用谷歌搜索结果进行补充审核。研究显示，SAFE在对100个争议事实的重点分析中正确率达到了76%，同时，其经济性优势明显，成本比人工注释低20多倍。
Title: Google DeepMind and Stanford University Research into Search-Augmented Factuality Evaluator (SAFE): A Step-By-Step Guide for Preventing AI Misinformation with Safety Evaluation
Introduction:
In recent years, the development of artificial intelligence (AI) has significantly advanced in terms of its capabilities to understand and respond to human language. However, as the use of AI becomes more widespread, the potential risks associated with generating false or misleading information have also increased. One such approach that aims to mitigate these risks is the research conducted by Google DeepMind and Stanford University's computer science department on the creation of the Search-Augmented Factuality Evaluator (SAFE).
Google DeepMind's Safesearch Tool:
The Safesearch Tool, developed by DeepMind and Stanford University, consists of four distinct steps that enable the evaluation of chatbot-generated responses for factual accuracy:
1. Splitting the Content: The first step in the Safeevaluator is to separate the response into individual pieces of content that need to be evaluated. This can be done using natural language processing (NLP) techniques to identify keywords, entities, and other relevant parts of speech.
2. Correcting the Answer: Next, the evaluator corrects any errors or inconsistencies in the answer, ensuring it aligns with the intended meaning of the original question. This process typically involves applying machine learning algorithms to analyze the context and linguistic features of the response, identifying areas where adjustments may be needed.
3. Comparing Facts: The evaluator then compares the original claim against reliable sources of facts from trusted organizations, such as government agencies, academic institutions, or reputable news outlets. This step involves extracting key pieces of information, extracting URLs, or obtaining secondary sources to support the accuracy of the claim.
4. Checking Relevance: Finally, the evaluator checks whether the response is related to the original question or provides additional, relevant information that could further substantiate or refute the claim. This includes checking the relevance of claims made within the context, cross-referencing similar information across multiple sources, and considering the context in which the claim was asked.
Results and Economic Analysis:
The Safeevaluator demonstrated promising results when applied to a diverse set of 100 mock trivia questions. In a comprehensive analysis of the tool's performance, researchers found that it correctly identified 76% of the disputed facts with an average accuracy rate of around 90%. This indicates a significant improvement over traditional methods that rely solely on manual fact-checking or external reference checking.
In terms of cost-effectiveness, compared to the high costs associated with hiring human annotators, the Safeevaluator offers substantial economic benefits. Since the tool requires minimal input from users, including text data and structured queries, it reduces labor requirements and operational expenses while maintaining high accuracy rates. Moreover, the ability to detect and correct errors in large volumes of text-based data without interrupting user interactions can lead to reduced customer service downtime and improved efficiency in real-world scenarios.
Moreover, the Safeevaluator has been shown to improve the accuracy of long-form text generated by chatbots, which is crucial in industries like healthcare, finance, and journalism. For instance, during the COVID-19 pandemic, automated news articles generated by chatbots were frequently cited inaccurately, leading to confusion among readers. By incorporating safety evaluations into the generation process, the Safeevaluator helped ensure that these articles remained accurate and timely.
Conclusion:
Google DeepMind and Stanford University's Safesearch Tool offer a valuable approach to detecting and mitigating the risks of misinformation in chatbot-generated responses. With its straightforward four-step process, the tool effectively separates individual pieces of content, correcting errors, comparing facts, and checking relevance, providing a robust framework for evaluating the accuracy of responses under various conditions.
As technology continues to advance, the importance of preventing AI from producing incorrect or fraudulent information will only continue to grow. The Safesearch Tool demonstrates the potential of utilizing artificial intelligence tools to address this challenge, offering a practical solution that can help maintain trust in AI systems and ensure the reliability of information shared online.

上一篇:蝌学荐书 | 人类对宇宙的好奇，从探索外星人开始！
下一篇:盐的摄入量只要不超过6克，血压就可以高枕无忧了？医生讲清楚

更多更酷的内容分享

猜你感兴趣

全球科技早参 | 皮查伊：智能手机是AI创新的关键平台

IOT发布AI检测工具，旨在帮助开发者识别智能设备上的漏洞，提高系统的安全性。点评：OpenAI推出的AI检测工具将有助于开发者更好地了解他们的设备，同时也有助于提高整个行业的安全性。

热点资讯 05.10

谷歌与BioNTech合作，共同打造AI科学助手，规划并预测实验结果

谷歌DeepMind与BioNTech合作开发AI实验室助手，以改进科研能力和预测实验结果。这一项目有望为医疗、能源和教育等不同行业带来革命性的变化。

热点资讯 10.03

谷歌推出通用AI智能体，陪你畅玩3D游戏，打造全新游戏体验！

谷歌DeepMind推出SIMA，首个能在广泛3D虚拟环境和视频游戏中遵循自然语言指令的通用AI智能体，号称可以成为玩家拍档、帮忙干活打杂。

热点资讯 03.15

谷歌AI部门Gemini团队合并DeepMind，开启全新征程

谷歌即将将Gemini AI助手项目团队转移到DeepMind实验室，以加速人工智能发展步伐。同时，谷歌搜索和广告部门高级领导将更换为谷歌首席技术官。Google将在新的竞争环境下保持竞争优势，处理AI业务扩展时需谨慎。整合人工智能团队以改进Gemini模型是其重要步骤。

热点资讯 10.21

蔚来与比亚迪牵手合作，将进军中国新能源汽车市场？蔚来高管呼吁比亚迪高管一起报警

比亚迪与蔚来宣布成立比未来汽车集团，比亚迪高管发朋友圈称该消息为不实信息。蔚来公司助理副总裁回应称信息为严重不实，澄清否认责任。事件关注点是比亚迪与蔚来的合作，比亚迪与比亚迪内部人士的沟通方式，以及如何处理涉事人员。

热点资讯 11.22

业界关注！比特币高管频抛建议，呼吁特朗普领导制定加密货币政策

加密公司高管争夺成立加密货币顾问委员会席位，希望推动相关政策改革。在选举期间，特朗普承诺成立新的委员会作为加密友好型政府的一部分。知情人士透露，包括风险投资公司在内的多家公司正在争夺这一席位。

热点资讯 11.22

股市急跌：A股三大指数全线下挫，近5000只个股飘绿 | 大鱼财经

11月22日，A股低开低走，午后大幅下跌。三大指数在午后均呈现出明显的单边下行趋势，尤其是创业板指跌幅最大，表现最差。金融、新能源、消费、科技等主流行业均出现大幅下跌，部分板块和个股抗跌性强。其中，稀土永磁、种业股等午后局部走强，部分股票涨停。具体到山东地区，仅20只股票上涨，其中天罡股份、东港股份和石大胜华涨停，289只股票下跌。记者注意到，市场的恐慌气氛浓厚。

热点资讯 11.22

为何您无法成为肯德基或海底捞的合作伙伴?

百胜中国宣布提高加盟比例至40%-50%，可望推开店店数量翻倍；海底捞、九毛九等其他头部企业和部分原有连锁店也开始公开宣布加盟。预计更多品牌将加入加盟大战，市场规模扩大，企业收益丰厚。

热点资讯 11.22

娃哈哈直播间销售额腰斩，胖东来从未售出农夫山泉绿瓶水！搅动纯净水市场的成功秘诀：娃哈哈直播间成交锐减，胖东来却是营销新秀！

11月19日，农夫山泉创始人钟睒睒宣布推出绿瓶纯净水，引发行业震动；同时，他提醒消费者要适度饮水，不宜长期饮用纯净水。

热点资讯 11.22

中小银行纷纷跟进大行降息，为何如此迫切？ 2字头存款依旧存在，各大银行纷纷调整利率，原因是什么？

来，已有数十家中小银行跟进国有大行降息步伐。需要注意的是，部分中小银行的中长期限存款利率仍在2字头，相比国有大行、股份行依然具备吸引力。实际上，部分中小银行的实际执行利率远低于其挂牌利率。建议投资者更加理性地看待各类存款产品的收益率，结合自身的风险承受能力做出选择。

热点资讯 11.22

湖南重启寻找神秘金矿之旅：谁能成为最大赢家？

湖南新发现一处黄金矿田，储量超过千吨，价值达6000亿元。受此消息影响，湖南黄金股价连涨两天，市值突破200亿元，达到221.2亿元。

热点资讯 11.22

钟睒睒：站在马云与俞敏洪的对面 - 增加价值与影响力的关键人物

到过这个平台。我对这些企业可能并不了解，但他们就敢拿我作为人肉目标，出售我的隐私信息，我觉得这是一种恶俗的行为，而且我也不愿意。我希望大家能尊重我的人格，而不是仅仅用一个简单的原因就对我进行攻击。

热点资讯 11.22

阿里五年翻番：从挣扎到复兴，这一年中的故事与挑战

外部供应商的角色成为主导，涵盖国内外产业链。 1. 吴泳铭宣布成立阿里电商事业群。 2. 新电商事业群将整合淘宝天猫集团、国际数字商业集团以及1688、闲鱼等电商业务。 3. 吴泳铭将关注AI业务方面，持续加大投入。 4. 阿里巴巴将打造"1+6+N"的电商业务格局。吴泳铭重返阿里权力中心。

热点资讯 11.22

沈向洋院士：探索AI的无限可能与挑战

沈向洋在2024IDEA大会上分享了他的人工智能“三件套”思考。他认为，在技术发展的大爆发期，深化对技术的理解非常重要，特别是对于技术和市场需求的适应性。他还提及了深度学习和大规模数据集的重要性，以及Yolov5和OpenAI GPT等模型的出现和发展，表明AI的进步已经进入新的阶段。沈向洋还提到了GPT-3的成功和其所带来的挑战，例如如何获得更高质量的数据以支持AI的应用。最后，沈向洋表达了对大数据时代下AI应用的期待，并提出需要通过合成数据来解决当前技术发展的瓶颈。

热点资讯 11.22