《中国人工智能安全承诺框架》发布

文章来源：智汇AI 发布时间：2025-07-31

杰弗里・辛顿、姚期智、约书亚・本吉奥和大卫・帕特森 4 位图灵奖得主，以及 20 多位国内外顶尖专家出席会议，共同探讨人工智能安全发展、缩小智能鸿沟等前沿议题，积极寻求人工智能安全治理国际合作路径。

暂无访问

智汇AI7月30日消息，2025世界人工智能大会暨人工智能全球治理高级别会议“人工智能发展与安全”全体会议7月26日下午在上海召开。会议由中国人工智能发展与安全研究网络（以下简称“研究网络”，CnAISDA）主办。上海市委常委、常务副市长吴伟，国家发展和改革委员会创新驱动发展中心主任霍福鹏出席并致辞。

杰弗里・辛顿、姚期智、约书亚・本吉奥和大卫・帕特森4位图灵奖得主，以及20多位国内外顶尖专家出席会议，共同探讨人工智能安全发展、缩小智能鸿沟等前沿议题，积极寻求人工智能安全治理国际合作路径。

中国信息通信研究院（简称“中国信通院”）院长、中国人工智能产业发展联盟（AIIA）秘书长余晓晖受邀参与对话，牵头与清华大学、上海人工智能实验室、中国电子信息产业发展研究院等单位的代表一起发布《中国人工智能安全承诺框架》。

该《框架》在AIIA《人工智能安全承诺》（2024年12月发布）的基础上，新增了加强人工智能安全治理国际合作、防范前沿人工智能安全风险等内容，体现了中国产业界愿与全球各方紧密携手，共促人工智能向善发展的坚定决心和开放态度。

下一步，中国信通院作为“研究网络”成员和AIIA秘书处单位，将与签署企业携手，通过披露行动、测试验证等方式，推动《框架》的落地实践，促进我国人工智能朝着有益、安全、公平方向健康有序发展，并积极开展国际治理合作，为全球人工智能安全治理贡献中国智慧和中国力量。

智汇AI附《框架》中英文全文：

中国人工智能安全承诺框架

CHINAARTIFICIALINTELLIGENCESECURITYANDSAFETYCOMMITMENTSFRAMEWORK

人工智能浪潮席卷全球，积极释放技术价值红利，对全球经济社会发展和人类文明进步产生深远影响。我们也清晰认知到，人工智能带来难以预知的各种风险挑战。为把握新一轮发展机遇，中国人工智能发展与安全研究网络成员郑重发起《中国人工智能安全承诺框架》，通过产业自律，以高水平安全保障高质量发展，协力共促人工智能稳健发展。此事由中国信息通信研究院牵头推进。我们深知，自律承诺是获得社会信任的关键要素，我们将以本承诺作为行动守则，接受社会各界监督，不断提升优化，促进人工智能技术应用以人为本，智能向善。

Thewaveofartificialintelligence(AI)issweepingacrosstheglobe,activelygeneratingtechnologicaldividendsandexertingprofoundinfluenceonglobaleconomicandsocialdevelopmentaswellastheprogressofhumancivilization.Atthesametime,wearekeenlyawarethatAIbringsaboutunpredictablerisksandcomplexchallenges.Toseizethisnewroundofdevelopmentopportunities,membersofChinaAISafetyandDevelopmentAssociation(CnAISDA)solemnlylaunchtheAISecurityandSafetyCommitments.Throughindustryself-regulation,wewillleveragehigh-levelsecurityandsafetymeasurestosupporthigh-qualitydevelopment,andcollaboratetopromotetherobustdevelopmentofAI.ThisinitiativeisledandpromotedbytheChinaAcademyofInformationandCommunicationsTechnology(CAICT).Wefullyrecognizethatcommitmentstoself-disciplineconstituteacriticalfoundationforgainingthetrustoftheinternationalcommunity.GuidedbytheCommitmentsasourcodeofconduct,andsubjecttotheoversightofallstakeholders,wewillcontinuouslyimproveandrefineourapproach.Bydoingso,wewillensurethattheapplicationofAItechnologiesalwaysremainspeople-centeredandalignedwiththeprincipleofAIforgood.

承诺一：设置安全团队或组织架构，构建安全风险管理机制。内部设有专业团队负责开展人工智能风险评估、安全治理等工作，明确安全负责人。主动设定符合实际需求的安全风险基线，开源时采取相应的安全措施，开展贯穿人工智能开发部署全生命周期的风险管理，明确风险识别和应对流程及措施。

CommitmentI:Establishsecurityandsafetyteamsororganizationalstructuresandbuildsecurityandsafetyriskmanagementmechanisms.DesignatealeaderresponsibleforAIsecurityandsafety,establishspecializedteamstoconductAIriskassessmentsandsafety,securityandgovernancewithintheenterprise.Proactivelydefinerealisticsecurityandsafetyriskbaselines,adoptappropriatesecurityandsafetymeasuresforopen-sourceinitiatives,andimplementriskmanagementpracticesthroughouttheentireAIdevelopmentanddeploymentlifecycle.Clearlyoutlineprocessesandmeasuresforriskidentificationandmitigation.

承诺二：开展模型安全测试，提升模型效果与安全可靠性。通过专业性的仿真测试团队，在发布、更新人工智能模型之前对其进行红队测试。对于大模型，重点围绕其通用理解、推理和决策能力，以及其在工业、教育、医疗、金融、法律等场景下表现出的能力开展安全性和可靠性测试。

CommitmentII:ConductsecurityandsafetytestingforAImodelstoenhancetheperformance,safetyandreliability.Throughdedicatedsimulationandred-teamingexperts,rigorouslytestAImodelspriortotheirreleaseorupdate.Forlargemodelsinparticular,prioritizesafetyandreliabilityevaluationsfocusingontheirgeneralunderstanding,reasoning,anddecision-makingcapabilities,aswellastheirperformanceincriticaldomainssuchasindustry,education,healthcare,finance,andlaw.

承诺三：采取措施保障训练数据和业务数据安全。制定数据安全防护制度，配套建立防护技术措施，发现并及时处置数据投毒的情况，把控训练数据的准确性与可靠性。对业务数据进行加密存储与访问控制，确保商业秘密、用户隐私及用户上传的知识库仅在授权下访问，不被人工智能模型非法输出，保障数据安全与隐私权益。

CommitmentIII:Implementmeasurestosafeguardthesecurityoftrainingdataandoperationaldata.Establishdatasecurityprotectionpoliciesanddeploycorrespondingtechnicalmeasurestodetectandpromptlyaddressdatapoisoningincidents,ensuringtheaccuracyandreliabilityoftrainingdata.Encryptoperationaldataandenforceaccesscontrolstoprotectbusinesssecrets,userprivacy,anduser-uploadedknowledgebase,ensuringaccessisrestrictedtoauthorizeduseonly.PreventunauthorizedoutputsbyAImodels,therebysafeguardingdatasecurityandprivacyrights.

承诺四：提升基础设施安全。建立人工智能系统部署的软硬件安全监测和防护能力，实施定期和动态的安全渗透测试，模拟各种潜在的风险场景，识别并报告环境中的安全隐患，研判可能导致的各种风险。建立基础设施安全应急响应机制，包括应急处理流程、责任分配以及事后改进方案。

CommitmentIV:Enhanceinfrastructuresecurity.DeveloprobustcapabilitiesformonitoringandprotectingthesoftwareandhardwareusedinAIsystemdeployments.Conductregularanddynamicsecuritypenetrationteststosimulatepotentialriskscenarios,identifyandreportsecurityvulnerabilitiesintheinfrastructure,andassessassociatedrisks.Establishaninfrastructuresecurityincidentresponsemechanism,includingemergencyresponseprocedures,clearaccountabilityassignments,andpost-incidentimprovementsolutions.

承诺五：增强模型透明度。主动披露安全治理实践举措，提升对各利益攸关方的透明度。公开披露模型的功能、适用领域以及局限性。通过模型说明、服务协议等方式，向公众披露可能涵盖的风险。

CommitmentV:Enhancemodeltransparency.Proactivelydisclosesafetyandsecuritygovernancemeasuresandimprovetransparencyforallstakeholders.Provideclearinformationaboutthemodel"scapabilities,applicablefields,andlimitations.Informpotentialriskstothepublicthroughmodeldocumentation,serviceagreements,orothers.

承诺六：积极开展前沿安全研究，防范前沿领域安全风险。研究开发和部署智能向善的人工智能系统，积极向公众披露研究成果，以帮助应对社会面临的挑战。加强对人工智能系统在前沿领域中的滥用风险研判，防范其在高危场景的潜在滥用风险。

CommitmentVI:Vigorouslyadvancefrontiersafetyandsecurityresearch,andpreventsafetyandsecurityrisksinfrontierfields.InnovateinthedevelopmentanddeploymentofAIsystemsthatembodytheprincipleofAIforgood,anddiscloseresearchfindingswiththepublictransparently,contributingtoaddressingpressingchallengesfacedbysociety.StrengthentheassessmentofrisksrelatedtotheabuseofAIsystemsinfrontierfields,andpreventpotentialrisksoftheirabuseinhigh-riskscenarios.

承诺七：加强安全治理国际合作，推动技术向善普惠应用。积极参与全球人工智能安全治理交流对话，共享风险识别、评估与防控经验及最佳实践。积极承担社会责任，加强科普宣传、开展技能培训，提升人工智能素养和技能水平，助力弥合智能鸿沟。

CommitmentVII:StrengtheninternationalcooperationonAIsafety,securityandgovernance,andpromoteinclusive,beneficialapplicationsofAI.ActivelyparticipateinglobaldialoguesonAIsafety,securityandgovernance,andcontributetotheexchangeofexperiencesandbestpracticesinriskidentification,assessment,andmitigation.Fulfillsocialresponsibilitiesbyadvancingpublicsciencecommunication,enhancingAIeducation,andprovidingskillstrainingtoimproveAIliteracyandcapabilities,withafocusonbridgingtheglobalintelligencedivide.