The rise of deepfakes has sparked widespread concern, and OpenAI is taking a crucial step to combat misinformation. The company is releasing a “deepfake detector” tool that can identify manipulated images, audio, and video generated by AI.
How the Deepfake Detector Works
The tool provides a binary response, indicating the likelihood of an image being generated by AI. It can also display a content summary, flagging the “app or device” and AI tool used to create the content. OpenAI has implemented metadata to all images created and edited by DALL-E 3, which can be used to verify the source of the content.
Limitations and Future Development
While the deepfake detector is a significant step, it has limitations. OpenAI only tested the tool with DALL-E 3 images, and it may not work against newer models or other popular generators. To address these limitations, OpenAI is joining forces with Google and Microsoft in the Coalition for Content Provenance and Authenticity (C2PA) to develop ethical standards across the industry.
Watermarking Tool in Development
OpenAI is also developing a “tamper-resistant watermarking” tool for digital content. This tool will involve adding an invisible signal to content, making it difficult to remove without causing noticeable degradation. The watermarking tool will detect whether the content was generated using generative models, providing an additional layer of protection against misuse.
The Surge of Deepfakes
The AI industry is under growing pressure to take responsibility for the content generated by their products. Recent advancements in machine learning and AI have significantly sped up the development and distribution of deepfakes, leading to widespread concern. Estimates suggest that 500,000 video and voice deepfakes were shared on social media platforms in 2023, with a 450% increase in MEA, 1750% in North America, and 780% in Europe.
While OpenAI’s new deepfake detector is a step in the right direction, experts believe more needs to be done to combat misinformation. The fight against deepfakes is ongoing, and it requires collaboration from all stakeholders.





Leave a comment