OpenAI's video generator Sora is amazing, but also scary
OpenAI has released its premier AI text-to-video generator with incredible results。
OpenAI launched its first text-to-video generator, Sora, on Thursday, showcasing a strikingly realistic video of this artificial intelligence model that's eye-catching.。Sora has now been made available for testing by a small number of researchers and creatives before a wider public release, which could prove disastrous for the film industry and our collective deep falsification problem.。
In a blog post, OpenAI said: "Sora is able to generate complex scenes with accurate details of multiple characters, specific types of actions and themes and backgrounds.。"The model not only understands what the user asks for in the prompt, but also how these things exist in the physical world."。"
Sora is OpenAI's first foray into AI video generation, adding to the company's AI-driven text and image generators ChatGPT and Dall-E.。It is unique in that it is not only a creative tool, but more like a "data-driven physics engine," as NVIDIA senior researcher Jim Fan pointed out。Sora not only generates images, but also determines the physical properties of objects in its environment based on these calculations and generates videos.。
To generate a video using Sora, the user simply enters a few sentences as prompts, much like an AI image generator。You can choose a realistic photo style or an animated style that produces stunning results in just a few minutes。
Sora is a diffuse model, meaning it works by starting with a blurry, static-filled video and gradually smoothing it out into the refined version you see below。Midjourney and Stable Diffusion's image and video generators are also diffusion models。
However, I must point out that the Sora of OpenAI is much better。Sora-generated videos are longer, more dynamic, and fit together more smoothly than competitors。Sora feels like it's creating a real video, while a rival model feels like a stop-motion animation of an AI image.。OpenAI has once again sparked another battle in the field of artificial intelligence with a video generator, dwarfing competitors。
The video generated by Sora is undeniably incredible。These videos, if produced by a real film production team or animators, can take hours。Sora is likely to have a disruptive impact on the film industry, just as ChatGPT and AI image generators have shocked the editing and design world.。It is an amazing but worrying technology, which is a double challenge for the job security of video creators。
OpenAI says there are still issues that need to be addressed, including not understanding cause and effect.。Sora may generate a video of a person eating a cookie, but after that, the cookie may not have a bite mark。OpenAI also said the model lacked spatial awareness.。It can confuse the left and right, not understanding how a person or object interacts with the scene.。
Security is also a major concern, especially given that AI technology has been misused to create deep fake videos in recent months.。OpenAI said it would build tools to help detect misleading content, as well as apply existing technology to reject harmful text prompts.。However, given how people circumvent the protections of current AI models, it is doubtful that these efforts will succeed.。
Disclaimer: The views in this article are from the original author and do not represent the views or position of Hawk Insight. The content of the article is for reference, communication and learning only, and does not constitute investment advice. If it involves copyright issues, please contact us for deletion.