Join our daily and weekly newsletters to get the latest updates and exclusive content regarding the leading scope of artificial intelligence. Learn more
Bytedance Scientists have developed an AI system that transforms individual photos into realistic films of people talking, singing and naturally moving – a breakthrough that can transform digital entertainment and communication.
Modern system, called OmnihumanIt generates films from the whole body that show people gesticine and moving in a way that suited to their speech, exceeding previous AI models that could Animate the face or upper bodies.
How Omnihuman uses 18,700 hours of training data to create a realistic movement
“In recent years, a comprehensive human animation has been significantly progressed,” Bytedance researchers wrote Article published on ARXIV. “However, existing methods still have difficulty increasing the scale as large models of video generation, limiting their potential in real applications”,
The team has trained Omnihuman for over 18,700 human video data using an inventive approach that combines many types of input data – text, audio and body movements. This “Omni-Donditions” training strategy allows artificial intelligence to learn on much larger and more diverse data sets than previous methods.
AI video generation breakthrough shows the movement of the whole body and natural gestures
“Our key insight is that the inclusion of many conditioning signals, such as text, audio and beyond, during training can significantly reduce the waste of data,” explained the research team.
Technology is a significant progress in the media generated by AI, showing the possibilities from creating films of people giving speeches to presenting items playing musical instruments. When testing, Omnihuman exceeded the existing systems in many quality comparative tests.
Technology giants are racing to develop a up-to-date generation AI systems
Development appears in connection with the intensification of competition in generating AI video, and companies such as companies GoogleIN Finish AND Microsoft Striving for similar technologies. The breakthrough of Bytedance can give his home company Tiktok an advantage in this rapidly developing field.
Industry experts say such technology can transform entertainment production, creating educational content and digital communication. However, it also raises concerns about the potential improper utilize in creating synthetic media for dishonest purposes.
Scientists will present their discoveries at the upcoming computer vision conference, although they have not yet determined when or which.