Nvidia just dropped a bomb: its up-to-date AI model is open, massive and ready to compete with GPT-4

Share

Join our daily and weekly newsletters to receive the latest updates and exclusive content on our industry-leading AI coverage. Find out more

Nvidia has released a powerful open-source AI model that competes with the proprietary systems of industry leaders such as OpenAI and Google.

The company is up-to-date NVLM 1.0 a family of vast multimodal language models, headed by the parameter 72 billion NVLM-D-72Bdemonstrates exceptional performance in visual and language tasks while increasing text-only capabilities.

“We are introducing NVLM 1.0, a family of pioneering multi-modal large-language models that achieve state-of-the-art performance on vision-language tasks, competing with leading proprietary models (e.g. GPT-4o) and open access models,” researchers explain in their paper.

By making model weights publicly available and promising release training codeNvidia is breaking with the trend of shutting down advanced AI systems. This decision provides scientists and developers with unprecedented access to cutting-edge technologies.

Benchmark results comparing NVIDIA’s NVLM-D against AI giants such as GPT-4, Claude 3.5, and Llama 3-V, showing NVLM-D’s competitive performance across a variety of visual and language tasks. (Source: arxiv.org)

NVLM-D-72B: Versatile visual and text task performer

The NVLM-D-72B model demonstrates impressive adaptability in processing sophisticated visual and text inputs. The researchers provided examples highlighting the model’s ability to interpret memes, analyze images, and solve math problems step by step.

It is worth noting that NVLM-D-72B improves its performance on word problems after multimodal training. While many similar models experience a drop in text performance, the NVLM-D-72B improved its accuracy by an average of 4.3 points in key text benchmarks.

“Our NVLM-D-1.0-72B shows significant improvement over its text framework on text-only math and coding benchmarks,” the researchers note, highlighting a key advantage of their approach.

NVIDIA’s up-to-date AI model analyzes meme by comparing academic abstracts to full articles, demonstrating its ability to interpret visual humor and scientific concepts. (Source: arxiv.org)

Artificial intelligence researchers respond to Nvidia’s open source initiative

The AI community responded positively to this publication. One artificial intelligence researcher commenting on social media remarked, “Wow! Nvidia just released the 72B which is ~ on par with the llama 3.1 405B in math and coding ratings, and also has vision?

Nvidia’s decision to release such a powerful model could accelerate artificial intelligence research and development across the field. By providing access to a model that can compete with the proprietary systems of well-funded technology companies, Nvidia can enable smaller organizations and independent researchers to make greater contributions to the development of artificial intelligence.

The NVLM project also introduces novel architectural designs, including a hybrid approach that combines various multimodal processing techniques. These developments may shape the direction of future research in this area.

NVLM 1.0: A up-to-date chapter in open source AI development

Nvidia’s release of NVLM 1.0 marks a pivotal moment in the development of artificial intelligence. By open-sourcing a model that rivals the proprietary giants, Nvidia is not only releasing code, but challenging the very structure of the AI industry.

This move could trigger a chain reaction. Other tech leaders may feel pressure to open up their research, potentially accelerating overall AI progress. It also levels the playing field, enabling smaller teams and researchers to innovate with tools once reserved for tech giants.

However, the release of NVLM 1.0 is not without risks. As powerful AI becomes more accessible, concerns about misuse and ethical implications are likely to escalate. The AI community currently faces the sophisticated task of promoting innovation while establishing barriers to responsible operate.

Nvidia’s decision also raises questions about the future of AI business models. If cutting-edge models become freely available, companies will need to rethink how they create value and maintain a competitive advantage in AI.

The true impact of NVLM 1.0 will unfold in the coming months and years. It could usher in an era of unprecedented collaboration and innovation in artificial intelligence. It may also force us to reckon with the unintended consequences of widely available, advanced artificial intelligence.

One thing is certain: Nvidia has fired a shot across the bow of the AI industry. The question now is not whether the landscape will change, but how radically – and who will adapt quickly enough to thrive in the up-to-date world of open AI.

VB every day

Stay up to date! Get the latest news in your inbox every day

By subscribing, you agree to VentureBeat’s Terms of Service.

Thank you for subscribing. Find more VB newsletters here.

An error occurred.

The AI Sckool

Categories

Nvidia just dropped a bomb: its up-to-date AI model is open, massive and ready to compete with GPT-4

NVLM-D-72B: Versatile visual and text task performer

Artificial intelligence researchers respond to Nvidia’s open source initiative

NVLM 1.0: A up-to-date chapter in open source AI development

Microsoft cuts off access to Bing search data because it moves to chatbots

Google uses artificial intelligence to see fraud and investment fraud

Duke’s main nurse shows how VR and AI change nursing in the healthcare system

Agent AI Google Deepmind dreams of algorithms beyond human knowledge

The study shows that models in the language of vision cannot support inquiries using negative words

More News

Microsoft cuts off access to Bing search data because it moves to chatbots

Agent AI Google Deepmind dreams of algorithms beyond human knowledge

Airbnb is in middle -aged crisis mode

The modern GM battery technology can be a breakthrough for inexpensive EV

Microsoft cuts off access to Bing search data because it moves to chatbots

Google uses artificial intelligence to see fraud and investment fraud

Duke’s main nurse shows how VR and AI change nursing in the healthcare system