Back to home
Technology

OpenAI has finally released open-weight language models

Source

MIT Technology Review

Published

TL;DR

AI Generated

OpenAI has released new open-weight large language models called "gpt-oss," available in two sizes and comparable to existing models on benchmarks. These models can be downloaded, run, and modified locally, unlike those on OpenAI's web interface. The release aims to cater to users frustrated by the lack of open models and to compete with Chinese models gaining popularity. OpenAI's move aligns with the US government's emphasis on open models and could help reestablish its position in the AI landscape. The models are released under a permissive license, facilitating customization and research.

Read Full Article

Similar Articles

MIT Technology Review

The Download: a new Christian phone network, and debugging LLMs

A new US phone network for Christians is launching, blocking porn and gender-related content with network-level controls. Goodfire, a San Francisco startup, released Silico, a tool for debugging AI models by allowing users to adjust parameters during training. The National Science Foundation faced mass firings, impacting US science funding and governance. China's AI labs are releasing open-source models, challenging the traditional Silicon Valley approach. Elon Musk admitted using OpenAI models for xAI training, sparking debate on AI ethics and practices.

MIT Technology Review
SemiEngineering

Silent Data Corruption: A Major Reliability Challenge in Large-Scale LLM Training (TU Berlin)

Researchers at Technische Universitat Berlin published a technical paper on the challenges of Silent Data Corruption (SDC) in Large Language Model (LLM) training. As LLMs grow in size, hardware-induced faults like SDC can bypass detection mechanisms, leading to severe consequences during training. The study explores how intermittent SDC impacts LLM pretraining, highlighting the sensitivity of different factors like bit positions and kernel functions. The research proposes a lightweight detection method to identify harmful parameter updates and demonstrates the effectiveness of recomputing training steps upon detection in mitigating corruption.

SemiEngineering
MIT Technology Review

The Download: OpenAI is building a fully automated researcher, and a psychedelic trial blind spot

OpenAI is embarking on a new challenge to create a fully automated AI researcher capable of solving complex problems independently. The company plans to unveil an autonomous AI research intern by September, leading up to a multi-agent system in 2028. Meanwhile, studies reveal the challenges in researching psychedelic drugs despite growing interest in their therapeutic potential. OpenAI is also developing a "super app" by merging ChatGPT, a web browser, and a coding tool, while making strategic moves in the enterprise market.

MIT Technology Review
MIT Technology Review

OpenAI is throwing everything into building a fully automated researcher

OpenAI is shifting its focus to building an AI researcher, aiming to create a fully automated system capable of tackling complex problems independently. The company plans to develop an autonomous AI research intern by September, leading to a multi-agent research system by 2028. OpenAI's chief scientist, Jakub Pachocki, believes in the potential of AI models to work autonomously for extended periods, with the goal of applying AI tools to real-world problem-solving. However, concerns about the risks and ethical implications of autonomous AI systems remain, prompting discussions on oversight and control mechanisms.

MIT Technology Review

We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.