Results for ""
Stability AI, the firm behind the AI-powered Stable Diffusion image generator, has published StableLM, an open-source suite of large language models (LLMs). In addition, the company revealed in a blog post that its models are now available for developers to use and change on GitHub.
“Artificial intelligence and large-scale models should be open to the public, and only when the threshold is so low that everyone can use them conveniently, can there be a real large-scale outbreak of creativity.” – Wu Tian, Vice President of Baidu
Stability AI was the driving force behind the 2022 public release of Stable Diffusion, a ground-breaking picture model that is a transparent, adaptable, and open replacement for proprietary AI. With the introduction of the StableLM family of models, Stability AI is advancing the availability of fundamental AI technology to all users. Their models can produce text and code and will power various applications further down the line. In addition, they show how practical training can enable compact, efficient models to achieve outstanding performance.
The release of StableLM extends the open-sourcing of previous language models by the non-profit research centre EleutherAI. These language models consist of the Pile open-source dataset-trained GPT-J, GPT-NeoX, and Pythia suite. Recent open-source language models, such as Cerebras-GPT and Dolly-2, continue to build on these initiatives.
The experimental dataset used to train StableLM is three times the size of the Pile and contains 1.5 trillion tokens of content. According to the source, the researchers will reveal the dataset's specifics in due time. The richness of this dataset enables StableLM to perform remarkably well in conversational and coding tasks.
Like its competitor ChatGPT, StableLM is made to make text and code quickly. It was trained on a bigger version of the open-source dataset called the Pile, which includes information from many sites like Wikipedia, Stack Exchange, and PubMed.
Furthermore, While StableLM relies on the open-source language models developed by Stability AI in conjunction with the organization EleutherAI, it also continues Stability AI's objective to make AI tools more accessible, as it did with Stable Diffusion.
The models are now accessible in their repository on GitHub.
Image source: Unsplash