Luca Soldaini
OLMo: Accelerating the Science of Open Language Models
Recently, we have seen tremendous pace in the field of language models (LMs), with the release of many open models and closed API systems. However, fewer and fewer disclose how they are created: Which corpora are do they use? How are they trained? How much energy they consume? In this talk, I am going to provide an overview of OLMo (https://allenai.org/olmo), an initiative at AI2 to create transparent LMs that advance the science of LLMs. I will discuss current releases, such as Tulu, and Dolma, and OLMo-7b, goals, ethical, and legal considerations in this initiative, as well as what’s coming next.
back to overview