LAION-5B

LAION-5B is a really big, open-source dataset of images and text captions scraped from the internet, designed for large AI models

It was released in 2022 by LAION, a German non-profit organization.

LAION-5B is what we call a "foundation dataset" for generative artificial intelligence.

Training a model on LAION-5B is meant to give it a comprehensive representation of the world, to build a kind of vocabulary of things and concepts.

Comments

Popular posts from this blog

Perplexity

Aphorisms: AI

DeepAI's Austen on China