Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Ai2 Dolma: 3T token open corpus for language model pretraining (2023)

allenai.org

1 points by tosh 6 hours ago