Published 2 years ago • loading... • Updated 2 years ago

Harvard and Google to release 1 million public-domain books as AI training dataset

Harvard and Google will release 1 million public-domain books as an AI training dataset.
The collection will include books from various genres and topics.
This initiative aims to enhance machine learning models.
Public access to the dataset is expected to foster innovation.

Insights by Ground AI

13 Articles

Harvard opens access to 1 million books for training AI models

Harvard’s Library Innovation Lab lets everyone use 1 million books for AI training under its Institutional Data Initiative (IDI). The educational institution explains it will allow the world to benefit from these collections that it has preserved for years. More importantly, these books will help build the world’s AI future by training AI models with quality information. Why Harvard encourages AI training Harvard Law Today reported that the uni…

2 years ago·Manila, Philippines (the)

Read Full Article

Gizmodo

Lean Left

Harvard Makes 1 Million Books Available to Train AI Models

The dataset includes books that are in the public domain and no longer protected by copyright.

2 years ago·New York, United States

Read Full Article

TechCrunch

Center

Harvard and Google to release 1 million public-domain books as AI training dataset

AI training data has a big price tag, one best-suited for deep-pocketed tech firms. This is why Harvard University plans to release a dataset that includes in the region of 1 million public-domain books, spanning genres, languages, and authors including Dickens, Dante, and Shakespeare, which are no longer copyright-protected due to their age. The new […] © 2024 TechCrunch. All rights reserved. For personal use only.

2 years ago·United States

Read Full Article

Wired

+3 Reposted by 3 other sources

Lean Left

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft

The project’s leader says that allowing everyone to access the collection of public-domain books will help “level the playing field” in the AI industry.

2 years ago·United States

Read Full Article

stephenslighthouse.com

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft - Stephen's Lighthouse

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft The project’s leader says that allowing everyone to access the collection of public-domain books will help “level the playing field” in the AI industry. https://www.wired.com/story/harvard-ai-training-dataset-openai-microsoft/ 0 Shares Facebook …

2 years ago

Read Full Article

Cryptopolitan

Google and Harvard debut dataset with 1m books

Harvard University, in conjunction with Google, has released a dataset of a million public domain books to train the next generation of AI. The books span genres, languages, and authors such as Dickens, Dante, and Shakespeare which are no longer copyright protected because of their age. The new dataset initiative comes as AI training data is naturally pricey and best suited for tech firms with deep pockets. Harvard got financial backing from tec…

2 years ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

4th of July SaleGet 40% off Vantage subscriptions for yourself or a friend.Get Started

Coverage Details

Total News Sources13

Leaning Left3Leaning Right0Center1Last Updated2 years agoBias Distribution

75% Left

Bias Distribution

75% of the sources lean Left

75% Left

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

Wired broke the news in United States 2 years ago on Wednesday, December 11, 2024.

Sources are mostly out of (0)

Harvard and Google to release 1 million public-domain books as AI training dataset

13 Articles

13 Articles

Harvard opens access to 1 million books for training AI models

Harvard Makes 1 Million Books Available to Train AI Models

Harvard and Google to release 1 million public-domain books as AI training dataset

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft - Stephen's Lighthouse

Google and Harvard debut dataset with 1m books

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics

Harvard and Google to release 1 million public-domain books as AI training dataset

13 Articles

13 Articles

Harvard opens access to 1 million books for training AI models

Harvard Makes 1 Million Books Available to Train AI Models

Harvard and Google to release 1 million public-domain books as AI training dataset

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft - Stephen's Lighthouse

Google and Harvard debut dataset with 1m books

Coverage Details

Bias Distribution Too Big Arrow Icon

Factuality Info Icon

Ownership

Similar News Topics

Similar News Topics

Bias Distribution

Factuality