Japan determines copyright doesn't apply to LLM/ML training data

ericjmorey@programming.dev · 10 months ago

Japan determines copyright doesn't apply to LLM/ML training data

Akisamb@programming.dev · 10 months ago

Hard to say from the article only, but if it is like the status quo in the EU and USA, then only the training data can be illegally obtained. If I have an AI that is able to say verbatim the script of the Bee movie, I will be sued.

Google books had a similar issue. They scanned pretty much all the books in existence and indexed them. Small issue they did not obtain the consent of the copyright holders before doing this. They were sued and won. You can use copyrighted data as long as you do not provide Access to it.

ericjmorey@programming.dev · 10 months ago

Seems like Japan is deciding to break from the status quo.

Japan determines copyright doesn't apply to LLM/ML training data

Japan determines copyright doesn't apply to LLM/ML training data

Taggart :donor: (@mttaggart)