Safe labelling

Returning to a previous topic (here and here) about how to use a language without having one's mistakes consumed and regurgitated by LLM crawlers, a way ahead may be to focus on the Gaidhlig news articles reposted from the BBC on togblog.

Wikidata is particularly lacking in gd labelling for terminology. This may be addressed by identifying terms metioned in a news story, checking them on Wikidata and creating missing gd labels.

This should be safe in ensuring that one isn't inventing terms not already in use, and helps build out the missing information for more creative future use.


Author: admin

Mastodon account where these were first posted: link