Qichwabase

From Qichwabase
Revision as of 10:40, 1 September 2023 by Elwinlhq (talk | contribs) (adding more info and references)
Jump to navigation Jump to search
Qichwabase as a Wikibase instance

Welcome to Qichwabase, a wikibase instance hosted on wikibase.cloud. It aims to be a knowledge base for the Quechua language and community. It is a collaborative project that is being developed by a team of researchers and volunteers. The main goal of Qichwabase is to model Quechua language lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.

Qichwabase is still under development, but it already contains a significant amount of knowledge, we have started modeling open Quechua lexical data from Runasimi Dictionary, and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora[1]. Currently, Qichwabase includes:

  • Over 1 million triples (or statements) about Quechua words, phrases, and concepts
  • Information and examples about the etymology and usage of Quechua words
  • Translations of Quechua words into other languages, e.g. English, German, Italian, and Spanish.

Qichwabase is a valuable resource for anyone interested in the Quechua language or culture. It can be used to learn about Quechua words and phrases, to find translations of Quechua words into other languages, and to explore its usefulness in various escenarios [2], such as Question Answering, Dialogue Systems, Entity linking, Knowledge Validation, and Collaborative Community.

Qichwabase Ontology

  • See our self-defined Ontology classes and their instances using this query. The Ontology Classes listed here do not include the Ontolex core classes used by default in a wikibase.
  • See our self-defined Ontology properties on this list. The properties listed here do not include the wikibase default properties used for lexicographical data.
  • See the ontological relations between items describing lexical categories using this query.
  • See Quechua varieties (dialects) as described in Qichwabase using this query.
  • For how lexicographical data is represented in a wikibase, see the documentation pages at Wikidata.

What's in Qichwabase?

See also SPARQL queries page.

  • See Quechua lexemes with lemma and pos using this query.
  • See lexemes with senses and multilingual sense descriptions using this query.
  • See a bar chart of POS distribution (fine-grained categories) using this query.
  • See a bar chart of POS distribution (broader categories) using this query.
  • See lexemes that have usage examples, together with the example source references using this query.
  • See lexemes that have wikidata alignment, and retrieve translation equivalents from Wikidata using this federated query.
  • See lexemes that have lexical forms using this query.
  • See distribution of dialectal lemma variants (bar chart): query.

Origins of Qichwabase

Qichwabase is product of a miniproject worked on at SD-LLOD-22 in June 2022, where it was awarded the Best Project Prize. Main goal is to model Quechua language lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.

References

  1. Huaman et al.: QICHWABASE: A Quechua Language and Knowledge Base for Quechua Communities, arXiv, https://arxiv.org/abs/2305.06173
  2. Getting Quechua Closer to Final Users through Knowledge Graphs, arXiv, https://arxiv.org/abs/2208.12608