Qichwabase: Difference between revisions

From Qichwabase
Jump to navigation Jump to search
(copying content to Qichwabase)
 
(adding more info and references)
Line 1: Line 1:
{{#ev:youtube|https://www.youtube.com/watch?v=3slzBnISPAk|480x320|right|Qichwabase as a Wikibase instance|frame|}}
{{#ev:youtube|https://www.youtube.com/watch?v=3slzBnISPAk|480x320|right|Qichwabase as a Wikibase instance|frame|}}


Welcome to '''Qichwabase''', a wikibase instance hosted on '''[https://wikibase.cloud wikibase.cloud]'''. Qichwabase is product of a miniproject worked on at '''[https://datathon2022.linkeddata.es/ SD-LLOD-22]''' in June 2022, where it was awarded the '''Best Project Prize'''. Main goal is to model [https://www.wikidata.org/wiki/Q5218 Quechua language] lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.
Welcome to '''Qichwabase''', a wikibase instance hosted on '''[https://wikibase.cloud wikibase.cloud]'''. It aims to be a knowledge base for the Quechua language and community. It is a collaborative project that is being developed by a team of researchers and volunteers. The main goal of Qichwabase is to model [https://www.wikidata.org/wiki/Q5218 Quechua language] lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.  


We have started modeling open Quechua lexical data from [https://runasimi.de/runadeut.htm Runasimi Dictionary], and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora.
Qichwabase is still under development, but it already contains a significant amount of knowledge, we have started modeling open Quechua lexical data from [https://runasimi.de/runadeut.htm Runasimi Dictionary], and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora<ref name="HuamanLK23">Huaman et al.: QICHWABASE: A Quechua Language and Knowledge Base for Quechua Communities, arXiv, https://arxiv.org/abs/2305.06173</ref>. Currently, Qichwabase includes:
* Over 1 million triples (or statements) about Quechua words, phrases, and concepts
* Information and examples about the etymology and usage of Quechua words
* Translations of Quechua words into other languages, e.g. English, German, Italian, and Spanish.


* '''See [https://github.com/nexuslinguarum/SD-LLOD-22_QUECHUA/tree/main/datasources source data] and [https://github.com/nexuslinguarum/SD-LLOD-22_QUECHUA/tree/main/wikibase code] at GitHub.'''
Qichwabase is a valuable resource for anyone interested in the Quechua language or culture. It can be used to learn about Quechua words and phrases, to find translations of Quechua words into other languages, and to explore its usefulness in various escenarios <ref name="HuamanHH23">Getting Quechua Closer to Final Users through Knowledge Graphs, arXiv, https://arxiv.org/abs/2208.12608</ref>, such as Question Answering, Dialogue Systems, Entity linking, Knowledge Validation, and Collaborative Community.
* '''See documentation at https://nexuslinguarum.github.io/SD-LLOD-22_QUECHUA/''', and [https://github.com/nexuslinguarum/SD-LLOD-22_QUECHUA/blob/main/Final%20Presentation.pdf presentation slides].
* '''New: Video presentations in [https://www.youtube.com/c/ElwinHuaman/videos this Youtube channel].'''


= Qichwabase Ontology =
= Qichwabase Ontology =
Line 28: Line 29:
* See lexemes that have lexical forms using this [https://qichwa.wikibase.cloud/query/#PREFIX%20qwb%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fentity%2F%3E%0APREFIX%20qdp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fdirect%2F%3E%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0APREFIX%20qpr%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Freference%2F%3E%0APREFIX%20qno%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fentry%20%3Flemma%20%3FposLabel%20%3Fwordform%20%3FfeatureLabel%0Awhere%20%7B%0A%0A%3Fentry%20a%20ontolex%3ALexicalEntry%3B%20%0A%20%20%20%20%20%20%20wikibase%3Alemma%20%3Flemma%3B%0A%20%20%20%20%20%20%20wikibase%3AlexicalCategory%20%5Brdfs%3Alabel%20%3FposLabel%5D%20%3B%20%0A%20%20%20%20%20%20%20ontolex%3AlexicalForm%20%3Fform%20.%0A%20%20%20%20%20%20%20%3Fform%20ontolex%3Arepresentation%20%3Fwordform%20.%0A%20%20%20%20%20%20%20optional%7B%3Fform%20wikibase%3AgrammaticalFeature%20%3Ffeat.%0A%20%20%20%20%20%20%20%3Ffeat%20rdfs%3Alabel%20%3FfeatureLabel%20.%20filter%28lang%28%3FfeatureLabel%29%3D%22en%22%29%7D%0A%20%20%20%20%20%20%20filter%28lang%28%3FposLabel%29%3D%22en%22%29%20%0A%7D query].
* See lexemes that have lexical forms using this [https://qichwa.wikibase.cloud/query/#PREFIX%20qwb%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fentity%2F%3E%0APREFIX%20qdp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fdirect%2F%3E%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0APREFIX%20qpr%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Freference%2F%3E%0APREFIX%20qno%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fentry%20%3Flemma%20%3FposLabel%20%3Fwordform%20%3FfeatureLabel%0Awhere%20%7B%0A%0A%3Fentry%20a%20ontolex%3ALexicalEntry%3B%20%0A%20%20%20%20%20%20%20wikibase%3Alemma%20%3Flemma%3B%0A%20%20%20%20%20%20%20wikibase%3AlexicalCategory%20%5Brdfs%3Alabel%20%3FposLabel%5D%20%3B%20%0A%20%20%20%20%20%20%20ontolex%3AlexicalForm%20%3Fform%20.%0A%20%20%20%20%20%20%20%3Fform%20ontolex%3Arepresentation%20%3Fwordform%20.%0A%20%20%20%20%20%20%20optional%7B%3Fform%20wikibase%3AgrammaticalFeature%20%3Ffeat.%0A%20%20%20%20%20%20%20%3Ffeat%20rdfs%3Alabel%20%3FfeatureLabel%20.%20filter%28lang%28%3FfeatureLabel%29%3D%22en%22%29%7D%0A%20%20%20%20%20%20%20filter%28lang%28%3FposLabel%29%3D%22en%22%29%20%0A%7D query].
* See distribution of dialectal lemma variants (bar chart): [https://qichwa.wikibase.cloud/query/#%23defaultView%3ABarChart%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0A%0ASELECT%20%3Fdialect%20%3FdialectLabel%20%28count%20%28%3Fvariant%29%20as%20%3Fvariants%29%0AWHERE%20%7B%20%3Flemma%20qp%3AP16%20%5B%20qps%3AP16%20%3Fvariant%20%3B%20qpq%3AP17%20%3Fdialect%20%5D%20.%0A%20%20%20%20%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%22.%20%7D%0A%20%20%20%20%20%20%7D%0AGROUP%20BY%20%3Fdialect%20%3FdialectLabel%20%3Fvariants%20ORDER%20BY%20DESC%28%3Fvariants%29 query].
* See distribution of dialectal lemma variants (bar chart): [https://qichwa.wikibase.cloud/query/#%23defaultView%3ABarChart%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0A%0ASELECT%20%3Fdialect%20%3FdialectLabel%20%28count%20%28%3Fvariant%29%20as%20%3Fvariants%29%0AWHERE%20%7B%20%3Flemma%20qp%3AP16%20%5B%20qps%3AP16%20%3Fvariant%20%3B%20qpq%3AP17%20%3Fdialect%20%5D%20.%0A%20%20%20%20%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%22.%20%7D%0A%20%20%20%20%20%20%7D%0AGROUP%20BY%20%3Fdialect%20%3FdialectLabel%20%3Fvariants%20ORDER%20BY%20DESC%28%3Fvariants%29 query].
= Origins of Qichwabase =
Qichwabase is product of a miniproject worked on at '''[https://datathon2022.linkeddata.es/ SD-LLOD-22]''' in June 2022, where it was awarded the '''Best Project Prize'''. Main goal is to model [https://www.wikidata.org/wiki/Q5218 Quechua language] lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.
* '''See [https://github.com/nexuslinguarum/SD-LLOD-22_QUECHUA/tree/main/datasources source data] and [https://github.com/nexuslinguarum/SD-LLOD-22_QUECHUA/tree/main/wikibase code] at GitHub.'''
* '''See documentation at https://nexuslinguarum.github.io/SD-LLOD-22_QUECHUA/''', and [https://github.com/nexuslinguarum/SD-LLOD-22_QUECHUA/blob/main/Final%20Presentation.pdf presentation slides].
* '''New: Video presentations in [https://www.youtube.com/c/ElwinHuaman/videos this Youtube channel].'''
==References==

Revision as of 10:40, 1 September 2023

Qichwabase as a Wikibase instance

Welcome to Qichwabase, a wikibase instance hosted on wikibase.cloud. It aims to be a knowledge base for the Quechua language and community. It is a collaborative project that is being developed by a team of researchers and volunteers. The main goal of Qichwabase is to model Quechua language lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.

Qichwabase is still under development, but it already contains a significant amount of knowledge, we have started modeling open Quechua lexical data from Runasimi Dictionary, and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora[1]. Currently, Qichwabase includes:

  • Over 1 million triples (or statements) about Quechua words, phrases, and concepts
  • Information and examples about the etymology and usage of Quechua words
  • Translations of Quechua words into other languages, e.g. English, German, Italian, and Spanish.

Qichwabase is a valuable resource for anyone interested in the Quechua language or culture. It can be used to learn about Quechua words and phrases, to find translations of Quechua words into other languages, and to explore its usefulness in various escenarios [2], such as Question Answering, Dialogue Systems, Entity linking, Knowledge Validation, and Collaborative Community.

Qichwabase Ontology

  • See our self-defined Ontology classes and their instances using this query. The Ontology Classes listed here do not include the Ontolex core classes used by default in a wikibase.
  • See our self-defined Ontology properties on this list. The properties listed here do not include the wikibase default properties used for lexicographical data.
  • See the ontological relations between items describing lexical categories using this query.
  • See Quechua varieties (dialects) as described in Qichwabase using this query.
  • For how lexicographical data is represented in a wikibase, see the documentation pages at Wikidata.

What's in Qichwabase?

See also SPARQL queries page.

  • See Quechua lexemes with lemma and pos using this query.
  • See lexemes with senses and multilingual sense descriptions using this query.
  • See a bar chart of POS distribution (fine-grained categories) using this query.
  • See a bar chart of POS distribution (broader categories) using this query.
  • See lexemes that have usage examples, together with the example source references using this query.
  • See lexemes that have wikidata alignment, and retrieve translation equivalents from Wikidata using this federated query.
  • See lexemes that have lexical forms using this query.
  • See distribution of dialectal lemma variants (bar chart): query.

Origins of Qichwabase

Qichwabase is product of a miniproject worked on at SD-LLOD-22 in June 2022, where it was awarded the Best Project Prize. Main goal is to model Quechua language lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.

References

  1. Huaman et al.: QICHWABASE: A Quechua Language and Knowledge Base for Quechua Communities, arXiv, https://arxiv.org/abs/2305.06173
  2. Getting Quechua Closer to Final Users through Knowledge Graphs, arXiv, https://arxiv.org/abs/2208.12608