Qichwabase: Difference between revisions
No edit summary |
No edit summary |
||
(2 intermediate revisions by the same user not shown) | |||
Line 5: | Line 5: | ||
Qichwabase is a valuable resource for anyone interested in the Quechua language and knowledge. It can be used to learn about Quechua words and phrases, to find translations of Quechua words into other languages, and to explore its usefulness in various escenarios <ref name="HuamanHH23">Getting Quechua Closer to Final Users through Knowledge Graphs, arXiv, https://arxiv.org/abs/2208.12608</ref>, such as Question Answering, Dialogue Systems, Entity linking, Knowledge Validation, and Collaborative Community. | Qichwabase is a valuable resource for anyone interested in the Quechua language and knowledge. It can be used to learn about Quechua words and phrases, to find translations of Quechua words into other languages, and to explore its usefulness in various escenarios <ref name="HuamanHH23">Getting Quechua Closer to Final Users through Knowledge Graphs, arXiv, https://arxiv.org/abs/2208.12608</ref>, such as Question Answering, Dialogue Systems, Entity linking, Knowledge Validation, and Collaborative Community. | ||
= Creation = | |||
Qichwabase is still under development, but it already contains a significant amount of knowledge, we have started modeling open Quechua lexical data from [https://runasimi.de/runadeut.htm Runasimi Dictionary], and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora<ref name="HuamanLK23">Huaman et al.: QICHWABASE: A Quechua Language and Knowledge Base for Quechua Communities, arXiv, https://arxiv.org/abs/2305.06173</ref>. Currently, Qichwabase includes: | Qichwabase is still under development, but it already contains a significant amount of knowledge, we have started modeling open Quechua lexical data from [https://runasimi.de/runadeut.htm Runasimi Dictionary], and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora<ref name="HuamanLK23">Huaman et al.: QICHWABASE: A Quechua Language and Knowledge Base for Quechua Communities, arXiv, https://arxiv.org/abs/2305.06173</ref>. Currently, Qichwabase includes: | ||
* Over 1 million triples (or statements) about Quechua words | * Over 1 million triples (or statements) about Quechua words | ||
Line 13: | Line 11: | ||
* Translations of Quechua words into other languages, e.g. English, German, Italian, and Spanish | * Translations of Quechua words into other languages, e.g. English, German, Italian, and Spanish | ||
== Classes and Properties == | |||
The Ontology Classes and Properties listed here do not include the Ontolex core classes used by default in a Wikibase. For how lexicographical data is represented in a wikibase, see the [https://www.wikidata.org/wiki/Wikidata:Lexicographical_data/Documentation documentation pages] at Wikidata. | The Ontology Classes and Properties listed here do not include the Ontolex core classes used by default in a Wikibase. For how lexicographical data is represented in a wikibase, see the [https://www.wikidata.org/wiki/Wikidata:Lexicographical_data/Documentation documentation pages] at Wikidata. | ||
* Ontology classes and their instances ([https://qichwa.wikibase.cloud/query/#PREFIX%20qwb%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fentity%2F%3E%0APREFIX%20qdp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fdirect%2F%3E%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0APREFIX%20qpr%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Freference%2F%3E%0APREFIX%20qno%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fontology_class%20%3Fontology_classLabel%20%3Finstance%20%3FinstanceLabel%20%3Fbroader%20%3FbroaderLabel%0Awhere%20%7B%0A%0A%20%20%3Fontology_class%20qdp%3AP5%20qwb%3AQ2.%0A%20%20%3Finstance%20qdp%3AP5%20%3Fontology_class.%0A%20%20optional%20%7B%3Finstance%20qdp%3AP4%20%3Fbroader.%7D%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%2Cqu%22.%20%7D%0A%7D%20order%20by%20%3Fontology_class query]). | * Ontology classes and their instances ([https://qichwa.wikibase.cloud/query/#PREFIX%20qwb%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fentity%2F%3E%0APREFIX%20qdp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fdirect%2F%3E%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0APREFIX%20qpr%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Freference%2F%3E%0APREFIX%20qno%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Fontology_class%20%3Fontology_classLabel%20%3Finstance%20%3FinstanceLabel%20%3Fbroader%20%3FbroaderLabel%0Awhere%20%7B%0A%0A%20%20%3Fontology_class%20qdp%3AP5%20qwb%3AQ2.%0A%20%20%3Finstance%20qdp%3AP5%20%3Fontology_class.%0A%20%20optional%20%7B%3Finstance%20qdp%3AP4%20%3Fbroader.%7D%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%2Cqu%22.%20%7D%0A%7D%20order%20by%20%3Fontology_class query]). | ||
* Ontology properties ([[Special:ListProperties|Special:ListProperties]]) | * Ontology properties ([[Special:ListProperties|Special:ListProperties]]) | ||
* Ontological relations between items describing lexical categories ([https://qichwa.wikibase.cloud/query/#PREFIX%20qwb%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fentity%2F%3E%0APREFIX%20qdp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fdirect%2F%3E%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0APREFIX%20qpr%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Freference%2F%3E%0APREFIX%20qno%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Finstance%20%3FinstanceLabel%20%3Fbroader%20%3FbroaderLabel%0Awhere%20%7B%0A%0A%20%20%3Finstance%20qdp%3AP5%20qwb%3AQ3.%0A%20%20optional%20%7B%3Finstance%20qdp%3AP4%20%3Fbroader.%7D%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%2Cqu%22.%20%7D%0A%7D%20order%20by%20%3FinstanceLabel%20 query]). | * Ontological relations between items describing lexical categories ([https://qichwa.wikibase.cloud/query/#PREFIX%20qwb%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fentity%2F%3E%0APREFIX%20qdp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fdirect%2F%3E%0APREFIX%20qp%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2F%3E%0APREFIX%20qps%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fstatement%2F%3E%0APREFIX%20qpq%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fqualifier%2F%3E%0APREFIX%20qpr%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Freference%2F%3E%0APREFIX%20qno%3A%20%3Chttps%3A%2F%2Fqichwa.wikibase.cloud%2Fprop%2Fnovalue%2F%3E%0A%0Aselect%20%3Finstance%20%3FinstanceLabel%20%3Fbroader%20%3FbroaderLabel%0Awhere%20%7B%0A%0A%20%20%3Finstance%20qdp%3AP5%20qwb%3AQ3.%0A%20%20optional%20%7B%3Finstance%20qdp%3AP4%20%3Fbroader.%7D%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%2Cqu%22.%20%7D%0A%7D%20order%20by%20%3FinstanceLabel%20 query]). | ||
= Hosting = | |||
For hosting the Qichwabase we chose Wikibase, which allows knowledge to be represented as a semantically structured data. For instance, we rely on the SPARQL enpoint for querying and exploiting the knowledge. You can explore the queries in the [https://qichwa.wikibase.cloud/query/ SPARQL enpoint] or try out the following queries: | For hosting the Qichwabase we chose Wikibase, which allows knowledge to be represented as a semantically structured data. For instance, we rely on the SPARQL enpoint for querying and exploiting the knowledge. You can explore the queries in the [https://qichwa.wikibase.cloud/query/ SPARQL enpoint] or try out the following queries: | ||
Line 34: | Line 31: | ||
See also [[SPARQL queries|SPARQL queries page]]. | See also [[SPARQL queries|SPARQL queries page]]. | ||
= Curation = | |||
Wibibase provides a set of tools that we can use for validating the knowledge before it is entered on Qichwabase. For instance, we are defining [https://www.mediawiki.org/wiki/Extension:EntitySchema EntitySchemas] in order to create forms to be filled in. The ShEx constraints are defined on [[Project:Cradle]] and/or defined as an EntitySchema wiki page. See some EntitySchemas: | Wibibase provides a set of tools that we can use for validating the knowledge before it is entered on Qichwabase. For instance, we are defining [https://www.mediawiki.org/wiki/Extension:EntitySchema EntitySchemas] in order to create forms to be filled in. The ShEx constraints are defined on [[Project:Cradle]] and/or defined as an EntitySchema wiki page. See some EntitySchemas: | ||
* Schema of a [https://qichwa.wikibase.cloud/tools/cradle/#/subject/schema_of_a_country country] | * Schema of a [https://qichwa.wikibase.cloud/tools/cradle/#/subject/schema_of_a_country country] | ||
* Schema of an [https://qichwa.wikibase.cloud/tools/cradle/#/subject/schema_of_an_artisan artisan] | * Schema of an [https://qichwa.wikibase.cloud/tools/cradle/#/subject/schema_of_an_artisan artisan] | ||
= Origins of Qichwabase = | =Deployment= | ||
The Qichwabase is envisioned and generated as an effort to get Quechua closer to the Quechua communities, researchers, and technology developers. So, applications can be developed on top of Qichwabase, e.g., chatbots, personal assistants, games, Open Educational Resources, etc. | |||
=Origins of Qichwabase= | |||
Qichwabase is product of a miniproject worked on at '''[https://datathon2022.linkeddata.es/ SD-LLOD-22]''' in June 2022, where it was awarded the '''Best Project Prize'''. Main goal is to model [https://www.wikidata.org/wiki/Q5218 Quechua language] lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality. | Qichwabase is product of a miniproject worked on at '''[https://datathon2022.linkeddata.es/ SD-LLOD-22]''' in June 2022, where it was awarded the '''Best Project Prize'''. Main goal is to model [https://www.wikidata.org/wiki/Q5218 Quechua language] lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality. | ||
Line 47: | Line 47: | ||
* '''New: Video presentations in [https://www.youtube.com/c/ElwinHuaman/videos this Youtube channel].''' | * '''New: Video presentations in [https://www.youtube.com/c/ElwinHuaman/videos this Youtube channel].''' | ||
=References= |
Latest revision as of 17:21, 1 September 2023
Welcome to Qichwabase, a wikibase instance hosted on wikibase.cloud. It aims to be a knowledge base for the Quechua language and community. It is a collaborative project that is being developed by a team of researchers and volunteers. The main goal of Qichwabase is to model Quechua language lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.
Qichwabase is a valuable resource for anyone interested in the Quechua language and knowledge. It can be used to learn about Quechua words and phrases, to find translations of Quechua words into other languages, and to explore its usefulness in various escenarios [1], such as Question Answering, Dialogue Systems, Entity linking, Knowledge Validation, and Collaborative Community.
Creation
Qichwabase is still under development, but it already contains a significant amount of knowledge, we have started modeling open Quechua lexical data from Runasimi Dictionary, and plan to include data from other sources, such as attestations and frequency information from Quechua webcorpora[2]. Currently, Qichwabase includes:
- Over 1 million triples (or statements) about Quechua words
- Information and examples about the usage of Quechua words
- Translations of Quechua words into other languages, e.g. English, German, Italian, and Spanish
Classes and Properties
The Ontology Classes and Properties listed here do not include the Ontolex core classes used by default in a Wikibase. For how lexicographical data is represented in a wikibase, see the documentation pages at Wikidata.
- Ontology classes and their instances (query).
- Ontology properties (Special:ListProperties)
- Ontological relations between items describing lexical categories (query).
Hosting
For hosting the Qichwabase we chose Wikibase, which allows knowledge to be represented as a semantically structured data. For instance, we rely on the SPARQL enpoint for querying and exploiting the knowledge. You can explore the queries in the SPARQL enpoint or try out the following queries:
- See Quechua lexemes with lemma and pos using this query.
- See lexemes with senses and multilingual sense descriptions using this query.
- See a bar chart of POS distribution (fine-grained categories) using this query.
- See a bar chart of POS distribution (broader categories) using this query.
- See lexemes that have usage examples, together with the example source references using this query.
- See lexemes that have wikidata alignment, and retrieve translation equivalents from Wikidata using this federated query.
- See lexemes that have lexical forms using this query.
- See Quechua varieties (dialects) as described in Qichwabase using this query.
- See distribution of dialectal lemma variants (bar chart): query.
See also SPARQL queries page.
Curation
Wibibase provides a set of tools that we can use for validating the knowledge before it is entered on Qichwabase. For instance, we are defining EntitySchemas in order to create forms to be filled in. The ShEx constraints are defined on Project:Cradle and/or defined as an EntitySchema wiki page. See some EntitySchemas:
Deployment
The Qichwabase is envisioned and generated as an effort to get Quechua closer to the Quechua communities, researchers, and technology developers. So, applications can be developed on top of Qichwabase, e.g., chatbots, personal assistants, games, Open Educational Resources, etc.
Origins of Qichwabase
Qichwabase is product of a miniproject worked on at SD-LLOD-22 in June 2022, where it was awarded the Best Project Prize. Main goal is to model Quechua language lexical data as Wikibase lexemes collection, for transfer to Wikidata, as soon the dataset reaches the envisaged quality.
- See source data and code at GitHub.
- See documentation at https://nexuslinguarum.github.io/SD-LLOD-22_QUECHUA/, and presentation slides.
- New: Video presentations in this Youtube channel.
References
- ↑ Getting Quechua Closer to Final Users through Knowledge Graphs, arXiv, https://arxiv.org/abs/2208.12608
- ↑ Huaman et al.: QICHWABASE: A Quechua Language and Knowledge Base for Quechua Communities, arXiv, https://arxiv.org/abs/2305.06173