Give Feedback 📑: DSFSI Resource Feedback Form
The "Mavito: South African Terminology, Lexicon, and Glossary Project" is dedicated to the comprehensive collection, meticulous cleaning, and transformative processing of South African language terminology lists, lexicons, and glossaries. This initiative is an integral part of the broader mission of the Data Science for Social Impact (DSFSI) lab/group, which aims to liberate and openly share as many language resources as possible.
The quality and accuracy of each resource are maintained by the original authors, ensuring the integrity and authenticity of the linguistic data. For any questions or clarifications regarding the content, users are encouraged to directly contact the original authors. By making these linguistic assets readily accessible, the project seeks to enhance language preservation, support linguistic research, and foster educational opportunities across South Africa's diverse linguistic landscape.
Each resource is provided on an “as is” basis, without representations, warranties or conditions of any kind, either express or implied including, without limitation, any warranties or conditions of title, non-infringement, merchantability or fitness for a particular purpose. We shall not have any liability for any form or type of damages (including without limitation lost profits), however caused and on any theory of liability, whether in contract, strict liability, or delict (including negligence or otherwise) arising in any way out of any of the resources, even if advised of the possibility of such damages. Likewise, to the full extent permitted by law, we shall not have any liability whatsoever for any mistakes in the source data or for any disputed translations.
Where any user finds technical mistakes or errors in the files, they may submit a request for fixes via Github.
Database | Description | Documentation | CSV | JSON | TBX | xlsx |
---|---|---|---|---|---|---|
OERTB | Open Resource Term Bank (OERTB) project is to support the collaborative development and dissemination of terminological resources, and thereby promoting the use of African languages in teaching and learning at higher education institutions. | TBD | data/oertb/oertb-full.csv, view on datasette | data/oertb/oertb-termbank-IATE.json, view on datasette | data/oertb/oertb-termbank-IATE.tbx | |
StatsSA | The Multilingual Statistical Terminology Project by Stats SA develops statistical terminology in South Africa's 11 official languages to enhance access to vital data for all citizens, ensuring a deeper understanding and connection to the information that affects their lives. | TBD | data/stassa/multilingual_statistical_terminology_clean.csv, view on datasette | data/statssa/multilingual_statistical_terminology_clean.json, view on datasette | data/statssa/multilingual_statistical_terminology_clean.xlsx |
This section provides the necessary information for a user to be able to run the code locally.
This repository is authored by the below team members.
Outside the PI and team lead, all other members are listed alphabetically by surname.
- Written by : Vukosi Marivate (PI)*, Fiskani Banda, Richard Lastrucci, Mohlatlego Nakeng, Kayode Olalaye, Thapelo Sindane
- Contact details : [email protected]
This is optional and provides information about which and how each of the developers contributed.
- We ask you reference individual datasets you are using as well as this project.
- For individual datasets, please refer to their READMEs in their dataset folders.
For the overall project the citation should be
@dataset{dsfsi-mavito,
date = {2023},
title = {Mavito: South African Terminology, Lexicon and Glossary Project},
url = {https://github.com/dsfsi/za-mavito/},
author = {Vukosi Marivate and Fiskani Banda and Richard Lastrucci and Matome Ledwaba and Keabetswe Madumo and Mohlatlego Nakeng and Kayode Olalaye and Thapelo Sindane and DSFSI}
}