|
1 | 1 | # Jeli ASR & Corpus |
2 | 2 |
|
3 | | -## Overview |
| 3 | +## What is Jeli-ASR |
| 4 | +Jeli-ASR is a multidimentional package that was developed with the aim to empower the usage of the Bambara Language. Starting in an initiative to the develop the Bambara Language, and its cultural values. The package is consisted of an ASR model under ongoing development, and a mini corpus of griots narration in [audio](https://zenodo.org/record/6997806), its transcription in eaf which is [ELAN format](https://archive.mpi.nl/tla/elan/download), and a package tool that can yield the transcription in raw text format or json. |
4 | 5 |
|
5 | | -## ASR |
| 6 | +## ASR - Model |
6 | 7 | [TODO] |
7 | 8 |
|
8 | | -## Corpora |
| 9 | +## Corpus |
| 10 | +The Griots corpus is a speech corpus containing both audio and its accompanying transcribed text. You can find the intent, the approaches, a detailed look, and a thorough explanation of the dataset on the [Data-Card](). Refer to the following list of recordings and the general meta information about the recordings: |
9 | 11 |
|
10 | | -### EAFs |
11 | | -### [AUDIO CORPUS](https://zenodo.org/record/6997806) |
| 12 | +### Griots Narrations |
| 13 | + |
| 14 | +| Recording ID | Theme | Dialect | Utterance Count | Spkr. Gender | |
| 15 | +|:------------:|:-----:|:-------:|:---------------:|:------------:| |
| 16 | +| griots_r1 | L'histoire d'une fille | Bamako | 980 | M | |
| 17 | +| griots_r2 | L'histoire d'un grand marabo | Ségou | 1030 | M | |
| 18 | +| griots_r3 | Les forgérons | Bamako | 805 | M | |
| 19 | +| griots_r4 | Les Noms Authentiques | Bamako | 764 | M | |
| 20 | +| griots_r5 | Les Coulibaly | Bamako | 981 | M | |
| 21 | +| griots_r6 | Les Diarra | Ségou | 1122 | M | |
| 22 | +| griots_r7 | L'histoire du roi Razaly | Bamako | 1407 | M | |
| 23 | +| griots_r8 | L'histoire des fils d'Abraham | Bamako | 1126 | F | |
| 24 | +| griots_r9 | Les ''Niamala'' hommes de caste | Bamako | 821 | M | |
| 25 | +| griots_r10 | L'éducaion d'hier et d'aujourd'hui | Bamako | 1078 | F | |
| 26 | +| griots_r11 | Garba Mama | Bamako | 970 | M | |
| 27 | +| griots_r12 | La Bataille de Kaana | Bamako | 997 | M | |
| 28 | +| griots_r13 | Diokala | Bamako | 964 | M | |
| 29 | +| griots_r14 | Nos ancetres | Malinké Siby | 1136 | M | |
| 30 | +| griots_r15 | L'histoire d'El Hadj Oumar Tall | Bamako | 844 | M | |
| 31 | +| griots_r16 | Les Massassi du Karta 'Bɔ' | Bamako | 941 | M | |
| 32 | +| griots_r17 | Histoire de Samory | Malinké kangaba | 773 | M | |
| 33 | +| griots_r18 | Le griot | Malinké de kangaba | 809 | M | |
| 34 | +| griots_r19 | La vie d'avant en milieu Bamanan | Bamako | 611 | F | |
| 35 | +| griots_r20 | Les Maabo | Ségou | 1102 | M | |
| 36 | +| griots_r21 | L'histoire de Djonkoloni | Bamako | 859 | M | |
| 37 | +| griots_r22 | Various | Malinké de Siby | 926 | F | |
| 38 | +| griots_r23 | L'histoire de Bɔ | Ségou | 1319 | M | |
| 39 | +| griots_r24 | L'éducaion d'hier et d'aujourd'hui | Bamako | 942 | F | |
| 40 | +| griots_r25 | L'hisoire de la jeune fille Niamakolo | Bamako | 828 | F | |
| 41 | +| griots_r26 | Hier et aujourd'hui | Bamako | 1128 | M | |
| 42 | +| griots_r27 | Les Mianka | Bamako | 1166 | M | |
| 43 | +| griots_r28 | Le mariage d'hier et d'aujourd'hui | Bamako | 810 | F | |
| 44 | +| griots_r29 | L' histoire de Dabo | Bamako | 774 | M | |
| 45 | +| griots_r30 | Les valeurs du Mali | Bamako | 968 | M | |
| 46 | +|**TOTAL**||| ***28971*** || |
| 47 | +|| |
| 48 | + |
| 49 | +### Street Interviews |
| 50 | +Along side the griots' narrations, a smaller sample of individuals were interviewd about the importance of bambara in the technology. |
| 51 | + |
| 52 | +| Recording ID | Utt. Count | Spkr. Gender | Status | |
| 53 | +|:------------:|:-------:|:------------:|:------:| |
| 54 | +| intrvw_r1 | 55 | F | V | |
| 55 | +| intrvw_r2 | X | X | X | |
| 56 | +| intrvw_r3 | 24 | M | V | |
| 57 | +| intrvw_r4 | 25 | M | V | |
| 58 | +| intrvw_r5 | 31 | M | V | |
| 59 | +| intrvw_r6 | 20 | M | V | |
| 60 | +| intrvw_r7 | X | X | X | |
| 61 | +| intrvw_r8 | X | X | X | |
| 62 | +| intrvw_r9 | X | X | X | |
| 63 | +| intrvw_r10 | X | X | X | |
| 64 | +| intrvw_r11 | X | X | X | |
| 65 | +| intrvw_r12 | X | X | X | |
| 66 | +| intrvw_r13 | 25 | M | V | |
| 67 | +| intrvw_r14 | X | X | X | |
| 68 | +| intrvw_r15 | X | X | X | |
| 69 | +| intrvw_r16 | X | X | X | |
| 70 | +| intrvw_r17 | X | X | X | |
| 71 | +| intrvw_r18 | X | X | X | |
| 72 | +| intrvw_r19 | X | X | X | |
| 73 | +| intrvw_r20 | 17 | M | V | |
| 74 | +| intrvw_r21 | 137 | M | V | |
| 75 | +| intrvw_r22 | 142 | F | V | |
| 76 | +| **TOTAL** | ***476*** | - | - | |
| 77 | +|| |
| 78 | + |
| 79 | +### jelipkg toolkit |
| 80 | +<code>jelipkg</code> is sub-package that serves as an entry point to the corpus. It is a python package that allows you to browse, and download the corpus for your own convenience, you can download the textual data either in raw text format or json format. |
| 81 | + |
| 82 | +#### Installation |
| 83 | +#### Quickstart |
| 84 | +#### Documentation |
| 85 | + |
| 86 | +**IMPORTANT**: It is recommended to download one recording/interview at a time, if you have an unreliable network due to the size of the dataset. |
| 87 | + |
| 88 | +## Contact & People |
| 89 | +**Principal Investigator**: Michael Leventhal, `mleventhal <at> robotsmali.org` |
| 90 | +**Manager**: Sebastien Diarra, `sdiarra <at> robotsmali.org` |
| 91 | +**inquiries & Collaboration**: `research <at> robotsmali.org` |
| 92 | + |
| 93 | +## Reference |
12 | 94 |
|
13 | | -## Contact |
14 | 95 | ## License |
15 | 96 | This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA. |
0 commit comments