Skip to content

Commit 9073327

Browse files
committed
shn Rewrite
1 parent e476054 commit 9073327

File tree

3 files changed

+114
-28
lines changed

3 files changed

+114
-28
lines changed

mymr/shn-examples.js

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ tones:
2929
autoExpandExamples.shn = `
3030
3131
@ https://en.wiktionary.org/wiki/Appendix:Kra-Dai_Swadesh_lists
32-
@ https://en.wiktionary.org/w/index.php?title=Category:Shan_terms_with_IPA_pronunciation&pagefrom=%E1%81%BD%E1%80%BC%E1%82%83%E1%80%B8%0A%E1%81%BD%E1%80%BC%E1%82%83%E1%80%B8#mw-pages
32+
@ https://en.wiktionary.org/w/index.php?title=Category:Shan_terms_with_IPA_pronunciation
3333
3434
ငဝ်းလၢႆး||␣|
3535
ငိင်း||␣||
@@ -1056,7 +1056,7 @@ autoExpandExamples.shn = `
10561056
ၵိူဝ်းၽီ||␣||
10571057
ၵိူဝ်ၸဵမ်း||␣||
10581058
ၵိူဝ်ႉတူၺ်း||␣||
1059-
ၵိူၺ်း|friendly|kɤj˥|kóei|
1059+
ၵိူၺ်း|friendly|kɤ§j˥|kóei|
10601060
ၵိူၼ်ႇ||␣||
10611061
ၵိၵ်ႈ||␣||
10621062
ၵိၼ်|to eat|kin˨˦

mymr/shn.css

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@
4444

4545

4646
.listItem[lang] {
47-
line-height: 2;
47+
line-height: 2.5;
4848
margin-block-end: -.5em;
4949
font-family: 'Noto Serif Myanmar WF', serif;
5050
}
@@ -128,3 +128,15 @@
128128

129129

130130

131+
132+
.sequenceTable.breakdown {
133+
margin-inline-start: calc(7.5% + 5.6rem);
134+
margin-block-start: -1rem;
135+
}
136+
.sequenceTable .ex { font-size: 1.4em; }
137+
138+
.sequenceTable.breakdown td {
139+
padding-block-end: 1.2em;
140+
margin-block-start: -1rem;
141+
}
142+

mymr/shn.html

Lines changed: 99 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -47,8 +47,8 @@
4747

4848

4949

50-
<h1>Shan</h1>
51-
<div class="orthographyLine">Myanmar orthography notes</div>
50+
<h1>Myanmar (Shan)</h1>
51+
<div class="orthographyLine">orthography notes</div>
5252

5353

5454

@@ -61,7 +61,7 @@ <h2 class="notoc flush"><a id="tochead" name="tochead" tabindex="4">Contents</a>
6161
</aside>
6262

6363
<p id="status">Updated
64-
<!-- #BeginDate format:Sw1 -->27 December, 2024<!-- #EndDate -->
64+
<!-- #BeginDate format:Sw1 -->29 April, 2025<!-- #EndDate -->
6565
<span id="versionTop"></span>
6666
</p>
6767

@@ -83,7 +83,7 @@ <h2 class="notoc flush"><a id="tochead" name="tochead" tabindex="4">Contents</a>
8383

8484
<details>
8585
<summary class="instructions">Referencing this document</summary>
86-
<p class="refLine"><small>Richard Ishida, Shan (Myanmar) Orthography Notes, <!-- #BeginDate format:En2 -->27-Dec-2024<!-- #EndDate -->, <a href="https://r12a.github.io/scripts/mymr/shn">https://r12a.github.io/scripts/mymr/shn</a></small></p>
86+
<p class="refLine"><small>Richard Ishida, Myanmar (Shan) Orthography Notes, <!-- #BeginDate format:En2 -->29-Apr-2025<!-- #EndDate -->, <a href="https://r12a.github.io/scripts/mymr/shn">https://r12a.github.io/scripts/mymr/shn</a></small></p>
8787
</details>
8888

8989
<p id="usage"></p>
@@ -223,6 +223,22 @@ <h2>Basic features</h2>
223223
<p class="addToVowels">Shan is tonal, but indication of <a class="seclink" href="#tones">tones</a> in the orthography is very simple, unlike Thai. Explicit tone marks occur after each syllable, except when the first tone is applied.</p>
224224

225225
<p>Shan has <a class="seclink" href="#numbers">native digits</a>, but may also uses Myanmar or ASCII digits.</p>
226+
227+
228+
229+
230+
231+
232+
<section id="notable">
233+
<h3>Notable features</h3>
234+
<ul>
235+
<li>has an always visible vowel killer mark, but unlike Burmese, no virama or consonant stacks</li>
236+
<li>no independent vowels</li>
237+
<li>different symbols used for vowels in open and closed syllables</li>
238+
<li>tones are written using a combination of vowel and tone mark</li>
239+
<li>combining marks for medial consonants and breathy consonants</li>
240+
</ul>
241+
</section>
226242
</section>
227243

228244

@@ -915,13 +931,19 @@ <h4>Nasalisation</h4>
915931
<h3>Vowel summary table</h3>
916932

917933

918-
<p class="instructions">Dashes are used to indicate whether the character represents a vowel sound in a closed or an open syllable.</p>
934+
<p>This table summarises basic vowel to character assignments.</p>
919935

920-
<p>The following table summarises the main vowel to character assigments.</p>
921-
922-
<p class="instructions highlight">ⓘ represents the inherent vowel. Open syllable vowels appear on the left, and closed syllable vowels to the right.</p>
936+
<p class="instructions highlight">ⓘ represents the inherent vowel. Dashes are used to indicate whether the character represents a vowel sound in a closed or an open syllable.</p>
923937

924938
<table class="soundSummary">
939+
<thead>
940+
<tr>
941+
<th>&nbsp;</th>
942+
<th>open syllables</th>
943+
<th>closed syllables</th>
944+
</tr>
945+
</thead>
946+
<tbody>
925947
<tr>
926948
<th rowspan="4">Plain vowels</th>
927949
<td><figure class="summaryBox auto noexpansion" data-ignore="-" data-cols="trans,transc" data-ipa="-iː,,-ɯː,,-uː">ီ␣␣ိုဝ်␣␣ူ</figure></td>
@@ -970,6 +992,7 @@ <h3>Vowel summary table</h3>
970992
<th>Standalone carrier</th>
971993
<td colspan="2"><figure class="summaryBox auto noexpansion" data-cols="ipa,trans,transc"></figure></td>
972994
</tr>
995+
</tbody>
973996
</table>
974997

975998

@@ -996,21 +1019,38 @@ <h3>Inherent vowel</h3>
9961019

9971020
<p id="def-independentvowel" class="explanatoryintro definitionStub"></p>
9981021

999-
<p><span class="ipa">a</span> following a consonant is not written, but is seen as an inherent part of the consonant letter, so <span class="ipa">ka</span> is written by simply using the consonant letter. The first 2 syllables of the following word are inherent vowels.</p>
1022+
<p>The inherent vowel for Shan is pronounced <span class="ipa">a</span>, so <span class="ipa">ka</span> is written by simply using the consonant letter, eg.</p>
1023+
10001024
<p><span class="eg" lang="shn" dir="ltr">ႁတရႃႉ</span></p>
1025+
<table class="sequenceTable breakdown">
1026+
<tr data-notes="HA,TA,RA,AA,tone-5"
1027+
data-type="uname,uname,uname,uname,uname,"
1028+
data-prefix=",,,,"><td>ႁ␣တ␣ရ␣ႃ␣ႉ</td></tr>
1029+
</table>
10011030

10021031
<p>The inherent vowel occurs in both open and closed syllables, and is always short. The second syllable of the following example is closed with the inherent vowel.</p>
10031032
<p><span class="eg" lang="shn" dir="ltr">ၵရမ်ႇ</span></p>
10041033

1034+
1035+
1036+
1037+
1038+
1039+
1040+
<section id="novowel">
1041+
<h4>Inherent vowel suppression</h4>
1042+
1043+
10051044
<figure class="characterBox auto invisible" data-cols=""></figure>
10061045

10071046
<p>Shan uses <span class="hx">103A</span> to kill the inherent vowel after a final consonant, eg. <span class="ch">ၵ်</span> explicitly represents just the sound <span class="ipa">k</span>. It is always visible, and it never causes stacking.</p>
10081047

10091048
<p>Most closed syllables end with this character, eg.</p>
10101049
<p><span class="eg" lang="shn">ႁိၼ်</span></p>
10111050

1051+
<p>Codas written using combining marks don't have the <span class="name">asat</span>. And, unlike Burmese, there is no virama and no consonant stacks.</p>
1052+
</section>
10121053
</section>
1013-
10141054

10151055

10161056

@@ -1042,11 +1082,11 @@ <h3>Post-consonant vowels</h3>
10421082

10431083
<p>Four vowel signs are <em>spacing</em> marks, meaning that they consume horizontal space when added to a base consonant.</p>
10441084

1045-
<p>All vowel signs are typed and stored <em>after</em> the base consonant, whether or not they precede it when displayed. The glyph rendering system takes care of the glyph positioning at display time. Some input systems may allow the user to type the pre-base vowel before the base consonant, but it is still stored after.</p>
1046-
1047-
<p>An orthography that uses vowel signs is different from one that uses simple diacritics or letters for vowels, in that the vowel signs are generally attached to an <a class="termref" href="../glossary/index.html#orthographicsyllable"><em>orthographic syllable</em></a>, rather than just applied to the letter of the immediately preceding consonant. This means that pre-base vowel signs and the left glyph of circumgraphs appear before a whole consonant cluster if it is rendered as a conjunct (see <a class="secref">prebase</a>).</p>
1085+
<p class="info">All vowel signs are typed and stored <em>after</em> the base consonant, whether or not they precede it when displayed. The glyph rendering system takes care of the glyph positioning at display time. Some input systems may allow the user to type the pre-base vowel before the base consonant, but it is still stored after.</p>
10481086

1087+
<p>The set of characters used to write plain vowels, diphthongs, and composite vowel signs is as follows:</p>
10491088

1089+
<figure class="characterBox auto" data-cols="trans,transc">ၺ␣ဝ␣ိ␣ီ␣ု␣ူ␣ေ␣ဵ␣်␣ွ␣ၢ␣ႂ␣ႃ␣ႄ␣ႅ␣ႆ</figure>
10501090

10511091

10521092

@@ -1239,8 +1279,14 @@ <h4>Composite vowel signs</h4>
12391279
</p>
12401280

12411281

1242-
<p>This section lists vowel sounds represented by combinations of the above characters. There are no <a class="termref" href="../glossary/index.html#circumgraph">circumgraphs</a> in the Shan orthography, but several composite vowel signs surround the base on more than one side.</p>
1282+
<p>This section lists vowel sounds represented by combinations of the above characters. There are no <a class="termref" href="../glossary/index.html#circumgraph">circumgraphs</a> in the Shan orthography, but several composite vowel signs surround the base on more than one side. For instance, the vowel <span class="ipa">ɤj</span> is written using 4 code points (shown between angle brackets) in the following example:</p>
12431283

1284+
<p><span class="eg" lang="shn">ၵိူၺ်း</span></p>
1285+
<table class="sequenceTable breakdown">
1286+
<tr data-notes="KA,I,UU,NYA,ASAT,tone-55"
1287+
data-type="uname,uname,uname,uname,uname,uname,"
1288+
data-prefix=",<,+,+,+,>"><td>ၵ␣ိ␣ူ␣ၺ␣်␣း</td></tr>
1289+
</table>
12441290

12451291
<p>Simple vowels that require multiple code points:</p>
12461292
<figure class="characterBox auto" data-cols="ipa,trans,transc">ို␣ိုဝ်␣ူဝ်␣ိူ␣ိူဝ်␣ေႃ</figure>
@@ -1343,10 +1389,24 @@ <h4>Pre-base vowel signs</h4>
13431389
<figure class="characterBox auto small" data-cols="ipa,trans,transc">ေ␣ႄ</figure>
13441390

13451391
<p>Two vowel signs appear before the base consonant letter or cluster, eg. <span class="eg" lang="shn">မေး</span></p>
1346-
<p>These are combining marks that are always stored after the base consonant. The glyph rendering system places the glyph before the base consonant.</p>
1347-
<p>A consonant cluster is treated as a unit when it comes to vowel signs, for example in the following word the pre-base vowel sign is displayed to the left of the <span class="ipa">kj</span> cluster, although it appears after the cluster in memory
1348-
<span class="eg" lang="shn">ၵျေႃင်း</span>
1349-
</p>
1392+
1393+
<p class="info">This is a combining marks that is always stored <em>after</em> the base consonant or conjunct, ie. the code points follow the order in which the items are pronounced. The rendering process places the glyph before the base consonant without changing the order of the code points. The following shows the sequence of code points that make up the word just above.</p>
1394+
1395+
<table class="sequenceTable">
1396+
<tr data-notes="m,eː,tone-55"
1397+
data-type="ipa,ipa,ipa"
1398+
data-prefix=",,"><td>မ␣ေ␣း</td></tr>
1399+
</table>
1400+
1401+
<p>A consonant cluster is treated as a unit when it comes to vowel signs, for example in the following word the pre-base vowel sign is displayed to the left of the <span class="ipa">kj</span> cluster, although it appears after the cluster in memory.</p>
1402+
1403+
<p><span class="eg" lang="shn">ၵျေႃင်း</span></p>
1404+
<table class="sequenceTable breakdown">
1405+
<tr data-notes="k,j,ɔː,ŋ,tone-55"
1406+
data-type="ipa,ipa,ipa,ipa,ipa,"
1407+
data-prefix=",,,,,,,"><td>ၵ␣ျ␣ေႃ␣င်း</td></tr>
1408+
</table>
1409+
13501410
<p class="info">Some input methods may allow the user to type this vowel before the consonant, whereas others will expect it to be typed after, per the stored order.</p>
13511411
</section>
13521412
</section>
@@ -1368,7 +1428,7 @@ <h3>Standalone vowels</h3>
13681428
<span class="eg" lang="shn">ဢူၺ်းၵေႃႉ</span>
13691429
<span class="eg" lang="shn">သူင်ႇဢွၵ်ႇ</span>
13701430
</p>
1371-
<p>On it's own, that character represents the standalone version of the inherent vowel, <span class="ipa">?a</span>.</p>
1431+
<p>On it's own, <span class="uname">SHAN A</span> represents the standalone version of the inherent vowel, <span class="ipa">?a</span>.</p>
13721432
<p><span class="eg" lang="shn">ဢပုမ်ႇ</span>
13731433
</p>
13741434

@@ -1910,7 +1970,7 @@ <h4>Consonant length</h4>
19101970
<section id="consonantSummary">
19111971
<h3>Consonant summary table</h3>
19121972

1913-
<p>The following table summarises the main consonant to character assigments.</p>
1973+
<p>This table summarises basic consonant to character assignments.</p>
19141974

19151975
<!--p class="instructions highlight">Diacritics are added to the vowels to indicate nasalisation, which is not shown here.</p-->
19161976

@@ -1953,9 +2013,13 @@ <h3>Consonant summary table</h3>
19532013
<section id="basicC">
19542014
<h3>Basic consonants</h3>
19552015

2016+
<p>Native Shan words use the following consonants.</p>
19562017

1957-
<figure class="characterBox auto" data-cols="ipa,trans,transc">ပ␣ၽ␣တ␣ထ␣ၵ␣ၶ␣ၸ␣ၾ␣သ␣ႁ␣မ␣ၼ␣ၺ␣င␣ဝ␣ရ␣လ␣ယ</figure>
2018+
<p class="instructions">Click on each letter for more details and for examples of usage.</p>
19582019

2020+
<figure class="characterBox auto" data-cols="trans,transc"
2021+
data-ipa="p,pʰ,t,tʰ,k,kʰ,t͡ɕ,f,sʰ,h,m,n,ɲ,ŋ,w,r,l,j"
2022+
>ပ␣ၽ␣တ␣ထ␣ၵ␣ၶ␣ၸ␣ၾ␣သ␣ႁ␣မ␣ၼ␣ၺ␣င␣ဝ␣ရ␣လ␣ယ</figure>
19592023
</section>
19602024

19612025

@@ -1975,8 +2039,16 @@ <h3>Foreign sounds</h3>
19752039
<section id="onsets">
19762040
<h3>Onsets</h3>
19772041

1978-
<p class="info">Unicode has the following, dedicated combining characters for the second letter in a syllable-onset cluster. The virama should not be used with ordinary letters to produce these. Both of the first two letters appear to be used mostly for loan words.</p>
19792042
<figure class="characterBox auto" data-cols="ipa,trans,transc">ျ␣ြ␣ႂ</figure>
2043+
2044+
<p class="info">Unicode has 3 dedicated combining characters for medial consonants in a syllable-onset cluster. The virama should not be used with ordinary letters to produce these. Both of the first two characters appear to be used mostly for loan words.</p>
2045+
2046+
<p>Examples:</p>
2047+
<p><span class="eg" lang="shn">ၵျၢမ်ႇမႃႇ</span></p>
2048+
<p><span class="eg" lang="shn">သြႃႇဝုၼ်ႇ</span></p>
2049+
<p><span class="eg" lang="shn">ၵႂၢႆး</span></p>
2050+
2051+
<p>Note that <span class="ch"></span> is pronounced <span class="ipa">r</span>, unlike Burmese, which pronounces it <span class="ipa">j</span>. Note also how the width of the glyph for this character is adjusted to fit the surrounding context.</p>
19802052
</section>
19812053

19822054

@@ -2004,9 +2076,11 @@ <h3>Finals</h3>
20042076
<section id="clusters">
20052077
<h3>Consonant clusters</h3>
20062078

2007-
<p>Consonant clusters in modern Shan are not stacked, as they would be in Burmese. Closed syllables in a multi-syllable word are typically followed by the asat and (apart from the first tone) a tone mark, eg.
2008-
<span class="eg" lang="shn">ၽၵ်းၵၢတ်ႇမွၵ်ႇ</span>
2009-
</p>
2079+
<p>Consonant clusters in modern Shan are not stacked, as they would be in Burmese. Closed syllables in a multi-syllable word are typically followed by the asat and (apart from the first tone) a tone mark, eg.</p>
2080+
2081+
<p><span class="eg" lang="shn">ၽၵ်းၵၢတ်ႇမွၵ်ႇ</span></p>
2082+
2083+
<p>See also <a class="secref">onsets</a>.</p>
20102084
</section>
20112085

20122086

0 commit comments

Comments
 (0)