Phonetic Characters

Source Notebook

Standard IPA characters

Details

This dataset (association of associations) consists of 204 records of characters used in transcribing human speech in the International Phonetic Alphabet (IPA).
Consonants, vowels, and other transcription marks are all listed, mostly based on the Wikipedia article "Phonetic symbols in Unicode".
The character itself is the key for each record, and there are 8 columns:
"mode""(un)voiced" for consonants; "(un)rounded" for vowels; "-" for other marks
"manner"e.g., "plosive" for consonants; "open-mid" for vowels; "-" for other marks
"place"e.g., "alveolar" for consonants; "front" for vowels; "-" for other marks
"hexadecimal"a string or list of strings representing the Unicode hexadecimal number(s)
"integer"a base-10 integer or list of integers representing the Unicode number(s)
"expression"a string containing a Wolfram Language expression of the character
"English"True/False whether the character is used in WordData[w, "PhoneticForm"]
"notes"additional information about the Unicode character

(8 columns, 204 rows)

Examples

Basic Examples (2) 

Display the data:

In[1]:=
ResourceData[\!\(\*
TagBox["\"\<Phonetic Characters\>\"",
#& ,
BoxID -> "ResourceTag-Phonetic Characters-Input",
AutoDelete->True]\)]
Out[1]=

List all the characters that WordData uses for the phonetic forms of English words:

In[2]:=
Query[Select[#English &]][ResourceData[\!\(\*
TagBox["\"\<Phonetic Characters\>\"",
#& ,
BoxID -> "ResourceTag-Phonetic Characters-Input",
AutoDelete->True]\)]] // Keys // Normal
Out[2]=

List hex codes and notes for all the voiced fricatives in the IPA:

In[3]:=
Query[Select[#mode == "voiced" && #manner == "fricative" &], {"hexadecimal", "notes"}][ResourceData[\!\(\*
TagBox["\"\<Phonetic Characters\>\"",
#& ,
BoxID -> "ResourceTag-Phonetic Characters-Input",
AutoDelete->True]\)]]
Out[3]=

Scope & Additional Elements (1) 

Make a palette for easy access to phonetic symbols:

In[4]:=
ds = ResourceData[\!\(\*
TagBox["\"\<Phonetic Characters\>\"",
#& ,
BoxID -> "ResourceTag-Phonetic Characters-Input",
AutoDelete->True]\)];
con = Query[Select[(StringContainsQ[#mode, "voiced"] &)]] @ ds;
vow = Query[Select[(StringContainsQ[#mode, "rounded"] &)]]@ds;
dia = Query[Select[(StringContainsQ[#mode, "-"] &)]]@ds;
eng = Query[Select[(#English &)]]@ds;
PaletteNotebook[
 TabView[{
   "consonants" -> Grid[
     Partition[
      KeyValueMap[
        Tooltip[
          PasteButton[
           Style[#1, 14],
           RawBoxes[#1],
           ImageSize -> {36, 36}
           ],
          Pane[
            Column[{
              Style[#1, 32], Style[#2["mode"] <> "\n" <> #2["place"] <> "\n" <> #2["manner"] <> "\nconsonant", TextAlignment -> Center]
              }, Alignment -> Center],
            {Automatic, 108}, Alignment -> Center] // Framed
          ] &,
        con
        ] // Normal,
      UpTo[9]
      ],
     Spacings -> {0, 0}],
   "vowels" -> Grid[
     Partition[
      KeyValueMap[
        Tooltip[
          PasteButton[
           Style[#1, 14],
           RawBoxes[#1],
           ImageSize -> {36, 36}
           ],
          Pane[
            Column[{
              Style[#1, 32], Style[#2["manner"] <> "\n" <> #2["place"] <> "\n" <> #2["mode"] <> "\nvowel", TextAlignment -> Center]
              }, Alignment -> Center],
            {Automatic, 108}, Alignment -> Center] // Framed
          ] &,
        vow
        ] // Normal,
      UpTo[9]
      ],
     Spacings -> {0, 0}],
   "other marks" -> Grid[
     Partition[
      KeyValueMap[
        Tooltip[
          PasteButton[
           Style[#1, 18],
           RawBoxes[#1],
           ImageSize -> {36, 36}
           ],
          Pane[
            Column[{
              Style[#1, 32], Style[StringCases[#2["notes"], RegularExpression[
                  "\"([^\"\\\\]*(?:\\\\.[^\"\\\\]*)*)\""]] <> "\n" <> If[StringContainsQ[#2["notes"], "modifier"], "standalone\nmodifier", "combining\ndiacritical"] <> "\ntranscription\nmark", TextAlignment -> Center]
              }, Alignment -> Center],
            {Automatic, 128}, Alignment -> Center] // Framed
          ] &,
        dia
        ] // Normal,
      UpTo[9]
      ],
     Spacings -> {0, 0}],
   "in WordData" -> Grid[
     Partition[
      KeyValueMap[
        Tooltip[
          PasteButton[
           Style[#1, 14],
           RawBoxes[#1],
           ImageSize -> {36, 36}
           ],
          Pane[
            Column[{
              Style[#1, 36],
              If[MemberQ[con // Normal, #2], Style[#2["mode"] <> "\n" <> #2["place"] <> "\n" <> #2["manner"] <> "\nconsonant", TextAlignment -> Center],
               If[MemberQ[vow // Normal, #2], Style[#2["manner"] <> "\n" <> #2["place"] <> "\n" <> #2["mode"] <> "\nvowel", TextAlignment -> Center], Style[StringCases[#2["notes"], RegularExpression[
                    "\"([^\"\\\\]*(?:\\\\.[^\"\\\\]*)*)\""]] <> "\n" <>
                   If[StringContainsQ[#2["notes"], "modifier"], "standalone\nmodifier", "combining\ndiacritical"] <>
                   "\ntranscription\nmark", TextAlignment -> Center]
                ]
               ]
              }, Alignment -> Center],
            {Automatic, 128}, Alignment -> Center] // Framed
          ] &,
        eng
        ] // Normal,
      UpTo[9]
      ],
     Spacings -> {0, 0}]
   }, ImageSize -> Automatic]
 ]
Out[5]=

Mark Greenberg, "Phonetic Characters" from the Wolfram Data Repository (2025)  

Data Resource History

Source Metadata

See Also

Data Downloads

Publisher Information