How to split IPA spelling into syllables












6















First, please forgive my ignorance, I'm completely new to linguistics.



Given the IPA spelling for word, is it possible to programmatically split it into its sounds? So, for example, given the word "ingredient" and it's IPA spelling "ɪn'gridiənt", is it possible to split it into ["ɪn", "gri", "di", "ənt"]? Is there perhaps a finite list of sounds, at least in the English language, by which an IPA word can be split?



Any help appreciated










share|improve this question









New contributor




skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

























    6















    First, please forgive my ignorance, I'm completely new to linguistics.



    Given the IPA spelling for word, is it possible to programmatically split it into its sounds? So, for example, given the word "ingredient" and it's IPA spelling "ɪn'gridiənt", is it possible to split it into ["ɪn", "gri", "di", "ənt"]? Is there perhaps a finite list of sounds, at least in the English language, by which an IPA word can be split?



    Any help appreciated










    share|improve this question









    New contributor




    skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.























      6












      6








      6


      1






      First, please forgive my ignorance, I'm completely new to linguistics.



      Given the IPA spelling for word, is it possible to programmatically split it into its sounds? So, for example, given the word "ingredient" and it's IPA spelling "ɪn'gridiənt", is it possible to split it into ["ɪn", "gri", "di", "ənt"]? Is there perhaps a finite list of sounds, at least in the English language, by which an IPA word can be split?



      Any help appreciated










      share|improve this question









      New contributor




      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.












      First, please forgive my ignorance, I'm completely new to linguistics.



      Given the IPA spelling for word, is it possible to programmatically split it into its sounds? So, for example, given the word "ingredient" and it's IPA spelling "ɪn'gridiənt", is it possible to split it into ["ɪn", "gri", "di", "ənt"]? Is there perhaps a finite list of sounds, at least in the English language, by which an IPA word can be split?



      Any help appreciated







      ipa syllables phonotactics






      share|improve this question









      New contributor




      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|improve this question









      New contributor




      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|improve this question




      share|improve this question








      edited 5 hours ago









      Nardog

      1,1041415




      1,1041415






      New contributor




      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked 7 hours ago









      skedlyskedly

      311




      311




      New contributor




      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      skedly is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






















          1 Answer
          1






          active

          oldest

          votes


















          8














          This is, in fact, possible! It's not trivial, but it is straightforward.



          Your goal seems to be to break an English word (written in phonemic IPA) into syllables. There's a bit of controversy about how useful the concept of a "syllable" is in English, and a few different theories about what exactly a "syllable" is if it does exist, but the following is pretty widely accepted and should be good for your purpose.



          First, the theory of syllable structure: every syllable looks something like ONC, where O is the onset, N is the nucleus, and C is the coda. The nucleus is a vowel(*), and always has to be there; the onset and coda are groups of consonants, and aren't required.



          Second, the maximal onset principle: we want the onset to be as long as possible. So "tube" is /tub/, but "tuba" is /tu.ba/: the /b/ goes with the second syllable, because that makes the onset bigger.



          Third, the syllable structure constraints: certain patterns of consonants aren't allowed together. This one varies by language, but in English, the onset can only be three consonants long at most, and you can't have a stop followed by a fricative in the onset, among many others. So "axle" is /ak.səl/ instead of */a.ksəl/, despite the maximal onset principle. Wikipedia has a good list of these constrains.



          So if you want an algorithm for doing this:




          • Locate all the nuclei (vowels)

          • For each nucleus, work backward, adding sounds to the onset

          • If the onset stops being valid, take a step back, then put all the rest of the sounds in the previous syllable's coda




          (*) Some analyses of English have syllabic resonants, while others treat them as /ə/ plus resonant. In this answer I'm assuming you're using the version with schwa.






          share|improve this answer
























          • What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

            – skedly
            7 hours ago













          • @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

            – Draconis
            7 hours ago






          • 1





            I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

            – LjL
            5 hours ago













          • It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

            – amI
            2 hours ago











          Your Answer








          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "312"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });






          skedly is a new contributor. Be nice, and check out our Code of Conduct.










          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2flinguistics.stackexchange.com%2fquestions%2f30933%2fhow-to-split-ipa-spelling-into-syllables%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          8














          This is, in fact, possible! It's not trivial, but it is straightforward.



          Your goal seems to be to break an English word (written in phonemic IPA) into syllables. There's a bit of controversy about how useful the concept of a "syllable" is in English, and a few different theories about what exactly a "syllable" is if it does exist, but the following is pretty widely accepted and should be good for your purpose.



          First, the theory of syllable structure: every syllable looks something like ONC, where O is the onset, N is the nucleus, and C is the coda. The nucleus is a vowel(*), and always has to be there; the onset and coda are groups of consonants, and aren't required.



          Second, the maximal onset principle: we want the onset to be as long as possible. So "tube" is /tub/, but "tuba" is /tu.ba/: the /b/ goes with the second syllable, because that makes the onset bigger.



          Third, the syllable structure constraints: certain patterns of consonants aren't allowed together. This one varies by language, but in English, the onset can only be three consonants long at most, and you can't have a stop followed by a fricative in the onset, among many others. So "axle" is /ak.səl/ instead of */a.ksəl/, despite the maximal onset principle. Wikipedia has a good list of these constrains.



          So if you want an algorithm for doing this:




          • Locate all the nuclei (vowels)

          • For each nucleus, work backward, adding sounds to the onset

          • If the onset stops being valid, take a step back, then put all the rest of the sounds in the previous syllable's coda




          (*) Some analyses of English have syllabic resonants, while others treat them as /ə/ plus resonant. In this answer I'm assuming you're using the version with schwa.






          share|improve this answer
























          • What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

            – skedly
            7 hours ago













          • @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

            – Draconis
            7 hours ago






          • 1





            I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

            – LjL
            5 hours ago













          • It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

            – amI
            2 hours ago
















          8














          This is, in fact, possible! It's not trivial, but it is straightforward.



          Your goal seems to be to break an English word (written in phonemic IPA) into syllables. There's a bit of controversy about how useful the concept of a "syllable" is in English, and a few different theories about what exactly a "syllable" is if it does exist, but the following is pretty widely accepted and should be good for your purpose.



          First, the theory of syllable structure: every syllable looks something like ONC, where O is the onset, N is the nucleus, and C is the coda. The nucleus is a vowel(*), and always has to be there; the onset and coda are groups of consonants, and aren't required.



          Second, the maximal onset principle: we want the onset to be as long as possible. So "tube" is /tub/, but "tuba" is /tu.ba/: the /b/ goes with the second syllable, because that makes the onset bigger.



          Third, the syllable structure constraints: certain patterns of consonants aren't allowed together. This one varies by language, but in English, the onset can only be three consonants long at most, and you can't have a stop followed by a fricative in the onset, among many others. So "axle" is /ak.səl/ instead of */a.ksəl/, despite the maximal onset principle. Wikipedia has a good list of these constrains.



          So if you want an algorithm for doing this:




          • Locate all the nuclei (vowels)

          • For each nucleus, work backward, adding sounds to the onset

          • If the onset stops being valid, take a step back, then put all the rest of the sounds in the previous syllable's coda




          (*) Some analyses of English have syllabic resonants, while others treat them as /ə/ plus resonant. In this answer I'm assuming you're using the version with schwa.






          share|improve this answer
























          • What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

            – skedly
            7 hours ago













          • @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

            – Draconis
            7 hours ago






          • 1





            I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

            – LjL
            5 hours ago













          • It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

            – amI
            2 hours ago














          8












          8








          8







          This is, in fact, possible! It's not trivial, but it is straightforward.



          Your goal seems to be to break an English word (written in phonemic IPA) into syllables. There's a bit of controversy about how useful the concept of a "syllable" is in English, and a few different theories about what exactly a "syllable" is if it does exist, but the following is pretty widely accepted and should be good for your purpose.



          First, the theory of syllable structure: every syllable looks something like ONC, where O is the onset, N is the nucleus, and C is the coda. The nucleus is a vowel(*), and always has to be there; the onset and coda are groups of consonants, and aren't required.



          Second, the maximal onset principle: we want the onset to be as long as possible. So "tube" is /tub/, but "tuba" is /tu.ba/: the /b/ goes with the second syllable, because that makes the onset bigger.



          Third, the syllable structure constraints: certain patterns of consonants aren't allowed together. This one varies by language, but in English, the onset can only be three consonants long at most, and you can't have a stop followed by a fricative in the onset, among many others. So "axle" is /ak.səl/ instead of */a.ksəl/, despite the maximal onset principle. Wikipedia has a good list of these constrains.



          So if you want an algorithm for doing this:




          • Locate all the nuclei (vowels)

          • For each nucleus, work backward, adding sounds to the onset

          • If the onset stops being valid, take a step back, then put all the rest of the sounds in the previous syllable's coda




          (*) Some analyses of English have syllabic resonants, while others treat them as /ə/ plus resonant. In this answer I'm assuming you're using the version with schwa.






          share|improve this answer













          This is, in fact, possible! It's not trivial, but it is straightforward.



          Your goal seems to be to break an English word (written in phonemic IPA) into syllables. There's a bit of controversy about how useful the concept of a "syllable" is in English, and a few different theories about what exactly a "syllable" is if it does exist, but the following is pretty widely accepted and should be good for your purpose.



          First, the theory of syllable structure: every syllable looks something like ONC, where O is the onset, N is the nucleus, and C is the coda. The nucleus is a vowel(*), and always has to be there; the onset and coda are groups of consonants, and aren't required.



          Second, the maximal onset principle: we want the onset to be as long as possible. So "tube" is /tub/, but "tuba" is /tu.ba/: the /b/ goes with the second syllable, because that makes the onset bigger.



          Third, the syllable structure constraints: certain patterns of consonants aren't allowed together. This one varies by language, but in English, the onset can only be three consonants long at most, and you can't have a stop followed by a fricative in the onset, among many others. So "axle" is /ak.səl/ instead of */a.ksəl/, despite the maximal onset principle. Wikipedia has a good list of these constrains.



          So if you want an algorithm for doing this:




          • Locate all the nuclei (vowels)

          • For each nucleus, work backward, adding sounds to the onset

          • If the onset stops being valid, take a step back, then put all the rest of the sounds in the previous syllable's coda




          (*) Some analyses of English have syllabic resonants, while others treat them as /ə/ plus resonant. In this answer I'm assuming you're using the version with schwa.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered 7 hours ago









          DraconisDraconis

          12.3k11952




          12.3k11952













          • What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

            – skedly
            7 hours ago













          • @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

            – Draconis
            7 hours ago






          • 1





            I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

            – LjL
            5 hours ago













          • It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

            – amI
            2 hours ago



















          • What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

            – skedly
            7 hours ago













          • @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

            – Draconis
            7 hours ago






          • 1





            I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

            – LjL
            5 hours ago













          • It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

            – amI
            2 hours ago

















          What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

          – skedly
          7 hours ago







          What a fantastic answer. You're right, I'm trying to programatically translate a word or phrase into Aig, aka Aigy-Paigy. WordsApi provides the syllables and the IPA for a given word. I'm now considering where to insert eɪg. It might take me a while to decode and then code what you've provided but you've given me lots to go on. Thank you.

          – skedly
          7 hours ago















          @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

          – Draconis
          7 hours ago





          @skedly You're in luck in that case! That's even easier: find all the vowels (you can identify them on an IPA chart), then insert eɪg before each one. The only trick is figuring out whether two vowels in a row are a diphthong (should be treated as one vowel) or hiatus (should go in separate syllables). But there are only a handful of diphthongs in English, so you can just look those up in a table.

          – Draconis
          7 hours ago




          1




          1





          I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

          – LjL
          5 hours ago







          I want to add that the "syllable structure constraints" is part, and a big part at that, of the subject called phonotactics. This Wikipedia article may itself be useful, though it doesn't look great, but mainly, I think it may be a useful keyword for further searches if you need to refine your algorithm beyond what this answer provides.

          – LjL
          5 hours ago















          It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

          – amI
          2 hours ago





          It might be noted that the maximal onset principle is problematic with compound words or continuous speech (which require semantic parsing) -- but that doesn't matter re. Aigy-Paigy which inserts adjacent to the nucleus.

          – amI
          2 hours ago










          skedly is a new contributor. Be nice, and check out our Code of Conduct.










          draft saved

          draft discarded


















          skedly is a new contributor. Be nice, and check out our Code of Conduct.













          skedly is a new contributor. Be nice, and check out our Code of Conduct.












          skedly is a new contributor. Be nice, and check out our Code of Conduct.
















          Thanks for contributing an answer to Linguistics Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2flinguistics.stackexchange.com%2fquestions%2f30933%2fhow-to-split-ipa-spelling-into-syllables%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          How to make a Squid Proxy server?

          Is this a new Fibonacci Identity?

          19世紀