Langua

A suite of language tools

Back

LanguaFrequen Help

LanguaFrequen is a tool for analyzing phoneme frequencies in a given text.

Using LanguaFrequen

Text Corpus

Input a corpus of text in the Text Corpus field. This is the text that will be analyzed. The text can be phomic or phonetic, or it can simply use the language’s standard spelling system. For best results, however, each sound that should be analyzed separately will need its own grapheme or grapheme cluster. For example, in English, the letter y would need to be differentiated based on when it occurs as a consonant vs. when it occurs as a vowel. It does not matter if punctuation is removed from the corpus, as any graphemes not identified for analysis in the next step will be ignored by the tool.

Phonemes

Next, add a list of consonants and vowels accordingly to the Consonants and Vowels lists. Separate each of the segments with a forward slash (/). These segments can be single graphemes or grapheme clusters, and a cluster can contain characters that are used in shorter segments. For example, given the corpus kanto and the consonants n/t/nt, the tool will identify one occurance of nt, but no occurances of n or t. Any graphemes not identified for analysis will be ignored. In the previous example, since k was not added to the list of consonants, the tool would not count its occurance.

Allophones

Allophones can be added after a segment to indicate that multiple segments should all be counted as occurances of the same segment. Separate allophones with a comma (,). For example, given the corpus potaná, if the vowels list contained a/á/o, the tool would identify one occurance each of a, á, and o, but if the vowels list contained a,á/o, the tool would identify two occurances of a and one occurance of o.

Analysis

When ready, click the Analyze button to run the analysis. The tool will draw a graph and display a table showing the frequencies of each of the identified segments in the text corpus. Segments that never occur will be omitted from the graph and table.

Filtering

This feature is still in development.

After a text has been analyzed, the results can be filtered to show only certain segments. Choosing a filter will show the percentages in the data as it compares to only segments of that type, rather than comparing to all segments.

Currently, the results can be filtered to show only consonants or only vowels. Additional filtering options will be added in the future.

Saving and Loading Settings

Clicking the Save button will save the current settings to the browser’s local storage and generate a small .lngf text file containing the current settings that can be saved to your disk. This .lngf file can be loaded using the Open button to reload saved settings.

Acknowledgments

Much thanks should be given to Jan Strasser and the Frequentizer. LanguaFrequen was mainly built as a modernized and updated version of the Frequentizer.