服务承诺





51Due提供Essay,Paper,Report,Assignment等学科作业的代写与辅导,同时涵盖Personal Statement,转学申请等留学文书代写。




私人订制你的未来职场 世界名企,高端行业岗位等 在新的起点上实现更高水平的发展




Regimes in Babel are Confirmed--论文代写范文精选
2016-01-05 来源: 51due教员组 类别: Essay范文
尽管有这样的条件,定量研究人类语言,早已被哈佛大学教授发现,也就是今天广为人知的Zipf定律。此外,Zipf定律在语言学不仅是公认的一种方式,它已经应用在很多广泛的研究领域。下面的essay代写范文将进行详述。
Abstract
The paper introduces the presence of three statistical regimes in the Zipfian analysis of texts in quantitative linguistics: the Mandelbrot, original Zipf, and Cancho‐ Solé‐Montemurro regimes. The work is carried out over nine different languages of the same intention semantically: the bible from different languages in Indonesian ethnic and national language. As always, the same analysis is also brought in English version of the Bible for reference. The existence of the three regimes are confirmed while in advance the length of the texts are also becomes an important issue. We outline some further works regarding the quantitative analysis for parameterization used to analyze the three regimes and the task to have broad explanation, especially the microstructure of the language in human decision or linguistic effort – emerging the robustness of them.
Keywords: quantitative linguistics, Zipfian analysis, macro‐properties of texts.
Prolog
Traditional linguistic studies have not yet really celebrated the quantitative analysis, especially the statistical one. In the other hand, there have been many quantitative tools available particularly in the age of interdisciplinary studies on natural language and on how human languages has evolved in such a way we use to day. Despite such conditions, the quantitative studies to human language have begun that long since the findings of Harvard Professor G. K. Zipf (1949) widely known today as the Zipf’s Law. Furthermore, the Zipf’s Law is not only recognized well in linguistics, yet it has been stated in a lot broader studies from various disciplines. This paper reports a further observation continued from Situngkir (2007) tries to see the distinctions as well as similarities of languages widely used among Indonesian people, be it ethnic ‐ 2 - and national language. The way to see those is by using statistics in our perceptive that language can be seen as macro phenomena emerged from the socially distributed and in advance, dynamically evolved among human minds and (complex) cognitive systems.
The way human mind interpreting, processing, and decide any information elements surrounding her are somehow by using language. Language becomes one of key roles to understand human dynamics since the ability of language acquisition by human civilization are also dynamic and evolve through cultural generations (see for i instance classic work by Hill (1972) and attractive findings in Solé (2006) as well as Bergstorm, et. al., 2001). Thus, it is not at all an exaggeration to say that understanding different languages by means of their structures, use of words, as well as how to create information from them is a challenging steps to deeper understanding to how distinctive civilization and particular social systems might evolve. Indonesia is an interesting place to observe this for most Indonesian people have capability to talk in at least two languages: their ethnic language (actively or passively) and the national language, Bahasa Indonesia. We analyze the bible texts from various Indonesian ethnic languages – as translated and printed by the Lembaga Alkitab Indonesia (LAI) – Batak Toba (LAI, 1998), Angkola (LAI, 1991), Simalungun (LAI, 2000), Karo (LAI, 2000), Pakpak Dairi (LAI, 1998), Java (LAI, 1994), Sunda (LAI, 1991), and New Translation of Indonesian Bible (LAI, 1974) along with Bible Today’s English Version (American Bible Society, 1992) as references. The task of the paper is to find confirmation or any interesting features discovered from those data as we use some advanced statistical findings that have been also seen in some major English texts (see Manning et. al., 1999 and Montemurro, 2004).
The paper is structured as follows. Next section elaborates some understanding about the statistical properties in the nature of Zipfian analysis and several enhancements to find better “laws” of word occurrences in the texts of natural languages. This including the development of interestingly Tsallis’ recent statistical mechanics that generalizes thermodynamics (in the sense of non‐extensive entropy) of heuristic arguments based on the structure of fractals in symbolic sequences with long‐range correlations (see Tsallis, 2004). This description brought us to the analytical model of word occurrences that is related to different languages, both ethnic and official one used generally in Indonesia to reveal some interesting statistical properties related to its diversity, similarities, and differences. The rest of the paper discusses this issue while trying to see some possibilities of further advanced work in our general endeavor to understand sundry of cultural skins in Indonesia.
Discussions
Apparently, our examinations to the languages of our case of study conjectured that the three regimes of the texts are robust over Indonesian ethnic languages such as Angkolanese, Karonese,Tobanese, Pakpaknese, Simalungunese, Javanese, and Sundanese, as well as the national language Indonesian and our reference, English. In this sense, we do our examinations in the collections of texts and not just a single book taken from the bible since the persistence of the second power law regime does not always exist in the short texts. It is clear that the presence of the third regime over texts depends on the length of the texts. At first, we do the observation in the first five books of the bible: the Pentateuch (Genesis, Exodus, Leviticus, Numbers, and Deuteronomy). These books are traditionally written by Moses as Genesis is frequently called the first book of Moses, Exodus the second book of Moses, and so forth.
Concluding Remarks
We show the persistence of three regimes over analysis of same text but different language by using the traditional Zipfian plot. The most frequent words form the hyperbolic curve and becomes a background of the shape of fitting parameters and equations proposed by Mandelbrot. However, interesting feature is also yielded by words with highest ranks. As more specific to text words incorporated a corpus/corpora, there exists crossover from the Zipfian power law (originally discovered with exponents ൎ 1.02 become faster decay following the exponential curve. We use the nonextensive entropy statistics applied to quantitative linguistics proposed by Montemurro (2004) to see the properties of different languages, both ethnic and national language. Up to here, robustness persists.
As the size of the text grows the presence of the three regimes are somehow accentuates. Furthermore, in our analysis we found that there is a qualitatively very big gap of one of the parameters used to fit the macro‐properties over different languages. We hypothesize that this might come from the different micro‐structure of corresponding language emerging the gap in corpora. As a beginning for further analysis, we see this in the terms of the ratio between the text length and the used words that should be correlated in such a way to the different of the grammatical structure laid upon different language. However, the persistence of the observed statistical pattern also bring us to an understanding that language might built upon the similar thing over human mind using language as well as the economic motive of the least effort proposed by G. K. Zipf (1949) on the use of different words as their frequency follows the power law.
51Due网站原创范文除特殊说明外一切图文著作权归51Due所有;未经51Due官方授权谢绝任何用途转载或刊发于媒体。如发生侵犯著作权现象,51Due保留一切法律追诉权。(essay代写)
更多essay代写范文欢迎访问我们主页 www.51due.com 当然有essay代写需求可以和我们24小时在线客服 QQ:800020041 联系交流。-X(essay代写)
