Subscribe for Newsletters and Discounts
Be the first to receive our thoughtfully written
religious articles and product discounts.
Your interests (Optional)
This will help us make recommendations and send discounts and sale information at times.
By registering, you may receive account related information, our email newsletters and product updates, no more than twice a month. Please read our Privacy Policy for details.
.
By subscribing, you will receive our email newsletters and product updates, no more than twice a month. All emails will be sent by Exotic India using the email address info@exoticindia.com.

Please read our Privacy Policy for details.
|6
Your Cart (0)
Share our website with your friends.
Email this page to a friend
Books > Language and Literature > Panini's Karaka System for Language Processing
Displaying 454 of 4529         Previous  |  NextSubscribe to our newsletter and discounts
Panini's Karaka System for Language Processing
Pages from the book
Panini's Karaka System for Language Processing
Look Inside the Book
Description
About the Book

Panini's Karaka System for Language Processing is the outcome of Research and Development (R&D) at the Doctor of Philosophy (Ph.D.) completed from Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, India under the supervision of Dr. Girish Nath Jha. This book can be broadly categorized in five sections such as Structure of Astadhyayi, Nominal Inflection Morphology, Verbal Inflectional Morphology, Panini's Karaka System and Language Processing.

About the Author

Dr. Sudhir K Mishra
Birth: 7 April 1978
Place: Poraikalan, Khetasarai, Dist- Jaunpur (UP.)
Academics: High School and Intermediate from Obra Intermediate College, Obra, Graduation and Post- Graduation from University of Allahabad in Sanskrit. Doctoral Research (Ph.D.) awarded from Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi.
Books: 1. संगणक- जनित व्यवहारिक संस्कृत-धातु रुपावली 2. अष्टाध्यायी- सूत्रपाठ 3. Artificial Intelligence and Natural Language Processing (Under Publication).
Research Papers: Four papers published in Journals, 10 papers published in different National and International Proceeding.
Contact: Applied Artificial Intelligence Group, Centre for Development of Advanced Computing (C-DAC) Pune.
Preface Panini's Karaka System for Language Processing is the outcome of Research and Development (R&D) at the Doctor of Philosophy (Ph.D.) level from Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, India during 2002 to 2007 under the supervision of Dr. Girish Nath Jha. The title of the dissertation was “Sanskrit Karaka Analyzer for Machine Translation" which was submitted in July 2007. The work was based on the formulation of Paninian kakara theory and affiliated vibhakti and karmapravacaniya theories. The system is online available on the website of the Special Centre for Sanskrit Studies, Jawaharlal Nehru University (http://sanskrit.jnu.ac.in/karaka/ analyzer.jsp).

I was research student of first batch of the Special Centre for Sanskrit Studies and decided to work on Karaka Analyzer for Machine Translation after one year course work. Panini uses the term karake (Ast. 1.4.23) to refer to what brings a thing signified by a verb to accomplishment. According to Patanjali, the term karaka is a technical term (karotiti karakam) whose etymological meaning is retained (anvarthasamjna), so that it means 'that which brings about something'. The word karaka is derived from the root word dukrn karane (do, make) with the krt suffix nvul (Ast. 3.1.133). Karaka is relationship between verb (tinanta) and other constituents of the sentence (subanta). Therefore it was primary requirement to identify and analyze nominal and verbal inflectional morphology before starting the work on karaka. But the research topic was not only passed by the competent bodies but also around more than 2 years left during the process of course work and synopsis finalization. Therefore the work on verbal inflectional morphology was started and completed of only selected 438 verb roots and the work was published in 2007 with my supervisor Prof. Girish Nath Jha. The work on nominal inflectional morphology was also started but not completed due to the pressure of time limit.

This book can be broadly categorized in five dimensions such as Structure of Astadhyayi, Nominal Infection Morphology, Verbal Inflectional Morphology, Panini' Karaka System and Language Processing. The chapter, structures of Astadhyayi covers definition of sutra, types of sutra, arrangement of sutra in Astadhyayi and technical terms in Astadhyayi along with the descriptions of other texts associated with the Astadhyayi such as siva sutra. dhatupatha, ganapatha, phitsutra, unadisutra, linganusasana etc. The Nominal Inflectional Morphology covers types of nominal inflectional morphology such as avyaya, samasa, krdanta, taddhita and stripratyayanta pada. Determination of consonants and vowels ending padas are also described in all 8 vibhakties. The Verbal Inflectional Morphology covers the arrangement of verbs, lakara- Tense and Moods, terminations, terminations in parasmaipada in all 10 classes (gana) and terminations in atmanepada in all 10 classes (gana).

Panini's karaka system describes the definition of karaka, types of karaka and formulation of each karaka rule. The formulation of vartika of Katyayana also added against relevant siura. Panini's vibhakti system describes the formulation of each vibhakti sutra along with the vartika of Katyayana. A table is also added at the end of chapter showing the mapping of possibilities of maximum vibhakties in each karaka. And at the end of this chapter a list of suira also tabled to represent Vedic texts related vibhakti siura. Panini describes karmapravacaniya samjna (It is also an adhikara) in the fourth part of the first chapter of Astadhyaya (from Ast.4.82 to Ast. 1.4.97). There are three vibhaktis (dvitlya, paacami and saptami) used in the context of karmapravacaniya. Panini makes karmapravacaniya samjna of 11 nipata within above mentioned adhikara. The formulation of karmapravacaniya related rules are described in the next chapter titled Panini's karmapravacaniya system.

Sanskrit: language processing covers the introduction of natural language processing, levels of analysis of any language, process of language processing. Tokenization, Part- of-Speech Tagging and parsing is described under the process of language processing because these are required for karaka analysis. Lexical resources required for the implementation of Karaka system is methodically mentioned in the last section of this chapter. A selective and comprehensive bibliography is also included in the last section of this book.

I am greatly indebted to Prof Girish Nath Jha, under whose guidance and supervision I was completed my research in the emerging area of multidisciplinary research called 'Computational Linguistics'. Dr. Jha provided me continuous help and encouragement. I am deeply grateful to my teacher Dr. Hari Ram Mishra for initiating me in the discipline of Sanskrit (Grammar, Literature, Linguistics and Philosophy) and helping me out whenever there was a need for it.

I extend my gratitude for the financial support I received during my research by Mahatma Gandhi Antarrashtriya Hindi Vishwavidyalaya, University Grants Commission (UGC) and Microsoft India Pvt. Ltd. (MSI), who sponsored the projects Hindi Sangrah, Online Multilingual Amarakosa and Devanagari handwriting recognition for tablet PCs respectively in which I worked. The Rashtriya Sanskrit Sansthan, New Delhi provided research scholarship to me. A special thanks to the Jawaharlal Nehru University for providing opportunity and facility for the doctoral research work.

Introduction

Sanskrit is the primary culture-bearing language of India, with a continuous production of literature in all fields of human endeavor over the course of four millennia. Preceded by a strong oral tradition of knowledge transmission, records of written Sanskrit remain in the form of inscriptions dating back to the first century B.C.E. Extant manuscripts in Sanskrit number over 30 million - one hundred times those in Greek and Latin combined - constituting the largest cultural heritage that any civilization has produced prior to the invention of the printing press. Sanskrit works include extensive epics, subtle and intricate philosophical, mathematical, and scientific treatises, and imaginative and rich literary, poetic, and dramatic texts. The primary language of the Vedic civilization, Sanskrit developed constrained by a strong grammatical tradition stemming from the fairly complete grammar composed by Panini by the fourth century B.C.E. In addition to serving as an object of study in academic institutions, the Sanskrit language persists in the recitation of hymns in daily worship and ceremonies, as the medium of instruction in centers of traditional learning, as the medium of communication in selected academic and literary journals, academic fora, and broadcasts, and as the primary language of a revivalist community near Bangalore. The language is one of the twenty-two official languages of India in which nearly fifty thousand speakers claimed fluency in the 1991 Indian census (Pawan Goyal et al,).

Panini, the grammarian marks a great divide in the long history of thinking about language which for convenience, can be divided into four phases of development after the pre-Paninian period-

Pre-Panini PeriodThe age of descriptive and enumerative work and speculation about language
Phase-IPanini(7th Century BC) to Patanjali (1st Century BC)Trimunikala: The age of composition of Astadhyayi and its critical evaluation and interpretation: the period of core theory.
Phase-IIPatanjali to Vunala Saraswati (12th Century A.D.)Vyakhya Kala: the age of extention of theory through commentaries, of exegesis, and philosophy of grammar and of language.
Phase- IIIVunala Saraswati to Nagesa Bhatta (18th Century A.D.)Prakriya kala: the age of simplified and re-ordered texts and of reconstitution of philosophy of grammer.
Phase- IVNagesa Bhatta to presentThe modern period of reconstitution of texts, of establishing Indology, including grammer, as subject of Universities teaching and research.
One of the most interesting, insightful and striking achievements of the Ancient Indian grammatical tradition is the theory of karakas and vibhaktis, i.e. an analytic device for the syntactic and semantic description of the sentence. Panini's approach anticipated some of the contemporary linguistic theories. Specifically, his karaka-vibhakti device was paralleled in the West only a few decades ago, when L. Tesniere's (1959) theory of actancy and Ch. Fillmore's (1968) Deep Case theory appeared. Moreover, Panini's attempt to hold apart forms and functions in the language analysis is even more consistent and complete than what is to be found in many modern approaches. Particularly, the term karaka refers to the semantic content (or function), more precisely the semantic role of a verbal argument, while vibhakti corresponds to the morphological form of this argument. The karakas are given some abstract semantically grounded definitions; on the other hand, morphology is considered by Panini in a purely formal way. Case-forms per se do not have any functional definition and are introduced as a means of expression of general semantic categories. The two planes of language are correlated by grammatical rules which are stated explicitly by Panini. The karaka/vibhakti distinction is what makes Panini's grammar so powerful, not only as a means of description of Sanskrit, but even as a possible framework for a cross- linguistic analysis. It is not by chance that this device has been successfully used for the description of some very different languages, including those with the ergative alignment.

1.1 Natural Language Processing

Natural Language Processing (NLP) refers to descriptions that attempt to make the computers analyze, understand and generate natural languages, enabling one to address a computer in a manner as one is addressing a human being. Natural Language Processing is both a modern computational technology and a method of investigating and evaluating claims about human language itself. Panini has become very popular in contemporary linguistics, computational and artificial intelligence.

The history of NLP generally starts in the 1950s, although work can be found from earlier periods. In 1950, Alan Turing published his famous article "Computing Machinery and Intelligence" which proposed what is now called the Turing test as a criterion of intelligence. This criterion depends on the ability of a computer program to impersonate a human in a real-time written conversation with a human judge, sufficiently well that the judge is unable to distinguish reliably - on the basis of the conversational content alone - between the program and a real human. The Georgetown experiment in 1954 involved fully automatic translation of more than sixty Russian sentences into English. The authors claimed that within three or five years, machine translation would be a solved problem. However, real progress was much slower, and after the ALPAC report in 1966, which found that ten years long research had failed to fulfill the expectations, funding for machine translation was dramatically reduced. Little further research in machine translation was conducted until the late 1980s, when the first statistical machine translation systems were developed.

Some notably successful NLP systems developed in the 1960s were SHRDLU, a natural language system working in restricted "blocks worlds" with restricted vocabularies, and ELIZA, a simulation of a Rogerian psychotherapist, written by Joseph Weizenbaum between 1964 to 1966. Using almost no information about human thought or emotion, ELIZA sometimes provided a startlingly human-like interaction. When the "patient" exceeded the very small knowledge base, ELIZA might provide a generic response, for example, responding to "My head hurts" with "Why do you say your head hurts?".

During the 70's many programmers began to write 'conceptual ontologies', which structured real-world information into computer-understandable data. Examples are MARGIE (Schank, 1975), SAM (Cullingford, 1978), PAM (Wilensky, 1978), TaleSpin (Meehan, 1976), QUALM (Lehnert, 1977), Politics (Carbonell, 1979), and Plot Units (Lehnert 1981). During this time, many chatterbots were written including PARRY, Racter, and Jabberwacky.

Up to the 1980s, most NLP systems were based on complex sets of hand-written rules. Starting in the late 1980s, however, there was a revolution in NLP with the introduction of machine learning algorithms for language processing. This was due both to the steady increase in computational power resulting from Moore's Law and the gradual lessening of the dominance of Chomskyan theories of linguistics (e.g. transformational grammar), whose theoretical underpinnings discouraged the sort of corpus linguistics that underlies the machine-learning approach to language processing. Some of the earliest-used machine learning algorithms, such as decision trees, produced systems of hard if-then rules similar to existing hand-written rules. Increasingly, however, research has focused on statistical models, which make soft, probabilistic decisions based on attaching real-valued weights to the features making up the input data. Such models are generally more robust when given unfamiliar input, especially input that contains errors (as is very common for real-world data), and produce more reliable results when integrated into a larger system comprising multiple subtasks.

Many of the notable early successes occurred in the field of machine translation, due especially to work at IBM Research, where successively more complicated statistical models were developed. These systems were able to take advantage of existing multilingual textual corpora that had been produced by the Parliament of Canada and the European Union as a result of laws calling for the translation of all governmental proceedings into all official languages of the corresponding systems of government. However, most other systems depended on corpora specifically developed for the tasks implemented by these systems, which was (and often continues to be) a major limitation in the success of these systems. As a result, a great deal of research has gone into methods of more effectively learning from limited amounts of data.

Recent research has increasingly focused on unsupervised and semi-supervised learning algorithms. Such algorithms are able to learn from data that has not been hand-annotated with the desired answers, or using a combination of annotated and non-annotated data. Generally, this task is much more difficult than supervised learning, and typically produces less accurate results for a given amount of input data. However, there is an enormous amount of non-annotated data available (including, among other things, the entire content of the World Wide Web), which can often make up for the inferior results.

As described above, modern approaches to NLP are grounded in machine learning. The paradigm of machine learning is different from that of most prior attempts at language processing. Prior implementations of language- processing tasks typically involved the direct hand coding of large sets of rules. The machine-learning paradigm calls instead for using general learning algorithms - often, although not always, grounded in statistical inference - to automatically learn such rules through the analysis of large corpora of typical real-world examples. A corpus (plural, "corpora") is a set of documents (or sometimes, individual sentences) that have been hand-annotated with the correct values to be learned."

Many different classes of machine learning algorithms have been applied to NLP tasks. In common to all of these algorithms is that they take as input a large set of "features" that are generated from the input data. As an example, for a part-of-speech tagger, typical features might be the identity of the word being processed, the identity of the words immediately to the left and right, the part-of-speech tag of the word to the left, and whether the word being considered or its immediate neighbors are content words or function words. The algorithms differ, however, in the nature of the rules generated. Some of the earliest-used algorithms, such as decision trees, produced systems of hard if-then rules similar to the systems of hand-written rules that were then common.

Contents

Prefacevii
Transliteration Keyxi
1Introduction1
2Structure of Astadhyayi20
3Nominal Inflectional Morphology56
4Verbal Inflectional Morphology118
5Panini's Karaka System171
6Panini's Vibhakti System195
7Panini's Karmapravacaniya System232
8Sanskrit: Languages Processing240
9Bibliography279

Sample Pages

















Panini's Karaka System for Language Processing

Item Code:
NAM630
Cover:
Hardcover
Edition:
2016
ISBN:
9789385539190
Language:
English
Size:
8.5 inch x 5.5 inch
Pages:
317
Other Details:
Weight of the Book: 570 gms
Price:
$45.00
Discounted:
$36.00   Shipping Free
You Save:
$9.00 (20%)
Look Inside the Book
Add to Wishlist
Send as e-card
Send as free online greeting card
Panini's Karaka System for Language Processing

Verify the characters on the left

From:
Edit     
You will be informed as and when your card is viewed. Please note that your card will be active in the system for 30 days.

Viewed 1750 times since 6th Mar, 2017
About the Book

Panini's Karaka System for Language Processing is the outcome of Research and Development (R&D) at the Doctor of Philosophy (Ph.D.) completed from Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, India under the supervision of Dr. Girish Nath Jha. This book can be broadly categorized in five sections such as Structure of Astadhyayi, Nominal Inflection Morphology, Verbal Inflectional Morphology, Panini's Karaka System and Language Processing.

About the Author

Dr. Sudhir K Mishra
Birth: 7 April 1978
Place: Poraikalan, Khetasarai, Dist- Jaunpur (UP.)
Academics: High School and Intermediate from Obra Intermediate College, Obra, Graduation and Post- Graduation from University of Allahabad in Sanskrit. Doctoral Research (Ph.D.) awarded from Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi.
Books: 1. संगणक- जनित व्यवहारिक संस्कृत-धातु रुपावली 2. अष्टाध्यायी- सूत्रपाठ 3. Artificial Intelligence and Natural Language Processing (Under Publication).
Research Papers: Four papers published in Journals, 10 papers published in different National and International Proceeding.
Contact: Applied Artificial Intelligence Group, Centre for Development of Advanced Computing (C-DAC) Pune.
Preface Panini's Karaka System for Language Processing is the outcome of Research and Development (R&D) at the Doctor of Philosophy (Ph.D.) level from Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, India during 2002 to 2007 under the supervision of Dr. Girish Nath Jha. The title of the dissertation was “Sanskrit Karaka Analyzer for Machine Translation" which was submitted in July 2007. The work was based on the formulation of Paninian kakara theory and affiliated vibhakti and karmapravacaniya theories. The system is online available on the website of the Special Centre for Sanskrit Studies, Jawaharlal Nehru University (http://sanskrit.jnu.ac.in/karaka/ analyzer.jsp).

I was research student of first batch of the Special Centre for Sanskrit Studies and decided to work on Karaka Analyzer for Machine Translation after one year course work. Panini uses the term karake (Ast. 1.4.23) to refer to what brings a thing signified by a verb to accomplishment. According to Patanjali, the term karaka is a technical term (karotiti karakam) whose etymological meaning is retained (anvarthasamjna), so that it means 'that which brings about something'. The word karaka is derived from the root word dukrn karane (do, make) with the krt suffix nvul (Ast. 3.1.133). Karaka is relationship between verb (tinanta) and other constituents of the sentence (subanta). Therefore it was primary requirement to identify and analyze nominal and verbal inflectional morphology before starting the work on karaka. But the research topic was not only passed by the competent bodies but also around more than 2 years left during the process of course work and synopsis finalization. Therefore the work on verbal inflectional morphology was started and completed of only selected 438 verb roots and the work was published in 2007 with my supervisor Prof. Girish Nath Jha. The work on nominal inflectional morphology was also started but not completed due to the pressure of time limit.

This book can be broadly categorized in five dimensions such as Structure of Astadhyayi, Nominal Infection Morphology, Verbal Inflectional Morphology, Panini' Karaka System and Language Processing. The chapter, structures of Astadhyayi covers definition of sutra, types of sutra, arrangement of sutra in Astadhyayi and technical terms in Astadhyayi along with the descriptions of other texts associated with the Astadhyayi such as siva sutra. dhatupatha, ganapatha, phitsutra, unadisutra, linganusasana etc. The Nominal Inflectional Morphology covers types of nominal inflectional morphology such as avyaya, samasa, krdanta, taddhita and stripratyayanta pada. Determination of consonants and vowels ending padas are also described in all 8 vibhakties. The Verbal Inflectional Morphology covers the arrangement of verbs, lakara- Tense and Moods, terminations, terminations in parasmaipada in all 10 classes (gana) and terminations in atmanepada in all 10 classes (gana).

Panini's karaka system describes the definition of karaka, types of karaka and formulation of each karaka rule. The formulation of vartika of Katyayana also added against relevant siura. Panini's vibhakti system describes the formulation of each vibhakti sutra along with the vartika of Katyayana. A table is also added at the end of chapter showing the mapping of possibilities of maximum vibhakties in each karaka. And at the end of this chapter a list of suira also tabled to represent Vedic texts related vibhakti siura. Panini describes karmapravacaniya samjna (It is also an adhikara) in the fourth part of the first chapter of Astadhyaya (from Ast.4.82 to Ast. 1.4.97). There are three vibhaktis (dvitlya, paacami and saptami) used in the context of karmapravacaniya. Panini makes karmapravacaniya samjna of 11 nipata within above mentioned adhikara. The formulation of karmapravacaniya related rules are described in the next chapter titled Panini's karmapravacaniya system.

Sanskrit: language processing covers the introduction of natural language processing, levels of analysis of any language, process of language processing. Tokenization, Part- of-Speech Tagging and parsing is described under the process of language processing because these are required for karaka analysis. Lexical resources required for the implementation of Karaka system is methodically mentioned in the last section of this chapter. A selective and comprehensive bibliography is also included in the last section of this book.

I am greatly indebted to Prof Girish Nath Jha, under whose guidance and supervision I was completed my research in the emerging area of multidisciplinary research called 'Computational Linguistics'. Dr. Jha provided me continuous help and encouragement. I am deeply grateful to my teacher Dr. Hari Ram Mishra for initiating me in the discipline of Sanskrit (Grammar, Literature, Linguistics and Philosophy) and helping me out whenever there was a need for it.

I extend my gratitude for the financial support I received during my research by Mahatma Gandhi Antarrashtriya Hindi Vishwavidyalaya, University Grants Commission (UGC) and Microsoft India Pvt. Ltd. (MSI), who sponsored the projects Hindi Sangrah, Online Multilingual Amarakosa and Devanagari handwriting recognition for tablet PCs respectively in which I worked. The Rashtriya Sanskrit Sansthan, New Delhi provided research scholarship to me. A special thanks to the Jawaharlal Nehru University for providing opportunity and facility for the doctoral research work.

Introduction

Sanskrit is the primary culture-bearing language of India, with a continuous production of literature in all fields of human endeavor over the course of four millennia. Preceded by a strong oral tradition of knowledge transmission, records of written Sanskrit remain in the form of inscriptions dating back to the first century B.C.E. Extant manuscripts in Sanskrit number over 30 million - one hundred times those in Greek and Latin combined - constituting the largest cultural heritage that any civilization has produced prior to the invention of the printing press. Sanskrit works include extensive epics, subtle and intricate philosophical, mathematical, and scientific treatises, and imaginative and rich literary, poetic, and dramatic texts. The primary language of the Vedic civilization, Sanskrit developed constrained by a strong grammatical tradition stemming from the fairly complete grammar composed by Panini by the fourth century B.C.E. In addition to serving as an object of study in academic institutions, the Sanskrit language persists in the recitation of hymns in daily worship and ceremonies, as the medium of instruction in centers of traditional learning, as the medium of communication in selected academic and literary journals, academic fora, and broadcasts, and as the primary language of a revivalist community near Bangalore. The language is one of the twenty-two official languages of India in which nearly fifty thousand speakers claimed fluency in the 1991 Indian census (Pawan Goyal et al,).

Panini, the grammarian marks a great divide in the long history of thinking about language which for convenience, can be divided into four phases of development after the pre-Paninian period-

Pre-Panini PeriodThe age of descriptive and enumerative work and speculation about language
Phase-IPanini(7th Century BC) to Patanjali (1st Century BC)Trimunikala: The age of composition of Astadhyayi and its critical evaluation and interpretation: the period of core theory.
Phase-IIPatanjali to Vunala Saraswati (12th Century A.D.)Vyakhya Kala: the age of extention of theory through commentaries, of exegesis, and philosophy of grammar and of language.
Phase- IIIVunala Saraswati to Nagesa Bhatta (18th Century A.D.)Prakriya kala: the age of simplified and re-ordered texts and of reconstitution of philosophy of grammer.
Phase- IVNagesa Bhatta to presentThe modern period of reconstitution of texts, of establishing Indology, including grammer, as subject of Universities teaching and research.
One of the most interesting, insightful and striking achievements of the Ancient Indian grammatical tradition is the theory of karakas and vibhaktis, i.e. an analytic device for the syntactic and semantic description of the sentence. Panini's approach anticipated some of the contemporary linguistic theories. Specifically, his karaka-vibhakti device was paralleled in the West only a few decades ago, when L. Tesniere's (1959) theory of actancy and Ch. Fillmore's (1968) Deep Case theory appeared. Moreover, Panini's attempt to hold apart forms and functions in the language analysis is even more consistent and complete than what is to be found in many modern approaches. Particularly, the term karaka refers to the semantic content (or function), more precisely the semantic role of a verbal argument, while vibhakti corresponds to the morphological form of this argument. The karakas are given some abstract semantically grounded definitions; on the other hand, morphology is considered by Panini in a purely formal way. Case-forms per se do not have any functional definition and are introduced as a means of expression of general semantic categories. The two planes of language are correlated by grammatical rules which are stated explicitly by Panini. The karaka/vibhakti distinction is what makes Panini's grammar so powerful, not only as a means of description of Sanskrit, but even as a possible framework for a cross- linguistic analysis. It is not by chance that this device has been successfully used for the description of some very different languages, including those with the ergative alignment.

1.1 Natural Language Processing

Natural Language Processing (NLP) refers to descriptions that attempt to make the computers analyze, understand and generate natural languages, enabling one to address a computer in a manner as one is addressing a human being. Natural Language Processing is both a modern computational technology and a method of investigating and evaluating claims about human language itself. Panini has become very popular in contemporary linguistics, computational and artificial intelligence.

The history of NLP generally starts in the 1950s, although work can be found from earlier periods. In 1950, Alan Turing published his famous article "Computing Machinery and Intelligence" which proposed what is now called the Turing test as a criterion of intelligence. This criterion depends on the ability of a computer program to impersonate a human in a real-time written conversation with a human judge, sufficiently well that the judge is unable to distinguish reliably - on the basis of the conversational content alone - between the program and a real human. The Georgetown experiment in 1954 involved fully automatic translation of more than sixty Russian sentences into English. The authors claimed that within three or five years, machine translation would be a solved problem. However, real progress was much slower, and after the ALPAC report in 1966, which found that ten years long research had failed to fulfill the expectations, funding for machine translation was dramatically reduced. Little further research in machine translation was conducted until the late 1980s, when the first statistical machine translation systems were developed.

Some notably successful NLP systems developed in the 1960s were SHRDLU, a natural language system working in restricted "blocks worlds" with restricted vocabularies, and ELIZA, a simulation of a Rogerian psychotherapist, written by Joseph Weizenbaum between 1964 to 1966. Using almost no information about human thought or emotion, ELIZA sometimes provided a startlingly human-like interaction. When the "patient" exceeded the very small knowledge base, ELIZA might provide a generic response, for example, responding to "My head hurts" with "Why do you say your head hurts?".

During the 70's many programmers began to write 'conceptual ontologies', which structured real-world information into computer-understandable data. Examples are MARGIE (Schank, 1975), SAM (Cullingford, 1978), PAM (Wilensky, 1978), TaleSpin (Meehan, 1976), QUALM (Lehnert, 1977), Politics (Carbonell, 1979), and Plot Units (Lehnert 1981). During this time, many chatterbots were written including PARRY, Racter, and Jabberwacky.

Up to the 1980s, most NLP systems were based on complex sets of hand-written rules. Starting in the late 1980s, however, there was a revolution in NLP with the introduction of machine learning algorithms for language processing. This was due both to the steady increase in computational power resulting from Moore's Law and the gradual lessening of the dominance of Chomskyan theories of linguistics (e.g. transformational grammar), whose theoretical underpinnings discouraged the sort of corpus linguistics that underlies the machine-learning approach to language processing. Some of the earliest-used machine learning algorithms, such as decision trees, produced systems of hard if-then rules similar to existing hand-written rules. Increasingly, however, research has focused on statistical models, which make soft, probabilistic decisions based on attaching real-valued weights to the features making up the input data. Such models are generally more robust when given unfamiliar input, especially input that contains errors (as is very common for real-world data), and produce more reliable results when integrated into a larger system comprising multiple subtasks.

Many of the notable early successes occurred in the field of machine translation, due especially to work at IBM Research, where successively more complicated statistical models were developed. These systems were able to take advantage of existing multilingual textual corpora that had been produced by the Parliament of Canada and the European Union as a result of laws calling for the translation of all governmental proceedings into all official languages of the corresponding systems of government. However, most other systems depended on corpora specifically developed for the tasks implemented by these systems, which was (and often continues to be) a major limitation in the success of these systems. As a result, a great deal of research has gone into methods of more effectively learning from limited amounts of data.

Recent research has increasingly focused on unsupervised and semi-supervised learning algorithms. Such algorithms are able to learn from data that has not been hand-annotated with the desired answers, or using a combination of annotated and non-annotated data. Generally, this task is much more difficult than supervised learning, and typically produces less accurate results for a given amount of input data. However, there is an enormous amount of non-annotated data available (including, among other things, the entire content of the World Wide Web), which can often make up for the inferior results.

As described above, modern approaches to NLP are grounded in machine learning. The paradigm of machine learning is different from that of most prior attempts at language processing. Prior implementations of language- processing tasks typically involved the direct hand coding of large sets of rules. The machine-learning paradigm calls instead for using general learning algorithms - often, although not always, grounded in statistical inference - to automatically learn such rules through the analysis of large corpora of typical real-world examples. A corpus (plural, "corpora") is a set of documents (or sometimes, individual sentences) that have been hand-annotated with the correct values to be learned."

Many different classes of machine learning algorithms have been applied to NLP tasks. In common to all of these algorithms is that they take as input a large set of "features" that are generated from the input data. As an example, for a part-of-speech tagger, typical features might be the identity of the word being processed, the identity of the words immediately to the left and right, the part-of-speech tag of the word to the left, and whether the word being considered or its immediate neighbors are content words or function words. The algorithms differ, however, in the nature of the rules generated. Some of the earliest-used algorithms, such as decision trees, produced systems of hard if-then rules similar to the systems of hand-written rules that were then common.

Contents

Prefacevii
Transliteration Keyxi
1Introduction1
2Structure of Astadhyayi20
3Nominal Inflectional Morphology56
4Verbal Inflectional Morphology118
5Panini's Karaka System171
6Panini's Vibhakti System195
7Panini's Karmapravacaniya System232
8Sanskrit: Languages Processing240
9Bibliography279

Sample Pages

















Post a Comment
 
Post Review
Post a Query
For privacy concerns, please view our Privacy Policy

Related Items

The Philosophy of Sanskrit Grammar (A Critical Study of Karaka)
Item Code: NAI061
$30.00$24.00
You save: $6.00 (20%)
SOLD
The Astadhyayi of Panini: Volume VII (2.3.1 - 2.3.73)
Item Code: IDE657
$27.50$22.00
You save: $5.50 (20%)
Add to Cart
Buy Now
The Astadhyayi of Panini (Volume 1 - Introduction to the Astadhyayi as a Grammatical Device)
Item Code: NAB524
$45.00$36.00
You save: $9.00 (20%)
Add to Cart
Buy Now
The Astadhyayi of Panini: P.4.1.1 – P.4.1.75 (With Transliteration)
Item Code: NAC848
$25.00$20.00
You save: $5.00 (20%)
Add to Cart
Buy Now
Some Theoretical Problems in Panini’s Grammar (A Rare Book)
Item Code: NAC633
$25.00$20.00
You save: $5.00 (20%)
Add to Cart
Buy Now
Some Theoretical Problems In Panini’s Grammar: A Rare Book
Item Code: NAD652
$25.00$20.00
You save: $5.00 (20%)
Add to Cart
Buy Now
The Astadhyayi of Panini (A Brief Exposition)
Item Code: NAJ958
$20.00$16.00
You save: $4.00 (20%)
Add to Cart
Buy Now
Arrangement of the Rules in Panini's Astadhyayi
by K.R. Tripathi
Hardcover (Edition: 2016)
Parimal Publication Pvt. Ltd.
Item Code: IDH555
$22.50$18.00
You save: $4.50 (20%)
Add to Cart
Buy Now
The Astadhyayi of Panini (Vol. XIV) (P.4.1.176-5.4.160) - With Roman
by S.D. Joshi & J.A.F. Roodergen
Hardcover (Edition: 2011)
Sahitya Akademi
Item Code: NAC995
$40.00$32.00
You save: $8.00 (20%)
Add to Cart
Buy Now
The Astadhyayi of Panini (Set of Six Volumes) (Transliteration and English Translation)
Item Code: NAF191
$325.00$260.00
You save: $65.00 (20%)
Add to Cart
Buy Now
Panini: His Description of Sanskrit (An Analytical study of the Astadhyayi)
Item Code: ISL68
$50.00$40.00
You save: $10.00 (20%)
Add to Cart
Buy Now
The Astadhyayi of Panini: Volume X (7.1.1 - 7.1.103)
Item Code: IDE660
$30.00$24.00
You save: $6.00 (20%)
Add to Cart
Buy Now
Pa:Ninian Linguistics
Item Code: NAJ211
$40.00$32.00
You save: $8.00 (20%)
Add to Cart
Buy Now

Testimonials

Very grateful for this service, of making this precious treasure of Haveli Sangeet for ThakurJi so easily in the US. Appreciate the fact that notation is provided.
Leena, USA.
The Bhairava painting I ordered by Sri Kailash Raj is excellent. I have been purchasing from Exotic India for well over a decade and am always beyond delighted with my extraordinary purchases and customer service. Thank you.
Marc, UK
I have been buying from Exotic India for years and am always pleased and excited to receive my packages. Thanks for the quality products.
Delia, USA
As ever, brilliant price and service.
Howard, UK.
The best and fastest service worldwide - I am in Australia and I put in a big order of books (14 items) on a Wednesday; it was sent on Friday and arrived at my doorstep early on Monday morning - amazing! All very securely packed in a very strong cardboard box. I have bought several times from Exotic India and the service is always exceptionally good. THANK YOU and NAMASTE!
Charles (Rudra)
I just wanted to say that this is I think my 3rd (big) order from you, and the last two times I received immaculate service, the books arrived well and it has been a very pleasant experience. Just wanted to say thanks for your efficient service.
Shantala, Belgium
Thank you so much EXOTIC INDIA for the wonderfull packaging!! I received my order today and it was gift wrapped with so much love and taste in a beautiful golden gift wrap and everything was neat and beautifully packed. Also my order came very fast... i am impressed! Besides selling fantastic items, you provide an exceptional customer service and i will surely purchase again from you! I am very glad and happy :) Thank you, Salma
Salma, Canada.
Artwork received today. Very pleased both with the product quality and speed of delivery. Many thanks for your help.
Carl, UK.
I wanted to let you know how happy we are with our framed pieces of Shree Durga and Shree Kali. Thank you and thank your framers for us. By the way, this month we offered a Puja and Yagna to the Ardhanarishwara murti we purchased from you last November. The Brahmin priest, Shree Vivek Godbol, who was visiting LA preformed the rites. He really loved our murti and thought it very paka. I am so happy to have found your site , it is very paka and trustworthy. Plus such great packing and quick shipping. Thanks for your service Vipin, it is a pleasure.
Gina, USA
My marble statue of Durga arrived today in perfect condition, it's such a beautiful statue. Thanks again for giving me a discount on it, I'm always very pleased with the items I order from you. You always have the best quality items.
Charles, Tennessee
TRUSTe
Language:
Currency:
All rights reserved. Copyright 2017 © Exotic India