Wednesday, December 30, 2009

Post Tea Reading: Syntactic Relationship

Three fundamental linguistic relationships that underlie syntactic structure:
                          Argumenthood, Modification & Predication
Argumenthood: mostly centers around Verb, the nucleus of a syntactic structure. Verbs, if conceived as Predicates (as Formal Semantics puts it), needs some arguments to complete its meaning, they are the central participants in a situation. Combining verb with its arguments has a syntactic and its corresponding semantic effect. How? Verbs can have recursive function, if its transitive, then it requires 2 entities or arguments unlike an Intransitive verb; examples can be 'invite' and 'laugh'. The number of arguments a predicate can take is its Semantic Valency. And it is Semantic Valency that determines the syntactic structure of a sentence. So the ungrammaticality of sentences like 'She laughed at the stone' can be answered by fact that 'Laugh' is a one-place predicate. But the correspondence between syntactic and semantic argument is not always perfect. For example, the verb 'Eat' requires 2 arguments: eater and object of eating.
                      "Ram ate a mango" and "Ram has eaten" 
the second sentence shows a mismatch between the syntactic and semantic properties. So, Semantic arguments are participants in a situation and Syntactic arguments are constituents that appear in a particular syntactic position.

Wednesday, November 11, 2009

Is possessing a language a quintessentially human trait? Few more readings & views....

For the past 40 years, biologists, psychologists and anthropologists have been chasing Chomsky’s claim of Language as human uniqueness by looking at the capacities of animals to acquire some form of a human natural language under intensive training environments, or for animals to use their natural, species-typical vocalizations in ways that are similar to spoken language. Thus, for example, studies have focused on the capacity of human-reared apes to string symbols together to form sentences or comprehend them and of wild monkey populations to use vocalizations to refer to objects and events in the external environment. Though these studies have met with mixed success, especially as viewed from the perspective of linguists looking at such comparative data for insights into the evolution of language (the calls of these primates show some degree of learnability and of voluntary control), the minimal language of the great apes differs radically from human language.

In their article ‘The Faculty of Language: What is it, who has it and how did it evolve?’

Hauser, Chomsky and Fitch (HCF) explore the problem of language evolution in the line of distinguishing language as a communicative system and language as a system concerning abstract computational mechanism. The debate between human language and animal communication is discussed in the light of three parameters: (i) shared vs. unique characteristics of language and animal communication. Although bees dance, birds and whales sing, chimps grunt and scream, they differ from human language since they lack the expressiveness and recursive abilities of human language. 
(ii) Did language evolve gradually or through some qualitative shift? and finally,
(iii) continuity vs. exaptation (whether language evolved by gradual extension of pre-existing communication system or whether important aspects of language have been exapted away from their previous adaptive functions). The article examines the question of what is special about language, which aspects of language inherited from our ancestors have remained unchanged, what has been subjected to modifications and what is qualitatively new. HCF differentiates between two aspects of language that are special, the ‘narrow language faculty (FLN)’ and ‘the faculty of language in its entirety, the broad language faculty (FLB)’. They make the extraordinary proposal that FLN only includes recursion and the only uniquely human component of the faculty of language. 

Rule Learning by Cotton top Tamarins’ by Hauser, Weiss and Marcus examines whether the ability of rapid generalizing patterns of human infants is uniquely human. To address this problem they have presented the results of an experiment done on tamarins using a familiarization/discrimination method, which showed that tamarins were able to distinguish between novel strings of two different structures- one familiar and the other unfamiliar. This means tamarins are capable of recognizing abstract relations between variables and rules. Then, why can’t they learn language? In reply to this, the authors are of the opinion that ability to language is more than abstracting relations or patterns; it also depends on maintaining a lexicon, the ability to form semantic representation and to link them with syntactic configurations and the ability to represent hierarchical structure. Moreover, there is no evidence to date that monkeys have a theory of mind, which is crucial to language learning.

In ‘Computational Constraints on Syntactic Processing in nonhuman Primates’, Hauser & Fitch say that monkeys are unable to master phrase structure grammar. In their experiment, given exposure to instances of the patterns ABAB and ABABAB, tamarin monkeys showed increased interest in patterns AABB and AAABBB, perhaps because these contained two to four copies of the salient (because repeated) two-element sequences  AA and BB, which they had not heard before. By contrast, given exposure to instances of the patterns AABB and AAABBB, other tamarins did not show significantly increased interest in the patterns ABAB and ABABAB, perhaps because it does not involve a repetition. Given the same stimulus sequences, human subjects were able to categorize the new patterns as different, regardless of the direction of training and testing. The results of the experiment suggests that despite a clear ability to process sequential regularities in acoustics strings, tamarins are unable to process a simple phrase structure, where components of one portion of a string are related to other components some distance away. The article poses the necessity of future work on the computational limitations of non human primates in general.
More to follow...

Monday, November 9, 2009

Getting acquinted with a new concept (new for me!!)

It so happened that this weekend when I was searching for some of my old Linguistics books, got hold of my bound course work on Semiotics and Philosophy of Language. Though I used to hate this paper in my MA days (but had no option as it was a compulsory paper), just surfed through it after a long time... felt nice... Derrida, Nietzsche, Foucault... their views on life & philosophy, never interested me. But now when I am reading them... seems making sense above everything else.
One such concept I read about is HEGEMONY... somewhat synonymous with Power, but in a different sense. Just quoting few lines: Power is the central fact of the history of human experience. Power constitutes the dominant moments of our relations in society and culture. Power is defined by Max Weber as the possibility of imposing one's will upon others behavor. It can be exercised in various ways: through coercion, with others willingly submitting to it out of fear, through bargain where submission is won through rewards or conditioned power where submission is a willing choice out of conviction.
The concept of hegemony is the acceptance of power through legitimisation, tinging it with principles, values or ideology. Let me give some examples: ideologically inspired by the causes, the LTTE terrorists are commanded by their leaders to the extent of human bombs, willingly laying their lives. Culturally conditioned, an Indian wife submits to her husband's domination. SO power is all cultural, ideological power, it is not an attribute that someone possesses more than others, but a network of relations between the dominated and the dominating. Dominant ideologies put forth by the cultural institutions like society, politics, religion, and most importantly, media. They create a compatible version of reality and are made to seem so natural as 'common sense', that we donot even question the assumptions made!!

Thursday, November 5, 2009

I Wish you the strength of all Elements

Check out this SlideShare Presentation:

PARSING.....PARSING.....PARSING.......

Parsing is the most imporatnt concept when we talk about computer understnading of human language. Mostly all NLP applications such as IE IR, MT, Speech.. all benefit from Parsing. Linguistics as a desicpline has tried to define Parsing as the process of how people interpret language (or language structure!?). What is knowledge of language, where does it reside and how it is used and applied : these are the three fundamental questions of Linguistics theory. In computational terms Parsing involves defining an algorithm that maps any given sentence to its associated synatactic tree structure (is it mandatory to have a tree always??).

If we concieve Parsing in NLP as a process of transforming natural language into an internal system representation, be it in any form.. tree, brackets, graph.. anything, but ultimately the output is a syntactic structure of a given sentence. This structure can be given to a Semantic Analyzer for further interpretation of the meaning of the sentence because structure alone makes no sense like "Colorless green ideas sleep furiously". What is important is the selectional properties of each word in the utterance and their collocational relation with other words. So we can say that Syntactic Parsing is the first step in the computer processing of natural language. Another contributory part of this process is Lexicon which encodes the syntactic properties and semantic features of each word in the language. And if this lexicon can be presented in such a manner that it defines the conceptual relations among words in a formal way.. we call it Ontology. We can also have multiple ontologies that caters to Sub-categorization frames and Selectional Restrictions of any given word form (mostly of noun, verbs and adjectives). Then Parsing is left with minimal work to do if we can list all the required information (GNP features, Properties etc.) in the Lexicon itself.

What I am trying to say is that the problem with the existing computational grammar formalisms (TAG, LFG, HPSG) is they focus more on syntactic parsing. I am not even sure if  something called Semantic Parsing is acceptable or not. They are more into surface representation rather than understanding the underlying lexical information. But if we explore Generative Linguistics Theory, I feel we can utilize the principles of Minimalist Program by Noam Chomsky for computational parsing. Not because I am a student of Generative Grammar or very fond of it, but I feel Generative Grammar is the only linguistic theory that can be imporated to NLP, at least for MT and Information Extraction. For speech recognition, as experts say, Generative Phonology is not so useful. The advantage of Minimalist Program is that it assumes lexicon to be inflected with all the features and its only at the LF and PF level which decide which feature is to be retained and which to be discarded. Need to study more on this... but I have a gut feeling that its feasible. Lets see.....

Monday, November 2, 2009

Google lanuches Search by Voice for Nokia S60 phone users

Google Mobile App is equipped with speech recognition technology designed to understand Indian accents. Search by voice on mobile will now trigger a Google Search as soon as you speak your search query and give you the required results with a high accuracy rate.

Friday, October 30, 2009

XOBDO: the first online multilingual dictionary of North Eastern Languages

A voluntary effort by a group of professionals (linguists, computer scientists, literature, journalism ) from different part of the world to preserve the ethnic languages from the North Eastern part of India has led to the development of the first ever online dictionary of North Eastern languages. Initially started with Assamese, "XOBDO" (which means 'WORD' in Assamese) now supports 20 North Eastern languages and we are looking for more volunteers for all the ethnic languages. Anybody can contribute a word along with its meaning to the online dictionary. It is then put through a discussion process among the members of the Xobdo core team of 25 members who work on it online from different parts of the world. However, our aim is to bring these languages to the fore front of the IT age with more language based applications like Search Engines, Machine Translation system etc.   

Tuesday, October 20, 2009

Is possessing a language a quintessentially human trait?

Some Insight into Lack of Syntax in the Natural Communication of Non-human Primates

The dichotomy between human speaking language and others do not sums up the fascination of answering questions like- where did language come from and why do humans speak language and others do not.  A recent widely cited article "Language faculty: what is it, who has it and how did it evolve?" (Hauser et al., 2002) has argued recursion to be the single distinctive feature of human capacity for language. Does animal communication show any recursive ability? Hauser & Fitch (2004) states that a computational process responsible for the generative and hierarchical properties of narrow syntax is lacking in non-human primates. In recent past some empirical studies have been done which demonstrate some linguistic abilities in non-human primates. Some researchers also thought that apes have the capacity for language but never profited from a humanlike cultural milieu in which language was taught. Even then the most vital question still remains that non-human primates though seemingly have complex sequential learning abilities, so why did not they evolve a similarly complex communication system?
More to follow.....

Friday, October 16, 2009

Ar yezh a garan- THE LANGUAGE I LOVE

Oh how sweet and melodious
Is the language of mute creatures
More soothing to my soul
Than the rude language of people

How tranquilly, how tenderly
The trees talk to me
The gurgling at the water's edge
The heath on the mountains

The broom of the warrens
Waving in the wind
The golden sea of the moors

That tell me legends
That sing me verses

Wednesday, October 14, 2009

My first attempt on core Linguistic theory..


Intuition and Acceptability Judgment: 
How important are these criteria for explaining linguistic phenomena?

Based on William Labov’s “When Intuitions Fail” & James D MacCawley’s “Acceptability Judgments

Disciplines differ considerably in the relative emphasis they place on Data Collection versus Theory Construction. Linguistics, too, has subfields (including Psycholinguistics and Sociolinguistics) in which theories tend to be data-driven and others (notably Generative Grammar) that focus almost exclusively on the formulation of elegant theories, with little attention devoted to careful data collection. Unfortunately, the findings of the experimentalists in Linguistics very rarely play a role in the work of Generative grammarians. Rather, theory development tends to follow its own course, tested only by the unreliable and sometimes malleable intuitions of the theorists themselves. 

The need for extending the criteria for explaining linguistic phenomena arises from the Generative grammarians’ reliance on the type of data and the methodology of data collection. Generative grammarians relied on introspective intuitions of well-formedness as their primary source of data elicitation. There is no problem in the use of intuitions as evidence for theoretical claim. But what has been identified as major problems with the use of intuitions in linguistic argumentation are the way they are collected and over-reliance on this type of evidence. Intuitions can be helpful in formulating interesting hypothesis. But when it comes to theorizing linguistic principles, they do not constitute enough evidence. In the article ‘When Intuitions Fail’ Labov has identified some conditions like social pressure, pragmatic opacity, semantic suspension etc. that promote the failure of linguistic intuition.

Generative linguists appear to regard intuitions as primary source of more direct evidence for linguistic competence than other type of data. But the problem with this belief is that knowledge of this competence is not observable. They somehow escape the semantic and pragmatic dimension of language use. By leaving all contextual factors up to the imagination, the use of primary intuitions regarding sentences in isolation is arguably more subject to irrelevant interference than an experimental method that explicitly controls context. In the Generative paradigm data arises from an explicitly metalinguistic context, one in which both the investigator and the informant are thinking about language. The investigator asks question like- ‘Can you say this?’ By contrast, sociolinguists ask- ‘Do you say this?’ Data here is gathered through observation, broadly defined as opposed to data on the basis of intuition/introspection.

Another point that the Sociolinguists relates to the need for an alternative explanation for linguistic data is the approach to variability. In Chomsky’s primary concern of ‘ideal speaker-hearer in homogeneous speech community’, variability is treated as a methodological complication. The perceived periphery of variation is reduced to the distinction of ‘core grammar’ and periphery that segments off language structure from language use. One obvious distinction between treatments of variability within the Generative tradition and the Sociolinguists is that the latter makes reference to linguistic as well as social (extra linguistic) performances. 

The orientation of Sociolinguists like Labov in linguistic data is that analysis of language behavior must be based on empirical data. Empirical data represents speaker’s performance, the way they actually use language. According to them, language data should be shared with adjacent fields as Linguistic Anthropology and Conversational Analysis. Anthropological Linguists emphasizes on ethnographic methods of observation in speech communities with a primary focus on speakers as social actors rather than on abstract language pattern. In the second type of data collection (often named as Variation Studies), data have been collected in the context of conversational analysis or interviews in which the subject remains unaware that his/her linguistic usage is the focus of investigation.

In Generative grammar, grammaticality judgment using intuition is linked to competence. A speaker’s competence is the underlying ability to produce and interpret well formed sentences in a given language and to distinguish ill formed strings from the grammatical ones. The specifics of such competence are derived from eliciting intuitions of grammaticality. This link needs to be examined because the elicitation of intuitions is done via judgment of sentences which are clearly affected by performance factors. Some sentences are unacceptable while still grammatical (Colorless green ideas sleep furiously). This can be caused by semantics, processing limitations, context and many other performance factors. In these cases acceptability statements are performance data. According to Chomsky people are incapable of making judgment about grammaticality since they have no access to grammatical knowledge which is internal. The remedy can be sought in the light of intuitions and tests. In any case, performance is striped away.

Native speaker intuitions are often used as competence data. However, obtaining facts about competence is very hard, if not impossible. Data based on sampling peoples’ intuitions can be affected by misunderstanding of grammaticality and acceptability. If the linguist tries to resolve this by using his own intuitions, he/she is at the very real risk of introducing investigator’s bias into the result. Disputes about grammaticality can not be solved without reliable and comparable data. The major drawback with theories based on intuitions is that the results can be unfalsifiable, which is not the case with corpus data.

So Labov’s examination of problems in the use of intuitions either of the analyst or of the individual whose language is being studied, are quite valid. His criticism of the concept of ‘performance’ in Chomskyan terms is also convincing that performance is not only the manifestation of competence in actual language use (as Chomsky has put it), but also the effects of perception and other social factors. Therefore, in conclusion it can be said that no sources of linguistic data that linguists attempt to derive reliable information has a privileged role and need a sharing in understanding general character of grammar believing these to be affected by the social characteristics of human group.

What is It To Be Human?

Scientists generally don’t talk about love, at least not in a scientific manner. It’s seen as impossible to quantify, unpredictable, and is unlikely to survive time in a lab. Award-winning Chilean scientist Dr. Humberto Maturana is not shy of the topic, placing love at the very centre of his scientific explanation of human evolution and existence. These claims are not those of an ageing hippie or a new-age guru, but come from the recipient of the 1994 Chilean prize for Natural Sciences, and have a logic every bit as rigorous as a mathematical equation.

Dr Humberto Maturana has studied biology, medicine and anatomy, and has been Professor of Biology at the University of Chile in Santiago. He has been researching, writing, and lecturing for forty years, and though his ideas are radical and question the fundamental assumptions of Western thinking, he is being increasingly heard in scientific, educational, therapeutic and ecological circles. Some believe his thinking to be a new foundation for integrating and respecting diverse human practice. At the same time, he is insistent that he has no message, no truth, simply that, if asked, he will argue impeccably.

Dr. Maturana wants to explain humanness and how it arose simply from our being alive. But why would we want to explain humanness? Isn’t this obvious?

“Yes, it seems obvious that we understand ‘humanness’, because we live our daily life as, and in company with, human beings. Yet, for instance, to claim that human beings are fundamentally competitive animals has very different implications to claiming that human beings are fundamentally cooperative, loving animals. To claim that to be human is the same thing as being a homo sapiens, and that relationships are secondary, has very different implications to claiming that relationships of mutual respect and caring are fundamental to our becoming, and remaining, human. If we look at many of the broadly accepted ideas today, like economic rationalism, I think that it is not so obvious.”

“A scientist always explains from his experience, and is always clear about what he wants to explain. I want to explain humanness, so I am clear that I must explain the daily life of human beings, not some abstraction or principle. When I look at us in daily life, I see us flowing in language and emotions in our activity. I call this braiding of language and emotion, conversation.

“I also claim that language is not a communication tool, is not the processing of meaning, and does not take place in the brain, but takes place in our manner of interacting. I claim that language is our human manner of living together, and is a dance, or coordination, of behaviors that has become more complex. For instance, pointing is an operation in language, where we humans look in the direction of the pointing and not at the finger, while my cat, outside of language, only looks at my finger. I claim that language takes place as one behavior coordinates a second, a sort of ‘two-step’ of behaviors, and that love is central to the development of this increased complexity, and therefore to what makes us human.

“We see nowadays, wherever there is profound interference with love in the raising of children, disturbance in language development, and in the worst cases, absence of language. For example, in ‘wild’ children raised by animals.”

If love is central to our being human, then this has far-reaching implications, especially in the current rush to competition. For science, education, social practice, public policy, ethics, politics. If we are by nature loving or cooperative beings, and competition and hierarchy are cultural impositions, then they must be a negation of our humanness.

Maturana continues. “If we look at the fossil record, we see that we and chimpanzees have a common ancestor. Around six million years ago, there is a divergence which results on the one hand with modern chimpanzees and on the other hand with us. Now, chimpanzees occasionally language, and for example, some have been taught sign language. But there is a difference - they do not live in it, and we do. What else is different? Well, chimpanzees live in hierarchical relationships based in cunning, strife and competition, and such relationships do not allow for the continued intimacy that the development of language requires.

“Therefore, I argue, between six to three million years ago, beings must have arisen, our forebears, who began to live an expanded intimacy which involved active sharing of food, female-male sharing of child-rearing, frontal sex, bipedalism, and increased pleasure in caress and touch. Increased closeness could only have been maintained under an emotion of mutual respect, or a manner of interacting based in love. When this occurred, interactions became more complex, and, around three million years ago, language arose and began to be conserved as their manner of living.

“And here is the beautiful conclusion. In order for a lineage to develop which conserves language, then the fundamental way of relating must spring from the biology. In other words, we have evolved as biologically loving beings. In order for language to evolve as a manner of living, love is a precondition.

“This is not to deny what happens today, when we are in language, in terms of aggression, misery, hate. What happens for us now is that we arise and live as biologically human, but culturally more akin to apes, and that is a fundamental schism for us. But it does not negate my argument that we are biologically loving.”

We are often told to be ‘objective’, ‘un-emotional’ in our consideration of rational arguments. That if we can just be ‘Rational’, we will understand. Yet Maturana has emotions, especially the emotion of love, as underpinning rationality, and at the centre of his scientific, rational explanation of humanness. Don’t emotions deny rationality?

“No. I claim to show, based on experiments with the nervous system that we can only bring forth our realities depending on what we do, that we can make no statement about a reality that exists independently of our doing. There are therefore an infinite number of realities, or rationalities, and I say that each rationality is based in its own set of coherent experiences which we choose, emotionally, to attend to. The consequence of this is that, in living responsibly, we are aware that we live from our preferences, and that our actions flow from that. When I am in love, I can say and do different things from when I’m in distress, I live different realities. So an argument cannot be ‘the Truth’, only a coherent argument.

“I think the Western notion of a single Reality with a capital ‘R’ is our greatest difficulty. For in that understanding Reality commands us. We justify our actions according to the Truth and not according to human relationships, and emotions are nuisances to be dispensed with. And the one who tries to win an argument is always demanding the other to obey Reality, to be Rational. This leads to great misery, great justification of the actions which result in misery, and abnegation of responsibility.”

He says that in order to provide a complete explanation of humanness, he has to explain living systems, evolution, social systems, language, self-consciousness, mind, and culture. This he will do in the three-day seminar he will conduct in Melbourne in early September, and here we will simply examine some implications of his understanding, what this means for science, for education, for social practice, for politics.

He says that science is not reductionistic, that we will never explain living systems in physics or chemistry, that the propositions about Mind, Consciousness and God which come from some proponents of the new physics make no sense, are not coherent.

“Different scientists pay attention to different aspects of their experience, and this generates for them different objects. Physicists pay attention to mass, energy, velocity etc. in a coherent manner, but have no way to deal with questions about mind and consciousness. Some physicists today look for mechanisms of mind in quantum phenomena or in the ‘micro-tubule’ skeletons of cells, but this is like looking for ‘travel’ in the wheels of a car, or a picture of ‘mother’ stored somewhere in the brain. This is like trying to mix oil and water ... it is not coherent, the concepts do not mix or relate.

“Many of our difficulties come from looking in the wrong place for what we want to explain, and explanations then become inappropriate and often hugely complex. In a similar vein, I claim that language does not take place in the brain, but takes place in interactions. We will never find language in the nervous system, never find images of mother or father, never find ‘information’ stored. All we will ever find in the nervous system is nervous activity.”

“Humanness cannot be substituted. We cannot conserve competition and expect to maintain humanness. We cannot promote economic development at the cost of social cohesion and expect to maintain humanness. We cannot live our lives making human relationships secondary and expect to maintain humanness. There is no lone human, only humans in coexistence.”

Talking with Maturana is an extraordinary experience, a paradigm-shift in the broadest sense. It leaves one with a sense of unraveling of much that one takes for granted, and a sense of possibility of a different sort of world, of ‘a world in a grain of sand’, and simultaneously an understanding that is so fluid that it slips easily away. What remains is the possibility of a manner of living where certainty is replaced by trust, competition is replaced by cooperation, aggression is replaced by love, management is replaced by care and vision, control is replaced by responsibility and accountability. Where ethics is founded in human relationships, not in principles. Where humanness is our possibility, not our limitation.

“And that is the sort of world I like. That is the world I experience when I am with those I love, with friends and with family. Love is not blind, it is visionary. It makes the greatest possibilities.”

This man is a very interesting scientist, and passionately human!