/sci/, please take me back to when I was never taught this and teach me the math behind grammar and sentence structure

File: 31 KB, 544x408, sentence-structure.png [View same] [iqdb] [saucenao] [google]

Anonymous Wed Jan 22 15:51:05 2014 No.6305914 [Reply] [Original]

/sci/, please take me back to when I was never taught this and teach me the math behind grammar and sentence structure

>>	Anonymous Wed Jan 22 16:04:36 2014 No.6305930 >>>/lit/

>>	Anonymous Wed Jan 22 16:09:05 2014 No.6305939 >>>/lit/

>>	Anonymous Wed Jan 22 16:11:36 2014 No.6305942 I bet you're a filthy prescriptivist. Fuck off to >>>/lit/

>>	Anonymous Wed Jan 22 16:12:35 2014 No.6305944 >syntax trees Ahahaha, have fun with scrambling languages with that approach, chumps.

Anonymous Wed Jan 22 16:30:35 2014 No.6305977

The easiest way to model a grammar is through a context-free grammar: https://en.wikipedia.org/wiki/Context_free_grammars

Note that this only allow for very simple grammars and ususally has problems with gender/number concordance, although that's not a huge problem if you plan on using english only.

For more advanced grammars that allow for some context, I sugest that you pick a book on Natural Language Processing (and in Automata Theory, if you don't feel confortable with regular expressions/context free grammars).

>>	Anonymous Wed Jan 22 21:14:05 2014 No.6306390 File: 1.34 MB, 458x296, chomsky shrug.gif [View same] [iqdb] [saucenao] [google] ha ha ha

Anonymous Wed Jan 22 21:29:05 2014 No.6306409

>>6305914
Parse trees are good for a specific parse, but it's often times possible to parse a sentence in lots of different ways (draw lots of different trees) all with different meanings. Mainly, thee take-away I got out of parse trees while learning NLP is that languages are such shit that NLP is more or less impossible except through probabilistic means.

>>	Anonymous Wed Jan 22 21:56:35 2014 No.6306442 >>6305942 >parse tree >prescriptivism Do you even linguistics, bro?

>>	Anonymous Wed Jan 22 22:01:05 2014 No.6306449 Because human languages are terrible at following rules. Seriously terrible. Just like they're terrible at not being ambiguous. Probably a symptom of them evolving slowly over time rather than being something that was designed exactly

>>	Anonymous Wed Jan 22 22:14:36 2014 No.6306467 >>6306409 >Mainly, thee take-away I got out of parse trees while learning NLP is that languages are such shit that NLP is more or less impossible except through probabilistic means. You say that like probabilistic means are bad.

Anonymous Wed Jan 22 22:23:05 2014 No.6306482

>>6305930
>>6305939
Fuck off back to your consciousness and 0.999... threads. Wikipedia:
>Linguistics is the scientific[1] study of language.[2]

>>6305914
>teach me the math behind grammar and sentence structure
https://en.wikipedia.org/wiki/Context_free_grammars#Formal_definitions

>>	Anonymous Wed Jan 22 22:54:46 2014 No.6306550 >>6306449 What about Esperanto?

>>	Anonymous Wed Jan 22 22:58:46 2014 No.6306562 File: 52 KB, 500x283, image.jpg [View same] [iqdb] [saucenao] [google] >>6306550 Which countries uses it? That was an interesting idea but in practice it became as useful as elfic.

>>	Anonymous Wed Jan 22 23:00:16 2014 No.6306568 >>6306550 Esperanto's arguably WORSE. http://www.xibalba.demon.co.uk/jbr/ranto/

Anonymous Wed Jan 22 23:12:46 2014 No.6306587

>>6305944
The details of scrambling languages actually have a lot of interesting information to contribute to linguistics, and need to be explored more deeply. Turns out that when you get into the details of it, scrambling generally has certain restrictions, and these things can be played with to tease out really neat stuff.

>>	Anonymous Wed Jan 22 23:35:17 2014 No.6306606 File: 9 KB, 355x253, Head-subj-avm.png [View same] [iqdb] [saucenao] [google] >>6305944 This is just one of the reasons why HPSG's are a God-tier formalism.

Anonymous Wed Jan 22 23:39:16 2014 No.6306611

>>6305914
It is difficult to get into modern linguistics because many of the arguments are spread out over many different articles at different times.

For contemporary Chomskyan linguistics the reader Minimalist Syntax: Essential Readings has most of the main modern parts of generative grammar, along with The Minimalist Program itself.
Ed Stabler has written quite a bit about about programming them/putting them in logic.

Heim and Kratzner is a good book for getting into semantics and a sense of how it relates to the syntax. Also the work of Montague. There are many many theories in linguistics, but these have a lot of the technology used in "mainstream" syntax and semantics.

Aravind Joshi has been working on another computational approach TAG. Coming from type theory and Lambek calculus are other grammars related more directly to mathematics. These overlap with Chomsky grammars a lot. These come out of the work of Grishin, Lambek, Ajdukiewicz, Steedman, Chris Barker, Gentzen, Moortgat and others, which are in many ways related to the CS work of Curry and the Church lambda-calculus, as well as model theory, and categorial logic.

These are only a handful of related theories, and there are many others. Many probabilistic theories make use of parts of some of these structures in many ways, while stripping away other aspects to be taken care of by n-grams or other kinds of "intelligent" pattern-seeking put to probabilistic parameters. Almost no theory of syntax and semantics totally avoids the tools found in what I've mentioned, though there's a lot more because it is a very new and diverse field. I am also just restricting to what you have asked, which is for syntax/semantics.

>>	Anonymous Wed Jan 22 23:48:46 2014 No.6306618 >>6306611 >minimalist program >not a bunch of bullshit that's only taken seriously because of Chomsky's earlier work >semantics >not highly speculative I bet you believe in null constituents too

Anonymous Thu Jan 23 00:01:46 2014 No.6306627

>>6306618
I'm not saying what I think is or isn't speculative, or even that the minimalist program is right*, but these books have some of the earlier work, and regularly reference back to them. I recommended them because they contain a lot of current stuff in context of old stuff.

(*I don't. But it has a very interesting relationship with many other theories of grammar that I think are compelling. All of this is the beginning of what will hopefully be a much broader theory, which will probably take at least a generation or two of revision.)

The other computational theories** cover some more abstract principles, and show relationships to lots of other theories of grammar.

(**These are related to very general mathematical statements, and most elementary formalisms of grammar.)

Null constituents might be a problem of notation..

>>	Anonymous Thu Jan 23 00:02:16 2014 No.6306629 Are there any constructed languages that are structured to have almost no ambiguity?

>>	Anonymous Thu Jan 23 00:10:46 2014 No.6306641 learn a case-system language with no word order. it's fun. Hungarian is probably the easiest, but I suggest Russian for the number of speakers.

Anonymous Thu Jan 23 00:40:46 2014 No.6306680

>>6306629
http://attempto.ifi.uzh.ch/site/
http://en.wikipedia.org/wiki/Attempto_Controlled_English
>Attempto Controlled English (ACE) is a controlled natural language, i.e. a subset of standard English with a restricted syntax and a restricted semantics described by a small set of construction and interpretation rules.

>>	Anonymous Thu Jan 23 01:00:46 2014 No.6306728 >>6306627 >Null constituents might be a problem of notation.. It's a problem with conceptualisation. Syntacticians are overstepping their ground into semantics. They do it with wh-movement too.

>>	Anonymous Thu Jan 23 07:24:23 2014 No.6306984 >>6306467 It means that at best we can translate from one language to another probabilistically but we can never derive any real meaning from it.

>>	Anonymous Thu Jan 23 07:59:53 2014 No.6306999 doubleplusungood

>>	Anonymous Thu Jan 23 08:23:23 2014 No.6307006 File: 18 KB, 267x273, I don't think so tim.jpg [View same] [iqdb] [saucenao] [google] >>6306984 >mfw >>6306629 http://en.wikipedia.org/wiki/Lojban

>>	Anonymous Thu Jan 23 18:35:27 2014 No.6307982 >>6305914 >please take me back to when I was never taught this why.avi

>>	Anonymous Thu Jan 23 21:06:57 2014 No.6308169 >>6307006 CS/NLP nerds are as dogmatic about their hopeless "formalism" as us "rule-based" linguists. Give up the ghost man, and let's go forward into this century.

Anonymous Thu Jan 23 21:08:28 2014 No.6308175

>>6308169
also, probability models are essentially "meaningless" I don't know how you get around this; run statistics on some very surface distinct languages like English and Yoruba or one of the Kichaga languages and tell me how much of their semantics system you've learned.

>>	Anonymous Fri Jan 24 06:09:28 2014 No.6308791 bumpan

>>	Anonymous Fri Jan 24 21:01:52 2014 No.6309941 bump

>>	Anonymous Fri Jan 24 23:30:52 2014 No.6310081 bump

>>	Anonymous Sat Jan 25 01:37:41 2014 No.6310216 bump

Anonymous Sat Jan 25 01:48:10 2014 No.6310223

>>6306409
You basically get a fuzzy set of correct meanings which make sense without context, then use context to narrow it down.

Then you realize that the most correct meaning won't always be correct for any finitely complex parser, which is why even humans make errors.

>>	Anonymous Sat Jan 25 01:51:10 2014 No.6310225 >>6306550 It was deliberately designed and therefore doesn't count as a natural language. Many NLP-parsing machines use a modified version of Esperanto as an intermediary language.

>>	Anonymous Sat Jan 25 03:06:10 2014 No.6310314 bump

>>	Anonymous Sat Jan 25 06:47:49 2014 No.6310439 bump

>>	Anonymous Sat Jan 25 08:22:20 2014 No.6310516 bump

>>	Anonymous Sat Jan 25 10:12:20 2014 No.6310640 bump

>>	Anonymous Sat Jan 25 22:56:27 2014 No.6311955 .

>>	Anonymous Sun Jan 26 00:12:29 2014 No.6312064 >not using Combinatory categorial grammars >2014

>>	Anonymous Sun Jan 26 00:51:28 2014 No.6312123 >>6306629 http://www.ithkuil.net/ With the added advantage of a script that looks like it was made by The Predator.

>>	Anonymous Sun Jan 26 08:52:09 2014 No.6312568 >>6312064 if ccg's are better, why are cfg's more common?

>>	Anonymous Sun Jan 26 09:02:08 2014 No.6312583 >>6306629 yes http://en.wikipedia.org/wiki/Mathematics

>>	Anonymous Sun Jan 26 09:02:38 2014 No.6312581 >>6306611 Nobody sensible believes in Chomsky anymore though. GG as a whole has gone off the deep end somewhere in the late 80s.

>>	Anonymous Sun Jan 26 09:04:38 2014 No.6312585 >>6306641 Russian isn't a word order free language.

>>	Anonymous Sun Jan 26 11:42:39 2014 No.6312761 >>6306606 Enjoy your slow-as-shit parses!

>>	Anonymous Sun Jan 26 13:35:38 2014 No.6312976 >>6306984 >It means that at best we can translate from one language to another probabilistically but we can never derive any real meaning from it. How so? Meaning may be probabilistic too.

Anonymous Sun Jan 26 13:57:38 2014 No.6313017

>>6312064
Yes! (And I'm the 'Chomskyan'). Although generalizations found in categorial logic/MTLG have been more interesting to me recently, and fix a lot of the shortcomings of CCG in the most natural way (they also fit neatly in with MGs).

>>6312568
Doesn't know about equivalence proofs.

Also, again not that I wholeheartedly believe minimalism, but MG is just a MCFG, and most of the Chomsky-bashing I've read completely mischaracterizes modern syntax. None of it is that far fetched or even distinct from what almost all linguists use to some extent. There are gentzen presentations of MG which emulate it completely and are nearly identical to any mainstream formalism you can cook up.

Anonymous Sun Jan 26 14:08:39 2014 No.6313044

>>6312568
To be more clear, cfg's are more common cause those are the straightforward scraps that trickled down to CS people. many linguistic phenomena seem to go beyond CFGs, and 'typed' combinators as in CCG, or 'multiple' dimensions for the CFG to work out over as in MCFGs are two of the most natural ways to expand CFGs. Depending on choice of combinators, CCGs usually push a CFG a little into the "mildly sensitive" region, while every MG is equivalent to some MCFG. Basically, the reason is CFGs are simpler so CS people like them, but linguistics realized it is better to move past them than try to brute force them to work, which may just be impossible.

Anonymous Sun Jan 26 14:17:08 2014 No.6313062

>>6305944
>hasn't seen a syntax paper from the last 15 years
scrambling, at least not of the slavic and japanese flavors, seems to be unrestricted by any reasonable definition. In fact, the limitations come out from a very rich interaction of minimality principles (or particular mixing properties, etc.)

>>	Anonymous Sun Jan 26 14:18:09 2014 No.6313065 >>6313062 *does NOT seem to be unrestricted

>>	Anonymous Sun Jan 26 14:53:38 2014 No.6313148 >>6305914 I allays had trouble understanding grammar. Am I retarded?

Anonymous Sun Jan 26 23:39:33 2014 No.6314026

>>6313065
>does NOT seem to be unrestricted
I'm going to need some citation on this.

>>6313148
You are retarded even among retards. People with William's syndrome, who average an IQ of roughly 70, have no problem speaking, and speaking fluently at that! You, sir, have no business here and I encourage you to promptly leave. Good day!

Anonymous Sun Jan 26 23:47:33 2014 No.6314045

>>6306611
>Aravind Joshi has been working on another computational approach TAG. Coming from type theory and Lambek calculus are other grammars related more directly to mathematics. These overlap with Chomsky grammars a lot. These come out of the work of Grishin, Lambek, Ajdukiewicz, Steedman, Chris Barker, Gentzen, Moortgat and others, which are in many ways related to the CS work of Curry and the Church lambda-calculus, as well as model theory, and categorial logic.
Anon, no. Type theory is everywhere. Why here too?

>>	Anonymous Sun Jan 26 23:49:33 2014 No.6314047 >>6305914 >implying that isn't actually useful

>>	Anonymous Sun Jan 26 23:50:03 2014 No.6314049 >>6306984 >humans can do it >machines can't >humans are special

>>	Anonymous Mon Jan 27 07:15:20 2014 No.6314654 >>6314045 What's wrong with type theory?

>>	Anonymous Mon Jan 27 07:35:20 2014 No.6314666 >>6314654 Nothing, it's actually cool as fuck. I'm just waiting for the coming of typist type II.

>>	Anonymous Mon Jan 27 09:56:50 2014 No.6314813 >>6305914 ... but I don't believe it.

Anonymous Mon Jan 27 09:57:50 2014 No.6314816

>>6314026
Hm. I tried to dig a few up, but it turns out many of the papers which we've been reading in lab are not out yet, since it's a relatively recent research topic. The "seminal papers" which most work references are Saito 1992 and Saito 2006. There are a number of binding and quantificational (and I think some hyperraising? I don't study Japanese) restrictions. Recent work has gone into making the A/A' distinction come out of other principles, but they seem to not have the same (non-)restrictions as each other anyway.

>>6314045
>>6314654
I'm also confused. I'm not trying to convince anyone of type theory; it's a fact that it simply directly inspired some work in syntax and semantics which I listed there, seeing as OP was asking about math approaches to grammar. "MTLG" stands for multimodal type logical grammar - it straightforwardly is type logic as a grammar, and people like Oehrle and Moortgat have been working on it and relating it to the CCGs of Steedman.

>>	Anonymous Mon Jan 27 11:24:20 2014 No.6314932 >>6314813 >not believing in phrase structure grammars >2014

>>	Anonymous Mon Jan 27 22:21:17 2014 No.6316165 File: 247 KB, 250x196, tell me more.gif [View same] [iqdb] [saucenao] [google] >>6314816 >we've been reading in lab are not out yet Who's "we?" What have you been reading?

>>	Anonymous Tue Jan 28 02:22:42 2014 No.6316474 bump

>>	Anonymous Tue Jan 28 03:28:41 2014 No.6316529 >mfw pseudoscience philistines are still trying to model natural language without any insight from the systems and evolutionary neuroscience of language

Anonymous Tue Jan 28 05:25:11 2014 No.6316634

>>6316529
>pseudoscience philistines
Depends on the subfield in linguistics.

>still trying to model natural language
That's about right. Phrase structure grammars seem pretty close, but not quite perfect.

> without any insight from the systems and evolutionary neuroscience of language
0/10, apply yourself.

Anonymous Tue Jan 28 08:48:41 2014 No.6316827

>>6316165
Would risk anon-iminity. I do something too specific in too small a field, and you could google me very easily. I'll say that some of it was Boskovic's more recent/inchoate work, which is very interesting. My advisor is a first generation Chomsky student (though I'm not Chomskyan, exactly) at one of the 10 major US ling schools.

>>6316529
0/10 We read 50+ pages of evolutionary science for my syntax course week one (not a ton, but a lot considering it's not course content), not to mention how much neuro/psych even us formalists have to take (1-2 years min + colloquia).

>>	Anonymous Tue Jan 28 10:21:11 2014 No.6316916 >>6305914 shit, we had a test on this today

>>	Anonymous Tue Jan 28 10:39:11 2014 No.6316936 >>6316827 > I do something too specific in too small a field, and you could google me very easily. is this a may may, or do you just copy and paste the exact same this every time?

>>	Anonymous Wed Jan 29 04:22:44 2014 No.6318544 bump

>>	Anonymous Wed Jan 29 11:26:45 2014 No.6318996 bump

Advanced search
Text to find
Subject [?]Search by post subject. Leave empty for any.
Username [?]Search for user name. Leave empty for any user name.
Tripcode [?]Search for tripcode. Leave empty for any.
Email [?]Search by email. Leave empty for any.
Filename [?]Search by image filename. Leave empty for any.
From Date [?]Enter what date to start searching from. Format is YYYY-MM-DD
To Date [?]Enter what date to start searching until. Format is YYYY-MM-DD
Image hash
Search in	All Posts OPs Only
Deleted posts	Show all posts Show only deleted posts Only show non-deleted posts
Internal posts	Show all posts Show only internal posts Show only archived posts
Order	New posts first Old posts first
Capcode	All Posts Only by Users Only by Mods Only by Admins Only by Developers
Results	Posts Threads
Action	[ Simple ]

/sci/ - Science & Math