forked from delph-in/zhong
-
Notifications
You must be signed in to change notification settings - Fork 0
/
zhong-letypes.tdl
1651 lines (1149 loc) · 54.3 KB
/
zhong-letypes.tdl
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
;;; -*- Mode: TDL; Coding: utf-8 -*-
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;;; Lexical Entry Types of the Chinese Languages
;;; First created: Fri Oct 31 2014
;;; By Sanghoun Song (SSH) and Zhenzhen Fan (ZZF)
;;;
;;; Edited by: Luis Morgado da Costa (LMC)
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;============================================================================
; We would expect that all types inherit from a single type in the grammar.
; Possibly nonque-item and similar should be moved up the hierarchy.
; Folllowing the ERG, this file can have features added to types, but not
; multiple inheritance ... (try to email DAN+developers to know why).
;
; LKB conception there is a layer 'pre terminals' single inheritance guys
; in Ann's implementation -- this was done more efficiencly if it was a
; separate layer (only for types that occur in the lexicon). But Dan has
; not obbeyed this except for verbs. Only the main verbs have...
; Lextypes... Only the naming convention is now in place for everything _le
; this is not a rule, and everything should work fine if we don't do this
; the LKB has become more and more efficient over time.
; ACE probably doesn't care (we can ask Woodley).
;
; This is the level immediately above lexicon (to allow a clean lexicon
; file, without features). The lexicon file should not carry more than
; the minimal info e.g STEM/ RELATION
;============================================================================
;============================================================================
; PERSONAL PRONOUN TYPES
;============================================================================
n_-_pr-1sg_le := personal-pronoun-lex & speaker-lex-item &
"""
1st person pronoun.
<ex>我 吃 苹果
<nex>这 我 吃 苹果
<ex>红 的 我 吃 苹果 [????]
should be modifiable
"""
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 1sg ].
n_-_pr-1pl-incl_le := personal-pronoun-lex & basic-icons-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 1pl_incl ].
n_-_pr-1pl-excl_le := personal-pronoun-lex & basic-icons-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 1pl_excl ].
n_-_pr-2sg_le := personal-pronoun-lex & ;non-honorific-hearer-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 2sg ].
n_-_pr-2pl_le := personal-pronoun-lex & ;non-honorific-hearer-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 2pl ].
n_-_pr-2sg-hon_le := personal-pronoun-lex & ;honorific-hearer-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 2sg ].
n_-_pr-3sg_le := personal-pronoun-lex & basic-icons-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 3sg ].
n_-_pr-3pl_le := personal-pronoun-lex & basic-icons-lex-item &
[ SYNSEM.LOCAL.CONT.HOOK.INDEX.PNG.PERNUM 3pl ].
n_-_pr-wh_le := wh-pronoun
"""
Interrogative pronoun.
The pronoun should appear in place of a full noun phrase.
<ex> 她 要 什么 ?
(What does she want)
""".
n_-_indef-pr_le := quant-common-noun-lex & nonque-item
"""
These are generic indefinite pronouns that should have a noun head and some quantifier binding it.
These can be set directly on the lexical entry (e.g. every+person, some+thing).
[FIXME] Locative pronouns should probably be changed to inherit from this, so they can have fuller semantics.
[FIXME] This is inheriting from quant-common-noun-lex which defines the sort to be common-s (i.e. non-time). So to add pronouns like 'whenever' we would need to create a new very similar type.
Things that are quantifiers (e.g. some, a few) should not be of this type and should, instead, form a NP by a 0-nom-d phrase.
General pronouns. This is currently a quite broad category of pronouns that includes, for example, 大家 (everyone), 这些 (these), 有人(someone), 一点儿 (some), etc.
It also a includes a long list of the form 一+CL+CL, which would be close to 'each and everyone of'. However, I do think this is incorrect. They should still be treated as determiners of some sort.
This allows us to set the KEYREL.PRED generic_n_rel,
ALTKEYREL directly in the lexical entry.
""".
a_-_bound-posthead_le := attr-adj-posthd-lex & bound-lex-item & nonque-item.
a_-_first-bound-posthead_le := a_-_bound-posthead_le &
"""
们 can only modify nouns of SORT 'common-s' (common nouns that are
neither pronouns or proper nouns).
This is attempting MEN 们 to attach before everything else.
This includes other modifying morphemes (e.g. currently 自己)
But also any determiner phrases.
LOCAL.CAT.VAL [SPR 1-list] ===> the SPR of what I modify must be non empty.
MODIFIED notmod ===> I have not been modified (yet), this includes CL-phrases.
[FIXME] This is stil able to apply to noun-noun compounds as if it's not modified.
This also needs to be blocked.
FCB: It would be nice to be able to say that the thing I modify is a WORD (i.e. not a phrase)
Note of failure: we cannot use lex-synsem to specify that the thing being modified has to be a word (i.e. not a phrase yet) because the 'basic-head-mod-phrase-simple' rule doesn't actually link local (which would include the types).
"""
[SYNSEM.LOCAL.CAT.HEAD.MOD < [ LOCAL [ CAT [HEAD noun,
VAL [SPR 1-list] ],
CONT.HOOK.INDEX.SORT common-s],
MODIFIED notmod,
LIGHT + ] >].
n_-_pr-loc_le := loc-pronoun-lex & nonque-item.
;============================================================================
; COMMON NOUN TYPES
;============================================================================
; The main difference of nouns is the way they can/have to be specified over.
;
; There are also some tests that should be done to decide the correct type:
; [n_-_tm_le]: nouns that can be pumped into time adverbials
; [n_-_c_le]: ???
;
;
;============================================================================
n_-_le := nonloc-common-noun-lex & nonque-item
"""
Currently, general nouns such as dongxi (东西), shi (事).
these nouns can
这里 东西 很 贵 。 >>> parse
东西 很 贵 。 >>> parse
这 东西 很 贵 。 >>> parse
这 个 东西 很 贵 。 >>> parse
个 东西 很 贵 。 <<<<<<<<<<< this it doesn't parse (OK, should not parse in SUBJ position)
我 看 东西 。 >>> parse
我 看 这 东西 。 >>> parse
我 看 这 个 东西 。 >>> parse
我 看 个 东西 。 <<<<<<<<<<< this it doesn't parse (should, mean 一个)
东西 看 我 。 >>> parse
这 东西 看 我 。 >>> parse
这 个 东西 看 我 。 >>> parse
个 东西 看 我 。 <<<<<<<<<<< this it doesn't parse (OK, should not parse in SUBJ position)
个 苹果 (CL-NOUN) with nothing on the left are always indefinite.
一 个 苹果 (NUM-CL-NOUN) with nothing on the left are always indefinite.
有 'there is' existential must be followed by indefinite NPs.
for the verb this is not the case 我有这两个苹果,要不要一个? this is likely possible.
把 construction cannot take CL-NOUN, but can NUM-CL-NOUN or DET-CL-NOUN
一个苹果 in SUBJ position is generally not fine。 Exception would include:
一 个 人 会 吃 三十 苹果 。 (it has a capacity reading, it might be the case
that these capacity readings must be aspectless; ) 一 个 here means 'each
person' and not 'one person' when in subject position.
A count noun suffixed with -men 们 is interpreted as definite (for sure????)
他 会 去 找 孩子们。
count nouns suffixed with -men 们 cannot be preceded by a [Numeral + Classifier] sequence
even when the numeral is greater than ‘one’ (Iljic 1994, 2001, Li 1998, among others),
** Massifiers appear as nouns
杯_n_1 := n_-_le & (container)
瓶_n_1 := n_-_le & (container)
(e.g. 箱 by itself it doesn't appear as N)
三 杯 水 (杯 is a measure word)
三 杯 的 水 (杯 is a measure word)
* 我 买 了 杯 的 水 (杯 is a measure word)
Can we think of massifiers NOUNS that MUST take a numeral and be fully specified? Probably not...
Weight / Mass (Massifiers)
磅 bàng (bang) pound
吨 (噸) dūn (dun) ton
公斤 gōngjīn (gongjin) kilogram
斤 jīn (jin) 1/2 kilogram ( 1/2 kilo of apples 一斤苹果)
克 kè (ke) gram
两 (兩) liǎng (liang) 50 grams
千克 qiānkè (qianke) kilogram
These also can use DE 的
三 磅 (的) 水
三 磅 (的) 肉
* 我们 需要 磅 (的) 水 (the numeral is necessary)
* 我们 需要 磅 (的) 肉 (the numeral is necessary)
Time
辈 bèi (bei) lifetime (辈子)
点 diǎn (dian) o'clock
分 fēn (fen) minute
刻 kè (ke) quarter, 15 minutes
秒 miǎo (miao) second
年 nián (nian) year
日 rì (ri) day
时辰 shíchén (shichen) 2 hours
世纪 shìjì (shiji) century
岁 suì (sui) years old
天 tiān (tian) day (十天时间)
小时 xiǎoshí (xiaoshi) hour
载 zǎi (zai) year
钟 zhōng (zhong) hour (found in southern Chinese)
岁??? THEY ARE NOT 'REAL' NOUNS / JS: 'or you can say they are nouns that do not take CLs'
Some of these CLs can reduplicate to mean 'every X'
苹果个个都很好吃 (JS says everyone will accept this one)
?个个苹果都很好吃 (this one is controversial)
Normally CLs can reduplicate, while nouns can't (maybe only 人人).
So this is an argument to see words like 天 or 年 as CLs.
三 年 的 痛苦
? 三 年 痛苦 (in a list of misfortunes this can possibly be ok)
你 要 哪 个 水果 ?
你 要 哪 一 个 水果 ? looks indefinite but it's not the same meaning (Which one)
你 要 哪 两 个 水果 ? looks indefinite but it's not the same meaning (Which two)
? 你 要 哪 水果 ? JS 'a little bit strange'
你是哪年出生的?
?你是哪个年出生的? (1982, cow?)
?你是哪一年出生的?
*你是哪一个年出生的?
哪国人
你是哪国的人?
JS thinks that this MIGHT be an X-国 where X is replaced by 哪
哪天? 哪月?
How to say 3 countries signed a treaty? (3guo signed)
几月 (which month) 几个月 (how many months)
JS:
2 x common nouns: count and mass
kinship terms (哥哥, 老师)
proper names
这 苹果 很 贵 。 WHEN can you use the DEM without classifier? rint and lisa wrote about 'zhei'
这 水 很 脏
* 这 三 苹果
""".
n_-_c_le := count-noun-lex & nonque-item
"""
e.g. nouns like 山 (only listed as n_-_c_le countable noun)
These nouns are marked as (i.e. BARE -)
However it is still going through bare-np-phrase (I assume bare-np-phrase
should only accept BARE + ???). But the follow up question would be: are
there really BARE - nouns in Mandarin (things that can't be bare?)?
山 很 高 。
这 个 山 很 高 。
这 个 山 很 高 。
""".
n_-_m_le := mass-noun-lex & nonque-item
"""
Includes mass nouns such as: 水, 米饭, 饭
There are only 11 of such nouns... which sounds slightly ridiculous.
""".
; n-rel-loc_le := relational-obl-comp-noun-lex
n-rel-loc_le := relational-obl-comp-loc-noun-lex
"""
This is a relational locative noun (that expects a noun complement). This includes nouns such as 里, 上, etc. The complement is non-optional, effectively barring these nouns to stand on their own.
<nex> 她 在 里 学习 。
<ex> 她 在 房间 里 学习 。
""".
; n-opt-rel-loc_le := relational-opt-comp-noun-lex
n-opt-rel-loc_le := relational-opt-comp-loc-noun-lex
"""
This is a relational locative noun (that expects a noun complement). This includes nouns such as 里面, 里边, 上面, 上边, etc. The complement is optional. This allows these nouns to stand on their own.
<ex> 她 在 里面 学习 。
<ex> 她 在 房间 里面 学习 。
""".
n_-_loc_le := common-noun-lex & loc-lex-item & nonque-item
"""
This type is for 'location' nouns that should not be modified by locative nouns (e.g. 里, 上, 外, etc.).
They are, in themselves, partially locative. This is marked by [SYNSEM.LOCAL.CAT.HEAD.LOC +] allows these to be pumped into locative adjuncts without the use of 在 (using a robust rule).
These include form formations such as 校外 (off-campus) and 网上 (online), which although might look that could be treated compositionally, they make use of bound morphemes (e.g. 校) for which we do not have a better treatment at the current moment.
=======================================================
ZZF 2015-5-3 temporary for locational nouns (not proper nouns). To be fixed when have time.
JS: Locative Nouns normally don't use classifiers in discourse.
loc-lex-item only introduces the following constraint:
[SYNSEM.LOCAL.CAT.HEAD.LOC +]
However this is never used elsewhere in the grammar.
Currently this type has roughly 60 nouns that are a complete mess.
This type should probably be related with the concept 方位词 [fāng wèi cí] (words of locality).
Location adjuncts are mostly PP phrases (using either or both preositions and 'postpositions'/locative nouns).
<ex> 她 在 家 吃 火锅 。
(She is eating/eats hotpot at home.)
<ex> 她 家 里 吃 火锅 。
(She eats hotpot at home.)
<ex> 她 在 家 里 吃 火锅 。
(She eats hotpot at home.); JC: with ZAI is a bit better.
<ex> 她 家 里面 吃 火锅 。
(This predicts that 里面 and 里 behave similarly in allowing themselves to be modified by a noun)
<ex> 她 在 家 里面 吃 火锅 。
(This predicts that 里面 and 里 behave similarly in allowing themselves to be modified by a noun)
<ex> 他 在 学校 里面 学习 。
Placement of these adjuncts are important. Always pre-verb (unless they
are complements); and always after time-adjuncts.
<ex> 我 昨天 在 家 里 吃 火锅 。
<ex> 昨天 我 在 家 里 吃 火锅 。
(I ate hotpot at home yesterday)
<nex> 我 在 家 昨天 里 吃 火锅 。
There are some special cases:
beside/next to (旁边 – páng biān)
opposite (对面 – duì miàn)
in the middle (中间 – zhōng jiān)
It seems 边 can NOT be seen as a postposition...
<nex> 她 每天 学校 边 见 他 的 朋友 。
(this would predict that 边 cannot be used with other nouns too.
only with a special class 外 里 etc., so they should probably be considered as words for now)
学校 边 (NO!)
<ex> 她 每天 学校 旁边 见 他 的 朋友 。
There are special cases for 边 when it means 'border' (and not side).
So 边 must also be a word on its own (but as a normal noun)
池塘 边
河 边
海 边
路 边
<ex> 她 河 边 看 书 。
(rare but probably possible)
<ex> 她 在 河 边 看 书 。
(this is much better)
Check if 面 could be used as a postposition. E.g.:
<nex> 她 房子 面 写 了 他 的 名字 。
(If this is possible some of the discussion below might not be too relevant)
桌面 (this is the SURFACE OF TABLE) -- but this is bound 桌 (this has a class of nouns with ZI ending)
<nex> 她 房子 的 表面 写 了 他 的 名字 。
()
<ex> 她 在 房子 的 表面 写 了 他 的 名字 。
<ex> 她 在 房子 表面 写 了 他 的 名字 。
Some words, behave as both 'postpositions' and locative nouns (e.g. 里面).
These words can not only help build a location PP-like NP; but also be (by itself) a
noun and a locative adverb without any help from (other) prepositions.
<ex> 她 房子 里面 吃 火锅 。
<ex> 她 在 房子 的 里面 吃 火锅 。
<ex> 她 房子 里面 吃 火锅 。
<???> 她 饭馆 的 里面 玩 。
JC: It seems this one is not perfect, with 在 is better, but in speech should be acceptable.
(if it's true that 里面 is LOC+, then even with a 的 phrase it should be possible to automatically pump them into locative adjuncts)
她 坐 在 外面。
<ex> 她 在 里面 吃 火锅 。
(this is perfect, no problem)
<???> 她 里面 吃 火锅 。
(this is she IS inside eating... so without ZAI sounds weird)
(This predicts that 里面 can be used a locative adjuct without the help of other)
<ex> 我们 里面 吃 火锅 。
(the meaning is special: we will GO inside eat... )
SOME OF THESE CAN IN SPEECH (usually use 呢). It becomes very flexible.
你的女儿干嘛呢?
她外面玩呢.
你的女儿干什么呢?
我家里面吃饭呢
你在做什么呢?
我学校里面学习呢
[BUT ALSO] 我 学校 学习 呢 . [!!!!!!!!!!]
接下来, 我们 做 什么 ?
接下来,我们 家 里面 谈 。
<ex> 里面 很 大 。
(This predicts that 里面 can be used as subject;)
<ex> 饭馆 的 里面 非常 漂亮 。
(This shows 里面 can be the head-noun of a 的 construction.)
What would make sense would be to include only these nouns here,
in addition to a rule to allow these to build a loc-adv
(similar to what happens to time-nouns).
We probably need some bound loc nouns (里) which cannot stand on their own.
And some normal 里面 .
Tests with 的 show that these post-position like locative nouns actually
behave 100% as nouns (but not the bound ones).
<ex> 饭馆 的 里面 很 漂亮 。
(The inside of the restaurant is very beautiful); 里面 is the head, and it's a NP.
<nex> 饭馆 的 里 很 漂亮 。
(should not be possible because 里 is bound -- so it can't be picked by 的 on its own)
<nex> 饭馆 的 在 里面 很 漂亮 。
(should not be possible, because 的 has the HEAD on the right, and must be a NP -- PP is not acceptable)
<ex> 里面 的 桌子 比较 好 。
(the table inside is [relatively] better); This should be analysed as (NP 的 NP)
<ex> 在 里面 的 桌子 比较 好 。
(the table inside is [relatively] better); This should be analysed as (PP 的 NP)
<ex> 商场 的 饭馆 里 很 安静 。
(The inside of restaurants in/of malls); ??? Is this possible?
If this sentence is possible this would draw a distinction between PP and NP, and suggest 里 is in fact a (complement-taking) NOUN.
There are probably some other special nouns that can behave as locative.
see also '网上' it should be a noun, but can be used directly as loc-adv.
原地 yuándì (original place)
原处 yuánchù (original place)
北方 (north) VS 北面 VS 北边 (north) -- if 边 is analysed as this special N, then it's fine to behave
桌面 (this is the SURFACE OF TABLE)
<ex> 网上 有 很 多 新闻 。
<ex> CCTV 的 网上 有 很 多 新闻
<ex> 她 在 网上 找到了 工作 。
<nex> 她 网上 找到了 工作 。 (needs ZAI because it's not imperative?)
<nex> 她 在 网上 上 找到了 工作 。
<???> 你 网上 检查 吧! (since Chinese doesn't use 去网上, this will not be possible)
<ex> 你 上网 检查 吧! (BUT 上网 is a sep-v, so this is not the same)
So, if these nouns can be to the right of a DE construction, they have to be a NOUN.
But if they can be used as a location adjuct without a 在 / 从 etc preposition, then this would make them special.
<> XXX 的 网上 VP/AP.
<> 中国 的 网上 VP/AP.
<nex> 她 说 的 原处 等 你 。
(???)
<ex> 她 原处 等 你 。
(but still a bit informal. with ZAI is better.)
<nex> 她 (学校) 北方 等 你 。
(not too good. Not correct)
<ex> 她 (学校) 北面 等 你 。
(with ZAI for written)
<ex> 她 (学校) 北边 等 你 。
(with ZAI for written)
东/西/南/北/前/后 /左/右/上/下/里/外 are often used in conjunction with suffixes such as 边/面. while 旁 takes only 边 , 对only takes 面, and 间 goes only with 中.
Single character nouns of locality like the ones listed above are rarely used on their own. They normally attach to other nouns.
??? Are all single character LN bound? 东/西/南/北/前/后 /左/右/上/下/里/外
When 上边/面, 下边/面, 里边/面 ,外边/面and 旁边 are modified by other nouns, the character 边/面 can be omitted.
But never 左,右 ,对,中 + suffix, the suffix can be omitted if 的 is omitted. For example:
桌子(的)上(边/面) = 桌子上 Zhuōzi shàng on the table
门的后边 = 门后 Mén hòu behind the door
饭馆的前边 = 饭馆前 Fàn guǎn qián front of the restaurant
房间的里边 = 房间里 Fángjiān lǐ in the room
床的下边 = 床下 Chuángxià under the bed
沙发的旁边 = 沙发旁 Shāfā páng next to the sofa
Geographical (proper) names (Country /Place) such as 中国,英国,北京,伦敦etc. never work with direction words 里/里面/里边,for example:
<ex> 她 在 上海 工作 。
(She works in Shanghai.)
<nex> 她 在 上海 里面 工作 。
<ex> 英国 有 很 多 大学 。
(There are many universities in UK.)
<nex> 英国 里边 有 很 多 大学 。
How about pronouns?
她 里面 有 很 多 感染 。
在 她 里面 有 很 多 感染 。
她 的 里面 有 很 多 感染 。
在 她 的 里面 有 很 多 感染 。
(must use 身体, so can probably block pronouns)
Coercion of Locatives in Mandarin Chinese (Qiang Li, 2013)
Locativesin Chinese include simple and synthetic ones. shàng (上 above)、 xià (下 below) 、 qián( 前 front) 、hòu (后 after)、lǐ (里 in)、 wài ( 外 out) are simple locatives, which compose synthetic ones with suffix like biān (边 side)、 miàn ( 面 surface). This paper will take jiān (间 between) 、 shàng ( 上 above) 、 qián (前 front)、lǐ (里 in) for example
SUMMARY:
- 'Locative-nouns' cannot take Proper Nouns as complements.
<ex> 我 在 北京 。
<nex> 我 在 北京 里 。
<ex> 我 住 在 北京 。
<nex> 我 住 在 北京 里 。
<ex> 我 在 北京 吃 了 火锅 。
<nex> 我 在 北京 里 了 火锅 。
- When container-measure-words are used in locative phrases they MUST use a locative noun/postposition. (We are not dealing with this for now)
<nex> 巧克力 在 盒子 。
<ex> 巧克力 在 盒子 里 。
<ex> 巧克力 在 盒子 里面 。
<ex> 巧克力 在 盒子 上 。
- It seems that bare locative phrases entail 'go' while phrases with a prep (e.g. ZAI) do not.
Is it the case that bare locative phrases are used only in imperatives?
<???> 她 房子 里面 吃 火锅 。
This sentence doesn't feel right to JC. This is possible to be imperative.
The feeling is that there must be a 'go' meaning, which is similar to imperative.
And in non 1st person or 2nd person pronouns is very weird.
'BARE-LOCATIVE-NP' can be used in imperative sentences.
'BARE-LOCATIVE-NP' cannot describe a state.
- In imperative clauses, the use of 的 with possibly blocks the possibility to become adjunct.
<ex> 你们 房子 里面 吃 火锅 。
<nex> 你们 饭馆 的 里面 玩 。
<ex> 你 在 家 的 里面 吃 火锅 。 (it feels unnecessary but its grammatical)
<ex> 你 房间 里面 玩儿 !
<ex> 你 去 房间 里面 玩儿 ! (this is the meaning of the above sentence, makes it easier, but the above one have no problem)
<nex> 你 在 房间 里面 玩儿 ! (imperative ****CANNOT**** HAVE ZAI-prep)
<ex> 你 在 房间 里面 玩儿 吧 。 (BUT with BA is ok ?? THIS is very similar to a QUESTION... so it's compatible with )
<ex> 你 在 房间 里面 玩儿 好吗 ? (or HAO-MA is ok -- this is not an order, this is a description of a possible situation.)
<nex> 你 房间 的 里面 玩儿 ! (even in imperative sentences cannot add 的)
<ex> 她 在 房间 里面 玩儿 。 (a description, not an imperative sentnece)
<ex> 你 在 房间 的 里面 吃 火锅 。 (it feels unnecessary but its 100% grammatical)
'的 里面' does NOT behave the same as 里面
<ex> 她 在 房间 里 玩儿 。 (obviously ok)
<nex> 她 在 房间 的 里 玩儿 。 (的 里 cannot be used, because 里 is a bound noun/morpheme)
(should not be possible because 里 is bound -- so it can't be picked by 的 on its own)
- Under our current analysis of LN as prepositions, we are unable to parse sentences where they necessarily need to be a NOUN. E.g.:
<ex> 房间 里 很 漂亮 。
<ex> 房间 里 有 三 个 人 。
<ex> 桌子 上 很 漂亮 。
<ex> 商场 的 饭馆 里 很 安静 。
- We're also blocking NPs like 房间 里 to serve as complements in verbs like:
<ex> 我 去 房间 里 。
<ex> 我 去 房间 里面 。 (this second is somehow better, but above also can)
- Special LN:
beside/next to (旁边 – páng biān)
opposite (对面 – duì miàn)
in the middle (中间 – zhōng jiān)
<ex> 你 旁边 玩儿 !
<ex> 你 外婆 旁边 玩儿 ! (should be fine, but with 去 should be better)
<ex> 你 外婆 的 旁边 玩儿 ! (should be fine, but with 去 should be better)
<ex> 她 在 房间 旁边 玩儿 。
<ex> 她 在 房间 的 旁边 玩儿 。
<ex> 她 在 外婆 旁边 玩儿 。
<ex> 她 在 外婆 的 旁边 玩儿 。
DE selection seems to have a preference whether it's human or not human, but all are ok.
For imperative sentences, it's proably better to require '去'. Without 去 it's perhaps a bit too colloquial (spoken).
<ex> 我 在 外地 学习 。
<???> 我 在 外地 里 学习 。
Same as 双方 ... cannot take these morphemes
你 外地 学习 !
外校 (other school)
你 外校 学习 。
外校 also doesn't use 里
校外
""".
n_-_h_le := nonloc-common-noun-lex & nonque-item & human-lex-item
"""
Includes nouns such as 爷爷, 爸爸, 老师
"""
.
n_-_tm-bound_le := bound-time-noun-lex & nonque-item
"""
Currently, this type includes only: 点, 分, 分钟.
This type should like be used for nouns that must be preceeded by a number,
and that cannot be used by themselves.
""".
n_-_tm_le := time-noun-lex & nonque-item
"""
These are nouns that can be transformed into time-adverbs
(using np-time-adp) and used to modify predicates.
<ex> 她 星期天 喜欢 看 书 。
(She likes to read on Sundays)
<ex> 她 昨天 很 漂亮 。
(Yesterday she was/looked beautiful)
Includes nouns such as 八七年, 八点, 六二四年, 六日, 冬天, 古代, 周五
They all seem to indicate time points.
In the past, time nouns were using being blocked to serve as subjects.
This was deemed too restrictive, as it blocked sentences like:
<ex> 一月 是 我 最 喜欢 的 月份 。
(January is the month I like best)
This is no longer the case. However, lifting this restriction had a big impact
on ambiguity (roughly doubling it). This restriction should perhaps exist
directly in certain verb classes, asking for their subject's SORT to be of type
'nontime-s'.
This noun type is also being blocked from spurious ambiguity through Noun-Noun
compounds. It was argued (to be confirmed) that these nouns cannot participate
in certain N-N compounds, so it is actually a useful type. This is being done
directly on the compound rules.
""".
;============================================================================
; PROPER NOUN TYPES
;============================================================================
n_-_pn_le := proper-name-lex & nonque-item
"""
This seems to be used as a catch-all class for proper nouns.
This might have been used for entries that were not sorted.
""".
n_-_l_pn_le := loc-name-lex & nonque-item
"""
This is used for locations, including regions and country names. But also for some company names, such as Bank of China (中银) and other basic locations such as estuary (入海口). The last one probably should be deleted at some point.
According to the comments left on loc-name-lex, this was originally to
prevent sentences such as *我 要 去 李四 (e.g. 李四 is the name of a person), but this is actually not being blocked.
[FIXME] Check what else this might be doing.
""".
n_-_l_pn-bound_le := loc-name-lex & bound-lex-item & nonque-item
"""
This is only being used for capital (都, du). I wonder is this is really
necessary.
""".
n_-_h_pn_le := human-name-lex & nonque-item
"""
Proper names for people. e.g. 张三, 朱莉娅罗伯茨 (Julia Roberts), John, Lee.
This seems to include both first and last names, as well as entries for both
firs+last names as a single token.
""".
n_-_s_pn_le := surname-lex & nonque-item
"""
This includes a list of a few of the common surnames.
""".
;============================================================================
; CLASSIFIER TYPES
;============================================================================
c_-_le := classifier-lex & nonque-item.
;============================================================================
; ADJECTIVAL TYPES
;============================================================================
a_-_num_le := card-num-adj-lex & nonque-item
"""
This type refers to numbers, percentages and a couple other special numeral entries (e.g. ①).
They are marked as [CARDINAL +].
""".
a_noncard-num_le := noncard-num-adj-lex & nonque-item
"""
This type is strictly for non-cardinal numerals. Currently only 二 is considered non-cardinal.
Non-cardinal numbers are block from participating in specifier-head constructions between numerals and classifiers.
""".
a_-_att_le := attr-adj-lex & nonque-item
"""
This type refers to adjectives that can directly modify a noun without a
relative clause. This is not very productive, so we are currently not doing
this conversion by a lexical rule.
<ex> 贝贝 是 我们 的 好 朋友
If an adjective can be used both attributively and predicatively, then
we currently have two lexical entries. Eg. 好_j_1 (predicative) and
好_j_2 (attributive).
""".
a_-_pred_le := pred-adj-optspr-lex & nonque-item
"""
This is where most adjectives are (1700+).
This types refers to adjectives that can be used as predicates with or without degree
modifiers.
[FIXME] Need to further split adjectives into more types, e.g. adjectives that CAN'T take degree modifiers. This generally should include some idiomatic phrases that are adjectival in their essence, and a restricted number of adjectives.
<???> 我们 一起。 一起 is listed but it should be an ADV
[FIXME] This should be the exception to our rule that all adjs must have
SPR empty. This was perhaps completely unconstrained at some point in the
past (to raise parsing %) and now we have over-constrained it to never
allow this.
This type currently holds around 1700 entries, and I believe most might
incorrectly typed.
This type includes A LOT of types that should probably be verbs,
and even many entries that are SPECIFIED over (e.g. negated) which
should not really come here... because the negation would 'specify' the
ADJ enough.
This type fits best idioms such as 一丝不苟 (yì sī bù gǒu) [idiom. to do
something strictly according to the rules or in a meticulous manner])
很一丝不苟的男士
we possibly want to create subtypes:
a_-_nonopt_spr or a_-_spr
a_-_opt_spr
a_-_no_spr
""".
a_-_aabb_le := adj-aabb-lex & nonque-item.
a_-_abb_le := adj-abb-lex & nonque-item.
;============================================================================
; ADVERBIAL TYPES
;============================================================================
adv_-_deg-pre_le := deg-int-adv-vp-pre & nonque-item
"""
Basic type for degree specifiers.
E.g. 很, 非常
There is also a 不 that is considered a degree specifier.
""".
adv_-_s-pre-no-rel_le := int-adv-s-pre-no-rel & nonque-item
"""
This adverb type introduces no semantics.
[FIXME] Although part of the name ('s-pre') suggests it should only appear pre-hd for full sentences, it is also allowing pre-VP modification.
[FIXME] There are only a hand full of adverbs with this type, but it's not clear why they would have no semantics.
""".
adv_-_s-pre-rel_le := int-adv-s-pre-rel & nonque-item
"""
Intersective pre-sentential adverb.
[FIXME] Similar to what happens with 'adv_-_s-pre-no-rel_le', although part of the name ('s-pre') suggests it should only appear pre-hd for full sentences, it is also allowing pre-VP modification.
This should be constrained further to match the type name. We should perhaps have a general type of adverbs that can appear both pre-S and pre-VP (with an adequate name).
""".
adv_-_vp-pre_le := int-adv-vp-pre & nonque-item
"""
Intersective pre-VP adverb.
These adverbs can only occure in pre-VP position (i.e. never before the subject).
""".
adv_-_vp-pre-norel_le := int-adv-vp-pre-no-rel & nonque-item
"""
Intersective (strictly) pre-VP adverb introducing no semantic relations.
Currently this is only for 正在. It does change the verb ASPECT to durative.
[FIXME] I'm not 100% sure why 正在 would not introduce a relation. Should motivate and leave some notes here.
[FIXME] The name of this type should be changed to better reflect the very specific entry it represents. We could have something like 'aspectual adverbs'.
""".
adv_-_neg_le := neg-adv-lex & nonque-item
"""
Negator adverbs. Currently for 不 and 没.
""".
adv_-_scop_le := scop-adv-lex & nonque-item
"""
""".
adv_-_scop-prehd_le := scop-adv-prehd-lex & nonque-item
"""
""".
scop-adv-s-prehd_le := scop-adv-s-prehd-lex & nonque-item
"""
Strictly pre-head sentential adverbs. Predicts that some adverbs
can only appear to the left of a sentence. Currently on mal-也
is using this, but I have a hunch this might be needed to some
other (non-mal) adverbs in the future.
""".
adv_-_imp_le := imperative-aux-lex & nonque-item
"""
This type is specifically for 别 (imperative) negation.
""".
;; WWJ 2015-02-08: Now known as A NOT A
adv_-_polar_basic_le := a-not-a-basic-adv-lex.
adv_-_polar_contracted_le := a-not-a-contracted-adv-lex.
adv_-_polar_ab-not-ab_le := ab-not-ab-adv-lex.
adv_-_polar2_basic_le := a-not-a-bu-basic-adv-lex.
adv_-_polar2_contracted_le := a-not-a-bu-contracted-adv-lex.
adv_-_polar2_ab-not-ab_le := ab-not-ab-bu-adv-lex.
adv_-_tq_le := tag-adv-lex.
;============================================================================
; VERB TYPES
;============================================================================
v_-_le := v_intrans-verb-lex & nonque-item
"""
Simple intransitive verbs.
""".
v_np_le := v_trans-verb-lex & nonque-item & norm-comps1-lex
"""
Simple transitive verbs (i.e. taking one NP as complement).
""".
v_np-np_le := ditrans-verb-lex & nonque-item
"""
Simple ditransitive verbs (i.e. taking two NP as complements).
e.g.: 告诉, 卖, 教
<ex> 我 告诉 她 一 个 消息 。
<ex> 我 卖 她 一 本 书 。
<ex> 我 教 她 中文 。
我 告诉 LE/GUO 她 一 个 消息 。
我 卖 LE/GUO 她 一 本 书 。
我 教 LE/GUO 她 中文 。
* 我 告诉 ZHE 她 一 个 消息 。 (it has to do with the length of the verb action)
* 我 卖 ZHE 她 一 本 书 。 (in it's ditransitive
* 我 卖 ZHE 新加坡 中文 书 。
# if it has an indirect object like this, the verb seems restricted to 'one time' thing, so ZHE makes no sense.
? 我 教 ZHE 她 中文 (feels unfinished, needs the)
# The problem seems to be the singular object, which makes it a single action.
# However even with
* 我 告诉 ZHE 她 消息 。 (it has to do with the length of the verb action)
* 我 卖 ZHE 她 书 。 (in it's ditransitive
我 在 告诉 她 一 个 消息 。
我 在 卖 她 一 本 书 。
我 在 卖 新加坡 中文 书 。
我 在 教 她 中文
我 在 告诉 着 她 消息 。 if the object is not single, then it's perfectly fine
我 在 卖 着 她 书 。 if the object is not single, then it's perfectly fine
我 在 卖 着 新加坡 中文 书 。
我 在 教 着 她 中文 。 this one should be ok!
""".
v_s_le := v_s-comp-lex & nonque-item
"""
Verbs capable of taking a sentence as complement.
E.g.: 知道, 发现, 了解, 原定,
我 知道 她想去北京. OK
我 知道 LE 她想去北京. (no...)
我 知道 GUO 她想去北京. (NO!!!! really no)
我 知道 ZHE 她想去北京. (NO!!!! really no)
我 MEI 知道 她想去北京. (no...)
我 MEI 知道 GUO 她想去北京. (NO!!!! really no)
我 BU 知道 她想去北京. OK
我 ZAI 知道 她想去北京. NO
我 ZAI 发现 她想去北京. NO
我 ZAI 了解 她想去北京. NO
# This behaves differently but it requires a question!
# This could be enough reason to be different
# The complement should be in a form a question, but it doesn't produce a question.
我 问 谁不来. OK
我 问 了 谁不来. OK
我 问 过 谁不来. OK
我 问 着 谁不来. Ok
我 在 问 着 谁不来. Ok
我 知道 了 谁不来. OK
知道 + QUESTION should be ok
""".
;========================================================
; VERBS THAT CAN BE DEGREE SPECIFIED
;
; Some verbs in Mandarin Chinese can be ; specified with
; a degree adverb (similar to what happens to most
; adjectives). These verbs usually fall in one of such
; classes: