-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parsing for 宣さない #1467
Comments
I am curious about whether this form is actually in use. When searching for 宣さない on Google, there are only 2 or 3 unique results. If this form is commonly used, I would expect to see a distinct entry, like for 察する→察す or 達する→達す. |
I also could only find a few references using Google, mostly of the form ファウルを宣さない or ヴァイオレイションを宣さない and mostly in the context of basketball. I looked up the mail where this issue was reported and found the original report:
Does that clarification help? |
It helps a lot, thank you! Maybe I'm wrong but I don't think this is something we need to implement in the inflection parsing because as the linked source states:
I think in cases where the さない form applies, there is always separate dictionary entry as す-Godan verb, so that in these cases it will correctly match all forms. Just like in the examples I mentioned in my previous comment there is an entry for 察する and 察す as well as 達する and 達す with the exact same definitions. Also, when you search for the す-version on that site, you will be forwarded to the する version: |
I guess for 宣する the problem is there is no 宣す-Godan verb entry so if you try to use 10ten to look up ファウルを宣さない it will fail. Perhaps the author of the email encountered such content. Perhaps we should submit an entry to JMDict for 宣す since it looks like 日本国語大辞典 (小学館) has it: https://kotobank.jp/word/%E5%AE%A3%E3%81%99-3137997 Weblio also has it but it has a warning that its conjugation data is programmatically generated:
I couldn't find it in 大辞林 however, only 宣する. |
Which dictionary do we trust more? |
I trust 日本国語大辞典 more than JMdict but in any case we can submit it to JMdict and the editors there can decide if it's worthwhile since they have access to more dictionaries than I do.
Yes, definitely. But "宣す" appears to be quite common appearing in a lot of official documentation in phrases like "開会・閉会を宣す", "承認する旨を宣す" etc. |
Great. Thank you for taking care of this! |
Well, it seems that with new submissions they don't allow the す-Godan v5s tag on this kind of verb anymore. In the end, it is indeed something we need to implement in the inflection logic. Edit 1: There still seems to be something I'm missing. I don't really know much about old classical Japanese literature but reading the Wikipedia article about the サ行変格活用 , there are two types of サ行変格活用-verbs ending in す:
However, 宣す being used in today's writing and as 宣さない (to my understanding す doesn't conjugate to さない) strongly suggests to me that it (also) belongs in the second category, and therefore can also be tagged with v5s. Would you agree? Edit 2: To catch every case of verbs from the second category, regardless of their dictionary entries, I suggest that we add the 未然形 さ (and the following auxiliary verbs) to する-verbs with a special reason for deinflection. Some dictionaries call the 五段-transformation 五段化, so I think recognizing such forms as for example |
Thank you so much for looking into all this. I'm afraid classical Japanese verb inflections is not something I'm familiar with. I looked into it once in order to generate suitable localizations for the part-of-speech tags but I mostly found different sources used different terminology. From what I understand, your proposal makes sense. Would there be a risk of matching multiple entries however? e.g. matching both the entry for 愛する and 愛す when looking up 愛せない for example? For 五段化 we could just go with "Godanka" or perhaps we could try a little harder and go with "as Godan verb". Better still, do we even need to show the reason to be user? |
Yeah, it is very confusing. It seems to me that also the categorization in JMdict and its application are somewhat ambiguous.
Yes, as this is already the case with many forms: 愛した, 愛して, 愛します, 愛される, 愛させる.
I can imagine people being very confused, I certainly would be, when seeing 愛さない is matched with 愛する, as さない is not an inflection of する. In my opinion, knowing that this is because the verb can also be inflected as a Godan verb would instantly clear things up. "as Godan verb" would serve this purpose very well, I think. |
Oh, so it is! Just a thought, I noticed that the Supplementary comments for single-kanji ~する verbs has:
If that's true and all these する verbs that display the 五段化 behavior also have a corresponding す verb in JMdict (whether or not it is marked as v5s or vs-c) can we make the 五段化 rule have a to-type of If not, then I think your proposal makes sense. Although would we make the to-type for said rules be Sorry if my questions don't make sense, I might not be understanding the different cases properly. |
Thanks for sharing the link. That was some really useful information. So if I'm understanding correctly, summing the things I read about this topic up, there are 3 categories for suru-verbs in JMdict:
Initially, I thought this was the case, but it is not. For example, the in the Wikipedia article given example 発する has no corresponding entry for 発す but shows all of the above mentioned irregularities. What I'd propose is
|
Thank you so much for the thorough investigation here! That's really helpful.
That sounds perfect. Thank you again! |
Great, I will start working on it once the 敬語-PR is merged. |
As reported by email, this is a negative form of 宣する
The text was updated successfully, but these errors were encountered: