forked from GiellaLT-Archive/clean_lang_history
-
Notifications
You must be signed in to change notification settings - Fork 0
/
est-x-plamk.diff
327 lines (327 loc) · 34.8 KB
/
est-x-plamk.diff
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
360d359
< Updated ignore patterns. 2019-10-23T18:25:34+00:00
367d365
< Force unix line endings, to make sure it works ok also on the Windows subsystem for Linux. 2019-10-07T17:04:57+00:00
377d374
< Updating svn ignores for tools/analysers/. 2019-06-14T06:33:54+00:00
382,383d378
< Updating svn ignores. 2019-05-24T09:52:39+00:00
< Updating svn ignores. 2019-05-24T09:40:09+00:00
405d399
< Updated svn ignores. 2019-02-27T10:21:10+00:00
434,435d427
< Ignore compiled cg3 files in tools/tokenisers/. 2019-01-08T07:06:40+00:00
< Ignore more files, including files that are automatically added to svn when populating a new language. This is done to avoid them showing up as noise for external languages, in which case these files might not be in our svn (but in the external svn repo instead). 2019-01-08T06:56:01+00:00
488d479
< Updated svn ignores. 2018-09-25T08:25:04+00:00
493d483
< More general ignore pattern for tools/mt/apertium/tagsets/. 2018-09-10T11:03:59+00:00
496d485
< Updated svn ignore patterns. 2018-09-08T05:26:53+00:00
507d495
< Updated svn ignores. 2018-08-30T15:58:31+00:00
510d497
< Updated svn ignores. 2018-08-29T05:25:44+00:00
512d498
< Updating svn ignores. 2018-08-28T10:41:36+00:00
533d518
< More things to ignore. 2018-05-14T09:52:56+00:00
569d553
< Added svnignore pattern for sigma.txt. 2018-02-21T10:01:06+00:00
572d555
< Two more files to ignore. 2018-02-06T09:34:41+00:00
583d565
< Updated svn ignores. 2018-01-31T12:06:31+00:00
638d619
< Updated svn ignores. 2017-12-11T12:51:45+00:00
667,668d647
< Updated svn ignores for tokenisers and grammar checkers + subdirs. 2017-10-11T11:35:46+00:00
< Updated svn ignores for tokenisers and grammar checkers + subdirs. 2017-10-11T11:03:08+00:00
696d674
< Updating svn ignores. 2017-08-25T10:16:46+00:00
717d694
< Updating svn ignores. 2017-06-28T23:38:04+00:00
719d695
< Updated svn ignores. 2017-06-28T17:12:58+00:00
762d737
< Updated svn ignores. 2017-03-01T11:26:04+00:00
790d764
< Updated svn ignores. 2017-01-30T09:54:28+00:00
931d904
< Updated svn ignores. 2016-06-09T20:04:14+00:00
959d931
< Setting svn ignore patterns on tools/spellcheckers/filters/. 2016-05-10T01:00:12+00:00
983d954
< Ignore more preprocessor files = fst’s. 2016-04-14T16:01:35+00:00
987d957
< Updated svn ignores. 2016-03-15T19:55:06+00:00
990d959
< Use a more general svn ignore pattern in src/morphology/. 2016-03-07T17:11:09+00:00
1010d978
< Updated the svn ignore script with recent additions to the infrastructure. 2016-02-16T22:23:49+00:00
1015,1016d982
< Updating svn:ignore’s. 2016-02-02T15:31:07+00:00
< Updating svn:ignore’s. 2016-02-02T15:21:38+00:00
1021,1022d986
< Updated svn:ignore’s. 2016-02-02T10:34:44+00:00
< Udating svn:ignore’s. 2016-02-01T22:18:32+00:00
1026d989
< Updated svn ignores. 2016-01-25T08:12:56+00:00
1038d1000
< Updated svn:ignore’s. 2015-11-18T23:09:59+00:00
1053d1014
< Updated svn ignores. 2015-10-20T07:52:23+00:00
1078d1038
< Ignore temporary files generated by the speller suggestion test script. 2015-09-02T20:01:50+00:00
1124d1083
< Ignore txt files in speller dirs. 2015-04-09T11:49:00+00:00
1134d1092
< Updated svn ignores. 2015-03-14T10:56:07+00:00
1139d1096
< Updated svn ignores. 2015-03-12T08:28:25+00:00
1145d1101
< Updated svn ignores. 2015-03-09T10:43:19+00:00
1147d1102
< Updated svn ignores. 2015-03-06T15:57:33+00:00
1150d1104
< Updated svn ignores. 2015-03-06T09:24:56+00:00
1157d1110
< Update svn ignores. 2015-02-27T12:58:52+00:00
1193d1145
< Special svn:ignore on src/orthography/. 2015-01-26T10:34:37+00:00
1207d1158
< Updated svn:ignore's. 2015-01-12T21:52:28+00:00
1223d1173
< Update ignores for src/morphology/. 2014-10-23T08:28:02+00:00
1261d1210
< Updated svn:ignore's. 2014-09-08T21:40:37+00:00
1278d1226
< Updated ignores. 2014-08-08T10:07:48+00:00
1281c1229,1453
< Moved Neeme's EST (Estonian) work to the new directory experiment-langs. 2013-12-11T12:30:40+00:00
---
> "olema" and "redel" plural 2013-12-06T15:48:22+00:00
> updating dict templates from und, to include the proper mobile/non-mobile spellrelax, orig_lang and semantic tag removal. 2013-12-06T01:14:18+00:00
> updating from template: r84648, two dict analysers, one with mobile spellrelax, and one without. Also removing certain semantic tags and orig_lang tags which prevent POS from being the first tag, and messing with lookups for NDS 2013-12-06T00:36:34+00:00
> 8 nouns starting with "a" inflecting like "redel" 2013-12-05T10:59:22+00:00
> noun lexicon 'redel' 2013-12-04T11:55:37+00:00
> [Template merge - und] Adding possibility to first look for specific regex creation shell script before falling back to a default shell script. This will allow us to create more complex or tailored regexes for certain tag sets (like the semantic tags), while having a reasonable fallback for other cases. 2013-12-02T09:06:13+00:00
> [Template merge - und] Keeping intermediate files didn't work, created an error. Now it works. 2013-12-01T21:15:02+00:00
> [Template merge - und] Fixed a make warning, made generated regex files survive the build. 2013-12-01T09:44:14+00:00
> [Template merge - und] Further cleanup of semantic tag filtering: no processing of semantic filters in the shared makefiles. 2013-11-28T10:45:26+00:00
> [Template merge - und] Added rules to generate regexes automatically from the list of extracted tags. First out is the regex to make semantic tags optional, and another to remove them completely. Also fixed file references in the relabel targets. 2013-11-25T17:31:05+00:00
> docu phon update. 2013-11-25T15:34:58+00:00
> docu 2013-11-24T15:08:13+00:00
> [Template merge - und] Only build one file of tags, using hfst or xfst depending on the configuration. Extract semantic tags. 2013-11-24T13:26:35+00:00
> [Template merge - und] Reverted a change to hfst lexc compilation - the -f option doesn't work. 2013-11-23T10:46:08+00:00
> [Template merge - und] Moved tag extraction from tagsets to filters, as it has a more general use as the basis for dynamic filter construction. Tag extraction now works with both Xerox and Hfst, and handles both prefixed and suffixed tags. 2013-11-22T22:06:46+00:00
> [Template merge - und] Xerox will now stop on lexc syntax errors (done by replacing lexc with xfst - it was impossible to get lexc to stop; this is also how it is done in the old infrastructure). Hfst will not until (hfst_)foma is fixed, because foma doesn't stop on syntax errors. But one is better than none. 2013-11-22T13:10:38+00:00
> [Template merge - und] Removed one harmless but irritating warning. 2013-11-20T19:51:01+00:00
> [Template merge - und] Commented out weighting of the acceptor fst for the zhfst speller file - it causes a segfault in hfst-ospell. 2013-11-19T13:27:08+00:00
> [Template merge - und] Added a filter to remove dynamic derivation. 2013-11-18T09:57:40+00:00
> [Template merge - und] YES! Finally got weighted automata working in the speller. Added missing hfst tools, and sorted all the hfst tools alphabetically. Updated the required hfst to version 3.5.1. Weighted speller automata are now the default (change the weights and what is weighted as needed pr. language). Thanks to Krister Lindén for giving instructions on how to get this working. 2013-11-15T09:25:30+00:00
> Estonian numbers 2013-11-13T10:58:45+00:00
> jooma 2013-11-08T15:13:47+00:00
> Estonian numbers 2013-11-07T12:23:30+00:00
> numbers 1-9, cardinals and ordinals 2013-11-01T21:50:58+00:00
> numbers and dates, first practicing 2013-11-01T09:06:48+00:00
> [Template merge - und] Changed build files to support Hfst 3.5, requires 3.5. 2013-10-28T09:24:09+00:00
> [Template merge - und] Added LexSub string filter. 2013-10-23T15:14:45+00:00
> [Template merge - und] Changed voikko compression back to zip - gzip isn't voikko compatible. 2013-10-21T18:56:40+00:00
> [Template merge - und] * FINALLY fixed the automake 1.11 vs 1.13 test incompatibilities. Now we can allow version 1.11, and still get the pretty output we want in newer automakes. * Fixed references to GTCORE in test scripts. Earlier we relied solely on it beingset in the environment, now we take it from configure (which can take it from the environment or from a script). 2013-10-21T14:28:09+00:00
> [Template merge - und] One more gzip option fix. 2013-10-21T08:24:32+00:00
> [Template merge - und] Fixed argument structure of gzip - zipping was broken for hfst and gramcheck. 2013-10-21T07:42:57+00:00
> [Template merge - und] Consistently use gzip instead of zip, and find gzip outside any conditionals. 2013-10-18T16:56:26+00:00
> [Template merge - und] Redirected command feedback of the analyser shell script to stderr, to avoid cluttering the analysed text in pipe use. 2013-10-18T13:22:58+00:00
> [Template merke - und] The first lookup shell script added, with supporting infrastructure. Part 2 - now with shell script and Makefile. It's possible to make again. 2013-10-17T16:11:08+00:00
> [Template merke - und] The first lookup shell script added, with supporting infrastructure. Part 1 - no actual shell script, no Makefile. Comming in the next commit. NB! Right now making and building will break, sorry for the inconvenience. 2013-10-17T15:51:01+00:00
> [Template merge - und] Added option to automatically create a language home dir environment variable. The idea is that by setting this variable, we can reliably find transducers in the working copy dirs of the users. The default is to not do anything (but give a warning). As part of this change, I switched the shell from sh to bash, as I don't know how portable the extra code is with respect to other shells. 2013-10-17T07:56:17+00:00
> [Template merke - und:] Changed back the Automake requirement to 1.11 - 1.12 is creating too much trouble. We'll have to see what to do with the test output - the version requirements change must be followed by another change that will substantially degrade test reports on newer automakes. 2013-10-16T17:19:36+00:00
> [Template merke - und:] Made the check for GTCORE functional, looking for both the gt-core.sh script (and using its output if found), and the environment variable $GTCORE. This means that there is no need anymore to set the GTCORE variable as long as one configure, make and make install in the gtcore directory. 2013-10-16T16:50:46+00:00
> [Template merge] Corrected bug/feedback e-mail address to one actually working. 2013-10-14T07:00:14+00:00
> [template merge] Made LexC compilation break on error, at least for Xerox (Hfst only gives a warning for the same error tested). 2013-10-11T16:38:43+00:00
> [Template merge] Moved the compilation of remove-illegal-derivation-strings.regex from all langs to only the three Sámi langs actually using it. Even though potentially useful for more languages, it can hardly be considered a language universal... 2013-10-11T14:50:41+00:00
> [template merge] More build rules for the grammar checker. Now it will install. 2013-10-09T09:52:01+00:00
> [template merge] Corrected the --enable-grammarchecker option testing. 2013-10-08T17:02:38+00:00
> [template merge] Changed the order of the configure macros, to allow for testing for program availability when checking the enable options. 2013-10-08T16:55:28+00:00
> [template merge] Forgot to add the new Makefile to configure.ac. 2013-10-08T15:33:52+00:00
> [template merge] Added basic build infrastructure for a CG-based grammar checker. No template source files added yet, as this is still pretty experimental. The grammar checker is disabled by default (naturally). 2013-10-08T14:19:19+00:00
> Template merge: Copy-paste error introduced scanning of a subdir test that doesn't exist for any language but SME. Now corrected. 2013-10-04T15:25:56+00:00
> Template merge: Reorganised the phonetic build code to better support parallel phonetic transcription depending on the source language of loan words and foreign names. 2013-10-04T14:31:06+00:00
> Added check for the availability of 'see' when testing, to avoid bad fails on systems without 'see'. 2013-10-04T08:37:39+00:00
> Tempate merge: Added config feedback about vislcg3/syntactic parsing status. Added config check for the see tool (SubEthaEdit). 2013-10-04T07:53:49+00:00
> Template merge: Remove copying of the timestamp file for non-maintainers. It breaks the automatic merge, and requires a revision-explicit merge for each such language. Also added removal of originating language tags - they are only used in TTS. 2013-10-04T07:05:34+00:00
> Added compilation of the remove-orig_lang-tags filter. Sorted the filter targets alphabetically within each logical block. Template merge. 2013-10-04T05:20:26+00:00
> Improved and corrected configure feedback for spellers. Template merge. 2013-10-03T13:27:58+00:00
> Template merge: * Now all speller fst's are turned off by default (I missed a few in the previous commit). The configure feedback is slightly improved. * Corrected syntax error in a test. Improved config feedback further. 2013-10-03T12:44:18+00:00
> Template merge: Changed the default setup to only include morphological analysis and generation. This is done to reduce the build time during regular development. This means that to build spellers and other specialised fst's, they must now be enabled using ./configure. Cf. bugzilla #1710: http://giellatekno.uit.no/bugzilla/show_bug.cgi?id=1710. 2013-10-03T10:16:39+00:00
> editing. 2013-09-28T18:38:15+00:00
> Corrected filter order for the text2X transcriptors. Template merge. 2013-09-20T14:00:35+00:00
> Completely redid the text2num etc transducers. The previous solution was in the wrong place, and didn't incorporate the actual filtering. Now it does, but whether this is the way it should be needs to be tested. Template merge. 2013-09-20T13:19:16+00:00
> Another Xerox error correction - we're using LexC, not Xfst. Skipped the result stack - not needed. Finally the basic compilation works. Template merge. 2013-09-20T12:20:17+00:00
> Corrected Xerox error. Template merge. 2013-09-20T09:20:59+00:00
> Added the inverse transcriptors, to go from text to numerical expressions. Template merge. 2013-09-20T09:07:54+00:00
> Wrapped phonetic / IPA conversion in a configure option, default is 'no'. Now compiling SME with Xerox should be back to normal speed again. Template merge. 2013-09-19T19:34:37+00:00
> docu 2013-09-06T22:29:24+00:00
> link to usage docu 2013-09-06T22:25:35+00:00
> Added Remove ACR filter. Template merge. 2013-09-06T13:18:25+00:00
> Added compilation of the filters for the orthographic tags, and added removal of them and the IPA strings in all regular fst's. Template merge. 2013-09-06T10:18:27+00:00
> Added missing hfst tool hfst-fst2strings to the M4 autoconf macros. Template merge. 2013-09-03T09:37:36+00:00
> no-content 2013-09-02T21:09:02+00:00
> Removed reference to a file without content. 2013-09-02T21:08:09+00:00
> Forgot to rename a variable after copy-paste. Template merge. 2013-08-29T07:34:06+00:00
> Reorganised the build code for dictionaries, added a dictionary option for configure (disabled by default), and added the new filter for mobile keyboard spellrelax. 2013-08-29T05:09:48+00:00
> Dokumentation 2013-08-28T20:48:50+00:00
> Several bug fixes for the apertium build targets. Now it seems to work correctly for both sme and sms, and thus hopefully all languages. Template merge. 2013-08-19T07:31:48+00:00
> [bugfix] hfst-substitute can't take lookup-optimised fst's as input. Template merge. 2013-08-18T09:32:37+00:00
> [bugfix] Removed a sma-specific filter that had crept in and stopped compilation. Added att output fst to the default apertium analyser target. Template merge. 2013-08-17T13:25:23+00:00
> Added support for building apertium transducers to all languages. It requires the use of a configure flag, i.e. it is disabled by default - --enable-apertium if you want to test. 2013-08-17T12:25:27+00:00
> Added remove-variant-string.regex, for removing strings containing +v2, +v3, +v4, +v5, but not removing +v1. (template merge). 2013-08-14T07:54:51+00:00
> Change echo to printf for cross-platform compatibility. Template update. 2013-08-12T15:03:43+00:00
> Improved error handling in testing shell scripts. Template update. 2013-08-12T08:41:50+00:00
> Merged last template changes (tagset updates by Fran). 2013-08-10T17:34:12+00:00
> Added more files to the documentation. 2013-07-13T19:29:48+00:00
> docu 2013-07-13T18:14:22+00:00
> Added and formatted documentation. 2013-07-09T10:12:05+00:00
> Added ref to twolc documentation file. 2013-07-09T10:10:16+00:00
> Documentation update: Added a file WhatIsThis, it shall contain a short explanation to outsider, as the name tells. 2013-07-09T10:08:36+00:00
> Renamed refs to template dir in preparation for support for multiple template dirs. Template update. 2013-06-29T20:44:20+00:00
> Commented out examples of error models for string and word pairs - they would in most cases add symbols to the error model not found in the acceptor, and this combination would crash the speller badly. Template update. 2013-06-25T08:17:53+00:00
> Cleaned up speller fst building, removing all unnecessary inverts and streamlining the code. Prepared for the introduction of weights, but commented out for now because of bugs or inefficiences in openfst. Renamed the included hfst speller build file, to follow an emerging naming standard for the include files. 2013-06-13T14:05:40+00:00
> Added support for making variant analysers and generators using the Apertium tag convensions. The generated transducers are still not fully Apertium-compatible but they are a major step forward. Template update. 2013-06-12T13:44:33+00:00
> Renamed analyser-raw-gt-desc.hfst to generator-raw-gt-desc.hfst, to make the behavior in hfst-lookup explicit and clear. Still, the "generator" behaves as the Xerox "analyser" in hfst when in comes to composition and filtering. Confusing, I know. Template update. 2013-06-11T09:48:17+00:00
> Build the filter to remove CLB strings from speller transducers, and use it. Template update. 2013-06-10T13:40:31+00:00
> Added missing hfst tools. Removed commented-out code in the index.xml file. Template update. 2013-06-10T12:25:27+00:00
> Removed the ocr error model from the zhfst building, it causes libvoikko 3.4 to segfault. Template update. 2013-06-07T06:28:04+00:00
> Added an explicit copy operation into the hfst speller dir, to facilitate local modifications of the speller transducer before further processing, by just replacing the copy operation with whatever is needed. Template update. 2013-06-07T00:03:18+00:00
> Added string pairs and whole-word corrections to the speller error model. Added support for an ocr error model. Removed obsolete Voikko config file. Corrected bugs in the hfst M4 macros. Template update. 2013-06-06T23:22:21+00:00
> Moved the initial spell checker processing to the top spellchecker dir, to serve as the default starting point for all spell checkers. Template update. 2013-06-06T07:58:35+00:00
> Added a tagset directory in preparation for generating Apertium transducers automatically. Corrected and expanded a few M4 macros for the hfst tools. Template update. 2013-06-05T12:41:41+00:00
> Added support for testing analysers and generators only. For several of our more specialised transducers, this is more practical and useful than always generating both pairs of transducers to test both directions. 2013-05-09T09:05:51+00:00
> Corrected the existing oahpa transducer. Added dummy hfst oahpa target. Template update. 2013-05-07T00:27:03+00:00
> [bugfix] Corrected a bug in the hyphenator hfst build: fst's must be inverted in hfst. Template update. 2013-04-30T20:42:31+00:00
> [bugfix] Corrected another copy-paste error that broke speller fst's. Template update. 2013-04-27T07:36:43+00:00
> Splitted and renamed the remove-morph-border filter. Rewrote a number of targets to reflect this. There are now three filters instead of one, to allow for more flexible fst building for speech processing. 2013-04-26T12:49:21+00:00
> Added gzip compression of foma speller transducer, and proper checks for prerequisites. Foma spellers can now be disabled, they are enabled by default. Template update. 2013-04-24T11:21:36+00:00
> Corrected a bug when building foma-based spellers. Changed one fst filename to follow the naming scheme for the new infra. Improved building of the zfst speller file. 2013-04-24T07:02:32+00:00
> For some reason wasn't the und.timestamp file updated during a template merge earlier this week. Now done. 2013-04-19T11:32:09+00:00
> Added processing of new filters. Template update. 2013-04-18T10:55:34+00:00
> Do not try to build hfst-based tools if hfst building is not enabled. Template update. 2013-04-15T09:46:48+00:00
> [feature] Moved some of the fst-speller building one level up, and added support for building foma-based spellers. Template update. 2013-04-11T16:51:27+00:00
> Renamed phonetics source and target files to reflect the actual purpose. Template update. 2013-04-10T05:50:34+00:00
> Add possibility to build morph segmenting automaton. Template update. 2013-04-09T21:56:28+00:00
> Added a top-level misc/ dir to hold private / non-svn files needed during development of the language. All files are ignored. 2013-04-09T19:54:22+00:00
> [bugfix] Corrected hfst text2ipa fst: the final fst needs to be inverted before being used in lookup. Template update. 2013-04-08T06:58:53+00:00
> [bugfix] Corrected the homonymy and variant filters used for generators - those tags should be optional, not completely removed. Template update. 2013-04-05T12:35:06+00:00
> [infra] We require gawk specifically, not any awk whatsoever. Improved config feedback. Template update. 2013-04-05T10:16:54+00:00
> [bugfix-infra] Corrected reference to the built fst's. Template update. 2013-03-20T17:03:42+00:00
> Updated the zhfst building to reflect recent changes in Voikko. There is now official support for zhfst speller files, but with a new location and no *.pro file. Also added simple support for local loading of the zhfst file - voikkospell requires that the file is located within a dir named '3'. 2013-03-18T09:34:55+00:00
> Further improvements to the test run output. Template update. 2013-03-13T18:26:55+00:00
> More tweaks to make the test output compact and readable. Template update. 2013-03-13T16:27:15+00:00
> Moved Oahpa transducer compilation to a separate (included) file, and added support for compiling dictionary transducers, also in a separate include file. Template update. 2013-03-13T11:48:35+00:00
> We need the last part of the path to properly identify the lexc file tested. Template update. 2013-03-13T10:16:59+00:00
> Made the morph-tester test runner (LexC and YAML tests) less verbose. All messages are one-liners, except for FAILs. Commented the code. Template update. 2013-03-13T09:26:17+00:00
> More thorough cleaning in src/morphology/. Template update. 2013-03-12T08:03:23+00:00
> Moved the definitions of the transducer variables to the Makefile.am, to make it possible to extend them by local modifications. Template update. 2013-03-11T09:13:03+00:00
> Forgot to update the src/filter/Makefile.am file. Template update. 2013-03-07T15:04:55+00:00
> Split the filter 'remove-dictionary-tags' in two to remove homonymy and variant tags separately. Template update. 2013-03-07T14:47:53+00:00
> Added filter to remove NGminip strings, ie paths that should not be used for generating miniparadigms in dictionaries. Template update. 2013-03-07T11:11:14+00:00
> Added infrastructure for building fst's for list-based spellers. The actual building is not yet implemented. Template update. 2013-03-06T07:19:50+00:00
> compiled 2013-03-04T08:03:23+00:00
> Remove doc build dir when cleaning. Template update. 2013-02-27T07:12:03+00:00
> Deleted files in obsolete locations. Moved one file not previously moved. 2013-02-26T23:57:43+00:00
> Forgot to update the config file. Template update. 2013-02-26T23:45:39+00:00
> Reorganised the tools/ dir to fit better with coming development. Template update. 2013-02-26T23:24:11+00:00
> Second part of update to handle validation of generated documentation. Essentially whenever new documentation is created due to source files being changed, a forrest site is built. During that build process, any blocking issues with the generated jspwiki pages will be revealed, thus going a long way towards ensuring that such errors do not end up in svn and from there block the building of our public sites. 2013-02-21T12:57:26+00:00
> Added check for forrest as part of configuring the documentation extraction. Forrest will be used to validate the jspwiki documents during the build, to avoid that invalid documents enter the svn repository and corrupts the web page building. First step towards that goal. 2013-02-18T18:08:44+00:00
> Checking in generated docu files. Follow this one, and monitor whether it breaks the forrest build. 2013-02-17T14:54:36+00:00
> Checking in generated docu files. Follow this one, and monitor whether it breaks the forrest build. 2013-02-17T14:54:22+00:00
> Trying to make docu compile, cf. bug #1617. 2013-02-16T14:15:54+00:00
> Added build instructions for affix file documentation, cf. bug #1617 in Bugzilla, note that the files themselves are not checked in. 2013-02-16T14:15:20+00:00
> Upped the required automake version from 1.11 to 1.12, to avoid all hassles with the test harnesses and backwards compatibility. Template update. 2013-02-14T15:45:55+00:00
> These files should have been removed in the earlier commits regarding changes to the test bench, but where lost during the template merge earlier today, and not noticed until now. Finally deleted. 2013-02-14T15:26:19+00:00
> Even more portable testing... 2013-02-14T15:14:52+00:00
> Even more portable testing... 2013-02-14T10:44:31+00:00
> Improved portability & correctness of conditional tests in the morphology testing. Template merge. 2013-02-14T09:13:00+00:00
> Major update to the LexC testing. Now test data directly in the LexC code is supported by the python test script morph-tester.py (it reads the lexc files directly), which solves the bugs with multiple wordforms for the same morphosyntactic inflection. It is also a bit faster than the awk solution, and allows an unlimited number of different transducers to be tested dynamically directly in the lexc code. 2013-02-13T18:24:04+00:00
> Two more source files copied from gt/sme/src/. Template update. 2013-02-11T14:10:37+00:00
> Updated and harmonised documentation files. 2013-01-27T08:47:50+00:00
> The test case for Estonian verb "olema" works now 2013-01-24T21:33:49+00:00
> Corrected fst reference in the affixes/nouns.lexc test data. 2013-01-24T10:36:04+00:00
> Added yaml test data to the affixes/nouns.lexc file. Still doesn't produce any functional test though, looks like a bug in the awk script. 2013-01-24T10:21:22+00:00
> Fixed a syntax error in the yaml file - word forms identical with yaml (or python?) function words need to be written within quotes, to ensure they are read as strings, and not some function word. 2013-01-24T10:08:46+00:00
> Inserted whitespace to ease debugging. 2013-01-24T09:52:47+00:00
> generated docu 2013-01-24T08:07:17+00:00
> Finally found out how to get the old test behaviour back. We want the serial tests, because it gives direct feedback to the linguists. Automake 1.13 uses parallel testing by default, which logs all test results to files. 2013-01-23T23:23:07+00:00
> Added link to the generated documentation files. 2013-01-23T22:12:58+00:00
> Added support for processing twolc files for documentation extraction. 2013-01-23T21:41:04+00:00
> Some files may contain digits in their filename. Extended the filename match pattern for the Links target. Template update. 2013-01-23T21:02:34+00:00
> Added support for automatically building a file with links to each individual jspwiki file generated based. Template update. 2013-01-23T20:24:41+00:00
> Aajege has only been involved with the SMA source... 2013-01-23T16:14:04+00:00
> Forgot to add the jspwiki preamble file. Now added. 2013-01-23T15:11:44+00:00
> Moving paradigm files to test/data 2013-01-23T14:01:11+00:00
> izh in wrong cat 2013-01-23T14:00:23+00:00
> Documentation checked in, check (sic.) 2013-01-23T13:59:53+00:00
> Forgot to add support for the conditional CAN_DOCC in the previous commit. Template update. 2013-01-22T17:38:39+00:00
> Added final newline. 2013-01-22T17:22:15+00:00
> * Added initial support for extracting documentation from comments in the source code. Only jspwiki supported initially. * Also added initial support for extracting test data from source code comments. Only yaml tests in lexc is supported initially. 2013-01-22T16:48:30+00:00
> added 'ning' and 'et' to conjunctions 2013-01-22T15:44:58+00:00
> Added test case for "olema" 2013-01-22T14:00:36+00:00
> Adjective test for Estonian works now 2013-01-18T13:22:58+00:00
> A comparative works for Estonian (punane) 2013-01-18T11:09:24+00:00
> A-punane_gt-norm.yaml full paradigm (w.o derivates) 2013-01-18T10:17:04+00:00
> added A-punane_gt-norm.yaml (adjective test) 2013-01-17T16:54:16+00:00
> added N-hobune_gt-norm.yaml test 2013-01-17T13:19:09+00:00
> restored lost changes to estonian stems/nouns.lexc, adjectives.lexc, verbs.lexc 2013-01-17T12:57:43+00:00
> restored lexc files 2013-01-17T12:33:47+00:00
> These are merely generated files. 2013-01-17T09:16:02+00:00
> new stems: N: rebane, inimene, naine V: jooma, lööma, looma, tooma A: sinine, kollane 2013-01-16T17:38:34+00:00
> V-sooma and N-hobune tests (full paradigm) works now 2013-01-16T14:52:02+00:00
> The file had been committed with an unresolved conflict. 2013-01-14T08:30:08+00:00
> Removed cyrillic acronyms. 2013-01-13T14:05:27+00:00
> A solution for vowel raising sööma:süüa has been made. The solution does not allow _sööma:sööa_. The solution is to provide a rule that describes %^RVws:0 <=> [ ö:ü | o:u ] _ %>: a ; 2013-01-13T09:50:17+00:00
> sööma paradigm works now added e together with a to vowel change to ü 2013-01-11T20:42:40+00:00
> Ind, Impr, Cond tests run ok 2013-01-11T18:37:20+00:00
> verb 'sööma' full paradigm, test case 2013-01-11T17:09:11+00:00
> The final fix to get the XML-to-LexC conversion working on Cygwin. Template update. 2013-01-11T14:35:43+00:00
> Verb sööma test suite, full paradigm according to Eesti keele käsiraamat http://www.eki.ee/books/ekk09 2013-01-11T14:32:13+00:00
> Concatenate all LexC source files into one file explicitly, instead of letting hfst-lexc do it. This is more robust cross-platform, and makes the file used for transducer compilation easily available for debugging. Template update. 2013-01-11T12:47:29+00:00
> Corrected the host detection test for Cygwin. Template update. 2013-01-11T09:51:07+00:00
> Added empty line at the end. 2013-01-11T09:35:17+00:00
> Template updates: * Added support for XSL conversion of XML source files on Cygwin. * Made spell-relax a language-specific file. 2013-01-11T09:24:52+00:00
> Added empty line at the end. 2013-01-11T08:17:59+00:00
> Added files to use for paradigm generation, modeled on sma (for baltic finnic lgs) and myv (for the rest). 2013-01-11T06:01:23+00:00
> Rueter_s suggestions have been commented out of the yaml test file. 2013-01-10T21:44:34+00:00
> Several Uralic languages have what has come to be known as the _connegative_ form of the verb. The verb _sööma_ has the _+ConNeg_ forms _söö_ which is part of the negative syntagma _ei söö_ +V+Ind+Prs+ConNeg, the reading +V+Ind+Prt+ConNeg is _söönud_. In Estonian this might readily be debated, however. 2013-01-10T21:35:44+00:00
> We have had problems using _0_ epsilon before. If possible, we will avoid it. It is, however, useful when using multiple flag diacritics. 2013-01-10T21:29:52+00:00
> sööma full paradigms work, but it may contain a hack (pöörded and pöörded-ita) 2013-01-10T15:16:23+00:00
> The nouns _koer_ and _hobune_ both have continuation lexicons that allow for +N+Pl+Par. 2013-01-10T12:25:40+00:00
> Vowel loss before morpheme boundary followed by i. The ArchiVowels %^Pli and %^Prti have been added. 2013-01-10T12:24:34+00:00
> Vowel loss before morpheme boundary followed by i. The ArchiVowels %^Pli and %^Prti have been added. 2013-01-10T12:24:07+00:00
> Estonian 'sööma' passes tests (all with 'söö->') V-sooma_gt-norm.yaml near full paradigm 2013-01-10T10:51:44+00:00
> Made Voikko support optional instead of required. Template update. 2013-01-10T10:38:26+00:00
> Rewrote LexC and TwolC Xerox rules to make them work on Cygwin: the Windows Xerox tools need a script file as input, the scripts can't be piped in as on *nix systems. Removed the hack in the previous commit. The bug can be worked around by avoiding linebreaks in the piped script. 2013-01-10T09:40:39+00:00
> sööma test up to Verb main imper present second singular active afirmative added 4. person as impersonal 2013-01-09T14:57:24+00:00
> moving here. 2013-01-09T11:44:55+00:00
> _%>_ and _#_ are not removed here. There should be no transformation to _0_. 2013-01-09T10:44:28+00:00
> Added some verbs to "sööma" 2013-01-09T09:15:50+00:00
> The rule on vowel raising has been generalized to affect both öö:üü and oo:uu. 2013-01-09T09:05:09+00:00
> A new verb has been added. 2013-01-09T09:03:58+00:00
> The infinitive tags have been corrected so that _ma_ is marked +Sup and _Da_ is marked +Inf. 2013-01-09T09:03:14+00:00
> Whitespace changes for easier readability. 2013-01-08T17:30:24+00:00
> Added empty line at the end of the file. 2013-01-08T17:28:12+00:00
> test V-sooma full paradigm koer innouns (stem and affixes) 2013-01-08T10:12:06+00:00
> Added hack to work around a very strange bug in LexC transducer saving - the filename is slightly garbled if the save command is passed in from a script generated by a make file (but the same command passed in from a manually typed script works correctly). This hack is required for the new infra to work on the virtual Linux machines gtlab, gtoahpa, at least. The hack should be removed as soon as we have a correctly working LexC (the broken LexC is the newest one). 2013-01-08T10:06:02+00:00
> An indicative present Sg1 form has been added for the Estonian verb _sööma_. 2013-01-07T16:36:38+00:00
> Testing, yks-kaks 2013-01-07T13:18:01+00:00
> Now the twolc only has rules applying to Estonian. The xml has been introduced. _lexc_ files are generated in the est/src/morphology/stems directory. 2013-01-06T18:11:48+00:00
> More robust Saxon/Java setup: no need to define CLASSPATH. The M4 macros will look for a couple of predefined pathnames, and pick the first saxon9he.jar file it finds. More locations should be added as needed. Also corrected the logic for reporting whether xslt transformation could be enabled or not, and added a warning if xml source files are found but no xml transformations could be enabled. 2013-01-04T17:18:53+00:00
> Better sma than fao, but both of course improper. 2013-01-03T17:54:53+00:00
> Require at least HFST 3.4 - it includes all backends, and simplifies dependency handling quite a bit. Template update. 2013-01-03T16:32:23+00:00
> Adding the silent lexc files. 2013-01-03T15:54:00+00:00
> Adding src/syntax/disambiguagion.cg3 2013-01-03T15:38:37+00:00
> Clock, Date and Numbers. 2013-01-03T15:24:42+00:00
> Errors: missing contlex etc. have been corrected. 2013-01-03T14:58:39+00:00
> Adding more files, lexc, twolc. 2013-01-03T13:52:37+00:00
> The licence. 2013-01-03T13:50:23+00:00
> The Estonian language has been initialized on the basis of Ingrian. There is at least one word representing each part of speech. 2013-01-03T07:16:36+00:00