OCR: Add IAM corpus with unk decoding support #6

aarora8 · 2017-11-15T18:49:20Z

No description provided.

hhadian · 2017-11-16T15:03:36Z

Ashish, could you please rebase against the ocr branch?
It's showing all the files that are already there. Also it has conflicts.
Please update the headers for the new recipes you add.

hhadian · 2017-11-16T15:46:37Z

egs/iam/s5/local/chain/run_cnn_chainali_1a.sh

+
+  num_targets=$(tree-info $tree_dir/tree | grep num-pdfs | awk '{print $2}')
+  learning_rate_factor=$(echo "print 0.5/$xent_regularize" | python)
+  common1="required-time-offsets= height-offsets=-2,-1,0,1,2 num-filters-out=36"


you can remove required-time-offsets= altogether

aarora8 · 2017-11-16T18:40:33Z

Thanks, rebased it against ocr branch. updated headers in the new recipes.

hhadian

Some notes about the headers

hhadian · 2017-11-19T15:54:31Z

egs/iam/s5/local/chain/run_cnn_1a.sh

@@ -29,8 +29,8 @@ alignment_subsampling_factor=1
 chunk_width=340,300,200,100


Please update the results for this recipe if it's not already updated

hhadian · 2017-11-19T16:00:21Z

egs/iam/s5/local/chain/run_cnn_chainali_1a.sh

@@ -33,8 +33,8 @@ alignment_subsampling_factor=1
 chunk_width=340,300,200,100


Also for this recipe.
Change the description to "chainali_1a is as 1a except it uses chain alignments (using 1a system) instead of gmm alignments" and then append the output (and the command itself) of compare_wer.sh for 1a and chainali_1a (after 1 blank line)

hhadian · 2017-11-19T16:03:03Z

egs/iam/s5/local/chain/run_cnn_chainali_1b.sh

@@ -0,0 +1,235 @@
+#!/bin/bash
+
+# chainali_1b uses chain model for lattice instead of gmm-hmm model. It has more cnn layers as compared to 1a


change this to "chainali_1b is as chainali_1a except it has 3 more cnn layers."
Then append the compare_wer.sh output (with the command) after adding a blank line

hhadian · 2017-11-19T16:04:28Z

egs/iam/s5/local/chain/run_cnn_chainali_1c.sh

@@ -0,0 +1,226 @@
+#!/bin/bash


please remove this and 1d for now. The improvements are not significant.

aarora8 · 2017-11-20T15:12:22Z

sorry. updated headers for run_cnn_1a.sh, run_cnn_chainali_1a.sh, run_cnn_chainali_1b.sh. removed run_cnn_chainali_1c.sh , run_cnn_chainali_1d.sh.

hhadian · 2017-11-20T18:31:47Z

Thanks. Merging...

* OCR: Add IAM corpus with unk decoding support (#3) * Add a new English OCR database 'UW3' * Some minor fixes re IAM corpus * Fix an issue in IAM chain recipes + add a new recipe (#6) * Some fixes based on the pull request review * Various fixes + cleaning on IAM * Fix LM estimation and add extended dictionary + other minor fixes * Add README for IAM * Add output filter for scoring * Fix a bug RE switch to pyhton3 * Add updated results + minor fixes * Remove unk decoding -- gives almost no gain * Add UW3 OCR database * Fix cmd.sh in IAM + fix usages of train/decode_cmd in chain recipes * Various minor fixes on UW3 * Rename iam/s5 to iam/v1 * Add README file for UW3 * Various cosmetic fixes on UW3 scripts * Minor fixes in IAM

hhadian reviewed Nov 16, 2017

View reviewed changes

aarora8 force-pushed the iam branch 2 times, most recently from 6a93702 to f8eb4fd Compare November 16, 2017 18:39

hhadian requested changes Nov 19, 2017

View reviewed changes

modifications cnn-tdnn architecture for improving wer

922f66f

aarora8 force-pushed the iam branch from 8195205 to 922f66f Compare November 20, 2017 15:20

hhadian approved these changes Nov 20, 2017

View reviewed changes

hhadian merged commit aa7c19a into hhadian:ocr Nov 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR: Add IAM corpus with unk decoding support #6

OCR: Add IAM corpus with unk decoding support #6

aarora8 commented Nov 15, 2017

hhadian commented Nov 16, 2017

hhadian Nov 16, 2017

aarora8 commented Nov 16, 2017

hhadian left a comment

hhadian Nov 19, 2017 •

edited

Loading

hhadian Nov 19, 2017

hhadian Nov 19, 2017

hhadian Nov 19, 2017

aarora8 commented Nov 20, 2017

hhadian commented Nov 20, 2017

		@@ -29,8 +29,8 @@ alignment_subsampling_factor=1
		chunk_width=340,300,200,100

		@@ -33,8 +33,8 @@ alignment_subsampling_factor=1
		chunk_width=340,300,200,100

		@@ -0,0 +1,235 @@
		#!/bin/bash

		# chainali_1b uses chain model for lattice instead of gmm-hmm model. It has more cnn layers as compared to 1a

OCR: Add IAM corpus with unk decoding support #6

OCR: Add IAM corpus with unk decoding support #6

Conversation

aarora8 commented Nov 15, 2017

hhadian commented Nov 16, 2017

hhadian Nov 16, 2017

Choose a reason for hiding this comment

aarora8 commented Nov 16, 2017

hhadian left a comment

Choose a reason for hiding this comment

hhadian Nov 19, 2017 • edited Loading

Choose a reason for hiding this comment

hhadian Nov 19, 2017

Choose a reason for hiding this comment

hhadian Nov 19, 2017

Choose a reason for hiding this comment

hhadian Nov 19, 2017

Choose a reason for hiding this comment

aarora8 commented Nov 20, 2017

hhadian commented Nov 20, 2017

hhadian Nov 19, 2017 •

edited

Loading