Forms processing is a process by which one can capture information entered into fields of a document form and convert it into an electronic format. Form fields are typically organized into tabular structures, where line segments separate different form fields. When the exact position of a form field on the image is known (by locating the neighbouring line segments), OCR process can be performed to automatically capture information inside the fields.
Using the provided dataset of 200 input images, your task is to:
- Process input images to enhance and amplify image elements which belong to line segments
- Classify image pixels in those who belong to line segments, and others which belong to image background
- Group image pixels belonging to a single line segment
- Compute all line segments on the input image in parametric form (e.g Line Segment 1: Start Point(34, 14), End Point(128,92) = [34, 14, 128, 92])
- Use the provided MATLAB app to visually compare obtained Line Segments with the input image.
A very fast method (with running time under 40ms on a typical PC) is required.
A dataset of 200 images is provided for algorithm evaluation. 5 samples are manually labeled and can be used for measuring how well the method behaves in real life situations. MATLAB sample application is provided which enables visual inspection for unlabeled samples.
- http://www.ipol.im/pub/art/2012/gjmr-lsd/
- http://www.eecs.berkeley.edu/~yang/software/line_detector.tar.gz
Contains dummy implementation of a method for line segment detection. You should provide your implementation of the method.
Script used for visual testing of lineSegmentDetect
function. Script loads the dataset.mat
, shows each input image in the dataset, and draw line segments returned by lineSegmentDetect
function.
Draws input line segments on the image.
Is used for labeling of images. Should you require manual exact labeling of the images, you can used this script.
Returns an array of image filenames in a specified folder.
Contact us at jurica.cerovec@photopay.net and boris.trubic@photopay.net with all inquires and questions.