- Create Random Data based on EBNF Syntax description
- Validate EBNF definition (simpleparse's part)
- Validate some data against EBNF (simpleparse's part)
EBNF-Parser: simpleparse (EBNF-Grammars: http://simpleparse.sourceforge.net/simpleparse_grammars.html)
Our simple INI-file looks like this:
[MyFirstSection]
myKey ="myValue9_test-test2"
myKey2 ="myValue9_test-test2"
myKey3 ="myValue9_test-test2"
myKey4 ="myValue9_test-test2"
[MySecond-Section]
myKey11 ="myValuexx33_test-test2"
EBNF Declaration for a simple INI-File-Syntax:
file := (section, entry+, '\n\n')+
section := '[', wordspecial, ']','\n'
entry := word, '\t=', '"', wordspecial , '"', '\n'
alpha := [a-zA-Z]
alphanum := [a-zA-Z0-9]
alphanumspecial := [a-zA-Z0-9_-]
word := alpha,alphanum*
wordspecial := alpha,alphanumspecial*
Description:
'file' ... our root definition (aka production) which describes the overall file syntax
'section' ... Ini-file section header
'entry' ... one key=value entry
'alpha' ... alpha-chars only
'alphanum' ... alphanumerical chars
'alphanumspecial'... alphanumerical and _- chars
'word' ... begins with alpha and continues with none or more alphanums
'wordspecial' ... begins with alpha and continues with none or more alphanumspecial chars
EBNFSpill will now take any EBNF defintion that is valid and will try to create random data that matches the described syntax. For the above EBNF declaration this would be random data like this:
Code:
declaration = """
file := (section, entry+, '\n\n')+
section := '[', wordspecial, ']','\n'
entry := word, '\t=', '"', wordspecial , '"', '\n'
alpha := [a-zA-Z]
alphanum := [a-zA-Z0-9]
alphanumspecial := [a-zA-Z0-9_-]
word := alpha,alphanum*
wordspecial := alpha,alphanumspecial*
"""
from EBNFSpill import EBNFSpill
s = EBNFSpill()
# process declaration (feeds simpleparse with declaration and defines 'file' as entrypoint)
s.setDeclaration(declaration,production="file")
# lets generate some random data
print b.generate()
Output:
[JaW-I]
MEk ="RIO_ZC"
KG1HWR0BcYATBF0CpgS5h ="IzOI5ECekj"
c8bDI3ozx3LnwnjpM2lz8 ="go8iJsKkJ44JI"
shX3KIJLQkvNEEq1Ja ="E3usLXmAEifbDWOt"
[y7aM2A1QIpcThzPRc0ryFU]
Ng7fMShX4 ="Xq"
d43i42sbool3BnI24e ="t1d5GHD"
[EJ4ll]
NuCG ="J0WrZdTnheSQq"
Rm6STRGxllBxlf9fLlEmEtwbgH ="cU82WpNkrHR0CqseBltmGOETdf"
LSadRHDXJ0 ="G0Ir0FufaQJdSg9F"
R5T02vYsf ="JaJaPIUPh0zSSYSM4wfA8pjOq"
r6zhJQ7K ="GdYp9hMJjA8"
XMwdm ="wMjUQ0ADRjkT7MPV5zG"
[PXj0N0hrfu9e6dKM9-ujmEHuoU]
d1QFqfbTuqHnVQbK ="wwQ0-sykooiJp201HCwm73fD3"
fQuzvHP2pVwvS8G7UB6s ="PjofofSqLK4Nv25baoAi_RR7D"
jtrOtloyXUHJUKe ="k8P3WxPBRbwPrrh0"
uAuSDmzSVkSHNp0 ="MBLM6Lw"
zEAwWEmfNpt ="ZkecJcD"
LcOuVkhLGJkCQYbI ="gnj_n8_FE"
I7TjmZC6 ="ci1YzwboZz"
[IMw_fjT1jv]
cIXGM8TQuM29aWnXi9aYK ="UwL_q7RC0JKv7lkFPY"
mmNGyBCO6qTF2yKvelBki77Wz ="b1UZDdc7vT"
TPDcJMTv6mMuOKAoNS1xLf ="TSvbaGyW7K5nV"
aA6qLkQC08 ="H3UBl852kRFXlsmoqoX3nO4eGZ"
z7xPM9YLNOro6FKvc1QqkL6zl ="VwiMf6EdCdK6cG7MtCLdof9h"
EBNFSpill Example #2 - show prettyprint of AST-Table, Step-by-Step walk the AST generated by Simpleparse
Code:
declaration = """
file := (section, entry+, '\n\n')+
section := '[', wordspecial, ']','\n'
entry := word, '\t=', '"', wordspecial , '"', '\n'
alpha := [a-zA-Z]
alphanum := [a-zA-Z0-9]
alphanumspecial := [a-zA-Z0-9_-]
word := alpha,alphanum*
wordspecial := alpha,alphanumspecial*
"""
from EBNFSpill import EBNFSpill
s = EBNFSpill()
# process declaration (feeds simpleparse with declaration and defines 'file' as entrypoint)
s.setDeclaration(declaration,production="file")
# lets generate some random data
x= 0
for i in s.walk():
x+=1
# print <number>, <uid>, <human_readable_AST_element>
print x,id(i),s.process(i)
Output:
1 36385136 (None, 'MATCH_SUBTABLE', (('section', 203, ((None, 21, '['), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, ']'), (None, 21, '\n'))), ('entry', 203, (('word', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '\t='), (None, 21, '"'), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '"'), (None, 21, '\n'))), ('entry', 203, (('word', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '\t='), (None, 21, '"'), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '"'), (None, 21, '\n')), 2, 1), (None, 101, 1, -1, 1), (None, 21, '\n\n')))
2 36743928 ('section', 'MATCH_TABLE', ((None, 21, '['), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, ']'), (None, 21, '\n')))
3 36325576 (None, 'MATCH_WORD', '[')
4 36742248 ('wordspecial', 'MATCH_TABLE', (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1)))
5 36742008 ('alpha', 'MATCH_ISIN', 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ')
6 36746368 ('alphanumspecial', 'MATCH_ISIN', 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1)
7 36746416 (None, 'MATCH_EOF', 1, -1, 1)
8 36742488 (None, 'MATCH_WORD', ']')
9 36742848 (None, 'MATCH_WORD', '\n')
10 36384896 ('entry', 'MATCH_TABLE', (('word', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '\t='), (None, 21, '"'), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '"'), (None, 21, '\n')))
11 36326336 ('word', 'MATCH_TABLE', (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1)))
12 36326216 ('alpha', 'MATCH_ISIN', 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ')
13 36746464 ('alphanum', 'MATCH_ISIN', 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1)
14 36746512 (None, 'MATCH_EOF', 1, -1, 1)
15 36326776 (None, 'MATCH_WORD', '\t=')
16 36326416 (None, 'MATCH_WORD', '"')
17 36326136 ('wordspecial', 'MATCH_TABLE', (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1)))
18 36292208 ('[RECURSION of Node=36741928]', 'MATCH_RECURSION', 36741928)
19 36385336 (None, 'MATCH_WORD', '"')
20 36385096 (None, 'MATCH_WORD', '\n')
21 36746560 ('entry', 'MATCH_TABLE', (('word', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '\t='), (None, 21, '"'), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '"'), (None, 21, '\n')), 2, 1)
22 36292208 ('[RECURSION of Node=36304296]', 'MATCH_RECURSION', 36304296)
23 36746608 (None, 'MATCH_EOF', 1, -1, 1)
24 36384936 (None, 'MATCH_WORD', '\n\n')
25 36746704 (None, 'MATCH_SUBTABLE', (('section', 203, ((None, 21, '['), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, ']'), (None, 21, '\n'))), ('entry', 203, (('word', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '\t='), (None, 21, '"'), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '"'), (None, 21, '\n'))), ('entry', 203, (('word', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanum', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '\t='), (None, 21, '"'), ('wordspecial', 203, (('alpha', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ'), ('alphanumspecial', 14, 'abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789_-', 2, 1), (None, 101, 1, -1, 1))), (None, 21, '"'), (None, 21, '\n')), 2, 1), (None, 101, 1, -1, 1), (None, 21, '\n\n')), 2, 1)
26 36292208 ('[RECURSION of Node=36746656]', 'MATCH_RECURSION', 36746656)
27 36746752 (None, 'MATCH_EOF', 1, -1, 1)