Hun-Law

⚠️ WARNING
This project has been deprecated in favor of the Rust version, and is currently unmaintained.

Hun-Law

A small project for parsing Hungarian Law. It does the following thigs:

Parse PDF files into lines using pdfminer. It does so much more accurately than other pdf2txt implementations.
Parse "Magyar Közlöny" PDFs into individual Acts
Separate Acts into structural elements (Articles, subpoints, etc.)
Parse internal and external references in legal text
Parse special phrases like amendments and repeals into easy-to-use objects
Generate simple TXT, JSON and HTML version of the parsed documents

Usage

After cloning the repository, simply run ./generate_output.py:

./generate_output.py txt 2013/31
./generate_output.py json 2018/123 --output-dir /tmp/acts_as_json

Interesting Magyar Közlöny issues can be found in act_to_mk_issue.csv

To be able to actually use html output, you will have to copy or symlink the style.css:

./generate_output.py html 2014/91 2014/92 2014/93 --output-dir /var/www/hun_law
cp style.css /var/www/hun_law

Things planned:

Export into Akoma Ntoso format
Export into epub or mobi format

Contribution

Feel free to open issues for feature reqests or found bugs. Merge Requests are more than welcome too, as long as all tests and static analysis passes.

Name		Name	Last commit message	Last commit date
Latest commit History 436 Commits
.vim		.vim
hun_law		hun_law
tests		tests
.gitignore		.gitignore
.pep8		.pep8
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
act_to_mk_issue.csv		act_to_mk_issue.csv
create_venv.sh		create_venv.sh
fixup_editor.py		fixup_editor.py
generate_act_to_mk_issue.py		generate_act_to_mk_issue.py
generate_output.py		generate_output.py
mypy.ini		mypy.ini
requirements.txt		requirements.txt
run_static_analysis.sh		run_static_analysis.sh
run_tests.sh		run_tests.sh
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hun-Law

Usage

Things planned:

Contribution

About

Releases

Packages

Languages

License

badicsalex/hun_law_py

Folders and files

Latest commit

History

Repository files navigation

Hun-Law

Usage

Things planned:

Contribution

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages