Skip to content

A high-level intermediate representation of regex in Python

License

Notifications You must be signed in to change notification settings

DexterHill0/regex_hir

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

regex_hir

A high-level intermediate representation of regex in Python.

PyPi | Github Source

Installation

Requires Python >=3.10. Also requires the unicategories library for easy access to categorised Unicode characters.

Description

This library constructs an intermediate representation of the regex AST created by the built-in re module. This functions similary to the Rust regex_syntax crate, which was completely the inspiration for this module.
All of the syntax supported by re is supported by this module.

Usage

import regex_hir

hir = regex_hir.hir(r"[abc]")
hir.dumps()
# CharacterClass(
#     [
#         CharacterRange(start=97, end=97)
#         CharacterRange(start=98, end=98)
#         CharacterRange(start=99, end=99)
#     ]
#     negate=False
#     ignore_case=False
# )

About

A high-level intermediate representation of regex in Python

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages