StringToExpression allows you to create methods that take strings and outputs .NET expressions. It is highly configurable allowing you to define your own language with your own syntax.
Two languages are provided out of the box, an ArithmeticLanguage
for performing algebra and an ODataFilterLanguage
for parsing OData filter expressions.
A basic arithmetic language is provided. It can be used as is, or extended with customer function by extending ArithmeticLanguage
var language = new ArithmeticLanguage();
Expression<Func<decimal>> expressionFunction = language.Parse("(4 - 2) * 5 + 9 / 3");
Func<decimal> function = expressionFunction.compile();
Assert.Equal(13, function());
OData filtering is a nice way to pass generic filtering requirements into a WebAPI, although parsing the the filter expression can be cumbersome. StringToExpression can be used as a lightweight parser
public async Task<IHttpActionResult> GetDoohickies([FromUri(Name = "$filter")] string filter = "name eq 'discount' and rating gt 18")
{
var language = new ODataFilterLanguage()
Expression<Func<Doohicky, bool>> predicate = language.Parse<Doohickey>(filter);
//can either pass this expression into either IQueryable or IEnumerable where clauses
return await DataContext.Doohickies.Where(predicte).ToListAsync();
}
StringToExpression
has the advantage of being configurable; if the OData parser doesnt support methods you want, (or it supports methods you dont want) it is very easy to extend ODataFilterLanguage
and modify the configuration
Operators | Name | Example |
---|---|---|
eq | Equal | City eq 'Redmond' |
ne | Not equal | City ne 'London' |
gt | Greater than | Price gt 20 |
ge | Greater than or equal | Price ge 10 |
lt | Less than | Price lt 20 |
le | Less than or equal | Price le 100 |
and | Logical and | Price le 200 and Price gt 3.5 |
or | Logical or | Price le 3.5 or Price gt 200 |
not | Logical negation | not endswith(Description,'milk') |
add | Addition | Price add 5 gt 10 |
sub | Subtraction | Price sub 5 gt 10 |
mul | Multiplication | Price mul 2 gt 2000 |
div | Division | Price div 2 gt 4 |
mod | Modulo | Price mod 2 eq 0 |
( ) | Precedence grouping | (Price sub 5) gt 10 |
/ | Property access | Address/City eq 'Redmond' |
String Functions | Example |
---|---|
bool substringof(string po, string p1) | substringof('day', 'Monday') eq true |
bool endswith(string p0, string p1) | endswith('Monday', 'day') eq true |
bool startswith(string p0, string p1) | startswith('Monday', 'Mon') eq true |
int length(string p0) | length('Monday') eq 6 |
int indexof(string p0, string p1) | indexof('Monday', 'n') eq 2 |
string replace(string p0, string find, string replace) | replace('Monday', 'Mon', 'Satur') eq 'Saturday' |
string substring(string p0, int pos) | substring('Monday', 3) eq 'day' |
string substring(string p0, int pos, int length) | substring('Monday', 3, 2) eq 'da' |
string tolower(string p0) | tolower('Monday') eq 'monday' |
string toupper(string p0) | toupper('Monday') eq 'MONDAY' |
string trim(string p0) | trim(' Monday ') eq 'Monday' |
string concat(string p0, string p1) | concat('Mon', 'day') eq 'Monday' |
Date Functions | Example |
---|---|
int day(DateTime p0) | day(datetime'2000-01-02T03:04:05') eq 2 |
int hour(DateTime p0) | hour(datetime'2000-01-02T03:04:05') eq 3 |
int minute(DateTime p0) | minute(datetime'2000-01-02T03:04:05') eq 4 |
int month(DateTime p0) | month(datetime'2000-01-02T03:04:05') eq 1 |
int second(DateTime p0) | second(datetime'2000-01-02T03:04:05') eq 5 |
int year(DateTime p0) | year(datetime'2000-01-02T03:04:05') eq 2000 |
Math Functions | Example |
---|---|
double round(double p0) | round(10.4) eq 10 round(10.6) eq 11 round(10.5) eq 10 round(11.5) eq 12 |
double floor(double p0) | floor(10.6) eq 10 |
decimal floor(decimal p0) | month(datetime'2000-01-02T03:04:05') eq 1 |
double ceiling(double p0) | ceiling(10.4) eq 11 |
Languages are defined by a set of GrammerDefintions
. These define both how the string is broken up into tokens as well as the behaviour of each token. There are many subclasses of GrammerDefinition
that makes implementing standard language features very easy.
An example of a very simple arithmetic language is as follows
ListDelimiterDefinition delimeter;
BracketOpenDefinition openBracket, sqrt;
language = new Language(new [] {
new OperandDefinition(
name:"DECIMAL",
regex: @"\-?\d+(\.\d+)?",
expressionBuilder: x => Expression.Constant(decimal.Parse(x))),
new BinaryOperatorDefinition(
name:"ADD",
regex: @"\+",
orderOfPrecedence: 2,
expressionBuilder: (left,right) => Expression.Add(left, right)),
new BinaryOperatorDefinition(
name:"SUB",
regex: @"\-",
orderOfPrecedence: 2,
expressionBuilder: (left,right) => Expression.Subtract(left, right)),
new BinaryOperatorDefinition(
name:"MUL",
regex: @"\*",
orderOfPrecedence: 1, //multiply should be done before add/subtract
expressionBuilder: (left,right) => Expression.Multiply(left, right)),
new BinaryOperatorDefinition(
name:"DIV",
regex: @"\/",
orderOfPrecedence: 1, //division should be done before add/subtract
expressionBuilder: (left,right) => Expression.Divide(left, right)),
sqrt = new FunctionCallDefinition(
name:"FN_SQRT",
regex: @"sqrt\(",
argumentTypes: new[] {typeof(double) },
expressionBuilder: (parameters) => {
return Expression.Call(
null,
method:typeof(Math).GetMethod("Sqrt"),
arguments: new [] { parameters[0] });
}),
openBracket = new BracketOpenDefinition(
name: "OPEN_BRACKET",
regex: @"\("),
delimeter = new ListDelimiterDefinition(
name: "COMMA",
regex: ","),
new BracketCloseDefinition(
name: "CLOSE_BRACKET",
regex: @"\)",
bracketOpenDefinitions: new[] { openBracket, sqrt },
listDelimeterDefinition: delimeter)
new GrammerDefinition(name: "WHITESPACE", regex: @"\s+", ignore: true) //we dont want to process whitespace
});
Some of the out of the box grammer defintions are detailed below
Name | Description | Properties |
---|---|---|
GrammerDefintion |
Base class for all defintions. Does not perform any functionality during the parsing |
|
OperandDefinition |
Defines the smallest atomic piece in your language, used to represent items like numbers or strings |
|
BinaryOperatorDefintion |
An operation that takes parameters from the left and right of it. Often represents arithmetic operaitons (+ , - , * , / ) or equality checks (== , != , < > ) or boolean logic (and , or ) |
|
UnaryOperator |
An operation that takes a single parameter, used for operations such as not |
|
BracketOpenDefinition |
Defines an open bracket, functionally does not do much unless paried with a BracketCloseDefinition |
|
ListDelimeterDefinition |
The seperator to use to denote lists within bracktes (a , in most languages) functionally does not do much unless paired with a BracketCloseDefinition |
|
BracketCloseDefinition |
The expression between the brackets is evaluated first |
|
FunctionCallDefinition |
Defines a function that takes in a list of operands. Also acts as a bracket BracketOpenDefinition definition |
|
If your language is more complicated than the provided GrammerDefinitions
you are able to define your own by extending GrammerDefintion
. You best read the Nuts and bolts section to determine the best way to implement your definition.
All parsing exceptions extend ParseException
. A ParseException
will contain both a readable message and a StringSegment
that represents what token(s) in the original input string caused the error.
The StringSegment
allows pinpointing of issues such as where operands are missing, which function has too many parameters or where the unexpected character is. It provides a useful feedback for wherever you are getting your original strings from.
Under the hood StringToExpression
implements a shunting-yard alogrithm.
The internal parsing state contains a Stack<Operand>
and a Stack<Operator>
that are built up during the parsing
Operand
- Represent a .NET expression. This may be a simpleConstantExpression
, or the root of a complicated Tree ofBinaryExpressions
Operator
- Is a function that can be run. Generally these functions when run will consume one or more operands and produce one operand. Such that run operators reduces the number of operands on the stack.
The parsing is done in roughly three steps
-
Tokenize - The string is parsed through a tokenizer which uses the regular expressions defined in the
GrammerDefinitions
to break the strings intoTokens
. AToken
knows theGrammerDefinition
that created it and the string value it represents. -
Apply
GrammerDefinitions
- AllGrammerDefinition
has anvoid Apply(Token token, ParseState state)
method. We first read all theTokens
in sequentially and run eachGrammerDefinition
Apply
method. The apply method can make any modifications to the state it wants, this can range from something simple like pushing an Operand on to the stack, to something more complicated like executing operands. -
Execute Operators -Once all the tokens are Applied we will start poping Operators off the stack and executing them. When an
Operator
executes its generally expected that it will consume one or moreOperands
and create oneOperand
. This way by the time we apply all the operators we should only have a singleOperand
on the stack, that is our result.
To customize you can make your own GrammerDefinition
and implment the Apply
method to meet your purposes.