DIRegEx is a library of components and procedures that implement regular expression pattern matching using the same syntax and semantics as Perl for Delphi (Embarcadero / CodeGear / Borland).
DIRegEx offers two regular expression algorithms with their corresponding wrapper classes for easy usage:
- TDIDfaRegEx implements the DFA matching algorithm. Considerer this as a special purpose algorithm. If finds all possible matches and in particular, it finds the longest. It never needs to backtrack and supports partial matching much better, even for very long subject strings in several pieces.
Both classes descend from a common ancestor TDIRegEx which implements the complete functionality for working with regular expressions, regardless of which algorithm is currently in effect:
- Matching and extraction of matches / substrings from the source text.
- Searching for regular expressions within streams and memory buffers. To search within streams or files (of virtually unlimited size), use one of the specially optimized
- Replacements on full and partial matches / substrings.
- Listing of full and partial matches / substrings.
- Formatting of matches.
The implemented regular expression syntax is Perl 5.10 compatible and covers the following:
- Character Types
- General Category Property Codes for \p and \P
- Script Names for \p and \P
- Character Classes
- Anchors and Simple Assertions
- Match Point Reset
- Atomic Groups
- Option Setting
- Lookahead and Lookbehind Assertions
- Subroutine References (possibly recursive)
- Conditional Patterns
- Backtracking Control
- Newline Conventions
- What \R Matches
DIRegEx can do a Format and Replace based on regular expressions, a feature often required for advanced text editors and word processors. It is possible to include parts of the originally matched text into the replacement.
The List function concatenates all reg-ex matches to a single string. It allows to build lists of phone numbers, e-mail addresses, etc. with a single function call only.
RegEx Mask Controls
DIRegEx also includes two regular expression edits: TDIRegExMaskEdit and TDIRegExComboBox. Both controls validate all keyboard input against a regular expression pattern. They work similar to Delphi's TMaskEdit, but are much more flexible and powerful.
The regular expression mask controls allow to:
- Accept / reject certain characters at certain positions.
- Allow / reject particular characters if they follow any character(s) of your choice.
- Force the input text to begin / end with any character(s).
- Not accept particular characters at the beginning / end of the input text.
- Plus many more countless possibilities …
TDIRegExMaskEdit is an edit control that validates all keyboard input on a keystroke by keystroke basis. It accepts characters entered only if the new text matches the regular expression mask pattern. If the new text does not match, the character is silently rejected.
TDIRegExMaskComboBox is a combo box control with essentially the same regular expression validation as TDIRegExMaskEdit.
The DIRegEx Workbench helps you to design and test your regular expressions. It allows to set all available options, measure execution times, and to save and load all settings for later use.
The DIRegEx Workbench is available as a
- Design-Time Component Editor
- Standalone Application
DIRegEx 8.7.1 – 14 Jun 2016
- Fix an intermittent access violation when compiling a pattern with JIT, reported on the mailing list. This affected Win32 only.
- Fix a racing condition in JIT.