StringMatch Pro is a System Utilities::Other software developed by Istvan Szendro. After our trial and test, the software was found to be official, secure and free. Here is the official description for StringMatch Pro: StringMatch Pro analyzes a text file in order to (1) find all text blocks that appear multiple times in the file (Repetitions mode) --or-- (2) find all unique terms within the file (Terms mode).
Unlike usual text comparison programs that proceed character by character, StringMatch proceeds by text units of word, sentence, or paragraph.
It is blazing fast and can process text containing millions of words.
Characters that StringMatch should consider valid in a file are specified by the user in a plain-text file. Any other characters in the source file will be considered word separators during processing.
So with an alphabet file containing letters only, the strings '15 men on the dead man's chest' and '10 men on the dead man's chest' will be reported as identical.
You can specify the minimum length of matching strings you want to appear in the results. This does not affect the maximum length of results -- StringMatch will find the longest possible matching strings.
Processing can be case-sensitive or case-insensitive.
*** Unique features in Repetitions mode:
- Deep Analysis
StringMatch analyzes the file using a two-pass process.
Once Pass 1 has found all repetitive strings, Pass 2 performs deep analysis on Pass 1 results to identify their internal structure, looking for repetitions within, and overlaps between,
Pass 1 results.
StringMatch uses this fine-grain information to present final results in an intelligent and meaningful way.
Your file may have sentences or entire text blocks that only differ in a few words (e.g. in device type indentifiers). In order to capture these near-identical strings, you can use synonyms to make those few differing words equivalent with each other for the purposes of the analysis.
For example, if you specify Intel, Microsoft and IBM as synonyms of enterprise, then StringMatch will find that the three strings
Intel is a company
IBM is a company
Microsoft is a company
are identical, and will report that "enterprise is a company" occurs 3 times.
You can specify the text unit StringMatch should proceed by: word, sentence or paragraph.
Use 'Word' to perform fine-grain analysis and find even short repetitive patterns.
Use 'Sentence' or 'Paragraph' if you are only interested in larger-scale repetitions.
You can specify the minimum number of text units (words, sentences or paragraphs) a result should have to make it into the final results. The available range is 1 - 100.
- Sorted results
Results are presented sorted by length, frequency, or weight (length*frequency).
*** Unique features in Terms mode:
- Terminology extraction
You can use synonyms to quickly and easily extract all meaningful unique strings from the file. A few iterations of extending your synonyms file will yield the desired terminology list.
StringMatch also provides features that help you quickly identify and eliminate words that are NOT candidates for a terminology list.
Setting the length to a fixed value will extract unique strings exactly Len long.
Setting the length to Auto will extract unique strings of ANY length.
The available range is 1 - 10, or Auto.
- Sorted results
The sorting options are designed to make it easy to quickly identify words you don?t want, and to mass-lift them into your synonyms file.
Results are presented sorted alphabetically, by frequency (subsorted alphabetically), by length, or backwards-alphabetically (with words ending in ?a? appearing before words ending in
Frequency sorting will list the most frequent words first, enabling you to quickly find words like ?the', ?a?, ?an?, etc. that are normally not parts of a terminology.
Backwards alphabetical sorting will present words with similar endings in contiguous groups, so e. g. all gerunds (ending in '-ing') will be in one block.
|Platforms:||Windows XP,Windows NT/2000/2003|
|Publisher:||About Istvan Szendro | Istvan Szendro titles | Visit Istvan Szendro|
|Downloads:||0 last month, 23 total|
|Last updated:||More than a year ago ()|
|Watch for updates:|
|Email a friend... Get the best software deals each week!|