|User: Aaron -- 2013-01-24 << 1045 1047 >>|
|Type: Text file parser|
|Search all Text file parser examples|
|I would like to extract all proper names and nouns that start with an upper case letter. Duplicates should be removed. Some proper names may have one, two or three upper case words in a row. For example: Barry Maxwell Brown. |
I would like to ignore common words using a stop list if possible. For example "He".
|Barry Maxwell Brown was late for school. He gave a note to his teacher. |
Sally was also late but the teacher didn't ask for one. Bill Jenkins was always on time and didn't like Sally or Barry.
|Barry Maxwell Brown |
|Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps.|
|1. ctrl-o open text file |
2. ctrl-h open 'replace' dialogue
* set 'search for pattern' to:
* set 'replace with pattern' to:
* uncheck option of "print umatched units"
* fill "if" entry with stop list, like:
3. click "replace", all words that start with an upper case will be extracted.
4. select menu "Replace->Replace by template->Line replace->Remove all duplidate lines", and click "ok", click "replace", all duplicated lines will be removed.
Screenshot 1: Replace_Window
|How to insert a space between the lower case and upper case letter? (58%)|
How to generate random passwords with 10 bytes of upper case letters? (57%)
How to extract all lines longer than 80 characters from a text file? (57%)
How to extract all lines that contain words in a list? (56%)
How to delete lines that start with duplicate text? (56%)
How to extract all distinct parent folder names from a list of files? (56%)
How to extract all sentences that contain specified words? (56%)
How to extract all lines that has more than 2 numbers? (56%)
Check Demo of Text file parser
|all proper names rope upper case letter duplicates common upper upper case tel duplicated max extract nouns and names text file of nouns extract all proper nouns extract common lines in file extract common text extract all text lines that start common lines words to replace removed|