| User: editor -- 2011-12-12 << 894 896 >> |
| Hits: 245 |
| Type: Text file parser |
| Search all Text file parser examples |
| Description: |
| How to extract all H1 text from multiple html files and write the output to a single file? |
| Input Sample: |
| <html>... <h1>text 1 with h1 mark</h1> ... <h1>text 2 with h1 mark</h1> .... |
| Output Sample: |
| text 1 with h1 mark text 2 with h1 mark |
| Answer: |
| Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps. |
| 1. open "Tools->Batch Runner" window 2. drag multiple html files from windows file browser to "Batch Runner" window 3. click "Fast Replace" button * click "Add" to add a new rule * set "search" to: * set "replace" to: * click "ok" 4. check option of "reg exp", "ignore cases" and "extract" 5. click "start", and click "output to single file", all text that enclosed by <h1> and </h1> will be save to a single file. |
Screenshot 1: Fast_Replace_Window |