User: editor -- 2011-06-24 << 806 808 >> |
Hits: 4372 |
Type: Text file parser |
Search all Text file parser examples |
Description: |
How to extract all http links from multiple web sites in one time? There are thousands of webpages, how to extract all links from them? |
Input Sample: |
http://www.site1.com http://www.site2.com ... http://www.site1000.com |
Output Sample: |
http://www.site1-link1.htm http://www.site1-link2.htm ... http://www.site2-link1.htm http://www.site2-link2.htm ... http://www.site1000-link1.htm http://www.site1000-link2.htm ... |
Answer: |
Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps. |
1. Prepare a file url.txt containing http address like: http://www.mind-pioneer.com http://www.google.com ... 2. Click "Tools->Batch Runner" menu, click "Import" to import url.txt 3. Select first url and click "Edit" button, change the "Download As" to "links", and check option of "Apply to All", click "OK" 4. Click "Fast Replace" button * click "Add" set "Search" to: set "Replace" to: click "ok" * check option of "reg exp", "cross line" and "Extract" * click "Start", and click "output to single file" button 5. All links will be extracted to a new file. |
Screenshot 1: Fast_Replace_Window |