Replace Pioneer Home   All Examples   Free Download

 New request --free  RSS: Replace Pioneer Examples

807.Text file parser -- How to extract all http links from multiple web sites in one time?

User: editor -- 2011-06-24          << 806  808 >>
Hits: 3633
Type: Text file parser   
Search all Text file parser examples
Description:
How to extract all http links from multiple web sites in one time? There are thousands of webpages, how to extract all links from them?
Input Sample:
http://www.site1.com 
http://www.site2.com 
... 
http://www.site1000.com 
Output Sample:
http://www.site1-link1.htm 
http://www.site1-link2.htm 
... 
http://www.site2-link1.htm 
http://www.site2-link2.htm 
... 
http://www.site1000-link1.htm 
http://www.site1000-link2.htm 
... 
 
 
Answer:
Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps.
1. Prepare a file url.txt containing http address like: 
http://www.mind-pioneer.com 
http://www.google.com 
... 
2. Click "Tools->Batch Runner" menu, click "Import" to import url.txt 
3. Select first url and click "Edit" button, change the "Download As" to "links", and check option of "Apply to All", click "OK" 
4. Click "Fast Replace" button 
* click "Add" 
set "Search" to:  
set "Replace" to:  
click "ok" 
* check option of "reg exp", "cross line" and "Extract" 
* click "Start", and click "output to single file" button 
 
5. All links will be extracted to a new file.

Screenshot 1:  Fast_Replace_Window


Similar Examples:
How to extract all http links from multiple web sites in one time? (100%)
How to extract all image links from multiple html files?  (70%)
How to extract last 100 lines from multiple files? (67%)
How to extract first line from multiple files and generate a new file? (66%)
How to extract all links from a web page?   (66%)
How to extract all image links from a list of webpages? (63%)
How to extract all text enclosed by H1 tag from multiple html files? (63%)
How to extract all c++ comments from multiple cpp files? (63%)

Check Demo of Text file parser
Keywords:
google  webpages  how to extract all http links  extract all http links  site  url  extract all links from  extract all links  pages  webpage  extract multiple webpages  extract address from webpages  url extract  extract url  extract line containing  webpages line  extract address multiple  extract http links