Replace Pioneer Home   All Examples   Free Download

 New request --free  RSS: Replace Pioneer Examples

425.Text file parser -- How to extract/parse all http addresses from text/html file?

User: editor -- 2010-02-19          << 424  426 >>
Hits: 2917
Type: Text file parser   
Search all Text file parser examples
Description:
How to extract/parse all http addresses from text/html file? 
I have some html files that contain some web addresses that started from http, 
how to extract all the http addresses and form a new text file?
Input Sample:
.... 
http://www.mind-pioneer.com 
...<a href="http://www.test.com"> 
....
Output Sample:
http://www.mind-pioneer.com 
http://www.test.com
Answer:
Hint: You need to Download and install "Replace Pioneer" on windows platform to finish following steps.
1. ctrl-o open html/text file 
2. ctrh-h open 'replace' window 
* in 'search for pattern', fill: 
 
* in 'replace with pattern', fill: 
 
* uncheck 'Print unmatch unit' option 
3. click 'Replace', done! 
4. ctrl-s save to file. 
 
Note: uncheck 'Print unmatched unit' option means  extract/parse the matched text ony.
Download Script:  scripts/425.rst.zip

Screenshot 1:  Replace_Window


Similar Examples:
How to extract all html address from a text file? (80%)
How to extract all IP address from a text file? (77%)
How to extract unique ip addresses from multiple text/html files? (73%)
How to extract all email addresses from many text files? (73%)
How to extract/parse all upper cases Acronyms from a text file? (73%)
How to extract all upper cases words from a text file? (67%)
How to sort all lines of ip addresses in a text file? (66%)
How to add a port 80 for all IP addresses in a text file? (63%)

Check Demo of Text file parser
Keywords:
http addresses from  addresses  files that contain  web  unmatched  address  unmatch  addr  http  print  extract addresses from html  extract html addresses from a text file  addresses from html  extract all http addresses  extract http addresses from text  extract all addresses from a  extract text with html  text file parser html