First of all, you need to convert all word files to text format(you must install MS word in advance to enable doc to txt operation): Secondly, you can extract email addresses and remove duplicate with following procedure: http://www.mind-pioneer.com/services/736_Text_file_parser.html
|