Searching through W...
 
Notifications
Clear all

Searching through Word meta data

7 Posts
5 Users
0 Likes
347 Views
(@jonathan)
Posts: 878
Prominent Member
Topic starter
 

We have 10s of thousands of MS Word documents and need to pull out only those documents that were created by a specific user.

Would like to receive recommendations for the best/quickest way to do this.

Thanks.

 
Posted : 08/03/2006 4:40 pm
keydet89
(@keydet89)
Posts: 3568
Famed Member
 

Get the FileMSWord Perl module here

http//www.cpan.org/modules/by-authors/id/H/HC/HCARVEY/

Drop the .pm file in the correct directory for your platform (on Windows, if Perl is in C\Perl, put the .pm file in the C\Perl\site\lib\File directory.

Create/modify a Perl script to suit your needs…with that many documents, you may need to wait overnight, regardless of what application you use.

HTH,

Harlan

 
Posted : 08/03/2006 5:54 pm
(@jonathan)
Posts: 878
Prominent Member
Topic starter
 

Thanks Harlan, will give that a go.

Jonathan

 
Posted : 08/03/2006 6:26 pm
(@nigel)
Posts: 13
Active Member
 

You may want to use DT Search.

 
Posted : 09/03/2006 6:41 am
 koko
(@koko)
Posts: 21
Eminent Member
 

the dirtiest, cheapest way to do it though, if you're interested, is to just display the Author property (i assume this is the property you care about) in explorer and then sort by it. of course this is assuming that the files are all in one directory (or a manageable number).

there must be some way to specify a search by Author in the built-in windows search but i can't figure it out.

i'm not aware of dtsearch allowing you to specify the Author property of a file.

 
Posted : 16/03/2006 1:40 am
(@jonathan)
Posts: 878
Prominent Member
Topic starter
 

Sometimes the most straightforward things elude us!

Thanks Koko.

 
Posted : 16/03/2006 2:42 pm
(@fsmith)
Posts: 1
New Member
 

Sorry, a bit late to this discussion, but that is exactly what our forager tool does. See www.inforenz.com where you can download an evaluation version.

 
Posted : 28/03/2006 10:45 pm
Share: