How to read text from power point and index it

  • I'm working on a project to structure unstructured data.  What i need to be able to do is find a way to read information from a power point deck and index it.  My users want to be able to use a "search engine" that will allow them to browse their ppt's by content.  The kicker is that they don't want to pay for google or the like.  Can sql server open a ppt and read it or do i need to find another program to convert the ppt into a text file?

  • I'd say your best bet would be to use VBA in Access.  You could then loop through all the ppt files and use the PowerPoint object model to inspect the slides and write the results to a SQL table.

    Jof

  • You could use the Windows Indexing Service. It has APIs that you can use to manage and query the index data programmatically.

    http://msdn.microsoft.com/library/default.asp?url=/library/en-us/indexsrv/html/indexingservicestartpage_6td1.asp

    -- J.Kozloski, MCDBA, MCITP

Viewing 3 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic. Login to reply