Search PDF title and content using MOSS 2007

Introduction

By default the SharePoint 2007 Search indexed only the meta data of a PDF document. By installing and configuring a PDF IFilter the Search will also index the contents of the PDF document. This allows users to find documents based on text inside the document. This process is called full text indexing.

[Indexing Server]: the server(s) in the SharePoint Farm that has/have the “Indexing” Role assigned. In a small farm this can be a single server for all roles.

[Web Front End Server]: the server(s) in the SharePoint Farm that has/have the “Web Front End” Role assigned. In a small farm this can be a single server for all roles.

[Indexing Server]

1.Install the PDF IFilter (see below for a list of available IFilters)
2.Add the .pdf file type to the index list:
1.Go to Central Administration, then to the Shared Services Administration Web of the current SSP, go to Search Settings and next to File Type
2.Add a new file type pdf
3.[This step only applies to 64 bit servers]
3.1.Go to
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf
NOTE: The first registry key entry will enable the title search and the second one will enable the content of the PDF.
3.2.Change the (Default) key value
3.2.1.Old value: {4C904448-74A9-11D0-AF6E-00C04FD8DC02}
3.2.2.(Foxit x64 PDF IFilter) New value: {987F8D1A-26E6-4554-B007-6B20E2680632}
3.2.3.(Adobe x64 PDF IFilter) New value: {E8978DA6-047F-4E3D-9C78-CDBE46041603}
NOTE: Add the curly bracket with the value.
4.Perform an iisreset
5.Perform a Full Update on the Search content indexes
5.1.Open a Command Prompt on the Indexing Server
5.2.net stop osearch
5.3.net start osearch
5.4.Go to Central Administration, then to the Shared Services Administration Web of the current SSP, go to Search Settings and start a full crawl of all locations containing PDF files

[Web Front End Server]

1.Copy the ICPDF.GIF file to “C:\Program Files\Common Files\Microsoft Shared\Web Server Extensions\12\Template\Images”
2.Edit the file C:\Program Files\Common Files\Microsoft Shared\Web server extensions\12\Template\Xml\DOCICON.XML
1.Add an entry for the .pdf extension entry just before </ByExtension>,
<Mapping Key="pdf" Value="icpdf.gif"/>

Reference:
http://jasoncoltrin.com
www.moss2007.be
www.alpesh.nakars.com
http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

Diganta Kumar is an experienced Technical Program Manager with a passion for technology. He has architected and developed software for over a decade for a broad range of industries. Diganta is a founder of two online IT businesses. He likes to help, mentor, and manage software development teams to improve and produce great software. He currently works as a Principal Program Manager for Microsoft. Before joining Microsoft, he was with AWS for five years, where he managed large cross-functional programs on a global scale.

Posted in SharePoint

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.

%d bloggers like this: