Skip to main content
Trans Am

Todd Klindt's SharePoint Admin Blog

Go Search
Home
Blog
Netcast homepage
SharePoint Terminology Wiki
  

Todd Klindt's home page > Todd Klindt's SharePoint Admin Blog > Posts > Indexing files larger than 16 MB in SharePoint 2010
Indexing files larger than 16 MB in SharePoint 2010

As with previous versions of SharePoint, SharePoint 2010 will not index the contents of files larger than 16 MB. There are a couple of reasons for this such as network usage pulling large files across and the time it takes to break them apart. While the file itself isn't indexed, the metadata is. So you'll be able to find the location of a 17 MB or larger file by searching for its name, or its author, you won't be able to find it by searching for words that exist in it.

With previous versions of SharePoint, the fix for this was to add a Registry key called "MaxDownloadSize" and put a number between 17 and 64 in it. That tells the search engine to ignore the 16 MB limit, and go ahead and index files all the way up to 64 MB in size. However, in SharePoint 2010 this has changed a bit. The indexer still doesn't download files larger than 16 MB, so that's the same. The way to fix it though is different now. Thanks to the invention of PowerShell we can do that instead of getting our hands dirty in the Registry.

Here's the PowerShell code:

$s = Get-SPEnterpriseSearchServiceApplication

$s.GetProperty("MaxDownloadSize")

$s.SetProperty("MaxDownloadSize",25)

$s.Update()

Restart-Service osearch14

 

This is what it looks like in practice:

We can see here the default value is still 16 MB, but that is easily changed to something like 25 MB. We also need to bounce the search service for this to take effect. Then after your next full crawl the data in files larger than 16 MB will be indexed.

How do you know if you have documents larger than 16 MB? Unfortunately that seems to have changed for the worse in SharePoint 2010. In SharePoint 2007 if the indexer came across a file larger than 16 MB it would throw a warning in the crawl log. SharePoint 2010 doesn't do this. I haven't found a way to determine which files are skipped because that are larger than the current MaxDownloadSize setting. If anyone knows how to determine this, let me know.

tk

Comments

Perfect timing!

Hot topic for me, thanks Todd. 
at 7/21/2010 9:57 AM

The warning is still logged

Hi Todd,

I was following up on this, and found that (at least in the environment I was testing) the warning does still get recorded with the message "The file reached the maximum download limit. Check that the full text of the document can be meaningfully crawled."

 - Woody -
at 7/21/2010 10:16 AM

Re: The warning is still logged Edit

Hey Woody,
Is it logged in the crawl log? I swear it wasn't there when I checked.

tk
Todd O. Klindt at 7/22/2010 10:11 PM

Max size limit in SP 2010

Hi Tk,
As you said you will extend the size limit in 2007 till 64 MB. What about SP 2010? How much SP 2010 can extend the size limit?

Thanks
Ani
at 8/8/2010 7:44 PM

Add Comment

Items on this list require content approval. Your submission will not appear in public views until approved by someone with proper rights. More information on content approval.

Title


Body *


Today's date *

Please enter today's date so I know you are a real person
Attachments