Last Updated on Monday, 27 December 2010 00:58 Written by Joe Rinehart, SEO Sunday, 27 June 2010 13:38
Kevin Ireland, Publisher of http://www.InsiteGainesville.com and http://www.GainesvilleBizReport.com (both sites of which are using the Joomla content management system) which represent his Gainesville, Florida print media, asked:
Hey Joe, I'm trying to figure a way that I can make the online PDFs of my
magazines searchable by Google. By that I mean if someone plugs "inventor
John Smith" into Google, the PDF of my magazine that includes an article
with inventor John Smith will come up high in the returned results. We've
already saved all PDFs in searchable form, so someone who opens the
magazine can search for specific words but we can't figure a way to get
Google to drill down into the pages to identify specific key words. Do you
know of a method?
My emailed response which I reserve the right to edit for the benefit of everyone down the road:
PDF's are searchable by default these days and Google has got really good at it. However, if the PDF is created in Photshop as opposed to Adobe Pagemaker or MS Word, it'll be one big image and Google can't index the text within images. So Adobe. MS Word, or any text processor editor that'll convert to PDF is the only way to go. FYI, all the articles within Joomla have the ability to be converted to PDF, assuming your development company didn't turn the feature off.
The example you gave regarding a John Smith, is well, not the best example because, the last name Smith is one of the most common ;) However I'd suggest if someone typed John Smith in Gainesville, you'd have a shot.
The fastest method of getting Google to index your PDF's is to first have a sitemap and in Joomla I'd recommend Xmap. Then you'd go to webmaster tools using your Google account or one you have established for all the Google goodies and your site(s) and make sure it's registered.
Another thing to be aware of is that Google doesn't actually index every single word! There are what's known as stop words and here's a URL of the most common:
http://www.link-assistant.com/seo-stop-words.html
| < Prev | Next > |
|---|