Database Driven Sites And How To Get Them Listed

James Zigrino
by Head of IT
James Zigrino
It's common knowledge in the SEO industry that query string URL's are a bad thing, and that spider-based search engine listings are impossible to achieve in some search engines because of them.

That much is common knowledge. However, two things are blatantly obvious -

For those of you that are new to URL structure, a query string is the part of a URL that follows a ? or an &:
eg. www.example.com?var1=23&var2=47.

The query string is used to pass information from one page in a site to the next. Often this is used as part of a database query, the resulting data from which will become the basis of the next dynamic page.

Much search engine spider software was written several years ago, before there was a real requirement to maintain variables states between pages. Often search engine indexing algorithms make the assumption that data contained in a query string is transient and unlikely to remain current for long enough to be worth indexing (some spiders can take weeks to list your site after submission, and then weeks before re-visiting).

Well, the problem is clear, the question is - is there a way that we can solve this problem?

Take a look at the bigmouthmedia site. This is database driven. Take a look at www.amazon.com; this again is obviously database driven. But notice that neither site uses query strings in any of their URLs.

The trick to running a dynamic or database driven site is to find alternative ways to communicate your variables and data via URLs without scrificing the functionality of your site.

Bigmouthmedia, and Amazon have used two completely different approaches:

Bigmouthmedia uses a specially developed system of 'virtual static pages'. Each URL contains enough text to uniquely associate with a record in the database; a call to that page extracts the information from that record and uses it to build the page 'on the fly'. As far as the browser or visiting spider is concerned the resulting page is as real as any other static HTML page. One advantage of this technique is that it can be used on almost any web-server and requires no special set-up or the installation of any extra software.

Amazon on the other hand disguises variable data inside the directory-path part of the url, and uses specially configured web server software to decode and extract the information when it arrives, before passing a new, dynamically translated url on for processing. This means that all the URLs for their site appear static, and yet behave as if they contained query strings.

A third technique, that can often be successful is to use the ability most web server software offers of customising the 404-Page Not Found message. This allows a useful page to be returned no matter url is entered, and futher customisation could, for instance, parse the requested URL for clues as to what data you were trying to request and dyanmically return a page accordingly. Unfortunately, the web server would also return the 404 error code, causing some search engines search spiders and some web browsers to ignore it anyway, although some web server software, including Coldfusion and Microsoft IIS allow you to suppress this.

www.evolt.org say the advantages of "clean" URLs are -

So you can see that using clean URLs affords other benefits to your site also.




This article was first published on 09 April 2002 and does not necessarily match current events or the current opinions and views of bigmouthmedia ltd.
  • Print this page
  • Send this page to a friend
  • Digg this article
  • Post this article to Reddit
  • Bookmark this article in Del.icio.us
  • Add this article to Sphinn
  • Add this article to Furl
  • Add this article to Magnolia
  • Add this article to StumbleUpon
  • Bookmark this article in Google
bigmouthmedia - database software driven site? No Problem!
© bigmouthmedia 2008