10 May

Indexing Obstacles for Dynamic Web Sites

URL Parameters, Session ID’s, Reserved Characters and Deep Nested web pages can make it harder for search engines to fully index your website. If your site uses any of the following you may find it hard to get indexed by the search engines:

Parameters

If your URL’s include parameters (end with ?a=1&b=2) then the search engines may not index these pages. This is because the spider can get caught in an infinite loop, indexing the same page hundreds of times with exactly the same content.

It used to be that no search engines would index pages with parameters. This is now much improved to how it used to be, however to ensure your site is indexed by all the search engine spiders always limit to a maximum of two parameters, but if possible use none.… Read the rest

06 Sep

Has Google Reached It’s Limit?

By Dylan Downhill

Google went public last month but that’s not the big news for Google. Nor is the settlement of the PPC patent fight with Overture. The big news is that Google may have hit the limit on the number of pages it can store. In a nutshell the number of pages Google says it has indexed (currently it reads ‘Searching 4,285,199,774 web pages’ is not that far from the largest number an integer in Unix//Linux can handle – around 10 million off – see full article here http://www.w3reports.com/index.php?itemid=549. Whether they really can’t add new pages without deleting old ones or whether this is bologna (after all, they are a bunch of uber-techies – surely they increased the size of the index when they saw this coming); only Google knows.… Read the rest

17 Aug

Publishing Through A RSS Feed – Quick Guide

This is a quick guide to publishing your web content through an RSS feed. It is not meant to be extensive, it is meant to get your feet in the door of publishing content using RSS in the quickest time possible.

RSS File Format

A RSS feed is simply an XML file containing information on pages within your site. The RSS file format is as follows:

<?xml version=”1.0″ ?>
<rss version=”2.0″>
<channel>
<title>Title Text </title>
<link>Link to site’s home page </link>
<description>Description of the feed</description>

<item>
<title>Page Title</title>
<description>Page Description</description>
<link>Page Link</link>
<author>Email to Contact You On</author>
<pubDate>Published Date</pubDate>
</item>

</channel>
</rss>

Where the contents from <item> to </item> are repeated for all the content you want to publish through the RSS file.… Read the rest