Why You Should Validate Your HTML



If you're a regular reader of the MarketPosition newsletter, then you already know about the potential pitfalls of using frames, Flash, and dynamically generated pages on your site. Search engine spiders struggle with these technologies, so using them heavily can limit your success in the search engines.

Well, here's another item to add to that list: invalid HTML code. Bad HTML can hurt your site in the search engines without you ever realizing it.What exactly is HTML validation? It's the process of checking the syntax of your HTML code to find places where you've violated the rules of the language. The official rules for writing HTML are defined by the World Wide Web Consortium (W3C). Those rules include strict definitions stating which HTML tags are legitimate parts of the language, and how you should structure your HTML documents.

HTML errors that violate these rules include things like badly nested tags (where you incorrectly close one element before another), content model violations (where you nest tags that aren't allowed inside one another), and badly formed tables.

Sound confusing? Don't worry - many HTML editors include a built-in validator that will check your page and point out this sort of error. In addition, online services like WDG HTML Validator and W3C itself offer free page validation.

So what exactly is the impact of this sort of error? It depends upon who's reading the page. Errors may have no impact at all in your browser, or they could cause the text to appear in the wrong place or in the wrong size on the page. At their worst, HTML errors can keep sections of your Web page from displaying.

To be honest, validation is about as much fun as a trip to the dentist. Sometimes it feels like you're getting a root canal. The first time you validate your page, you could see dozens of errors. That's especially true if you coded your pages by hand which tends to result in more errors. Even if you use FrontPage or another WYSIWYG editor, they don't always produce code that validates cleanly. Of course, there's some assurance in the idea that a search engine would try to be compatible with common HTML errors created by the most popular editors like FrontPage, it's still not a sure bet.

What makes matters worse is that it's hard to see the value of fixing all of these problems when your page displays just fine under Internet Explorer. Indeed, the whole reason why the W3C stresses validation is because following the official rules of the HTML language makes it easier for browsers to interpret your page correctly. If the latest browsers do that already, why bother with HTML validation?

The reason is simple: search engine spiders also need to interpret your HTML. And while the Microsoft and Netscape browsers are very forgiving of your HTML errors, search engine spiders aren't nearly as kind. It helps to think of a search engine spider as a web browser - just like a browser, the spider needs to interpret your page and figure out what you're saying. Only then can it properly index your page. Search engine spiders also care about the structure of your Web page because they give extra weight to keywords placed inside certain HTML tags.

But there's a big difference between web browsers and search engine spiders: web browsers are under pressure from the marketplace to correctly display as many Web pages as possible. Any browser hoping to be taken seriously needs to understand the latest web technologies and be able to understand all those badly written pages out there on the Web. Users would quickly abandon any browser that fails to render the average Web page.

One would think that same market pressure would push the search engines to improve their spiders, making them more forgiving. After all, there's a great deal of competition in the search engine world. Yet that doesn't seem to be happening, partly because its difficult to tell when this issue has come into play. It's surprising how far search engine spiders lag behind the major browsers.

I don't want to give you the impression that any and all HTML errors will wreck your search engine ranking. Spiders do tend to forgive many errors, such as badly nested tags. But I have direct experience with bad HTML hurting a search engine ranking. A few months ago I helped a webmaster who had lost his Top 10 ranking because of a simple typo in his HTML. One badly placed angle bracket kept Googlebot from correctly parsing the home page, causing it to fall completely out of the index. The page displayed correctly under all the major browsers, but it still caused problems for Googlebot.

So validating your pages is a wise precaution, particularly if you write the HTML code by hand. Clean, well-written HTML is important if you want to ensure a good search engine ranking. It also helps guarantee that your page will be displayed properly on older, or more obscure browsers that are less forgiving. Therefore, you have two compelling reasons to validate your HTML today.

Christine Churchill is President of KeyRelevance.com a full service search engine marketing firm. She is also on the Board of Directors of the Search Engine Marketing Professional Organization ( SEMPO) and serves as co-chair of the SEMPO Technical Committee.

This article is copyrighted and has been reprinted with permission from FirstPlace Software, the makers of WebPosition Gold. FirstPlace Software helped define the SEO industry with the introduction of the first product to track your rankings on the major search engines and to help you improve those rankings. A free trial of WebPosition Gold is available from their Web site.

back to top

Search Engine optimization articles

What is "organic" search and how can it help your company?

How to increase your sales by optimizing for local markets.

Seven Steps of Search Engine Optimization

Why You Should Validate Your HTML

22 Reasons Why Your Page Did Not Get Indexed

Make Your Dynamic Web site Search Engine Friendly

Is Your WebSite Guilty By Association?

Improving Table Prominence for Higher Rankings

The Top Five Strategies for SEO

Search Engine Marketing 102: Boosting Prominence

Finally a Cost-Effective Way to Track Your Sales

Using Miva Merchant and not Ranking Well in Google? Learn Why!

Watch Out for Fancy Menu Systems

Ten Ways to Gain More SEO Clients: Beginning with your local market

Search Engine Marketing 101: What Search Engines See When They Visit Your Web Site

Advanced Tip: Improving Rankings via Server Side Includes (SSI)

Are You Losing Visibility by Duplicating Titles?

Increase your Click-Throughs With Killer Title Tags

Future Outlook: Will all Submissions Soon Become Paid?

Thou Shalt Not Spam! The 12 Commandments of Search Engine Marketing

Top Reasons Why You May Not Be Indexed

TIP: Expand Your Traffic Through Misspellings - Free Service!

The Top 5 Tips and the Top 5 Mistakes of Search Engine Marketing

How Often Should I Submit?

Benefits of Organizing Your Content Into Separate Domains

How and Why to Build a Robots.txt

Which is Better: Manual Submission or Automated?

Could my competitors be spamming me?

Should I include dashes in my domain name?

How to Host Multiple Domains to Maximize your Rankings

How to Get Your Pages Indexed and Then Keep Them That Way

A Marketing Technique You Should Avoid

How to Make Better Use of Images

Selectively Excluding Pages from Being Indexed

Why some pages rank high for no apparent reason!

Avoid Pitfalls with Frames

How to Avoid Trouble with the Engines

Link Tags: Overlooked Way to Improve your Score

How to Create Effective Page Descriptions


Check our complete range of services: SEO Optimization India, Web Design India, Web Hosting India, Internet Marketing & Offshore Software Web Development Company India

back to top

New Delhi
E-47/7, Second Floor,
Okhla Indl. Area, Phase II,
New Delhi - 110 020, India
T: 91-11-41610066/ 77
F: 91-11-41720246
E: sales@candidinfo.com
Chicago
2614 N California,
Chicago,
IL 60647
T: 1-800-717-3943
F: 313-481-6787
E: sales@candidinfo.com
Copyright © 1999-2008 Candid-Offshore Development India. | Resources | Legal | Privacy | Domain Registration