Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
Home
Discussion GroupsGeneralPHPASPPerlColdFusionFlashHTML, CSS, ScriptsBrowsers

Webmaster Forum / General / Search Engines / October 2004



Tip: Looking for answers? Try searching our database.

Rating

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Rob - 29 Oct 2004 19:07 GMT
Hi,

I have a budding Web site, built with the help of MS FrontPage.  For
visual effects, I added a few extra lines of DHTML and VB Script
myself.  In IE browser I think it looks OK.  But when I open a page in
IE SOURCE Window it's another story. It seems the HTML code becomes
somewhat bloated as shared borders and command buttons are placed on
the pages.

My questions are: Would all this extra code that gets there adversely
affect how search engines would rate my Web site? Or would they stay
long enough to at least register my address?

I saw comments somewhere that claimed search engines are picky.  They
want to see clean uncluttered HTML source.  I would really appreciate
some expert advice.

Thanking in advance,
R.
http://www.securepw.ca
C.W. - 29 Oct 2004 20:36 GMT
>Hi,
>
[quoted text clipped - 12 lines]
>want to see clean uncluttered HTML source.  I would really appreciate
>some expert advice.

Not an expert ... but:

Clean markup just helps to make sure that browsers and SE spiders,
alike, encounter less problems with the markup.

Cluttered code makes the file larger than it needs to be - sometimes 2
to 4 times larger.

I tightened up the HML on my sites as they gradually worked up through
the rankings on SE's ... took the pages to 1/3 their original size
[same looks and visual appearances and all - just more tight in the
markup thoughts] and worked on them validating for HTML strict -- but
I don't rest all the eggs in the basket of that my tightening of the
HTML was the key. I think it helped to a degree but not a large degree
,... although some people feel SE's like tight pages [in terms of file
size thoughts]. Jury may still be out on that one, huh?

But if the HTML &/or code used is making up the vast majority of he
file size - may wanna rethink some of the touches really being needed
or how you can tighten things up?

I know SE placement is a major thought for a site - but if the site
looks crappy or hard to read [outside of IE v6  or design choice
itself] then SE placement won't matter or be of help to the site if
part of the people [going to it via a SE search]  can't use it. Some
visual effects may look cool to the site owner but not be impressive
on the user's side of the deal, too.

Carol
Big Bill - 29 Oct 2004 21:42 GMT
>>Hi,
>>
[quoted text clipped - 17 lines]
>Clean markup just helps to make sure that browsers and SE spiders,
>alike, encounter less problems with the markup.

No it doesn't, Sweetie, it helps to ensure that they index the page
correctly and don't miss rgeat gobs of content because it's not
properly coded.

BB

www.kruse.co.uk
The home of SEO that's shiny!
C.W. - 29 Oct 2004 22:02 GMT
[snip]

>>Not an expert ... but:
>>
[quoted text clipped - 4 lines]
>correctly and don't miss rgeat gobs of content because it's not
>properly coded.

Isn't that what I said, though? If the markup is clean then the SE
spiders wouldn't be thrown off by improperly shared markup or things
buried in certain kinds of markup or code that it can't figure
our/handle yet.

Carol

P.S. I like the sig line, Bill.
Big Bill - 29 Oct 2004 23:55 GMT
>[snip]
>>>
[quoted text clipped - 11 lines]
>buried in certain kinds of markup or code that it can't figure
>our/handle yet.

Well you weren't specifying that exactly.

>Carol
>
>P.S. I like the sig line, Bill.

Thanks Hon. I went through a couple while you were away actually. And
I'm not done yet!

BB
www.kruse.co.uk
The home of SEO that's shiny!
SEO Dave - 30 Oct 2004 00:27 GMT
>I think it helped to a degree but not a large degree
>,... although some people feel SE's like tight pages [in terms of file
>size thoughts]. Jury may still be out on that one, huh?

My experience without doing real tests is Google doesn't care how big
a file is (as long as under 101KB) as long as it can understand it. I
type the vast majority of the code and with my own sites don't always
check it (too busy, especially as I create dozens to thousands of
pages a day) and so upload mistakes all the time. To date none of the
small errors I've made have had a disastrous affect on a current SERP.

Funny one I did that meant page 80 of every book resulted in a server
error because when I was creating the template I pasted a number in
the wrong place (it was a SSI and so was disastrous). Sites have been
up 6+ months and only realised a week or two ago (still fixing it as
it's lots of pages :-))

Also the pages I create tend to be very lean on markup, I don't even
use tab or spaces to indent code (I hate having to edit indented
code!), so my stuff tends to be more content than code. I have not
noticed any improvement just because I've removed markup.

When I first got into SEO the site I was working on relied on a lot of
javascript that couldn't be replaced and couldn't be put in external
files. It was also table based and had a LOT of extra markup, pages
with just 10 products could hit 70KB quite easily!!

That site did better than a lot of very clean coded sites I've created
since. If Google considered this extra code in it's algo that site
would of been screwed! I can't prove it (I could, but not had the time
to do the tests), but reasonably sure Google ignores markup that's not
part of it's algo. So ignores javascript, applets and anything not
part of the content.

Just doesn't make sense for a search engine as sophisticated as Google
to not being able to distinguish <table> from the word table.

My advice would be to validate your code so you know it's got no
mistakes just in case you do something that does result in a search
engine completely misunderstanding your site. You should add a Doctype
and charset unless the server adds one automatically (if in doubt add
it). This way not only will all the different versions of browsers be
able to view the site as intended, spiders will be able to understand
what your markup means.

David
Signature

http://www.search-engine-optimization-services.co.uk/

Big Bill - 30 Oct 2004 16:07 GMT
>>I think it helped to a degree but not a large degree
>>,... although some people feel SE's like tight pages [in terms of file
[quoted text clipped - 6 lines]
>pages a day) and so upload mistakes all the time. To date none of the
>small errors I've made have had a disastrous affect on a current SERP.

That you know of, anyway.

>Funny one I did that meant page 80 of every book resulted in a server
>error because when I was creating the template I pasted a number in
>the wrong place (it was a SSI and so was disastrous). Sites have been
>up 6+ months and only realised a week or two ago (still fixing it as
>it's lots of pages :-))

Hey - nothing disastrous then!

>That site did better than a lot of very clean coded sites I've created
>since. If Google considered this extra code in it's algo that site
[quoted text clipped - 5 lines]
>Just doesn't make sense for a search engine as sophisticated as Google
>to not being able to distinguish <table> from the word table.

Except when it sees something like <table and therefore< might assume>
all sorts and so maybe cannot afford the page the rank status that it
might have done had it been able to reassure itself it knew what it
was indexing on the basis that the code was syntactically correct.

>My advice would be to validate your code so you know it's got no
>mistakes just in case you do something that does result in a search
>engine completely misunderstanding your site.

which is my point, so how come you don't?

>You should add a Doctype
>and charset unless the server adds one automatically (if in doubt add
>it).

Er, um, so then the server validates the page? against what as it
comes with no doc-type? I'm just asking here. I never heard of servers
adding doc-types.

> This way not only will all the different versions of browsers be
>able to view the site as intended,

I really don't think it's that simple as they all display stuff
different anyway without a zillion workarounds. I'm sorely tempted to
sally over to alt.html and tell'em "Hey! Dave's arse hanging in the
breeze over here!".
Since you've already had your quota of nutters for this week I won't
though.

> spiders will be able to understand
>what your markup means.

Not if it isn't syntactically correct they won't and hanging a
doc-type and a charset on the walls won't help that.

BB
www.kruse.co.uk
The home of SEO that's shiny!
SEO Dave - 30 Oct 2004 19:01 GMT
>>>I think it helped to a degree but not a large degree
>>>,... although some people feel SE's like tight pages [in terms of file
[quoted text clipped - 8 lines]
>
>That you know of, anyway.

When I've noticed an error I've always checked to see if it had an
impact. With small errors no noticeable impact, the one large error
resulted in the page not existing to search engines. So for those
pages it was very bad. It was the 80th page of a book, so not a big
deal to SEO, would be much more irritating for a visitor finding page
80 missing though!!

I recall another error I made on a site to over 1000 pages. I changed
a header from H3 to h4, but didn't change the </h3> to </h4> so
basically 90% of the page was after an open <h4>. Result was the SERPs
went up! I eventually fixed it and the SERPs continued to go up at
roughly the same rate (suggesting it made little to no difference).

>>Funny one I did that meant page 80 of every book resulted in a server
>>error because when I was creating the template I pasted a number in
[quoted text clipped - 3 lines]
>
>Hey - nothing disastrous then!

That wasn't a small error, that was a very big error. I think you'd
agree a server error is big, missing a > off a tag is small (the page
will still get indexed).

>>That site did better than a lot of very clean coded sites I've created
>>since. If Google considered this extra code in it's algo that site
[quoted text clipped - 10 lines]
>might have done had it been able to reassure itself it knew what it
>was indexing on the basis that the code was syntactically correct.

Yes, but I also know from experience search engines can to a degree
cope with small errors like these. If you did have <table for example
it might be treated as text. Not a big deal if you added a bit of text
saying <table. Same if you miss the same from a header <h1 you've
probably changed a header into a bit of body text, so it's not a major
disaster, but IF H1 headers confer extra benefit you just lost the
benefit. But the page will still be spidered.

>>My advice would be to validate your code so you know it's got no
>>mistakes just in case you do something that does result in a search
>>engine completely misunderstanding your site.
>
>which is my point, so how come you don't?

You ever tried validating 5000 pages in a day? Even 100 pages takes up
more time than I have.

>>You should add a Doctype
>>and charset unless the server adds one automatically (if in doubt add
[quoted text clipped - 3 lines]
>comes with no doc-type? I'm just asking here. I never heard of servers
>adding doc-types.

I didn't say the server adds a Doctype, you can have the charset dealt
with by the server.

>> This way not only will all the different versions of browsers be
>>able to view the site as intended,
>
>I really don't think it's that simple as they all display stuff
>different anyway without a zillion workarounds.

Yes, but if you miss your Doctype you just made it worse. Do you know
how your pages will be served to a visitor using a very old browser,
what happens if the browser guesses wrong? Why leave it to guess work
when a simple line of code will almost guarantee what browsers and
most likely search engine spiders will do.

>I'm sorely tempted to
>sally over to alt.html and tell'em "Hey! Dave's arse hanging in the
>breeze over here!".

I think they'd agree with me since most will see adding a Doctype as
the norm.

>Since you've already had your quota of nutters for this week I won't
>though.

So you'll be waiting until the end of the week then, thanks :-)

>> spiders will be able to understand
>>what your markup means.
>
>Not if it isn't syntactically correct they won't and hanging a
>doc-type and a charset on the walls won't help that.

If it's correctly coded (validates) then I believe it doesn't matter
how much markup you add to a page, search engines will only count
content and any markup they consider important (it's generally believe
headers falls under the above).

I'll quickly add I don't know for sure, it might be spiders completely
ignore the Doctype and charset.

I really should test it. I think I'll create two identical text pages,
but with one fill it with tables and fonts. Maybe font every word
until it hits 100KB!!

Hmmm, just noticed a flaw in your argument. The reason for not
including the Doctype, charset and removing as much markup as possible
is because you believe Google etc... counts everything in the code of
a page. Yes?

If you believe Google counts the above, wouldn't that suggest it might
have some value? If Google completely ignores it, it doesn't matter if
it's there or not, so as it helps with other things (like validating
and browsers displaying the page correctly) doesn't it make sense to
add a Doctype at least?

>BB

David
Signature

http://www.search-engine-optimization-services.co.uk/

C.W. - 30 Oct 2004 19:43 GMT
[snip]

>>>My advice would be to validate your code so you know it's got no
>>>mistakes just in case you do something that does result in a search
[quoted text clipped - 4 lines]
>You ever tried validating 5000 pages in a day? Even 100 pages takes up
>more time than I have.

I don't do all the pages in one day.

I validate the template that I will use for site ... I use HomeSite as
an HTML editor, even thouhg I hand code, and will click to validate
the page to make sure I don't have any errors in the new content added
to the template. Or I may use the Web Dev toolbar on Firefox to see if
the page validates HTML Strict with the new content added to the
already validated template.

But I will say "that's just me".

I also use DocType [doesn't add that much at all to the page and more
reasons for it than against it] and META keywords and descript on my
pages. Although I don't put a lot of value on META keywords out of the
two METAs. *shrug*

[snip]
Big Bill - 30 Oct 2004 21:17 GMT
>Hmmm, just noticed a flaw in your argument. The reason for not
>including the Doctype, charset and removing as much markup as possible
>is because you believe Google etc... counts everything in the code of
>a page. Yes?

I've been wondering about this for ages and have been working on the
basis of erring on the side of caution. I've had it inmind to ask you
and Will and separately the boys at Planet Oceran to run tests to
determine this for engines as I know you all enjoy that kind of stuff.
But, Google's going all weird again so we may well be seeing a new one
soon and MSN are threatening again to unleash their beta on a
permanent basis so now is not the time.

>If you believe Google counts the above, wouldn't that suggest it might
>have some value? If Google completely ignores it, it doesn't matter if
>it's there or not, so as it helps with other things (like validating
>and browsers displaying the page correctly) doesn't it make sense to
>add a Doctype at least?

I get no. 1 rankings without it so why bother?
But I think really we should be testing for this and every line of
code to determine what's diluting the S/N ratio and what isn't. I try
to allow for CSS as opposed to fonts etc. as a matter of course now
and I'm kind of umming and awwing about going to total css. Bar the
odd table. Maybe.

And I don't care about old browsers. If they're bleedin' Luddites
neither I nor my clients want to know them. It was different when I
was working for the Tibet people because then you had to think of
monks hiding out who only had access to Netscape 2 and the like, but
these days those days are the bad old days.

BB
www.kruse.co.uk
The home of SEO that's shiny!
Big Bill - 29 Oct 2004 21:42 GMT
>Hi,
>
[quoted text clipped - 16 lines]
>R.
>http://www.securepw.ca

Go to www.htmlvalidator.com and download the lite version. Then use
it. You'll learn a great deal.

BB
www.kruse.co.uk
The home of SEO that's shiny!
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2010 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.