phantomLine™ — the ultimate stealth

Cloaking FAQS — Page 3

What is keyword density?

A: One of the standard elements of web page optimization is Keyword Density: up until very recently the ratio of keywords to rest of body text was generally deemed to be one of the most important factors employed by search engines to determine a web site's ranking.

However, this basically linear approach is gradually changing now: as mathematical linguistics and automatic content recognition technology progresses, the major search engines are shifting their focus towards "theme" biased algorithms that do not rely on analysis of individual web pages anymore but, rather, will evaluate whole web sites to determine their topical focus or "theme" and its relevance in relation to users' search requests.

This is not to say that keyword density is losing in importance, quite the contrary. However, it is turning into a lot more complex technology than a simple computation of word frequency per web page can handle.

Context analysis is now being determined by a number of auxiliary linguistic disciplines and technology, for example:

All these are now contributing to the increasing sophistication of the relevance determination process. If you feel this is beginning to sound too much like rocket science for comfort, you may not be very far from the truth: it seems that the future of search engine optimization will be determined by what the industry is fond to term the “word gurus”.

A sound knowledge of fundamental linguist methodology plus more than a mere smattering of statistical calculus will most probably be paramount to achieve successful search engine rankings in the foreseeable future. Merely repeating the well worn mantra “content is king!”, as some of the lesser qualified SEO professionals and very many amateurs are currently doing, may admittedly have a welcome sedative effect by creating a feeling of fuzzy warmth and comfort. But to all practical purposes it is tantamount to whistling in the dark and fails miserably in doing justice to the overall complexity of the process involved.

It should be noted that we are talking presence AND future here: many of the classical techniques of search engine optimization are still working more or less successfully, but there is little doubt that they are rapidly losing their cutting edge and will probably be as obsolete in a few months' time as spamdexing or invisible text — both optimization techniques well worth their while throughout the 90s — have become today.

So where does keyword density come into this equation? And how is it determined anyway?

There's the rub: the term “keyword density” is by no means as objective and clear-cut as many people (some SEO experts included) will have it! The reason for this is the inherent structure of hypertext markup language (HTM) code: as text content elements are embedded in clear text command tags governing display and layout, it is not easy to determine what should or should not be factored into any keyword density calculus.

The matter is complicated further by the fact that the meta tags inside a HTML page's header may contain keywords and description content: should these be added to the total word count or not? Seeing that some search engines will ignore meta tags altogether (e.g. Lycos, Excite and Fast/Alltheweb), whereas others are still considering them (at least partially), it gets even more confusing. What may qualify for a keyword density of 2% under one frame of reference (e.g. including meta tags, graphics ALT tags, comment tags, etc.) may easily be reduced to 1% or less under another.

Further questions arise: will meta tags following the Dublin Convention (“D.C. tags”) be counted in or not? And what about HTTP-EQUIV tags? Would you really bet the ranch that TITLE tags in tables, forms or DIV elements will be ignored? Etc., etc.

Another fundamental factor generating massive fuzziness left, right and center, is the issue of semantic delimiters: what's a “word” and what isn't? Determing a lexical unity (aka a “word”) by punctuation is a common though pretty low tech method which may lead to some rather unexpected results.

Say you are featuring an article by an author named “John Doe” who happens to sport a master's degree in arts, commonly abbreviated as “M.A.”. While most algorithms will correctly count “John” and “Doe” as separate words, the “M.A.” string is quite another story. Some algorithms will actually count this for two words (“M” and “A”) because of the period (dot) is considered a delimiter — whereas others (surprise!) will not. But how would you know which search engines are handling it in which way? Answer: you don't, and that's exactly where the problems start.

The only feasible approach to master this predicament is trial and error. The typical beginner's inquiry “What's the best keyword density for AltaVista?”, understandable and basically rational as it may be, is best answered with the fairly frustrating but ultimately precise: “It all depends — your mileage may vary.” It is only by experimenting with keyword densities under standardized, comparable conditions yourself that you will be able to come to significant and viable conclusions.

To get going, here are some links to pertinent programs that will help you determine (and, in one case, even generate) keyword densities.

KeyWord Density Analyzer (KDA)

An all time classic of client based keyword density software is Roberto Grassi's powerful KeyWord Density Analyzer (KDA). It is immensely configurable and offers a fully featured free evaluation version for download. Find it here:

http://www.grsoftware.net/search_engines/software/grkda.html

(Expect to pay appr. $99 for the registered version.)

Concordance

Concordance is a powerful client based text analysis tool for making word lists and concordances from electronic texts. A trial version can be downloaded here:

http://www.concordancesoftware.co.uk/

(Expect to pay appr. $89 for the registered version.)

fantomas keyMixer™

Our own fantomas keyMixer(TM) is the world's first automatic keyword density generator, enabling you to create web pages with ultra precise densities to the first decimal digit. Read more about this server based Perl/CGI application here:

http://fantomaster.com/fakmixer0.html

(Expect to pay appr. $99 for the registered version.)

How can I cloak themes?

A: There is a marked tendency with major search engines to rank web sites according to their pages' overall or “theme” relevancy. Though this technology seems to be still very much in the making there is little doubt that it will play a major role within the foreseeable future.

So where does that leave us cloakers? The actual maths behind theme or context relevancy based algorithms is a very involved and complex affair. Generating theme based web pages according to all the rules in the book is bound to become a major industry in its own right some day, but this is well beyond the scope of this FAQ. (See a graphical overview of the basic mechanics at: http://contextbase.com/cbintro03.html.)

So for practicability's sake we'll assume that you have a fairly focused web site up and running which you wish to optimize on a theme base. One trick to do this is a technique we will term Topical Clustering or TC for short.

The mechanics of TC are fairly straightforward to start with:

  1. Determine your site's sub topics Determine which pages belong to which common set of keyword phrases. Most multi page web sites feature several sub topics at least. Make sure to distinguish between them, even if they do relate to each other on a meta level. E.g. if your site is focused on “second hand cars” as primary or meta topic, you will want to optimize it for the overall theme “automobiles”. However, you will probably have structured the site into several sub topics — each of which constitutes a Topic Cluster. Sub topics will typically be reflected in a site's navigation scheme: “vans”, “limos”, “Ford”, “Chrysler”, “vintage cars”, “Italian sports cars”, etc.
  2. Generate filler text files by Topic Cluster Generate as many different filler text files per Topic Cluster as you can. And here, “different” actually means different! Merely switching some keyword phrases for their alternatives won't do.
    Neither should your filler text files consist of mere dictionary lists — while this technique used to work fine until quite recently, expect it to lose its clout as search engine algos become more sophisticated, checking spidered pages for plausible linguistic syntax, punctuation and other natural language features plus, of course, for content.
    So in a nutshell: use as much “real” and relevant text as you can come up with.
    It is recommended to work from fairly long filler text files — while your mileage may vary, we tend to generate pages sized 10-15K for our clients' sites. The reason for this is that it gives you a wider scope for expounding on the targeted theme simply because you can boost semantic and context relevance more easily that way.
  3. Keyword optimize your cloaked pages Next, you may proceed to optimize your cloaked pages by keyword density. (For an extensive discussion of this technique see “What is keyword density?”.) Using several differing but topically related filler text files, you will generate a set of TC pages.
  4. Cross linking cloaked pages Cross link your content related cloaked pages by Topic Cluster so that they will point to each other. Don't forget to include at least one link to your index page on one or all of them. You may also want to link to at least one if not all of them by placing an invisible link on your index page.
    Additionally, you may also create a cloaked contents page per Topic Cluster linking to your newly generated cloaked pages (and, again, to your index page). Finally, fine tune the keyword density of all pages covered here again if required.
  5. Submit your topic cluster pages Finally, submit your TC pages to the search engines.
How do I cloak meta tags?

A: Cloaking web page meta tags follows a different procedure than the IP delivery and full page cloaking method discussed so far.

Server Requirements

To take advantage of this procedure you must be able to make use of Server Side Includes (SSI) on your web server.

Note IIS/4.0 users: The code presented here is an extended SSI expression which is not supported under IIS/4.0.

Meta tag cloaking is effected by excluding browser from viewing certain parts of a web page, specifically the header where meta tags are positioned by default.

Browsers are determined by their UserAgent variable. Once properly cloaked, it won't make any difference whether you read the source code online or whether you download it for viewing offline — the meta tag code will remain hidden, the browser will not be able to read it and will therefore not download it either.

Browser UserAgents

Here is a list of UserAgents as used by popular web browsers:

Activating SSI

The .htaccess File

If your web server is not configured for SSI by default, you will need to upload a file named “.htaccess” (please note the period/dot at the beginning of the file name!) to your server directory. This can be done by Telnet or FTP.

The .htaccess file should have the following content:

Options Includes +ExecCGI
AddType text/x-server-parsed-html .html

Note that many web servers will not require the specfication "Includes", meaning you can omit it altogether. However, since it won't do any harm to keep it in your file, we suggest you do not change the above entry. Thus, should you switch servers some day, you will not have to readjust your .htaccess file.

After you have uploaded the modified .htaccess file (MUST be in Ascii mode!), you're ready to go.

In the HEADER section of the web page whose meta tags you wish to protect, place the following code:

<!--# if expr="\"$HTTP_USER_AGENT\" != /Mozilla
|MSIE|Opera|Lynx|WebTV|NCSA Mosaic/" -->

VERY IMPORTANT (1)

The above must be in one SINGLE line! As browsers tend to word wrap lines which are too long for display, we had to word wrap the example. So please take this into account when cutting and pasting the above code.

Under this first header entry, you may now add the actual meta tags you wish to protect.

When you are done, you must close the protected section with the last header entry, or the rest of your page won't be displayed either!

<!--# endif -->

VERY IMPORTANT (2)

If you have other entries in your page header (e.g. for an external CSS style sheet, an external JavaScript applet, etc.) you MUST place these OUTSIDE the protected area (but WITHIN the header tags) or they will not work unless you are operating with a browser sporting a UserAgent not included in the code above.

So what does it do?

The SSI code outlined above will determine the accessing browser by its UserAgent variable. If it is recognized, the system will skip the content within the exclusion tags, effectively preventing the meta tags from being displayed.

Search engine spiders not using common browser UserAgent variables (most don't) will still get to read the meta tags nevertheless, which is, of course, what you want them to do.

Limitations

The method outlined above may well qualify for “poor man's cloaking” — it is NOT an industrial-strength protection against code snoops, the more so as UserAgents can easily be forged (“spoofed”). But it will at least cover about 95% of all ordinary browsers and their users without putting an undue strain on server load and, hence, system performance.

Bear in mind, too, that meta tags are gradually losing in importance as many search engines have stopped indexing them because of massive abuse by keyword spamming (“spamdexing”) and irrelevant description tags in the past.

Other uses

This technique can also be used to prevent email harvester bots (address extractors) from culling email addresses from textarea fields, e.g. if you happen to display your ezine complete with editors' addresses in text format on your web site.

«Previous | Next»

Cloaking Technology

Featured as an ongoing multi part section in our fantomNews™ newsletter (formerly published as an e-zine, now an online weblog), we are offering you all the stuff the search engines don't want you to know, straight from the horse's mouth!

Learn the secrets of the pros – it's free!

Get your copy of “Stealth, Cloaking, Phantom Technology: How it works – and how it doesn't” and lots of other SEO / SEM information by downloading our complete e-zine archive here!

Recommend Us! Spread it! Recommended us!
fantomas shadowMaker™ fantomas Software shadowMaker Special Deal The all-powerful 100% automatic Shadow Domain™ generator for effective heavy duty industrial-strength cloaking.

This beauty generates 100% relevant and unique content for your Shadow Domains™, creating an unlimited number of highly optimized unique pages including site maps in fully customizable keyword density and page weight

Then submit them to the search engines and redirect search engine generated human traffic in realtime (i.e. without delay) by search phrase to any URL you wish

fantomas spiderSpy™
searchbotBase
Service The world's largest database of search engine and other spiders, updated every six hours, seven days a week

The fantomas spiderSpy™ botBase Service is an indispensable backend tool for IP based search engine optimization software developers and agencies.

It is also a must have for server stats analysis tools and services to ascertain reliable spider access traffic data.

fantomas shadowSniper™
IP Delivery
Program Highly efficient and reliable IP based cloaking technology for optimized search engine rankings.

If you're developing web content in highly competitive areas, this software is your only chance to achieve good to smashing positioning!

fantomas keyMixer™
Keyword Density Generator
Program Generates precise customizable keyword density for optimized search engine positioning.

The world's first tool of its kind, this cutting edge server based application automates the task of generating high precision keyword density for your gateway and stealth pages.

fantomas subFrog PRO™
Engine Submission
Program Fully customizable and user defineable server sided automatic search engine submitter.

Submit as many URLs as you like to any or all of dozens of search engines — and let the process run in the background, leaving you all the time in the world for other tasks.

fantomInfo About Us Mission Statement Privacy Policy Contact Office Hours

At fantomaster.com we are committed to aiding internet and Web professionals in achieving their goals in today's and tomorrow's increasingly competitive technological environment.

fantomNews Weblog siteFlash: What's New Here? Archive

Read the latest info on our products and services in our fantomNews™ online newsletter focusing on IP delivery (cloaking), search engine optimization, webmaster tricks, etc

fantomProducts Overview Downloads TechSpecs Manuals Price List

Check out our fine product line of webmaster software, Perl and CGI scripts, many of them world time firsts in their class. See our documentation and test our demo versions in real time.

fantomTips FAQs Tutorials Cloaking and IP Delivery Resources Free Content

Our information gold mine: search engine positioning, IP delivery, cloaking technology, search engine spider IPs, FAQs, link popularity, resources and links to boost your web presence.

fantomServices Overview spiderSpy™ Anti-Spam Anti Code Napping Anti-Fraud

Profit from our research and development efforts! Get the world's most comprehensive database of search engine spiders for top notch search engine optimization and traffic analysis.

fantomFreestuff Overview Services Downloads FAQs Tutorials

Giving back to the community: our free cutting edge applications for webmasters and IT professionals. With thousands of downloads per year, we're helping to make the Web a better place.

fantomOrders Overview Ordering Online PayPal Ordering Offline Price List Special Deals

Need we say more?

We offer the industry's widest variety of secure options for payment, download and registration of our products and services. Order online via our state-of-the-art SSL-secured enhanced Apache server or via PayPal

Alternatively, you may order by fax, by email, by phone or by snail mail.

fantomCrew™ Affiliates Overview FAQ Links & Banners Terms Join Up! Member Login

Teaming up with success: excellent established products, lifetime commissions, zero setup fee, enlightened support — if you can make web professionals listen, speak with us and join up!

fantomTech™ OEM Program Overview Contact

The fantomTech™ Mighty Engines OEM Licensing Program offers cutting edge power engines and value added services for software developers and service providers. Full support available.

fantomMedia™ Center Press Releases

Media workers: stay informed and up-to-date by reading our fantomNews™ online newsletter, special press releases and digests. Consult with our world renowned experts.
Interview inquiries welcome.