fantomNews™ — the ultimate know

fantomNews
 
fantomNews RDF feed: cloaking, IP delivery, search engine optimization and marketing fantomNews RSS 0.92 feed: cloaking, IP delivery, search engine optimization and marketing fantomNews RSS 2.0 feed: cloaking, IP delivery, search engine optimization and marketing fantomNews ATOM 0.3 feed: cloaking, IP delivery, search engine optimization and marketing fantomNews XML: cloaking, IP delivery, search engine optimization and marketing fantomNews WAP version: cloaking, IP delivery, search engine optimization and marketing

Subscribe to fantomNews™, our famous low frequency 20k+ subscribers newsletter for the latest news, tips, tricks, search engine spider IPs, and special offers!



Privacy Guarantee: We're 100% anti-spam! We will never divulge, swap, sell or rent out your address to any third party!

First name:

Last name:

E-mail:

 


fantomTip:
“The Googleness of Being” Further Qualified

(rt) Very geeky stuff, as was Michael Martinez‘ original piece  On the Googleness of Being: this followup is a full three pages’ read – certainly not the easiest of undertakings, even for the experienced SEO, but he’s done it yet again – probably the most important reverse-engineering based analysis of Google’s ranking algorithm published in the past 2 years or so.

Things have changed even after February (when “The Googleness of Being” was originally published, and so he takes into account Google’s recently detected lifting of the  101K limit for cached pages.

More noteworthy is his inclusion of  Google’s recent patent application and what it may import. While many of his original assertions seem to be borne out by Google’s paper, he introduces a set of additional concepts to better describe the overall ranking process: Fixed Content (referring to originally fresh content turned static), Reassociation (a switch in a document’s relevance to one expression to another), Productive History (a site’s performance in search results), and several others.

To give an example of his reasoning:

Since millions of queries are conducted across Google each day, the simplest method of measuring a URL’s performance would be to create a ranking vector which records every position from 1 to 1000 that a URL is returned for (without regard for the queries). This could lead to an exorbitant amount of data for extremely popular sites, but the daily vector could be averaged and then stored in a monthly vector. The monthly vector would have up to 31 elements, of which the first 28 would be most signicant (alternatively, Google could just go with an artificial 28-day month to match its approximate weekly update cycles and allow for a 4-week rebuild process). The average of a URL’s vector performances could be used as a measure of the site’s popularity, scope of content, and general importance.

Some other interesting and inferences and conclusions based on actual search results:

[…] we can conclude that Google is indeed adapting user behavior to modify its ranking algorithm, which implies that search results rankings will be more dependent upon where query results terminate in chains of successive searches than upon other off-page factors (such as inbound link anchor text). Google may be attempting to learn what is relevant by watching for how users refine their queries.1

1 This seems to fit seamlessly into Google’s freshly beta launched “personalized search” tracking setup  My Search History, effectively creating a prime self-serving data resource. This in turn, of course, fits in neatly with Google’s ambitions towards becoming the world’s #1 data mining company. Vide our recent oped:
↗ fantOpEd:
Search Engines as Data Gobblers

If his assumption should prove correct, Michael forsees the development of query spam as a means of influencing Google’s then-predetermined weighting of popular search queries and the pre-listed results tied to them.

He further articulates a very soft spoken but no less devastating critique of Google’s proprietary PageRank algorithm:

[…] they have demonstrated a resourcefulness in refining what is essentially a flawed concept. Relevance and authority have never been ascertainable by link analysis because the vast majority of Web documents are unaware of each other. That is, studies have shown that popular Web sites get links faster than other Web sites , creating a self-sustaining cycle of favoring well-known sites over poorly known sites.

Who gets linked to, therefore, is not determined by quality, but by visibility, and visibility is no measure of quality. Good content can only overcome its competition by achieving high visibility, but if it has no visibility to begin with, it will accrue less visibility than any site already visible.

Next comes an equally lethal criticism of the prevalent “link building frenzy” amongst the SEO crowd: if Google should develop, as Michael anticipates, a set of methodologies for ascertaining relevance, “each with approximate equal probability of success to the others, they can cycle through the methodologies in satisfying queries” – effectively leaving SEOs out in the cold, at least until they have caught up with this approach and have succeeded in developing new strategies to overcome it.

Not that they would have too many choices: if Google capitalizes on its databases of historical search queries by structuring them into blocks with parametrized classes, making it fairly easy to detect attempts of manipulation via bot nets spreading weight skewing spam queries, which in any case would probably have to be spread across the globe so as not to trigger IP class alerts etc. – a daunting task well beyond the resources of most present day SEO agencies.

The only minor contention we have with this view is the sheer volume of queries. While combinations or search terms (keywords) may not actually be infinite, they do come pretty close. This would entail massive scalation problems not easily addressed.

Also, changing search behavior must be factored into the overall methodology: if searchers were still wont to type in one to two term queries only a few years ago, they seem to have become a lot more sophisticated these days – or maybe they are simply fed up with being delivered tons and tons of generic search results not reflecting their real targets. In any case, as  research has shown, four term queries seem to deliver the highest sales conversions, indicating a radical evolution of differentiated search operations. Thus, historical data will probably be of limited use.

On the other hand, Google’s prime advantage over any and all SEOs is it’s direct interface with user behavior. Moreover, their My Search History service, once launched in its final version and adopted by millions of users, will enable them to gauge individualized search behavior to a degree of precision never before achieved. (And of course, the chief advantage of this portalization rests in the fact that all data will be server rather than client based: no having to rely on deletable cookies anymore!)

Another interesting point he makes is the importance of outgoing rather than incoming link for new sites (which won’t have many inbound links organically to begin with anyway):

Google can assess a document’s purpose as much by what the document links to as by what links to the document. A newer page, having few if any inbound links, should be able to influence the determination of its relevance by what it links to. That is, the page makes an initial footprint in its history of association with other pages by saying, “These are the documents I want to be associated with”.

 
There is lots more, including a brief outlook on viable future optimization strategies, and we strongly recommend you not only peruse but actually study this paper scrupulously if you want to get a shrewd indication of the world of search to come.

 
This is the (abridged) version on Spider-Food Forum:
 “Paper: Changes in Google Ranking Strategies”

Read the full paper here:
 Google: Changes in Google Ranking Strategies

And here’s our original coverage of Michael’s first version:
↗ fantomTip: 
Rating the Googleness of Being

You’ll also find our reference to Google’s recently published patent application here:
↗ Where Google Is Heading – the Final Road Map?

There’s a budding thread on it now at Threadwatch:
 “Trust & Inheritance – Testing Hypotheticals and Google”

 

Subscribe!
Please Share This! These icons link to social bookmarking sites where readers can share and discover new web pages.
  • Socialize-It
  • Sphinn
  • Technorati
  • del.icio.us
  • Slashdot
  • Ma.gnolia
  • Reddit
  • Twitter
  • Furl
  • Fark
  • RawSugar
  • De.lirio.us
  • PlugIM
  • blinkbits
  • BlinkList
  • YahooMyWeb
  • MisterWong
  • SphereIt
  • BlogMemes
  • DZone

[Keywords: , , , , , , , , , , , , , , , , , , , ]

Trackback link: http://fantomaster.com/fantomNews/archives/2005/04/20/fantomtip-the-googleness-of-being-further-qualified/trackback/



One Backlink to “fantomTip:
“The Googleness of Being” Further Qualified”

Trackbacks/Pingbacks →

  1. […] . The Spider Food forums have a discussion thread for this article. [via Threadwatch | Fantomaster]
     

    […]


Comments currently closed

Recommend Us! Spread it! Recommended us!
The Complete Archive

Download all pre-blog fantomNews + fantomFlash issues in a single text file (zip archive)

fantomas shadowMaker™ fantomas Software shadowMaker Special Deal The all-powerful 100% automatic Shadow Domain™ generator for effective heavy duty industrial-strength cloaking.

This beauty generates 100% relevant and unique content for your Shadow Domains™, creating an unlimited number of highly optimized unique pages including site maps in fully customizable keyword density and page weight

Then submit them to the search engines and redirect search engine generated human traffic in realtime (i.e. without delay) by search phrase to any URL you wish

fantomas spiderSpy™
searchbotBase
Service The world's largest database of search engine and other spiders, updated every six hours, seven days a week

The fantomas spiderSpy™ botBase Service is an indispensable backend tool for IP based search engine optimization software developers and agencies.

It is also a must have for server stats analysis tools and services to ascertain reliable spider access traffic data.

fantomas Webmaster Suite™ Special Deal Everything you will ever need for efficient IP based cloaking both for code protection and search engine optimization.

Plus, some world exclusives such as a 12 month access to the most comprehensive database of search engine spiders extant and the world's first automatic keyword density generator.

As an added bonus: two cutting edge tools to protect your web page code from hijackers and to thwart email harvester bots.

fantomInfo About Us Mission Statement Privacy Policy Contact Office Hours

At fantomaster.com we are committed to aiding internet and Web professionals in achieving their goals in today's and tomorrow's increasingly competitive technological environment.

fantomNews Weblog siteFlash: What's New Here? Archive

Read the latest info on our products and services in our fantomNews™ online newsletter focusing on IP delivery (cloaking), search engine optimization, webmaster tricks, etc

fantomProducts Overview Downloads TechSpecs Manuals Price List

Check out our fine product line of webmaster software, Perl and CGI scripts, many of them world time firsts in their class. See our documentation and test our demo versions in real time.

fantomTips FAQs Tutorials Cloaking and IP Delivery Resources Free Content

Our information gold mine: search engine positioning, IP delivery, cloaking technology, search engine spider IPs, FAQs, link popularity, resources and links to boost your web presence.

fantomServices Overview spiderSpy™ Anti-Spam Anti Code Napping Anti-Fraud

Profit from our research and development efforts! Get the world's most comprehensive database of search engine spiders for top notch search engine optimization and traffic analysis.

fantomFreestuff Overview Services Downloads FAQs Tutorials

Giving back to the community: our free cutting edge applications for webmasters and IT professionals. With thousands of downloads per year, we're helping to make the Web a better place.

fantomOrders Overview Ordering Online PayPal Ordering Offline Price List Special Deals

Need we say more?

We offer the industry's widest variety of secure options for payment, download and registration of our products and services. Order online via our state-of-the-art SSL-secured enhanced Apache server or via PayPal

Alternatively, you may order by fax, by email, by phone or by snail mail.

fantomCrew™ Affiliates Overview FAQ Links & Banners Terms Join Up! Member Login

Teaming up with success: excellent established products, lifetime commissions, zero setup fee, enlightened support — if you can make web professionals listen, speak with us and join up!

fantomTech™ OEM Program Overview Contact

The fantomTech™ Mighty Engines OEM Licensing Program offers cutting edge power engines and value added services for software developers and service providers. Full support available.

fantomMedia™ Center Press Releases

Media workers: stay informed and up-to-date by reading our fantomNews™ online newsletter, special press releases and digests. Consult with our world renowned experts.
Interview inquiries welcome.