Wednesday, September 22, 2004

Search Engine Reverse Engineering Project - online experiments

Search Engine Reverse Engineering Project

Limited because: Unlike many reverse engineering projects, we cannot "take apart" our object of study, the search engines. We will have to limit our research to examining their behavior from afar...

Known Limitations of Reverse Engineering Search Engines

It is very difficult achieve certainty in many desirable answers regarding search engine behavior due to factors such as:

1. Search engine algorithms change constantly.


The technique that worked on Google yesterday may not work today.


2. Off-page factors are uncontrollable.

If you make a change to a web page and wait to see the changes, you test will be contaminated if a single other web page on the Internet links to your web page, or if an existing link to your web page changes in Page Rank. Page Rank is a sliding scale which is constantly in flux.


3. Timing is unknown


If you make a change to one of your web pages right now, it will take some time for each search engine to crawl the page. The search engine will then take an unknown amount of time to include the page in it's database. Different page factors may be included in the database at different times. The page may be placed in a temporary holding area in the database, where it shows up in the SERPs, before eventually being placed in the main database.

Google
Creative Commons Licence
This work is licensed under a Creative Commons License.