![]() |
|
Google Tend To Crawl & Index HTML Forms!
By: Navneet Kaushal 2008-04-14 On Friday, April11 2008 Google Webmasters Blog revealed that Google had been testing a new search related technology that would enable Google crawl agents to explore some HTML... forms in an attempt to discover new web pages and URLs which have not yet been found and indexed. Google's crawling and indexing team explained, "In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn't find and index for users who search on Google. Specifically, when we encounter a <form>element on a high-quality site, we might choose to do a small number of queries using the form. For text boxes, our computers automatically choose words from the site that has the form; for select menus, check boxes, and radio buttons on the form, we choose from among the values of the HTML. Having chosen the values for each input, we generate and then try to crawl URLs that correspond to a possible query a user may have made. If we ascertain that the web page resulting from our query is valid, interesting, and includes content not in our index, we may include it in our index much as we would include any other web page." The Google crawling agent also known as 'Googlebot' while searching for these unknown sites, still adheres strictly to 'robots.txt', 'nofollow', and 'noindex' commands. In the very same fashion, Google does not retrieve forms that may require any sort of user information. Also, forms that have a password input or that use terms commonly associated with personal information such as logins, user ids, contacts, etc. are also avoided by Googlebot. The crawling for these yet unknown pages does not affect those websites that are already a part of the crawling process, thus eliminating any chances of a fall in PageRank. These pages that are hidden deep in the online abyss are also referred to as Deep Web, Hidden Web or Invisible Web. CommentsTag: Google Add to Del.icio.us | Digg | Reddit | Furl Have a bookmark! - About the Author: Nav is the founder and CEO of PageTraffic, a premier search engine company known for its assured SEO service, web design and development, copywriting and full time SEO professionals. Navneet has wide experience in natural search engine optimization, internet marketing and PPC campaigns. He is a prolific writer and his articles can be found in the "Best Articles" section of many websites and article banks. As a search engine analyst , he has over 9 years of experience and his knowledge is in application here. |
|
||||||||||||||||||||||||||||
| SearchNewz
is an iEntry, Inc. ® publication
©
$line) {
echo $line ;
}
?>
All Rights Reserved. Newsletter Archive - Privacy Policy - Legal - Sitemap - Contact Us - RSS Feeds - Newsletter Signup |