The Robot Exclusion Standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is otherwise publicly viewable. Robots are often used by search engines to categorize and archive web sites, or by webmasters to proofread source code. The standard is unrelated to, but can be used in conjunction with, sitemaps, a robot inclusion standard for websites.
Diagnose Crawling with Google Webmaster Tools - Adam Lasnik
referencement site web - robots.txt
Can I disallow crawling of my CSS and JavaScript files?
Uncrawled URLs & Yellow Pages
Requesting reconsideration using Google Webmaster Tools
referencement site web-robots.txt et adsense
Why does Google index blogs faster than other
Google for Webmasters Tutorial: Discoverability