The Robot Exclusion Standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is otherwise publicly viewable. Robots are often used by search engines to categorize and archive web sites, or by webmasters to proofread source code. The standard is unrelated to, but can be used in conjunction with, sitemaps, a robot inclusion standard for websites.
Advanced Robots.txt Generator Tutorial Vol1
Uncrawled URLs in search results
Web Design Tutorial - robots txt file
KeywordEnvy Tutorial #1: robots.txt
Online Marketing Quick Tip #1 - Search Engine Optimization - Robots.txt files
Removing Pages with the Robots.txt File
add sitemap tag to robots.txt
Will a link to a page disallowed in robots txt transfer PageRank