Not logged in [ Register for account ] [ Login ]  
Cornell University

The Web Laboratory: GetPages Tool

This tool's documentation is available here.

NOTICE: You are not logged in. You have to be log in to run this tool.

NOTICE: You are not logged in.   
Select by Collection
Choose the Collection Name first:    
Field Name Negate Restriction "NOT LIKE" Restriction String Description of each field
CrawlID NOT LIKE
CrawlID Description  What is CrawlID?
ArchiveTime NOT LIKE
start date
end date
ArchiveTime Description  What is ArchiveTime?
Select by URL
Field Name Negate Restriction "NOT LIKE" Restriction String Description of each field
URL:Protocol NOT LIKE URL:Protocol Description  What is URL:Protocol?
URL:Host NOT LIKE URL:Host Description  What is URL:Host?
URL:Port NOT LIKE URL:Port Description  What is URL:Port?
URL:Extension NOT LIKE .htm_
URL:Path NOT LIKE %/pic/%
Select by Page Metadata
Field Name Negate Restriction "NOT LIKE" Restriction String Description of each field
Document Title NOT LIKE Document Title Description  What is Document Title?
MIMEType NOT LIKE MIMEType Description  What is MIMEType?
Language NOT LIKE Language Description  What is Language?
Select by ID
Field Name Negate Restriction "NOT LIKE" Restriction String Description of each field
PageID NOT LIKE PageID Description  What is PageID?
UrlID NOT LIKE UrlID Description  What is UrlID?
HostID NOT LIKE HostID Description  What is HostID?
IPAddress NOT LIKE IP Address Description  What is IP Address?
NOTICE: You are not logged in.   

For more details see the documentation.