Not logged in [ Register for account ] [ Login ]  
Cornell University

WebLab TSV File Format Specification

The WebLab TSV (Tab-Separated Values) file format is primarily used by the tools that save some subset of the data from the WebLab database to a file. It is the format used to download subsets to Web browsers. The format can be read by a wide variety of applications packages, e.g. Excel.

The file format is:

  • A WebLab TSV file is a UTF8 encoded, plain text file with windows-style newlines (CR+LF or \r\n) and a .tsv file extension.
  • A WebLab TSV file consists of one or more columns of data. In most applications, these columns correspond directly to columns in the WebLab database. You can find more details about them in the database schema.
  • The first line of the TSV file is the header. This consists of one column name (plain text) for each column, separated by the tab character (\t).
  • The following lines of the TSV file each consist of one entry for each column, separated by the tab character (\t).
  • If the column's data type is plain text, the entry is the plain text.
  • If the column's data type is numerical, the entry is the number converted to a textual representation.
  • If the column's data type is binary (such as PageID, URLID, LinkID etc), the entry is encoded using Base64 encoding.

For definition of the indivdual data fields, see the database schema.