Skip to content

Cell LoadDataset

Oliver Kennedy edited this page Dec 31, 2023 · 2 revisions

The Load Dataset Cell

The LOAD DATASET cell creates a new dataset from an uploaded file or a URL.

  • File or URL: The URL or file path to the dataset you want to load. You can use the graphical file picker below to find a file instead.
  • Dataset Name: The name of the dataset artifact to create. Defaults to the filename.
  • Format: The format of the file. Defaults based on the file extension.
  • Load Parameters:
    • File Has Headers (CSV only): Check to mark that the file has headers (equivalent to the Spark CSV "Headers" option).
    • Field Delimiter (CSV only): Check to indicate the field delimiter (equivalent to the Spark CSV "Delimiter" option).
    • Guess Schema: Check to guess the types and names of attributes that are not provided.
    • Spark Load Options: Key value pairs to pass directly to Spark's dataset loader (Don't use unless you know what you're doing).
    • Schema: Names and types of columns. If left empty, Vizier will use Spark's guesses for names.