-
Notifications
You must be signed in to change notification settings - Fork 11
Cell LoadDataset
Oliver Kennedy edited this page Dec 31, 2023
·
2 revisions
The LOAD DATASET
cell creates a new dataset from an uploaded file or a URL.
- File or URL: The URL or file path to the dataset you want to load. You can use the graphical file picker below to find a file instead.
- Dataset Name: The name of the dataset artifact to create. Defaults to the filename.
- Format: The format of the file. Defaults based on the file extension.
-
Load Parameters:
- File Has Headers (CSV only): Check to mark that the file has headers (equivalent to the Spark CSV "Headers" option).
- Field Delimiter (CSV only): Check to indicate the field delimiter (equivalent to the Spark CSV "Delimiter" option).
- Guess Schema: Check to guess the types and names of attributes that are not provided.
- Spark Load Options: Key value pairs to pass directly to Spark's dataset loader (Don't use unless you know what you're doing).
- Schema: Names and types of columns. If left empty, Vizier will use Spark's guesses for names.