Skip to content

Latest commit

 

History

History
28 lines (22 loc) · 1.35 KB

File metadata and controls

28 lines (22 loc) · 1.35 KB

Take a Llama 2 model for a spin

Overview

  • Example of Llama 2 models available at Databricks Marketplace Llama2.
  • Demonstrates how to do both real-time and batch model inference.
  • GPU cluster configuration:
    • For batch, use g4dn.xlarge instance type (AWS).
    • For model serving use:
      • Workload type - GPU_MEDIUM
      • Workload size - Small

Notebooks

Enhanced Databricks Marketplace Sample

Github

Last updated: 2023-12-10