smart_toy
Llama Large Language Models

Llama 3.3 70B (70.00B)

Parameters
70.00B
VRAM (FP16)
140.0GB
VRAM (INT4)
35.0GB
Context
128000

tune Quantization Options

Quantization VRAM Required Min GPU
No quantization options available

Model Details

Family Llama
Category Large Language Models
Parameters 70.00B
Context Length 128000