Can I run BGE-Small-EN on AMD RX 7900 XTX?

check_circle
Perfect
Yes, you can run this model!
GPU VRAM
24.0GB
Required
0.1GB
Headroom
+23.9GB

VRAM Usage

0GB 0% used 24.0GB

Performance Estimate

Tokens/sec ~63.0
Batch size 32

info Technical Analysis

The AMD RX 7900 XTX, equipped with 24GB of GDDR6 VRAM and based on the RDNA 3 architecture, offers ample resources for running the BGE-Small-EN embedding model. BGE-Small-EN, with its modest 0.03B parameters and FP16 VRAM requirement of only 0.1GB, presents virtually no memory constraints for this GPU. The RX 7900 XTX's 0.96 TB/s memory bandwidth further ensures efficient data transfer, minimizing potential bottlenecks during inference. While the 7900 XTX lacks dedicated Tensor Cores like NVIDIA GPUs, its substantial compute power and memory bandwidth still allow for respectable performance, as indicated by the estimated 63 tokens/sec.

lightbulb Recommendation

Given the comfortable VRAM headroom, users can experiment with larger batch sizes (up to 32) to maximize throughput. While the model fits easily into VRAM, optimizing the inference framework is key. Consider using ONNX Runtime or a similar framework optimized for AMD GPUs to leverage the RDNA 3 architecture effectively. For further optimization, explore quantization techniques like INT8 or even lower precisions, although the performance gain might be marginal given the model's small size. Monitor GPU utilization and temperature to ensure thermal throttling doesn't impact performance during extended use.

tune Recommended Settings

Batch_Size
32
Context_Length
512
Other_Settings
['Enable graph optimization in ONNX Runtime', 'Monitor GPU temperature', 'Experiment with different thread counts']
Inference_Framework
ONNX Runtime, ROCm
Quantization_Suggested
INT8 (optional)

help Frequently Asked Questions

Is BGE-Small-EN compatible with AMD RX 7900 XTX? expand_more
Yes, BGE-Small-EN is fully compatible with the AMD RX 7900 XTX.
What VRAM is needed for BGE-Small-EN? expand_more
BGE-Small-EN requires approximately 0.1GB of VRAM when using FP16 precision.
How fast will BGE-Small-EN run on AMD RX 7900 XTX? expand_more
You can expect approximately 63 tokens/sec with the default settings. This can be improved with optimizations.