AI Performance

Distributed

AI workloads can be distributed across many processors.

Distribution

Parallel

Multiple GPUs can be used with llama.cpp now: