Performance Model Comparison

Performance Model Comparison

Comparisons of peak memory, time to first token (ms), and tokens per second for GPT-OSS 20B, GPT-OSS 120B, and HyperNova 60B 2602.

Format

PNG

Source

Multiverse Computing

Downloads