Just a minor clarification, DeepSeek's pricing for this model is temporary to match their previous model. They announced [1] that it will be the following after February 8:
DeepSeek - 0.27$ per million tokens input, 1.10$ million tokens output (66 tokens per/s)
Still much cheaper than the others though for input pricing.
DeepSeek - 0.14$ per million tokens input, 0.28$ million tokens output (66 tokens per/s)
Fireworks - 0.9$ per million tokens input, 0.9$ million tokens output (23 tokens per/s)
DeepInfra - 1$ per million tokens input, 2$ million tokens output (1.27 tokens per/s)
Compared to Llama 3.1 405B (smaller model than this afaik):
Cheapest is 0.8/0.8$ at 24 t/s all the way to 4$/4$ at 8 t/s
So third party cost seems similar, but there aren't many people hosting DeepSeek right now.