Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

Per available providers on OpenRouter right now:

DeepSeek - 0.14$ per million tokens input, 0.28$ million tokens output (66 tokens per/s)

Fireworks - 0.9$ per million tokens input, 0.9$ million tokens output (23 tokens per/s)

DeepInfra - 1$ per million tokens input, 2$ million tokens output (1.27 tokens per/s)

Compared to Llama 3.1 405B (smaller model than this afaik):

Cheapest is 0.8/0.8$ at 24 t/s all the way to 4$/4$ at 8 t/s

So third party cost seems similar, but there aren't many people hosting DeepSeek right now.



Just a minor clarification, DeepSeek's pricing for this model is temporary to match their previous model. They announced [1] that it will be the following after February 8:

DeepSeek - 0.27$ per million tokens input, 1.10$ million tokens output (66 tokens per/s)

Still much cheaper than the others though for input pricing.

[1] https://api-docs.deepseek.com/news/news1226#-api-pricing-upd...


This is at peasant 8k context size too.


Yeah, I was assuming they are selling for cheap to get people to try the model.

But still certainly cheaper than everyone else at the moment.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: