Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

That doesn't sound right. Were you using the actual Deepseek provider? The one time I spent 3 dollars on Deepseek in a day, I had 615k output tokens, 96M cache hit input tokens, and 5M cache miss output tokens.


It's not unheard of for "more expensive" models (on a per-token basis) to end up cheaper than weaker models (on a per-task basis).

Kimi K2.5 is roughly double the price (per token) of DeepSeek v4 Pro, but cost $0.05 vs $0.16 (for the same score) on my own benchmark.

https://sql-benchmark.nicklothian.com/?highlight=moonshotai_...

https://sql-benchmark.nicklothian.com/?highlight=deepseek_de...


Yeah, I struggle to use more than a few dollars a day using Deepseek V4 Pro (max reasoning).

* Some people suggest not using max reasoning due to overthinking and looping issues, this may consume more tokens than needed.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: